BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 010616
(506 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224124910|ref|XP_002319454.1| predicted protein [Populus trichocarpa]
gi|222857830|gb|EEE95377.1| predicted protein [Populus trichocarpa]
Length = 507
Score = 780 bits (2014), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/509 (71%), Positives = 436/509 (85%), Gaps = 5/509 (0%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV 60
M K+L FCLW L +C LLPASSNGL RIGLKKR LDL ++ A I R+E G G
Sbjct: 1 MGNKILLKAFCLWAL-TCFLLPASSNGLVRIGLKKRHLDLQTIKDAIIARQEG-KAGVGA 58
Query: 61 SGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI 120
S H LG SD DI+PLKN++DAQY GEIGIGSPPQNF+V+FDTGSSNLWVPSSKCYFSI
Sbjct: 59 SSRVHDLGSSDGDIIPLKNYLDAQYLGEIGIGSPPQNFTVVFDTGSSNLWVPSSKCYFSI 118
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+CYFHS+YKS +S+TYT+ G CEI+YGSGS+SGFFSQDNV+VGD+VVKDQVF+EAT+EG
Sbjct: 119 ACYFHSKYKSSRSSTYTKNGNFCEIHYGSGSVSGFFSQDNVQVGDLVVKDQVFVEATKEG 178
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
SL+F+L +FDGI+GLGF+EI+VG+ VP+W NM++Q LV +EVFSFWLNR+P+A+EGGE+V
Sbjct: 179 SLSFILGKFDGILGLGFQEISVGNVVPLWYNMIQQDLVDDEVFSFWLNRNPEAKEGGELV 238
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGGVDPKHFKGKHTYVPVT+KGYWQ +GD LIG STG+CEGGCAAIVDSGTSLLAGPT
Sbjct: 239 FGGVDPKHFKGKHTYVPVTQKGYWQINMGDFLIGKHSTGLCEGGCAAIVDSGTSLLAGPT 298
Query: 301 PVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVS 360
P++TEINHAIG EG+VSAECK VVS YGDLIW+L++SG+ P KVC Q+GLC FN A+
Sbjct: 299 PIITEINHAIGAEGLVSAECKEVVSHYGDLIWELIISGVQPSKVCTQLGLCIFNEAKSAR 358
Query: 361 TGIKTVVEKEN---VSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNP 417
TGI++VVEKEN SAG+ C+AC+M V+WVQNQL++K TKE ++Y+++LC+SLP+P
Sbjct: 359 TGIESVVEKENKEKSSAGNDLPCTACQMLVIWVQNQLREKATKETAINYLDKLCESLPSP 418
Query: 418 MGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGP 477
MG+S IDC+ I TMPN++FTIGDK F+L+PEQYILKTGEGIA+VCISGFMA D+PPPRGP
Sbjct: 419 MGQSSIDCNSISTMPNITFTIGDKPFSLTPEQYILKTGEGIAQVCISGFMALDVPPPRGP 478
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
LWILGDVFMG YHT+FD G L +GFAEAA
Sbjct: 479 LWILGDVFMGAYHTIFDYGNLEVGFAEAA 507
>gi|294440430|gb|ADE74632.1| aspartic protease 1 [Nicotiana tabacum]
Length = 506
Score = 756 bits (1952), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 351/508 (69%), Positives = 430/508 (84%), Gaps = 6/508 (1%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITR-KERYMGGAG 59
ME+K L + LW + +LP SS+ L R+GLKK+ LD++S+NAAR+ R ++RY G
Sbjct: 1 MERKHLCAALLLWAIVY-FVLPVSSDNLLRVGLKKQSLDVNSINAARVARLQDRY--GKN 57
Query: 60 VSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS 119
V+G+ +LGDSD DI+ LKN++DAQY+GEIG+GSPPQ F VIFDTGSSNLWVPSS+CYFS
Sbjct: 58 VNGIEKKLGDSDLDIVSLKNYLDAQYYGEIGVGSPPQKFKVIFDTGSSNLWVPSSRCYFS 117
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
I+C+FHS+YK+ KS TYT G+SC I YG+GSISG FSQDNV+VGD+VVKDQVFIEATRE
Sbjct: 118 IACWFHSKYKASKSTTYTRNGESCSIRYGTGSISGHFSQDNVQVGDLVVKDQVFIEATRE 177
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEI 239
S+TF++A+FDGI+GLGF+EI+VG+A PVW NMV QGLV E+VFSFW+NRD A+EGGE+
Sbjct: 178 PSITFIIAKFDGILGLGFQEISVGNATPVWYNMVGQGLVKEQVFSFWINRDATAKEGGEL 237
Query: 240 VFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGP 299
VFGGVD HFKG HTYVP+T+KGYWQF +GD LIGN STGVC GGCAAIVDSGTSLLAGP
Sbjct: 238 VFGGVDSNHFKGNHTYVPLTQKGYWQFNMGDFLIGNASTGVCAGGCAAIVDSGTSLLAGP 297
Query: 300 TPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYV 359
T VVT+INHAIG EG+VS ECK +VSQYG++IW+LLVSG+ P++VC Q GLC FNGA++V
Sbjct: 298 TTVVTQINHAIGAEGIVSMECKTIVSQYGEMIWNLLVSGVKPDQVCSQAGLCYFNGAQHV 357
Query: 360 STGIKTVVEKEN--VSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNP 417
S+ I+TVVE+E S G++ +C+ACEMAVVW+QNQLKQK+TKE+VL Y+N+LC+ LP+P
Sbjct: 358 SSNIRTVVERETEGSSVGEAPLCTACEMAVVWMQNQLKQKETKERVLEYVNQLCEKLPSP 417
Query: 418 MGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGP 477
MGES+IDC I MPN++FTI DK + L+PEQYILKTGEGI +C+SGF A D+PPPRGP
Sbjct: 418 MGESVIDCSMISAMPNITFTIKDKAYVLTPEQYILKTGEGITTICMSGFAALDVPPPRGP 477
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAEA 505
LWILGDVFMGVYHTVFD G R+GFAEA
Sbjct: 478 LWILGDVFMGVYHTVFDYGNSRLGFAEA 505
>gi|82623417|gb|ABB87123.1| aspartic protease precursor-like [Solanum tuberosum]
Length = 506
Score = 747 bits (1928), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 350/509 (68%), Positives = 427/509 (83%), Gaps = 6/509 (1%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITR-KERYMGGAG 59
ME+K L + LW + +C LPASS L RIGLKK RLD++S+ AAR+ + ++RY G
Sbjct: 1 MEKKHLCAALLLWAI-TCSALPASSGDLLRIGLKKHRLDVNSIKAARVAKLQDRY--GKH 57
Query: 60 VSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS 119
V+G+ + DSD DI+PLKN++DAQY+GEIGIGSPPQ F VIFDTGSSNLWVPSSKCYFS
Sbjct: 58 VNGIEKKSSDSDIDIVPLKNYLDAQYYGEIGIGSPPQKFKVIFDTGSSNLWVPSSKCYFS 117
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
I+C+ HS+YK+ KS+TYT G+SC I YG+GSISG FS DNV+VGD+VVKDQVFIEATRE
Sbjct: 118 IACWIHSKYKASKSSTYTRDGESCSIRYGTGSISGHFSMDNVQVGDLVVKDQVFIEATRE 177
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEI 239
S+TF++A+FDGI+GLGF+EI+VG+ PVW NMV QGLV E VFSFW NRD +A+EGGE+
Sbjct: 178 PSITFIVAKFDGILGLGFQEISVGNTTPVWYNMVGQGLVKESVFSFWFNRDANAKEGGEL 237
Query: 240 VFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGP 299
VFGGVDPKHFKG HTYVP+T+KGYWQF +GD LIGN STG C GGCAAIVDSGTSLLAGP
Sbjct: 238 VFGGVDPKHFKGNHTYVPLTQKGYWQFNMGDFLIGNTSTGYCAGGCAAIVDSGTSLLAGP 297
Query: 300 TPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYV 359
T +VT+INHAIG EG+VS ECK +VSQYG++IWDLLVSG+ P++VC Q GLC +GA++V
Sbjct: 298 TTIVTQINHAIGAEGIVSMECKTIVSQYGEMIWDLLVSGVRPDQVCSQAGLCFVDGAQHV 357
Query: 360 STGIKTVVEKEN--VSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNP 417
S+ I+TVVE+E S G++ +C+ACEMAVVW+QNQLKQ TKEKVL Y+N+LC+ +P+P
Sbjct: 358 SSNIRTVVERETEGSSVGEAPLCTACEMAVVWMQNQLKQAGTKEKVLEYVNQLCEKIPSP 417
Query: 418 MGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGP 477
MGES IDC+ I +MP++SFTI DK F L+PEQYILKTGEG+A +C+SGF A D+PPPRGP
Sbjct: 418 MGESTIDCNSISSMPDISFTIKDKAFVLTPEQYILKTGEGVATICVSGFAALDVPPPRGP 477
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
LWILGDVFMG YHTVFD GK ++GFAEAA
Sbjct: 478 LWILGDVFMGPYHTVFDYGKSQVGFAEAA 506
>gi|171854659|dbj|BAG16519.1| putative aspartic protease [Capsicum chinense]
Length = 506
Score = 741 bits (1914), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 343/490 (70%), Positives = 421/490 (85%), Gaps = 5/490 (1%)
Query: 20 LLPASSNGLRRIGLKKRRLDLHSLNAARITR-KERYMGGAGVSGVRHRLGDSDEDILPLK 78
+LPASS+ L RIGLKK +D++S+NAAR+ R ++RY G ++G+ + SD DI+PLK
Sbjct: 19 VLPASSDNLLRIGLKKHHVDVNSINAARVARLQDRY--GKHLNGLEKKSDGSDVDIVPLK 76
Query: 79 NFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTE 138
N++DAQY+GEIGIGSPPQ F VIFDTGSSNLWVPSS+CYFSI+C+FH +YK+ KS+TYT
Sbjct: 77 NYLDAQYYGEIGIGSPPQKFKVIFDTGSSNLWVPSSRCYFSIACWFHHKYKAGKSSTYTR 136
Query: 139 IGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFR 198
GKSC I YG+GSISG FSQDNV+VGD+VVKDQVFIEATRE S+TF++ +FDGI+GLGF+
Sbjct: 137 NGKSCSIRYGTGSISGHFSQDNVQVGDLVVKDQVFIEATREPSITFIIGKFDGILGLGFQ 196
Query: 199 EIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPV 258
EI+VG+A PVW NMV+QGLV E VFSFW NRD +EGGE+VFGGVDPKHFKG HTYVP+
Sbjct: 197 EISVGNATPVWYNMVDQGLVKEPVFSFWFNRDASTKEGGELVFGGVDPKHFKGNHTYVPL 256
Query: 259 TKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSA 318
T+KGYWQF +GD LIGN STG C GGCAAIVDSGTSLLAGPT +VT++NHAIG EGVVSA
Sbjct: 257 TQKGYWQFNMGDFLIGNTSTGYCAGGCAAIVDSGTSLLAGPTTIVTQLNHAIGAEGVVSA 316
Query: 319 ECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN--VSAGD 376
ECK +VSQYG+++WDLLVSG+ P++VC Q GLC FNGAE+VS+ I+TVVE+EN S G+
Sbjct: 317 ECKTIVSQYGEVLWDLLVSGVRPDQVCSQAGLCFFNGAEHVSSNIRTVVERENEGSSVGE 376
Query: 377 SAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSF 436
+ +C+ CEMAVVW+QNQLKQ+ TKE+VL Y+++LC+ LP+PMGES++DC+ I ++PN++F
Sbjct: 377 APLCTVCEMAVVWIQNQLKQQGTKERVLEYVDQLCEKLPSPMGESVVDCNSISSLPNITF 436
Query: 437 TIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSG 496
TI DK F L+PEQYILKTGEGIA +CISGF AFD+PPPRGPLWILGDVFMG YHTVFD G
Sbjct: 437 TIKDKAFVLTPEQYILKTGEGIASICISGFAAFDVPPPRGPLWILGDVFMGPYHTVFDYG 496
Query: 497 KLRIGFAEAA 506
++GFAEAA
Sbjct: 497 NSQVGFAEAA 506
>gi|350535356|ref|NP_001234702.1| aspartic protease precursor [Solanum lycopersicum]
gi|951449|gb|AAB18280.1| aspartic protease precursor [Solanum lycopersicum]
Length = 506
Score = 737 bits (1902), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 345/509 (67%), Positives = 425/509 (83%), Gaps = 6/509 (1%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITR-KERYMGGAG 59
M++K L + LW +A C LPASS L RIGLKK RLD+ S+ AAR+ + ++RY G
Sbjct: 1 MDKKHLCAALLLWAIA-CSALPASSGDLFRIGLKKHRLDVDSIKAARVAKLQDRY--GKH 57
Query: 60 VSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS 119
V+G+ + DSD +PLKN++DAQY+GEIGIGSPPQ F VIFDTGSSNLWVPSSKCYFS
Sbjct: 58 VNGIEKKSSDSDIYKVPLKNYLDAQYYGEIGIGSPPQKFKVIFDTGSSNLWVPSSKCYFS 117
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
I+C+ HS+Y++ KS+TYT G+SC I YG+GSISG FS DNV+VGD+VVKDQVFIEATRE
Sbjct: 118 IACWIHSKYQASKSSTYTRDGESCSIRYGTGSISGHFSMDNVQVGDLVVKDQVFIEATRE 177
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEI 239
S+TF++A+FDGI+GLGF+EI+VG+ PVW NMV QGLV E VFSFW NRD +A+EGGE+
Sbjct: 178 PSITFIVAKFDGILGLGFQEISVGNTTPVWYNMVGQGLVKEPVFSFWFNRDANAKEGGEL 237
Query: 240 VFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGP 299
VFGGVDPKHFKG HT VP+T+KGYWQF +GD LIGN STG C GGCAAIVDSGTSLLAGP
Sbjct: 238 VFGGVDPKHFKGNHTCVPLTQKGYWQFNMGDFLIGNTSTGYCAGGCAAIVDSGTSLLAGP 297
Query: 300 TPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYV 359
T +VT+INHAIG EG+VS ECK +VSQYG++IWDLLVSG+ P++VC Q GLC +G+++V
Sbjct: 298 TTIVTQINHAIGAEGIVSMECKTIVSQYGEMIWDLLVSGIRPDQVCSQAGLCFLDGSQHV 357
Query: 360 STGIKTVVEKEN--VSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNP 417
S+ I+TVVE+E S G++ +C+ACEMAVVW+QNQLKQ+QTKEKVL Y+N+LC+ +P+P
Sbjct: 358 SSNIRTVVERETEGSSVGEAPLCTACEMAVVWMQNQLKQEQTKEKVLEYVNQLCEKIPSP 417
Query: 418 MGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGP 477
MGES IDC+RI +MP+++FTI D F L+PEQYILKTGEG+A +C+SGF A D+PPPRGP
Sbjct: 418 MGESAIDCNRISSMPDITFTIKDTAFVLTPEQYILKTGEGVATICVSGFAALDVPPPRGP 477
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
LWILGDVFMG YHTVFD GK ++GFAEAA
Sbjct: 478 LWILGDVFMGPYHTVFDYGKSQVGFAEAA 506
>gi|255543036|ref|XP_002512581.1| Aspartic proteinase precursor, putative [Ricinus communis]
gi|223548542|gb|EEF50033.1| Aspartic proteinase precursor, putative [Ricinus communis]
Length = 494
Score = 731 bits (1887), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 351/505 (69%), Positives = 408/505 (80%), Gaps = 19/505 (3%)
Query: 5 LLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVR 64
L + FCLW L +C LPASSNGL +I LKKR LDL S+NAAR R+ER A S
Sbjct: 6 LWMAAFCLWAL-TCSFLPASSNGLMKISLKKRPLDLDSINAARTARQERKTRIAASS--- 61
Query: 65 HRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYF 124
L D D++PLKN++D QYFGEI IGSPPQ F+VIFDTGSSNLW+PS+KCYFS++CYF
Sbjct: 62 -MLHSPDPDMIPLKNYLDTQYFGEISIGSPPQTFTVIFDTGSSNLWIPSAKCYFSLACYF 120
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
HSRYKS +S TY G +C+I YG+GSI GFFSQD VEVG++VV++QVFIEATREGSLTF
Sbjct: 121 HSRYKSSRSTTYIRNGTTCKIRYGTGSIVGFFSQDTVEVGNLVVRNQVFIEATREGSLTF 180
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
+LA+FDGI GLGF+EI+VGDAVPVW NMV+QGLV + VFSFWLN DPDA+EGGE+VFGGV
Sbjct: 181 VLAKFDGIFGLGFQEISVGDAVPVWYNMVQQGLVGDPVFSFWLNNDPDAKEGGELVFGGV 240
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
D KH++GKHTYVPVT+KGYWQF +GD +IGN ST DSGTSLLAGPTP+V
Sbjct: 241 DEKHYRGKHTYVPVTQKGYWQFNMGDFIIGNHST-----------DSGTSLLAGPTPIVA 289
Query: 305 EINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIK 364
EINHAIG EG+VSAECK VVSQYG+LIWDLL+SG+ P KVC Q+GLC F G Y S I+
Sbjct: 290 EINHAIGAEGIVSAECKEVVSQYGNLIWDLLISGVQPGKVCSQLGLCTFRGDRYESNVIE 349
Query: 365 TVVEKENV---SAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGES 421
+VVE+EN+ S GD +C+ACEM V+WVQNQLK KQTKE L Y+N+LC+SLP+PMGES
Sbjct: 350 SVVEEENMEGSSVGDDVLCTACEMLVIWVQNQLKHKQTKEAALEYVNKLCESLPSPMGES 409
Query: 422 IIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWIL 481
IIDC MPN+ FTIGDK F L+PEQYILKTGEGIA VCISGFMA D+PPPRGPLWIL
Sbjct: 410 IIDCASTTGMPNIIFTIGDKQFQLTPEQYILKTGEGIASVCISGFMALDVPPPRGPLWIL 469
Query: 482 GDVFMGVYHTVFDSGKLRIGFAEAA 506
GDVFM VYHTVFD G L++GFAEAA
Sbjct: 470 GDVFMRVYHTVFDFGDLQVGFAEAA 494
>gi|359487701|ref|XP_002276363.2| PREDICTED: aspartic proteinase oryzasin-1-like [Vitis vinifera]
gi|296089851|emb|CBI39670.3| unnamed protein product [Vitis vinifera]
Length = 502
Score = 728 bits (1878), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 344/510 (67%), Positives = 420/510 (82%), Gaps = 12/510 (2%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV 60
M Q ++ + FCLW L C LLP S+G RIGLKKR LD +++ ARI + + +GG GV
Sbjct: 1 MRQGVVWAAFCLWALI-CPLLPVYSHGSVRIGLKKRPLDFNNMRTARIAQMQGKIGG-GV 58
Query: 61 SGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI 120
H D D + + LKN++DAQYFGEIGIG+PPQNF+V+FDTGSSNLWVPSSKCYFSI
Sbjct: 59 MSKYHGFDDPDGEFVSLKNYLDAQYFGEIGIGTPPQNFTVVFDTGSSNLWVPSSKCYFSI 118
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+C+FH++YK+R S+TYT+IG+ EI+YGSGSISGFFSQDNVEVG +VVKDQVFIEATREG
Sbjct: 119 ACFFHNKYKARLSSTYTKIGRPGEIHYGSGSISGFFSQDNVEVGSLVVKDQVFIEATREG 178
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
SLTF LA+FDGI+GLGF+ I+VG+A PVW M++QGL+ EE+FSFWLNR+P+A EGGEIV
Sbjct: 179 SLTFALAKFDGIMGLGFQGISVGNATPVWSTMLQQGLLHEELFSFWLNRNPNANEGGEIV 238
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGGVD +HF+GKHT+VPVT+ GYWQF +GD LI NQ+TGVCEGGC+AIVDSGTSL+AGPT
Sbjct: 239 FGGVDKRHFRGKHTFVPVTQAGYWQFRMGDFLISNQTTGVCEGGCSAIVDSGTSLIAGPT 298
Query: 301 PVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVS 360
VVT+INHAIG EG+VS ECK VVSQYG+++WDLLVSG+LP KVC QIGLC S
Sbjct: 299 LVVTQINHAIGAEGIVSMECKEVVSQYGNMMWDLLVSGVLPSKVCSQIGLCM------AS 352
Query: 361 TGIKTVVEKENVSA----GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPN 416
GI+TVVEKE + + GD C+ACEM VW+Q+QLKQ +TK+KVL Y+ ELC SLP+
Sbjct: 353 PGIRTVVEKEKMESVEEVGDVVFCNACEMIAVWIQSQLKQMKTKDKVLRYVTELCGSLPS 412
Query: 417 PMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG 476
PMGES+IDC + MPN++F IGDK F+L+P+QYIL+TG+G A VC+SGF A D+PPP+G
Sbjct: 413 PMGESVIDCTSVANMPNITFIIGDKAFDLTPDQYILRTGDGSATVCLSGFTALDVPPPKG 472
Query: 477 PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PLWILG++FMGVYHTVFD G LRIGFAEAA
Sbjct: 473 PLWILGEIFMGVYHTVFDFGDLRIGFAEAA 502
>gi|351725345|ref|NP_001237345.1| aspartic proteinase 2 [Glycine max]
gi|15425751|dbj|BAB64296.1| aspartic proteinase 2 [Glycine max]
Length = 508
Score = 716 bits (1848), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 342/515 (66%), Positives = 419/515 (81%), Gaps = 18/515 (3%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV 60
M QK L +VFCLW L +C LLP+ S G+ RIGLKKR LDL S+NAAR R+ G+
Sbjct: 1 MGQKHLVTVFCLWAL-TCSLLPSFSFGILRIGLKKRPLDLDSINAARKARE-------GL 52
Query: 61 SGVRHRLGDSD--------EDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVP 112
VR +G D EDI+PLKN++DAQYFGEIGIG PPQ F+V+FDTGSSNLWVP
Sbjct: 53 RSVRPMMGAHDQFIGKSKGEDIVPLKNYLDAQYFGEIGIGIPPQPFTVVFDTGSSNLWVP 112
Query: 113 SSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQV 172
SSKCYF+++CY H+ Y ++KS T+ + G SC+INYG+GSISGFFSQDNV+VG VVK Q
Sbjct: 113 SSKCYFTLACYTHNWYTAKKSKTHVKNGTSCKINYGTGSISGFFSQDNVKVGSAVVKHQD 172
Query: 173 FIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPD 232
FIEAT EGSLTFL A+FDGI+GLGF+EI+V +AVPVW MVEQ L+SE+VFSFWLN DP+
Sbjct: 173 FIEATHEGSLTFLSAKFDGILGLGFQEISVENAVPVWFKMVEQKLISEKVFSFWLNGDPN 232
Query: 233 AEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSG 292
A++GGE+VFGGVDPKHFKG HTYVP+T+KGYWQ E+GD +G STGVCEGGCAAIVDSG
Sbjct: 233 AKKGGELVFGGVDPKHFKGNHTYVPITEKGYWQIEMGDFFVGGVSTGVCEGGCAAIVDSG 292
Query: 293 TSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCA 352
TSLLAGPTPVV EINHAIG EGV+S ECK VVSQYG+LIWDLLVSG+ P+ +C Q+GLC+
Sbjct: 293 TSLLAGPTPVVAEINHAIGAEGVLSVECKEVVSQYGELIWDLLVSGVKPDDICSQVGLCS 352
Query: 353 FNGAEYVSTGIKTVVEKEN--VSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINEL 410
+ S GI+ V EKE ++A D+ +CS+C+M V+W+QNQLKQK TK++V +Y+N+L
Sbjct: 353 SKRHQSKSAGIEMVTEKEQEELAARDTPLCSSCQMLVLWIQNQLKQKATKDRVFNYVNQL 412
Query: 411 CDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFD 470
C+SLP+P GES+I C+ + MPN++FTIG+K F L+PEQYIL+TGEGI EVC+SGF+AFD
Sbjct: 413 CESLPSPSGESVISCNSLSKMPNITFTIGNKPFVLTPEQYILRTGEGITEVCLSGFIAFD 472
Query: 471 LPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+PPP+GPLWILGDVFM YHTVFD G L++GFAEA
Sbjct: 473 VPPPKGPLWILGDVFMRAYHTVFDYGNLQVGFAEA 507
>gi|12231178|dbj|BAB20972.1| aspartic proteinase 4 [Nepenthes alata]
Length = 505
Score = 716 bits (1847), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 354/509 (69%), Positives = 422/509 (82%), Gaps = 7/509 (1%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV 60
M + L +FC L SC S++GL RIGLK++ D +S+ A RI RK G+
Sbjct: 1 MGHRNLWVIFCFCALISCFF-STSADGLVRIGLKRQFSDSNSIRAVRIARKAGM--NQGL 57
Query: 61 SGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI 120
++ GDSD DI+ LKN++DAQY+GEIGIGSPPQ FSVIFDTGSSNLWVPSSKCYFS+
Sbjct: 58 KRFQYSFGDSDTDIVYLKNYLDAQYYGEIGIGSPPQKFSVIFDTGSSNLWVPSSKCYFSV 117
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+CYFHS+YKS KS+TYT+IGKSCEI+YGSGSISGFFSQD VEVG++ VK+QVFIEA+RE
Sbjct: 118 ACYFHSKYKSSKSSTYTKIGKSCEIDYGSGSISGFFSQDIVEVGNLAVKNQVFIEASREK 177
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
SLTF LA+FDGI+GLGF+EI+VGD VPVW NMVEQGLVSE+VFSFW NRDP A+ GGEIV
Sbjct: 178 SLTFALAKFDGILGLGFQEISVGDVVPVWYNMVEQGLVSEKVFSFWFNRDPKAKIGGEIV 237
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGG+D KHF G+H YVP+T+KGYWQFE+G+ LIGN STG C GGC AIVDSGTSLLAGP
Sbjct: 238 FGGIDEKHFVGEHIYVPITRKGYWQFEMGNFLIGNYSTGFCRGGCDAIVDSGTSLLAGPM 297
Query: 301 PVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVS 360
VVTE+NHAIG EG+ S ECK VV QYGD+IWDLLVSG+ P+K+C Q+ LC FN A+++S
Sbjct: 298 HVVTEVNHAIGAEGIASMECKEVVYQYGDMIWDLLVSGVQPDKICSQLALC-FNDAQFLS 356
Query: 361 TGIKTVVEKE---NVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNP 417
GIKTV+E+E N S D +C+ACEMAVVW+QNQL+++ TKEKVL+YINELCDSLP+P
Sbjct: 357 IGIKTVIERENRKNSSVADDFLCTACEMAVVWIQNQLRREVTKEKVLNYINELCDSLPSP 416
Query: 418 MGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGP 477
MGES+IDCD IP MPNV+FTIG+K F L+PEQY+LK GEG A VC+SGF+A D+PPP GP
Sbjct: 417 MGESVIDCDSIPYMPNVTFTIGEKPFKLTPEQYVLKAGEGDAMVCLSGFIALDVPPPSGP 476
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
LWILGDVFMGVYHTVFD G L++GFAE+A
Sbjct: 477 LWILGDVFMGVYHTVFDFGNLKLGFAESA 505
>gi|356505735|ref|XP_003521645.1| PREDICTED: aspartic proteinase-like [Glycine max]
Length = 508
Score = 715 bits (1845), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 339/508 (66%), Positives = 418/508 (82%), Gaps = 4/508 (0%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV 60
M QK L +V CLW L +C LLP+ S G+ RIGLKKR LD+ S+NAAR R+ G + +
Sbjct: 1 MGQKHLVTVLCLWAL-TCSLLPSFSFGILRIGLKKRPLDIDSINAARKAREGLRSGRSMM 59
Query: 61 SGVRHRLGDSD-EDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS 119
+G S ED++PLKN+MDAQYFGEIGIG+PPQ F+V+FDTGSSNLWVPSSKCYF+
Sbjct: 60 GAHDQYIGKSKGEDLVPLKNYMDAQYFGEIGIGTPPQPFTVVFDTGSSNLWVPSSKCYFT 119
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
++CY H+ Y ++KS T+ + G SC+I+YG+GSISGFFSQDNV+VG VVK Q FIEAT E
Sbjct: 120 LACYTHNWYTAKKSKTHAKNGTSCKISYGTGSISGFFSQDNVKVGSAVVKHQDFIEATHE 179
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEI 239
GSLTFL A+FDGI+GLGF+EI+V ++VPVW MVEQ L+SE+VFSFWLN DP+A++GGE+
Sbjct: 180 GSLTFLSAKFDGILGLGFQEISVENSVPVWYKMVEQKLISEKVFSFWLNGDPNAKKGGEL 239
Query: 240 VFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGP 299
VFGGVDPKHFKG HTYVP+T+KGYWQ E+GD IG STGVCEGGCAAIVDSGTSLLAGP
Sbjct: 240 VFGGVDPKHFKGNHTYVPITEKGYWQIEIGDFFIGGVSTGVCEGGCAAIVDSGTSLLAGP 299
Query: 300 TPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYV 359
TPVV EINHAIG EGV+S ECK VVSQYG+LIWDLLVSG+ P+ +C Q+GLC+ E
Sbjct: 300 TPVVAEINHAIGAEGVLSVECKEVVSQYGELIWDLLVSGVKPDDICSQVGLCSSKRHESK 359
Query: 360 STGIKTVVEKEN--VSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNP 417
S GI+ V EKE ++A D+ +CS+C+M V+W+QNQLKQK TK++V +Y+N+LC+SLP+P
Sbjct: 360 SAGIEMVTEKEQGELTARDNPLCSSCQMLVLWIQNQLKQKATKDRVFNYVNQLCESLPSP 419
Query: 418 MGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGP 477
GES+I C+ + MPN++FTIG+K F L+PEQYILKTGEGI EVC+SGF+AFD+PPP+GP
Sbjct: 420 SGESVISCNSLSKMPNITFTIGNKPFVLTPEQYILKTGEGITEVCLSGFIAFDVPPPKGP 479
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAEA 505
LWILGDVFM YHTVFD G L++GFAEA
Sbjct: 480 LWILGDVFMRAYHTVFDYGNLQVGFAEA 507
>gi|356534977|ref|XP_003536026.1| PREDICTED: aspartic proteinase-like [Glycine max]
Length = 508
Score = 713 bits (1841), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 354/509 (69%), Positives = 427/509 (83%), Gaps = 5/509 (0%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV 60
M K L VFCLW L +C LLP+ S GL RIGLKKR LDL S+ AAR+ R++ +G +
Sbjct: 2 MGHKYLWLVFCLWAL-TCSLLPSFSFGLMRIGLKKRDLDLDSIRAARMVREKPRLGRPVL 60
Query: 61 SGVRHRLGDS-DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS 119
H LG DE I+PLKN++DAQY+GEIGIG+PPQ F+VIFDTGSSNLWVPSSKCYFS
Sbjct: 61 GAYDHDLGKPIDEGIVPLKNYLDAQYYGEIGIGTPPQKFNVIFDTGSSNLWVPSSKCYFS 120
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
I+CY H YKS+KS TYT+ G SC+I YGSGSISGFFS+D+V+VGDVVVK+Q FIEATRE
Sbjct: 121 IACYTHHWYKSKKSKTYTKNGTSCKIGYGSGSISGFFSKDHVKVGDVVVKNQDFIEATRE 180
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEI 239
GSL+F+LA+FDG++GLGF+EI+V +AVPVW NMV+Q LVSE+VFSFWLN DP A++GGE+
Sbjct: 181 GSLSFVLAKFDGLLGLGFQEISVENAVPVWYNMVKQNLVSEQVFSFWLNGDPKAKDGGEL 240
Query: 240 VFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGP 299
+FGG+DPKHFKG H YVPVTKKGYWQ E+GD IG STGVCEGGCAAIVDSGTSLLAGP
Sbjct: 241 IFGGIDPKHFKGDHIYVPVTKKGYWQIEMGDFFIGGLSTGVCEGGCAAIVDSGTSLLAGP 300
Query: 300 TPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYV 359
T VVTEINHAIG EGV+S ECK VVS+YG+L+WDLLVSG+ P+ VC Q+GLC F A+
Sbjct: 301 TTVVTEINHAIGAEGVLSVECKEVVSEYGELLWDLLVSGVRPDDVCSQVGLC-FKRAKSE 359
Query: 360 STGIKTVVEK--ENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNP 417
S GI+ V EK +SA D+A+C++C+M VVW+QNQLKQK+TKE V +Y+N+LC+SLP+P
Sbjct: 360 SNGIEMVTEKGQRELSAKDTALCTSCQMLVVWIQNQLKQKKTKEIVFNYVNQLCESLPSP 419
Query: 418 MGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGP 477
GES++DC+ I +PN++FT+GDK F L+PEQYILKTGEGIAEVC+SGF+AFD+PPPRGP
Sbjct: 420 NGESVVDCNSIYGLPNITFTVGDKPFTLTPEQYILKTGEGIAEVCLSGFIAFDIPPPRGP 479
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
LWILGDVFM VYHTVFD G LR+GFA+AA
Sbjct: 480 LWILGDVFMRVYHTVFDYGNLRVGFAKAA 508
>gi|357511707|ref|XP_003626142.1| Aspartic proteinase [Medicago truncatula]
gi|355501157|gb|AES82360.1| Aspartic proteinase [Medicago truncatula]
Length = 504
Score = 711 bits (1835), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 338/508 (66%), Positives = 417/508 (82%), Gaps = 6/508 (1%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV 60
M Q VFCL +C LLP+ S G+ RIGL+KR LDLH+++A ++ R+++ G +
Sbjct: 1 MVQTHFVVVFCLLAF-TCSLLPSFSFGMMRIGLQKRPLDLHNMDAFKMVREQQLRSGRPM 59
Query: 61 SGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI 120
+ H+ SD+ I+PLKN+MDAQYFGEI IG+PPQ F+VIFDTGSSNLWVPSSKCYFS+
Sbjct: 60 M-LAHK--SSDDAIVPLKNYMDAQYFGEIAIGTPPQTFTVIFDTGSSNLWVPSSKCYFSL 116
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+CY H+ YK++KS TY + G SC+I+YG+GSISG+FSQDNV+VG VVK Q FIEATREG
Sbjct: 117 ACYTHNWYKAKKSKTYNKNGTSCKISYGTGSISGYFSQDNVKVGSSVVKHQDFIEATREG 176
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
SL+FL +FDGI GLGF+EI+V A+PVW NM+EQ L+ E+VFSFWLN +P+A++GGE+V
Sbjct: 177 SLSFLAGKFDGIFGLGFQEISVERALPVWYNMLEQNLIGEKVFSFWLNGNPNAKKGGELV 236
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGGVDPKHFKGKHTYVPVT+KGYWQ E+GD IG STGVCEGGCAAIVDSGTSLLAGPT
Sbjct: 237 FGGVDPKHFKGKHTYVPVTEKGYWQIEMGDFFIGGLSTGVCEGGCAAIVDSGTSLLAGPT 296
Query: 301 PVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVS 360
PVV EINHAIG EGV+S ECK VVSQYG+LIWDLLVSG+ P VC Q+GLC+ G + S
Sbjct: 297 PVVAEINHAIGAEGVLSVECKEVVSQYGELIWDLLVSGVKPGDVCSQVGLCSIRGDQSNS 356
Query: 361 TGIKTVVEKEN--VSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPM 418
GI+ V +KE +SA D+ +CS+C+M V+WVQNQLKQK TKE+V +Y+N+LC+SLP+P
Sbjct: 357 AGIEMVTDKEQSELSAKDTPLCSSCQMLVLWVQNQLKQKATKERVFNYVNQLCESLPSPS 416
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
GES+I C+ I MPN+SFTIG+K F L+PEQYIL+TGEGI +VC+SGF+AFD+PPP+GPL
Sbjct: 417 GESVISCNDISKMPNISFTIGNKPFVLTPEQYILRTGEGITQVCLSGFIAFDVPPPKGPL 476
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEAA 506
WILGDVFM YHTVFD G L++GFAEAA
Sbjct: 477 WILGDVFMRAYHTVFDYGNLQVGFAEAA 504
>gi|50540937|gb|AAT77954.1| Asp [Solanum tuberosum]
Length = 497
Score = 710 bits (1832), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 337/507 (66%), Positives = 413/507 (81%), Gaps = 18/507 (3%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITR-KERYMGGAG 59
M++K L + LW + +C LPASS L RIGLKK RLD++S+ AAR+ + ++RY G
Sbjct: 1 MDKKHLCAALLLWAI-TCSALPASSGDLLRIGLKKHRLDVNSIKAARVAKLQDRY--GKH 57
Query: 60 VSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS 119
V+G+ + DSD DI+PLKN++DAQY+GEIGIGSPPQ F VIFDTGSSNLWVPSSKCYFS
Sbjct: 58 VNGIEKKSSDSDIDIVPLKNYLDAQYYGEIGIGSPPQKFKVIFDTGSSNLWVPSSKCYFS 117
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
I+C+ H G+SC I Y +GSISG FS DNV+VGD+VVKDQVFIEATRE
Sbjct: 118 IACWIHRD------------GESCSIRYETGSISGHFSMDNVQVGDLVVKDQVFIEATRE 165
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEI 239
S+TF++A+FDGI+GLGF+EI+VG+ PVW NMV QGLV E VFSFW NRD +A+EGGE+
Sbjct: 166 PSITFIVAKFDGILGLGFQEISVGNTTPVWYNMVGQGLVKEPVFSFWFNRDANAKEGGEL 225
Query: 240 VFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGP 299
VFGGVDPKHFKG HTYVP+T+KGYWQF +GD LIGN STG C GGCAAIVDSGTSLLAGP
Sbjct: 226 VFGGVDPKHFKGNHTYVPLTQKGYWQFNMGDFLIGNTSTGYCAGGCAAIVDSGTSLLAGP 285
Query: 300 TPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYV 359
T +V +INHAIG EG+VS ECK +VSQYG++IWDLLVSG+ P++VC Q GLC +GA++V
Sbjct: 286 TTIVAQINHAIGAEGIVSMECKTIVSQYGEMIWDLLVSGVRPDQVCSQAGLCFVDGAQHV 345
Query: 360 STGIKTVVEKEN--VSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNP 417
S+ IKTVVE+E S G++ +C+ACEMAVVW+QNQLKQ+ TKEKVL Y+N+LC+ +P+P
Sbjct: 346 SSNIKTVVERETEGSSVGEAPLCTACEMAVVWMQNQLKQEGTKEKVLEYVNQLCEKIPSP 405
Query: 418 MGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGP 477
MGES IDC+ I +MP+++FTI DK F L+PEQYILKTGEG+A +C+SGF A D+PPPRGP
Sbjct: 406 MGESAIDCNNISSMPDITFTIKDKAFVLTPEQYILKTGEGVATICVSGFAALDVPPPRGP 465
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAE 504
LWILGDVFMG YHTVFD GK ++GFAE
Sbjct: 466 LWILGDVFMGPYHTVFDYGKSQVGFAE 492
>gi|114786427|gb|ABI78942.1| aspartic protease [Ipomoea batatas]
Length = 508
Score = 709 bits (1830), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/515 (69%), Positives = 415/515 (80%), Gaps = 16/515 (3%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV 60
M K L + LWV+A C +LPASS L R+GLKK LD +S+ AA+ R + G
Sbjct: 1 MAWKYLCASILLWVIA-CSVLPASSEKLLRVGLKKNPLDFNSIKAAKAARVQGKCG---- 55
Query: 61 SGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI 120
G ++LGDSD I+ LKN++DAQY+GEI IGSPPQ F+VIFDTGSSNLWVPSSKCYFSI
Sbjct: 56 KGANNKLGDSDTGIVSLKNYLDAQYYGEISIGSPPQKFTVIFDTGSSNLWVPSSKCYFSI 115
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+CYFHS+YKS KS+TYT+IG SC I YGSGSISGF SQDNV VGD+VVKDQVFIE T+E
Sbjct: 116 ACYFHSKYKSSKSSTYTKIGTSCSITYGSGSISGFLSQDNVGVGDLVVKDQVFIETTKEP 175
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
SLTF+LA+FDG++GLGF+EI+V D VPVW NMVEQGLV E VFSFWLNRD +AEEGGE++
Sbjct: 176 SLTFVLAKFDGLLGLGFQEISVEDVVPVWYNMVEQGLVDEPVFSFWLNRDTNAEEGGELI 235
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGGVDP HFKGKHTYVPVT+KGYWQFE+GD LIGN STG CEGGCAAIVDSGTSLL GPT
Sbjct: 236 FGGVDPNHFKGKHTYVPVTQKGYWQFEMGDFLIGNSSTGFCEGGCAAIVDSGTSLLTGPT 295
Query: 301 --------PVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCA 352
+VTEINHAIG EGVVS ECK +VSQYG++IWDLLVSG+ P++VC Q+GLC
Sbjct: 296 TIVTEINHAIVTEINHAIGAEGVVSTECKEIVSQYGNMIWDLLVSGVKPDEVCSQVGLCF 355
Query: 353 FNGAEYVSTGIKTVVEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELC 411
FNGA + I VVEK+N S +C+ACEMAVVW+QNQLKQK KEKV Y+N+LC
Sbjct: 356 FNGA--AGSNIGMVVEKDNEGKSSSDPMCTACEMAVVWMQNQLKQKVVKEKVFDYVNQLC 413
Query: 412 DSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDL 471
+ +P+PMGES IDC+ I MPNV+F I DK F L+PEQYILKTGEG+A +C+SGF+A D+
Sbjct: 414 EKIPSPMGESTIDCNSISNMPNVTFKIADKDFVLTPEQYILKTGEGVATICVSGFLAMDV 473
Query: 472 PPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
P PRGPLWILGDVFMGVYHTVFD G L+IGFAEAA
Sbjct: 474 PAPRGPLWILGDVFMGVYHTVFDYGNLQIGFAEAA 508
>gi|359487589|ref|XP_003633616.1| PREDICTED: aspartic proteinase-like [Vitis vinifera]
Length = 510
Score = 708 bits (1827), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 334/508 (65%), Positives = 415/508 (81%), Gaps = 5/508 (0%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV 60
M Q L + FCLW L + LL ASS+GL RIGLKK RLD + + AAR+ R+ + +GG V
Sbjct: 3 MRQGYLWAAFCLWAL-TFPLLQASSDGLVRIGLKKWRLDYNRIRAARMARRAKSIGGV-V 60
Query: 61 SGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI 120
+ LGDSD + + L+N+MDAQY+GEIGIG+PPQNF+V+FDTGS+NLWVPS+KC+FSI
Sbjct: 61 KSMYQGLGDSDGESVLLRNYMDAQYYGEIGIGTPPQNFTVVFDTGSANLWVPSTKCHFSI 120
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+C FHS+Y SR S TY ++GK EI+YGSGSISG FSQDNV+VG + +K+QVFIEATRE
Sbjct: 121 ACLFHSKYNSRLSTTYIDLGKEGEIHYGSGSISGVFSQDNVQVGSMAIKNQVFIEATREA 180
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
SL F+L +FDGI+GLGF EI VG+A PVW N++ QGLV E++FSFWLNRDP A +GGEIV
Sbjct: 181 SLVFVLGKFDGILGLGFEEIVVGNATPVWYNLLRQGLVQEDIFSFWLNRDPQATDGGEIV 240
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGGVD +HFKG+HTY +T+KGYWQFE+G+ LIG QSTG CE GCAAIVDSGTSL+AGPT
Sbjct: 241 FGGVDKRHFKGQHTYASITQKGYWQFEMGEFLIGYQSTGFCEAGCAAIVDSGTSLIAGPT 300
Query: 301 PVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVS 360
+VTEINHAIG EG+VS ECK VVSQYG++IWDLL+S + P+ VC QIGLC FNG++ S
Sbjct: 301 AIVTEINHAIGAEGIVSQECKEVVSQYGNMIWDLLISRVQPDAVCSQIGLCNFNGSQIES 360
Query: 361 TGIKTVVEKEN---VSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNP 417
IKTVVE+E+ G+ C+ACEM V+W+QNQLKQ++TKE + SY+ ELC SLP+P
Sbjct: 361 PRIKTVVEEEDARGTKVGNEVWCTACEMTVIWIQNQLKQRKTKEIIFSYVTELCQSLPSP 420
Query: 418 MGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGP 477
MGES++DC R+P MP+V+FTI DK F L+P++Y+LKTGEGI VC+SGF+A D+PPPRGP
Sbjct: 421 MGESVVDCGRVPYMPDVTFTIADKHFTLTPKEYVLKTGEGITTVCLSGFIALDVPPPRGP 480
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAEA 505
LWILGD+FMGVYHTVFD G L++GFAEA
Sbjct: 481 LWILGDIFMGVYHTVFDYGNLQVGFAEA 508
>gi|356575293|ref|XP_003555776.1| PREDICTED: aspartic proteinase [Glycine max]
Length = 507
Score = 706 bits (1822), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 351/509 (68%), Positives = 423/509 (83%), Gaps = 5/509 (0%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV 60
M L VFCLW L +C LLP+ S GL RIGLKKR LDL S+ AAR+ R+ +G +
Sbjct: 1 MGHNYLWLVFCLWAL-TCSLLPSFSFGLLRIGLKKRDLDLDSIRAARMVRENLRLGRPVL 59
Query: 61 SGVRHRLGD-SDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS 119
+G +DE I+PLKN++DAQY+GEIGIG+PPQ F+VIFDTGSSNLWVPSSKCYFS
Sbjct: 60 GANDQYIGKPTDEGIVPLKNYLDAQYYGEIGIGTPPQKFNVIFDTGSSNLWVPSSKCYFS 119
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
I+CY H YKS+KS TYT+ G SC+I YGSGSISGFFS+D+V+VGDVVVK+Q FIEATRE
Sbjct: 120 IACYTHHWYKSKKSKTYTKNGTSCKIRYGSGSISGFFSKDHVKVGDVVVKNQDFIEATRE 179
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEI 239
GSL+F+LA+FDG++GLGF+EI+V +AVPVW NMV+Q LVSE+VFSFWLN DP + GGE+
Sbjct: 180 GSLSFVLAKFDGLLGLGFQEISVENAVPVWYNMVKQNLVSEQVFSFWLNGDPKVKNGGEL 239
Query: 240 VFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGP 299
VFGGVDPKHFKG+H YVPVTKKGYWQ E+GD IG STGVCEGGCAAIVDSGTSLLAGP
Sbjct: 240 VFGGVDPKHFKGEHIYVPVTKKGYWQIEMGDFFIGGLSTGVCEGGCAAIVDSGTSLLAGP 299
Query: 300 TPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYV 359
T VVTEINHAIG EGV+S ECK VVS+YG+L+WDLLVSG+ P+ VC Q+GLC F +
Sbjct: 300 TTVVTEINHAIGAEGVLSVECKEVVSEYGELLWDLLVSGVRPDDVCSQVGLC-FKRTKSE 358
Query: 360 STGIKTVVEKEN--VSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNP 417
S GI+ V EKE +S D+A+C++C+M VVW+QNQLKQK+TKE V +Y+N+LC+SLP+P
Sbjct: 359 SNGIEMVTEKEQRELSTKDTALCTSCQMLVVWIQNQLKQKKTKEIVFNYVNQLCESLPSP 418
Query: 418 MGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGP 477
GES++DC+ I +PN++FT+GDK F L+PEQYILKTGEGIAEVC+SGF+AFD+PPPRGP
Sbjct: 419 NGESVVDCNSIYGLPNITFTVGDKPFTLTPEQYILKTGEGIAEVCLSGFIAFDIPPPRGP 478
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
LWILGDVFM VYHTVFD G LR+GFA+AA
Sbjct: 479 LWILGDVFMRVYHTVFDYGNLRVGFAKAA 507
>gi|13897888|gb|AAK48494.1|AF259982_1 putative aspartic protease [Ipomoea batatas]
Length = 504
Score = 701 bits (1809), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 353/511 (69%), Positives = 421/511 (82%), Gaps = 12/511 (2%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPA--SSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGA 58
M +K L + F LW + C LPA S N L R+GLKKR LDL S+ AA+ R +GG
Sbjct: 1 MGRKYLCNAFLLWAVV-CTALPAAYSDNNLLRVGLKKRPLDLESIKAAKGAR----LGGK 55
Query: 59 GVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF 118
GV +LGDSDE I+ L N++DAQY+GEI IGSPPQNF+VIFDTGSSNLWVPSSKCY
Sbjct: 56 YGKGVNKKLGDSDEGIVSLNNYLDAQYYGEISIGSPPQNFTVIFDTGSSNLWVPSSKCYL 115
Query: 119 SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
SI+CYFHS+YKS KS+TYT+IGKSC I YGS SISGF SQD+V++GD++VKDQVFIE TR
Sbjct: 116 SIACYFHSKYKSSKSSTYTQIGKSCSITYGSVSISGFLSQDDVQLGDLLVKDQVFIETTR 175
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
E SLTF++A+FDGI+GLGF+EI+V + VPVW +MVEQGLV E VFSFWLNRDP AE GGE
Sbjct: 176 EPSLTFIIAKFDGILGLGFQEISVENVVPVWYDMVEQGLVDEPVFSFWLNRDPKAEVGGE 235
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
+VFGGVDPKHFKG+HTYVPVT+KGYWQ +LGD LIGN STG CEGGCA IVDSGTSLL G
Sbjct: 236 LVFGGVDPKHFKGEHTYVPVTQKGYWQIDLGDFLIGNSSTGYCEGGCAVIVDSGTSLLTG 295
Query: 299 PTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEY 358
PT VVTEIN+AIG EGVV AECK VVS+YG++IWDLLVSGL ++VC ++GLC NGA +
Sbjct: 296 PTAVVTEINYAIGPEGVVCAECKEVVSEYGEMIWDLLVSGLRADQVCSELGLCFLNGAWH 355
Query: 359 VSTGIKTVVEKE---NVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLP 415
S+ IKTVVEKE N+++ + +C+ CEMAV+W+QNQLKQK KEKV Y+++LC+ LP
Sbjct: 356 ESSIIKTVVEKEAEGNLTS--NPLCTTCEMAVIWLQNQLKQKGIKEKVFEYVDQLCEKLP 413
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
+P GES+IDC+ I +MPNV+F IGDK F L+PEQYILKTGEGIA VC+SGF+A D+PPP+
Sbjct: 414 SPDGESVIDCNSISSMPNVTFVIGDKDFVLTPEQYILKTGEGIAAVCVSGFLALDVPPPQ 473
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
GPLWILGDVFMG YHTVFD G L++GFAEAA
Sbjct: 474 GPLWILGDVFMGAYHTVFDYGNLQVGFAEAA 504
>gi|357511709|ref|XP_003626143.1| Aspartic proteinase [Medicago truncatula]
gi|355501158|gb|AES82361.1| Aspartic proteinase [Medicago truncatula]
Length = 478
Score = 695 bits (1794), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 326/481 (67%), Positives = 403/481 (83%), Gaps = 5/481 (1%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
+ RIGL+KR LDLH+++A ++ R+++ G + + H+ SD+ I+PLKN+MDAQYFG
Sbjct: 1 MMRIGLQKRPLDLHNMDAFKMVREQQLRSGRPMM-LAHK--SSDDAIVPLKNYMDAQYFG 57
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
EI IG+PPQ F+VIFDTGSSNLWVPSSKCYFS++CY H+ YK++KS TY + G SC+I+Y
Sbjct: 58 EIAIGTPPQTFTVIFDTGSSNLWVPSSKCYFSLACYTHNWYKAKKSKTYNKNGTSCKISY 117
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVP 207
G+GSISG+FSQDNV+VG VVK Q FIEATREGSL+FL +FDGI GLGF+EI+V A+P
Sbjct: 118 GTGSISGYFSQDNVKVGSSVVKHQDFIEATREGSLSFLAGKFDGIFGLGFQEISVERALP 177
Query: 208 VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFE 267
VW NM+EQ L+ E+VFSFWLN +P+A++GGE+VFGGVDPKHFKGKHTYVPVT+KGYWQ E
Sbjct: 178 VWYNMLEQNLIGEKVFSFWLNGNPNAKKGGELVFGGVDPKHFKGKHTYVPVTEKGYWQIE 237
Query: 268 LGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQY 327
+GD IG STGVCEGGCAAIVDSGTSLLAGPTPVV EINHAIG EGV+S ECK VVSQY
Sbjct: 238 MGDFFIGGLSTGVCEGGCAAIVDSGTSLLAGPTPVVAEINHAIGAEGVLSVECKEVVSQY 297
Query: 328 GDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN--VSAGDSAVCSACEM 385
G+LIWDLLVSG+ P VC Q+GLC+ G + S GI+ V +KE +SA D+ +CS+C+M
Sbjct: 298 GELIWDLLVSGVKPGDVCSQVGLCSIRGDQSNSAGIEMVTDKEQSELSAKDTPLCSSCQM 357
Query: 386 AVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNL 445
V+WVQNQLKQK TKE+V +Y+N+LC+SLP+P GES+I C+ I MPN+SFTIG+K F L
Sbjct: 358 LVLWVQNQLKQKATKERVFNYVNQLCESLPSPSGESVISCNDISKMPNISFTIGNKPFVL 417
Query: 446 SPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+PEQYIL+TGEGI +VC+SGF+AFD+PPP+GPLWILGDVFM YHTVFD G L++GFAEA
Sbjct: 418 TPEQYILRTGEGITQVCLSGFIAFDVPPPKGPLWILGDVFMRAYHTVFDYGNLQVGFAEA 477
Query: 506 A 506
A
Sbjct: 478 A 478
>gi|357131833|ref|XP_003567538.1| PREDICTED: aspartic proteinase-like [Brachypodium distachyon]
Length = 503
Score = 695 bits (1793), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 322/506 (63%), Positives = 409/506 (80%), Gaps = 3/506 (0%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV 60
M + L V CLW L+ LLL ASS+G+ RI L K+RLD +L AA++ R++R + +G
Sbjct: 1 MGPRHLLWVTCLWTLSCALLLGASSDGVLRINLSKKRLDKEALTAAKLARQQRNVLRSGD 60
Query: 61 SGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI 120
R+ LG SD+DI+PL N++D QY+GEIG+G+PPQNF+VIFDTGSSNLWVPSSKCYFSI
Sbjct: 61 GSYRY-LGVSDDDIVPLDNYLDTQYYGEIGVGTPPQNFTVIFDTGSSNLWVPSSKCYFSI 119
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+CY H +YKS KS+TY + G++C I+YGSGSI+GFFS+D+V VGD+VVK+Q FIE TRE
Sbjct: 120 ACYLHHKYKSTKSSTYKKNGETCTISYGSGSIAGFFSEDSVLVGDLVVKNQKFIETTREA 179
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
S +F++ +FDGI+GLGF EI+VG A PVW +M EQ L+++++FSFWLNRDPDA GGE+V
Sbjct: 180 SPSFIIGKFDGILGLGFPEISVGSAPPVWQSMQEQKLIAKDIFSFWLNRDPDAPTGGELV 239
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGGVD KH+KGKHTYVPVT+KGYWQF++GD+LIG QSTG C GGCAAIVDSGTSLLAGPT
Sbjct: 240 FGGVDQKHYKGKHTYVPVTRKGYWQFDMGDLLIGGQSTGFCAGGCAAIVDSGTSLLAGPT 299
Query: 301 PVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVS 360
+V ++NHAIG EG++S ECK VV +YG++I +LLV+ P+KVC QIGLC F+G + VS
Sbjct: 300 TIVAQVNHAIGAEGIISMECKEVVREYGEMILELLVAQTRPQKVCSQIGLCVFDGTKSVS 359
Query: 361 TGIKTVVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGE 420
I++VVEKEN G +C+ACEMAVVW+QNQL+Q QTKE +L Y N+LC+ LP+P GE
Sbjct: 360 NQIESVVEKEN--RGSDLLCTACEMAVVWIQNQLRQNQTKELILQYANQLCERLPSPNGE 417
Query: 421 SIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWI 480
S +DC +I MPN++FTI +K F L+PEQYI+K + +CISGFMAFD+PPPRGPLWI
Sbjct: 418 STVDCHQISKMPNLAFTIANKTFTLTPEQYIVKLEQSGQTICISGFMAFDIPPPRGPLWI 477
Query: 481 LGDVFMGVYHTVFDSGKLRIGFAEAA 506
LGDVFMG YHTVFD G +IGFA++A
Sbjct: 478 LGDVFMGAYHTVFDFGDSKIGFAKSA 503
>gi|224056377|ref|XP_002298827.1| predicted protein [Populus trichocarpa]
gi|222846085|gb|EEE83632.1| predicted protein [Populus trichocarpa]
Length = 494
Score = 687 bits (1772), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 327/500 (65%), Positives = 404/500 (80%), Gaps = 13/500 (2%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSG---VRHRLGDS 70
+++S L P ++GL RIGLKKR+ + ++ AA++ KE G + +R+ GD+
Sbjct: 1 MISSALSPP--NDGLIRIGLKKRKYERNNRLAAKLESKE----GESIKKYHLLRNLGGDA 54
Query: 71 -DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYK 129
D DI+ LKN+MDAQYFGEIGIG+PPQ F+VIFDTGSSNLWVPSSKCYFS++CYFHS+YK
Sbjct: 55 EDTDIVSLKNYMDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFSVACYFHSKYK 114
Query: 130 SRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARF 189
S S TY E GKS EI+YG+G+ISGFFSQD+V+VGD+VVK+Q FIEATRE S+TFL+A+F
Sbjct: 115 SSHSRTYKENGKSAEIHYGTGAISGFFSQDHVKVGDLVVKNQEFIEATREPSVTFLVAKF 174
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF 249
DGI+GLGF+EI+VG AVPVW NMVEQGLV E VFSFW NR+ D +EGGEIVFGGVDP H+
Sbjct: 175 DGILGLGFQEISVGKAVPVWYNMVEQGLVKEPVFSFWFNRNADEKEGGEIVFGGVDPDHY 234
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHA 309
KG+HTYVPVT+KGYWQF++GD+LIG Q++G C GCAAI DSGTSLLAGPT ++TE+NHA
Sbjct: 235 KGEHTYVPVTQKGYWQFDMGDVLIGGQTSGFCASGCAAIADSGTSLLAGPTTIITEVNHA 294
Query: 310 IGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEK 369
IG GVVS ECK VV+QYGD I ++L++ P+K+C QIGLC F+G VS GI++VV +
Sbjct: 295 IGATGVVSQECKAVVAQYGDTIMEMLLAKDQPQKICAQIGLCTFDGTRGVSMGIESVVNE 354
Query: 370 ENVSAGD---SAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
A D A+CS CEMAVVW+QNQLKQ QT+E++L Y+NELC+ LP+PMGES +DCD
Sbjct: 355 HAQKASDGFHDAMCSTCEMAVVWMQNQLKQNQTQERILDYVNELCERLPSPMGESAVDCD 414
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
+ +MPNVSFTIG ++F LSPEQY+LK GEG CISGF A D+PPPRGPLWILGDVFM
Sbjct: 415 GLSSMPNVSFTIGGRVFELSPEQYVLKVGEGDVAQCISGFTALDVPPPRGPLWILGDVFM 474
Query: 487 GVYHTVFDSGKLRIGFAEAA 506
G +HTVFD G +R+GFAEA
Sbjct: 475 GSFHTVFDYGNMRVGFAEAT 494
>gi|21616051|emb|CAC86003.1| aspartic proteinase [Theobroma cacao]
Length = 514
Score = 687 bits (1772), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 331/501 (66%), Positives = 402/501 (80%), Gaps = 13/501 (2%)
Query: 18 CLLL-----PASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHR--LGDS 70
CLLL S+ L RIGLKKR+ D + AA + KER A + R + L +S
Sbjct: 15 CLLLFPIVFSISNERLVRIGLKKRKFDQNYRLAAHLDSKEREAFRASLKKYRLQGNLQES 74
Query: 71 DE-DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYK 129
++ DI+ LKN++DAQYFGEIGIG+PPQNF+VIFDTGSSNLWVPSSKCYFSI+CY HSRYK
Sbjct: 75 EDIDIVALKNYLDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYLHSRYK 134
Query: 130 SRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARF 189
S +S+TY GK +I YG+G+ISGFFS+DNV+VGD+VVK+Q FIEATRE S+TFL+A+F
Sbjct: 135 SSRSSTYKANGKPADIQYGTGAISGFFSEDNVQVGDLVVKNQEFIEATREPSITFLVAKF 194
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF 249
DGI+GLGF+EI+VG+AVPVW NMV QGLV E VFSFW NRDP+ + GGE+VFGG+DPKHF
Sbjct: 195 DGILGLGFQEISVGNAVPVWYNMVNQGLVKEPVFSFWFNRDPEDDIGGEVVFGGMDPKHF 254
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHA 309
KG HTYVP+T+KGYWQF++GD+LIGNQ+TG+C GGC+AI DSGTSL+ GPT ++ ++NHA
Sbjct: 255 KGDHTYVPITRKGYWQFDMGDVLIGNQTTGLCAGGCSAIADSGTSLITGPTAIIAQVNHA 314
Query: 310 IGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEK 369
IG GVVS ECK VVSQYG+ I D+L+S P K+C QIGLC F+G VSTGI++VV
Sbjct: 315 IGASGVVSQECKTVVSQYGETIIDMLLSKDQPLKICSQIGLCTFDGTRGVSTGIESVVH- 373
Query: 370 ENV--SAGD--SAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
ENV + GD A+CS CEM V+W+QNQLKQ QT+E++L YINELCD LP+PMGES +DC
Sbjct: 374 ENVGKATGDLHDAMCSTCEMTVIWMQNQLKQNQTQERILEYINELCDRLPSPMGESAVDC 433
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
+ TMPNVSFTIG KIF LSPEQY+LK GEG C+SGF A D+PPPRGPLWILGDVF
Sbjct: 434 SSLSTMPNVSFTIGGKIFELSPEQYVLKVGEGDVAQCLSGFTALDVPPPRGPLWILGDVF 493
Query: 486 MGVYHTVFDSGKLRIGFAEAA 506
MG +HTVFD G L++GFAEAA
Sbjct: 494 MGQFHTVFDYGNLQVGFAEAA 514
>gi|297848226|ref|XP_002891994.1| hypothetical protein ARALYDRAFT_314946 [Arabidopsis lyrata subsp.
lyrata]
gi|297337836|gb|EFH68253.1| hypothetical protein ARALYDRAFT_314946 [Arabidopsis lyrata subsp.
lyrata]
Length = 504
Score = 686 bits (1771), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 334/507 (65%), Positives = 403/507 (79%), Gaps = 11/507 (2%)
Query: 2 EQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS 61
KL+ V C + LA LL P +S+ L +GLKKRRL+L + A+R+ RK ++
Sbjct: 7 RNKLIHQVICFYFLA-ILLHPTTSSDLFHVGLKKRRLELDDIRASRVIRKLKHSQRLTNY 65
Query: 62 GVRHRLG--DSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS 119
LG S++D + LKN++DAQY+G IGIG+P Q F VIFDTGSSNLWVPSSKCY S
Sbjct: 66 PSFATLGGDSSNQDQVILKNYLDAQYYGVIGIGTPSQEFEVIFDTGSSNLWVPSSKCYLS 125
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
++CY H +YKS KS TY + GK+C I YGSGSISGFFS+DNV+VGD+VVK+Q FIEATRE
Sbjct: 126 LACYLHPKYKSTKSKTYIKNGKTCTITYGSGSISGFFSEDNVKVGDLVVKNQEFIEATRE 185
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEI 239
GSLTFLLA+FDG++GLGF+EI+VG+AVPVW NMV+QGLV ++VFSFWLNRD +AE GGEI
Sbjct: 186 GSLTFLLAKFDGLLGLGFQEISVGNAVPVWYNMVDQGLVRDKVFSFWLNRDTEAEVGGEI 245
Query: 240 VFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGP 299
VFGGVDP HFKGKHTYVPVT+KGYWQF +GDI +G+ STG CE GC AI+DSGTSLLAGP
Sbjct: 246 VFGGVDPAHFKGKHTYVPVTRKGYWQFNMGDIFVGSNSTGFCEQGCDAIMDSGTSLLAGP 305
Query: 300 TPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYV 359
T V+ +INHAIG EG+VSAECK VVSQYG++IW+LLV +LP +VC+++GLC F G E
Sbjct: 306 TTVIAQINHAIGAEGIVSAECKDVVSQYGEMIWNLLVKRVLPRQVCKELGLCVF-GQE-- 362
Query: 360 STGIKTVVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMG 419
TGIKTVV+KE S +C CEMAVVWVQ +LK +TKEKV Y+N+LC+SLP+P G
Sbjct: 363 -TGIKTVVDKER----SSVLCEVCEMAVVWVQTKLKVNETKEKVFEYVNQLCESLPSPAG 417
Query: 420 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 479
ESIIDC+ I MP+V+FTIG F+LSP+QYILKTG G AE+CISGF AFDLPPP GPLW
Sbjct: 418 ESIIDCNNIKNMPSVTFTIGGNPFSLSPQQYILKTGVGNAEMCISGFSAFDLPPPTGPLW 477
Query: 480 ILGDVFMGVYHTVFDSGKLRIGFAEAA 506
I+GDVFMG YHTVFDS L+IG AEA
Sbjct: 478 IIGDVFMGAYHTVFDSDNLQIGIAEAT 504
>gi|359483345|ref|XP_003632941.1| PREDICTED: aspartic proteinase isoform 2 [Vitis vinifera]
gi|359483347|ref|XP_002262915.2| PREDICTED: aspartic proteinase isoform 1 [Vitis vinifera]
Length = 514
Score = 684 bits (1764), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 329/490 (67%), Positives = 401/490 (81%), Gaps = 6/490 (1%)
Query: 23 ASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSG-VRH-RLGDS-DEDILPLKN 79
A+++GL RIGLKK +LD + AAR+ KE A + RH LGDS D DI+ LKN
Sbjct: 25 ATTDGLFRIGLKKMKLDQNDQLAARLESKEGESLRASIRKYFRHGNLGDSQDTDIVGLKN 84
Query: 80 FMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEI 139
+MDAQYFGEIGIG+PPQ F+VIFDTGSSNLWVPSSKCYFS+ CYFHS+YKS +S+TY +
Sbjct: 85 YMDAQYFGEIGIGTPPQTFTVIFDTGSSNLWVPSSKCYFSVPCYFHSKYKSSQSSTYRKN 144
Query: 140 GKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFRE 199
GKS +I+YG+G+ISGFFS+DNV+VGD+VVK+Q FIEATRE S+TFL+A+FDGI+GLGF+E
Sbjct: 145 GKSADIHYGTGAISGFFSEDNVKVGDLVVKNQEFIEATREPSVTFLVAKFDGILGLGFQE 204
Query: 200 IAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVT 259
I+VG+AVPVW NMV+QGLV E VFSFWLNR D +EGGE+VFGGVDP HFKG+HTYVPVT
Sbjct: 205 ISVGNAVPVWYNMVKQGLVKEPVFSFWLNRKTDDDEGGELVFGGVDPDHFKGEHTYVPVT 264
Query: 260 KKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
+KGYWQF++G++LI ++TG C GGCAAI DSGTSLLAGPT VV INHAIG GVVS E
Sbjct: 265 QKGYWQFDMGEVLIDGETTGYCAGGCAAIADSGTSLLAGPTAVVAMINHAIGATGVVSQE 324
Query: 320 CKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN---VSAGD 376
CK VV+QYG+ I DLL+S P+K+C QIGLC F+G V GI++VV+++N S
Sbjct: 325 CKTVVAQYGETIMDLLLSEASPQKICSQIGLCTFDGTRGVGMGIESVVDEKNGDKSSGVH 384
Query: 377 SAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSF 436
A CSACEMAVVW+Q+QL+Q QTKE++L Y+NELCD LP+PMGES +DC ++ +MPNVS
Sbjct: 385 DAGCSACEMAVVWMQSQLRQNQTKERILEYVNELCDRLPSPMGESAVDCLQLSSMPNVSL 444
Query: 437 TIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSG 496
TIG K+F+LS +Y+LK GEG A CISGF+A D+PPPRGPLWILGDVFMG YHTVFD G
Sbjct: 445 TIGGKVFDLSANEYVLKVGEGAAAQCISGFIAMDVPPPRGPLWILGDVFMGRYHTVFDYG 504
Query: 497 KLRIGFAEAA 506
+R+GFAEAA
Sbjct: 505 NMRVGFAEAA 514
>gi|226497182|ref|NP_001152501.1| retrotransposon protein SINE subclass precursor [Zea mays]
gi|195624058|gb|ACG33859.1| retrotransposon protein SINE subclass [Zea mays]
gi|195656921|gb|ACG47928.1| retrotransposon protein SINE subclass [Zea mays]
gi|413946824|gb|AFW79473.1| retrotransposon protein SINE subclass isoform 1 [Zea mays]
gi|413946825|gb|AFW79474.1| retrotransposon protein SINE subclass isoform 2 [Zea mays]
gi|413946826|gb|AFW79475.1| retrotransposon protein SINE subclass isoform 3 [Zea mays]
Length = 504
Score = 683 bits (1763), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 320/508 (62%), Positives = 403/508 (79%), Gaps = 6/508 (1%)
Query: 1 MEQKLLRSVFCLWVLASC-LLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAG 59
M Q L + C WVL++C LLL ASS+GL RI L K+RLD +L AA++ +KE + +
Sbjct: 1 MGQTHLLLLACFWVLSTCSLLLDASSDGLLRINLNKKRLDKEALTAAKLAKKESNLRRS- 59
Query: 60 VSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS 119
G L S +DI+PL N++D QYFG+I IG+PPQNF+VIFDTGSSNLWVPSSKCYFS
Sbjct: 60 -VGADQYLSASTDDIVPLDNYLDTQYFGQISIGTPPQNFTVIFDTGSSNLWVPSSKCYFS 118
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
I+CY H RYKS KS TYT+ G+SC I YGSG I+GFFS+DNV VG++VV++Q FIE TRE
Sbjct: 119 IACYLHHRYKSTKSKTYTKNGESCTITYGSGQIAGFFSEDNVLVGNLVVQNQKFIETTRE 178
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEE-GGE 238
S TF++ +FDGI+GLGF EI+VG A P+W +M +Q LV+++VFSFWLNRDPDA GGE
Sbjct: 179 TSPTFIIGKFDGILGLGFPEISVGGAPPIWQSMKQQKLVAKDVFSFWLNRDPDASSGGGE 238
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
+VFGGVDPKH+KG HTYVPVT+KGYWQF++GD++IG STG C GGCAAIVDSGTSLLAG
Sbjct: 239 LVFGGVDPKHYKGDHTYVPVTRKGYWQFDMGDLIIGGHSTGFCAGGCAAIVDSGTSLLAG 298
Query: 299 PTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEY 358
PT +V ++NHAIG EG++S ECK VVS+YG++I +LL+S P+KVC QIGLC F+GA
Sbjct: 299 PTTIVAQVNHAIGAEGIISTECKEVVSEYGEMILELLISQTSPQKVCTQIGLCVFDGAHS 358
Query: 359 VSTGIKTVVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPM 418
VS I++VVEK+ G C+ACEMAVVW+QNQL++ +TKE +L+Y N+LC+ LP+P
Sbjct: 359 VSNPIESVVEKQK--RGSDLFCTACEMAVVWIQNQLRENKTKELILNYANQLCERLPSPN 416
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
GES +DC +I MPN++FTI +K F L+PEQYI+K + +CISGFMAFD+PPPRGPL
Sbjct: 417 GESTVDCHQISKMPNLAFTIANKTFTLTPEQYIVKLEQAGQTICISGFMAFDVPPPRGPL 476
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEAA 506
WILGDVFMG YHTVFD G+ RIGFA++A
Sbjct: 477 WILGDVFMGAYHTVFDFGENRIGFAKSA 504
>gi|413946821|gb|AFW79470.1| retrotransposon protein SINE subclass isoform 1 [Zea mays]
gi|413946822|gb|AFW79471.1| retrotransposon protein SINE subclass isoform 2 [Zea mays]
Length = 545
Score = 682 bits (1761), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 320/508 (62%), Positives = 403/508 (79%), Gaps = 6/508 (1%)
Query: 1 MEQKLLRSVFCLWVLASC-LLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAG 59
M Q L + C WVL++C LLL ASS+GL RI L K+RLD +L AA++ +KE + +
Sbjct: 42 MGQTHLLLLACFWVLSTCSLLLDASSDGLLRINLNKKRLDKEALTAAKLAKKESNLRRS- 100
Query: 60 VSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS 119
G L S +DI+PL N++D QYFG+I IG+PPQNF+VIFDTGSSNLWVPSSKCYFS
Sbjct: 101 -VGADQYLSASTDDIVPLDNYLDTQYFGQISIGTPPQNFTVIFDTGSSNLWVPSSKCYFS 159
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
I+CY H RYKS KS TYT+ G+SC I YGSG I+GFFS+DNV VG++VV++Q FIE TRE
Sbjct: 160 IACYLHHRYKSTKSKTYTKNGESCTITYGSGQIAGFFSEDNVLVGNLVVQNQKFIETTRE 219
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEE-GGE 238
S TF++ +FDGI+GLGF EI+VG A P+W +M +Q LV+++VFSFWLNRDPDA GGE
Sbjct: 220 TSPTFIIGKFDGILGLGFPEISVGGAPPIWQSMKQQKLVAKDVFSFWLNRDPDASSGGGE 279
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
+VFGGVDPKH+KG HTYVPVT+KGYWQF++GD++IG STG C GGCAAIVDSGTSLLAG
Sbjct: 280 LVFGGVDPKHYKGDHTYVPVTRKGYWQFDMGDLIIGGHSTGFCAGGCAAIVDSGTSLLAG 339
Query: 299 PTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEY 358
PT +V ++NHAIG EG++S ECK VVS+YG++I +LL+S P+KVC QIGLC F+GA
Sbjct: 340 PTTIVAQVNHAIGAEGIISTECKEVVSEYGEMILELLISQTSPQKVCTQIGLCVFDGAHS 399
Query: 359 VSTGIKTVVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPM 418
VS I++VVEK+ G C+ACEMAVVW+QNQL++ +TKE +L+Y N+LC+ LP+P
Sbjct: 400 VSNPIESVVEKQK--RGSDLFCTACEMAVVWIQNQLRENKTKELILNYANQLCERLPSPN 457
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
GES +DC +I MPN++FTI +K F L+PEQYI+K + +CISGFMAFD+PPPRGPL
Sbjct: 458 GESTVDCHQISKMPNLAFTIANKTFTLTPEQYIVKLEQAGQTICISGFMAFDVPPPRGPL 517
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEAA 506
WILGDVFMG YHTVFD G+ RIGFA++A
Sbjct: 518 WILGDVFMGAYHTVFDFGENRIGFAKSA 545
>gi|255578112|ref|XP_002529926.1| Aspartic proteinase precursor, putative [Ricinus communis]
gi|223530603|gb|EEF32480.1| Aspartic proteinase precursor, putative [Ricinus communis]
Length = 514
Score = 682 bits (1761), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 323/504 (64%), Positives = 410/504 (81%), Gaps = 7/504 (1%)
Query: 10 FCLWVLA-SCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSG--VRHR 66
FCL +L C +S++GL RIGLKKR+ D ++ AA+ KE A + +R
Sbjct: 11 FCLILLPLVCATASSSNDGLVRIGLKKRKFDQNNRVAAQFESKEGEAFRASIKKYHIRGN 70
Query: 67 LGDSDE-DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFH 125
LGD+++ DI+ LKN+MDAQYFGEIGIG+PPQ F+VIFDTGSSNLWVPSSKCYFS++CYFH
Sbjct: 71 LGDAEDIDIVSLKNYMDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFSVACYFH 130
Query: 126 SRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFL 185
S+YKS +S+TY + GKS +I+YG+G+ISGFFSQDNV+VG++V+K+Q FIEATRE S+TFL
Sbjct: 131 SKYKSGQSSTYKKNGKSADIHYGTGAISGFFSQDNVKVGELVIKNQEFIEATREPSITFL 190
Query: 186 LARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVD 245
+A+FDGI+GLGF+EI+VG+AVPVW NMV QGLV E VFSFW NR+ D +EGGEIVFGG+D
Sbjct: 191 VAKFDGILGLGFQEISVGNAVPVWYNMVNQGLVKEPVFSFWFNRNADEDEGGEIVFGGMD 250
Query: 246 PKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTE 305
P H+KG+HTYVPVT+KGYWQF++GD+LI ++TG+C GCAAI DSGTSLLAGPT ++TE
Sbjct: 251 PNHYKGEHTYVPVTQKGYWQFDMGDVLIDGKTTGICSSGCAAIADSGTSLLAGPTTIITE 310
Query: 306 INHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKT 365
+NHAIG GVVS ECK VV+QYG+ I +L++ P+K+C QIGLC F+G+ VS GI++
Sbjct: 311 VNHAIGATGVVSQECKAVVAQYGETIIAMLLAKDQPQKICSQIGLCTFDGSRGVSMGIES 370
Query: 366 VVEK--ENVSAG-DSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESI 422
VV + + V+ G A+CS CEMAVVW+QNQLKQ QT+E +L+Y+NELC+ LP+PMGES
Sbjct: 371 VVNEKIQEVAGGLHDAMCSTCEMAVVWMQNQLKQNQTQEHILNYVNELCERLPSPMGESA 430
Query: 423 IDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILG 482
+DC + TMPNVSFTIG ++F+L+PEQY+LK G+G A CISGF A D+PPPRGPLWILG
Sbjct: 431 VDCGSLSTMPNVSFTIGGRVFDLAPEQYVLKVGDGEAAQCISGFTALDVPPPRGPLWILG 490
Query: 483 DVFMGVYHTVFDSGKLRIGFAEAA 506
DVFMG +HTVFD G R+GFAE A
Sbjct: 491 DVFMGPFHTVFDYGNKRVGFAEVA 514
>gi|194706186|gb|ACF87177.1| unknown [Zea mays]
Length = 504
Score = 682 bits (1759), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 319/508 (62%), Positives = 402/508 (79%), Gaps = 6/508 (1%)
Query: 1 MEQKLLRSVFCLWVLASC-LLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAG 59
M Q L + C WVL++C LLL ASS+GL RI L K+RLD +L AA++ +KE + +
Sbjct: 1 MGQTHLLLLACFWVLSTCSLLLDASSDGLLRINLNKKRLDKEALTAAKLAKKESNLRRS- 59
Query: 60 VSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS 119
G L S +DI+PL N++D QYFG+I IG+PPQNF+VIFDTGSSNLWVPSSKCYFS
Sbjct: 60 -VGADQYLSASTDDIVPLDNYLDTQYFGQISIGTPPQNFTVIFDTGSSNLWVPSSKCYFS 118
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
I+CY H RYKS KS TYT+ G+SC I YGSG I+GFFS+DNV VG++VV++Q FIE TRE
Sbjct: 119 IACYLHHRYKSTKSKTYTKNGESCTITYGSGQIAGFFSEDNVLVGNLVVQNQKFIETTRE 178
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEE-GGE 238
S TF++ +FDGI+GLGF EI+VG A P+W +M +Q LV+++VFSFWLNRDPDA GGE
Sbjct: 179 TSPTFIIGKFDGILGLGFPEISVGGAPPIWQSMKQQKLVAKDVFSFWLNRDPDASSGGGE 238
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
+VFGGVDPKH+KG HTYVP T+KGYWQF++GD++IG STG C GGCAAIVDSGTSLLAG
Sbjct: 239 LVFGGVDPKHYKGDHTYVPATRKGYWQFDMGDLIIGGHSTGFCAGGCAAIVDSGTSLLAG 298
Query: 299 PTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEY 358
PT +V ++NHAIG EG++S ECK VVS+YG++I +LL+S P+KVC QIGLC F+GA
Sbjct: 299 PTTIVAQVNHAIGAEGIISTECKEVVSEYGEMILELLISQTSPQKVCTQIGLCVFDGAHS 358
Query: 359 VSTGIKTVVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPM 418
VS I++VVEK+ G C+ACEMAVVW+QNQL++ +TKE +L+Y N+LC+ LP+P
Sbjct: 359 VSNPIESVVEKQK--RGSDLFCTACEMAVVWIQNQLRENKTKELILNYANQLCERLPSPN 416
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
GES +DC +I MPN++FTI +K F L+PEQYI+K + +CISGFMAFD+PPPRGPL
Sbjct: 417 GESTVDCHQISKMPNLAFTIANKTFTLTPEQYIVKLEQAGQTICISGFMAFDVPPPRGPL 476
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEAA 506
WILGDVFMG YHTVFD G+ RIGFA++A
Sbjct: 477 WILGDVFMGAYHTVFDFGENRIGFAKSA 504
>gi|219887925|gb|ACL54337.1| unknown [Zea mays]
Length = 504
Score = 681 bits (1758), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 320/508 (62%), Positives = 402/508 (79%), Gaps = 6/508 (1%)
Query: 1 MEQKLLRSVFCLWVLASC-LLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAG 59
M Q L + C WVL++C LLL ASS+GL RI L K+RLD +L AA++ +KE + +
Sbjct: 1 MGQTHLLLLACFWVLSTCSLLLDASSDGLLRINLNKKRLDKEALTAAKLAKKESNLRRS- 59
Query: 60 VSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS 119
G L S +DI+PL N++D QYFG+I IG+PPQNF+VIFDTGSSNLWVPSSKCYFS
Sbjct: 60 -VGADQYLSASTDDIVPLDNYLDTQYFGQISIGTPPQNFTVIFDTGSSNLWVPSSKCYFS 118
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
I+CY H RYKS KS TYT+ G+SC I YGSG I+GFFS+DNV VG++VV++Q FIE TRE
Sbjct: 119 IACYLHHRYKSTKSKTYTKNGESCTITYGSGQIAGFFSEDNVLVGNLVVQNQKFIETTRE 178
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEE-GGE 238
S TF++ +FDGI+GLGF EI+VG A P+W +M +Q LV+++VFSFWLNRDPDA GGE
Sbjct: 179 TSPTFIIGKFDGILGLGFPEISVGGAPPIWQSMKQQKLVAKDVFSFWLNRDPDASSGGGE 238
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
VFGGVDPKH+KG HTYVPVT+KGYWQF++GD++IG STG C GGCAAIVDSGTSLLAG
Sbjct: 239 PVFGGVDPKHYKGDHTYVPVTRKGYWQFDMGDLIIGGHSTGFCAGGCAAIVDSGTSLLAG 298
Query: 299 PTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEY 358
PT +V ++NHAIG EG++S ECK VVS+YG++I +LL+S P+KVC QIGLC F+GA
Sbjct: 299 PTTIVAQVNHAIGAEGIISTECKEVVSEYGEMILELLISQTSPQKVCTQIGLCVFDGAHS 358
Query: 359 VSTGIKTVVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPM 418
VS I++VVEK+ G C+ACEMAVVW+QNQL++ +TKE +L+Y N+LC+ LP+P
Sbjct: 359 VSNPIESVVEKQK--RGSDLFCTACEMAVVWIQNQLRENKTKELILNYANQLCERLPSPN 416
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
GES +DC +I MPN++FTI +K F L+PEQYI+K + +CISGFMAFD+PPPRGPL
Sbjct: 417 GESTVDCHQISKMPNLAFTIANKTFTLTPEQYIVKLEQAGQTICISGFMAFDVPPPRGPL 476
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEAA 506
WILGDVFMG YHTVFD G+ RIGFA++A
Sbjct: 477 WILGDVFMGAYHTVFDFGENRIGFAKSA 504
>gi|449454758|ref|XP_004145121.1| PREDICTED: aspartic proteinase-like [Cucumis sativus]
gi|449472326|ref|XP_004153558.1| PREDICTED: aspartic proteinase-like [Cucumis sativus]
Length = 514
Score = 681 bits (1757), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 325/506 (64%), Positives = 404/506 (79%), Gaps = 7/506 (1%)
Query: 8 SVFCLWVLASCLLLPASSN-GLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHR 66
+ CL++L S ++ + SN GL R+GLKK LD + AAR+ K+ + A
Sbjct: 9 AFLCLFLLVSLNIVSSVSNDGLLRVGLKKINLDPENRLAARLESKDAEILKAAFRKYNPN 68
Query: 67 --LGDS-DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCY 123
LG+S D DI+ LKN++DAQY+GEI IG+PPQ F+VIFDTGSSNLWVPS+KC FS++C+
Sbjct: 69 GNLGESSDTDIVALKNYLDAQYYGEIAIGTPPQKFTVIFDTGSSNLWVPSAKCLFSVACH 128
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
FH+RYKS +S+TY + G S I YG+G++SGFFS DNV+VGD+VVK+Q+FIEATRE LT
Sbjct: 129 FHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPGLT 188
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
FL+A+FDG++GLGF+EIAVG AVPVW NMVEQGLV E VFSFWLNR+ + EEGGEIVFGG
Sbjct: 189 FLVAKFDGLLGLGFQEIAVGSAVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGG 248
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
VDPKH+KGKHTYVPVT+KGYWQF++GD+LI + TG CEGGC+AI DSGTSLLAGPT +V
Sbjct: 249 VDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGKPTGYCEGGCSAIADSGTSLLAGPTTIV 308
Query: 304 TEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGI 363
T INHAIG +GV+S ECK VV QYG I DLL+S P+K+C QI LC F+G VS GI
Sbjct: 309 TMINHAIGAKGVMSQECKAVVQQYGQTIMDLLLSEADPKKICSQIKLCTFDGTRGVSMGI 368
Query: 364 KTVVEKENVSAGD---SAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGE 420
++VV++ + D +CS CEM VVW+QNQL+Q QTKE++++YINELCD +P+PMG+
Sbjct: 369 ESVVDENAGKSSDGLRDGMCSVCEMTVVWMQNQLRQNQTKERIINYINELCDRMPSPMGQ 428
Query: 421 SIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWI 480
S +DC + +MP+VSFTIGDK+F+L+PE+YILK GEG A CISGF AFD+PPPRGPLWI
Sbjct: 429 SAVDCGTLSSMPSVSFTIGDKVFDLAPEEYILKVGEGAAAQCISGFTAFDIPPPRGPLWI 488
Query: 481 LGDVFMGVYHTVFDSGKLRIGFAEAA 506
LGDVFMG YHTVFD GKLR+GFAEAA
Sbjct: 489 LGDVFMGRYHTVFDFGKLRVGFAEAA 514
>gi|224115794|ref|XP_002317126.1| predicted protein [Populus trichocarpa]
gi|222860191|gb|EEE97738.1| predicted protein [Populus trichocarpa]
Length = 512
Score = 675 bits (1742), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 322/503 (64%), Positives = 404/503 (80%), Gaps = 10/503 (1%)
Query: 13 WVLASCLL----LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLG 68
+VL S LL L S++GL RIGLKK + D ++ AAR+ +E + LG
Sbjct: 11 FVLLSFLLFAVVLSESNDGLLRIGLKKVKFDKNNRIAARLDSQEALRASIRKYNLLGNLG 70
Query: 69 DS-DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSR 127
+S D DI+ LKN+ DAQY+GEIG+G+PPQ F+VIFDTGSSNLWVPSSKCY S++CYFHS+
Sbjct: 71 ESEDTDIVALKNYFDAQYYGEIGVGTPPQKFTVIFDTGSSNLWVPSSKCYLSVACYFHSK 130
Query: 128 YKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLA 187
Y S KS++Y + GKS EI YGSGSISGFFS D VEVG++VVKDQ FIEAT+E S+TFL+
Sbjct: 131 YNSGKSSSYKKNGKSAEIQYGSGSISGFFSIDAVEVGNLVVKDQEFIEATKEPSITFLVG 190
Query: 188 RFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPK 247
+FDGI+GLGF+EIAVG AVPVWDNM++QGL+ E VFSFWLNR+ D EEGGEIVFGG+DP
Sbjct: 191 KFDGILGLGFKEIAVGGAVPVWDNMIKQGLIKEPVFSFWLNRNADDEEGGEIVFGGMDPN 250
Query: 248 HFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEIN 307
H+KGKHTYVPVT+KGYWQF++GD+++G++STG C GGCAAI DSGTSLLAGPT ++T IN
Sbjct: 251 HYKGKHTYVPVTQKGYWQFDMGDVIVGDKSTGYCAGGCAAIADSGTSLLAGPTAIITMIN 310
Query: 308 HAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVV 367
HAIG GVVS +CK VVSQYG++I DLL+S + P+K+C QIGLC F+G +S GI++VV
Sbjct: 311 HAIGASGVVSQQCKAVVSQYGEVIMDLLLSEVQPKKICSQIGLCTFDGTRGISMGIQSVV 370
Query: 368 EKENVSA----GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESII 423
++ N + GD A+CSACEMAV W+++QL+Q QT+++VL Y N+LC+ +PNP G+S +
Sbjct: 371 DEGNDKSSGVLGD-AMCSACEMAVFWMRSQLQQNQTQDRVLDYANQLCERVPNPTGQSTV 429
Query: 424 DCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGD 483
DC + +MP ++FTIG K F L+PE+YILK G+G A CISGF A D+PPPRGPLWILGD
Sbjct: 430 DCGSVLSMPRIAFTIGGKEFELAPEEYILKVGQGSAAQCISGFTALDIPPPRGPLWILGD 489
Query: 484 VFMGVYHTVFDSGKLRIGFAEAA 506
VFMG YHTVFDSGKLR+GFAEAA
Sbjct: 490 VFMGRYHTVFDSGKLRVGFAEAA 512
>gi|21616053|emb|CAC86004.1| aspartic proteinase [Theobroma cacao]
Length = 514
Score = 674 bits (1740), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 320/488 (65%), Positives = 396/488 (81%), Gaps = 6/488 (1%)
Query: 25 SNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHR--LGDSDE-DILPLKNFM 81
++GL RIGLKK +LD ++ AAR+ K+ A + R R LGDS+E DI+ LKN+M
Sbjct: 27 NDGLVRIGLKKMKLDPNNRLAARLDSKDGEALRAFIKKYRFRNNLGDSEETDIVALKNYM 86
Query: 82 DAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGK 141
DAQY+GEIGIG+P Q F+VIFDTGSSNLWV S+KCYFS++CYFH +YK+ S+TY + GK
Sbjct: 87 DAQYYGEIGIGTPTQKFTVIFDTGSSNLWVSSTKCYFSVACYFHEKYKASDSSTYKKDGK 146
Query: 142 SCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIA 201
I YG+G+ISGFFS D+V+VGD+VVKDQ FIEAT+E LTF++A+FDGI+GLGF+EI+
Sbjct: 147 PASIQYGTGAISGFFSYDHVQVGDLVVKDQEFIEATKEPGLTFMVAKFDGILGLGFKEIS 206
Query: 202 VGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKK 261
VGDAVPVW NM++QGL+ E VFSFWLNR+ D E GGEIVFGGVDP H+KGKHTYVPVT+K
Sbjct: 207 VGDAVPVWYNMIKQGLIKEPVFSFWLNRNVDEEAGGEIVFGGVDPNHYKGKHTYVPVTQK 266
Query: 262 GYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECK 321
GYWQF++GD+LI ++ TG C G CAAI DSGTSLLAGP+ V+T INHAIG GVVS ECK
Sbjct: 267 GYWQFDMGDVLIADKPTGYCAGSCAAIADSGTSLLAGPSTVITMINHAIGATGVVSQECK 326
Query: 322 LVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN-VSAG--DSA 378
VV QYG I DLL++ P+K+C QIGLC FNGA VSTGI++VV++ N S+G A
Sbjct: 327 AVVQQYGRTIIDLLIAEAQPQKICSQIGLCTFNGAHGVSTGIESVVDESNGKSSGVLRDA 386
Query: 379 VCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTI 438
+C ACEMAVVW+QNQ++Q QT++++LSY+NELCD +PNPMGES +DC + +MP +SFTI
Sbjct: 387 MCPACEMAVVWMQNQVRQNQTQDRILSYVNELCDRVPNPMGESAVDCGSLSSMPTISFTI 446
Query: 439 GDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKL 498
G K+F+L+PE+YILK GEG CISGF A D+PPPRGPLWILGD+FMG YHTVFD GKL
Sbjct: 447 GGKVFDLTPEEYILKVGEGSEAQCISGFTALDIPPPRGPLWILGDIFMGRYHTVFDFGKL 506
Query: 499 RIGFAEAA 506
R+GFAEAA
Sbjct: 507 RVGFAEAA 514
>gi|224118038|ref|XP_002331542.1| predicted protein [Populus trichocarpa]
gi|222873766|gb|EEF10897.1| predicted protein [Populus trichocarpa]
Length = 512
Score = 674 bits (1740), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 317/489 (64%), Positives = 397/489 (81%), Gaps = 6/489 (1%)
Query: 23 ASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDS-DEDILPLKNFM 81
AS++GL RIGLKK +LD ++ AAR+ KE + LG+S D DI+ LKN++
Sbjct: 25 ASNDGLLRIGLKKVKLDKNNRIAARLDSKETLRASIRKYNLCGNLGESEDTDIVALKNYL 84
Query: 82 DAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGK 141
D+QY+GEIG+GSPPQ F+VIFDTGSSNLWVPSSKCY S++CYFHS+Y S KS+TY + GK
Sbjct: 85 DSQYYGEIGVGSPPQKFTVIFDTGSSNLWVPSSKCYLSVACYFHSKYDSGKSSTYKKNGK 144
Query: 142 SCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIA 201
S EI YGSGSISGFFS D VEVG +VVKDQ FIEAT+E ++TFL+A+FDGI+GLGF+EI+
Sbjct: 145 SAEIRYGSGSISGFFSNDAVEVGGLVVKDQEFIEATKEPNITFLVAKFDGILGLGFKEIS 204
Query: 202 VGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKK 261
VGDAVPVWDNM++ GL+ E VFSFWLNR+ + EEGGEIVFGG+DP H+KGKHT+VPVT+K
Sbjct: 205 VGDAVPVWDNMIKHGLIKEPVFSFWLNRNAEDEEGGEIVFGGMDPNHYKGKHTFVPVTRK 264
Query: 262 GYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECK 321
GYWQF +GD+ IG++ TG C GCAAI DSGTSLLAGPT ++T IN AIG GVVS +CK
Sbjct: 265 GYWQFNMGDVHIGDKPTGYCASGCAAIADSGTSLLAGPTTIITMINQAIGASGVVSQQCK 324
Query: 322 LVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSA----GDS 377
VVSQYG+ I DLL+S P+++C QIGLC F+G +S GI++VV++ N + GD
Sbjct: 325 AVVSQYGEAIMDLLLSQAQPKRICSQIGLCTFDGTRGISIGIQSVVDEGNDKSSGFLGD- 383
Query: 378 AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFT 437
A+C ACEMAVVW+++QLKQ QT++++L Y+N+LC+ +PNPMGES +DC+ +P+MP V+FT
Sbjct: 384 AMCPACEMAVVWMRSQLKQNQTQDRILDYVNQLCERMPNPMGESAVDCESVPSMPTVAFT 443
Query: 438 IGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGK 497
IG K F L+PE+YILK G+G A CISGF A D+PPPRGPLWILGD+FMG YHTVFDSGK
Sbjct: 444 IGGKEFELAPEEYILKVGQGSAAQCISGFTALDIPPPRGPLWILGDIFMGRYHTVFDSGK 503
Query: 498 LRIGFAEAA 506
LR+GFAEAA
Sbjct: 504 LRVGFAEAA 512
>gi|115461973|ref|NP_001054586.1| Os05g0137400 [Oryza sativa Japonica Group]
gi|78099760|sp|P42211.2|ASPRX_ORYSJ RecName: Full=Aspartic proteinase; Flags: Precursor
gi|46485798|gb|AAS98423.1| aspartic proteinase [Oryza sativa Japonica Group]
gi|113578137|dbj|BAF16500.1| Os05g0137400 [Oryza sativa Japonica Group]
gi|215694423|dbj|BAG89416.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 496
Score = 671 bits (1731), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 308/505 (60%), Positives = 400/505 (79%), Gaps = 11/505 (2%)
Query: 2 EQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS 61
++ LL CLW L+ LLL ASS+G R+ L K+RLD L AA++ ++ +
Sbjct: 3 KRHLLLVTTCLWALSCALLLHASSDGFLRVNLNKKRLDKEDLTAAKLAQQGNRL------ 56
Query: 62 GVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSIS 121
+ G SD D +PL ++++ QY+G IG+GSPPQNF+VIFDTGSSNLWVPS+KCYFSI+
Sbjct: 57 ---LKTGSSDSDPVPLVDYLNTQYYGVIGLGSPPQNFTVIFDTGSSNLWVPSAKCYFSIA 113
Query: 122 CYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGS 181
CY HSRY S+KS++Y G++C+I YGSG+ISGFFS+DNV VGD+VVK+Q FIEATRE S
Sbjct: 114 CYLHSRYNSKKSSSYKADGETCKITYGSGAISGFFSKDNVLVGDLVVKNQKFIEATRETS 173
Query: 182 LTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVF 241
+TF++ +FDGI+GLG+ EI+VG A P+W +M EQ L++++VFSFWLNRDPDA GGE+VF
Sbjct: 174 VTFIIGKFDGILGLGYPEISVGKAPPIWQSMQEQELLADDVFSFWLNRDPDASSGGELVF 233
Query: 242 GGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTP 301
GG+DPKH+KG HTYVPV++KGYWQF +GD+LI STG C GCAAIVDSGTSLLAGPT
Sbjct: 234 GGMDPKHYKGDHTYVPVSRKGYWQFNMGDLLIDGHSTGFCAKGCAAIVDSGTSLLAGPTA 293
Query: 302 VVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVST 361
+V ++NHAIG EG++S ECK VVS+YG++I +LL++ P+KVC Q+GLC F+G VS
Sbjct: 294 IVAQVNHAIGAEGIISTECKEVVSEYGEMILNLLIAQTDPQKVCSQVGLCMFDGKRSVSN 353
Query: 362 GIKTVVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGES 421
GI++VV+KEN+ G A+CS CEMAVVW++NQL++ +TKE +L+Y N+LC+ LP+P GES
Sbjct: 354 GIESVVDKENL--GSDAMCSVCEMAVVWIENQLRENKTKELILNYANQLCERLPSPNGES 411
Query: 422 IIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWIL 481
+ C +I MPN++FTI +K F L+PEQYI+K +G VCISGFMAFD+PPPRGPLWIL
Sbjct: 412 TVSCHQISKMPNLAFTIANKTFILTPEQYIVKLEQGGQTVCISGFMAFDIPPPRGPLWIL 471
Query: 482 GDVFMGVYHTVFDSGKLRIGFAEAA 506
GDVFMG YHTVFD GK RIGFA++A
Sbjct: 472 GDVFMGAYHTVFDFGKDRIGFAKSA 496
>gi|218143|dbj|BAA02242.1| aspartic proteinase [Oryza sativa Japonica Group]
Length = 496
Score = 670 bits (1729), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 308/505 (60%), Positives = 399/505 (79%), Gaps = 11/505 (2%)
Query: 2 EQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS 61
++ LL CLW L+ LLL ASS+G R+ L K+RLD L AA++ ++ +
Sbjct: 3 KRHLLLVTTCLWALSCALLLHASSDGFLRVNLNKKRLDKEDLTAAKLAQQGNRL------ 56
Query: 62 GVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSIS 121
+ G SD D +PL ++++ QY+G IG+GSPPQNF+VIFDTGSSNLWVPS+KCYFSI+
Sbjct: 57 ---LKTGSSDSDPVPLVDYLNTQYYGVIGLGSPPQNFTVIFDTGSSNLWVPSAKCYFSIA 113
Query: 122 CYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGS 181
CY HSRY S+KS++Y G++C+I YGSG+ISGFFS+DNV VGD VVK+Q FIEATRE S
Sbjct: 114 CYLHSRYNSKKSSSYKADGETCKITYGSGAISGFFSKDNVLVGDQVVKNQKFIEATRETS 173
Query: 182 LTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVF 241
+TF++ +FDGI+GLG+ EI+VG A P+W +M EQ L++++VFSFWLNRDPDA GGE+VF
Sbjct: 174 VTFIIGKFDGILGLGYPEISVGKAPPIWQSMQEQELLADDVFSFWLNRDPDASSGGELVF 233
Query: 242 GGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTP 301
GG+DPKH+KG HTYVPV++KGYWQF +GD+LI STG C GCAAIVDSGTSLLAGPT
Sbjct: 234 GGMDPKHYKGDHTYVPVSRKGYWQFNMGDLLIDGHSTGFCAKGCAAIVDSGTSLLAGPTA 293
Query: 302 VVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVST 361
+V ++NHAIG EG++S ECK VVS+YG++I +LL++ P+KVC Q+GLC F+G VS
Sbjct: 294 IVAQVNHAIGAEGIISTECKEVVSEYGEMILNLLIAQTDPQKVCSQVGLCMFDGKRSVSN 353
Query: 362 GIKTVVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGES 421
GI++VV+KEN+ G A+CS CEMAVVW++NQL++ +TKE +L+Y N+LC+ LP+P GES
Sbjct: 354 GIESVVDKENL--GSDAMCSVCEMAVVWIENQLRENKTKELILNYANQLCERLPSPNGES 411
Query: 422 IIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWIL 481
+ C +I MPN++FTI +K F L+PEQYI+K +G VCISGFMAFD+PPPRGPLWIL
Sbjct: 412 TVSCHQISKMPNLAFTIANKTFILTPEQYIVKLEQGGQTVCISGFMAFDIPPPRGPLWIL 471
Query: 482 GDVFMGVYHTVFDSGKLRIGFAEAA 506
GDVFMG YHTVFD GK RIGFA++A
Sbjct: 472 GDVFMGAYHTVFDFGKDRIGFAKSA 496
>gi|255554815|ref|XP_002518445.1| Aspartic proteinase precursor, putative [Ricinus communis]
gi|223542290|gb|EEF43832.1| Aspartic proteinase precursor, putative [Ricinus communis]
Length = 511
Score = 668 bits (1724), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 315/487 (64%), Positives = 390/487 (80%), Gaps = 3/487 (0%)
Query: 23 ASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMD 82
A ++GL R+GLKK +LD +S AAR+ K A V R D DI+ LKN++D
Sbjct: 25 APNDGLVRLGLKKMKLDENSRLAARLESKNAEALRASVRKYGLRGDSKDTDIVALKNYLD 84
Query: 83 AQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKS 142
AQY+GEIGIG+PPQ F+V+FDTGSSNLWVPSSKC FS++C+FHSRYKS +S+TY + GKS
Sbjct: 85 AQYYGEIGIGTPPQKFTVVFDTGSSNLWVPSSKCIFSVACFFHSRYKSGQSSTYKKNGKS 144
Query: 143 CEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAV 202
EI+YGSG+ISGFFS DNV VG++VVKDQ FIEAT+E +TF+ A+FDGI+GLGF+EI+V
Sbjct: 145 AEIHYGSGAISGFFSSDNVVVGNLVVKDQEFIEATKEPGVTFVAAKFDGILGLGFQEISV 204
Query: 203 GDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
G+AVPVW NM++QGL+ E VFSFWLNR+ EEGGEIVFGGVD H+KGKHTYVPVT+KG
Sbjct: 205 GNAVPVWYNMIKQGLIKEPVFSFWLNRNTQGEEGGEIVFGGVDLNHYKGKHTYVPVTQKG 264
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKL 322
YWQFE+GD+LIG++ T C GGC+AI DSGTSLLAGPT VVT IN AIG GV S ECK
Sbjct: 265 YWQFEMGDVLIGHKPTEYCAGGCSAIADSGTSLLAGPTTVVTLINEAIGATGVASQECKT 324
Query: 323 VVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAG---DSAV 379
V++QYG+ I DLL++ P+K+C QIGLC F+G VS GI++VV+ N + A+
Sbjct: 325 VIAQYGETIMDLLIAEAQPKKICSQIGLCTFDGTRGVSMGIQSVVDDNNDKSSGIVRDAM 384
Query: 380 CSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIG 439
CSACEM VVW+QNQL++ QT++++L+Y+NELCD +PNP+GESI+DC I +MP VSFTIG
Sbjct: 385 CSACEMTVVWMQNQLRENQTQDRILNYVNELCDRIPNPLGESIVDCGSISSMPVVSFTIG 444
Query: 440 DKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLR 499
K+F+LSP++YILK GEG CISGFMA D+PPPRGPLWILGD+FMG YHTVFD G LR
Sbjct: 445 GKVFDLSPQEYILKVGEGAQAQCISGFMALDVPPPRGPLWILGDIFMGRYHTVFDYGNLR 504
Query: 500 IGFAEAA 506
+GFAEAA
Sbjct: 505 VGFAEAA 511
>gi|12231174|dbj|BAB20970.1| aspartic proteinase 2 [Nepenthes alata]
Length = 514
Score = 662 bits (1708), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 313/492 (63%), Positives = 395/492 (80%), Gaps = 20/492 (4%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGV---------SGVRHRLGDSDE-DILPL 77
L R+GLKKR+LD +I R + G G G+ + LG+SD+ DI+ L
Sbjct: 30 LLRVGLKKRKLD-------QINRLSSHYGCKGKGSTSPSIWKHGLGNGLGNSDDADIISL 82
Query: 78 KNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYT 137
KN+MDAQYFGEIGIGSPPQ F+VIFDTGSSNLWVPS+KCYFSI+CY H +YKS KS+TY
Sbjct: 83 KNYMDAQYFGEIGIGSPPQKFTVIFDTGSSNLWVPSAKCYFSIACYLHPKYKSFKSSTYA 142
Query: 138 EIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGF 197
+ GKS I+YG+G+ISGFFSQD+V++GD+VV++Q FIEAT+E S+TF+ A+FDGI+GLGF
Sbjct: 143 KNGKSAAIHYGTGAISGFFSQDHVKMGDLVVENQDFIEATKEPSITFVAAKFDGILGLGF 202
Query: 198 REIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVP 257
+EI+VGDAVP W NM++QGLV+E VFSFWLNR + EEGGEIVFGGVDP H+KG+HTYVP
Sbjct: 203 QEISVGDAVPAWYNMIDQGLVNEPVFSFWLNRKSEEEEGGEIVFGGVDPNHYKGEHTYVP 262
Query: 258 VTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVS 317
VT+KGYWQF++ D+L+G ++TG C GGC+AI DSGTSLLAGPT ++ +INHAIG G+VS
Sbjct: 263 VTRKGYWQFDMDDVLVGGETTGYCSGGCSAIADSGTSLLAGPTTIIVQINHAIGASGLVS 322
Query: 318 AECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGD- 376
ECK VVSQYG I D LV+ P+K+C QIGLC F+G VS GI++VVEK ++ D
Sbjct: 323 QECKAVVSQYGKAILDALVAEAQPQKICSQIGLCTFDGKRGVSMGIESVVEKNPGNSSDG 382
Query: 377 --SAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 434
A+C+ACEMAVVW+QNQL+Q +T+E++L+Y+NELC+ LP+PMGES +DC + +MPNV
Sbjct: 383 LQDAMCTACEMAVVWMQNQLRQNRTEEQILNYVNELCNRLPSPMGESSVDCGSLSSMPNV 442
Query: 435 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 494
S TIG K+F+LSPE+Y+LK GEG+A CISGF+A D+ PPRGPLWILGD+FMG YHTVFD
Sbjct: 443 SLTIGGKVFDLSPEKYVLKVGEGVAAQCISGFIALDIAPPRGPLWILGDIFMGQYHTVFD 502
Query: 495 SGKLRIGFAEAA 506
G L +GFAEAA
Sbjct: 503 YGNLSVGFAEAA 514
>gi|1326165|gb|AAB03108.1| aspartic protease [Brassica napus]
Length = 506
Score = 660 bits (1703), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 319/509 (62%), Positives = 402/509 (78%), Gaps = 17/509 (3%)
Query: 7 RSVFCLWVLASCLLLPASS---NGLRRIGLKKRRLDLHSLNAARITRKE-RYMGGAGVSG 62
++V +++ L L AS+ +G R+GLKK + D S AA + K+ + + G G
Sbjct: 6 KTVALSLIVSFLLFLSASAERNDGTFRVGLKKLKFDPRSRIAAPVGSKQLKPLRGYG--- 62
Query: 63 VRHRLGDS-DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSIS 121
LGDS D DI+ LKN++DAQY+GEI IG+PPQ F+V+FDTGSSNLWVPSSKCYFSI+
Sbjct: 63 ----LGDSGDADIVTLKNYLDAQYYGEIAIGTPPQKFTVVFDTGSSNLWVPSSKCYFSIA 118
Query: 122 CYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGS 181
C FHS+YKS +S+TY + GKS I+YG+G+I+GFFS D V VGD+VVKDQ FIEAT+E
Sbjct: 119 CLFHSKYKSSRSSTYEKNGKSAAIHYGTGAIAGFFSNDAVTVGDLVVKDQEFIEATKEPG 178
Query: 182 LTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVF 241
+TF+LA+FDGI+GLGF+EI+VG+A PVW NM++QGL+ E VFSFWLNR+ + EEGGE+VF
Sbjct: 179 ITFVLAKFDGILGLGFQEISVGNAAPVWYNMLKQGLIKEPVFSFWLNRNAEDEEGGELVF 238
Query: 242 GGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTP 301
GGVDP HFKG+HTYVPVT+KGYWQF++GD+LIG TG CE GC+AI DSGTSLLAGPT
Sbjct: 239 GGVDPNHFKGEHTYVPVTQKGYWQFDMGDVLIGGAPTGYCESGCSAIADSGTSLLAGPTT 298
Query: 302 VVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVST 361
V+T INHAIG GVVS +CK+VV QYG I DLL+S P+K+C QIGLC F+G VS
Sbjct: 299 VITMINHAIGAAGVVSQQCKIVVDQYGQTILDLLLSETQPKKICSQIGLCTFDGKRGVSM 358
Query: 362 GIKTVVEKENVSA----GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNP 417
GI++VV+KEN + GD+A CSACEMAVVW+Q+QL+Q T+E++L YIN+LC+ LP+P
Sbjct: 359 GIESVVDKENAKSSSGVGDAA-CSACEMAVVWIQSQLRQNMTQERILDYINDLCERLPSP 417
Query: 418 MGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGP 477
MGES +DC ++ TMP VS TIG K+F+L+PE+Y+LK GEG A CISGF+A D+ PPRGP
Sbjct: 418 MGESAVDCAQLSTMPTVSLTIGGKVFDLAPEEYVLKVGEGPAAQCISGFIALDVAPPRGP 477
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
LWILGDVFMG YHTVFD GK ++GFAEAA
Sbjct: 478 LWILGDVFMGKYHTVFDFGKEQVGFAEAA 506
>gi|122890420|emb|CAM12780.1| aspartic proteinase [Fagopyrum esculentum]
Length = 506
Score = 659 bits (1700), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 322/493 (65%), Positives = 394/493 (79%), Gaps = 6/493 (1%)
Query: 17 SCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILP 76
S + L ++N L R+GLKKR+LD + A+R K+ M G+ + GD D I+
Sbjct: 17 SPIALSVANNDLVRVGLKKRKLDPTNRPASRFGCKKHLMQKYGLG---NGFGDDDTGIIS 73
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
LKN+MDAQYFGEI IG+P Q F+VIFDTGSSNLWVPS KCY SI+C+FHS+YKS KS+TY
Sbjct: 74 LKNYMDAQYFGEIAIGTPSQTFTVIFDTGSSNLWVPSGKCYLSIACFFHSKYKSSKSSTY 133
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
+ GKS EI+YG+G+ISG+FSQDNV+VGD+VV++Q FIEATRE SLTF+ A+FDGI+GLG
Sbjct: 134 VKNGKSAEIHYGTGAISGYFSQDNVKVGDLVVENQEFIEATREPSLTFVAAKFDGILGLG 193
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
F+EI+VG AVPVW NMV QGLV+E VFSFWLNR+ D E GGEIVFGG+DP H KG+HTY+
Sbjct: 194 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWLNRNADEEVGGEIVFGGIDPAHHKGEHTYL 253
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
PVT+KGYWQF+L D+L+G +STG C GGC+AI DSGTSLLAGPTPVV +INHAIG GVV
Sbjct: 254 PVTQKGYWQFDLDDVLVGGESTGFCSGGCSAIADSGTSLLAGPTPVVAQINHAIGASGVV 313
Query: 317 SAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKE-NVSAG 375
S ECK VVSQYG I DLLVS P K+C QIGLC F+G VS GI++VV+K + S+G
Sbjct: 314 SQECKTVVSQYGKQILDLLVSQTQPRKICSQIGLCTFDGTRGVSMGIESVVDKNVDKSSG 373
Query: 376 D--SAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPN 433
+ A CSACEMAVVW+QNQLKQ QT++++L Y N+LC+ LP+PMGES +DC + T+P
Sbjct: 374 NLKDATCSACEMAVVWMQNQLKQNQTEDRILDYANQLCERLPSPMGESAVDCGSLSTLPT 433
Query: 434 VSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVF 493
VSFT+G K F L+PEQYIL+ GEG A CISGF+A D+PPPRGPLWILGD+FMG YHTVF
Sbjct: 434 VSFTLGGKTFALAPEQYILQVGEGPATQCISGFIALDVPPPRGPLWILGDIFMGQYHTVF 493
Query: 494 DSGKLRIGFAEAA 506
D G +++GFAEAA
Sbjct: 494 DHGNMQVGFAEAA 506
>gi|222630120|gb|EEE62252.1| hypothetical protein OsJ_17039 [Oryza sativa Japonica Group]
Length = 501
Score = 658 bits (1698), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 307/513 (59%), Positives = 398/513 (77%), Gaps = 22/513 (4%)
Query: 2 EQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS 61
++ LL CLW L+ LLL ASS+G R+ L K+RLD L AA++ ++ +
Sbjct: 3 KRHLLLVTTCLWALSCALLLHASSDGFLRVNLNKKRLDKEDLTAAKLAQQGNRL------ 56
Query: 62 GVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSIS 121
+ G SD D +PL ++++ QY+G IG+GSPPQNF+VIFDTGSSNLWVPS+KCYFSI+
Sbjct: 57 ---LKTGSSDSDPVPLVDYLNTQYYGVIGLGSPPQNFTVIFDTGSSNLWVPSAKCYFSIA 113
Query: 122 CYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGS 181
CY HSRY S+KS++Y G++C+I YGSG+ISGFFS+DNV VGD+VVK+Q FIEATRE S
Sbjct: 114 CYLHSRYNSKKSSSYKADGETCKITYGSGAISGFFSKDNVLVGDLVVKNQKFIEATRETS 173
Query: 182 LTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVF 241
+TF++ +FDGI+GLG+ EI+VG A P+W +M EQ L++++VFSFWLNRDPDA GGE+VF
Sbjct: 174 VTFIIGKFDGILGLGYPEISVGKAPPIWQSMQEQELLADDVFSFWLNRDPDASSGGELVF 233
Query: 242 GGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTP 301
GG+DPKH+KG HTYVPV++KGYWQF +GD+LI STG C GCAAIVDSGTSLLAGPT
Sbjct: 234 GGMDPKHYKGDHTYVPVSRKGYWQFNMGDLLIDGHSTGFCAKGCAAIVDSGTSLLAGPTA 293
Query: 302 VVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVST 361
+V ++NHAIG EG++S ECK VVS+YG++I +LL++ P+KVC Q+GLC F+G VS
Sbjct: 294 IVAQVNHAIGAEGIISTECKEVVSEYGEMILNLLIAQTDPQKVCSQVGLCMFDGKRSVSN 353
Query: 362 GIKTVVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGES 421
GI++VV+KEN+ G A+CS CEMAVVW++NQL++ +TKE +L+Y N+LC+ LP+P GES
Sbjct: 354 GIESVVDKENL--GSDAMCSVCEMAVVWIENQLRENKTKELILNYANQLCERLPSPNGES 411
Query: 422 IIDCDRIPTMPNVSFTIGDKIFNLSPEQ--------YILKTGEGIAEVCISGFMAFDLPP 473
+ C +I MPN++FTI +K F L+PEQ Y K G+ VCISGFMAFD+PP
Sbjct: 412 TVSCHQISKMPNLAFTIANKTFILTPEQDPDAFEVVYYFKRGQ---TVCISGFMAFDIPP 468
Query: 474 PRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PRGPLWILGDVFMG YHTVFD GK RIGFA++A
Sbjct: 469 PRGPLWILGDVFMGAYHTVFDFGKDRIGFAKSA 501
>gi|312282703|dbj|BAJ34217.1| unnamed protein product [Thellungiella halophila]
Length = 506
Score = 658 bits (1698), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 317/507 (62%), Positives = 396/507 (78%), Gaps = 15/507 (2%)
Query: 7 RSVFCLWVLASCLLLPASS---NGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGV 63
R+V +++ L ASS +G R+GLKK +LD + AARI+ ++ A
Sbjct: 6 RTVAVSLIVSFLLFFSASSERNDGTVRVGLKKLKLDPKNRLAARISSEQEKPLRA----- 60
Query: 64 RHRLGDS-DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISC 122
LGDS D DI+ LKN++DAQY+GEI IG+PPQ F+V+FDTGSSNLWVPSSKCYFSI+C
Sbjct: 61 -FSLGDSGDADIVALKNYLDAQYYGEIAIGTPPQKFTVVFDTGSSNLWVPSSKCYFSIAC 119
Query: 123 YFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSL 182
H +YKS +S+TY + GKS I+YG+G+I+GFFS D V VGD+VVKDQ FIEAT+E +
Sbjct: 120 LLHPKYKSSRSSTYEKNGKSAAIHYGTGAIAGFFSNDAVTVGDLVVKDQEFIEATKEPGI 179
Query: 183 TFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFG 242
TF+LA+FDGI+GLGF+EI+VG+A PVW NM++QGL+ E VFSFWLNR+ + +EGGE+VFG
Sbjct: 180 TFVLAKFDGILGLGFKEISVGNAAPVWYNMLKQGLIKEPVFSFWLNRNAEDDEGGELVFG 239
Query: 243 GVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPV 302
GVDP HFKGKHTYVPVT+KGYWQF++GD+LIGN TG CE GC+AI DSGTSLLAGPT +
Sbjct: 240 GVDPNHFKGKHTYVPVTQKGYWQFDMGDVLIGNAPTGFCESGCSAIADSGTSLLAGPTTI 299
Query: 303 VTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTG 362
+T INHAIG GVVS +CK VV QYG I +LL+S P+K+C QIGLC FNG VS G
Sbjct: 300 ITMINHAIGAAGVVSQQCKTVVDQYGRTILELLLSETQPKKICSQIGLCTFNGKRGVSMG 359
Query: 363 IKTVVEKENVS----AGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPM 418
I++VV+KEN GD+A CSACEMAVVW+Q+QL+Q T+E++L Y NELC+ LP+PM
Sbjct: 360 IESVVDKENAKLSNGVGDAA-CSACEMAVVWIQSQLRQNMTQERILDYANELCERLPSPM 418
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
GES +DC ++ TMP VS TIG K+F+L+PE+Y+LK GEG A CISGF+A D+ PPRGPL
Sbjct: 419 GESAVDCAQLSTMPTVSLTIGGKVFDLAPEEYVLKVGEGPAAQCISGFIALDVAPPRGPL 478
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WILGDVFMG YHTVFD GK ++GFAEA
Sbjct: 479 WILGDVFMGKYHTVFDFGKEQVGFAEA 505
>gi|77808107|gb|AAV84085.2| aspartic proteinase 9 [Fagopyrum esculentum]
Length = 506
Score = 658 bits (1697), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 322/493 (65%), Positives = 394/493 (79%), Gaps = 6/493 (1%)
Query: 17 SCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILP 76
S + L ++N L R+GLKKR+LD + A+R K+ M G+ + GD D I+
Sbjct: 17 SPISLSVANNDLVRVGLKKRKLDPTNRPASRFGCKKHLMQKYGLG---NGFGDDDTGIIS 73
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
LKN+MDAQYFGEI IG+P Q F+VIFDTGSSNLWVPS KCY SI+C+FHS+YKS KS+TY
Sbjct: 74 LKNYMDAQYFGEIAIGTPSQTFTVIFDTGSSNLWVPSGKCYLSIACFFHSKYKSSKSSTY 133
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
+ GKS EI+YG+G+ISG+FSQDNV+VGD+VV++Q FIEATRE SLTF+ A+FDGI+GLG
Sbjct: 134 VKNGKSAEIHYGTGAISGYFSQDNVKVGDLVVENQEFIEATREPSLTFVAAKFDGILGLG 193
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
F+EI+VG AVPVW NMV QGLV+E VFSFWLNR+ D E GGEIVFGG+DP H KG+HTY+
Sbjct: 194 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWLNRNADEEIGGEIVFGGIDPAHHKGEHTYL 253
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
PVT+KGYWQF+L D+L+G +STG C GGC+AI DSGTSLLAGPTPVV +INHAIG GVV
Sbjct: 254 PVTQKGYWQFDLDDVLVGGESTGFCSGGCSAIADSGTSLLAGPTPVVAQINHAIGASGVV 313
Query: 317 SAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKE-NVSAG 375
S ECK VVSQYG I DLLVS P K+C QIGLC F+G VS GI++VV+K + S+G
Sbjct: 314 SQECKTVVSQYGKQILDLLVSQTQPRKICSQIGLCTFDGTRGVSMGIESVVDKNVDKSSG 373
Query: 376 D--SAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPN 433
+ A CSACEMAVVW+QNQLKQ QT++++L Y N+LC+ LP+PMGES +DC + T+P
Sbjct: 374 NLKDATCSACEMAVVWMQNQLKQNQTEDRILDYANQLCERLPSPMGESAVDCGSLSTLPT 433
Query: 434 VSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVF 493
VSFT+G K F L+PEQYIL+ GEG A CISGF+A D+PPPRGPLWILGD+FMG YHTVF
Sbjct: 434 VSFTLGGKTFALAPEQYILQVGEGPATQCISGFIALDVPPPRGPLWILGDIFMGQYHTVF 493
Query: 494 DSGKLRIGFAEAA 506
D G +++GFAEAA
Sbjct: 494 DHGNMQVGFAEAA 506
>gi|1030715|dbj|BAA06876.1| aspartic protease [Oryza sativa]
gi|1711289|dbj|BAA06875.1| aspartic protease [Oryza sativa]
Length = 509
Score = 657 bits (1696), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 317/494 (64%), Positives = 390/494 (78%), Gaps = 14/494 (2%)
Query: 22 PASSN-GLRRIGLKKRRLDLHSLNAARITRKE--RYMGGAGVSGVRHRLGDSDEDILPLK 78
PAS+ GL RI LKKR +D +S AAR++ +E R +G G + + + DI+ LK
Sbjct: 21 PASAEEGLVRIALKKRPIDENSRVAARLSGEEGARRLGLRGANSLGGGG--GEGDIVALK 78
Query: 79 NFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTE 138
N+M+AQYFGEIG+G+PPQ F+VIFDTGSSNLWVPS+KCYFSI+C+FHSRYKS +S+TY +
Sbjct: 79 NYMNAQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSAKCYFSIACFFHSRYKSGQSSTYQK 138
Query: 139 IGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFR 198
GK I YG+GSI+GFFS+D+V VGD+VVKDQ FIEAT+E LTF++A+FDGI+GLGF+
Sbjct: 139 NGKPAAIQYGTGSIAGFFSEDSVTVGDLVVKDQEFIEATKEPGLTFMVAKFDGILGLGFQ 198
Query: 199 EIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPV 258
EI+VGDAVPVW MVEQGLVSE VFSFW NR D EGGEIVFGG+DP H+KG HTYVPV
Sbjct: 199 EISVGDAVPVWYKMVEQGLVSEPVFSFWFNRHSDEGEGGEIVFGGMDPSHYKGNHTYVPV 258
Query: 259 TKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSA 318
++KGYWQFE+GD+LIG ++TG C GC+AI DSGTSLLAGPT ++TEIN IG GVVS
Sbjct: 259 SQKGYWQFEMGDVLIGGKTTGFCASGCSAIADSGTSLLAGPTAIITEINEKIGATGVVSQ 318
Query: 319 ECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDS- 377
ECK VVSQYG I DLL++ P K+C Q+GLC F+G VS GIK+VV+ E AG+S
Sbjct: 319 ECKTVVSQYGQQILDLLLAETQPSKICSQVGLCTFDGKHGVSAGIKSVVDDE---AGESN 375
Query: 378 -----AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMP 432
+C+ACEMAVVW+QNQL Q +T++ +L+YIN+LCD LP+PMGES +DC + +MP
Sbjct: 376 GLQSGPMCNACEMAVVWMQNQLAQNKTQDLILNYINQLCDKLPSPMGESSVDCGSLASMP 435
Query: 433 NVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTV 492
+SFTIG K F L PE+YILK GEG A CISGF A D+PPPRGPLWILGDVFMG YHTV
Sbjct: 436 EISFTIGAKKFALKPEEYILKVGEGAAAQCISGFTAMDIPPPRGPLWILGDVFMGAYHTV 495
Query: 493 FDSGKLRIGFAEAA 506
FD GK+R+GFA++A
Sbjct: 496 FDYGKMRVGFAKSA 509
>gi|115465497|ref|NP_001056348.1| Os05g0567100 [Oryza sativa Japonica Group]
gi|78099759|sp|Q42456.2|ASPR1_ORYSJ RecName: Full=Aspartic proteinase oryzasin-1; Flags: Precursor
gi|51854282|gb|AAU10663.1| aspartic proteinase oryzasin 1 precursor [Oryza sativa Japonica
Group]
gi|113579899|dbj|BAF18262.1| Os05g0567100 [Oryza sativa Japonica Group]
gi|125553350|gb|EAY99059.1| hypothetical protein OsI_21016 [Oryza sativa Indica Group]
gi|169244443|gb|ACA50495.1| aspartic proteinase oryzasin 1 [Oryza sativa Japonica Group]
gi|215695381|dbj|BAG90572.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737145|dbj|BAG96074.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215740829|dbj|BAG96985.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222632587|gb|EEE64719.1| hypothetical protein OsJ_19575 [Oryza sativa Japonica Group]
Length = 509
Score = 657 bits (1696), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 317/494 (64%), Positives = 390/494 (78%), Gaps = 14/494 (2%)
Query: 22 PASS-NGLRRIGLKKRRLDLHSLNAARITRKE--RYMGGAGVSGVRHRLGDSDEDILPLK 78
PAS+ GL RI LKKR +D +S AAR++ +E R +G G + + + DI+ LK
Sbjct: 21 PASAAEGLVRIALKKRPIDENSRVAARLSGEEGARRLGLRGANSLGGGG--GEGDIVALK 78
Query: 79 NFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTE 138
N+M+AQYFGEIG+G+PPQ F+VIFDTGSSNLWVPS+KCYFSI+C+FHSRYKS +S+TY +
Sbjct: 79 NYMNAQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSAKCYFSIACFFHSRYKSGQSSTYQK 138
Query: 139 IGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFR 198
GK I YG+GSI+GFFS+D+V VGD+VVKDQ FIEAT+E LTF++A+FDGI+GLGF+
Sbjct: 139 NGKPAAIQYGTGSIAGFFSEDSVTVGDLVVKDQEFIEATKEPGLTFMVAKFDGILGLGFQ 198
Query: 199 EIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPV 258
EI+VGDAVPVW MVEQGLVSE VFSFW NR D EGGEIVFGG+DP H+KG HTYVPV
Sbjct: 199 EISVGDAVPVWYKMVEQGLVSEPVFSFWFNRHSDEGEGGEIVFGGMDPSHYKGNHTYVPV 258
Query: 259 TKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSA 318
++KGYWQFE+GD+LIG ++TG C GC+AI DSGTSLLAGPT ++TEIN IG GVVS
Sbjct: 259 SQKGYWQFEMGDVLIGGKTTGFCASGCSAIADSGTSLLAGPTAIITEINEKIGATGVVSQ 318
Query: 319 ECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDS- 377
ECK VVSQYG I DLL++ P K+C Q+GLC F+G VS GIK+VV+ E AG+S
Sbjct: 319 ECKTVVSQYGQQILDLLLAETQPSKICSQVGLCTFDGKHGVSAGIKSVVDDE---AGESN 375
Query: 378 -----AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMP 432
+C+ACEMAVVW+QNQL Q +T++ +L+YIN+LCD LP+PMGES +DC + +MP
Sbjct: 376 GLQSGPMCNACEMAVVWMQNQLAQNKTQDLILNYINQLCDKLPSPMGESSVDCGSLASMP 435
Query: 433 NVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTV 492
+SFTIG K F L PE+YILK GEG A CISGF A D+PPPRGPLWILGDVFMG YHTV
Sbjct: 436 EISFTIGGKKFALKPEEYILKVGEGAAAQCISGFTAMDIPPPRGPLWILGDVFMGAYHTV 495
Query: 493 FDSGKLRIGFAEAA 506
FD GK+R+GFA++A
Sbjct: 496 FDYGKMRVGFAKSA 509
>gi|356532081|ref|XP_003534602.1| PREDICTED: aspartic proteinase [Glycine max]
Length = 507
Score = 656 bits (1693), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 313/505 (61%), Positives = 391/505 (77%), Gaps = 17/505 (3%)
Query: 10 FCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRL-- 67
FCLW L L+ A ++GL RIGLKK +L+ H + + R S +H L
Sbjct: 12 FCLWTLLFSLVFCAPNDGLGRIGLKKVKLNTHDVEGLKEFRS---------SIRKHHLQN 62
Query: 68 ---GDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYF 124
G + D++ LKN++DAQY+GEI IG+PPQ F+VIFDTGSSNLWVPSSKCYFSI+C+
Sbjct: 63 ILGGAEETDVVALKNYLDAQYYGEIAIGTPPQKFTVIFDTGSSNLWVPSSKCYFSIACFM 122
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
H+RY+S +S+TY E G S I YG+G+ISGFFS D+V+VGD+VVKDQ FIEATRE +TF
Sbjct: 123 HARYRSSQSSTYRENGTSAAIQYGTGAISGFFSNDDVKVGDIVVKDQEFIEATREPGVTF 182
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
+ A+FDGI+GLGF++I+VG AVPVW +MVEQGLV + VFSFWLNR P+ E GGE+VFGG
Sbjct: 183 VAAKFDGILGLGFQDISVGYAVPVWYSMVEQGLVKDPVFSFWLNRKPEEENGGELVFGGA 242
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
DP H+KGKHTYVPVT+KGYWQF++GD+LI + TG C C+AI DSGTSLLAGPT VVT
Sbjct: 243 DPAHYKGKHTYVPVTRKGYWQFDMGDVLIAGKPTGYCADDCSAIADSGTSLLAGPTTVVT 302
Query: 305 EINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIK 364
IN AIG GVVS EC+ VV+QYG I +LL++ P+K+C QIGLC F+G VS GI+
Sbjct: 303 MINQAIGASGVVSKECRSVVNQYGQTILELLLAEAKPKKICSQIGLCTFDGTHGVSMGIE 362
Query: 365 TVVEK-ENVSAGD--SAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGES 421
+VV+K E S+G A CSACEMAV+W+QNQL+Q QT+++++ Y NELCD LPNPMG+S
Sbjct: 363 SVVDKNERKSSGSIRDAGCSACEMAVIWMQNQLRQNQTEDRIIDYANELCDKLPNPMGQS 422
Query: 422 IIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWIL 481
+DC+++ +MP VSFTIG K+F+LSP++YILK GEG CISGF A D+PPPRGPLWIL
Sbjct: 423 SVDCEKLSSMPIVSFTIGGKVFDLSPQEYILKVGEGPEAQCISGFTALDVPPPRGPLWIL 482
Query: 482 GDVFMGVYHTVFDSGKLRIGFAEAA 506
GDVFMG YHT+FD GKLR+GFAEAA
Sbjct: 483 GDVFMGRYHTIFDYGKLRVGFAEAA 507
>gi|218188020|gb|EEC70447.1| hypothetical protein OsI_01478 [Oryza sativa Indica Group]
Length = 495
Score = 656 bits (1692), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 314/506 (62%), Positives = 392/506 (77%), Gaps = 11/506 (2%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV 60
M + L V CLW+L+ +LL AS +GL RI L K+RLD +L+ A++ R+E +
Sbjct: 1 MGRNHLCLVTCLWILSCAVLLHASPDGLLRISLNKKRLDKKTLDGAKLAREESH------ 54
Query: 61 SGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI 120
R R +DI+PL N++D QYFGEIGIG+PPQNF+VIFDTGSSNLWVPS KCYFSI
Sbjct: 55 ---RLRADGLGDDIVPLDNYLDTQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSVKCYFSI 111
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+CY H RYKS+ S++Y + G+SC I+YGSGSI+GFFS+D+V VGD+ VK+Q+FIE TRE
Sbjct: 112 ACYLHHRYKSKGSSSYKKNGESCSISYGSGSIAGFFSEDSVLVGDLAVKNQMFIETTREP 171
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
SLTF++ +FDGI+GLGF EI+VG A P+W M EQ L+ ++VFSFWLNRDPDA GGE++
Sbjct: 172 SLTFIIGKFDGILGLGFPEISVGGAPPIWQGMKEQQLIEKDVFSFWLNRDPDAPTGGELI 231
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGGVDP H+KG HTYVPVT+KGYWQFE+GD+LI + STG C GGCAAI DSGTSLL GPT
Sbjct: 232 FGGVDPNHYKGSHTYVPVTRKGYWQFEMGDLLIDDYSTGFCSGGCAAIADSGTSLLGGPT 291
Query: 301 PVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVS 360
+V +INHAIG EG+VS ECK VV YGD+I ++L++ P K+C QIGLCAF+G V
Sbjct: 292 TIVAQINHAIGAEGIVSMECKQVVRDYGDMILEMLIAQASPMKLCSQIGLCAFDGTRSVR 351
Query: 361 TGIKTVVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGE 420
I++VV+KE V G C+ACEMAVVW+QNQL+ QT+E +L Y ++LC+ LP+P GE
Sbjct: 352 NNIESVVDKEKV--GSDLSCTACEMAVVWIQNQLRHNQTRELILQYADQLCERLPSPNGE 409
Query: 421 SIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWI 480
S +DCD I MPN+SFTI +K F L+PEQY++K + VCISGFMAFD+PPPRGPLWI
Sbjct: 410 SAVDCDEISNMPNLSFTIANKTFTLTPEQYVVKLEQQGQTVCISGFMAFDVPPPRGPLWI 469
Query: 481 LGDVFMGVYHTVFDSGKLRIGFAEAA 506
LGDVFMG YHTVFD GK RIGFAE+A
Sbjct: 470 LGDVFMGAYHTVFDFGKNRIGFAESA 495
>gi|296089849|emb|CBI39668.3| unnamed protein product [Vitis vinifera]
Length = 430
Score = 656 bits (1692), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 296/428 (69%), Positives = 363/428 (84%), Gaps = 3/428 (0%)
Query: 81 MDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIG 140
MDAQY+GEIGIG+PPQNF+V+FDTGS+NLWVPS+KC+FSI+C FHS+Y SR S TY ++G
Sbjct: 1 MDAQYYGEIGIGTPPQNFTVVFDTGSANLWVPSTKCHFSIACLFHSKYNSRLSTTYIDLG 60
Query: 141 KSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREI 200
K EI+YGSGSISG FSQDNV+VG + +K+QVFIEATRE SL F+L +FDGI+GLGF EI
Sbjct: 61 KEGEIHYGSGSISGVFSQDNVQVGSMAIKNQVFIEATREASLVFVLGKFDGILGLGFEEI 120
Query: 201 AVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTK 260
VG+A PVW N++ QGLV E++FSFWLNRDP A +GGEIVFGGVD +HFKG+HTY +T+
Sbjct: 121 VVGNATPVWYNLLRQGLVQEDIFSFWLNRDPQATDGGEIVFGGVDKRHFKGQHTYASITQ 180
Query: 261 KGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAEC 320
KGYWQFE+G+ LIG QSTG CE GCAAIVDSGTSL+AGPT +VTEINHAIG EG+VS EC
Sbjct: 181 KGYWQFEMGEFLIGYQSTGFCEAGCAAIVDSGTSLIAGPTAIVTEINHAIGAEGIVSQEC 240
Query: 321 KLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN---VSAGDS 377
K VVSQYG++IWDLL+S + P+ VC QIGLC FNG++ S IKTVVE+E+ G+
Sbjct: 241 KEVVSQYGNMIWDLLISRVQPDAVCSQIGLCNFNGSQIESPRIKTVVEEEDARGTKVGNE 300
Query: 378 AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFT 437
C+ACEM V+W+QNQLKQ++TKE + SY+ ELC SLP+PMGES++DC R+P MP+V+FT
Sbjct: 301 VWCTACEMTVIWIQNQLKQRKTKEIIFSYVTELCQSLPSPMGESVVDCGRVPYMPDVTFT 360
Query: 438 IGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGK 497
I DK F L+P++Y+LKTGEGI VC+SGF+A D+PPPRGPLWILGD+FMGVYHTVFD G
Sbjct: 361 IADKHFTLTPKEYVLKTGEGITTVCLSGFIALDVPPPRGPLWILGDIFMGVYHTVFDYGN 420
Query: 498 LRIGFAEA 505
L++GFAEA
Sbjct: 421 LQVGFAEA 428
>gi|384040313|gb|AFH58568.1| aspartic acid protease [Ananas comosus]
Length = 514
Score = 656 bits (1692), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 315/512 (61%), Positives = 399/512 (77%), Gaps = 16/512 (3%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKE--------RYMGG 57
L L VL +L AS++GL RIGLKKR +D ++ AAR+ KE RY
Sbjct: 8 LAVAILLSVLLHQSILLASADGLVRIGLKKRPIDENNRIAARLVEKEEGPLLAARRY--- 64
Query: 58 AGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY 117
G+ G + G+ + DI+ LKN+M+AQYFGEIGIG+PPQ F+VIFDTGSSNLWVPSSKCY
Sbjct: 65 -GLRGAPLKEGE-ETDIIALKNYMNAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCY 122
Query: 118 FSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEAT 177
FSI+C FH++YKS +S++Y + GKS I+YG+G+ISGFFS D+V+VGD+VVK Q FIEAT
Sbjct: 123 FSIACLFHTKYKSGRSSSYHKNGKSASIHYGTGAISGFFSTDHVKVGDLVVKTQDFIEAT 182
Query: 178 REGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGG 237
+E S+TF++A+FDGI+GLGF+EI+VG+AVPVW NMV+QGL+ E VFSFW NR+ + EGG
Sbjct: 183 KEPSVTFVVAKFDGILGLGFQEISVGNAVPVWYNMVDQGLIKEPVFSFWFNRNANDGEGG 242
Query: 238 EIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLA 297
EIVFGG DP H+KG HTYVPVT+KGYWQFE+GD+L+G QSTG C GGCAAI DSGTSLLA
Sbjct: 243 EIVFGGADPNHYKGNHTYVPVTQKGYWQFEMGDVLVGGQSTGFCNGGCAAIADSGTSLLA 302
Query: 298 GPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAE 357
GPT ++ EIN IG GVVS ECK VV++YG I +L++ + P K+C IGLC F+G +
Sbjct: 303 GPTTIIAEINQKIGASGVVSQECKAVVAEYGQQILQMLLAEVQPGKICSSIGLCTFDGKQ 362
Query: 358 YVSTGIKTVVEKEN--VSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSL 414
VS GI++VV K+ +AG S A+C+ CEMAVVW+QNQ+ Q QT+E + +Y+N+LC+ L
Sbjct: 363 GVSAGIESVVNKDTRRSAAGLSDAMCNVCEMAVVWMQNQISQNQTQELIFNYLNQLCEKL 422
Query: 415 PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPP 474
P+PMGES +DC + +MP++SFTIG K F+L PEQYIL+ GEG A CISGF A D+PPP
Sbjct: 423 PSPMGESSVDCSSVASMPDISFTIGGKKFSLKPEQYILQVGEGYAAQCISGFTALDVPPP 482
Query: 475 RGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
RGPLWILGDVFMG YHTVFD G +R+GFA+AA
Sbjct: 483 RGPLWILGDVFMGAYHTVFDYGNMRVGFADAA 514
>gi|223946977|gb|ACN27572.1| unknown [Zea mays]
gi|238014788|gb|ACR38429.1| unknown [Zea mays]
gi|413946556|gb|AFW79205.1| aspartic proteinase oryzasin-1 isoform 1 [Zea mays]
gi|413946557|gb|AFW79206.1| aspartic proteinase oryzasin-1 isoform 2 [Zea mays]
Length = 510
Score = 655 bits (1690), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 312/490 (63%), Positives = 383/490 (78%), Gaps = 10/490 (2%)
Query: 24 SSNGLRRIGLKKRRLDLHSLNAARITRKER---YMGGAGVSGVRHRLGDSDEDILPLKNF 80
SS GL R+ LKK +D + AAR++ +ER + GA G GD D D++ LKN+
Sbjct: 24 SSEGLVRVALKKLPVDQNGRVAARLSAEERQRLLLRGANALGSG---GDDDSDVIALKNY 80
Query: 81 MDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIG 140
M+AQYFGEIG+GSP Q F+VIFDTGSSNLWVPSSKCYFSI+CYFHSRYKS +S+TY + G
Sbjct: 81 MNAQYFGEIGVGSPQQKFTVIFDTGSSNLWVPSSKCYFSIACYFHSRYKSGQSSTYKKNG 140
Query: 141 KSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREI 200
K I YG+GSI+GFFS+D+V +GD+VVKDQ FIEAT+E LTF++A+FDGI+GLGF+EI
Sbjct: 141 KPAAIRYGTGSIAGFFSEDSVTLGDLVVKDQEFIEATKEPGLTFMVAKFDGILGLGFQEI 200
Query: 201 AVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTK 260
+VG+A PVW NMV+QGL+S+ VFSFW NR D EGGEIVFGG+D H+KG HT+VPVT+
Sbjct: 201 SVGNATPVWYNMVKQGLISDPVFSFWFNRHADEGEGGEIVFGGMDSSHYKGDHTFVPVTR 260
Query: 261 KGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAEC 320
KGYWQF +GD+L+ +STG C GGCAAI DSGTSLLAGPT ++TEIN IG GVVS EC
Sbjct: 261 KGYWQFNMGDVLVDGKSTGFCAGGCAAIADSGTSLLAGPTAIITEINEKIGAAGVVSQEC 320
Query: 321 KLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVE----KENVSAGD 376
K VVSQYG I DLL++ P K+C Q+GLC F+G VS GI++VV+ K N
Sbjct: 321 KTVVSQYGQQILDLLLAETQPAKICSQVGLCTFDGTHGVSAGIRSVVDDEAGKSNGGLKS 380
Query: 377 SAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSF 436
+C+ACEMAVVW+QNQL Q +T+E +L+YIN+LC+ LP+PMGES +DC + +MP+++F
Sbjct: 381 DPMCNACEMAVVWMQNQLAQNKTQELILNYINQLCERLPSPMGESAVDCGSLASMPDIAF 440
Query: 437 TIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSG 496
TIG K F L PEQYILK GEG A CISGF A D+PPPRGPLWILGDVFMGVYHTVFD G
Sbjct: 441 TIGGKKFKLKPEQYILKVGEGQAAQCISGFTAMDIPPPRGPLWILGDVFMGVYHTVFDYG 500
Query: 497 KLRIGFAEAA 506
KLR+GFAE+A
Sbjct: 501 KLRVGFAESA 510
>gi|357132502|ref|XP_003567869.1| PREDICTED: phytepsin-like [Brachypodium distachyon]
Length = 505
Score = 655 bits (1690), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 314/507 (61%), Positives = 396/507 (78%), Gaps = 13/507 (2%)
Query: 7 RSVFCLW--VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKE-RYMGGAGVSGV 63
R V L+ VL LL + + GL RI LKKR +D ++ A R++ +E +++GGA
Sbjct: 5 RVVLVLFAAVLLQALLPASEAEGLVRIALKKRPIDQNNRVATRLSGEEGQHLGGA----- 59
Query: 64 RHRLGDSDE-DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISC 122
+ LG DE DI+ L+N+M+AQYFGEIG+G+PPQ F+VIFDTGSSNLWVPS+KCYFSI+C
Sbjct: 60 -NSLGSEDEGDIVALQNYMNAQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSAKCYFSIAC 118
Query: 123 YFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSL 182
YFHSRYK+ +S+TY + GK I YG+GSI+G+FS+D+V VGD+VVKDQ FIEAT+E +
Sbjct: 119 YFHSRYKAGQSSTYKKNGKPAAIQYGTGSIAGYFSEDSVTVGDLVVKDQEFIEATKEPGV 178
Query: 183 TFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFG 242
TF++A+FDGI+GLGF+EI+VG AVPVW M+EQGL+S+ VFSFW NR EGGEIVFG
Sbjct: 179 TFMVAKFDGILGLGFQEISVGKAVPVWYKMIEQGLISDPVFSFWFNRHAGEGEGGEIVFG 238
Query: 243 GVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPV 302
G+DPKH+ G+HTYVPVT+KGYWQF++GD+L+G +STG C GGCAAI DSGTSLLAGPT +
Sbjct: 239 GMDPKHYIGEHTYVPVTQKGYWQFDMGDVLVGGKSTGFCAGGCAAIADSGTSLLAGPTAI 298
Query: 303 VTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTG 362
+TEIN IG GVVS ECK VVSQYG I DLL++ P+K+C Q+GLC F+G VS G
Sbjct: 299 ITEINEKIGAAGVVSQECKTVVSQYGQQILDLLLAETQPKKICSQVGLCTFDGTRGVSAG 358
Query: 363 IKTVVEKENVSAG---DSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMG 419
I++VV+ E + +C+ACEMAVVW+QNQL Q +T++ +L+YIN+LCD LP+PMG
Sbjct: 359 IRSVVDDEAEKSNGLHSDPMCNACEMAVVWMQNQLSQNKTQDVILNYINQLCDRLPSPMG 418
Query: 420 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 479
ES +DC + +MP + FTIG K F L PE+YILK GEG A CISGF A D+PPPRGPLW
Sbjct: 419 ESSVDCGSLASMPEIEFTIGGKKFALKPEEYILKVGEGPAAQCISGFTAMDIPPPRGPLW 478
Query: 480 ILGDVFMGVYHTVFDSGKLRIGFAEAA 506
ILGDVFMG YHTVFD GKLR+GFA+AA
Sbjct: 479 ILGDVFMGPYHTVFDYGKLRVGFAKAA 505
>gi|15221141|ref|NP_172655.1| aspartic proteinase A1 [Arabidopsis thaliana]
gi|75318541|sp|O65390.1|APA1_ARATH RecName: Full=Aspartic proteinase A1; Flags: Precursor
gi|3157937|gb|AAC17620.1| Identical to aspartic proteinase cDNA gb|U51036 from A. thaliana.
ESTs gb|N96313, gb|T21893, gb|R30158, gb|T21482,
gb|T43650, gb|R64749, gb|R65157, gb|T88269, gb|T44552,
gb|T22542, gb|T76533, gb|T44350, gb|Z34591, gb|AA728734,
gb|T46003, gb|R65157, gb|N38290, gb|AA395468, gb|T20815
and gb|Z34173 come from this gene [Arabidopsis thaliana]
gi|15912219|gb|AAL08243.1| At1g11910/F12F1_24 [Arabidopsis thaliana]
gi|15912251|gb|AAL08259.1| At1g11910/F12F1_24 [Arabidopsis thaliana]
gi|17381036|gb|AAL36330.1| putative aspartic proteinase [Arabidopsis thaliana]
gi|21617929|gb|AAM66979.1| putative aspartic proteinase [Arabidopsis thaliana]
gi|25055040|gb|AAN71979.1| putative aspartic proteinase [Arabidopsis thaliana]
gi|332190692|gb|AEE28813.1| aspartic proteinase A1 [Arabidopsis thaliana]
Length = 506
Score = 655 bits (1689), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 308/487 (63%), Positives = 387/487 (79%), Gaps = 12/487 (2%)
Query: 25 SNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDS-DEDILPLKNFMDA 83
++G R+GLKK +LD + AAR+ K+ A +RLGDS D D++ LKN++DA
Sbjct: 27 NDGTFRVGLKKLKLDSKNRLAARVESKQEKPLRA------YRLGDSGDADVVVLKNYLDA 80
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
QY+GEI IG+PPQ F+V+FDTGSSNLWVPSSKCYFS++C H +YKS +S+TY + GK+
Sbjct: 81 QYYGEIAIGTPPQKFTVVFDTGSSNLWVPSSKCYFSLACLLHPKYKSSRSSTYEKNGKAA 140
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVG 203
I+YG+G+I+GFFS D V VGD+VVKDQ FIEAT+E +TF++A+FDGI+GLGF+EI+VG
Sbjct: 141 AIHYGTGAIAGFFSNDAVTVGDLVVKDQEFIEATKEPGITFVVAKFDGILGLGFQEISVG 200
Query: 204 DAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGY 263
A PVW NM++QGL+ E VFSFWLNR+ D EEGGE+VFGGVDP HFKGKHTYVPVT+KGY
Sbjct: 201 KAAPVWYNMLKQGLIKEPVFSFWLNRNADEEEGGELVFGGVDPNHFKGKHTYVPVTQKGY 260
Query: 264 WQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLV 323
WQF++GD+LIG TG CE GC+AI DSGTSLLAGPT ++T INHAIG GVVS +CK V
Sbjct: 261 WQFDMGDVLIGGAPTGFCESGCSAIADSGTSLLAGPTTIITMINHAIGAAGVVSQQCKTV 320
Query: 324 VSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVS----AGDSAV 379
V QYG I DLL+S P+K+C QIGLC F+G VS GI++VV+KEN GD+A
Sbjct: 321 VDQYGQTILDLLLSETQPKKICSQIGLCTFDGTRGVSMGIESVVDKENAKLSNGVGDAA- 379
Query: 380 CSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIG 439
CSACEMAVVW+Q+QL+Q T+E++L+Y+NELC+ LP+PMGES +DC ++ TMP VS TIG
Sbjct: 380 CSACEMAVVWIQSQLRQNMTQERILNYVNELCERLPSPMGESAVDCAQLSTMPTVSLTIG 439
Query: 440 DKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLR 499
K+F+L+PE+Y+LK GEG CISGF+A D+ PPRGPLWILGDVFMG YHTVFD G +
Sbjct: 440 GKVFDLAPEEYVLKVGEGPVAQCISGFIALDVAPPRGPLWILGDVFMGKYHTVFDFGNEQ 499
Query: 500 IGFAEAA 506
+GFAEAA
Sbjct: 500 VGFAEAA 506
>gi|115436054|ref|NP_001042785.1| Os01g0290000 [Oryza sativa Japonica Group]
gi|8467954|dbj|BAA96578.1| putative aspartic proteinase [Oryza sativa Japonica Group]
gi|113532316|dbj|BAF04699.1| Os01g0290000 [Oryza sativa Japonica Group]
gi|215694819|dbj|BAG90010.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215701475|dbj|BAG92899.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222618242|gb|EEE54374.1| hypothetical protein OsJ_01384 [Oryza sativa Japonica Group]
Length = 495
Score = 654 bits (1688), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 313/506 (61%), Positives = 391/506 (77%), Gaps = 11/506 (2%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV 60
M + L V CLW+L+ +LL AS +GL RI L K+RLD +L+ A++ R+E +
Sbjct: 1 MGRNHLCLVTCLWILSCAVLLHASPDGLLRISLNKKRLDKKTLDGAKLAREESH------ 54
Query: 61 SGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI 120
R R +DI+PL N++D QYFGEIGIG+PPQNF+VIFDTGSSNLWVPS KCYFSI
Sbjct: 55 ---RLRADGLGDDIVPLDNYLDTQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSVKCYFSI 111
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+CY H RYKS+ S++Y + G+SC I+YGSGSI+GFFS+D+V VGD+ VK+Q+FIE TRE
Sbjct: 112 ACYLHHRYKSKGSSSYKKNGESCSISYGSGSIAGFFSEDSVLVGDLAVKNQMFIETTREP 171
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
SLTF++ +FDGI+GLGF EI+VG A P+W M EQ L+ ++VFSFWLNRDPDA GGE++
Sbjct: 172 SLTFIIGKFDGILGLGFPEISVGGAPPIWQGMKEQQLIEKDVFSFWLNRDPDAPTGGELI 231
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGGVDP H+KG HTYVPVT+KGYWQFE+GD+LI + STG C GGCAAI DSGTSLL GPT
Sbjct: 232 FGGVDPNHYKGSHTYVPVTRKGYWQFEMGDLLIDDYSTGFCSGGCAAIADSGTSLLGGPT 291
Query: 301 PVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVS 360
+V +INHAIG EG+VS ECK VV YGD+I ++L++ P K+C QIGLCAF+G V
Sbjct: 292 TIVAQINHAIGAEGIVSMECKQVVRDYGDMILEMLIAQASPMKLCSQIGLCAFDGTRSVR 351
Query: 361 TGIKTVVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGE 420
I++VV+KE V G C+ACEMAVVW+QNQL+ QT+E +L Y ++LC+ LP+P GE
Sbjct: 352 NNIESVVDKEKV--GSDLSCTACEMAVVWIQNQLRHNQTRELILQYADQLCERLPSPNGE 409
Query: 421 SIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWI 480
S +DCD I MPN+SFTI +K F L+PEQY++K + VCISGFMAFD+PPPRGPLWI
Sbjct: 410 SAVDCDEISNMPNLSFTIANKTFTLTPEQYVVKLEQQGQTVCISGFMAFDVPPPRGPLWI 469
Query: 481 LGDVFMGVYHTVFDSGKLRIGFAEAA 506
LGDVFM YHTVFD GK RIGFAE+A
Sbjct: 470 LGDVFMAAYHTVFDFGKNRIGFAESA 495
>gi|1354272|gb|AAC49730.1| aspartic proteinase [Arabidopsis thaliana]
Length = 486
Score = 654 bits (1686), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 308/487 (63%), Positives = 387/487 (79%), Gaps = 12/487 (2%)
Query: 25 SNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDS-DEDILPLKNFMDA 83
++G R+GLKK +LD + AAR+ K+ A +RLGDS D D++ LKN++DA
Sbjct: 7 NDGTFRVGLKKLKLDSKNRLAARVESKQEKPLRA------YRLGDSGDADVVVLKNYLDA 60
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
QY+GEI IG+PPQ F+V+FDTGSSNLWVPSSKCYFS++C H +YKS +S+TY + GK+
Sbjct: 61 QYYGEIAIGTPPQKFTVVFDTGSSNLWVPSSKCYFSLACLLHPKYKSSRSSTYEKNGKAA 120
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVG 203
I+YG+G+I+GFFS D V VGD+VVKDQ FIEAT+E +TF++A+FDGI+GLGF+EI+VG
Sbjct: 121 AIHYGTGAIAGFFSNDAVTVGDLVVKDQEFIEATKEPGITFVVAKFDGILGLGFQEISVG 180
Query: 204 DAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGY 263
A PVW NM++QGL+ E VFSFWLNR+ D EEGGE+VFGGVDP HFKGKHTYVPVT+KGY
Sbjct: 181 KAAPVWYNMLKQGLIKEPVFSFWLNRNADEEEGGELVFGGVDPNHFKGKHTYVPVTQKGY 240
Query: 264 WQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLV 323
WQF++GD+LIG TG CE GC+AI DSGTSLLAGPT ++T INHAIG GVVS +CK V
Sbjct: 241 WQFDMGDVLIGGAPTGFCESGCSAIADSGTSLLAGPTTIITMINHAIGAAGVVSQQCKTV 300
Query: 324 VSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVS----AGDSAV 379
V QYG I DLL+S P+K+C QIGLC F+G VS GI++VV+KEN GD+A
Sbjct: 301 VDQYGQTILDLLLSETQPKKICSQIGLCTFDGTRGVSMGIESVVDKENAKLSNGVGDAA- 359
Query: 380 CSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIG 439
CSACEMAVVW+Q+QL+Q T+E++L+Y+NELC+ LP+PMGES +DC ++ TMP VS TIG
Sbjct: 360 CSACEMAVVWIQSQLRQNMTQERILNYVNELCERLPSPMGESAVDCAQLSTMPTVSLTIG 419
Query: 440 DKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLR 499
K+F+L+PE+Y+LK GEG CISGF+A D+ PPRGPLWILGDVFMG YHTVFD G +
Sbjct: 420 GKVFDLAPEEYVLKVGEGPVAQCISGFIALDVAPPRGPLWILGDVFMGKYHTVFDFGNEQ 479
Query: 500 IGFAEAA 506
+GFAEAA
Sbjct: 480 VGFAEAA 486
>gi|226503984|ref|NP_001148782.1| aspartic proteinase oryzasin-1 precursor [Zea mays]
gi|195622118|gb|ACG32889.1| aspartic proteinase oryzasin-1 precursor [Zea mays]
Length = 510
Score = 653 bits (1685), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 309/486 (63%), Positives = 379/486 (77%), Gaps = 4/486 (0%)
Query: 24 SSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDA 83
SS GL R+ LKK +D + AAR++ +ER S GD D D++ LKN+M+A
Sbjct: 24 SSEGLVRVALKKLPVDQNGRVAARLSAEERQRLLLRGSNALGSGGDDDSDVIALKNYMNA 83
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
QYFGEIG+GSP Q F+VIFDTGSSNLWVPSSKCYFSI+CYFHSRYKS +S+TY + GK
Sbjct: 84 QYFGEIGVGSPQQKFTVIFDTGSSNLWVPSSKCYFSIACYFHSRYKSGQSSTYKKNGKPA 143
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVG 203
I YG+GSI+GFFS+D+V +GD+VVKDQ FIEAT+E LTF++A+FDGI+GLGF+EI+VG
Sbjct: 144 AIRYGTGSIAGFFSEDSVTLGDLVVKDQEFIEATKEPGLTFMVAKFDGILGLGFQEISVG 203
Query: 204 DAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGY 263
+A PVW NMV+QGL+S+ VFSFW NR D EGGEIVFGG+D H+KG HT+VPVT+KGY
Sbjct: 204 NATPVWYNMVKQGLISDPVFSFWFNRHADEGEGGEIVFGGMDSSHYKGDHTFVPVTRKGY 263
Query: 264 WQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLV 323
WQF +GD+L+ +STG C GGCAAI DSGTSLLAGPT ++TEIN IG GVVS ECK V
Sbjct: 264 WQFNMGDVLVDGKSTGFCAGGCAAIADSGTSLLAGPTAIITEINEKIGAAGVVSQECKTV 323
Query: 324 VSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVE----KENVSAGDSAV 379
VSQYG I DLL++ P K+C Q+GLC F+G VS GI++VV+ K N +
Sbjct: 324 VSQYGQQILDLLLAETQPTKICSQVGLCTFDGTHGVSAGIRSVVDDEAGKSNGGLKSDPM 383
Query: 380 CSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIG 439
C+ACEMAVVW+QNQL Q +T+E +L+YIN+LC+ LP+PMGES +DC + +MP+++FTIG
Sbjct: 384 CNACEMAVVWMQNQLAQNKTQELILNYINQLCERLPSPMGESAVDCGSLASMPDIAFTIG 443
Query: 440 DKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLR 499
K F L PEQYILK GEG A CISGF A D+PPPRGPLWILGDVFMGVYHTVFD GKLR
Sbjct: 444 GKKFKLKPEQYILKVGEGQAAQCISGFTAMDIPPPRGPLWILGDVFMGVYHTVFDYGKLR 503
Query: 500 IGFAEA 505
+GFAE+
Sbjct: 504 VGFAES 509
>gi|356555682|ref|XP_003546159.1| PREDICTED: aspartic proteinase-like [Glycine max]
Length = 507
Score = 652 bits (1683), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 314/505 (62%), Positives = 388/505 (76%), Gaps = 17/505 (3%)
Query: 10 FCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRL-- 67
FCLW L L+ A ++GLRRIGLKK +LD + + R S +H L
Sbjct: 12 FCLWTLLFPLVFCAPNDGLRRIGLKKVKLDTDDVVGFKEFRS---------SIRKHHLQN 62
Query: 68 ---GDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYF 124
G D D++ LKN++DAQY+GEI IG+PPQ F+VIFDTGSSNLWVPSSKCYFS++C+
Sbjct: 63 ILGGAEDTDVVALKNYLDAQYYGEIAIGTPPQKFTVIFDTGSSNLWVPSSKCYFSVACFM 122
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
H+RY+S +S+TY E G S I YG+G+ISGFFS D+V+VGD+VVKDQ FIEATRE +TF
Sbjct: 123 HARYRSSQSSTYRENGTSAAIQYGTGAISGFFSNDDVKVGDIVVKDQEFIEATREPGVTF 182
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
+ A+FDGI+GLGF+EI+VG AVPVW MVEQGLV + VFSFWLNR P+ E GGE+VFGG
Sbjct: 183 VAAKFDGILGLGFQEISVGYAVPVWYTMVEQGLVKDPVFSFWLNRKPEEENGGELVFGGA 242
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
DP H+KGKHTYVPVT+KGYWQF++GD+LI + TG C C+AI DSGTSLLAGPT V+T
Sbjct: 243 DPAHYKGKHTYVPVTRKGYWQFDMGDVLISGKPTGYCTNDCSAIADSGTSLLAGPTTVIT 302
Query: 305 EINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIK 364
IN AIG GVVS EC+ VV+QYG I +LL++ P+K+C QIGLC F+G VS GI+
Sbjct: 303 MINQAIGAAGVVSKECRSVVNQYGQTILELLLAEAKPKKICSQIGLCTFDGTHGVSMGIE 362
Query: 365 TVVEK-ENVSAG--DSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGES 421
+VV+K E S+G A CSACEMAV+W+QNQL+Q QT+++++ Y NELC+ LPNPMG S
Sbjct: 363 SVVDKNEKKSSGGIRDAGCSACEMAVIWMQNQLRQNQTEDRIIDYANELCEKLPNPMGPS 422
Query: 422 IIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWIL 481
+DC ++ +MP VSFTIG K+F+LSPE+YILK GEG CISGF A D+PPPRGPLWIL
Sbjct: 423 SVDCGKLSSMPIVSFTIGGKVFDLSPEEYILKVGEGPEAQCISGFTALDVPPPRGPLWIL 482
Query: 482 GDVFMGVYHTVFDSGKLRIGFAEAA 506
GDVFMG YHT+FD GKLR+GFAEAA
Sbjct: 483 GDVFMGRYHTIFDYGKLRVGFAEAA 507
>gi|357134751|ref|XP_003568979.1| PREDICTED: aspartic proteinase-like [Brachypodium distachyon]
Length = 498
Score = 652 bits (1682), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 307/506 (60%), Positives = 388/506 (76%), Gaps = 11/506 (2%)
Query: 2 EQKLLRSVFCLWVLASCLLLPASS-NGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV 60
++ L CLW L+ LL AS +GL RI L K+ L+ +LNAA++ R++
Sbjct: 3 QRHLFLVTTCLWALSCAGLLHASPPDGLLRINLNKKSLNYEALNAAKLARQQ-------- 54
Query: 61 SGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI 120
V ++ S+ DI+PL ++++ QYFG IG+G+PPQNF+VIFDTGSSNLWVPSSKCYFSI
Sbjct: 55 DSVHLKISSSNSDIVPLVDYLNTQYFGVIGVGTPPQNFTVIFDTGSSNLWVPSSKCYFSI 114
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+CY H +YKS KS+TY G+S +I YGSG+ISGFFS DNV VGD+VVK Q FIE TRE
Sbjct: 115 ACYLHHKYKSSKSSTYKADGESAKITYGSGAISGFFSNDNVLVGDLVVKKQKFIETTRET 174
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
S TF++ +FDGI+GLGF EI+VG A PVW +M +Q L++++VFSFWLNR+ DA GGE+V
Sbjct: 175 SATFIIGKFDGILGLGFPEISVGKAPPVWMSMQKQKLLADDVFSFWLNRNADATSGGELV 234
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGGVD H+KG HTYVPV++KGYWQF +GD+LI QSTG C GCAAIVDSGTSLLAGPT
Sbjct: 235 FGGVDSNHYKGNHTYVPVSRKGYWQFNMGDLLIDGQSTGFCAKGCAAIVDSGTSLLAGPT 294
Query: 301 PVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVS 360
+V ++NHAIG EG++S ECK VVSQYG++I DLL++ P+KVC Q+GLC F+G VS
Sbjct: 295 AIVAQVNHAIGAEGIISTECKEVVSQYGEMILDLLLAQTEPQKVCSQVGLCLFDGTHSVS 354
Query: 361 TGIKTVVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGE 420
GI++VV KENV G +C+ACEMAVVW++NQL++ +TKE +L Y N+LC+ LP+P GE
Sbjct: 355 KGIESVVGKENV--GSDVMCTACEMAVVWIENQLRENKTKELILQYANQLCERLPSPNGE 412
Query: 421 SIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWI 480
S + C I MPN++FTI K F L+PEQYI+K + VCISGFMAFD+PPPRGPLWI
Sbjct: 413 STVSCHEISKMPNLAFTIAGKTFVLTPEQYIVKLEQSGQTVCISGFMAFDIPPPRGPLWI 472
Query: 481 LGDVFMGVYHTVFDSGKLRIGFAEAA 506
LGDVFMG YHTVFD G+ RIGFAE+A
Sbjct: 473 LGDVFMGAYHTVFDFGEDRIGFAESA 498
>gi|225460913|ref|XP_002279049.1| PREDICTED: aspartic proteinase [Vitis vinifera]
gi|297737462|emb|CBI26663.3| unnamed protein product [Vitis vinifera]
Length = 514
Score = 651 bits (1679), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 307/509 (60%), Positives = 387/509 (76%), Gaps = 9/509 (1%)
Query: 7 RSVFCLWVLASCLLLP---ASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGV 63
R+V L+ + P AS GL RIGLKKR D + AARI K+ G +
Sbjct: 6 RTVAVALFLSILMFSPEFSASDGGLVRIGLKKRAFDQTNRLAARIESKQGEALGTSIRKY 65
Query: 64 R---HRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI 120
+ G ++ L N+MDAQYFGEI IG+PPQ F+VIFDTGSSNLWVPSSKCYFS+
Sbjct: 66 NLHGNAAGSKHTYVVALHNYMDAQYFGEISIGTPPQKFTVIFDTGSSNLWVPSSKCYFSV 125
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+CYFHS+YKS +S+TY + G S +I+YG+G+ISGFFS+D+V+VGD+ V +Q FIEAT+E
Sbjct: 126 ACYFHSKYKSSQSSTYKKNGTSADIHYGTGAISGFFSKDDVKVGDLAVINQEFIEATKEP 185
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
S+TF LA+FDGI+GLGF+EI+VG+AVPVW NM+ Q L+ E +FSFW NR+ + E GGEIV
Sbjct: 186 SITFALAKFDGILGLGFQEISVGNAVPVWYNMINQELIKEPIFSFWFNRNSNEEVGGEIV 245
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGG+D H+KGKHTYVPVTKKGYWQF+LGD++IG ++TG C GC+AI DSGTSLLAGPT
Sbjct: 246 FGGIDSDHYKGKHTYVPVTKKGYWQFDLGDVMIGGKTTGFCASGCSAIADSGTSLLAGPT 305
Query: 301 PVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVS 360
++TE+NHAIG G VS EC+ VV QYG +I D+L++ P+K+C QIGLCAFNG VS
Sbjct: 306 TIITEVNHAIGASGFVSQECRAVVQQYGQIIIDMLLTKEQPQKICSQIGLCAFNGIRGVS 365
Query: 361 TGIKTVVEKENVSAGD---SAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNP 417
GI++VV++ N A D +CSAC MAVVW+QN+L Q +T +++L Y+NELCD LP+P
Sbjct: 366 MGIESVVDENNSKASDGLHDTMCSACSMAVVWIQNKLGQNETIDRILKYVNELCDRLPSP 425
Query: 418 MGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGP 477
MGES +DC + +MPNVS TIG K+F+LSP+QYILK GEG CISGF A D+PPP GP
Sbjct: 426 MGESAVDCGSLSSMPNVSLTIGGKVFDLSPKQYILKVGEGEIAQCISGFTALDVPPPHGP 485
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
LWILGDVFMG YHTVFD G +++GFAEAA
Sbjct: 486 LWILGDVFMGQYHTVFDYGNMKVGFAEAA 514
>gi|297849560|ref|XP_002892661.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338503|gb|EFH68920.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 506
Score = 650 bits (1677), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 310/512 (60%), Positives = 396/512 (77%), Gaps = 16/512 (3%)
Query: 4 KLLRSVFCLWVLASCLLLPAS----SNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAG 59
K+ + ++ S LL ++ ++G R+GLKK +LD + AAR+ K+ A
Sbjct: 2 KIYSTTVAFSLIVSFLLFFSAFSERNDGTFRVGLKKLKLDSKNRLAARVESKQDKPLRA- 60
Query: 60 VSGVRHRLGDS-DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF 118
+ LG+S D D++ LKN++DAQY+GEI IG+PPQ F+V+FDTGSSNLWVPSSKCYF
Sbjct: 61 -----YSLGNSEDADVVVLKNYLDAQYYGEIAIGTPPQKFTVVFDTGSSNLWVPSSKCYF 115
Query: 119 SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
S++C H +YKS +S+TY + GKS I+YG+G+I+GFFS D V VGD+VVKDQ FIEAT+
Sbjct: 116 SLACLLHPKYKSSRSSTYEKNGKSAAIHYGTGAIAGFFSNDAVTVGDLVVKDQEFIEATK 175
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
E +TF++A+FDGI+GLGF+EI+VG+A PVW NM++QGL+ E VFSFW NR+ D EEGGE
Sbjct: 176 EPGITFVVAKFDGILGLGFQEISVGNATPVWYNMLKQGLIKEPVFSFWFNRNADEEEGGE 235
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
+VFGGVDP HFKGKHTYVPVT+KGYWQF++GD+LIG TG CE GC+AI DSGTSLLAG
Sbjct: 236 LVFGGVDPNHFKGKHTYVPVTQKGYWQFDMGDVLIGGAPTGFCESGCSAIADSGTSLLAG 295
Query: 299 PTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEY 358
PT ++T INHAIG GVVS +CK VV QYG I DLL+S P+K+C QIGLC F+G
Sbjct: 296 PTTIITMINHAIGAAGVVSQQCKTVVDQYGQTILDLLLSETQPKKICSQIGLCTFDGTRG 355
Query: 359 VSTGIKTVVEKENVS----AGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSL 414
VS GI++VV+KEN GD+A CSACEMAVVW+Q+QL+Q T+E++L+Y+NELC+ L
Sbjct: 356 VSMGIESVVDKENSKLSNGVGDAA-CSACEMAVVWIQSQLRQNMTQERILNYVNELCERL 414
Query: 415 PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPP 474
P+PMGES +DC ++ TMP VS TIG K+F+L+PE+Y+LK GEG CISGF+A D+ PP
Sbjct: 415 PSPMGESAVDCAQLSTMPTVSLTIGGKVFDLAPEEYVLKVGEGPVAQCISGFIALDVAPP 474
Query: 475 RGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
RGPLWILGDVFMG YHTVFD G ++GFAEAA
Sbjct: 475 RGPLWILGDVFMGKYHTVFDFGNEQVGFAEAA 506
>gi|261264941|gb|ACX55829.1| aspartic proteinase 1 [Castanea mollissima]
Length = 513
Score = 649 bits (1674), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 315/489 (64%), Positives = 394/489 (80%), Gaps = 5/489 (1%)
Query: 23 ASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSG--VRHRLGD-SDEDILPLKN 79
AS+ GL RIGLKK +LD ++ AA++ K+ + A + +R GD D DI+ LKN
Sbjct: 25 ASNGGLVRIGLKKMKLDKNNRVAAQLESKDGEVRSASIRKYYLRGNSGDPEDIDIVSLKN 84
Query: 80 FMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEI 139
+MDAQYFGEIG+G+PPQ F+VIFDTGSSNLWVPSSKCYFS++CYFHS+YKS S+TY +
Sbjct: 85 YMDAQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSSKCYFSVACYFHSKYKSSSSSTYKKN 144
Query: 140 GKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFRE 199
GK +I+YG+G+ISG+FSQD+V+VGD+VVK+Q FIEATRE S+TFL+A+FDGI+GLGF+E
Sbjct: 145 GKPADIHYGTGAISGYFSQDHVKVGDLVVKNQEFIEATREPSITFLVAKFDGILGLGFKE 204
Query: 200 IAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVT 259
I+VG+AVPVW NMV+QGLV E VFSFW NR+ D EEGGEIVFGGVDP H+KGKHTYVPVT
Sbjct: 205 ISVGNAVPVWYNMVKQGLVKEPVFSFWFNRNTDEEEGGEIVFGGVDPNHYKGKHTYVPVT 264
Query: 260 KKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
+KGYWQF++GD+LI Q+TG C GC+AI DSGTSLLAGPT ++TE+NHAIG GVVS E
Sbjct: 265 QKGYWQFDMGDVLIDGQTTGFCARGCSAIADSGTSLLAGPTTIITEVNHAIGATGVVSQE 324
Query: 320 CKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAG--DS 377
CK VV++YG+ I +L+ P K+C QIGLC F+G VS I++VV+ ++
Sbjct: 325 CKAVVAEYGETIIKMLLEKDQPMKICSQIGLCTFDGVRGVSMDIESVVDNTRKASNGLRD 384
Query: 378 AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFT 437
A+CS CEM VVW+QNQLKQ QT++++L+Y+NELCD LP+PMGES +DC + ++PNVS T
Sbjct: 385 AMCSTCEMTVVWMQNQLKQNQTQDRILTYVNELCDRLPSPMGESAVDCGSLSSLPNVSLT 444
Query: 438 IGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGK 497
IG ++F+LSPEQY+LK GEG A CISGF A D+PPPRGPLWILGDVFMG YHTVFD G
Sbjct: 445 IGGRVFDLSPEQYVLKVGEGEAAQCISGFTALDVPPPRGPLWILGDVFMGRYHTVFDYGN 504
Query: 498 LRIGFAEAA 506
R+GFAEAA
Sbjct: 505 QRVGFAEAA 513
>gi|326494022|dbj|BAJ85473.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326511208|dbj|BAJ87618.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 498
Score = 649 bits (1673), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 306/508 (60%), Positives = 393/508 (77%), Gaps = 12/508 (2%)
Query: 1 MEQKLLRSVF-CLWVLASCLLLPASS-NGLRRIGLKKRRLDLHSLNAARITRKERYMGGA 58
M Q+LL V CLW ++ + ASS +GL RI L KR L SL AA+ R+
Sbjct: 1 MGQRLLLLVTTCLWAISCAVPHHASSRDGLLRINLNKRSLTHESLAAAKAARQ------- 53
Query: 59 GVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF 118
+R + G+SD DI+PL ++++ QY+G IG+G+PPQNF+VIFDTGSSNLWVPSSKCYF
Sbjct: 54 -YGALRLKSGNSDSDIVPLVDYLNTQYYGVIGLGTPPQNFTVIFDTGSSNLWVPSSKCYF 112
Query: 119 SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
SI+CY H +Y+S +S TY G++C+I YGSG+ISGFFS DNV VGD+VVK+Q FIEATR
Sbjct: 113 SIACYLHPKYRSSRSTTYKADGENCKITYGSGAISGFFSNDNVLVGDLVVKNQKFIEATR 172
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
E S++F+L +FDGI+GLG+ +I+VG A PVW +M EQ L++++VFSFWLNRD DA GGE
Sbjct: 173 ETSVSFILGKFDGILGLGYPDISVGKAPPVWLSMQEQKLLADDVFSFWLNRDSDALSGGE 232
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
+VFGG+DP H+KG HTYVPV++KGYWQF +GD+LI STG C GCAAIVDSGTSLLAG
Sbjct: 233 LVFGGMDPHHYKGNHTYVPVSRKGYWQFNMGDLLIDGHSTGFCAKGCAAIVDSGTSLLAG 292
Query: 299 PTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEY 358
PT +V ++NHAIG EG++S ECK VVSQYG++I ++L++ P+KVC QIGLC F+G +
Sbjct: 293 PTAIVAQVNHAIGAEGIISTECKEVVSQYGEMILEMLIAQTQPQKVCSQIGLCLFDGTQS 352
Query: 359 VSTGIKTVVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPM 418
VS GI+++V KENV G +C+ACEMAVVW++NQL++ +TKE +L Y N+LC+ LP+P
Sbjct: 353 VSNGIESIVGKENV--GSDLMCTACEMAVVWIENQLRENKTKELILQYANQLCERLPSPN 410
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
GES + C + MPN++F I +K F L+PEQYI+K + VCISGFMAFD+PPPRGPL
Sbjct: 411 GESTVSCHEMSKMPNLAFAIANKTFVLTPEQYIVKLEQSGQTVCISGFMAFDIPPPRGPL 470
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEAA 506
WILGDVFMG YHTVFD GK RIGFAE+A
Sbjct: 471 WILGDVFMGGYHTVFDFGKDRIGFAESA 498
>gi|261264943|gb|ACX55830.1| aspartic proteinase 2 [Castanea mollissima]
Length = 513
Score = 647 bits (1670), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 314/489 (64%), Positives = 395/489 (80%), Gaps = 5/489 (1%)
Query: 23 ASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSG--VRHRLGD-SDEDILPLKN 79
AS+ GL RIGLKK +LD ++ AA++ K+ + A + +R GD D DI+ LKN
Sbjct: 25 ASNGGLVRIGLKKMKLDKNNRVAAQLESKDGEVRSASIRKYYLRGNSGDPEDIDIVSLKN 84
Query: 80 FMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEI 139
+MDAQYFGEIG+G+PPQ F+VIFDTGSSNLWVPSSKCYFS++CYFHS+YKS S+TY +
Sbjct: 85 YMDAQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSSKCYFSVACYFHSKYKSSSSSTYKKN 144
Query: 140 GKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFRE 199
GK +I+YG+G+ISG+FSQD+V+VGD+VVK+Q FIEATRE S+TFL+A+FDGI+GLGF+E
Sbjct: 145 GKPADIHYGTGAISGYFSQDHVKVGDLVVKNQEFIEATREPSITFLVAKFDGILGLGFKE 204
Query: 200 IAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVT 259
I+VG+AVPVW NMV+QGLV E VFSFW NR+ D EEGGEIVFGGVDP H+KGKHTYVPVT
Sbjct: 205 ISVGNAVPVWYNMVKQGLVKEPVFSFWFNRNTDEEEGGEIVFGGVDPNHYKGKHTYVPVT 264
Query: 260 KKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
+KGYWQF++GD+LI Q+TG C C+AI DSGTSLLAGPT ++TE+NHAIG GVVS E
Sbjct: 265 QKGYWQFDMGDVLIDGQTTGFCVTTCSAIADSGTSLLAGPTTIITEVNHAIGATGVVSQE 324
Query: 320 CKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAG--DS 377
CK VV++YG+ I +L+ P K+C QIGLC F+G + VS I++VV+ + ++
Sbjct: 325 CKAVVAEYGETIIKMLLEKDQPMKICSQIGLCTFDGTQGVSMDIESVVDNTHKASNGLRD 384
Query: 378 AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFT 437
A+CS CEM VVW+QNQLKQ QT++++L+Y+NELCD LP+PMGES +DC + ++PNVS T
Sbjct: 385 AMCSTCEMTVVWMQNQLKQNQTQDRILTYVNELCDRLPSPMGESAVDCGSLSSLPNVSLT 444
Query: 438 IGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGK 497
IG ++F+LSPEQY+LK GEG A CISGF A D+PPPRGPLWILGDVFMG YHTVFD G
Sbjct: 445 IGGRVFDLSPEQYVLKVGEGEAAQCISGFTALDVPPPRGPLWILGDVFMGRYHTVFDYGN 504
Query: 498 LRIGFAEAA 506
R+GFAEAA
Sbjct: 505 QRVGFAEAA 513
>gi|73912435|dbj|BAE20414.1| aspartic proteinase [Triticum aestivum]
Length = 498
Score = 647 bits (1668), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 305/508 (60%), Positives = 391/508 (76%), Gaps = 12/508 (2%)
Query: 1 MEQKLLRSVF-CLWVLASCLLLPASS-NGLRRIGLKKRRLDLHSLNAARITRKERYMGGA 58
M Q+LL V CLW L+ + ASS +GL RI L K+ L SL AA+ R+
Sbjct: 1 MGQRLLLLVTTCLWALSCAVPHHASSRDGLLRINLNKKSLTHESLAAAKAARQH------ 54
Query: 59 GVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF 118
+R + G+SD DI+PL ++++ QY+G IG+G+PPQNF+VIFDTGSSNLWVPS+KCYF
Sbjct: 55 --DALRLKSGNSDSDIVPLVDYLNTQYYGVIGLGTPPQNFTVIFDTGSSNLWVPSAKCYF 112
Query: 119 SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
SI+CY H +YKS KS+TY G++C+I YGSG+ISGFFS DNV VGD+VVK+Q FI TR
Sbjct: 113 SIACYLHPKYKSSKSSTYKADGETCKITYGSGAISGFFSNDNVLVGDLVVKNQKFIGTTR 172
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
E S++F++ +FDGI+GLG+ +I+VG A PVW +M EQ L++++VFSFWLNRD DA GGE
Sbjct: 173 ETSVSFIVGKFDGILGLGYPDISVGKAPPVWLSMQEQKLLADDVFSFWLNRDSDALSGGE 232
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
+VFGG+DP H+KG HTYVPV+++GYWQF +GD+LI STG C GCAAIVDSGTSLLAG
Sbjct: 233 LVFGGMDPDHYKGNHTYVPVSRRGYWQFNMGDLLIDGHSTGFCAKGCAAIVDSGTSLLAG 292
Query: 299 PTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEY 358
PT +V ++NHAIG EG++S ECK VVSQYG++I +LL++ P+KVC QIGLC F+G
Sbjct: 293 PTAIVAQVNHAIGAEGIISTECKEVVSQYGEMILELLIAQTQPQKVCSQIGLCLFDGTHS 352
Query: 359 VSTGIKTVVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPM 418
VS GI++VV KENV G +C+ACEMAVVW++NQL++ +TKE +L Y N+LC+ LP+P
Sbjct: 353 VSNGIESVVGKENV--GSDVMCTACEMAVVWIENQLRENKTKELILQYANQLCERLPSPN 410
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
GES + C + MPN++FTI K F L+PEQY++K + VCISGFMAFD+PPPRGPL
Sbjct: 411 GESTVSCHEMSKMPNLAFTIASKTFVLTPEQYVVKLEQSGQTVCISGFMAFDIPPPRGPL 470
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEAA 506
WILGDVFMG YHTVFD GK RIGFAE+A
Sbjct: 471 WILGDVFMGAYHTVFDFGKDRIGFAESA 498
>gi|449466825|ref|XP_004151126.1| PREDICTED: aspartic proteinase-like [Cucumis sativus]
Length = 513
Score = 645 bits (1663), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 307/509 (60%), Positives = 398/509 (78%), Gaps = 7/509 (1%)
Query: 4 KLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGV 63
KL +V + ++ AS++G RIGLK+R+ ++ A++I KE V
Sbjct: 6 KLFIAVLFICFFMFPMVFCASNDGKVRIGLKRRKFGQNNRVASKIATKEGISLKNSVEKY 65
Query: 64 R--HRLGDSDE-DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI 120
+ LGDSD+ DI+ LKN+++AQYFGEIGIG+PPQ F+VIFDTGSSNLWVPSSKC FS+
Sbjct: 66 QPSANLGDSDDFDIVGLKNYLNAQYFGEIGIGTPPQKFAVIFDTGSSNLWVPSSKC-FSV 124
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+C HS+YKS++S+TY + GKS I YG+G+ISG+FS+DNV+VGD++VK Q FIEATRE
Sbjct: 125 ACLLHSKYKSKRSSTYKKNGKSASIKYGTGAISGYFSEDNVKVGDLIVKKQDFIEATREP 184
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
SLTF+LA+FDGI+GLGF+EI+VGDAVPVW NMV+Q LV E VFSFW NR+ D E+GGEIV
Sbjct: 185 SLTFVLAQFDGILGLGFKEISVGDAVPVWYNMVDQNLVKEPVFSFWFNRNADEEQGGEIV 244
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGGVDP H+KG+HTYVPVTKKGYWQF++GD+LI +TG C GGC+AI DSGTSLLAGPT
Sbjct: 245 FGGVDPDHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTGFCSGGCSAIADSGTSLLAGPT 304
Query: 301 PVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVS 360
++T++NHAIG GVVS ECK VV++YG+ I +L++ P+K+C +GLCAF+G VS
Sbjct: 305 TIITQVNHAIGASGVVSEECKAVVAEYGETIIKMLLAKDQPKKICSTLGLCAFDGERGVS 364
Query: 361 TGIKTVVEKENVSAGD---SAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNP 417
GI++VV+ + + +C+ACEMAVVW Q+QLK+++T++++L+YI+ LC+ LP+P
Sbjct: 365 MGIESVVDNTTQKSSNGLRDVMCNACEMAVVWAQSQLKEEKTQDQILNYIDGLCEKLPSP 424
Query: 418 MGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGP 477
MGES+IDCD + T+P++SFTIG K+F L PEQY+LK EG CISGF A D+PPPRGP
Sbjct: 425 MGESVIDCDSLSTLPSISFTIGGKVFELKPEQYVLKVTEGPVTECISGFAALDVPPPRGP 484
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
LWILGDVFMG YHTVFD G R+GFAEAA
Sbjct: 485 LWILGDVFMGSYHTVFDYGNSRVGFAEAA 513
>gi|12231176|dbj|BAB20971.1| aspartic proteinase 3 [Nepenthes alata]
Length = 507
Score = 644 bits (1661), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 327/511 (63%), Positives = 400/511 (78%), Gaps = 14/511 (2%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRH 65
+ S+F +L L+ S++GL RIGLKK+ D ++ AAR+ +E G A S +R
Sbjct: 1 MPSLFVFIILLP-LVFSDSNDGLLRIGLKKKIFDQNNRIAARLETEE---GEARRSSLRK 56
Query: 66 -----RLGDSDE-DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS 119
LG+ +E DI+ LKN+MDAQYFGEIGIG+PPQ F+VIFDTGSSNLWVPSSKCYFS
Sbjct: 57 YYLHGNLGNPEETDIVALKNYMDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFS 116
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
+ CYFH++YKS S++Y + GKS +I+YG+G+ISGFFS+DNV+VGD+ VK Q FIEA+RE
Sbjct: 117 VPCYFHAKYKSSISSSYKKNGKSADIHYGTGAISGFFSEDNVQVGDLAVKAQEFIEASRE 176
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEI 239
S+TFL+A+FDGI+GLGF+EI+VG+A PVW NMV QGLV E VFSFWLNR EEGGEI
Sbjct: 177 PSVTFLVAKFDGILGLGFQEISVGNATPVWYNMVNQGLVKEPVFSFWLNRKVGEEEGGEI 236
Query: 240 VFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGP 299
VFGGVDP HFKG H+YVPVT KGYWQF++GD+LI ++T CEGGC+AI DSGTSLLAGP
Sbjct: 237 VFGGVDPNHFKGTHSYVPVTHKGYWQFDMGDVLIDGKATEYCEGGCSAIADSGTSLLAGP 296
Query: 300 TPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYV 359
T VVT INHAIG GVVS ECK VVSQYG I DLL++ + PEK+C QIGLC F+G V
Sbjct: 297 TSVVTMINHAIGATGVVSEECKAVVSQYGQTIMDLLLAEVSPEKICSQIGLCTFDGTRGV 356
Query: 360 STGIKTVVEKEN--VSAG--DSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLP 415
S GIK+VV+KEN S+G A+C ACEMAVVW+++QL+Q QT+ +L+Y+N+LCD LP
Sbjct: 357 SIGIKSVVDKENNGKSSGILRDALCPACEMAVVWMKSQLEQNQTQNLILNYVNDLCDQLP 416
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
+PMGES +DC RI +M VS TIG K+F+L PEQYIL+ GEG A CISGF A D+PPP
Sbjct: 417 SPMGESAVDCARISSMATVSSTIGGKVFDLRPEQYILRVGEGPAAQCISGFTAMDIPPPG 476
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
GPLWILGD+ MG YHTVFD G LR+GFAEAA
Sbjct: 477 GPLWILGDILMGRYHTVFDYGNLRVGFAEAA 507
>gi|22330379|ref|NP_176419.2| phytepsin [Arabidopsis thaliana]
gi|79320483|ref|NP_001031219.1| phytepsin [Arabidopsis thaliana]
gi|75331143|sp|Q8VYL3.1|APA2_ARATH RecName: Full=Aspartic proteinase A2; AltName: Full=Aspartic
protease 57; Short=AtASP57; Flags: Precursor
gi|17979428|gb|AAL49856.1| putative aspartic protease [Arabidopsis thaliana]
gi|23297031|gb|AAN13225.1| putative aspartic protease [Arabidopsis thaliana]
gi|222424000|dbj|BAH19961.1| AT1G62290 [Arabidopsis thaliana]
gi|332195825|gb|AEE33946.1| phytepsin [Arabidopsis thaliana]
gi|332195826|gb|AEE33947.1| phytepsin [Arabidopsis thaliana]
Length = 513
Score = 643 bits (1659), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 300/486 (61%), Positives = 380/486 (78%), Gaps = 5/486 (1%)
Query: 25 SNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLG--DSDEDILPLKNFMD 82
++G R+GLKK +LD ++ A R K+ + + + LG D DI+PLKN++D
Sbjct: 27 NDGTFRVGLKKLKLDPNNRLATRFGSKQEEALRSSLRSYNNNLGGDSGDADIVPLKNYLD 86
Query: 83 AQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKS 142
AQY+GEI IG+PPQ F+VIFDTGSSNLWVPS KC+FS+SCYFH++YKS +S+TY + GK
Sbjct: 87 AQYYGEIAIGTPPQKFTVIFDTGSSNLWVPSGKCFFSLSCYFHAKYKSSRSSTYKKSGKR 146
Query: 143 CEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAV 202
I+YGSGSISGFFS D V VGD+VVKDQ FIE T E LTFL+A+FDG++GLGF+EIAV
Sbjct: 147 AAIHYGSGSISGFFSYDAVTVGDLVVKDQEFIETTSEPGLTFLVAKFDGLLGLGFQEIAV 206
Query: 203 GDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
G+A PVW NM++QGL+ VFSFWLNRDP +EEGGEIVFGGVDPKHF+G+HT+VPVT++G
Sbjct: 207 GNATPVWYNMLKQGLIKRPVFSFWLNRDPKSEEGGEIVFGGVDPKHFRGEHTFVPVTQRG 266
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKL 322
YWQF++G++LI +STG C GC+AI DSGTSLLAGPT VV IN AIG GVVS +CK
Sbjct: 267 YWQFDMGEVLIAGESTGYCGSGCSAIADSGTSLLAGPTAVVAMINKAIGASGVVSQQCKT 326
Query: 323 VVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGD---SAV 379
VV QYG I DLL++ P+K+C QIGLCA++G VS GI++VV+KEN + A
Sbjct: 327 VVDQYGQTILDLLLAETQPKKICSQIGLCAYDGTHGVSMGIESVVDKENTRSSSGLRDAG 386
Query: 380 CSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIG 439
C ACEMAVVW+Q+QL+Q T+E++++YINE+C+ +P+P GES +DC ++ MP VSFTIG
Sbjct: 387 CPACEMAVVWIQSQLRQNMTQERIVNYINEICERMPSPNGESAVDCSQLSKMPTVSFTIG 446
Query: 440 DKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLR 499
K+F+L+PE+Y+LK GEG CISGF A D+PPPRGPLWILGDVFMG YHTVFD G +
Sbjct: 447 GKVFDLAPEEYVLKIGEGPVAQCISGFTALDIPPPRGPLWILGDVFMGKYHTVFDFGNEQ 506
Query: 500 IGFAEA 505
+GFAEA
Sbjct: 507 VGFAEA 512
>gi|302144105|emb|CBI23210.3| unnamed protein product [Vitis vinifera]
Length = 429
Score = 643 bits (1658), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 300/429 (69%), Positives = 363/429 (84%), Gaps = 3/429 (0%)
Query: 81 MDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIG 140
MDAQYFGEIGIG+PPQ F+VIFDTGSSNLWVPSSKCYFS+ CYFHS+YKS +S+TY + G
Sbjct: 1 MDAQYFGEIGIGTPPQTFTVIFDTGSSNLWVPSSKCYFSVPCYFHSKYKSSQSSTYRKNG 60
Query: 141 KSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREI 200
KS +I+YG+G+ISGFFS+DNV+VGD+VVK+Q FIEATRE S+TFL+A+FDGI+GLGF+EI
Sbjct: 61 KSADIHYGTGAISGFFSEDNVKVGDLVVKNQEFIEATREPSVTFLVAKFDGILGLGFQEI 120
Query: 201 AVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTK 260
+VG+AVPVW NMV+QGLV E VFSFWLNR D +EGGE+VFGGVDP HFKG+HTYVPVT+
Sbjct: 121 SVGNAVPVWYNMVKQGLVKEPVFSFWLNRKTDDDEGGELVFGGVDPDHFKGEHTYVPVTQ 180
Query: 261 KGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAEC 320
KGYWQF++G++LI ++TG C GGCAAI DSGTSLLAGPT VV INHAIG GVVS EC
Sbjct: 181 KGYWQFDMGEVLIDGETTGYCAGGCAAIADSGTSLLAGPTAVVAMINHAIGATGVVSQEC 240
Query: 321 KLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN---VSAGDS 377
K VV+QYG+ I DLL+S P+K+C QIGLC F+G V GI++VV+++N S
Sbjct: 241 KTVVAQYGETIMDLLLSEASPQKICSQIGLCTFDGTRGVGMGIESVVDEKNGDKSSGVHD 300
Query: 378 AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFT 437
A CSACEMAVVW+Q+QL+Q QTKE++L Y+NELCD LP+PMGES +DC ++ +MPNVS T
Sbjct: 301 AGCSACEMAVVWMQSQLRQNQTKERILEYVNELCDRLPSPMGESAVDCLQLSSMPNVSLT 360
Query: 438 IGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGK 497
IG K+F+LS +Y+LK GEG A CISGF+A D+PPPRGPLWILGDVFMG YHTVFD G
Sbjct: 361 IGGKVFDLSANEYVLKVGEGAAAQCISGFIAMDVPPPRGPLWILGDVFMGRYHTVFDYGN 420
Query: 498 LRIGFAEAA 506
+R+GFAEAA
Sbjct: 421 MRVGFAEAA 429
>gi|297837199|ref|XP_002886481.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297332322|gb|EFH62740.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 513
Score = 642 bits (1657), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 301/499 (60%), Positives = 384/499 (76%), Gaps = 5/499 (1%)
Query: 12 LWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDS- 70
+W L + ++G R+GLKK +LD ++ A R K+ + + + LG
Sbjct: 14 VWFLLFFTVSSQRNDGTFRVGLKKLKLDPNNRLATRFGSKQEEALRSSLPSYNNNLGSDS 73
Query: 71 -DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYK 129
D DI+PLKN++DAQY+GEI IG+PPQ F+VIFDTGSSNLWVPS KC+FS+SC+FH+++K
Sbjct: 74 GDADIVPLKNYLDAQYYGEIAIGTPPQKFTVIFDTGSSNLWVPSGKCFFSLSCFFHAKFK 133
Query: 130 SRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARF 189
S +S+TY + GK I+YGSGSISGFFS D V VGD+VVKDQ FIEAT E LTFL+A+F
Sbjct: 134 SSRSSTYKKSGKRAAIHYGSGSISGFFSYDAVTVGDLVVKDQEFIEATSEPGLTFLVAKF 193
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF 249
DG++GLGF+EIAVG+A PVW NM++QGL+ VFSFWLNRDP +EEGGEIVFGGVDPKHF
Sbjct: 194 DGLLGLGFQEIAVGNATPVWYNMLKQGLIERPVFSFWLNRDPKSEEGGEIVFGGVDPKHF 253
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHA 309
KG+HT+VPVT++GYWQF++G++LI STG C GC+AI DSGTSLLAGPT V+ IN A
Sbjct: 254 KGEHTFVPVTQRGYWQFDMGEVLIAGDSTGYCGSGCSAIADSGTSLLAGPTAVIAMINKA 313
Query: 310 IGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEK 369
IG GVVS +CK VV QYG I DLL++ P+K+C QIGLCAF+G VS GI++VV+K
Sbjct: 314 IGASGVVSQQCKTVVDQYGQTILDLLLAETQPKKICSQIGLCAFDGTHGVSMGIESVVDK 373
Query: 370 ENVSAGD---SAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
EN + A C ACEMAV+W+Q+QL+Q T+E++++YINE+C+ +P+P GES +DC
Sbjct: 374 ENTRSSSGLRDAGCPACEMAVMWIQSQLRQNMTQERIVNYINEICERMPSPNGESAVDCS 433
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
++ MP VSFTIG K+F+L+PE+Y+LK GEG CISGF A D+PPPRGPLWILGDVFM
Sbjct: 434 QLSKMPTVSFTIGGKVFDLAPEEYVLKIGEGPVAQCISGFTALDVPPPRGPLWILGDVFM 493
Query: 487 GVYHTVFDSGKLRIGFAEA 505
G YHTVFD G ++GFAEA
Sbjct: 494 GKYHTVFDFGNEQVGFAEA 512
>gi|226506070|ref|NP_001150729.1| aspartic proteinase oryzasin-1 precursor [Zea mays]
gi|195641348|gb|ACG40142.1| aspartic proteinase oryzasin-1 precursor [Zea mays]
Length = 518
Score = 642 bits (1656), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 302/485 (62%), Positives = 379/485 (78%), Gaps = 5/485 (1%)
Query: 27 GLRRIGLKKRRLDLHSLNAARITRKERY-MGGAGVSGVRHRLGDSDEDILPLKNFMDAQY 85
GL R+ LKK+ +D ++ AAR++ +ER + G + + GD D D++ L + +AQY
Sbjct: 34 GLVRVALKKQPVDQNARVAARLSAEERQRLLLRGANALGSAGGDDDSDVIALNXYXNAQY 93
Query: 86 FGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEI 145
FGEIG+G+PPQ F+VIFDTGSSNLWVPSSKCYFSI+CYFHSRYKS +S+TY + GK I
Sbjct: 94 FGEIGVGTPPQKFTVIFDTGSSNLWVPSSKCYFSIACYFHSRYKSGQSSTYKKNGKPAAI 153
Query: 146 NYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDA 205
YG+G+I+GFFS+D+V++GD+ V DQ FIEAT+E LTF++A+FDGI+GLGF+EI+VG+A
Sbjct: 154 QYGTGAIAGFFSEDSVKLGDLDVNDQEFIEATKEPGLTFMVAKFDGILGLGFQEISVGNA 213
Query: 206 VPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQ 265
PVW NMV+QGL+S+ VFSFW NR EGGEIVFGG+D H+KG HTYVPVT+KGYWQ
Sbjct: 214 TPVWYNMVKQGLISDPVFSFWFNRHAGEGEGGEIVFGGMDSSHYKGDHTYVPVTQKGYWQ 273
Query: 266 FELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVS 325
F +GD+L+ +STG C GGCAAI DSGTSLLAGPT ++TEIN IG GVVS ECK VVS
Sbjct: 274 FNMGDVLVDGKSTGFCAGGCAAIADSGTSLLAGPTAIITEINEKIGAAGVVSQECKTVVS 333
Query: 326 QYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVE----KENVSAGDSAVCS 381
QYG I DLL++ P K+C Q+GLC F+G VSTGI++VV+ K N +C+
Sbjct: 334 QYGQQILDLLLAETQPAKICSQVGLCTFDGTHGVSTGIRSVVDDKAGKSNGGLKSDPMCN 393
Query: 382 ACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDK 441
ACEMAVVW+QNQL Q +T+E +L+YIN+LC+ LP+PMGES +DC + +MP+++FTIG K
Sbjct: 394 ACEMAVVWMQNQLAQNKTQELILTYINQLCERLPSPMGESAVDCASLGSMPDIAFTIGGK 453
Query: 442 IFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIG 501
F L PEQYILK GEG A CISGF A D+PPPRGPLWILGDVFMGVYHTVFD KLR+G
Sbjct: 454 KFKLKPEQYILKVGEGQAAQCISGFTAMDIPPPRGPLWILGDVFMGVYHTVFDYXKLRVG 513
Query: 502 FAEAA 506
FAE+A
Sbjct: 514 FAESA 518
>gi|109675118|gb|ABG37021.1| aspartic protease [Nicotiana tabacum]
Length = 508
Score = 641 bits (1653), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 298/491 (60%), Positives = 380/491 (77%), Gaps = 6/491 (1%)
Query: 19 LLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLK 78
++ S++GL R+G+KKR+LD +N A A + +GDSD DI+ LK
Sbjct: 21 MVFSVSNDGLIRVGIKKRKLD--QINQAFGGIDSNGANSARTYHLGGNIGDSDTDIIALK 78
Query: 79 NFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTE 138
N++DAQYFGEI IGSPPQ F+VIFDTGSSNLWVPS++CYFS++CY H +YKS S+TY +
Sbjct: 79 NYLDAQYFGEICIGSPPQKFTVIFDTGSSNLWVPSARCYFSLACYLHPKYKSSHSSTYKK 138
Query: 139 IGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFR 198
G S I YG+GSISG+FS DNV+VGD++VKDQ FIEATRE +TFL A+FDGI+GLGF+
Sbjct: 139 NGTSAAIRYGTGSISGYFSNDNVKVGDLIVKDQDFIEATREPGITFLAAKFDGILGLGFQ 198
Query: 199 EIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPV 258
EI+VG +VPVW NMV QGLV + VFSFW NR+ EEGGE+VFGGVDP HFKGKHTYVPV
Sbjct: 199 EISVGKSVPVWYNMVNQGLVKKPVFSFWFNRNAQEEEGGELVFGGVDPNHFKGKHTYVPV 258
Query: 259 TKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSA 318
T KGYWQF++GD+L+G ++TG C GGC+AI DSGTSLLAGPT ++T+INH IG GVVS
Sbjct: 259 THKGYWQFDMGDVLVGGETTGFCSGGCSAIADSGTSLLAGPTTIITQINHVIGASGVVSQ 318
Query: 319 ECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSA---G 375
ECK +V++YG I DLL S P+K+C QIGLC+ +G+ VS I++VV+K N ++ G
Sbjct: 319 ECKSLVTEYGKTILDLLESKAAPQKICSQIGLCSSDGSRDVSMIIESVVDKHNGASNGLG 378
Query: 376 DSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVS 435
D +C CEMAV+W+QNQ+++ +T + + Y+N+LCD LP+PMGES +DC + +MPNVS
Sbjct: 379 DE-MCRVCEMAVIWMQNQMRRNETADSIYDYVNQLCDRLPSPMGESAVDCSSLASMPNVS 437
Query: 436 FTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDS 495
FT+G++ F L+P+QY+L+ GEG CISGF A D+PPPRGPLWILGDVFMG YHTVFD
Sbjct: 438 FTVGNQTFGLTPQQYVLQVGEGPVAQCISGFTALDVPPPRGPLWILGDVFMGRYHTVFDY 497
Query: 496 GKLRIGFAEAA 506
G R+GFAEAA
Sbjct: 498 GNSRVGFAEAA 508
>gi|12231172|dbj|BAB20969.1| aspartic proteinase 1 [Nepenthes alata]
Length = 514
Score = 640 bits (1652), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 313/487 (64%), Positives = 392/487 (80%), Gaps = 12/487 (2%)
Query: 28 LRRIGLKKRRLD----LHSLNAARITRKERYMGGAGVSGVRHRLGDSDE-DILPLKNFMD 82
L R+GLKKR+LD SL + KE G+ + LG+SD+ DI+ LKN+M+
Sbjct: 30 LLRVGLKKRKLDQINRFSSLYGCK--GKESINPAIRKYGLGNGLGNSDDADIISLKNYMN 87
Query: 83 AQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKS 142
AQYFGEIGIG+PPQ F++IFDTGSSNLWVPS+KCYFSI+CYFHS+YKS S++YT+ GKS
Sbjct: 88 AQYFGEIGIGTPPQKFTLIFDTGSSNLWVPSAKCYFSIACYFHSKYKSSLSSSYTKNGKS 147
Query: 143 CEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAV 202
EI+YG+G+ISGFFSQD+V++GD+VV++Q FIEATRE S+TF+ A+FDGI+GLGF+EI+V
Sbjct: 148 AEIHYGTGAISGFFSQDHVKLGDLVVENQDFIEATREPSITFVAAKFDGILGLGFQEISV 207
Query: 203 GDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
G+AVPVW NMV+QGLV+E VFSFWLNR+ EEGGEIVFGGVDP H+KG+HT+VPVT KG
Sbjct: 208 GNAVPVWYNMVKQGLVNEPVFSFWLNRNATEEEGGEIVFGGVDPNHYKGEHTFVPVTHKG 267
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKL 322
YWQF++ D+L+G ++TG C GGC+AI DSGTSLLAGPT +V +INHAIG GVVS ECK
Sbjct: 268 YWQFDMDDVLVGGETTGYCSGGCSAIADSGTSLLAGPTTIVAQINHAIGASGVVSQECKA 327
Query: 323 VVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDS----A 378
VV+QYG I D+L+S P+K+C QIGLC F+G VS GIK+VV+ NV S A
Sbjct: 328 VVAQYGTAILDMLISETQPKKICSQIGLCTFDGKRGVSVGIKSVVDM-NVDGSSSGLQDA 386
Query: 379 VCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTI 438
C+ACEM VVW+QNQLKQ QT+E++L+Y+NELC+ LP+PMGES +DC + +MP VSFT+
Sbjct: 387 TCTACEMTVVWMQNQLKQNQTEERILNYVNELCNRLPSPMGESAVDCSSLSSMPGVSFTV 446
Query: 439 GDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKL 498
G K+F+L PEQYIL+ GEG+A CISGF A D+ PP GPLWILGD+FMG YHTVFD G +
Sbjct: 447 GGKVFDLLPEQYILQVGEGVATQCISGFTALDVAPPLGPLWILGDIFMGQYHTVFDYGNM 506
Query: 499 RIGFAEA 505
R+GFAEA
Sbjct: 507 RVGFAEA 513
>gi|449503193|ref|XP_004161880.1| PREDICTED: aspartic proteinase-like [Cucumis sativus]
Length = 516
Score = 639 bits (1648), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 306/512 (59%), Positives = 399/512 (77%), Gaps = 10/512 (1%)
Query: 4 KLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGV 63
KL +V + ++ AS++G RIGLK+R+ ++ A++I KE V
Sbjct: 6 KLFIAVLFICFFMFPMVFCASNDGKVRIGLKRRKFGQNNRVASKIATKEGISLKNSVEKY 65
Query: 64 R--HRLGDSDE-DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI 120
+ LGDSD+ DI+ LKN+++AQYFGEIGIG+PPQ F+VIFDTGSSNLWVPSSKC FS+
Sbjct: 66 QPSANLGDSDDFDIVGLKNYLNAQYFGEIGIGTPPQKFAVIFDTGSSNLWVPSSKC-FSV 124
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQV---FIEAT 177
+C HS+YKS++S+TY + GKS I YG+G+ISG+FS+DNV+VGD++VK++ FIEAT
Sbjct: 125 ACLLHSKYKSKRSSTYKKNGKSASIKYGTGAISGYFSEDNVKVGDLIVKNRSLFDFIEAT 184
Query: 178 REGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGG 237
RE SLTF+LA+FDGI+GLGF+EI+VGDAVPVW NMV+Q LV E VFSFW NR+ D E+GG
Sbjct: 185 REPSLTFVLAQFDGILGLGFKEISVGDAVPVWYNMVDQNLVKEPVFSFWFNRNADEEQGG 244
Query: 238 EIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLA 297
EIVFGGVDP H+KG+HTYVPVTKKGYWQF++GD+LI +TG C GGC+AI DSGTSLLA
Sbjct: 245 EIVFGGVDPDHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTGFCSGGCSAIADSGTSLLA 304
Query: 298 GPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAE 357
GPT ++T++NHAIG GVVS ECK VV++YG+ I +L++ P+K+C +GLCAF+G
Sbjct: 305 GPTTIITQVNHAIGASGVVSEECKAVVAEYGETIIKMLLAKDQPKKICSTLGLCAFDGER 364
Query: 358 YVSTGIKTVVEKENVSAGD---SAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSL 414
VS GI++VV+ + + +C+ACEMAVVW Q+QLK+++T++++L+YI+ LC+ L
Sbjct: 365 GVSMGIESVVDNTTQKSSNGLRDVMCNACEMAVVWAQSQLKEEKTQDQILNYIDGLCEKL 424
Query: 415 PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPP 474
P+PMGES+IDCD + T+P++SFTIG K+F L PEQY+LK EG CISGF A D+PPP
Sbjct: 425 PSPMGESVIDCDSLSTLPSISFTIGGKVFELKPEQYVLKVTEGPVTECISGFAALDVPPP 484
Query: 475 RGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
RGPLWILGDVFMG YHTVFD G R+GFAEAA
Sbjct: 485 RGPLWILGDVFMGSYHTVFDYGNSRVGFAEAA 516
>gi|223929912|gb|ACN24614.1| aspartic acid protease [Phaseolus vulgaris]
Length = 513
Score = 637 bits (1643), Expect = e-180, Method: Compositional matrix adjust.
Identities = 308/506 (60%), Positives = 390/506 (77%), Gaps = 11/506 (2%)
Query: 10 FCLWVLASCLLLPASS----NGLRRIGLKKRRLDLHSLNAARI-TRKERYMGGAGVSGVR 64
+CL+V + LLL A S +GLRRIGLKK +LD ++ AARI ++ + + ++
Sbjct: 10 WCLFV--TTLLLSAVSCAPNDGLRRIGLKKIKLDPNNRLAARIGSKDDSFRASIRKFHLQ 67
Query: 65 HRLGDS-DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCY 123
+ G + D DI+ LKN++DAQYFGEI IG+ PQ F+VIFDTGSSNLWVPSS C FS++CY
Sbjct: 68 NNFGGTEDTDIVALKNYLDAQYFGEIAIGTSPQKFTVIFDTGSSNLWVPSSLCTFSVACY 127
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
FH++Y+S KS+TY + G + I YG+G+ISGFFS D+V VGD+VVK Q FIEATRE +
Sbjct: 128 FHAKYRSSKSSTYKKNGTAAAIQYGTGAISGFFSYDSVRVGDIVVKSQEFIEATREPGVV 187
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
FL A+FDGI+GLGF+EI+VG+AVPVW NMVEQGL+ E VFSFW NR P+ EEGGEIVFGG
Sbjct: 188 FLAAKFDGILGLGFQEISVGNAVPVWYNMVEQGLIKEPVFSFWFNRKPEEEEGGEIVFGG 247
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
VDP H+KGKHTYVPVT+KGYW+F++GD+LIG + TG C GC AI DSGTSLLAGPT ++
Sbjct: 248 VDPAHYKGKHTYVPVTRKGYWRFDMGDVLIGGKPTGYCADGCLAIADSGTSLLAGPTTII 307
Query: 304 TEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGI 363
T INHAIG G++S ECK VV++YG I +LL++ P+K+C QIGLC F+G + GI
Sbjct: 308 TMINHAIGAAGIMSQECKTVVAEYGQTILNLLLAETQPKKICSQIGLCTFDGTRGIDMGI 367
Query: 364 KTVVE---KENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGE 420
+VV+ +++ A CSACEMAVVW+QNQL + QT++++LSYIN+LCD +P+PMGE
Sbjct: 368 ASVVDEIARKSSGGLHDAACSACEMAVVWMQNQLSRNQTQDQILSYINQLCDKMPSPMGE 427
Query: 421 SIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWI 480
S ID I ++P VSFTIG + F+L PE+YILK GEG CISGF A D+PPPRGPLWI
Sbjct: 428 SSIDRGNISSLPVVSFTIGGRTFDLLPEEYILKVGEGPVAQCISGFTAIDIPPPRGPLWI 487
Query: 481 LGDVFMGVYHTVFDSGKLRIGFAEAA 506
LGDVFMG YHTVFD G LR+GFA+AA
Sbjct: 488 LGDVFMGRYHTVFDFGNLRVGFADAA 513
>gi|2811025|sp|O04057.1|ASPR_CUCPE RecName: Full=Aspartic proteinase; Flags: Precursor
gi|1944181|dbj|BAA19607.1| aspartic endopeptidase [Cucurbita pepo]
Length = 513
Score = 637 bits (1642), Expect = e-180, Method: Compositional matrix adjust.
Identities = 321/506 (63%), Positives = 406/506 (80%), Gaps = 8/506 (1%)
Query: 8 SVFCLWVLASCLLLPASSN-GLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHR 66
+ CL++L S ++ ++SN GL R+GLKK +LD + AAR+ K+ + A +
Sbjct: 9 AFLCLFLLVSFNIVSSASNDGLLRVGLKKIKLDPENRLAARVESKDAEILKAAFRKYNPK 68
Query: 67 --LGDS-DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCY 123
LG+S D DI+ LKN++DAQY+GEI IG+PPQ F+VIFDTGSSNLWV +C FS++C+
Sbjct: 69 GNLGESSDTDIVALKNYLDAQYYGEIAIGTPPQKFTVIFDTGSSNLWV-LCECLFSVACH 127
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
FH+RYKS +S++Y + G S I YG+G++SGFFS DNV+VGD+VVK+QVFIEATRE SLT
Sbjct: 128 FHARYKSSRSSSYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKEQVFIEATREPSLT 187
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
FL+A+FDG++GLGF+EIAVG+AVPVW NMVEQGLV E VFSFWLNR+ + EEGGEIVFGG
Sbjct: 188 FLVAKFDGLLGLGFQEIAVGNAVPVWYNMVEQGLVKEPVFSFWLNRNVEEEEGGEIVFGG 247
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
VDPKH++GKHTYVPVT+KGYWQF++GD+LI + TG C+GGC+AI DSGTSLLAGPTPV+
Sbjct: 248 VDPKHYRGKHTYVPVTQKGYWQFDMGDVLIDGEPTGFCDGGCSAIADSGTSLLAGPTPVI 307
Query: 304 TEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGI 363
T INHAIG +GVVS +CK VV+QYG I DLL+S P+K+C QI LC F+G VS GI
Sbjct: 308 TMINHAIGAKGVVSQQCKAVVAQYGQTIMDLLLSEADPKKICSQINLCTFDGTRGVSMGI 367
Query: 364 KTVVEKENVSAGDS---AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGE 420
++VV++ + DS +CS CEM VVW+QNQL+Q QTKE++++YINELCD +P+PMG+
Sbjct: 368 ESVVDENAGKSSDSLHDGMCSVCEMTVVWMQNQLRQNQTKERIINYINELCDRMPSPMGQ 427
Query: 421 SIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWI 480
S +DC ++ +MP VSFTIG KIF+L+PE+YILK GEG CISGF AFD+PPPRGPLWI
Sbjct: 428 SAVDCGQLSSMPTVSFTIGGKIFDLAPEEYILKVGEGPVAQCISGFTAFDIPPPRGPLWI 487
Query: 481 LGDVFMGVYHTVFDSGKLRIGFAEAA 506
LGDVFMG YHTVFD GKLR+G AEAA
Sbjct: 488 LGDVFMGRYHTVFDFGKLRVGSAEAA 513
>gi|147780252|emb|CAN65745.1| hypothetical protein VITISV_037763 [Vitis vinifera]
Length = 504
Score = 635 bits (1639), Expect = e-179, Method: Compositional matrix adjust.
Identities = 308/502 (61%), Positives = 383/502 (76%), Gaps = 34/502 (6%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV 60
M Q L + FCLW L + LL ASS+GL RIGLKK RLD + + AAR+ R+ + +GG V
Sbjct: 3 MRQGYLWAAFCLWAL-TFPLLQASSDGLVRIGLKKWRLDYNRIRAARMARRAKSIGGV-V 60
Query: 61 SGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI 120
+ LGDSD + + L+N+MDAQY+GEIGIG+PPQNF+V+FDTGS+NLWVPS+KC+FSI
Sbjct: 61 KSMYQGLGDSDGESVLLRNYMDAQYYGEIGIGTPPQNFTVVFDTGSANLWVPSTKCHFSI 120
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+C FHS+Y SR S T T+ C + VFIEATRE
Sbjct: 121 ACLFHSKYNSRLSTTSTK----CHFS-------------------------VFIEATREA 151
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
SL F+L +FDGI+GLGF EI VG+A PVW N++ QGLV E++FSFWLNRDP A +GGEIV
Sbjct: 152 SLVFVLGKFDGILGLGFEEIVVGNATPVWYNLLRQGLVQEDIFSFWLNRDPQATDGGEIV 211
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGGVD +HFKG+HTY +T+KGYWQFE+G+ LIG QSTG CE GCAAIVDSGTSL+AGPT
Sbjct: 212 FGGVDKRHFKGQHTYASITQKGYWQFEMGEFLIGYQSTGFCEAGCAAIVDSGTSLIAGPT 271
Query: 301 PVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVS 360
+VTEINHAIG EG+VS ECK VVSQYG++IWDLL+S + P+ VC QIGLC FNG++ S
Sbjct: 272 AIVTEINHAIGAEGIVSQECKEVVSQYGNMIWDLLISRVQPDAVCSQIGLCNFNGSQIES 331
Query: 361 TGIKTVVEKEN---VSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNP 417
IKTVVE+E+ G+ C+ACEM V+W+QNQLKQ++TKE + SY+ ELC SLP+P
Sbjct: 332 PRIKTVVEEEDARGTKVGNEVWCTACEMTVIWIQNQLKQRKTKEIIFSYVTELCQSLPSP 391
Query: 418 MGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGP 477
MGES++DC R+P MP+V+FTI DK F L+P++Y+LKTGEGI VC+SGF+A D+PPPRGP
Sbjct: 392 MGESVVDCGRVPYMPDVTFTIADKHFTLTPKEYVLKTGEGITTVCLSGFIALDVPPPRGP 451
Query: 478 LWILGDVFMGVYHTVFDSGKLR 499
LWILGD+FMGVYHTVFD G L+
Sbjct: 452 LWILGDIFMGVYHTVFDYGNLQ 473
>gi|20800441|gb|AAB03843.2| aspartic proteinase [Vigna unguiculata]
gi|33339734|gb|AAQ14346.1| aspartic proteinase [Vigna unguiculata]
Length = 513
Score = 635 bits (1638), Expect = e-179, Method: Compositional matrix adjust.
Identities = 307/509 (60%), Positives = 389/509 (76%), Gaps = 9/509 (1%)
Query: 7 RSVFCLWVLASCLLLPASS----NGLRRIGLKKRRLDLHSLNAARI-TRKERYMGGAGVS 61
++V L + + LL A S +GLRRIGLKK +LD ++ AARI + + +
Sbjct: 5 KNVISLCLFVTTLLFSAVSCAPNDGLRRIGLKKIKLDPNNRLAARIGSNDDSFRASIRKF 64
Query: 62 GVRHRL-GDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI 120
+++ G + DI+ LKN++DAQY+GEI IG+ PQ F+VIFDTGSSNLWVPSS+C FS+
Sbjct: 65 HLQNNFAGTGETDIVALKNYLDAQYYGEISIGTSPQKFTVIFDTGSSNLWVPSSRCTFSL 124
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+CYFH++Y+S +S+TY G + I YG+G+I+GFFS DNV VGD+VVK+Q FIEATRE
Sbjct: 125 ACYFHAKYRSGRSSTYRRNGTAAAIQYGTGAIAGFFSYDNVRVGDIVVKNQEFIEATREP 184
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
+ FL A+FDGI+GLGF+EI+VG+AVPVW NMVEQGL+ E VFSFWLNR + EEGGE+V
Sbjct: 185 GVVFLAAKFDGILGLGFQEISVGNAVPVWYNMVEQGLIKEPVFSFWLNRKTEEEEGGELV 244
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGGVDP H+KG+HTYVPVT+KGYWQF++GD+LIG + TG C GGCAAI DSGTSLLAGPT
Sbjct: 245 FGGVDPAHYKGEHTYVPVTRKGYWQFDMGDVLIGGKPTGYCAGGCAAIADSGTSLLAGPT 304
Query: 301 PVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVS 360
++T INHAIG GV+S ECK VV++YG I +LL++ P+K+C QIGLC F+G V
Sbjct: 305 AIITMINHAIGASGVMSQECKTVVAEYGQTILNLLLAETQPKKICSQIGLCTFDGTRGVD 364
Query: 361 TGIKTVV-EKENVSAG--DSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNP 417
GI++VV E S+G A CSACEMAVVWVQNQL + QT++++LSY+N+LCD +P+P
Sbjct: 365 MGIESVVDENARKSSGGLHDAGCSACEMAVVWVQNQLSRNQTQDQILSYVNQLCDKMPSP 424
Query: 418 MGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGP 477
MGES + C I ++P VSFTIG + F+L PE+YILK GEG CISGF A D+ PPRGP
Sbjct: 425 MGESSVGCGDISSLPVVSFTIGGRTFDLRPEEYILKVGEGPVAQCISGFTAIDIAPPRGP 484
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
LWILGDVFMG YHTVFD G R+GFAEAA
Sbjct: 485 LWILGDVFMGPYHTVFDFGNQRVGFAEAA 513
>gi|148906206|gb|ABR16259.1| unknown [Picea sitchensis]
Length = 509
Score = 635 bits (1637), Expect = e-179, Method: Compositional matrix adjust.
Identities = 302/496 (60%), Positives = 380/496 (76%), Gaps = 17/496 (3%)
Query: 23 ASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS-------GVRHRLGDSDE--- 72
A+++ L RI LKK+ LD +L AARI +E G+S G+R L S+
Sbjct: 19 AANDCLARIELKKKGLDQKTLQAARIVARE-----GGLSNEVNRKYGLRGGLSYSESARG 73
Query: 73 DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRK 132
+ +PLKN++DAQY+GEIG+G+PPQ F+VIFDTGSSNLWVPS+KCY SI+CYFHS+YK+ +
Sbjct: 74 EYVPLKNYLDAQYYGEIGLGTPPQKFTVIFDTGSSNLWVPSTKCYLSIACYFHSKYKASQ 133
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGI 192
S++Y GK I YGSGS+SG+ QD+V GD+VVKDQVF E T+E LTFL A+FDGI
Sbjct: 134 SSSYCVNGKPFNIQYGSGSVSGYLGQDHVTAGDLVVKDQVFAEVTQEPGLTFLAAKFDGI 193
Query: 193 IGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGK 252
+GLGF++I+VG+ VPVW NMV QGL+ E VFSFW+NR EEGGEIVFGGVDP HFKGK
Sbjct: 194 LGLGFQKISVGNVVPVWYNMVNQGLIKEPVFSFWMNRKVGDEEGGEIVFGGVDPNHFKGK 253
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
HTYVPVT++GYWQF +GD LIG QSTG C GGCAAIVDSGTSLLAGP+ +V +IN AIG
Sbjct: 254 HTYVPVTREGYWQFNMGDFLIGGQSTGFCSGGCAAIVDSGTSLLAGPSGIVAQINEAIGA 313
Query: 313 EGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN- 371
G+ S ECK VVSQYGDLI +LL++ P+KVC QIGLC +G V I +V+EK N
Sbjct: 314 SGLASQECKSVVSQYGDLIMELLMAQTNPQKVCSQIGLCLSDGTRDVGMRIASVLEKGNE 373
Query: 372 -VSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPT 430
S S +C+ACEMAVVW +NQ+ + +K+++++Y+N+LCD LPNP G++ +DC+ + +
Sbjct: 374 ATSTSSSGMCAACEMAVVWAKNQIARNASKDQIMTYLNQLCDRLPNPNGQAAVDCNNLSS 433
Query: 431 MPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYH 490
MP VSFTIGD+ F+L+P+QYILK GEG A CISGFM D+PPP GP+WILGDVFMGVYH
Sbjct: 434 MPTVSFTIGDRSFDLTPDQYILKVGEGSAAQCISGFMGLDVPPPMGPIWILGDVFMGVYH 493
Query: 491 TVFDSGKLRIGFAEAA 506
TVFD G +R+GF EAA
Sbjct: 494 TVFDFGNMRVGFTEAA 509
>gi|351724625|ref|NP_001237064.1| aspartic proteinase 1 precursor [Glycine max]
gi|15186732|dbj|BAB62890.1| aspartic proteinase 1 [Glycine max]
Length = 514
Score = 634 bits (1636), Expect = e-179, Method: Compositional matrix adjust.
Identities = 314/516 (60%), Positives = 400/516 (77%), Gaps = 12/516 (2%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPA----SSNGLRRIGLKKRRLDLHSLNAARITRKERYMG 56
M ++ V CL L S LL+ A + GLRRIGLKK +LD + AAR+ K+
Sbjct: 1 MGNRMNAIVLCL--LVSTLLVSAVYCAPNAGLRRIGLKKIKLDPKNRLAARVGSKDVDSF 58
Query: 57 GAGVS--GVRHRLGDSDE-DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPS 113
A + +++ G ++E DI+ LKN++DAQY+GEI IG+ PQ F+VIFDTGSSNLWVPS
Sbjct: 59 RASIRQFHLQNNFGGTEETDIVALKNYLDAQYYGEIAIGTSPQKFAVIFDTGSSNLWVPS 118
Query: 114 SKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF 173
SKC FS++CYFH++YKS KS+T+ + G + I YG+G+ISGFFS D+V VG++VVK+Q F
Sbjct: 119 SKCTFSVACYFHAKYKSSKSSTFKKNGTAAAIQYGTGAISGFFSYDSVRVGEIVVKNQEF 178
Query: 174 IEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDA 233
IEATRE +TFL A+FDGI+GLGF+EI+VG+A PVW NMV+QGL+ E VFSFW NR+P+
Sbjct: 179 IEATREPGVTFLAAKFDGILGLGFQEISVGNAAPVWYNMVDQGLLKEPVFSFWFNRNPEE 238
Query: 234 EEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGT 293
EEGGEIVFGGVDP H+KGKHTYVPVT+KGYWQF++GD+LIG + TG C GC+AI DSGT
Sbjct: 239 EEGGEIVFGGVDPAHYKGKHTYVPVTRKGYWQFDMGDVLIGGKPTGYCANGCSAIADSGT 298
Query: 294 SLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAF 353
SLLAGPT V+T INHAIG GV+S ECK +V++YG I DLL++ P+K+C +IGLCAF
Sbjct: 299 SLLAGPTTVITMINHAIGASGVMSQECKTIVAEYGQTILDLLLAETQPKKICSRIGLCAF 358
Query: 354 NGAEYVSTGIKTVV---EKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINEL 410
+G V GIK+VV E++++ A C ACEMAVVW+QNQL + QT++++LSYIN+L
Sbjct: 359 DGTHGVDVGIKSVVDENERKSLGGHHGAACPACEMAVVWMQNQLSRNQTQDQILSYINQL 418
Query: 411 CDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFD 470
CD +P+PMGES +DC I ++P VSFTIG + F+LSPE+Y+LK GEG CISGF A D
Sbjct: 419 CDKMPSPMGESAVDCGNISSLPVVSFTIGGRTFDLSPEEYVLKVGEGPVAQCISGFTAID 478
Query: 471 LPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
+PPPRGPLWILGDVFMG YHTVFD GKLR+GFA+AA
Sbjct: 479 IPPPRGPLWILGDVFMGRYHTVFDFGKLRVGFADAA 514
>gi|297736824|emb|CBI26025.3| unnamed protein product [Vitis vinifera]
Length = 500
Score = 634 bits (1634), Expect = e-179, Method: Compositional matrix adjust.
Identities = 302/511 (59%), Positives = 379/511 (74%), Gaps = 19/511 (3%)
Query: 1 MEQKLLRSVFCL-WVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAG 59
M K + CL W A CL L SS+GL RIGLKK+ LDL L+AARITR +
Sbjct: 1 MRLKYILVANCLLWAWACCLALDDSSDGLVRIGLKKKPLDLARLHAARITRGNGFHA--- 57
Query: 60 VSGVRHRLGDSDED-----ILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSS 114
LG D++ + LKN+MDAQY+GEIGIGSPPQ FSV+FDTGSSNLWVPSS
Sbjct: 58 -----QGLGKVDDNYPKANTVYLKNYMDAQYYGEIGIGSPPQTFSVVFDTGSSNLWVPSS 112
Query: 115 KCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFI 174
KCYFSI+CYFH+RY++ S TY++ G+ C+INYGSGSISGFFSQD+V++G++V+K+QVF
Sbjct: 113 KCYFSIACYFHARYRAVLSRTYSKNGRHCKINYGSGSISGFFSQDHVQIGEIVIKNQVFT 172
Query: 175 EATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAE 234
EAT+EG F LA+FDGI+GLGF+ +VG P+W NMV+Q LVS E+ SFWLNRDP A+
Sbjct: 173 EATKEGLFAFSLAQFDGILGLGFQNASVGKIPPIWYNMVQQSLVSMEIVSFWLNRDPKAK 232
Query: 235 EGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTS 294
GGE++FGGVD +HF G HT+VP+T+K YWQ E+GDILI STG CEGGCAAIVD+GTS
Sbjct: 233 IGGEVIFGGVDWRHFMGDHTFVPITRKDYWQIEVGDILIAGSSTGFCEGGCAAIVDTGTS 292
Query: 295 LLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFN 354
++AGPT VVT+INHAIG EG+VS CK VV++YG LIW LVSG PE VC IGLCA+N
Sbjct: 293 MIAGPTTVVTQINHAIGAEGIVSFNCKNVVNKYGRLIWQFLVSGFQPENVCSDIGLCAYN 352
Query: 355 GAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSL 414
G + S G++TV+ GD+A C+ CEM W+Q QLK+ + KEKV Y+NELC++L
Sbjct: 353 GTKNASAGMETVIGN-----GDNAACTFCEMIAFWIQVQLKEHKAKEKVFQYVNELCENL 407
Query: 415 PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPP 474
PNP G+ ++CD + TMP +SF IGDK F L+ EQY LK VC+SGF A D+P P
Sbjct: 408 PNPGGKDFVNCDALATMPVISFAIGDKYFPLTAEQYTLKVEVNCTTVCLSGFTALDVPRP 467
Query: 475 RGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
GPLW+LGDVF+G YHT+FD G L++GFA++
Sbjct: 468 DGPLWVLGDVFLGAYHTIFDFGNLQVGFAKS 498
>gi|1665867|emb|CAA70340.1| aspartic proteinase [Centaurea calcitrapa]
Length = 509
Score = 634 bits (1634), Expect = e-179, Method: Compositional matrix adjust.
Identities = 310/495 (62%), Positives = 388/495 (78%), Gaps = 17/495 (3%)
Query: 21 LPASSNGLRRIGLKKRRLDL------HSLNAARITRKERYMGGAGVSGVRHRLGDSDEDI 74
AS+ GL R+GLKKR++D H + RK+ GG+ L DSD DI
Sbjct: 23 FSASNGGLLRVGLKKRKVDQINQLRNHGASMEGKARKDFGFGGS--------LRDSDSDI 74
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+ LKN+MDAQY+GEIGIGSP Q F+VIFDTGSSNLWVPS+KCYFS++C FHS+YKS S+
Sbjct: 75 IELKNYMDAQYYGEIGIGSPAQKFTVIFDTGSSNLWVPSAKCYFSVACLFHSKYKSSHSS 134
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + G S I YG+GSISGF SQD+V++GD+VVK+Q FIEAT+E +TFL A+FDGI+G
Sbjct: 135 TYKKNGTSAAIQYGTGSISGFVSQDSVKLGDLVVKEQDFIEATKEPGVTFLAAKFDGILG 194
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LGF+EI+VG +VPVW NMV QGLV E VFSFW NR+ D EEGGE+VFGGVDP HFKGKHT
Sbjct: 195 LGFQEISVGKSVPVWYNMVNQGLVQEPVFSFWFNRNADEEEGGELVFGGVDPNHFKGKHT 254
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
YVPVT+KGYWQF +GD+LI +++TG C GCAAI DSGTSLLAGPT ++T+INHAIG +G
Sbjct: 255 YVPVTQKGYWQFNMGDVLIEDKTTGFCADGCAAIADSGTSLLAGPTAIITQINHAIGAKG 314
Query: 315 VVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN-VS 373
V+S +CK +V QYG I ++L+S P+K+C Q+ LC F+GA VS+ I++VV+K N S
Sbjct: 315 VMSQQCKTLVDQYGKTIIEMLLSEAQPDKICSQMKLCTFDGARDVSSIIESVVDKNNGKS 374
Query: 374 AG--DSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTM 431
+G +C+ CEMAVVW+QNQ+K+ QT++ +++Y+NELCD LP+PMGES +DC+ + +M
Sbjct: 375 SGGVHDEMCTFCEMAVVWMQNQIKRNQTEDNIINYVNELCDRLPSPMGESAVDCNDLSSM 434
Query: 432 PNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHT 491
PN++FTIG K+F L PEQYILK GEG A CISGF A D+ PPRGPLWILGDVFMG YHT
Sbjct: 435 PNIAFTIGGKVFELCPEQYILKIGEGEAAQCISGFTAMDVAPPRGPLWILGDVFMGQYHT 494
Query: 492 VFDSGKLRIGFAEAA 506
VFD GKLR+GFAEAA
Sbjct: 495 VFDYGKLRVGFAEAA 509
>gi|356522015|ref|XP_003529645.1| PREDICTED: aspartic proteinase-like [Glycine max]
Length = 514
Score = 633 bits (1632), Expect = e-179, Method: Compositional matrix adjust.
Identities = 315/490 (64%), Positives = 387/490 (78%), Gaps = 6/490 (1%)
Query: 23 ASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSG--VRHRLGDSDE-DILPLKN 79
A ++GLRRIGLKK +LD + AARI K+ A + +++ G S+E DI+ LKN
Sbjct: 25 APNDGLRRIGLKKIKLDPKNRLAARIGSKDVDSFRASIRKFHLQNNFGGSEETDIVALKN 84
Query: 80 FMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEI 139
++DAQY+GEI IG+ PQ F+VIFDTGSSNLWVPSSKC FS++CYFH++YKS KS+TY +
Sbjct: 85 YLDAQYYGEIAIGTSPQKFTVIFDTGSSNLWVPSSKCTFSVACYFHAKYKSSKSSTYKKN 144
Query: 140 GKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFRE 199
G + I YG+G+ISGFFS D+V VGD+ VK+Q FIEATRE +TFL A+FDGI+GLGF+E
Sbjct: 145 GTAAAIQYGTGAISGFFSYDSVRVGDIFVKNQEFIEATREPGVTFLAAKFDGILGLGFQE 204
Query: 200 IAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVT 259
I+VG+AVPVW NMV+QGL+ E VFSFW NR P+ EEGGEIVFGGVDP H+KGKHTYVPVT
Sbjct: 205 ISVGNAVPVWYNMVDQGLIKEPVFSFWFNRKPEEEEGGEIVFGGVDPAHYKGKHTYVPVT 264
Query: 260 KKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
+KGYWQF++GD+LIG + TG C GC+AI DSGTSLLAGPT V+T INHAIG GV+S E
Sbjct: 265 RKGYWQFDMGDVLIGGKPTGYCADGCSAIADSGTSLLAGPTTVITMINHAIGASGVMSQE 324
Query: 320 CKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVV-EKENVSAG--D 376
CK VV++YG I DLL+S P+K+C +IGLCAF+G V GIK+VV E E S+G
Sbjct: 325 CKTVVAEYGQTILDLLLSETQPKKICSRIGLCAFDGTRGVDVGIKSVVDENERKSSGGHH 384
Query: 377 SAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSF 436
A C ACEMAVVW+QNQL + QT++++LSYIN+LCD +P+PMGES +DC I ++P VSF
Sbjct: 385 GAACPACEMAVVWMQNQLSRNQTQDQILSYINQLCDKMPSPMGESAVDCGNISSLPVVSF 444
Query: 437 TIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSG 496
TIG + F LSPE+YILK GEG CISGF A D+PPPRGPLWILGDVFMG YHTVFD G
Sbjct: 445 TIGGRTFELSPEEYILKVGEGPVAQCISGFTAIDIPPPRGPLWILGDVFMGRYHTVFDFG 504
Query: 497 KLRIGFAEAA 506
K R+GFA+AA
Sbjct: 505 KQRVGFADAA 514
>gi|556819|emb|CAA57510.1| cyprosin [Cynara cardunculus]
Length = 509
Score = 632 bits (1631), Expect = e-179, Method: Compositional matrix adjust.
Identities = 306/492 (62%), Positives = 387/492 (78%), Gaps = 17/492 (3%)
Query: 24 SSNGLRRIGLKKRRLDL------HSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPL 77
S+ GL R+GLKKR++D H ++ RK+ GGA L DS DI+ L
Sbjct: 26 SNGGLLRVGLKKRKVDQINQLSGHGVSMEAKARKDFGFGGA--------LRDSGSDIIAL 77
Query: 78 KNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYT 137
KN+MDAQY+GEIGIGSPPQ F+VIFDTGSSNLWVPS+KCYFS++C FHS+YKS S+TY
Sbjct: 78 KNYMDAQYYGEIGIGSPPQKFTVIFDTGSSNLWVPSAKCYFSVACLFHSKYKSSHSSTYK 137
Query: 138 EIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGF 197
+ G S I YG+GSISGF SQD+V++GD+VVK+Q FIEAT+E +TFL A+FDGI+GLGF
Sbjct: 138 KNGTSAAIQYGTGSISGFVSQDSVKLGDLVVKEQDFIEATKEPGITFLAAKFDGILGLGF 197
Query: 198 REIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVP 257
+EI+VG +VP+W NMV QGLV E VFSFW NR+ D EEGGE+VFGGVDP HFKGKHTYVP
Sbjct: 198 QEISVGKSVPLWYNMVNQGLVQEPVFSFWFNRNADEEEGGELVFGGVDPNHFKGKHTYVP 257
Query: 258 VTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVS 317
VT+KGYWQF++GD+LI +++TG C GCAAI DSGTSLLAGPT ++TEINHAIG +GV+S
Sbjct: 258 VTEKGYWQFDMGDVLIEDKTTGFCSDGCAAIADSGTSLLAGPTAIITEINHAIGAKGVMS 317
Query: 318 AECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN---VSA 374
+CK +VSQYG + ++L+S P+K+C Q+ LC F+GA S+ I++VV++ N S
Sbjct: 318 QQCKTLVSQYGKTMIEMLLSEAQPDKICSQMKLCTFDGARDASSIIESVVDENNGKSSSG 377
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 434
+C+ CEMAVVW+QNQ+K+ +T++ +++Y+NELCD LP+PMGES +DC+ + +MPN+
Sbjct: 378 VHDEMCTFCEMAVVWMQNQIKRNETEDNIINYVNELCDRLPSPMGESAVDCNSLSSMPNI 437
Query: 435 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 494
+FTIG K+F L PEQYILK GEG A CISGF A D+ PPRGPLWILGDVFMG YHTVFD
Sbjct: 438 AFTIGGKVFELCPEQYILKIGEGEAAQCISGFTAMDVAPPRGPLWILGDVFMGRYHTVFD 497
Query: 495 SGKLRIGFAEAA 506
GKLR+GFAEAA
Sbjct: 498 YGKLRVGFAEAA 509
>gi|1168536|sp|P42210.1|ASPR_HORVU RecName: Full=Phytepsin; AltName: Full=Aspartic proteinase;
Contains: RecName: Full=Phytepsin 32 kDa subunit;
Contains: RecName: Full=Phytepsin 29 kDa subunit;
Contains: RecName: Full=Phytepsin 16 kDa subunit;
Contains: RecName: Full=Phytepsin 11 kDa subunit; Flags:
Precursor
gi|18904|emb|CAA39602.1| aspartic proteinase [Hordeum vulgare subsp. vulgare]
Length = 508
Score = 630 bits (1624), Expect = e-178, Method: Compositional matrix adjust.
Identities = 309/492 (62%), Positives = 387/492 (78%), Gaps = 8/492 (1%)
Query: 20 LLPASSN--GLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPL 77
+LPA+S GL RI LKKR +D +S A ++ E +G + +R + + DI+ L
Sbjct: 20 VLPAASEAEGLVRIALKKRPIDRNSRVATGLSGGEEQPLLSGANPLRS---EEEGDIVAL 76
Query: 78 KNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYT 137
KN+M+AQYFGEIG+G+PPQ F+VIFDTGSSNLWVPS+KCYFSI+CY HSRYK+ S+TY
Sbjct: 77 KNYMNAQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSAKCYFSIACYLHSRYKAGASSTYK 136
Query: 138 EIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGF 197
+ GK I YG+GSI+G+FS+D+V VGD+VVKDQ FIEAT+E +TFL+A+FDGI+GLGF
Sbjct: 137 KNGKPAAIQYGTGSIAGYFSEDSVTVGDLVVKDQEFIEATKEPGITFLVAKFDGILGLGF 196
Query: 198 REIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVP 257
+EI+VG AVPVW M+EQGLVS+ VFSFWLNR D EGGEI+FGG+DPKH+ G+HTYVP
Sbjct: 197 KEISVGKAVPVWYKMIEQGLVSDPVFSFWLNRHVDEGEGGEIIFGGMDPKHYVGEHTYVP 256
Query: 258 VTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVS 317
VT+KGYWQF++GD+L+G +STG C GGCAAI DSGTSLLAGPT ++TEIN IG GVVS
Sbjct: 257 VTQKGYWQFDMGDVLVGGKSTGFCAGGCAAIADSGTSLLAGPTAIITEINEKIGAAGVVS 316
Query: 318 AECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGD- 376
ECK +VSQYG I DLL++ P+K+C Q+GLC F+G VS GI++VV+ E V +
Sbjct: 317 QECKTIVSQYGQQILDLLLAETQPKKICSQVGLCTFDGTRGVSAGIRSVVDDEPVKSNGL 376
Query: 377 --SAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 434
+CSACEMAVVW+QNQL Q +T++ +L Y+N+LC+ LP+PMGES +DC + +MP++
Sbjct: 377 RADPMCSACEMAVVWMQNQLAQNKTQDLILDYVNQLCNRLPSPMGESAVDCGSLGSMPDI 436
Query: 435 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 494
FTIG K F L PE+YILK GEG A CISGF A D+PPPRGPLWILGDVFMG YHTVFD
Sbjct: 437 EFTIGGKKFALKPEEYILKVGEGAAAQCISGFTAMDIPPPRGPLWILGDVFMGPYHTVFD 496
Query: 495 SGKLRIGFAEAA 506
GKLRIGFA+AA
Sbjct: 497 YGKLRIGFAKAA 508
>gi|425892460|gb|AFB73927.2| preprocirsin [Cirsium vulgare]
Length = 509
Score = 629 bits (1623), Expect = e-178, Method: Compositional matrix adjust.
Identities = 306/495 (61%), Positives = 387/495 (78%), Gaps = 17/495 (3%)
Query: 21 LPASSNGLRRIGLKKRRLDL------HSLNAARITRKERYMGGAGVSGVRHRLGDSDEDI 74
+ S++GL R+GLKKR++D H + RK+ GG L DSD DI
Sbjct: 23 ISVSNDGLIRVGLKKRKVDQINQLSGHGASMEGKARKDFGFGGT--------LRDSDSDI 74
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+ LKN+MDAQY+GEIGIG+PPQ F+VIFDTGSSNLWVPS+KCYFS++C FHS+YKS S+
Sbjct: 75 IALKNYMDAQYYGEIGIGAPPQKFTVIFDTGSSNLWVPSAKCYFSVACLFHSKYKSSHSS 134
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + G S I YG+GSISGF SQD+V++GD+VVK+Q FIEAT+E +TFL A+FDGI+G
Sbjct: 135 TYKKNGTSAAIQYGTGSISGFVSQDSVKLGDLVVKEQDFIEATKEPGITFLAAKFDGILG 194
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LGF+EI+VG +VPVW NMV QGLV E VFSFW NR+ + EEGGE+VFGGVDP HFKGKHT
Sbjct: 195 LGFQEISVGKSVPVWYNMVNQGLVQEPVFSFWFNRNANEEEGGELVFGGVDPNHFKGKHT 254
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
YVPVT+KGYWQF +GD+LI +++TG C GCAAI DSGTSLLAGPT ++TEINHA G +G
Sbjct: 255 YVPVTEKGYWQFNMGDVLIEDKTTGFCSDGCAAIADSGTSLLAGPTAIITEINHASGAKG 314
Query: 315 VVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSA 374
V+S +CK +VSQYG I ++L+S P+K+C Q+ LC F+GA VS+ I++VV+K N +
Sbjct: 315 VMSQQCKTLVSQYGKSIIEMLLSEAQPDKICSQMKLCTFDGARDVSSIIESVVDKNNGKS 374
Query: 375 GDSA---VCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTM 431
A +C+ CEMAVVW+QNQ+K+ +T++ +++Y+NELCD LP+PMGES +DC+ + +M
Sbjct: 375 SGGANDEMCTFCEMAVVWMQNQIKRNETEDNIINYVNELCDRLPSPMGESAVDCNSLSSM 434
Query: 432 PNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHT 491
PN++FTIG K+F L PEQYILK GEG A CISGF A D+ PPRGPLWILGDVFMG YHT
Sbjct: 435 PNIAFTIGGKVFELCPEQYILKIGEGEAAQCISGFTAMDVAPPRGPLWILGDVFMGRYHT 494
Query: 492 VFDSGKLRIGFAEAA 506
VFD GK R+GFAEAA
Sbjct: 495 VFDYGKSRVGFAEAA 509
>gi|224068986|ref|XP_002302872.1| predicted protein [Populus trichocarpa]
gi|222844598|gb|EEE82145.1| predicted protein [Populus trichocarpa]
Length = 505
Score = 629 bits (1623), Expect = e-178, Method: Compositional matrix adjust.
Identities = 300/502 (59%), Positives = 391/502 (77%), Gaps = 9/502 (1%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLN----AARITRKERYMGGAGVSGVR--HRL 67
+L+ ++L A +GL RIGLKK++LD + +E G + + + + +
Sbjct: 4 LLSFPVVLSARDDGLMRIGLKKKKLDHLGRRVVPGSVNFIPEEEGGGASKPAATKKYYNI 63
Query: 68 GDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSR 127
G+++ DI+ LKN++DAQY+GEI IG+PPQ F+VIFDTGSSNLWVPSSKCYFS++CYFHS+
Sbjct: 64 GETEADIVALKNYLDAQYYGEITIGTPPQTFTVIFDTGSSNLWVPSSKCYFSLACYFHSK 123
Query: 128 YKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLA 187
YKS S TY + G S I YG+GSISGFFSQD+VEVGD+VVK+Q FIEAT+E +TFL +
Sbjct: 124 YKSSASTTYVKNGTSAAIQYGTGSISGFFSQDSVEVGDLVVKNQGFIEATKEPGVTFLAS 183
Query: 188 RFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPK 247
+FDGI+GLGF+EI+VG+AVPVW NMV QGLV E+VFSFWLNR+ + EEGGEIVFGGVDP
Sbjct: 184 KFDGILGLGFQEISVGNAVPVWYNMVNQGLVKEKVFSFWLNRNVEGEEGGEIVFGGVDPN 243
Query: 248 HFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEIN 307
H+KG+HTYVPVT KGYWQF++GD+LIG ++TG+C GGC AI DSGTSLLAGPT V+T+IN
Sbjct: 244 HYKGEHTYVPVTHKGYWQFDMGDLLIGTETTGLCAGGCKAIADSGTSLLAGPTTVITQIN 303
Query: 308 HAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVV 367
+AIG G+VS ECK VV+QYG +I ++LV+ P KVC QI C F+G + VS I++VV
Sbjct: 304 NAIGASGIVSEECKTVVAQYGKIILEMLVAQAQPRKVCSQISFCTFDGTQGVSMNIESVV 363
Query: 368 EKENVSAGD---SAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIID 424
E+ + + D A+C+ACEM VVW++N+L+ T++++L Y+N LCD LP+P GES ++
Sbjct: 364 EENSDKSSDGLHDAMCTACEMMVVWMENRLRLNDTEDQILDYVNNLCDRLPSPNGESAVE 423
Query: 425 CDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDV 484
C + +MP++SF IG K+F LSPEQY+LK GEG++ CISGF A D+PPP GPLWILGDV
Sbjct: 424 CSSLSSMPSISFEIGGKLFELSPEQYVLKVGEGVSAQCISGFTALDVPPPHGPLWILGDV 483
Query: 485 FMGVYHTVFDSGKLRIGFAEAA 506
FMG YHTVFD G L +GFA+AA
Sbjct: 484 FMGRYHTVFDYGNLTVGFADAA 505
>gi|356556454|ref|XP_003546541.1| PREDICTED: aspartic proteinase oryzasin-1-like [Glycine max]
Length = 505
Score = 628 bits (1620), Expect = e-177, Method: Compositional matrix adjust.
Identities = 300/510 (58%), Positives = 379/510 (74%), Gaps = 9/510 (1%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNG-LRRIGLKKRRLDLHSLNAARITRKERYMGGAG 59
M+ K L C+W + S++G L RIGLK+R LDL L AARI + G
Sbjct: 1 MDFKYLLVGMCVWAWFGSITFATSNDGRLMRIGLKRRTLDLQCLKAARIKEAGHHRDLGG 60
Query: 60 VSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS 119
V+ DEDI+ LKN++DAQYFGEI IGSPPQ F+V+FDTGSSNLWVPSSKC FS
Sbjct: 61 VN-----RNCCDEDIVYLKNYLDAQYFGEISIGSPPQYFNVVFDTGSSNLWVPSSKCIFS 115
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
I+CYFHS+Y+S+ S+TYTEIG C+I YG GSI GFFSQDNV+VGD+++KDQ F E TRE
Sbjct: 116 IACYFHSKYRSKISSTYTEIGIPCKIPYGQGSIFGFFSQDNVQVGDIIIKDQEFAEITRE 175
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEI 239
GSL FDGI+GLGF++ +VG PVW NM+E GL+S ++FS WLN+DP E GGEI
Sbjct: 176 GSLALPALPFDGILGLGFQDTSVGKVTPVWYNMLEGGLISHKIFSLWLNQDPSEEMGGEI 235
Query: 240 VFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGP 299
VFGG+D +HF+G+HTYVP+++KGYWQ +LGDIL+ N STG+CEGGCAA+VDSGTSL+AGP
Sbjct: 236 VFGGIDYRHFRGEHTYVPLSQKGYWQIDLGDILLANNSTGLCEGGCAAVVDSGTSLIAGP 295
Query: 300 TPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYV 359
T VVT+INHAIG EG S ECK ++ YGD IW+ L++GL P+ +C IG C+ N +
Sbjct: 296 TTVVTQINHAIGAEGYTSFECKSILHNYGDSIWESLIAGLYPDIICSAIGFCSNNEFNTM 355
Query: 360 STGIKTVVEKENVSAG---DSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPN 416
IKTVV ++ + +S CS C M V+W+Q QLKQ KEKVL Y++ELC+ LPN
Sbjct: 356 DDVIKTVVHNQSWNRSQTRESPFCSFCNMIVLWIQVQLKQSNVKEKVLKYVDELCEKLPN 415
Query: 417 PMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG 476
P G+S I+C+RI TMP+++FTIG+K F LSPEQY+L+ EG + VC GF+A D+PPP+G
Sbjct: 416 PPGQSFINCNRIATMPHITFTIGNKSFPLSPEQYVLRVEEGCSTVCYGGFVAIDVPPPQG 475
Query: 477 PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PLW+LG +F+G YHTVFD G LRIGFAEAA
Sbjct: 476 PLWVLGSIFLGAYHTVFDYGNLRIGFAEAA 505
>gi|359477267|ref|XP_002275241.2| PREDICTED: aspartic proteinase [Vitis vinifera]
Length = 502
Score = 627 bits (1617), Expect = e-177, Method: Compositional matrix adjust.
Identities = 301/513 (58%), Positives = 378/513 (73%), Gaps = 21/513 (4%)
Query: 1 MEQKLLRSVFCL-WVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAG 59
M K + CL W A CL L SS+GL RIGLKK+ LDL L+AARITR +
Sbjct: 1 MRLKYILVANCLLWAWACCLALDDSSDGLVRIGLKKKPLDLARLHAARITRGNGFHA--- 57
Query: 60 VSGVRHRLGDSDED-----ILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSS 114
LG D++ + LKN+MDAQY+GEIGIGSPPQ FSV+FDTGSSNLWVPSS
Sbjct: 58 -----QGLGKVDDNYPKANTVYLKNYMDAQYYGEIGIGSPPQTFSVVFDTGSSNLWVPSS 112
Query: 115 KCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFI 174
KCYFSI+CYFH+RY++ S TY++ G+ C+INYGSGSISGFFSQD+V++G++V+K+QVF
Sbjct: 113 KCYFSIACYFHARYRAVLSRTYSKNGRHCKINYGSGSISGFFSQDHVQIGEIVIKNQVFT 172
Query: 175 EATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAE 234
EAT+EG F LA+FDGI+GLGF+ +VG P+W NMV+Q LVS E+ SFWLNRDP A+
Sbjct: 173 EATKEGLFAFSLAQFDGILGLGFQNASVGKIPPIWYNMVQQSLVSMEIVSFWLNRDPKAK 232
Query: 235 EGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTS 294
GGE++FGGVD +HF G HT+VP+T+K YWQ E+GDILI STG CEGGCAAIVD+GTS
Sbjct: 233 IGGEVIFGGVDWRHFMGDHTFVPITRKDYWQIEVGDILIAGSSTGFCEGGCAAIVDTGTS 292
Query: 295 LLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFN 354
++AGPT VVT+INHAIG EG+VS CK VV++YG LIW LVSG PE VC IGLCA+N
Sbjct: 293 MIAGPTTVVTQINHAIGAEGIVSFNCKNVVNKYGRLIWQFLVSGFQPENVCSDIGLCAYN 352
Query: 355 GAEYVS--TGIKTVVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCD 412
G + G++TV+ GD+A C+ CEM W+Q QLK+ + KEKV Y+NELC+
Sbjct: 353 GTKNARQGAGMETVIGN-----GDNAACTFCEMIAFWIQVQLKEHKAKEKVFQYVNELCE 407
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
+LPNP G+ ++CD + TMP +SF IGDK F L+ EQY LK VC+SGF A D+P
Sbjct: 408 NLPNPGGKDFVNCDALATMPVISFAIGDKYFPLTAEQYTLKVEVNCTTVCLSGFTALDVP 467
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
P GPLW+LGDVF+G YHT+FD G L++GFA++
Sbjct: 468 RPDGPLWVLGDVFLGAYHTIFDFGNLQVGFAKS 500
>gi|509163|emb|CAA48939.1| cyprosin [Cynara cardunculus]
Length = 474
Score = 627 bits (1617), Expect = e-177, Method: Compositional matrix adjust.
Identities = 294/483 (60%), Positives = 376/483 (77%), Gaps = 18/483 (3%)
Query: 33 LKKRRLDL------HS-LNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQY 85
LKKR++++ H+ N A RK GVR DSD +++ LKN+MDAQY
Sbjct: 1 LKKRKVNILNHPGEHAGSNDANARRK---------YGVRGNFRDSDGELIALKNYMDAQY 51
Query: 86 FGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEI 145
FGEIGIG+PPQ F+VIFDTGSSNLWVPSSKCYFS++C FHS+Y+S S TY + GKS I
Sbjct: 52 FGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFSVACLFHSKYRSTDSTTYKKNGKSAAI 111
Query: 146 NYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDA 205
YG+GSISGFFSQD+V++GD++VK+Q FIEAT+E +TFL A+FDGI+GLGF+EI+VGDA
Sbjct: 112 QYGTGSISGFFSQDSVKLGDLLVKEQDFIEATKEPGITFLAAKFDGILGLGFQEISVGDA 171
Query: 206 VPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQ 265
VPVW M+ QGLV E VFSFWLNR+ D +EGGE+VFGGVDP HFKG+HTYVPVT+KGYWQ
Sbjct: 172 VPVWYTMLNQGLVQEPVFSFWLNRNADEQEGGELVFGGVDPNHFKGEHTYVPVTQKGYWQ 231
Query: 266 FELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVS 325
FE+GD+LIG+++TG C GCAAI DSGTSLLAG T +VT+IN AIG GV+S +CK +V
Sbjct: 232 FEMGDVLIGDKTTGFCASGCAAIADSGTSLLAGTTTIVTQINQAIGAAGVMSQQCKSLVD 291
Query: 326 QYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKE--NVSAGDSAVCSAC 383
QYG + ++L+S PEK+C Q+ LC+F+G+ S I++VV+K S +C+ C
Sbjct: 292 QYGKSMIEMLLSEEQPEKICSQMKLCSFDGSHDTSMIIESVVDKSKGKSSGLHDEMCTMC 351
Query: 384 EMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIF 443
+MAVVW+QNQ++Q +T+E +++Y+++LC+ LP+PMGES +DC + +MPN++FT+G K F
Sbjct: 352 QMAVVWMQNQIRQNETEENIINYVDKLCERLPSPMGESAVDCSSLSSMPNIAFTVGGKTF 411
Query: 444 NLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFA 503
NLSPEQY+LK GEG CISGF A D+ PP GPLWILGDVFMG YHTVFD G LR+GFA
Sbjct: 412 NLSPEQYVLKVGEGATAQCISGFTAMDVAPPHGPLWILGDVFMGQYHTVFDYGNLRVGFA 471
Query: 504 EAA 506
EAA
Sbjct: 472 EAA 474
>gi|357480353|ref|XP_003610462.1| Aspartic proteinase [Medicago truncatula]
gi|355511517|gb|AES92659.1| Aspartic proteinase [Medicago truncatula]
Length = 519
Score = 626 bits (1615), Expect = e-177, Method: Compositional matrix adjust.
Identities = 310/522 (59%), Positives = 397/522 (76%), Gaps = 19/522 (3%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASS------NGLRRIGLKKRRLDLHSLNAARIT----- 49
M KL V CL L S LL+ A S +GLRRI LKK +LD ++ AA
Sbjct: 1 MGNKLHVIVLCL--LVSTLLISAVSIAASSSDGLRRIALKKIQLDRNNKLAAAAAAAAGG 58
Query: 50 RKERYMGGAGVSGVRHRLGDS--DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSS 107
R+ + S ++ L ++ + DI+ LKN++DAQY+GEI IG+ PQ F+VIFDTGSS
Sbjct: 59 RRTKDTDSLQSSIRKYNLANNYQETDIVALKNYLDAQYYGEISIGTSPQKFTVIFDTGSS 118
Query: 108 NLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVV 167
NLWVPSSKC FS++CYFH++YKS KS TY + G + I YG+G+ISGFFS D+V+VGD+V
Sbjct: 119 NLWVPSSKCTFSVACYFHAKYKSTKSTTYRKNGTAAAIQYGTGAISGFFSYDSVKVGDIV 178
Query: 168 VKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL 227
VK+Q FIEAT+E +TFL+A+FDGI+GLGF+EI+VG+AVPVW NMVEQGL+ E VFSFWL
Sbjct: 179 VKNQEFIEATKEPGVTFLVAKFDGILGLGFQEISVGNAVPVWYNMVEQGLIQEPVFSFWL 238
Query: 228 NRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAA 287
NR P+ EEGGEIVFGGVDP H+KG HTYVPV +KGYWQF++GD+ I +STG C GC+A
Sbjct: 239 NRKPEEEEGGEIVFGGVDPAHYKGNHTYVPVKRKGYWQFDMGDVTIDGKSTGYCVDGCSA 298
Query: 288 IVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQ 347
I DSGTSLLAGPT V+T INHAIG GVVS ECK +V++YG I +LL++ P+K+C +
Sbjct: 299 IADSGTSLLAGPTTVITMINHAIGASGVVSKECKTIVAEYGQTILNLLLAEAQPKKICSE 358
Query: 348 IGLCAFNGAEYVSTGIKTVV---EKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVL 404
IGLC F+G V I++VV E+++ S A CSACEMAVVW+QNQL+Q +T++++L
Sbjct: 359 IGLCTFDGTHGVDLAIESVVDGNERKSSSGLHGASCSACEMAVVWMQNQLRQNKTQDQIL 418
Query: 405 SYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCIS 464
+YIN LCD +P+PMGES +DC+ I ++P +SFTIG + F+L+PE+YI K GEG A CIS
Sbjct: 419 TYINNLCDKMPSPMGESSVDCENISSLPVISFTIGGRTFDLAPEEYI-KVGEGPAAQCIS 477
Query: 465 GFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
GF+A D+PPPRGP+WILGD+FMG YHTVFD GK R+GFAEAA
Sbjct: 478 GFVAIDVPPPRGPIWILGDIFMGRYHTVFDFGKSRVGFAEAA 519
>gi|297809619|ref|XP_002872693.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297318530|gb|EFH48952.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 507
Score = 625 bits (1613), Expect = e-176, Method: Compositional matrix adjust.
Identities = 289/503 (57%), Positives = 386/503 (76%), Gaps = 9/503 (1%)
Query: 10 FCLWVLASCLLLPASS------NGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGV 63
F L L SCL+L +++ +G RIGLKKR+LD + A+++ K R G+
Sbjct: 8 FLLVFLLSCLILISTALCERKGDGTIRIGLKKRKLDRSNRLASQLFLKNR---GSWSPKD 64
Query: 64 RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCY 123
RL D++ D++PLKN++DAQY+G+I IG+PPQ F+VIFDTGSSNLW+PS+KCY S++CY
Sbjct: 65 YFRLNDANADMVPLKNYLDAQYYGDITIGTPPQKFTVIFDTGSSNLWIPSTKCYLSVACY 124
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
FHS+YK+ +S++Y + GK I YG+G+ISG+FS D+V+VGD+VVK+Q FIEAT E +T
Sbjct: 125 FHSKYKASQSSSYRKNGKPASIRYGTGAISGYFSNDDVKVGDIVVKEQEFIEATTEPGIT 184
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
FLLA+FDGI+GLGF+EI+VG++ PVW NMVE+GLV + VFSFWLNR+P +EGGEIVFGG
Sbjct: 185 FLLAKFDGILGLGFKEISVGNSTPVWYNMVEKGLVKDPVFSFWLNRNPQDQEGGEIVFGG 244
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
VDPKHFKG+HTYVPVT KGYWQF++GD+ I + TG C GC+AI DSGTSLL GP+ V+
Sbjct: 245 VDPKHFKGEHTYVPVTHKGYWQFDMGDLQIAGKPTGYCAKGCSAIADSGTSLLTGPSTVI 304
Query: 304 TEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGI 363
T INHAIG +G+VS ECK VV QYG + + L++ P+KVC QIG+CA++G VS I
Sbjct: 305 TMINHAIGAQGIVSRECKAVVDQYGKTMLNSLLAQEDPKKVCSQIGVCAYDGTHSVSMDI 364
Query: 364 KTVVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESII 423
++VV+ + A+CSACEMA VW++++L Q QT+E++L+Y ELC+ +P +S +
Sbjct: 365 QSVVDDGTSGLLNQAMCSACEMAAVWMESELTQNQTQERILAYAAELCNHIPTKNQQSAV 424
Query: 424 DCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGD 483
DC+R+ +MP VSF+IG + F+LSP+ YI K G+G+ C SGF A D+PPPRGPLWILGD
Sbjct: 425 DCERVSSMPIVSFSIGGRTFDLSPQDYIFKIGDGVESQCTSGFTAMDIPPPRGPLWILGD 484
Query: 484 VFMGVYHTVFDSGKLRIGFAEAA 506
+FMG YHTVFD GK R+GFA+AA
Sbjct: 485 IFMGPYHTVFDYGKARVGFAKAA 507
>gi|4589716|dbj|BAA76870.1| aspartic proteinase [Helianthus annuus]
Length = 509
Score = 625 bits (1612), Expect = e-176, Method: Compositional matrix adjust.
Identities = 292/495 (58%), Positives = 380/495 (76%), Gaps = 17/495 (3%)
Query: 21 LPASSNGLRRIGLKKR------RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDI 74
++ GL R+GLKKR R+ H L+ R+ G L +S+ D+
Sbjct: 23 FSSTKGGLLRVGLKKRKTNQFNRVSEHGLSMEGTDRRNF--------GFYDTLRNSEGDV 74
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+ LKN+MDAQYFGEIGIG+PPQ F+V+FDTGS+NLWVPSSKC+ S++C FH +YK+ +S+
Sbjct: 75 IVLKNYMDAQYFGEIGIGTPPQKFTVVFDTGSANLWVPSSKCFLSVACLFHQKYKASRSS 134
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + G + I YG+G+ISG FS+D+V++GD+VVK+Q FIEATRE +TFL A+FDGI+G
Sbjct: 135 TYKKNGTAAAIQYGTGAISGVFSRDSVKLGDLVVKEQDFIEATREPGITFLAAKFDGILG 194
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+++I+VG AVPVW NMV QGLV E VFSFW NR EEGGE+VFGGVDP HFKGKHT
Sbjct: 195 LGYQDISVGKAVPVWYNMVNQGLVQEPVFSFWFNRHTGEEEGGELVFGGVDPNHFKGKHT 254
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
YVPVT+KGYWQF++GD+LIG+++TG C GGCAAI DSGTSLLAGPT ++T+INHAIG G
Sbjct: 255 YVPVTQKGYWQFDMGDVLIGDKTTGFCSGGCAAIADSGTSLLAGPTTIITQINHAIGAAG 314
Query: 315 VVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN--V 372
V+S +CK +V QYG I ++L+S P+K+C ++ LC F+G+ VS+ I++VV+K N
Sbjct: 315 VMSQQCKTLVDQYGKTIIEMLLSEAQPDKICSRMNLCTFDGSRDVSSIIESVVDKNNGKS 374
Query: 373 SAG-DSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTM 431
SAG + +C+ CEMAVVW+Q+QLK+ QT++ +++Y+NELCD +P+PMGES +DC + M
Sbjct: 375 SAGLNDGICAFCEMAVVWMQSQLKRNQTEDSIINYVNELCDRIPSPMGESAVDCQTLSNM 434
Query: 432 PNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHT 491
PN++FTIG K F+L+PEQYILK GEG CISGF A D+ PP GPLWI GDVFMG YHT
Sbjct: 435 PNIAFTIGGKTFDLTPEQYILKVGEGEVAQCISGFTALDVAPPHGPLWIHGDVFMGQYHT 494
Query: 492 VFDSGKLRIGFAEAA 506
VFD GK R+GFAEAA
Sbjct: 495 VFDFGKSRVGFAEAA 509
>gi|255556616|ref|XP_002519342.1| Aspartic proteinase oryzasin-1 precursor, putative [Ricinus
communis]
gi|223541657|gb|EEF43206.1| Aspartic proteinase oryzasin-1 precursor, putative [Ricinus
communis]
Length = 500
Score = 625 bits (1612), Expect = e-176, Method: Compositional matrix adjust.
Identities = 299/504 (59%), Positives = 380/504 (75%), Gaps = 19/504 (3%)
Query: 10 FCLWVLASCL---LLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHR 66
F ++A CL L +SS+ L +IGLKKRRLDL+S+NAARIT ++
Sbjct: 9 FRFLLVALCLGAWLGASSSSRLVKIGLKKRRLDLYSINAARIT----------IADASAS 58
Query: 67 LGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHS 126
G D++ LKN++D QY+GE+ IGSPPQ F+V+FDTGSSNLWVPSSKC SI+CYFHS
Sbjct: 59 FGWPKADVVYLKNYLDTQYYGEVAIGSPPQTFTVVFDTGSSNLWVPSSKCVLSITCYFHS 118
Query: 127 RYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLL 186
+++++ S TYT+IG C+I+YGSGSISGFFSQD V++GD V+DQ F+E TREG L FL
Sbjct: 119 KFRAKMSRTYTKIGLPCKIDYGSGSISGFFSQDYVKLGDATVRDQEFVEVTREGLLAFLG 178
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
+FDGI+GLGF+EI VG A PVW NMV QG V++++FS WLNRDP A GGEIVFGG+D
Sbjct: 179 TQFDGILGLGFQEITVGQATPVWYNMVRQGHVNQKLFSLWLNRDPTAGMGGEIVFGGLDW 238
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
+HF+G+HTYVPVT+KGYWQ E+GD+ I +STG+CE GCAAIVDSGTS +AGPT +VT+I
Sbjct: 239 RHFRGEHTYVPVTEKGYWQIEVGDVFIAKKSTGMCEYGCAAIVDSGTSFIAGPTTIVTQI 298
Query: 307 NHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTV 366
NHAIG +G+VS ECK VV+++GDLIW+ L+SGL PE VC IGLC +N T IKT
Sbjct: 299 NHAIGAQGIVSLECKSVVTKFGDLIWESLISGLRPEIVCVDIGLCVYNNNS--RTVIKTK 356
Query: 367 VEKEN----VSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESI 422
+ + S +SA+C+ CEM V W+Q QLKQ++ +EK+ Y++ELC+ LP+PMG+S
Sbjct: 357 ADDRDGDKSSSLDESALCTFCEMIVFWIQVQLKQQKAEEKIFKYVDELCEKLPDPMGKSF 416
Query: 423 IDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILG 482
IDC I MP V+F IG+K F LSPEQY++K E +C+SGF A D+PPP+GPLWILG
Sbjct: 417 IDCGDITNMPYVTFIIGNKSFPLSPEQYVVKVEEKYGTICLSGFTALDVPPPQGPLWILG 476
Query: 483 DVFMGVYHTVFDSGKLRIGFAEAA 506
DVF+G YHTVFD G LRIGFA AA
Sbjct: 477 DVFLGAYHTVFDFGNLRIGFARAA 500
>gi|73912433|dbj|BAE20413.1| aspartic proteinase [Triticum aestivum]
Length = 508
Score = 622 bits (1605), Expect = e-175, Method: Compositional matrix adjust.
Identities = 306/490 (62%), Positives = 385/490 (78%), Gaps = 10/490 (2%)
Query: 23 ASSNGLRRIGLKKRRLDLHSLNAARIT-RKERYMGGAGVSGVRHRLGDSDE-DILPLKNF 80
+ + GL RI LKKR +D +S A ++ R+E ++ G G + L +E DI+ LKN+
Sbjct: 23 SEAEGLVRIALKKRAIDRNSRVAKSLSDREEVHLLG----GASNTLPSEEEGDIVSLKNY 78
Query: 81 MDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIG 140
M+AQYFGEIG+G+PPQ F+VIFDTGSSNLWVPS+KCYFSI+CY H+RYK+ S+TY + G
Sbjct: 79 MNAQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSAKCYFSIACYLHARYKAGASSTYKKNG 138
Query: 141 KSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREI 200
K I YG+GSI+G+FS+D+V VGD+VVKDQ FIEAT+E +TFL+A+FDGI+GLGF+EI
Sbjct: 139 KPAAIQYGTGSIAGYFSEDSVTVGDLVVKDQEFIEATKEPGVTFLVAKFDGILGLGFKEI 198
Query: 201 AVGDAVPVWDNMVEQGLVSEEVFSFWLNRDP-DAEEGGEIVFGGVDPKHFKGKHTYVPVT 259
+VG AVPVW NMVEQGL+S+ VFSFWLNR D EGGEI+FGG+DPKH+ G+HTYVP T
Sbjct: 199 SVGKAVPVWYNMVEQGLISDPVFSFWLNRHADDEGEGGEIIFGGMDPKHYVGEHTYVPAT 258
Query: 260 KKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
+KGYWQF++GD+L+G +STG C GGCAAI DSGTSLLAGPT ++TEIN IG GVVS E
Sbjct: 259 QKGYWQFDMGDVLVGGKSTGFCAGGCAAIADSGTSLLAGPTAIITEINEKIGAAGVVSQE 318
Query: 320 CKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAG---D 376
CK +VSQYG I DLL++ P+KVC Q+GLC F+G VS GI++VV+ E V +
Sbjct: 319 CKTIVSQYGQQILDLLLAETQPKKVCSQVGLCTFDGTRGVSAGIRSVVDDEPVKSNGLHT 378
Query: 377 SAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSF 436
+CSACEMAVVW+QNQL Q +T++ +L Y+N+LC+ LP+PMGES +DC + +MP++ F
Sbjct: 379 DPMCSACEMAVVWMQNQLAQNKTQDLILDYVNQLCNRLPSPMGESAVDCASLGSMPDIEF 438
Query: 437 TIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSG 496
TI K F L PE+YILK GEG A CISGF A D+PPPRGPLWILGDVFMG YHTVFD G
Sbjct: 439 TISGKKFALKPEEYILKVGEGAAAQCISGFTAMDIPPPRGPLWILGDVFMGPYHTVFDYG 498
Query: 497 KLRIGFAEAA 506
KLR+GFA+AA
Sbjct: 499 KLRVGFAKAA 508
>gi|5822248|pdb|1QDM|A Chain A, Crystal Structure Of Prophytepsin, A Zymogen Of A Barley
Vacuolar Aspartic Proteinase.
gi|5822249|pdb|1QDM|B Chain B, Crystal Structure Of Prophytepsin, A Zymogen Of A Barley
Vacuolar Aspartic Proteinase.
gi|5822250|pdb|1QDM|C Chain C, Crystal Structure Of Prophytepsin, A Zymogen Of A Barley
Vacuolar Aspartic Proteinase
Length = 478
Score = 622 bits (1604), Expect = e-175, Method: Compositional matrix adjust.
Identities = 303/480 (63%), Positives = 379/480 (78%), Gaps = 6/480 (1%)
Query: 30 RIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEI 89
RI LKKR +D +S A ++ E +G + +R + + DI+ LKN+M+AQYFGEI
Sbjct: 2 RIALKKRPIDRNSRVATGLSGGEEQPLLSGANPLRS---EEEGDIVALKNYMNAQYFGEI 58
Query: 90 GIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGS 149
G+G+PPQ F+VIFDTGSSNLWVPS+KCYFSI+CY HSRYK+ S+TY + GK I YG+
Sbjct: 59 GVGTPPQKFTVIFDTGSSNLWVPSAKCYFSIACYLHSRYKAGASSTYKKNGKPAAIQYGT 118
Query: 150 GSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVW 209
GSI+G+FS+D+V VGD+VVKDQ FIEAT+E +TFL+A+FDGI+GLGF+EI+VG AVPVW
Sbjct: 119 GSIAGYFSEDSVTVGDLVVKDQEFIEATKEPGITFLVAKFDGILGLGFKEISVGKAVPVW 178
Query: 210 DNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELG 269
M+EQGLVS+ VFSFWLNR D EGGEI+FGG+DPKH+ G+HTYVPVT+KGYWQF++G
Sbjct: 179 YKMIEQGLVSDPVFSFWLNRHVDEGEGGEIIFGGMDPKHYVGEHTYVPVTQKGYWQFDMG 238
Query: 270 DILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGD 329
D+L+G +STG C GGCAAI DSGTSLLAGPT ++TEIN IG GVVS ECK +VSQYG
Sbjct: 239 DVLVGGKSTGFCAGGCAAIADSGTSLLAGPTAIITEINEKIGAAGVVSQECKTIVSQYGQ 298
Query: 330 LIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGD---SAVCSACEMA 386
I DLL++ P+K+C Q+GLC F+G VS GI++VV+ E V + +CSACEMA
Sbjct: 299 QILDLLLAETQPKKICSQVGLCTFDGTRGVSAGIRSVVDDEPVKSNGLRADPMCSACEMA 358
Query: 387 VVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLS 446
VVW+QNQL Q +T++ +L Y+N+LC+ LP+PMGES +DC + +MP++ FTIG K F L
Sbjct: 359 VVWMQNQLAQNKTQDLILDYVNQLCNRLPSPMGESAVDCGSLGSMPDIEFTIGGKKFALK 418
Query: 447 PEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PE+YILK GEG A CISGF A D+PPPRGPLWILGDVFMG YHTVFD GKLRIGFA+AA
Sbjct: 419 PEEYILKVGEGAAAQCISGFTAMDIPPPRGPLWILGDVFMGPYHTVFDYGKLRIGFAKAA 478
>gi|115439013|ref|NP_001043786.1| Os01g0663400 [Oryza sativa Japonica Group]
gi|113533317|dbj|BAF05700.1| Os01g0663400 [Oryza sativa Japonica Group]
gi|215701483|dbj|BAG92907.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218188796|gb|EEC71223.1| hypothetical protein OsI_03158 [Oryza sativa Indica Group]
gi|222618996|gb|EEE55128.1| hypothetical protein OsJ_02912 [Oryza sativa Japonica Group]
gi|385717674|gb|AFI71272.1| unnamed protein [Oryza sativa Japonica Group]
Length = 522
Score = 620 bits (1599), Expect = e-175, Method: Compositional matrix adjust.
Identities = 299/493 (60%), Positives = 371/493 (75%), Gaps = 13/493 (2%)
Query: 27 GLRRIGLKKRRLD--------LHSLNAARI-TRKERYMGGAGVSGVRHRLGDSDE-DILP 76
G+ RI LKKR++D L +A R+ R+ ++ + E DI+
Sbjct: 30 GVVRIALKKRQVDETGRVGGHLAGEDAQRLLARRHGFLTNDAARAASRKARAEAEGDIVA 89
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
LKN+++AQY+GEI IG+PPQ F+VIFDTGSSNLWVPSSKC+ SI+CYFHSRYK+ +S+TY
Sbjct: 90 LKNYLNAQYYGEIAIGTPPQMFTVIFDTGSSNLWVPSSKCHLSIACYFHSRYKAGQSSTY 149
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
+ GK I+YG+G+ISG+FSQD+V+VGDV VK+Q FIEATRE S+TF++A+FDGI+GLG
Sbjct: 150 KKNGKPASIHYGTGAISGYFSQDSVKVGDVAVKNQDFIEATREPSITFMVAKFDGILGLG 209
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
F+EI+VG+AVP+W NMV QGLV + VFSFW NR D +GGEIVFGG+DP H+KG HTYV
Sbjct: 210 FKEISVGNAVPIWYNMVRQGLVVDPVFSFWFNRHADEGQGGEIVFGGIDPNHYKGNHTYV 269
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
PVT+KGYWQF +GD+LIG STG C GCAAI DSGTSLL GPT ++T+IN IG GVV
Sbjct: 270 PVTRKGYWQFNMGDVLIGGNSTGFCAAGCAAIADSGTSLLTGPTAIITQINEKIGATGVV 329
Query: 317 SAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKE-NVSAG 375
S ECK VVSQYG I D L + P KVC +GLC F+G VS GI++VV+ E S+G
Sbjct: 330 SQECKAVVSQYGQQILDQLRAETKPAKVCSSVGLCTFDGTHGVSAGIRSVVDDEVGKSSG 389
Query: 376 --DSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPN 433
SA+C+ACE AVVW+ QL Q QT++ VL YI++LCD LP+PMGES +DC + +MP+
Sbjct: 390 PFSSAMCNACETAVVWMHTQLAQNQTQDLVLQYIDQLCDRLPSPMGESSVDCSSLASMPD 449
Query: 434 VSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVF 493
++FTIG F L PEQYILK GEG A CISGF A D+PPPRGPLWILGDVFMG YHTVF
Sbjct: 450 IAFTIGGNKFVLKPEQYILKVGEGTATQCISGFTAMDIPPPRGPLWILGDVFMGAYHTVF 509
Query: 494 DSGKLRIGFAEAA 506
D G L++GFAEAA
Sbjct: 510 DYGNLKVGFAEAA 522
>gi|1169175|sp|P40782.2|CYPR1_CYNCA RecName: Full=Cyprosin; Flags: Precursor
gi|1585067|prf||2124255A cyprosin
Length = 473
Score = 620 bits (1598), Expect = e-175, Method: Compositional matrix adjust.
Identities = 293/482 (60%), Positives = 374/482 (77%), Gaps = 17/482 (3%)
Query: 33 LKKRRLDL------HS-LNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQY 85
LKKR++++ H+ N A RK GVR DSD +++ LKN+MDAQY
Sbjct: 1 LKKRKVNILNHPGEHAGSNDANARRK---------YGVRGNFRDSDGELIALKNYMDAQY 51
Query: 86 FGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEI 145
FGEIGIG+PPQ F+VIFDTGSSNLWVPSSKCYFS++C FHS+Y+S S TY + GKS I
Sbjct: 52 FGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFSVACLFHSKYRSTDSTTYKKNGKSAAI 111
Query: 146 NYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDA 205
YG+GSISGFFSQD+V++GD++VK+Q FIEAT+E +TFL A+FDGI+GLGF+EI+VGDA
Sbjct: 112 QYGTGSISGFFSQDSVKLGDLLVKEQDFIEATKEPGITFLAAKFDGILGLGFQEISVGDA 171
Query: 206 VPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQ 265
VPVW M+ QGLV E VFSFWLNR+ D +EGGE+VFGGVDP HFKG+HTYVPVT+KGYWQ
Sbjct: 172 VPVWYTMLNQGLVQEPVFSFWLNRNADEQEGGELVFGGVDPNHFKGEHTYVPVTQKGYWQ 231
Query: 266 FELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVS 325
FE+GD+LIG+++TG C GCAAI DSGTSLLAG T +VT+IN AIG GV+S +CK +V
Sbjct: 232 FEMGDVLIGDKTTGFCASGCAAIADSGTSLLAGTTTIVTQINQAIGAAGVMSQQCKSLVD 291
Query: 326 QYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEK-ENVSAGDSAVCSACE 384
QYG + ++L+S PEK+C Q+ LC+F+G+ S I++VV+K + S+G C C
Sbjct: 292 QYGKSMIEMLLSEEQPEKICSQMKLCSFDGSHDTSMIIESVVDKSKGKSSGLPMRCVPCA 351
Query: 385 MAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFN 444
VVW+QNQ++Q +T+E +++Y+++LC+ LP+PMGES +DC + +MPN++FT+G K FN
Sbjct: 352 RWVVWMQNQIRQNETEENIINYVDKLCERLPSPMGESAVDCSSLSSMPNIAFTVGGKTFN 411
Query: 445 LSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAE 504
LSPEQY+LK GEG CISGF A D+ PP GPLWILGDVFMG YHTVFD G LR+GFAE
Sbjct: 412 LSPEQYVLKVGEGATAQCISGFTAMDVAPPHGPLWILGDVFMGQYHTVFDYGNLRVGFAE 471
Query: 505 AA 506
AA
Sbjct: 472 AA 473
>gi|15233518|ref|NP_192355.1| phytepsin [Arabidopsis thaliana]
gi|75338508|sp|Q9XEC4.1|APA3_ARATH RecName: Full=Aspartic proteinase A3; Flags: Precursor
gi|4773885|gb|AAD29758.1|AF076243_5 putative aspartic protease [Arabidopsis thaliana]
gi|13937238|gb|AAK50111.1|AF372974_1 AT4g04460/T26N6_7 [Arabidopsis thaliana]
gi|7267203|emb|CAB77914.1| putative aspartic protease [Arabidopsis thaliana]
gi|332656990|gb|AEE82390.1| phytepsin [Arabidopsis thaliana]
Length = 508
Score = 620 bits (1598), Expect = e-175, Method: Compositional matrix adjust.
Identities = 288/503 (57%), Positives = 383/503 (76%), Gaps = 8/503 (1%)
Query: 10 FCLWVLASCLLLPASS------NGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGV 63
F L L SCL+L +++ +G RIGLKKR+LD + A+++ K R G
Sbjct: 8 FLLVFLLSCLILISTASCERNGDGTIRIGLKKRKLDRSNRLASQLFLKNR--GSHWSPKH 65
Query: 64 RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCY 123
RL D + D++PLKN++DAQY+G+I IG+PPQ F+VIFDTGSSNLW+PS+KCY S++CY
Sbjct: 66 YFRLNDENADMVPLKNYLDAQYYGDITIGTPPQKFTVIFDTGSSNLWIPSTKCYLSVACY 125
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
FHS+YK+ +S++Y + GK I YG+G+ISG+FS D+V+VGD+VVK+Q FIEAT E +T
Sbjct: 126 FHSKYKASQSSSYRKNGKPASIRYGTGAISGYFSNDDVKVGDIVVKEQEFIEATSEPGIT 185
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
FLLA+FDGI+GLGF+EI+VG++ PVW NMVE+GLV E +FSFWLNR+P EGGEIVFGG
Sbjct: 186 FLLAKFDGILGLGFKEISVGNSTPVWYNMVEKGLVKEPIFSFWLNRNPKDPEGGEIVFGG 245
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
VDPKHFKG+HT+VPVT KGYWQF++GD+ I + TG C GC+AI DSGTSLL GP+ V+
Sbjct: 246 VDPKHFKGEHTFVPVTHKGYWQFDMGDLQIAGKPTGYCAKGCSAIADSGTSLLTGPSTVI 305
Query: 304 TEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGI 363
T INHAIG +G+VS ECK VV QYG + + L++ P+KVC QIG+CA++G + VS GI
Sbjct: 306 TMINHAIGAQGIVSRECKAVVDQYGKTMLNSLLAQEDPKKVCSQIGVCAYDGTQSVSMGI 365
Query: 364 KTVVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESII 423
++VV+ + A+CSACEMA VW++++L Q QT+E++L+Y ELCD +P +S +
Sbjct: 366 QSVVDDGTSGLLNQAMCSACEMAAVWMESELTQNQTQERILAYAAELCDHIPTQNQQSAV 425
Query: 424 DCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGD 483
DC R+ +MP V+F+IG + F+L+P+ YI K GEG+ C SGF A D+ PPRGPLWILGD
Sbjct: 426 DCGRVSSMPIVTFSIGGRSFDLTPQDYIFKIGEGVESQCTSGFTAMDIAPPRGPLWILGD 485
Query: 484 VFMGVYHTVFDSGKLRIGFAEAA 506
+FMG YHTVFD GK R+GFA+AA
Sbjct: 486 IFMGPYHTVFDYGKGRVGFAKAA 508
>gi|334186351|ref|NP_001190671.1| phytepsin [Arabidopsis thaliana]
gi|332656991|gb|AEE82391.1| phytepsin [Arabidopsis thaliana]
Length = 504
Score = 617 bits (1591), Expect = e-174, Method: Compositional matrix adjust.
Identities = 288/503 (57%), Positives = 382/503 (75%), Gaps = 12/503 (2%)
Query: 10 FCLWVLASCLLLPASS------NGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGV 63
F L L SCL+L +++ +G RIGLKKR+LD + A+++ K R G
Sbjct: 8 FLLVFLLSCLILISTASCERNGDGTIRIGLKKRKLDRSNRLASQLFLKNR--GSHWSPKH 65
Query: 64 RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCY 123
RL D + D++PLKN++DAQY+G+I IG+PPQ F+VIFDTGSSNLW+PS+KCY S++CY
Sbjct: 66 YFRLNDENADMVPLKNYLDAQYYGDITIGTPPQKFTVIFDTGSSNLWIPSTKCYLSVACY 125
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
FHS+YK+ +S++Y + GK I YG+G+ISG+FS D+V+VGD+VVK+Q FIEAT E +T
Sbjct: 126 FHSKYKASQSSSYRKNGKPASIRYGTGAISGYFSNDDVKVGDIVVKEQEFIEATSEPGIT 185
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
FLLA+FDGI+GLGF+EI+VG++ PVW NMVE+GLV E +FSFWLNR+P EGGEIVFGG
Sbjct: 186 FLLAKFDGILGLGFKEISVGNSTPVWYNMVEKGLVKEPIFSFWLNRNPKDPEGGEIVFGG 245
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
VDPKHFKG+HT+VPVT KGYWQF++GD+ I + TG C GC+AI DSGTSLL GP+ V+
Sbjct: 246 VDPKHFKGEHTFVPVTHKGYWQFDMGDLQIAGKPTGYCAKGCSAIADSGTSLLTGPSTVI 305
Query: 304 TEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGI 363
T INHAIG +G+VS ECK VV QYG +++ LL +KVC QIG+CA++G + VS GI
Sbjct: 306 TMINHAIGAQGIVSRECKAVVDQYG----KTMLNSLLAQKVCSQIGVCAYDGTQSVSMGI 361
Query: 364 KTVVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESII 423
++VV+ + A+CSACEMA VW++++L Q QT+E++L+Y ELCD +P +S +
Sbjct: 362 QSVVDDGTSGLLNQAMCSACEMAAVWMESELTQNQTQERILAYAAELCDHIPTQNQQSAV 421
Query: 424 DCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGD 483
DC R+ +MP V+F+IG + F+L+P+ YI K GEG+ C SGF A D+ PPRGPLWILGD
Sbjct: 422 DCGRVSSMPIVTFSIGGRSFDLTPQDYIFKIGEGVESQCTSGFTAMDIAPPRGPLWILGD 481
Query: 484 VFMGVYHTVFDSGKLRIGFAEAA 506
+FMG YHTVFD GK R+GFA+AA
Sbjct: 482 IFMGPYHTVFDYGKGRVGFAKAA 504
>gi|388517285|gb|AFK46704.1| unknown [Medicago truncatula]
Length = 510
Score = 615 bits (1585), Expect = e-173, Method: Compositional matrix adjust.
Identities = 304/510 (59%), Positives = 383/510 (75%), Gaps = 24/510 (4%)
Query: 10 FCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSL----------NAARITRKERYMGGAG 59
CLW L L+ A + GLRRIGLKK +L+ +L ++ R + +GGAG
Sbjct: 12 LCLWTLLFSLVSCAPNEGLRRIGLKKNKLEPKNLLGSKGCESSWSSIRNYASKNILGGAG 71
Query: 60 VSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS 119
+ D++ LKN++DAQY+GEI IG+PPQ F+VIFDTGSSN WVPS KCYFS
Sbjct: 72 -----------EADVVALKNYLDAQYYGEISIGTPPQTFTVIFDTGSSNTWVPSVKCYFS 120
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
++C H++YKS +S+TY G I YG+G++SGFFS DNV+VGDVVVKD FIEATRE
Sbjct: 121 LACLVHAKYKSSQSSTYKPNGTHAAIQYGTGAVSGFFSYDNVKVGDVVVKDVEFIEATRE 180
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEI 239
LTF+ A+FDG++GLGF+EI+VG+AVP+W MV+QGLV + VFSFWLNR+P+ E+GGE+
Sbjct: 181 PGLTFVAAKFDGLLGLGFQEISVGNAVPIWYKMVKQGLVKDPVFSFWLNRNPNEEQGGEL 240
Query: 240 VFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGP 299
VFGGVDP HFKG+HTYVPVT+KGYWQF +GD+LI + TG C C+AI DSGTSLLAGP
Sbjct: 241 VFGGVDPAHFKGEHTYVPVTRKGYWQFAMGDVLIDGKPTGYCANDCSAIADSGTSLLAGP 300
Query: 300 TPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYV 359
T V+T IN AIG GV S EC+ VV QYG I LLV+ P+KVC QIGLC F+G + +
Sbjct: 301 TTVITMINQAIGASGVYSQECRTVVDQYGHSILQLLVAEAQPKKVCSQIGLCTFDGTQGI 360
Query: 360 STGIKTVVEK-ENVSAG--DSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPN 416
S GI++VVE+ + +S+G A C CEMAVVW+QNQLKQ QT+E++++Y + LCD +PN
Sbjct: 361 SMGIQSVVEQTDRISSGGHQDATCFVCEMAVVWMQNQLKQNQTEERIINYADSLCDKMPN 420
Query: 417 PMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG 476
P+G+S +DC +I +MP VSFTIG K F+L+PE+YILK GEG A CISGF A D+PPPRG
Sbjct: 421 PLGQSSVDCAKISSMPKVSFTIGGKKFDLAPEEYILKVGEGAAAQCISGFTALDVPPPRG 480
Query: 477 PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PLWI GD+FMG YHTVFD GKLR+GFAEAA
Sbjct: 481 PLWIPGDIFMGRYHTVFDYGKLRVGFAEAA 510
>gi|449433980|ref|XP_004134774.1| PREDICTED: aspartic proteinase-like [Cucumis sativus]
gi|449526063|ref|XP_004170034.1| PREDICTED: aspartic proteinase-like [Cucumis sativus]
Length = 516
Score = 614 bits (1584), Expect = e-173, Method: Compositional matrix adjust.
Identities = 297/492 (60%), Positives = 378/492 (76%), Gaps = 10/492 (2%)
Query: 23 ASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDE-----DILPL 77
AS+ G RIGLKK + D +S A + K+ G+ V G ++ G++ E DI+PL
Sbjct: 27 ASNEGFLRIGLKKIKYDQNSRFKALLESKKGEFLGSSV-GKHNQWGNNLEESKNADIVPL 85
Query: 78 KNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYT 137
KN++DAQY+GEIGIG+PPQ F+VIFDTGSSNLWVPS+KC FS++C+FH++Y+S +S+TY
Sbjct: 86 KNYLDAQYYGEIGIGTPPQKFTVIFDTGSSNLWVPSAKCIFSLACFFHAKYQSGRSSTYK 145
Query: 138 EIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGF 197
G S I YGSG+ISGFFS DNV+VGDV+V++Q IEAT ++TF+ A+FDGI+GLGF
Sbjct: 146 RNGTSAAIQYGSGAISGFFSYDNVQVGDVIVRNQELIEATSMSTMTFMAAKFDGILGLGF 205
Query: 198 REIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVP 257
+EIA G AVPVW NMV+Q LV E+VFSFWLNR+ + +EGGE+VFGGVDPKHFKG+HTYVP
Sbjct: 206 QEIATGGAVPVWYNMVKQKLVKEQVFSFWLNRNAEEKEGGELVFGGVDPKHFKGQHTYVP 265
Query: 258 VTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVS 317
VT KGYWQF++GDILIG ++T C GGC+AI DSGTSLLAGP+ +V IN AIG V
Sbjct: 266 VTDKGYWQFDIGDILIGGETTKYCAGGCSAIADSGTSLLAGPSNIVVSINRAIGAAAVAH 325
Query: 318 AECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVV-EKENVSAG- 375
ECK +VSQYG I DLL++ PEK+C +IG+C F+ VS I+ VV +K+ S+G
Sbjct: 326 PECKAIVSQYGRAIMDLLLAKAQPEKICSKIGVCTFDETHDVSLKIENVVSDKDGRSSGG 385
Query: 376 -DSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 434
A+CSACEMAV+W+Q++LKQ +T+E ++ +NELCD N E+++DC RI MPNV
Sbjct: 386 FSEAMCSACEMAVLWIQDELKQNKTQEDIIENVNELCDRGLN-QDETLVDCGRISQMPNV 444
Query: 435 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 494
SFTIGD++F L+ + YILK GEG A CISGF+ FD+PPPRGPLWILGDVFMG YHTVFD
Sbjct: 445 SFTIGDRLFELTSKDYILKVGEGSAAQCISGFIPFDIPPPRGPLWILGDVFMGPYHTVFD 504
Query: 495 SGKLRIGFAEAA 506
GK R+GFAEAA
Sbjct: 505 FGKARVGFAEAA 516
>gi|226532912|ref|NP_001146573.1| hypothetical protein [Zea mays]
gi|219887869|gb|ACL54309.1| unknown [Zea mays]
gi|413917600|gb|AFW57532.1| hypothetical protein ZEAMMB73_218341 [Zea mays]
Length = 494
Score = 609 bits (1571), Expect = e-172, Method: Compositional matrix adjust.
Identities = 277/442 (62%), Positives = 356/442 (80%), Gaps = 3/442 (0%)
Query: 66 RLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFH 125
RLG S +PL ++++ QY+G +GIG+PPQNF+VIFDTGSSNLWVPSS+CYFSI+CY H
Sbjct: 55 RLGASGGGDVPLVDYLNTQYYGVVGIGTPPQNFTVIFDTGSSNLWVPSSRCYFSIACYLH 114
Query: 126 SRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFL 185
RYKS KS+TY G++C+I YGSGSI+GFFS D+V VGD+ VK+Q FIE TRE S+TF+
Sbjct: 115 HRYKSAKSSTYKADGETCKITYGSGSIAGFFSDDDVLVGDLTVKNQKFIETTRESSITFI 174
Query: 186 LARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPD-AEEGGEIVFGGV 244
+ +FDGI+GLG+ EI+VG A P+W +M EQ L++E+VFSFWLNR PD A GGE+VFGGV
Sbjct: 175 IGKFDGILGLGYPEISVGKAPPIWQSMQEQELLAEDVFSFWLNRSPDAAAAGGELVFGGV 234
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
DP HF G HTYVPV++KGYWQF++GD+LI STG C GCAAIVDSGTSLLAGPT ++
Sbjct: 235 DPAHFSGNHTYVPVSRKGYWQFDMGDLLIDGHSTGFCAKGCAAIVDSGTSLLAGPTAIIA 294
Query: 305 EINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIK 364
++N AIG +G++S ECK VVSQYG++I D+L++ P++VC Q+GLC F+GA VS GI+
Sbjct: 295 QVNEAIGADGIISTECKEVVSQYGEMILDMLIAQTDPQRVCSQVGLCVFDGARSVSEGIE 354
Query: 365 TVVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIID 424
+VV KEN+ G +CSAC+MAVVW++NQL++ +TKE +L Y N+LC+ LP+P GES +
Sbjct: 355 SVVGKENL--GSDVMCSACQMAVVWIENQLRENKTKELILQYANQLCERLPSPNGESTVS 412
Query: 425 CDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDV 484
C I MP+++FTI +K F L+P+QYI+K +G VCISGFMA+D+PPPRGPLWILGDV
Sbjct: 413 CQEISKMPSLAFTIANKTFTLTPQQYIVKLEQGGQTVCISGFMAYDVPPPRGPLWILGDV 472
Query: 485 FMGVYHTVFDSGKLRIGFAEAA 506
FMG YHTVFD G RIGFAE+A
Sbjct: 473 FMGAYHTVFDFGNDRIGFAESA 494
>gi|223949795|gb|ACN28981.1| unknown [Zea mays]
gi|413917601|gb|AFW57533.1| hypothetical protein ZEAMMB73_218341 [Zea mays]
gi|413917602|gb|AFW57534.1| hypothetical protein ZEAMMB73_218341 [Zea mays]
Length = 509
Score = 609 bits (1570), Expect = e-171, Method: Compositional matrix adjust.
Identities = 277/442 (62%), Positives = 356/442 (80%), Gaps = 3/442 (0%)
Query: 66 RLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFH 125
RLG S +PL ++++ QY+G +GIG+PPQNF+VIFDTGSSNLWVPSS+CYFSI+CY H
Sbjct: 70 RLGASGGGDVPLVDYLNTQYYGVVGIGTPPQNFTVIFDTGSSNLWVPSSRCYFSIACYLH 129
Query: 126 SRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFL 185
RYKS KS+TY G++C+I YGSGSI+GFFS D+V VGD+ VK+Q FIE TRE S+TF+
Sbjct: 130 HRYKSAKSSTYKADGETCKITYGSGSIAGFFSDDDVLVGDLTVKNQKFIETTRESSITFI 189
Query: 186 LARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPD-AEEGGEIVFGGV 244
+ +FDGI+GLG+ EI+VG A P+W +M EQ L++E+VFSFWLNR PD A GGE+VFGGV
Sbjct: 190 IGKFDGILGLGYPEISVGKAPPIWQSMQEQELLAEDVFSFWLNRSPDAAAAGGELVFGGV 249
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
DP HF G HTYVPV++KGYWQF++GD+LI STG C GCAAIVDSGTSLLAGPT ++
Sbjct: 250 DPAHFSGNHTYVPVSRKGYWQFDMGDLLIDGHSTGFCAKGCAAIVDSGTSLLAGPTAIIA 309
Query: 305 EINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIK 364
++N AIG +G++S ECK VVSQYG++I D+L++ P++VC Q+GLC F+GA VS GI+
Sbjct: 310 QVNEAIGADGIISTECKEVVSQYGEMILDMLIAQTDPQRVCSQVGLCVFDGARSVSEGIE 369
Query: 365 TVVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIID 424
+VV KEN+ G +CSAC+MAVVW++NQL++ +TKE +L Y N+LC+ LP+P GES +
Sbjct: 370 SVVGKENL--GSDVMCSACQMAVVWIENQLRENKTKELILQYANQLCERLPSPNGESTVS 427
Query: 425 CDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDV 484
C I MP+++FTI +K F L+P+QYI+K +G VCISGFMA+D+PPPRGPLWILGDV
Sbjct: 428 CQEISKMPSLAFTIANKTFTLTPQQYIVKLEQGGQTVCISGFMAYDVPPPRGPLWILGDV 487
Query: 485 FMGVYHTVFDSGKLRIGFAEAA 506
FMG YHTVFD G RIGFAE+A
Sbjct: 488 FMGAYHTVFDFGNDRIGFAESA 509
>gi|356545806|ref|XP_003541325.1| PREDICTED: aspartic proteinase oryzasin-1-like [Glycine max]
Length = 495
Score = 608 bits (1567), Expect = e-171, Method: Compositional matrix adjust.
Identities = 286/509 (56%), Positives = 371/509 (72%), Gaps = 19/509 (3%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV 60
+ LL + C W S + +S +G+ R+ LK+R LD++SLN+ARI ++ GV
Sbjct: 3 FKHLLLVTSVCAW-FVSLAVTTSSGDGVTRVSLKRRSLDINSLNSARIKGVVNHLKADGV 61
Query: 61 SGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI 120
LKN++DAQYFGEIGIGSPPQ+F V+FDTGSSNLWVPS+KC SI
Sbjct: 62 Y---------------LKNYLDAQYFGEIGIGSPPQSFRVVFDTGSSNLWVPSAKCVLSI 106
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+CYFHS+Y+S+ SNTYT+IG C+I YG G + GF SQDN+ VGD+++KDQ F E T+EG
Sbjct: 107 ACYFHSKYRSKLSNTYTKIGTPCKIPYGHGHVPGFISQDNLRVGDIIIKDQQFAEITKEG 166
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
L FL FDGI+GLGF+ +V PVW NM+EQGLV++++FS WLN+DP A+ GGEIV
Sbjct: 167 PLAFLAMHFDGILGLGFQNKSVRQVTPVWYNMIEQGLVTQKIFSLWLNQDPVAKLGGEIV 226
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGG+D +HFKG+HTYVP+T+K YWQ E+GDI I N TG+CEGGCAAI+DSGTSL+AGPT
Sbjct: 227 FGGIDWRHFKGEHTYVPLTQKDYWQIEVGDIQIANNPTGLCEGGCAAIIDSGTSLIAGPT 286
Query: 301 PVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVS 360
+VT+INHAIG EG VS ECK ++ YGD IW+ ++SGL PE +C IGLC+ N +
Sbjct: 287 KIVTQINHAIGAEGYVSYECKNIIHNYGDSIWEYIISGLKPEIICVDIGLCSRNRTFITN 346
Query: 361 TGIKTVVEKEN---VSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNP 417
I+T V E+ +S +C+ C+M V W+Q QLKQK TKEK+L Y++ELC+ LPNP
Sbjct: 347 DVIETAVYNESWGESRTKESPLCTFCDMIVFWMQVQLKQKNTKEKILKYVDELCEKLPNP 406
Query: 418 MGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGP 477
+G++ IDC+ I MP ++FTIG+K F LSPEQY+L+ EG VC GF+ D+P P+GP
Sbjct: 407 VGQTFIDCNDIANMPQITFTIGNKSFPLSPEQYMLRIEEGCNTVCYGGFVPLDVPAPQGP 466
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
LW+LGD+F+G YHTVFD G LRIGFAEAA
Sbjct: 467 LWVLGDLFLGAYHTVFDYGNLRIGFAEAA 495
>gi|356565563|ref|XP_003551009.1| PREDICTED: aspartic proteinase oryzasin-1-like [Glycine max]
Length = 494
Score = 605 bits (1561), Expect = e-170, Method: Compositional matrix adjust.
Identities = 287/505 (56%), Positives = 368/505 (72%), Gaps = 20/505 (3%)
Query: 5 LLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVR 64
L+ + C W S ++ +S +GL R+ LK+R LD+ SLN+A+I ++ GV
Sbjct: 7 LVVTCVCAW-FGSLVVTTSSGDGLMRVSLKRRSLDISSLNSAKIKEVVNHLKADGVY--- 62
Query: 65 HRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYF 124
LKN++DAQYFGEIGIGSPPQ+F V+FDTGSSNLWVPS+KC SI+CYF
Sbjct: 63 ------------LKNYLDAQYFGEIGIGSPPQSFRVVFDTGSSNLWVPSAKCVLSIACYF 110
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
HS+Y+S+ SNTYT+IG C+I YG G I GF SQDN+ VGD+++KDQ F E T+EG L F
Sbjct: 111 HSKYRSKLSNTYTKIGTPCKIPYGRGHIPGFISQDNIRVGDIIIKDQQFAEITKEGPLAF 170
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
L FDGI+GLGF+ +VG PVW NM+EQG VS+++FS WLN+DP A+ GGEIVFGG+
Sbjct: 171 LAMHFDGILGLGFQNKSVGQVTPVWYNMIEQGHVSQKIFSLWLNQDPVAKVGGEIVFGGI 230
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
D +HFKG HTYVP+T+K YWQ E+GDILI N TG+CEGGCAAI+DSGTSL+AGPT +VT
Sbjct: 231 DWRHFKGDHTYVPLTQKDYWQIEVGDILIANNPTGLCEGGCAAIIDSGTSLIAGPTKIVT 290
Query: 305 EINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIK 364
+IN AIG EG VS ECK ++ YGD IW+ ++SGL PE +C IGLC+ E S I+
Sbjct: 291 QINRAIGAEGYVSYECKNIIHNYGDSIWEYIISGLKPEIICVDIGLCSLY-LETCSDVIE 349
Query: 365 TVVEKEN---VSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGES 421
T E+ +S +C+ C+M V W+Q QLKQK TKEK+L Y++ELC+ LPNP+G++
Sbjct: 350 TATHNESWGESRTKESPLCTFCDMIVFWMQVQLKQKNTKEKILKYVDELCEKLPNPVGQT 409
Query: 422 IIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWIL 481
IDC+ I MP ++FTIG+K F LSPEQY+L+ EG VC GF+ D+P P+GPLW+L
Sbjct: 410 FIDCNDIANMPQITFTIGNKSFPLSPEQYMLRIEEGCNTVCYGGFVPLDVPAPQGPLWVL 469
Query: 482 GDVFMGVYHTVFDSGKLRIGFAEAA 506
GD+F+G YHTVFD G LRIGFAEAA
Sbjct: 470 GDLFLGAYHTVFDYGNLRIGFAEAA 494
>gi|357450315|ref|XP_003595434.1| Aspartic proteinase [Medicago truncatula]
gi|355484482|gb|AES65685.1| Aspartic proteinase [Medicago truncatula]
Length = 507
Score = 604 bits (1557), Expect = e-170, Method: Compositional matrix adjust.
Identities = 288/512 (56%), Positives = 373/512 (72%), Gaps = 11/512 (2%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV 60
M K + V CLW+ + L S++ L RI LKKR LD+ SLN +RI ++ + +
Sbjct: 1 MSLKYMLVVTCLWIWSLSLAYTISNDNLMRISLKKRNLDIQSLNTSRI---KKVIHERDL 57
Query: 61 SGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI 120
V G +D++ LKN+ D QY+GEIGIGSPPQ F+V+FDTGSSNLWVPSS+C FSI
Sbjct: 58 ESVDTNYGS--KDVVYLKNYFDVQYYGEIGIGSPPQYFNVVFDTGSSNLWVPSSRCIFSI 115
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+CYFHS+Y+S S+TY EIG CEI Y G I GFFSQDNV+VGD+ VKDQ F E TREG
Sbjct: 116 ACYFHSKYRSGISSTYNEIGVPCEIPYDEGYIYGFFSQDNVKVGDINVKDQEFCEITREG 175
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
+ L FDGI+GLGF++++VG PVW NM+EQG +S++VFS W N+DP AE GGEIV
Sbjct: 176 NFALLALPFDGILGLGFQDVSVGKVTPVWYNMIEQGHISDKVFSLWFNKDPMAEVGGEIV 235
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGGVD +HF+G HTY P+++KGYWQ E+GDIL+ N +TG+CEGGCAAIVDSGTSL+AGPT
Sbjct: 236 FGGVDKRHFRGDHTYFPISQKGYWQIEVGDILLANNTTGLCEGGCAAIVDSGTSLIAGPT 295
Query: 301 PVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVS 360
VVT+INH IG EG VS ECK +V YG+LIW+ L+SGL PE +C I LC+ NG + ++
Sbjct: 296 GVVTQINHVIGTEGYVSYECKNIVHNYGNLIWESLISGLNPEILCADIRLCSDNGFQRMN 355
Query: 361 TGIKTVVEKENVSAG---DSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNP 417
I+TVV E+ +S CS C M V+W+Q Q+KQ KEKVL Y++ELC+ LPNP
Sbjct: 356 DVIETVVHNESRDGSPLKESLFCSFCNMVVLWMQVQIKQSNVKEKVLKYVDELCEKLPNP 415
Query: 418 MGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKT---GEGIAEVCISGFMAFDLPPP 474
+G+S I+C + MP+++FT G+K+F LSPEQY+L+ E + VC SGF+A D+P P
Sbjct: 416 VGQSFINCSSVSDMPHITFTFGNKLFPLSPEQYVLRVESDDEDCSPVCYSGFVALDVPSP 475
Query: 475 RGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
+GPLW++GD+F+ YHTVFD LRIGFAE+
Sbjct: 476 QGPLWVVGDIFLQAYHTVFDYANLRIGFAEST 507
>gi|168029783|ref|XP_001767404.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162681300|gb|EDQ67728.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 499
Score = 603 bits (1556), Expect = e-170, Method: Compositional matrix adjust.
Identities = 288/479 (60%), Positives = 356/479 (74%), Gaps = 8/479 (1%)
Query: 29 RRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGE 88
RRI LKK+ + L S+ A +R + L D EDI+ L N++DAQYFGE
Sbjct: 28 RRIALKKKPVTLQSVRNAASRTIQR---AKTFTRSEDELRDG-EDIVALNNYLDAQYFGE 83
Query: 89 IGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYG 148
IGIGSPPQ F+VIFDTGSSNLWVPS+KCY S++CYFH RYKS KS+TY E G S I YG
Sbjct: 84 IGIGSPPQPFAVIFDTGSSNLWVPSAKCYLSLACYFHHRYKSGKSSTYKEDGTSFAIQYG 143
Query: 149 SGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPV 208
+GS+ GF SQD+V +GD+ VK QVF EAT+E LTF++A+FDGI+GLGF+EI+V P
Sbjct: 144 TGSMEGFLSQDDVTLGDLTVKGQVFAEATKEPGLTFVVAKFDGILGLGFKEISVNRVTPP 203
Query: 209 WDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFEL 268
W NM++QGLV E VFSFWLNR+PD GGE+V GGVDPKHFKG+H Y PVT+KGYWQF+L
Sbjct: 204 WYNMLDQGLVKEPVFSFWLNRNPDESSGGELVLGGVDPKHFKGEHVYTPVTRKGYWQFDL 263
Query: 269 GDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYG 328
GD+ I ++TG C GC AI DSGTSLLAGP+ +V EIN AIG GVVS +CK+VV QYG
Sbjct: 264 GDVTINGRTTGFCANGCTAIADSGTSLLAGPSGIVAEINQAIGATGVVSQQCKMVVQQYG 323
Query: 329 DLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENV-SAGDSAVCSACEMAV 387
D I ++L++ + P KVC +GLC F E GI +VVEK+ S + +C+ CEMAV
Sbjct: 324 DQIVEMLLAQMNPGKVCTTLGLCNFGAGE---PGIASVVEKDQSHSLREDPLCTVCEMAV 380
Query: 388 VWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSP 447
VW QNQL Q +TKE++ +Y+N+LC+ LP+P GES +DC+ + +MPNV+FTI +K F L P
Sbjct: 381 VWAQNQLSQNRTKEQIDAYLNQLCERLPSPNGESAVDCNSLSSMPNVAFTISNKTFELKP 440
Query: 448 EQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
E+YILK GEG CISGF+ D+PPP GPLWILGDVFMGVYHTVFD G R+GFAEAA
Sbjct: 441 EEYILKIGEGAEAQCISGFLGLDVPPPAGPLWILGDVFMGVYHTVFDFGNTRLGFAEAA 499
>gi|40641523|emb|CAE52913.1| putative vacuaolar aspartic proteinase [Physcomitrella patens]
Length = 504
Score = 600 bits (1548), Expect = e-169, Method: Compositional matrix adjust.
Identities = 287/481 (59%), Positives = 358/481 (74%), Gaps = 12/481 (2%)
Query: 29 RRIGLKKRRLDLHSLN--AARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYF 86
RRI LKK+ + L S+ A+R ++ + + L D EDI+ L N++DAQYF
Sbjct: 28 RRIALKKKPVTLQSVRNAASRTIQRAKTF-----TRSEDELRDG-EDIVALNNYLDAQYF 81
Query: 87 GEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEIN 146
GEIGIGSPPQ F+VIFDTGSSNLWVPS+KCY S++CYFH RYKS KS+TY E G S I
Sbjct: 82 GEIGIGSPPQPFAVIFDTGSSNLWVPSAKCYLSLACYFHHRYKSGKSSTYKEDGTSFAIQ 141
Query: 147 YGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
YG+GS+ GF SQD+V +GD+ VK QVF EAT+E LTF++A+FDGI+GLGF+EI+V
Sbjct: 142 YGTGSMEGFLSQDDVTLGDLTVKGQVFAEATKEPGLTFVVAKFDGILGLGFKEISVNRVT 201
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
P W NM++QGLV E VFSFWLNR+PD GGE+V GGVDPKHFKG+H Y PVT+KGYWQF
Sbjct: 202 PPWYNMLDQGLVKEPVFSFWLNRNPDESSGGELVLGGVDPKHFKGEHVYTPVTRKGYWQF 261
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQ 326
+LGD+ I ++TG C GC AI DSGTSLLAGP+ +V EIN AIG GVVS +CK+VV Q
Sbjct: 262 DLGDVTINGRTTGFCANGCTAIADSGTSLLAGPSGIVAEINQAIGATGVVSQQCKMVVQQ 321
Query: 327 YGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENV-SAGDSAVCSACEM 385
YGD I ++L++ + P KVC +GLC F E GI +VVEK+ S + +C+ C M
Sbjct: 322 YGDQIVEMLLAQMNPGKVCTTLGLCNFGAGE---PGIASVVEKDQSHSLREDPLCTVCGM 378
Query: 386 AVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNL 445
AVVW QNQL Q +TKE++ +Y+N+LC+ LP+P GES +DC+ + +MPNV+FTI +K F L
Sbjct: 379 AVVWAQNQLSQNRTKEQIDAYLNQLCERLPSPNGESAVDCNSLSSMPNVAFTISNKTFEL 438
Query: 446 SPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
PE+YILK GEG CISGF+ D+PPP GPLWILGDVFMGVYHTVFD G R+GFAEA
Sbjct: 439 KPEEYILKIGEGAEAQCISGFLGLDVPPPAGPLWILGDVFMGVYHTVFDFGNTRLGFAEA 498
Query: 506 A 506
A
Sbjct: 499 A 499
>gi|224106994|ref|XP_002314336.1| predicted protein [Populus trichocarpa]
gi|222863376|gb|EEF00507.1| predicted protein [Populus trichocarpa]
Length = 483
Score = 599 bits (1545), Expect = e-169, Method: Compositional matrix adjust.
Identities = 289/488 (59%), Positives = 371/488 (76%), Gaps = 19/488 (3%)
Query: 24 SSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDA 83
SS+GL R+GLKKR LDL+S++AARITR + S R S+ +I+ LKN++D
Sbjct: 10 SSDGLARVGLKKRNLDLNSIHAARITRPQ------ATSFARV---TSNAEIVYLKNYLDT 60
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
QY+GEIGIGSPPQ F+V+FDTGSSNLWVPSSKC SI+CYFHS++ +R S TYT+IG C
Sbjct: 61 QYYGEIGIGSPPQIFTVVFDTGSSNLWVPSSKCLLSITCYFHSKFIARLSRTYTKIGIPC 120
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVG 203
+I YGSGS+SGF SQD+V+VGD ++ +QV +++EG L L +FDGI+GL F++IAV
Sbjct: 121 KIQYGSGSVSGFLSQDHVKVGDDIIINQVSSASSKEGFLALLGVQFDGILGLAFQDIAVA 180
Query: 204 DAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGY 263
A PVW NM EQG VS++VFS WLNR+P +E GGE+VFGG+D +HFKG HTYVPVT +GY
Sbjct: 181 KATPVWYNMAEQGHVSQKVFSLWLNRNPSSELGGEVVFGGLDWRHFKGDHTYVPVTGRGY 240
Query: 264 WQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLV 323
WQ ++GDI I N STG+C GGC+AIVDSGTS L+GPT +V +INHAIG G+VS ECK V
Sbjct: 241 WQIQVGDIFIANNSTGLCAGGCSAIVDSGTSFLSGPTRIVAQINHAIGARGIVSLECKEV 300
Query: 324 VSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKE-----NVSAGDSA 378
VS+Y + IWD ++SGL PE +C +GLC +N +T I+TVV+ E +V G A
Sbjct: 301 VSKYWNSIWDSMISGLRPEIICVDVGLCLYNN----NTVIETVVDGEATDRLSVDEG-GA 355
Query: 379 VCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTI 438
+C+ CEM V W+Q QLK+K+ KEK+ Y++ELC+ LPNP+G+S I+CD I MP VSFTI
Sbjct: 356 LCTFCEMIVFWIQVQLKEKKAKEKIFHYVDELCERLPNPLGKSFINCDEITAMPYVSFTI 415
Query: 439 GDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKL 498
G++ F LSPEQYI++ E A +C+SGF A D+PP +GPLWILGDVF+G YHTVFD G
Sbjct: 416 GNRSFPLSPEQYIVRVEESYATICLSGFAALDMPPRQGPLWILGDVFLGAYHTVFDFGNH 475
Query: 499 RIGFAEAA 506
RIGFA+AA
Sbjct: 476 RIGFAKAA 483
>gi|110162110|emb|CAL07969.1| aspartic proteinase [Cynara cardunculus]
Length = 506
Score = 596 bits (1537), Expect = e-168, Method: Compositional matrix adjust.
Identities = 280/484 (57%), Positives = 363/484 (75%), Gaps = 6/484 (1%)
Query: 24 SSNGLRRIGLKKRRLD-LHSLNAARITRKERYMGGAGVS-GVRHRLGDSDEDILPLKNFM 81
S+ GL R+GLKKR++D L L A + +G A G R L S I+ L N
Sbjct: 26 SNGGLLRVGLKKRKVDRLDQLRAHGV----HMLGNARKDFGFRRTLRVSGSGIVALTNDR 81
Query: 82 DAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGK 141
D Y+GEIGIG+PPQNF+VIFDTGSS+LWVPSSKCY S++C H RY+S S+TY G
Sbjct: 82 DTAYYGEIGIGTPPQNFAVIFDTGSSDLWVPSSKCYTSLACVIHPRYESGDSSTYKRNGT 141
Query: 142 SCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIA 201
+ I YG+G+I GF+SQD+VEVGD+VV+ Q FIE T E FL FDGI+GLGF+EI+
Sbjct: 142 TASIQYGTGAIVGFYSQDSVEVGDLVVEQQDFIETTEEDDTVFLARDFDGILGLGFQEIS 201
Query: 202 VGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKK 261
G AVPVW NMV QGLV E VFSFWLNR+ D EEGGE+VFGGVDP HF+G HTYVPVT+K
Sbjct: 202 AGKAVPVWYNMVNQGLVEEAVFSFWLNRNVDEEEGGELVFGGVDPNHFRGNHTYVPVTRK 261
Query: 262 GYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECK 321
GYWQFE+GD+LIG++S+G C GGCAAI DSGTSL+AGPT ++T+IN AIG +GV++ +CK
Sbjct: 262 GYWQFEMGDVLIGDKSSGFCAGGCAAIADSGTSLIAGPTAIITQINQAIGAKGVLNQQCK 321
Query: 322 LVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCS 381
+VSQYG + +L S + P+++C Q+ LC F+GA +V + I++VV+K N + +C+
Sbjct: 322 TLVSQYGKNMIQMLTSEVQPDQICSQMKLCTFDGARHVRSMIESVVDKNNDKSSGDEICT 381
Query: 382 ACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDK 441
CEMA+VW+QN++K+ +T++ +++++NELCD LP ESI+DC+ I +MPN +FTIG K
Sbjct: 382 FCEMALVWMQNEIKRNETEDNIINHVNELCDHLPTSSAESIVDCNGISSMPNTAFTIGRK 441
Query: 442 IFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIG 501
+F L+PEQYI K GEG A CISGF A D+ P+GP+WILGD+FMG YHTVFD GKLR+G
Sbjct: 442 LFELTPEQYIFKVGEGEAATCISGFTALDIMSPQGPIWILGDMFMGPYHTVFDYGKLRVG 501
Query: 502 FAEA 505
F EA
Sbjct: 502 FTEA 505
>gi|357511711|ref|XP_003626144.1| Aspartic proteinase [Medicago truncatula]
gi|355501159|gb|AES82362.1| Aspartic proteinase [Medicago truncatula]
Length = 426
Score = 596 bits (1537), Expect = e-168, Method: Compositional matrix adjust.
Identities = 282/424 (66%), Positives = 351/424 (82%), Gaps = 5/424 (1%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
+ RIGL+KR LDLH+++A ++ R+++ G + + H+ SD+ I+PLKN+MDAQYFG
Sbjct: 1 MMRIGLQKRPLDLHNMDAFKMVREQQLRSGRPMM-LAHK--SSDDAIVPLKNYMDAQYFG 57
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
EI IG+PPQ F+VIFDTGSSNLWVPSSKCYFS++CY H+ YK++KS TY + G SC+I+Y
Sbjct: 58 EIAIGTPPQTFTVIFDTGSSNLWVPSSKCYFSLACYTHNWYKAKKSKTYNKNGTSCKISY 117
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVP 207
G+GSISG+FSQDNV+VG VVK Q FIEATREGSL+FL +FDGI GLGF+EI+V A+P
Sbjct: 118 GTGSISGYFSQDNVKVGSSVVKHQDFIEATREGSLSFLAGKFDGIFGLGFQEISVERALP 177
Query: 208 VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFE 267
VW NM+EQ L+ E+VFSFWLN +P+A++GGE+VFGGVDPKHFKGKHTYVPVT+KGYWQ E
Sbjct: 178 VWYNMLEQNLIGEKVFSFWLNGNPNAKKGGELVFGGVDPKHFKGKHTYVPVTEKGYWQIE 237
Query: 268 LGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQY 327
+GD IG STGVCEGGCAAIVDSGTSLLAGPTPVV EINHAIG EGV+S ECK VVSQY
Sbjct: 238 MGDFFIGGLSTGVCEGGCAAIVDSGTSLLAGPTPVVAEINHAIGAEGVLSVECKEVVSQY 297
Query: 328 GDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN--VSAGDSAVCSACEM 385
G+LIWDLLVSG+ P VC Q+GLC+ G + S GI+ V +KE +SA D+ +CS+C+M
Sbjct: 298 GELIWDLLVSGVKPGDVCSQVGLCSIRGDQSNSAGIEMVTDKEQSELSAKDTPLCSSCQM 357
Query: 386 AVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNL 445
V+WVQNQLKQK TKE+V +Y+N+LC+SLP+P GES+I C+ I MPN+SFTIG+K F L
Sbjct: 358 LVLWVQNQLKQKATKERVFNYVNQLCESLPSPSGESVISCNDISKMPNISFTIGNKPFVL 417
Query: 446 SPEQ 449
+PEQ
Sbjct: 418 TPEQ 421
>gi|413946823|gb|AFW79472.1| hypothetical protein ZEAMMB73_587615 [Zea mays]
Length = 488
Score = 593 bits (1529), Expect = e-167, Method: Compositional matrix adjust.
Identities = 279/451 (61%), Positives = 355/451 (78%), Gaps = 6/451 (1%)
Query: 1 MEQKLLRSVFCLWVLASC-LLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAG 59
M Q L + C WVL++C LLL ASS+GL RI L K+RLD +L AA++ +KE + +
Sbjct: 42 MGQTHLLLLACFWVLSTCSLLLDASSDGLLRINLNKKRLDKEALTAAKLAKKESNLRRS- 100
Query: 60 VSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS 119
G L S +DI+PL N++D QYFG+I IG+PPQNF+VIFDTGSSNLWVPSSKCYFS
Sbjct: 101 -VGADQYLSASTDDIVPLDNYLDTQYFGQISIGTPPQNFTVIFDTGSSNLWVPSSKCYFS 159
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
I+CY H RYKS KS TYT+ G+SC I YGSG I+GFFS+DNV VG++VV++Q FIE TRE
Sbjct: 160 IACYLHHRYKSTKSKTYTKNGESCTITYGSGQIAGFFSEDNVLVGNLVVQNQKFIETTRE 219
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEE-GGE 238
S TF++ +FDGI+GLGF EI+VG A P+W +M +Q LV+++VFSFWLNRDPDA GGE
Sbjct: 220 TSPTFIIGKFDGILGLGFPEISVGGAPPIWQSMKQQKLVAKDVFSFWLNRDPDASSGGGE 279
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
+VFGGVDPKH+KG HTYVPVT+KGYWQF++GD++IG STG C GGCAAIVDSGTSLLAG
Sbjct: 280 LVFGGVDPKHYKGDHTYVPVTRKGYWQFDMGDLIIGGHSTGFCAGGCAAIVDSGTSLLAG 339
Query: 299 PTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEY 358
PT +V ++NHAIG EG++S ECK VVS+YG++I +LL+S P+KVC QIGLC F+GA
Sbjct: 340 PTTIVAQVNHAIGAEGIISTECKEVVSEYGEMILELLISQTSPQKVCTQIGLCVFDGAHS 399
Query: 359 VSTGIKTVVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPM 418
VS I++VVEK+ G C+ACEMAVVW+QNQL++ +TKE +L+Y N+LC+ LP+P
Sbjct: 400 VSNPIESVVEKQK--RGSDLFCTACEMAVVWIQNQLRENKTKELILNYANQLCERLPSPN 457
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQ 449
GES +DC +I MPN++FTI +K F L+PEQ
Sbjct: 458 GESTVDCHQISKMPNLAFTIANKTFTLTPEQ 488
>gi|302820804|ref|XP_002992068.1| hypothetical protein SELMODRAFT_186535 [Selaginella moellendorffii]
gi|300140190|gb|EFJ06917.1| hypothetical protein SELMODRAFT_186535 [Selaginella moellendorffii]
Length = 499
Score = 588 bits (1515), Expect = e-165, Method: Compositional matrix adjust.
Identities = 283/500 (56%), Positives = 363/500 (72%), Gaps = 15/500 (3%)
Query: 9 VFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAA--RITRKERYMGGAGVSGVRHR 66
+ +W L SCL+ + + + LKKR L L A + RK +G V G
Sbjct: 11 LLVVWGL-SCLI---AVTAVEVVPLKKRPLTAERLRLAVKSVPRKAHALGFHNVHG---- 62
Query: 67 LGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHS 126
+S DI PL+N++DAQY+GEIGIGSPPQ F+VIFDTGSSNLWVPSS+C FS +C+ H
Sbjct: 63 -ANSLTDIEPLRNYLDAQYYGEIGIGSPPQVFTVIFDTGSSNLWVPSSRCIFSPACWLHR 121
Query: 127 RYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLL 186
RYKSRKS+TY S I YG+G ++GF S D V +GDVVVKDQ F E+T E L FL
Sbjct: 122 RYKSRKSSTYKPDDASIAIQYGTGQMAGFLSTDYVTIGDVVVKDQTFAESTSEPGLVFLF 181
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDP-DAEEGGEIVFGGVD 245
A+FDGI+GLGF+ I++G PVW NM+ Q L+S+ VFSFWLNRD D E+GGEIVFGGV+
Sbjct: 182 AKFDGILGLGFKAISMGQVTPVWYNMLAQKLISQPVFSFWLNRDASDEEDGGEIVFGGVN 241
Query: 246 PKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTE 305
FKGKH Y PVT++GYWQF +GD+++ QSTG C GCAAI DSGTSLLAGPT +V +
Sbjct: 242 KDRFKGKHVYTPVTREGYWQFNMGDVVVDGQSTGFCAKGCAAIADSGTSLLAGPTGIVAQ 301
Query: 306 INHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKT 365
IN AIG G+VS ECK+VV+QYGDLI +LL++ + P+KVC Q G+C + I +
Sbjct: 302 INQAIGATGLVSEECKMVVTQYGDLIVELLLAQVTPDKVCAQAGVCTLRND---NPHIAS 358
Query: 366 VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
V++KEN GD +CS CEMAVVWVQNQL+Q +TK+++ Y+N+LC+ LP+P G+S+++C
Sbjct: 359 VLDKENQKVGDDVLCSVCEMAVVWVQNQLRQNRTKQQIEDYLNQLCERLPSPNGQSVVEC 418
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
+I ++PNVSFTI ++ F L+P+QYIL+ GEG A C+SGF D+PPP GP+WILGDVF
Sbjct: 419 AKISSLPNVSFTIANQTFELTPKQYILQVGEGAAAQCLSGFTGMDVPPPAGPIWILGDVF 478
Query: 486 MGVYHTVFDSGKLRIGFAEA 505
MGVYHTVFD G RIGFA+A
Sbjct: 479 MGVYHTVFDFGNKRIGFAKA 498
>gi|302761358|ref|XP_002964101.1| hypothetical protein SELMODRAFT_166719 [Selaginella moellendorffii]
gi|300167830|gb|EFJ34434.1| hypothetical protein SELMODRAFT_166719 [Selaginella moellendorffii]
Length = 505
Score = 587 bits (1514), Expect = e-165, Method: Compositional matrix adjust.
Identities = 283/501 (56%), Positives = 363/501 (72%), Gaps = 11/501 (2%)
Query: 9 VFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAA--RITRKERYMGGAGVSGVRHR 66
+ +W L SCL+ + + + LKKR L L A + RK +G V
Sbjct: 11 LLAVWGL-SCLI---AVTAVEVVPLKKRPLTAERLRLAVKSVPRKAHALGFHNVRDANSL 66
Query: 67 LGD-SDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFH 125
+ S DI PL+N++DAQY+GEIGIGSPPQ F+VIFDTGSSNLWVPSS+C FS +C+ H
Sbjct: 67 TKNGSVPDIEPLRNYLDAQYYGEIGIGSPPQVFTVIFDTGSSNLWVPSSRCIFSPACWLH 126
Query: 126 SRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFL 185
RYKSRKS+TY G S I YG+G ++GF S D V +GDVVVKDQ F E+T E L FL
Sbjct: 127 HRYKSRKSSTYKPDGTSIAIQYGTGQMAGFLSTDYVTIGDVVVKDQTFAESTSEPGLVFL 186
Query: 186 LARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDP-DAEEGGEIVFGGV 244
+A+FDGI+GLGF+ I+ G PVW NM+ Q L+S+ VFSFWLNRD D E+GGEIVFGGV
Sbjct: 187 VAKFDGILGLGFKAISKGQVTPVWYNMLAQKLISQPVFSFWLNRDASDEEDGGEIVFGGV 246
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
+ FKGKH Y PVT++GYWQF +GD+ + QSTG C GCAAI DSGTSLLAGPT +V
Sbjct: 247 NKDRFKGKHVYTPVTREGYWQFNMGDVAVDGQSTGFCAKGCAAIADSGTSLLAGPTGIVA 306
Query: 305 EINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIK 364
+IN AIG G+VS ECK+VV+QYGDLI +LL++ + P++VC Q G+C+ + I
Sbjct: 307 QINQAIGATGLVSEECKMVVAQYGDLIVELLLAQVTPDRVCAQAGVCSLRND---NPHIA 363
Query: 365 TVVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIID 424
+V++KEN GD +CS CEMAVVWVQNQL+Q +TK+++ Y+N+LC+ LP+P G+S+++
Sbjct: 364 SVLDKENQKVGDDVLCSVCEMAVVWVQNQLRQNRTKQQIEDYLNQLCERLPSPNGQSVVE 423
Query: 425 CDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDV 484
C +I ++PNVSFTI ++ F L+P+QYIL+ GEG A CISGF D+PPP GP+WILGDV
Sbjct: 424 CAKISSLPNVSFTIANQTFELTPKQYILQVGEGAAAQCISGFTGMDVPPPAGPIWILGDV 483
Query: 485 FMGVYHTVFDSGKLRIGFAEA 505
FMGVYHTVFD G RIGFA+A
Sbjct: 484 FMGVYHTVFDFGNKRIGFAKA 504
>gi|75338567|sp|Q9XFX4.1|CARDB_CYNCA RecName: Full=Procardosin-B; Contains: RecName: Full=Cardosin-B
heavy chain; AltName: Full=Cardosin-B 34 kDa subunit;
Contains: RecName: Full=Cardosin-B light chain; AltName:
Full=Cardosin-B 14 kDa subunit; Flags: Precursor
gi|4582534|emb|CAB40349.1| preprocardosin B [Cynara cardunculus]
Length = 506
Score = 584 bits (1505), Expect = e-164, Method: Compositional matrix adjust.
Identities = 275/484 (56%), Positives = 359/484 (74%), Gaps = 6/484 (1%)
Query: 24 SSNGLRRIGLKKRRLD-LHSLNAARITRKERYMGGAGVS-GVRHRLGDSDEDILPLKNFM 81
S+ GL R+GLKKR++D L L A + +G A G R L DS I+ L N
Sbjct: 26 SNGGLLRVGLKKRKVDRLDQLRAHGV----HMLGNARKDFGFRRTLSDSGSGIVALTNDR 81
Query: 82 DAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGK 141
D Y+GEIGIG+PPQNF+VIFDTGSS+LWVPS+KC S++C H RY S S+TY G
Sbjct: 82 DTAYYGEIGIGTPPQNFAVIFDTGSSDLWVPSTKCDTSLACVIHPRYDSGDSSTYKGNGT 141
Query: 142 SCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIA 201
+ I YG+G+I GF+SQD+VEVGD+VV+ Q FIE T E FL + FDGI+GLGF+EI+
Sbjct: 142 TASIQYGTGAIVGFYSQDSVEVGDLVVEHQDFIETTEEDDTVFLKSEFDGILGLGFQEIS 201
Query: 202 VGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKK 261
G AVPVW NMV QGLV E VFSFWLNR+ D EEGGE+VFGGVDP HF+G HTYVPVT+K
Sbjct: 202 AGKAVPVWYNMVNQGLVEEAVFSFWLNRNVDEEEGGELVFGGVDPNHFRGNHTYVPVTRK 261
Query: 262 GYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECK 321
GYWQFE+GD+LIG++S+G C GGCAAI DSGTS AGPT ++T+IN AIG +GV++ +CK
Sbjct: 262 GYWQFEMGDVLIGDKSSGFCAGGCAAIADSGTSFFAGPTAIITQINQAIGAKGVLNQQCK 321
Query: 322 LVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCS 381
+V QYG + +L S + P+K+C + LC F+GA V + I++VV+K N + +C+
Sbjct: 322 TLVGQYGKNMIQMLTSEVQPDKICSHMKLCTFDGAHDVRSMIESVVDKNNDKSSGGEICT 381
Query: 382 ACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDK 441
CEMA+V +QN++K+ +T++ +++++NE+CD LP ESI+DC+ I +MPN++FTIG K
Sbjct: 382 FCEMALVRMQNEIKRNETEDNIINHVNEVCDQLPTSSAESIVDCNGISSMPNIAFTIGSK 441
Query: 442 IFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIG 501
+F ++PEQYI K GEG A CISGF A D+ P+GP+WILGD+FMG YHTVFD GKLR+G
Sbjct: 442 LFEVTPEQYIYKVGEGEAATCISGFTALDIMSPQGPIWILGDMFMGPYHTVFDYGKLRVG 501
Query: 502 FAEA 505
FAEA
Sbjct: 502 FAEA 505
>gi|357130655|ref|XP_003566963.1| PREDICTED: aspartic proteinase oryzasin-1-like [Brachypodium
distachyon]
Length = 520
Score = 582 bits (1499), Expect = e-163, Method: Compositional matrix adjust.
Identities = 287/496 (57%), Positives = 366/496 (73%), Gaps = 10/496 (2%)
Query: 20 LLPASSNGLRRIGLKKRRLDLHSLNAAR-----ITRKERYMGGAGVSGVRHRLGDSDED- 73
LL A + GL R+ LKK +D H L A + R+ ++ +G + + +
Sbjct: 26 LLAAPAEGLVRVALKKHPVDEHGLAAGEEAQRLLLRRYGHVFNDASAGASSKPSTAAKGG 85
Query: 74 ILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKS 133
+ LKN ++AQY+GE+GIG+PPQNF+VIFDTGS+NLWVPSS CYFSI+CYFH RY + +S
Sbjct: 86 SVTLKNCLNAQYYGEVGIGTPPQNFTVIFDTGSANLWVPSSNCYFSIACYFHPRYNAGQS 145
Query: 134 NTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGII 193
TY + GK EI+YG+G+ISG+ SQD+V+VG VVVK Q FIEAT E S+TF+ +FDGI+
Sbjct: 146 KTYKKNGKHVEIHYGTGAISGYLSQDSVQVGGVVVKKQDFIEATGEPSITFMFGKFDGIL 205
Query: 194 GLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKH 253
GLGF+E+ +P+W NMV QGLV + +FSFW NR +GGEIVFGG+DP H KG H
Sbjct: 206 GLGFKEMLYLSVLPIWYNMVSQGLVGDLIFSFWFNRHAGEGQGGEIVFGGIDPSHHKGNH 265
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
TYVPV KKGYWQF++ D+LIG STG C+ GCAA+ DSGTSLL+GPT +VT+IN IG
Sbjct: 266 TYVPVPKKGYWQFDMSDVLIGGNSTGFCKDGCAAMADSGTSLLSGPTAIVTQINKKIGAT 325
Query: 314 GVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVS 373
GVVS ECK VVSQYG I DLL+ +K+C +GLC F+GA VS GI++VV+ +
Sbjct: 326 GVVSQECKAVVSQYGKQILDLLLK-YSRKKICSSVGLCTFDGAHGVSAGIQSVVDDKVWG 384
Query: 374 AGD---SAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPT 430
+ D C+ CEMAVVW+Q+QL Q QT+E VL YIN+LCDS P+PMGES +DC+R+ +
Sbjct: 385 SNDIFSKVTCNMCEMAVVWMQHQLAQNQTQEFVLQYINQLCDSFPSPMGESSVDCNRLAS 444
Query: 431 MPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYH 490
MP+++F+IG K F L+PEQYILK GEG+A CISGF A D+PPPRGPLWILGD+FMG YH
Sbjct: 445 MPDIAFSIGGKQFVLTPEQYILKVGEGVATQCISGFTAVDIPPPRGPLWILGDIFMGAYH 504
Query: 491 TVFDSGKLRIGFAEAA 506
TVFD G L++GFAEAA
Sbjct: 505 TVFDYGNLKVGFAEAA 520
>gi|168033581|ref|XP_001769293.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162679399|gb|EDQ65847.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 485
Score = 580 bits (1494), Expect = e-163, Method: Compositional matrix adjust.
Identities = 285/479 (59%), Positives = 351/479 (73%), Gaps = 22/479 (4%)
Query: 29 RRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGE 88
RRI LKK+ +DL S+ +A +R AG S R GD+ + L N+MDAQYFGE
Sbjct: 28 RRIPLKKKSIDLQSVRSAAARTLQRANALAG-SANSLRGGDA----VDLNNYMDAQYFGE 82
Query: 89 IGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYG 148
IGIGSPPQ FSVIFDTGSSNLWVPS+KCY S++CYFH RYKS KS+TY E G S I YG
Sbjct: 83 IGIGSPPQPFSVIFDTGSSNLWVPSAKCYLSLACYFHRRYKSSKSSTYKEDGTSFAIQYG 142
Query: 149 SGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPV 208
+GS+ GF SQD+V +GD+ VK QVF EAT+E +TF+ A+FDGI+GLGF+EI+V PV
Sbjct: 143 TGSMEGFLSQDDVTLGDLTVKWQVFAEATKEPGVTFVSAKFDGILGLGFKEISVDRVTPV 202
Query: 209 WDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFEL 268
W NM++QGLV E VFSFWLNRD D +GGE+VFGGVDP HFKG+HTY PVT+KGYWQF+L
Sbjct: 203 WYNMLDQGLVKEPVFSFWLNRDSDESDGGELVFGGVDPDHFKGEHTYTPVTRKGYWQFDL 262
Query: 269 GDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYG 328
GD GC+AI DSGTSLLAGP+ +V EIN AIG G+VS +CK+VV QYG
Sbjct: 263 GD-------------GCSAIADSGTSLLAGPSGIVAEINQAIGATGIVSQQCKMVVQQYG 309
Query: 329 DLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENV-SAGDSAVCSACEMAV 387
+ I ++LV+ + P KVC +GLC E GI +V+EKE V S C+ CEMA+
Sbjct: 310 EQIVEMLVAQMNPGKVCASLGLCQLAAGE---PGIASVLEKEEVHSLHADPRCTVCEMAL 366
Query: 388 VWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSP 447
VW QNQL+ +TKE++ +Y+N+LC+ LP+P GES +DC+ + MPNV FTI K F L+P
Sbjct: 367 VWAQNQLRMNRTKEEIDAYLNQLCERLPSPNGESAVDCNALSYMPNVGFTIAGKSFELTP 426
Query: 448 EQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
EQYILK GEG + C+SGF+ D+PPP GPLWILGDVFMGVYHTVFD G R+GFA+AA
Sbjct: 427 EQYILKIGEGPEKQCVSGFLGLDVPPPAGPLWILGDVFMGVYHTVFDFGNSRLGFAKAA 485
>gi|87241358|gb|ABD33216.1| Peptidase A1, pepsin [Medicago truncatula]
Length = 396
Score = 578 bits (1489), Expect = e-162, Method: Compositional matrix adjust.
Identities = 270/391 (69%), Positives = 329/391 (84%), Gaps = 2/391 (0%)
Query: 118 FSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEAT 177
++CY H+ YK++KS TY + G SC+I+YG+GSISG+FSQDNV+VG VVK Q FIEAT
Sbjct: 6 LQLACYTHNWYKAKKSKTYNKNGTSCKISYGTGSISGYFSQDNVKVGSSVVKHQDFIEAT 65
Query: 178 REGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGG 237
REGSL+FL +FDGI GLGF+EI+V A+PVW NM+EQ L+ E+VFSFWLN +P+A++GG
Sbjct: 66 REGSLSFLAGKFDGIFGLGFQEISVERALPVWYNMLEQNLIGEKVFSFWLNGNPNAKKGG 125
Query: 238 EIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLA 297
E+VFGGVDPKHFKGKHTYVPVT+KGYWQ E+GD IG STGVCEGGCAAIVDSGTSLLA
Sbjct: 126 ELVFGGVDPKHFKGKHTYVPVTEKGYWQIEMGDFFIGGLSTGVCEGGCAAIVDSGTSLLA 185
Query: 298 GPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAE 357
GPTPVV EINHAIG EGV+S ECK VVSQYG+LIWDLLVSG+ P VC Q+GLC+ G +
Sbjct: 186 GPTPVVAEINHAIGAEGVLSVECKEVVSQYGELIWDLLVSGVKPGDVCSQVGLCSIRGDQ 245
Query: 358 YVSTGIKTVVEKEN--VSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLP 415
S GI+ V +KE +SA D+ +CS+C+M V+WVQNQLKQK TKE+V +Y+N+LC+SLP
Sbjct: 246 SNSAGIEMVTDKEQSELSAKDTPLCSSCQMLVLWVQNQLKQKATKERVFNYVNQLCESLP 305
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
+P GES+I C+ I MPN+SFTIG+K F L+PEQYIL+TGEGI +VC+SGF+AFD+PPP+
Sbjct: 306 SPSGESVISCNDISKMPNISFTIGNKPFVLTPEQYILRTGEGITQVCLSGFIAFDVPPPK 365
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
GPLWILGDVFM YHTVFD G L++GFAEAA
Sbjct: 366 GPLWILGDVFMRAYHTVFDYGNLQVGFAEAA 396
>gi|302761354|ref|XP_002964099.1| hypothetical protein SELMODRAFT_142401 [Selaginella moellendorffii]
gi|300167828|gb|EFJ34432.1| hypothetical protein SELMODRAFT_142401 [Selaginella moellendorffii]
Length = 497
Score = 575 bits (1483), Expect = e-161, Method: Compositional matrix adjust.
Identities = 283/500 (56%), Positives = 359/500 (71%), Gaps = 17/500 (3%)
Query: 9 VFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAA--RITRKERYMGGAGVSGVRHR 66
+ +W L SCL+ + + + LKKR L L A + RK +G V G
Sbjct: 11 LLAVWGL-SCLI---AVTAVEVVPLKKRPLTAERLRLAVKSVPRKAHALGFHNVHG---- 62
Query: 67 LGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHS 126
+S DI PL+N++DAQY+GEIGIGSPPQ F+VIFDTGSSNLWVPSS+C FS +C+ H
Sbjct: 63 -ANSLTDIEPLRNYLDAQYYGEIGIGSPPQVFTVIFDTGSSNLWVPSSRCIFSPACWLHR 121
Query: 127 RYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLL 186
RYKSRKS+TY S I YGSG ++GFFS D V +GDVVVKDQ F E+T E L FL
Sbjct: 122 RYKSRKSSTYKPDDASIAIQYGSGQMAGFFSTDYVTIGDVVVKDQTFAESTSEPGLVFLF 181
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDP-DAEEGGEIVFGGVD 245
A+FDGI+GLGF+ I++G PVW NM+ Q L+S+ VFSFWLNRD D E+GGEIVFGGV+
Sbjct: 182 AKFDGILGLGFKAISMGQVTPVWYNMLAQKLISQPVFSFWLNRDASDEEDGGEIVFGGVN 241
Query: 246 PKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTE 305
FKGKH Y PVT++GYWQF +GD+++ QSTG C GCAAI DSGTSLL GPT +V +
Sbjct: 242 KDRFKGKHVYTPVTREGYWQFNMGDVVVDGQSTGFCAKGCAAIADSGTSLLVGPTGIVAQ 301
Query: 306 INHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKT 365
IN AIG G+VS ECK+VV+QYGDLI +LL++ + P+KVC Q G+C + I +
Sbjct: 302 INQAIGATGLVSEECKMVVAQYGDLIVELLLAQVTPDKVCAQAGVCTLRND---NPHIAS 358
Query: 366 VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
V++KEN GD +CS CEMAVV VQNQL+Q TK+++ +N+LC+ LP+P G+S +DC
Sbjct: 359 VLDKENQKVGDHGLCSVCEMAVVSVQNQLRQNPTKQQI--DLNQLCERLPSPNGQSFVDC 416
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
+I ++PNVSFTI +++F L+P+QYIL+ GEG A CISGF D+ PP GP+WILGDVF
Sbjct: 417 AKISSLPNVSFTIANQMFELTPKQYILQVGEGAAAQCISGFTGMDVAPPAGPIWILGDVF 476
Query: 486 MGVYHTVFDSGKLRIGFAEA 505
MGVYHTVFD G RIGFA+A
Sbjct: 477 MGVYHTVFDFGNKRIGFAKA 496
>gi|418731269|gb|AFX67029.1| aspartic protease, partial [Solanum tuberosum]
Length = 372
Score = 572 bits (1474), Expect = e-160, Method: Compositional matrix adjust.
Identities = 265/372 (71%), Positives = 319/372 (85%), Gaps = 2/372 (0%)
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
T G+SC I YG+GSISG FS DNV+VGD+VVKDQVFIEATRE S+TF++A+FDGI+GLG
Sbjct: 1 TRDGESCSIRYGTGSISGHFSMDNVQVGDLVVKDQVFIEATREPSITFIVAKFDGILGLG 60
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
F+EI+VG+ PVW NMV QGLV E VFSFW NRD +A+EGGE+VFGGVDPKHFKG HTYV
Sbjct: 61 FQEISVGNTTPVWYNMVGQGLVKESVFSFWFNRDANAKEGGELVFGGVDPKHFKGNHTYV 120
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
P+T+KGYWQF +GD LIGN STG C GGCAAIVDSGTSLLAGPT +VT+INHAIG EG+V
Sbjct: 121 PLTQKGYWQFNMGDFLIGNTSTGYCAGGCAAIVDSGTSLLAGPTTIVTQINHAIGAEGIV 180
Query: 317 SAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN--VSA 374
S ECK +VSQYG++IWDLLVSG+ P++VC Q GLC +GA++VS+ I+TVVE+E S
Sbjct: 181 SMECKTIVSQYGEMIWDLLVSGVRPDQVCSQAGLCFVDGAQHVSSNIRTVVERETEGSSV 240
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 434
G++ +C+ACEMAVVW+QNQLKQ TKEKVL Y+N+LC+ +P+PMGES IDC+ I +MP++
Sbjct: 241 GEAPLCTACEMAVVWMQNQLKQAGTKEKVLEYVNQLCEKIPSPMGESTIDCNSISSMPDI 300
Query: 435 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 494
SFTI DK F L+PEQYILKTGEG+A +C+SGF A D+PPPRGPLWILGDVFMG YHTVFD
Sbjct: 301 SFTIKDKAFVLTPEQYILKTGEGVATICVSGFAALDVPPPRGPLWILGDVFMGPYHTVFD 360
Query: 495 SGKLRIGFAEAA 506
GK ++GFAEAA
Sbjct: 361 YGKSQVGFAEAA 372
>gi|3551952|gb|AAC34854.1| senescence-associated protein 4 [Hemerocallis hybrid cultivar]
Length = 517
Score = 572 bits (1473), Expect = e-160, Method: Compositional matrix adjust.
Identities = 281/481 (58%), Positives = 356/481 (74%), Gaps = 18/481 (3%)
Query: 12 LWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKE------RYMGGAGVSGVRH 65
L +L L L AS+ GL RI LKK+ D S ++R++ E RY G+R
Sbjct: 14 LSMLVFQLALSASAEGLVRINLKKKPFDEKSRVSSRLSADEDEPLKARY-------GLRG 66
Query: 66 RLGDSDE--DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCY 123
L D + DI+ LKN+M+AQYFGEIG+G+PPQ F+VIFDTGSSNLWVPS+KCYFSI+C
Sbjct: 67 GLNDGADSTDIISLKNYMNAQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSAKCYFSIACL 126
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
H++YKS +S+TY + GK I+YG+G+I+G+FS+D+VE+GD VVK Q FIEAT+E +T
Sbjct: 127 LHTKYKSGRSSTYHKNGKPAAIHYGTGAIAGYFSEDHVELGDFVVKGQEFIEATKEPGVT 186
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
FL+A+FDGI+GLGF+EI+VG AVP+W NMVEQGLV E VFSFWLNR + EGGEIVFGG
Sbjct: 187 FLVAKFDGILGLGFKEISVGGAVPLWYNMVEQGLVKEAVFSFWLNRKSEDGEGGEIVFGG 246
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
VDP H KG+H YVPVT+KGYWQF++GD+L+G QSTG CEGGCAAI DSGTSL+AGPT V+
Sbjct: 247 VDPSHHKGEHVYVPVTQKGYWQFDMGDVLVGGQSTGFCEGGCAAIADSGTSLIAGPTTVI 306
Query: 304 TEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGI 363
TEINH IG GVVS ECK VV QYG I D+L++ P K+C QIGLC F+G VS GI
Sbjct: 307 TEINHKIGAAGVVSQECKAVVQQYGQQILDMLIAQTQPMKICSQIGLCTFDGTRGVSMGI 366
Query: 364 KTVVEKE-NVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESI 422
++VV + S A+CSACEMAVVW+QNQ+K +T++ +L+YIN+LC+ LP+PMGES
Sbjct: 367 ESVVNGNVDKSVASDAMCSACEMAVVWMQNQIKHNKTQDLILNYINQLCERLPSPMGESA 426
Query: 423 IDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGP--LWI 480
+DC + TMP++SFTIG K F+L+ EQY+LK GEG A CI RG W+
Sbjct: 427 VDCSVLSTMPSISFTIGGKQFDLTAEQYVLKVGEGPAAQCIKWIHCLGHSSSRGHSGYWV 486
Query: 481 L 481
+
Sbjct: 487 M 487
>gi|357135633|ref|XP_003569413.1| PREDICTED: aspartic proteinase oryzasin-1-like [Brachypodium
distachyon]
Length = 560
Score = 567 bits (1462), Expect = e-159, Method: Compositional matrix adjust.
Identities = 267/452 (59%), Positives = 339/452 (75%), Gaps = 6/452 (1%)
Query: 59 GVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF 118
G+ G R + D ++I+PLKN+M+AQYFG+IG+G PPQNF+V+FDTGSSN+WVPS+KC F
Sbjct: 111 GIRGNR-SVHDGQQNIIPLKNYMNAQYFGQIGVGCPPQNFTVVFDTGSSNIWVPSAKCIF 169
Query: 119 SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
S++CYFH +Y SR S+TY E G I+YGSG+I GF+S+D V +G++VVK+Q FIE T
Sbjct: 170 SLACYFHPKYVSRWSSTYKENGTPASIHYGSGAIYGFYSEDQVTIGNLVVKNQEFIETTY 229
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
E TFL A+FDGI+GLGF+EI+V + PVW NM++QGLV E+ FSFWLNRD + EGGE
Sbjct: 230 EHGFTFLAAKFDGILGLGFKEISVEGSDPVWYNMIDQGLVKEKSFSFWLNRDANDGEGGE 289
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
IVFGG DPKH+KG HTY VT+K YWQFE+GD LIG +STG+C GCAAI DSGTSL+AG
Sbjct: 290 IVFGGSDPKHYKGSHTYTRVTRKAYWQFEMGDFLIGGKSTGICVDGCAAIADSGTSLIAG 349
Query: 299 PTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLL-VSGLLPEKVCQQIGLCAFNGAE 357
P V+ +IN IG GV + ECK VV+ YG + +LL P +VC +IGLC F+G
Sbjct: 350 PVAVIAQINEKIGANGVANEECKQVVAGYGQQMIELLEAKQTAPAQVCSKIGLCTFDGTR 409
Query: 358 YVSTGIKTVV-EKENVSAGD--SAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSL 414
VS GIK+VV E + + G A C+ACEMAV W+Q++ +TKE L Y+N LCD +
Sbjct: 410 AVSAGIKSVVGEAQKTALGGMFDATCNACEMAVTWMQSEFVHNRTKEDTLEYVNRLCDHM 469
Query: 415 PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPP 474
P+P+G S +DC I ++ +VSF+IG KIF L PEQYILK G+G CISGF A D+PPP
Sbjct: 470 PSPVGSS-VDCRHIDSLQSVSFSIGGKIFELKPEQYILKVGDGFMARCISGFTALDIPPP 528
Query: 475 RGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
GPLWILGDVFMG YHT+FD GK+R+GFAE+A
Sbjct: 529 VGPLWILGDVFMGAYHTIFDYGKMRVGFAESA 560
>gi|302761356|ref|XP_002964100.1| hypothetical protein SELMODRAFT_438819 [Selaginella moellendorffii]
gi|300167829|gb|EFJ34433.1| hypothetical protein SELMODRAFT_438819 [Selaginella moellendorffii]
Length = 503
Score = 565 bits (1456), Expect = e-158, Method: Compositional matrix adjust.
Identities = 278/501 (55%), Positives = 357/501 (71%), Gaps = 13/501 (2%)
Query: 9 VFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAA--RITRKERYMGGAGVSGVRHR 66
+ +W L SCL+ + + + LKKR L L A + RK +G V
Sbjct: 11 LLAVWGL-SCLI---AVTAVEVVPLKKRPLTAERLRLAVKSVPRKAHALGFHNVRDANSL 66
Query: 67 LGD-SDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFH 125
+ S DI PL+N++DAQY+GEIGIGSPPQ F+VIFDTGSSNLWVPSS+C FS +C+ H
Sbjct: 67 TKNGSVPDIEPLRNYLDAQYYGEIGIGSPPQVFTVIFDTGSSNLWVPSSRCIFSPACWLH 126
Query: 126 SRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFL 185
RYKSRKS+TY S I YG+G ++GF S D V +GDVVVKDQ F E+T E L FL
Sbjct: 127 RRYKSRKSSTYKPDDASIAIQYGTGQMAGFLSTDYVTIGDVVVKDQTFAESTSEPGLVFL 186
Query: 186 LARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDP-DAEEGGEIVFGGV 244
A+FDGI+GLGF+ I++G PVW NM+ Q L+S+ VFSFWLNRD D E+GGEIVFGGV
Sbjct: 187 FAKFDGILGLGFKAISMGQVTPVWYNMLAQKLISQPVFSFWLNRDASDEEDGGEIVFGGV 246
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
+ FKGKH Y PVT++GYWQF +GD+++ QSTG C GCAAI DSGTSLL GPT +V
Sbjct: 247 NKDRFKGKHVYTPVTREGYWQFNMGDVVVDGQSTGFCAKGCAAIADSGTSLLVGPTGIVA 306
Query: 305 EINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIK 364
+IN AIG G+VS ECK+VV+QYGDLI +LL++ + P+KVC Q G+C + I
Sbjct: 307 QINQAIGATGLVSEECKMVVAQYGDLIVELLLAQVTPDKVCAQAGVCTLRND---NPHIA 363
Query: 365 TVVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIID 424
+V++KEN GD +CS CEMAVV VQNQL+Q TK+++ +N+LC+ LP+P G+S+++
Sbjct: 364 SVLDKENQKVGDDVLCSVCEMAVVSVQNQLRQNPTKQQI--DLNQLCERLPSPNGQSLVE 421
Query: 425 CDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDV 484
C +I ++PNVSFTI +++F L+P+QYIL+ GEG A CISGF D+ PP P+WILGDV
Sbjct: 422 CAKISSLPNVSFTIANQMFELTPKQYILQVGEGAAAQCISGFTGMDVAPPAVPIWILGDV 481
Query: 485 FMGVYHTVFDSGKLRIGFAEA 505
FMGVYHTVFD G RIGFA+A
Sbjct: 482 FMGVYHTVFDFGNKRIGFAKA 502
>gi|356542078|ref|XP_003539498.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase oryzasin-1-like
[Glycine max]
Length = 449
Score = 561 bits (1445), Expect = e-157, Method: Compositional matrix adjust.
Identities = 257/439 (58%), Positives = 340/439 (77%), Gaps = 19/439 (4%)
Query: 71 DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKS 130
D I+ LKN+M+AQYFGEIGIG+ PQ F+VIFDTGSSNLWVPSSKCYFS++CY HSRYKS
Sbjct: 27 DTSIIRLKNYMNAQYFGEIGIGTLPQKFTVIFDTGSSNLWVPSSKCYFSVACYLHSRYKS 86
Query: 131 RKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFD 190
+S+T + G S EI+YG+G ISGFF+QD+V+V D+VV DQ FIEATR
Sbjct: 87 SQSSTCNKNGSSAEIHYGTGHISGFFTQDHVKVXDLVVYDQDFIEATR------------ 134
Query: 191 GIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFK 250
+GF+EI+VG+A P+W NM+ Q +++ VFSFWLNR+ + E+GG+IVFGG+D H+K
Sbjct: 135 ----VGFQEISVGNAAPIWYNMLNQHFLTQPVFSFWLNRNTNEEQGGQIVFGGIDSDHYK 190
Query: 251 GKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI 310
G+HTYVPVT+KGYWQ E+GD+LI ++TG+C C AIVDSGTSLLAGPT V+ +INHAI
Sbjct: 191 GEHTYVPVTQKGYWQIEIGDVLINGKTTGLCAAKCLAIVDSGTSLLAGPTGVIAQINHAI 250
Query: 311 GGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEK- 369
G G+VS ECK +V+QYG I D L++ LP+++C QIGLC F+G + VS GI++VV+K
Sbjct: 251 GAVGIVSQECKALVAQYGKTILDKLINEALPQQICSQIGLCTFDGTQGVSIGIQSVVDKN 310
Query: 370 --ENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDR 427
+ + A C+ACEMA VW++N+L+ +T++++L + N LCD +P+P GES+++C+
Sbjct: 311 IXRTSCSWNDAGCTACEMAAVWMKNRLRLNETEDQILDHANALCDLVPSPKGESVVECNT 370
Query: 428 IPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMG 487
+ MPNVSFTIG ++F LSPEQYILK G+G CISGF+A D+ PPRGPLWILGD+FMG
Sbjct: 371 LSEMPNVSFTIGGEVFELSPEQYILKVGKGATAQCISGFIALDIAPPRGPLWILGDIFMG 430
Query: 488 VYHTVFDSGKLRIGFAEAA 506
YHTVFD G +++GFAE+A
Sbjct: 431 SYHTVFDYGNMKVGFAESA 449
>gi|326510801|dbj|BAJ91748.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 450
Score = 559 bits (1440), Expect = e-156, Method: Compositional matrix adjust.
Identities = 262/455 (57%), Positives = 347/455 (76%), Gaps = 12/455 (2%)
Query: 1 MEQKLLRSVF-CLWVLASCLLLPASS-NGLRRIGLKKRRLDLHSLNAARITRKERYMGGA 58
M Q+LL V CLW ++ + ASS +GL RI L KR L SL AA+ R+
Sbjct: 1 MGQRLLLLVTTCLWAISCAVPHHASSRDGLLRINLNKRSLTHKSLAAAKAARQ------- 53
Query: 59 GVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF 118
+R + G+SD DI+PL ++++ QY+G IG+G+PPQNF+VIFDTGSSNLWVPSSKCYF
Sbjct: 54 -YGALRLKSGNSDSDIVPLVDYLNTQYYGVIGLGTPPQNFTVIFDTGSSNLWVPSSKCYF 112
Query: 119 SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
SI+CY H +Y+S +S TY G++C+I YGSG+ISGFFS DNV VGD+VVK+Q FIEATR
Sbjct: 113 SIACYLHPKYRSSRSTTYKADGENCKITYGSGAISGFFSNDNVLVGDLVVKNQKFIEATR 172
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
E S++F+L +FDGI+GLG+ +I+VG A PVW +M EQ L++++VFSFWLNRD DA GGE
Sbjct: 173 ETSVSFILGKFDGILGLGYPDISVGKAPPVWLSMQEQKLLADDVFSFWLNRDSDALSGGE 232
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
+VFGG+DP H+KG HTYVPV++KGYWQF +GD+LI STG C GCAAIVDSGTSLLAG
Sbjct: 233 LVFGGMDPHHYKGNHTYVPVSRKGYWQFNMGDLLIDGHSTGFCAKGCAAIVDSGTSLLAG 292
Query: 299 PTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEY 358
PT +V ++NHAIG EG++S ECK VVSQYG++I ++L++ P+KVC QIGLC F+G +
Sbjct: 293 PTAIVAQVNHAIGAEGIISTECKEVVSQYGEMILEMLIAQTQPQKVCSQIGLCLFDGTQS 352
Query: 359 VSTGIKTVVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPM 418
VS GI+++V KENV G +C+ACEMAVVW++NQL++ +TKE +L Y N+LC+ LP+P
Sbjct: 353 VSNGIESIVGKENV--GSDLMCTACEMAVVWIENQLRENKTKELILQYANQLCERLPSPN 410
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILK 453
GES + C + MPN++F I +K F L+PEQ + +
Sbjct: 411 GESTVSCHEMSKMPNLAFAIANKTFVLTPEQVLFR 445
>gi|255567717|ref|XP_002524837.1| Aspartic proteinase precursor, putative [Ricinus communis]
gi|223535897|gb|EEF37557.1| Aspartic proteinase precursor, putative [Ricinus communis]
Length = 456
Score = 549 bits (1415), Expect = e-153, Method: Compositional matrix adjust.
Identities = 256/446 (57%), Positives = 338/446 (75%), Gaps = 9/446 (2%)
Query: 12 LWVLASCLLLPA----SSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRL 67
LW+ + LLLP ++ L R+GLKK++ D ++ A + KE A
Sbjct: 8 LWI-SFVLLLPVVFSLHNDALVRVGLKKKKFDQVNIPAGTVDFKEGEAMRAATKKYNLVE 66
Query: 68 GDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSR 127
D DI+ LKN++DAQY+GEI IG+PPQ F+VIFDTGSSNLW+PSSKCYFS++CYFHS+
Sbjct: 67 NSDDVDIVELKNYLDAQYYGEIAIGTPPQTFTVIFDTGSSNLWIPSSKCYFSVACYFHSK 126
Query: 128 YKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLA 187
YK+ +S+TY + G S I YG+GSISGFFSQDNV+VGD+V+++Q FIEAT+E +TFL A
Sbjct: 127 YKASESSTYQKNGTSAAIRYGTGSISGFFSQDNVKVGDLVIRNQDFIEATKEPGVTFLAA 186
Query: 188 RFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPK 247
+FDGI+GLGF+EI+VG A+PVW NMV +GLV E+VFSFWLNR+ AEEGGEIVFGG+DP
Sbjct: 187 KFDGILGLGFQEISVGKAIPVWYNMVNEGLVKEQVFSFWLNRNVQAEEGGEIVFGGMDPN 246
Query: 248 HFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEIN 307
H+KG+HTYVPVT+KGYWQF++G++LIGN+ TG+C GC AI DSGTSLLAGPT V+T+IN
Sbjct: 247 HYKGQHTYVPVTQKGYWQFDMGEVLIGNEITGLCADGCKAIADSGTSLLAGPTTVITQIN 306
Query: 308 HAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVV 367
HAIG G+VS ECK VV QYG I ++L + P+K+C QIG C F+G + VST I++VV
Sbjct: 307 HAIGASGIVSQECKTVVEQYGKFILEMLTAQAQPQKICSQIGFCTFDGTQGVSTNIESVV 366
Query: 368 EKENVSAGD----SAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESII 423
+K +A D + C+ CEM VVW+QN+L+ +T +++L+Y+N+LCD LP+P GES +
Sbjct: 367 DKSKETASDGLQQDSACTVCEMIVVWMQNRLRLNETVDQILNYVNKLCDRLPSPNGESAV 426
Query: 424 DCDRIPTMPNVSFTIGDKIFNLSPEQ 449
DC + +MP VSFTIG K F L+ +Q
Sbjct: 427 DCSSLSSMPIVSFTIGGKAFKLTADQ 452
>gi|168031065|ref|XP_001768042.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680680|gb|EDQ67114.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 455
Score = 549 bits (1415), Expect = e-153, Method: Compositional matrix adjust.
Identities = 261/459 (56%), Positives = 329/459 (71%), Gaps = 7/459 (1%)
Query: 50 RKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNL 109
R ++ GG GV S D + L N++DAQY+G I IG+P Q F+V+FDTGSSNL
Sbjct: 2 RAQQARGGTRGQGV-----GSGGDEVALVNYLDAQYYGVIEIGTPKQEFTVVFDTGSSNL 56
Query: 110 WVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVK 169
WVPS+KCY S++C+FH RYK+RKS+TY + G I YG+GS+ GF S D+V +GD+ VK
Sbjct: 57 WVPSAKCYLSLACFFHHRYKARKSSTYKQDGTPFAIQYGTGSMEGFLSIDDVTLGDLTVK 116
Query: 170 DQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNR 229
QVF EAT+E +TFL A DGI+GLGF+EI+V D PVW NM+ Q LV E VFSFWLNR
Sbjct: 117 AQVFAEATKEPGVTFLAAEMDGILGLGFKEISVNDVNPVWYNMLYQKLVQEPVFSFWLNR 176
Query: 230 DPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIV 289
D + E+GGE+V GGVDP HFKG HTY PVT+ GYWQF++GD+L+ QSTG C GGCAAI
Sbjct: 177 DVEGEKGGELVLGGVDPHHFKGNHTYTPVTRLGYWQFDMGDVLLDGQSTGFCAGGCAAIA 236
Query: 290 DSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLL-PEKVCQQI 348
DSGTSLLAGPT +V EIN+AIG G++S ECKLVV QY D I +L+S LL P K+C +
Sbjct: 237 DSGTSLLAGPTGIVAEINYAIGATGIISGECKLVVDQYADFIIQMLMSKLLTPLKICAKA 296
Query: 349 GLCAF-NGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYI 407
G C G + I +V+EK G+ C CEM V+W QNQL++ T+ ++ ++
Sbjct: 297 GACLVEEGTSTRNPNIASVLEKHENDLGNGVTCVFCEMVVIWAQNQLRKNGTQAQIKEHL 356
Query: 408 NELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFM 467
N+LC+ LPNP GES++DC+ + +MP+VSFTI F L+PEQY+LK GEG C SGF+
Sbjct: 357 NQLCERLPNPNGESMVDCNSLSSMPDVSFTISGTTFKLTPEQYVLKVGEGDDAQCTSGFL 416
Query: 468 AFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
D+PPP GPLWILGDVFMG YHTVFD G R+GFA AA
Sbjct: 417 GIDIPPPAGPLWILGDVFMGAYHTVFDFGNQRLGFALAA 455
>gi|222424506|dbj|BAH20208.1| AT1G11910 [Arabidopsis thaliana]
Length = 389
Score = 546 bits (1407), Expect = e-153, Method: Compositional matrix adjust.
Identities = 253/390 (64%), Positives = 315/390 (80%), Gaps = 5/390 (1%)
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+C H +YKS +S+TY + GK+ I+YG+G+I+GFFS D V VGD+VVKDQ FIEAT+E
Sbjct: 1 ACLLHPKYKSSRSSTYEKNGKAAAIHYGTGAIAGFFSNDAVTVGDLVVKDQEFIEATKEP 60
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
+TF++A+FDGI+GLGF+EI+VG A PVW NM++QGL+ E VFSFWLNR+ D EEGGE+V
Sbjct: 61 GITFVVAKFDGILGLGFQEISVGKAAPVWYNMLKQGLIKEPVFSFWLNRNADEEEGGELV 120
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGGVDP HFKGKHTYVPVT+KGYWQF++GD+LIG TG CE GC+AI DSGTSLLAGPT
Sbjct: 121 FGGVDPNHFKGKHTYVPVTQKGYWQFDMGDVLIGGAPTGFCESGCSAIADSGTSLLAGPT 180
Query: 301 PVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVS 360
++T INHAIG GVVS +CK VV QYG I DLL+S P+K+C QIGLC F+G VS
Sbjct: 181 TIITMINHAIGAAGVVSQQCKTVVDQYGQTILDLLLSETQPKKICSQIGLCTFDGTRGVS 240
Query: 361 TGIKTVVEKENVS----AGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPN 416
GI++VV+KEN GD+A CSACEMAVVW+Q+QL+Q T+E++L+Y+NELC+ LP+
Sbjct: 241 MGIESVVDKENAKLSNGVGDAA-CSACEMAVVWIQSQLRQNMTQERILNYVNELCERLPS 299
Query: 417 PMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG 476
PMGES +DC ++ TMP VS TIG K+F+L+PE+Y+LK GEG CISGF+A D+ PPRG
Sbjct: 300 PMGESAVDCAQLSTMPTVSLTIGGKVFDLAPEEYVLKVGEGPVAQCISGFIALDVAPPRG 359
Query: 477 PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PLWILGDVFMG YHTVFD G ++GFAEAA
Sbjct: 360 PLWILGDVFMGKYHTVFDFGNEQVGFAEAA 389
>gi|293335451|ref|NP_001169605.1| uncharacterized protein LOC100383486 precursor [Zea mays]
gi|224030337|gb|ACN34244.1| unknown [Zea mays]
Length = 556
Score = 545 bits (1405), Expect = e-152, Method: Compositional matrix adjust.
Identities = 282/559 (50%), Positives = 369/559 (66%), Gaps = 56/559 (10%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASS----NGLRRIGLKKRRL------DL---------- 40
M ++ + F L + S LPASS +GL RI LKKR + DL
Sbjct: 1 MGRRTCGTAFILLYVLSTSTLPASSSNTGDGLIRIPLKKRSIMDTIYGDLLPKPSAPEEK 60
Query: 41 --------------------HSL--NAARITRKERYM---GGAGVSGVRHRLGDSDED-- 73
H + AA R+ RY GAG G RL D +
Sbjct: 61 EKQAVDDPVRDAIARARERQHEMLVQAAATERRRRYYWSYSGAGGKGNGSRLHDGGQGEG 120
Query: 74 -----ILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRY 128
I+ LKNF++AQYFG+IG+G PPQNF+V+FDTGS+NLWVPS+KC+FS++C FH +Y
Sbjct: 121 SGSIAIVALKNFLNAQYFGQIGVGCPPQNFTVVFDTGSANLWVPSAKCFFSLACLFHPKY 180
Query: 129 KSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLAR 188
SR+S+TY G I+YG+G I+GF+SQD V VG++VV++Q FIEAT E TFLLA+
Sbjct: 181 DSRQSSTYKPNGTPASIHYGTGGIAGFYSQDQVTVGNLVVQNQEFIEATHEPGFTFLLAK 240
Query: 189 FDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKH 248
FDGI+GL F+EI+V ++PVW NMV Q LV++ VFSFWLNR+P EGGEIVFGG D +H
Sbjct: 241 FDGILGLAFQEISVEGSLPVWYNMVNQNLVAQPVFSFWLNRNPFDGEGGEIVFGGSDEQH 300
Query: 249 FKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH 308
+KG HTY VT+KGYWQFE+GD LIG +STG+C GCAAI DSGTSL+AGP + +IN
Sbjct: 301 YKGSHTYTRVTRKGYWQFEMGDFLIGGRSTGICVDGCAAIADSGTSLIAGPLVAIAQINE 360
Query: 309 AIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVS-TGIKTVV 367
IG GVV+ ECK VV+ YG I LL + P +VC ++GLC F+G VS GI++V
Sbjct: 361 QIGAAGVVNQECKQVVAGYGLQIAGLLEAQTPPSEVCSKVGLCTFDGTRGVSAAGIESV- 419
Query: 368 EKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDR 427
+V A+C+ACE+ V W Q++L ++ E L Y++ LC+S+P+P+G S +DC R
Sbjct: 420 -PGSVDGMAEALCNACEIVVFWTQSELSPNRSNEGTLEYVDRLCESMPDPVG-SRVDCGR 477
Query: 428 IPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMG 487
+ ++ V+F+IG + F L P+QY+LK GEG A CISGF A D+PPP GPLWILGDVFMG
Sbjct: 478 VGSLQTVAFSIGGRAFELRPDQYVLKVGEGFAAHCISGFTALDVPPPVGPLWILGDVFMG 537
Query: 488 VYHTVFDSGKLRIGFAEAA 506
YHT+FD GK+RIGFA++A
Sbjct: 538 AYHTIFDYGKMRIGFADSA 556
>gi|414881317|tpg|DAA58448.1| TPA: hypothetical protein ZEAMMB73_088821 [Zea mays]
Length = 557
Score = 541 bits (1395), Expect = e-151, Method: Compositional matrix adjust.
Identities = 283/565 (50%), Positives = 371/565 (65%), Gaps = 67/565 (11%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASS----NGLRRIGLKKRRL------DL---------- 40
M ++ + F L + S LPASS +GL RI LKKR + DL
Sbjct: 1 MGRRTCGTAFILLYVLSTSTLPASSSNTGDGLIRIPLKKRSIMDTIYGDLLPKPSAPEEK 60
Query: 41 --------------------HSL--NAARITRKERYM---GGAGVSGVRHRLGDSDED-- 73
H + AA R+ RY GAG G RL D +
Sbjct: 61 EKQAVDDPVRDAIARARERQHEMLVQAAATERRRRYYWSYSGAGGKGNGSRLHDGGQGEG 120
Query: 74 -----ILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRY 128
I+ LKNF++AQYFG+IG+G PPQNF+V+FDTGS+NLWVPS+KC+FS++C FH +Y
Sbjct: 121 SGSIAIVALKNFLNAQYFGQIGVGCPPQNFTVVFDTGSANLWVPSAKCFFSLACLFHPKY 180
Query: 129 KSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLAR 188
SR+S+TY G I+YG+G I+GF+SQD V VG++VV++Q FIEAT E TFLLA+
Sbjct: 181 DSRQSSTYKPNGTPASIHYGTGGIAGFYSQDQVTVGNLVVQNQEFIEATHEPGFTFLLAK 240
Query: 189 FDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKH 248
FDGI+GL F+EI+V ++PVW NMV Q LV++ VFSFWLNR+P EGGEIVFGG D +H
Sbjct: 241 FDGILGLAFQEISVEGSLPVWYNMVNQNLVAQPVFSFWLNRNPFDGEGGEIVFGGSDEQH 300
Query: 249 FKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH 308
+KG HTY VT+KGYWQFE+GD LIG +STG+C GCAAI DSGTSL+AGP + +IN
Sbjct: 301 YKGSHTYTRVTRKGYWQFEMGDFLIGGRSTGICVDGCAAIADSGTSLIAGPLVAIAQINE 360
Query: 309 AIGGEGVVSAECKLVVSQYGDLIWDLLVSGLL------PEKVCQQIGLCAFNGAEYVS-T 361
IG GVV+ ECK VV+ YG L ++GLL P +VC ++GLC F+G VS
Sbjct: 361 QIGAAGVVNQECKQVVAGYG-----LQIAGLLEAQQTPPSEVCSKVGLCTFDGTRGVSAA 415
Query: 362 GIKTVVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGES 421
GI++V +V A+C+ACE+ V W Q++L ++ E L Y++ LC+S+P+P+G S
Sbjct: 416 GIESV--PGSVDGMAEALCNACEIVVFWTQSELSPNRSNEGTLEYVDRLCESMPDPVG-S 472
Query: 422 IIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWIL 481
+DC R+ ++ V+F+IG + F L P+QY+LK GEG A CISGF A D+PPP GPLWIL
Sbjct: 473 RVDCGRVGSLQTVAFSIGGRAFELRPDQYVLKVGEGFAAHCISGFTALDVPPPVGPLWIL 532
Query: 482 GDVFMGVYHTVFDSGKLRIGFAEAA 506
GDVFMG YHT+FD GK+RIGFA++A
Sbjct: 533 GDVFMGAYHTIFDYGKMRIGFADSA 557
>gi|302756359|ref|XP_002961603.1| hypothetical protein SELMODRAFT_230037 [Selaginella moellendorffii]
gi|300170262|gb|EFJ36863.1| hypothetical protein SELMODRAFT_230037 [Selaginella moellendorffii]
Length = 423
Score = 540 bits (1390), Expect = e-151, Method: Compositional matrix adjust.
Identities = 249/426 (58%), Positives = 321/426 (75%), Gaps = 3/426 (0%)
Query: 81 MDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIG 140
MDAQY+GEIGIGSPPQ F+VIFDTGSSNLWVPS KC S SC+FH RYK+ +S+TY G
Sbjct: 1 MDAQYYGEIGIGSPPQEFTVIFDTGSSNLWVPSGKCVLSPSCWFHRRYKAGQSSTYKPNG 60
Query: 141 KSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREI 200
S I YGSGS+SGF S D+V +G + VK +VF EAT E LTF+ A+FDGI+GLGF+ I
Sbjct: 61 TSISIQYGSGSMSGFLSVDDVTLGKLTVKGEVFAEATSEPGLTFMAAKFDGIMGLGFQAI 120
Query: 201 AVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTK 260
A VP+W ++VEQ LV E VFSFWLNRD GGE+V GGVDPKHFKGKH Y P+T+
Sbjct: 121 AQARVVPIWYHIVEQQLVKEPVFSFWLNRDATDGNGGELVLGGVDPKHFKGKHNYAPITR 180
Query: 261 KGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAEC 320
+GYW+ +GD+LI TG+C GCAAIVDSGTSLLAGP+ ++ EINHAIG GVVS EC
Sbjct: 181 EGYWEIRMGDVLIDGHGTGMCSKGCAAIVDSGTSLLAGPSAIIAEINHAIGASGVVSQEC 240
Query: 321 KLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVC 380
KL+V QYG++I +LL++ + P+KVC Q+G+C+ E I +V++KE + C
Sbjct: 241 KLIVDQYGNIIINLLLAQVSPDKVCSQLGVCSATRNE---PDIASVLDKEREGIDNDLAC 297
Query: 381 SACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGD 440
ACE AV+W++NQL++ +++E+++SY++ELC LP+P GES +DC + MP +SFTI +
Sbjct: 298 EACERAVIWIENQLRKNRSREEIVSYLDELCSRLPSPNGESAVDCSSVSRMPKISFTIAN 357
Query: 441 KIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRI 500
+ + LSPEQYILK G+G + C+SGF+ D+P P GPLWILGD+FMGVYHTVFD G ++
Sbjct: 358 RNYELSPEQYILKIGDGNKKQCLSGFIGLDVPAPAGPLWILGDIFMGVYHTVFDFGNKQV 417
Query: 501 GFAEAA 506
GFA AA
Sbjct: 418 GFAPAA 423
>gi|56182674|gb|AAV84086.1| aspartic proteinase 12 [Fagopyrum esculentum]
Length = 387
Score = 539 bits (1389), Expect = e-150, Method: Compositional matrix adjust.
Identities = 250/388 (64%), Positives = 315/388 (81%), Gaps = 5/388 (1%)
Query: 103 DTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVE 162
DTGSSNLWVPS+KCYFSI+C+FHS+YKS KS T+ + G S I YG+G+ISGFFS+DNV+
Sbjct: 1 DTGSSNLWVPSAKCYFSIACFFHSKYKSSKSITHVKNGTSAAIRYGTGAISGFFSRDNVK 60
Query: 163 VGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEV 222
+GD+VV++Q FIEATRE S+TF+ A+FDGI+GLGF+EI+VG AVPVW NM++QGL+SE V
Sbjct: 61 IGDLVVENQEFIEATREPSITFIAAKFDGILGLGFQEISVGKAVPVWYNMIDQGLISEPV 120
Query: 223 FSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCE 282
FSFW NR+ + EEGGE+VFGG+DP HF+G+HTYVPVT+KGYWQF++ D+LI STG C
Sbjct: 121 FSFWFNRNAEEEEGGELVFGGIDPDHFRGQHTYVPVTQKGYWQFDMDDVLIDGMSTGFCA 180
Query: 283 GGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPE 342
GGCAAI DSGTSLLAGP VV +INHAIG G+VS ECK VV++YG I ++L+S P
Sbjct: 181 GGCAAIADSGTSLLAGPMAVVAQINHAIGATGIVSQECKTVVAEYGKEIIEMLLSEAQPL 240
Query: 343 KVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAV----CSACEMAVVWVQNQLKQKQ 398
K+C Q+GLC F+G VS GI++VV+K NV ++ C ACEMAVVW+QN+L Q Q
Sbjct: 241 KICSQVGLCTFDGTRGVSMGIESVVDK-NVXKSSGSLKEXKCVACEMAVVWIQNRLIQNQ 299
Query: 399 TKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGI 458
T+E +L Y N+LC+ LP+PMGES +DC + T+P+VSFTIG K F+L+PEQY+L+ GEG
Sbjct: 300 TEELILDYANQLCERLPSPMGESAVDCSSLSTLPDVSFTIGGKTFDLAPEQYVLQVGEGP 359
Query: 459 AEVCISGFMAFDLPPPRGPLWILGDVFM 486
A CISGF+A D+PPPRGPLWILGDVFM
Sbjct: 360 AAQCISGFIALDVPPPRGPLWILGDVFM 387
>gi|302775562|ref|XP_002971198.1| hypothetical protein SELMODRAFT_147484 [Selaginella moellendorffii]
gi|300161180|gb|EFJ27796.1| hypothetical protein SELMODRAFT_147484 [Selaginella moellendorffii]
Length = 423
Score = 536 bits (1382), Expect = e-150, Method: Compositional matrix adjust.
Identities = 248/426 (58%), Positives = 320/426 (75%), Gaps = 3/426 (0%)
Query: 81 MDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIG 140
MDAQY+GEIGIGSPPQ F+VIFDTGSSNLWVPS KC S SC+FH R+K+ +S+TY G
Sbjct: 1 MDAQYYGEIGIGSPPQEFTVIFDTGSSNLWVPSGKCVLSPSCWFHRRFKAGQSSTYKPNG 60
Query: 141 KSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREI 200
S I YGSGS+SGF S D+V +G + VK +VF EAT E LTF+ A+FDGI+GLGF+ I
Sbjct: 61 TSISIQYGSGSMSGFLSVDDVTLGKLTVKGEVFAEATSEPGLTFMAAKFDGIMGLGFQAI 120
Query: 201 AVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTK 260
A VP+W ++VEQ LV E VFSFWLNRD GGE+V GGVDPKHFKGKH Y P+T+
Sbjct: 121 AQARVVPIWYHIVEQQLVKEPVFSFWLNRDATDGNGGELVLGGVDPKHFKGKHNYAPITR 180
Query: 261 KGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAEC 320
+GYW+ +GD+LI TG+C GCAAIVDSGTSLLAGP+ ++ EINHAIG GVVS EC
Sbjct: 181 EGYWEIRMGDVLIDGHGTGMCSKGCAAIVDSGTSLLAGPSAIIAEINHAIGASGVVSQEC 240
Query: 321 KLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVC 380
KL+V QYG++I +LL++ + P+KVC Q+G+C+ E I +V++KE + C
Sbjct: 241 KLIVDQYGNIIINLLLAQVSPDKVCSQLGVCSATRNE---PDIASVLDKEREGIDNDLAC 297
Query: 381 SACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGD 440
ACE AV+W++NQL++ +++E+++SY++ELC LP+P GES +DC + MP +SFTI +
Sbjct: 298 EACERAVIWIENQLRKNRSREEIVSYLDELCSRLPSPNGESAVDCSSVSRMPKISFTIAN 357
Query: 441 KIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRI 500
+ LSPEQYILK G+G + C+SGF+ D+P P GPLWILGD+FMGVYHTVFD G ++
Sbjct: 358 HNYELSPEQYILKIGDGNKKQCLSGFIGLDVPAPAGPLWILGDIFMGVYHTVFDFGNKQV 417
Query: 501 GFAEAA 506
GFA AA
Sbjct: 418 GFALAA 423
>gi|75267434|sp|Q9XFX3.1|CARDA_CYNCA RecName: Full=Procardosin-A; Contains: RecName: Full=Cardosin-A
intermediate form 35 kDa subunit; Contains: RecName:
Full=Cardosin-A heavy chain; AltName: Full=Cardosin-A 31
kDa subunit; Contains: RecName: Full=Cardosin-A
intermediate form 30 kDa subunit; Contains: RecName:
Full=Cardosin-A light chain; AltName: Full=Cardosin-A 15
kDa subunit; Flags: Precursor
gi|4581209|emb|CAB40134.1| preprocardosin A [Cynara cardunculus]
Length = 504
Score = 536 bits (1381), Expect = e-149, Method: Compositional matrix adjust.
Identities = 274/498 (55%), Positives = 346/498 (69%), Gaps = 10/498 (2%)
Query: 12 LWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSD 71
L+ L S + S +GL RIGLKKR++D ++ R R G R + DS
Sbjct: 14 LFYLLSPTVFSVSDDGLIRIGLKKRKVD--RIDQLRGRRALMEGNARKDFGFRGTVRDSG 71
Query: 72 EDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSR 131
++ L N D YFGEIGIG+PPQ F+VIFDTGSS LWVPSSKC S +C HS Y+S
Sbjct: 72 SAVVALTNDRDTSYFGEIGIGTPPQKFTVIFDTGSSVLWVPSSKCINSKACRAHSMYESS 131
Query: 132 KSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDG 191
S+TY E G I YG+GSI+GFFSQD+V +GD+VVK+Q FIEAT E FL FDG
Sbjct: 132 DSSTYKENGTFGAIIYGTGSITGFFSQDSVTIGDLVVKEQDFIEATDEADNVFLHRLFDG 191
Query: 192 IIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKG 251
I+GL F+ I+V PVW NM+ QGLV E FSFWLNR+ D EEGGE+VFGG+DP HF+G
Sbjct: 192 ILGLSFQTISV----PVWYNMLNQGLVKERRFSFWLNRNVDEEEGGELVFGGLDPNHFRG 247
Query: 252 KHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
HTYVPVT + YWQF +GD+LIG++STG C GC A DSGTSLL+GPT +VT+INHAIG
Sbjct: 248 DHTYVPVTYQYYWQFGIGDVLIGDKSTGFCAPGCQAFADSGTSLLSGPTAIVTQINHAIG 307
Query: 312 GEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN 371
GV++ +CK VVS+YG I ++L S + P+K+C + LC F+GA VS+ I++VV+K N
Sbjct: 308 ANGVMNQQCKTVVSRYGRDIIEMLRSKIQPDKICSHMKLCTFDGARDVSSIIESVVDKNN 367
Query: 372 VSAG---DSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRI 428
+ +C+ CEMAVVW+QN++KQ +T++ +++Y NELC+ L E +DC+ +
Sbjct: 368 DKSSGGIHDEMCTFCEMAVVWMQNEIKQSETEDNIINYANELCEHLSTSSEELQVDCNTL 427
Query: 429 PTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGV 488
+MPNVSFTIG K F L+PEQYILK G+G A CISGF A D GPLWILGDVFM
Sbjct: 428 SSMPNVSFTIGGKKFGLTPEQYILKVGKGEATQCISGFTAMD-ATLLGPLWILGDVFMRP 486
Query: 489 YHTVFDSGKLRIGFAEAA 506
YHTVFD G L +GFAEAA
Sbjct: 487 YHTVFDYGNLLVGFAEAA 504
>gi|218196057|gb|EEC78484.1| hypothetical protein OsI_18377 [Oryza sativa Indica Group]
Length = 389
Score = 532 bits (1371), Expect = e-148, Method: Compositional matrix adjust.
Identities = 242/380 (63%), Positives = 314/380 (82%), Gaps = 2/380 (0%)
Query: 127 RYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLL 186
+ +S+KS++Y G++C+I YGSG+ISGFFS+DNV VGD+VVK+Q FIEATRE S+TF++
Sbjct: 12 QIQSKKSSSYKADGETCKITYGSGAISGFFSKDNVLVGDLVVKNQKFIEATRETSVTFII 71
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
+FDGI+GLG+ EI+VG A P+W +M EQ L++++VFSFWLNRDPDA GGE+VFGG+DP
Sbjct: 72 GKFDGILGLGYPEISVGKAPPIWQSMQEQELLADDVFSFWLNRDPDASSGGELVFGGMDP 131
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
KH+KG HTYVPV++KGYWQF +GD+LI STG C GCAAIVDSGTSLLAGPT +V ++
Sbjct: 132 KHYKGDHTYVPVSRKGYWQFNMGDLLIDGHSTGFCAKGCAAIVDSGTSLLAGPTAIVAQV 191
Query: 307 NHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTV 366
NHAIG EG++S ECK VVS+YG++I +LL++ P+KVC Q+GLC F+G VS GI++V
Sbjct: 192 NHAIGAEGIISTECKEVVSEYGEMILNLLIAQTDPQKVCSQVGLCMFDGKRSVSNGIESV 251
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
V+KEN+ G A+CS CEMAVVW++NQL++ +TKE +L+Y N+LC+ LP+P GES + C
Sbjct: 252 VDKENL--GSDAMCSVCEMAVVWIENQLRENKTKELILNYANQLCERLPSPNGESTVSCH 309
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
+I MPN++FTI +K F L+PEQYI+K +G VCISGFMAFD+PPPRGPLWILGDVFM
Sbjct: 310 QISKMPNLAFTIANKTFILTPEQYIVKLEQGGQTVCISGFMAFDIPPPRGPLWILGDVFM 369
Query: 487 GVYHTVFDSGKLRIGFAEAA 506
G YHTVFD GK RIGFA++A
Sbjct: 370 GAYHTVFDFGKDRIGFAKSA 389
>gi|242053731|ref|XP_002456011.1| hypothetical protein SORBIDRAFT_03g028820 [Sorghum bicolor]
gi|241927986|gb|EES01131.1| hypothetical protein SORBIDRAFT_03g028820 [Sorghum bicolor]
Length = 567
Score = 531 bits (1367), Expect = e-148, Method: Compositional matrix adjust.
Identities = 251/439 (57%), Positives = 328/439 (74%), Gaps = 11/439 (2%)
Query: 73 DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRK 132
+I+ LKNF++AQYFG+IG+G PPQNF+V+FDTGS+NLWVPS+KC+FS++C FH +Y S +
Sbjct: 135 NIVALKNFLNAQYFGQIGVGCPPQNFTVVFDTGSANLWVPSAKCFFSLACLFHPKYDSSQ 194
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGI 192
S+TY G I+YG+G I+GF+SQD V VG++VV++Q FIEAT E TFLLA+FDGI
Sbjct: 195 SSTYKPNGTPASIHYGTGGIAGFYSQDEVTVGNLVVQNQEFIEATHEPGFTFLLAKFDGI 254
Query: 193 IGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDP-DAEEGGEIVFGGVDPKHFKG 251
+GL F+EI+V +VPVW NMV Q LV + VFSFWLNR+P D EEGGEIVFGG D +H+KG
Sbjct: 255 LGLAFQEISVEGSVPVWYNMVNQSLVPQPVFSFWLNRNPFDGEEGGEIVFGGSDEQHYKG 314
Query: 252 KHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
HTY VT+K YWQFE+GD LIG +STG+C GCAAI DSGTSL+AGP + +IN IG
Sbjct: 315 SHTYTRVTRKAYWQFEMGDFLIGERSTGICVDGCAAIADSGTSLIAGPLVAIAQINEQIG 374
Query: 312 GEGVVSAECKLVVSQYG-DLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKE 370
GVV+ ECK VV+ YG +++ L P +VC +IGLC +G VS GI++V
Sbjct: 375 AAGVVNHECKQVVAGYGLEMVELLKAQQTPPSQVCSKIGLCTLDGTHGVSAGIESV---- 430
Query: 371 NVSAGD---SAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDR 427
+GD A+C+ACEM V W+Q++ +TKE L Y++ LC+++P+P+G S +DC
Sbjct: 431 -SGSGDGMSEAICNACEMIVFWMQSEFNTNKTKEGTLEYVDRLCENMPDPVG-SHVDCRH 488
Query: 428 IPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMG 487
I ++ V+F+IG + F L P+QYIL+ GEG A CISGF A D+PPP GPLWILGDVFMG
Sbjct: 489 IGSLQTVAFSIGGRAFELRPDQYILRVGEGFAAHCISGFTALDIPPPIGPLWILGDVFMG 548
Query: 488 VYHTVFDSGKLRIGFAEAA 506
YHT+FD GK+RIGFA++A
Sbjct: 549 AYHTIFDYGKMRIGFADSA 567
>gi|356547093|ref|XP_003541952.1| PREDICTED: LOW QUALITY PROTEIN: cyprosin-like, partial [Glycine
max]
Length = 470
Score = 526 bits (1355), Expect = e-146, Method: Compositional matrix adjust.
Identities = 267/491 (54%), Positives = 346/491 (70%), Gaps = 35/491 (7%)
Query: 19 LLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLK 78
++L SNG+ R+GL+K + D +++ GG S D I+ LK
Sbjct: 12 VVLSGPSNGIIRVGLEKNKFD----------QRKTPFGGYENS--------DDTSIIRLK 53
Query: 79 NFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTE 138
N+M+AQYFGEIGIG+P Q F+VIFDTGSSNLWVPSSKCYFS++CY HSRYKS +S+T +
Sbjct: 54 NYMNAQYFGEIGIGTP-QKFTVIFDTGSSNLWVPSSKCYFSVACYLHSRYKSSQSSTQNK 112
Query: 139 IGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFR 198
G S EI YG+G ISGFFSQD V+VGD++V TR LL +I L F+
Sbjct: 113 NGSSAEIRYGTGQISGFFSQDYVKVGDLIV-------LTR----XILLNEHFCVI-LQFK 160
Query: 199 EIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPV 258
I+VG P+W NM+ Q L+++ VFSFWLNR+ D ++GG+IVFGGVD H+ G+HTYVPV
Sbjct: 161 SISVGKVSPIWYNMLNQHLLAQPVFSFWLNRNTDEKQGGQIVFGGVDSDHYXGEHTYVPV 220
Query: 259 TKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSA 318
T KGYWQ E+GD+LI ++T C C+AI DSGTSLLAGPT + +INHAIG GVV+
Sbjct: 221 THKGYWQTEIGDVLIDRKTTEFCASKCSAIDDSGTSLLAGPTGAIAQINHAIGAVGVVNQ 280
Query: 319 ECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEK--ENVS-AG 375
ECK VV+QYG I D L++ LP++VC Q LC F+G + VS GI++VV+K E S +
Sbjct: 281 ECKAVVAQYGKTILDKLINEALPQQVCSQX-LCTFDGTKGVSMGIQSVVDKTIEKTSYSW 339
Query: 376 DSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVS 435
+ A C+ACEMAVVW++N L+ +T++++L Y N LCD LP+P GES+++C + MPNVS
Sbjct: 340 NDAGCTACEMAVVWIKNPLRLNETEDQILDYANALCDMLPSPNGESVVECSTLSEMPNVS 399
Query: 436 FTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDS 495
FTIG K+F LSPEQYILK G+G CI GF+A D+ PPRGPLWILGD+FMG YHTVF
Sbjct: 400 FTIGGKVFELSPEQYILKVGKGATAQCIRGFIALDIAPPRGPLWILGDIFMGRYHTVFFY 459
Query: 496 GKLRIGFAEAA 506
G ++GFAE+A
Sbjct: 460 GNKKVGFAESA 470
>gi|413946558|gb|AFW79207.1| hypothetical protein ZEAMMB73_486493 [Zea mays]
Length = 382
Score = 519 bits (1337), Expect = e-144, Method: Compositional matrix adjust.
Identities = 244/380 (64%), Positives = 300/380 (78%), Gaps = 5/380 (1%)
Query: 131 RKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFD 190
+K+ TY GK I YG+GSI+GFFS+D+V +GD+VVKDQ FIEAT+E LTF++A+FD
Sbjct: 4 KKTKTYMS-GKPAAIRYGTGSIAGFFSEDSVTLGDLVVKDQEFIEATKEPGLTFMVAKFD 62
Query: 191 GIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFK 250
GI+GLGF+EI+VG+A PVW NMV+QGL+S+ VFSFW NR D EGGEIVFGG+D H+K
Sbjct: 63 GILGLGFQEISVGNATPVWYNMVKQGLISDPVFSFWFNRHADEGEGGEIVFGGMDSSHYK 122
Query: 251 GKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI 310
G HT+VPVT+KGYWQF +GD+L+ +STG C GGCAAI DSGTSLLAGPT ++TEIN I
Sbjct: 123 GDHTFVPVTRKGYWQFNMGDVLVDGKSTGFCAGGCAAIADSGTSLLAGPTAIITEINEKI 182
Query: 311 GGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVE-- 368
G GVVS ECK VVSQYG I DLL++ P K+C Q+GLC F+G VS GI++VV+
Sbjct: 183 GAAGVVSQECKTVVSQYGQQILDLLLAETQPAKICSQVGLCTFDGTHGVSAGIRSVVDDE 242
Query: 369 --KENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
K N +C+ACEMAVVW+QNQL Q +T+E +L+YIN+LC+ LP+PMGES +DC
Sbjct: 243 AGKSNGGLKSDPMCNACEMAVVWMQNQLAQNKTQELILNYINQLCERLPSPMGESAVDCG 302
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
+ +MP+++FTIG K F L PEQYILK GEG A CISGF A D+PPPRGPLWILGDVFM
Sbjct: 303 SLASMPDIAFTIGGKKFKLKPEQYILKVGEGQAAQCISGFTAMDIPPPRGPLWILGDVFM 362
Query: 487 GVYHTVFDSGKLRIGFAEAA 506
GVYHTVFD GKLR+GFAE+A
Sbjct: 363 GVYHTVFDYGKLRVGFAESA 382
>gi|148910494|gb|ABR18322.1| unknown [Picea sitchensis]
Length = 471
Score = 511 bits (1316), Expect = e-142, Method: Compositional matrix adjust.
Identities = 247/422 (58%), Positives = 314/422 (74%), Gaps = 17/422 (4%)
Query: 23 ASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS-------GVRHRLGDSDE--- 72
A+++ L RI LKK+ LD +L AARI +E G+S G+R L S+
Sbjct: 19 AANDCLARIELKKKGLDQKTLQAARIVARE-----GGLSNEVNRKYGLRGGLSYSESARG 73
Query: 73 DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRK 132
+ +PLKN++DAQY+GEIG+G+PPQ F+VIFDTGSSNLWVPS+KCY SI+CYFHS+YK+ +
Sbjct: 74 EYVPLKNYLDAQYYGEIGLGTPPQKFTVIFDTGSSNLWVPSTKCYLSIACYFHSKYKASQ 133
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGI 192
S++Y GK I YGSGS+SG+ QD+V GD+VVKDQVF E T+E LTFL A+FDGI
Sbjct: 134 SSSYCVNGKPFNIQYGSGSVSGYLGQDHVTAGDLVVKDQVFAEVTQEPGLTFLAAKFDGI 193
Query: 193 IGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGK 252
+GLGF++I+VG+ VPVW NMV QGL+ E VFSFW+NR EEGGEIVFGGVDP HFKGK
Sbjct: 194 LGLGFQKISVGNVVPVWYNMVNQGLIKEPVFSFWMNRKVGDEEGGEIVFGGVDPNHFKGK 253
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
HTYVPVT++GYWQF +GD LIG QSTG C GGCAAIVDSGTSLLAGP+ +V +IN AIG
Sbjct: 254 HTYVPVTREGYWQFNMGDFLIGGQSTGFCSGGCAAIVDSGTSLLAGPSGIVAQINEAIGA 313
Query: 313 EGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN- 371
G+ S ECK VVSQYGDLI +LL++ P+KVC QIGLC +G V I +V+EK N
Sbjct: 314 SGLASQECKSVVSQYGDLIMELLMAQTNPQKVCSQIGLCLSDGTRDVGMRIASVLEKGNE 373
Query: 372 -VSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPT 430
S S +C+ACEMAVVW +NQ+ + +K+++++Y+N+LCD LPNP G++ +DC
Sbjct: 374 ATSTSSSGMCAACEMAVVWAKNQIARNASKDQIMTYLNQLCDRLPNPNGQAAVDCKTYQA 433
Query: 431 MP 432
P
Sbjct: 434 CP 435
>gi|2160151|gb|AAB60773.1| Strong similarity to Brassica aspartic protease (gb|X77260)
[Arabidopsis thaliana]
Length = 433
Score = 495 bits (1274), Expect = e-137, Method: Compositional matrix adjust.
Identities = 238/404 (58%), Positives = 304/404 (75%), Gaps = 18/404 (4%)
Query: 25 SNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLG--DSDEDILPLKNFMD 82
++G R+GLKK +LD ++ A R K+ + + + LG D DI+PLKN++D
Sbjct: 27 NDGTFRVGLKKLKLDPNNRLATRFGSKQEEALRSSLRSYNNNLGGDSGDADIVPLKNYLD 86
Query: 83 AQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKS 142
AQY+GEI IG+PPQ F+VIFDTGSSNLWVPS KC+FS+SCYFH++YKS +S+TY + GK
Sbjct: 87 AQYYGEIAIGTPPQKFTVIFDTGSSNLWVPSGKCFFSLSCYFHAKYKSSRSSTYKKSGKR 146
Query: 143 CEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAV 202
I+YGSGSISGFFS D V VGD+VVKDQ FIE T E LTFL+A+FDG++GLGF+EIAV
Sbjct: 147 AAIHYGSGSISGFFSYDAVTVGDLVVKDQEFIETTSEPGLTFLVAKFDGLLGLGFQEIAV 206
Query: 203 GDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
G+A PVW NM++QGL+ VFSFWLNRDP +EEGGEIVFGGVDPKHF+G+HT+VPVT++G
Sbjct: 207 GNATPVWYNMLKQGLIKRPVFSFWLNRDPKSEEGGEIVFGGVDPKHFRGEHTFVPVTQRG 266
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT-------------PVVTEINHA 309
YWQF++G++LI +STG C GC+AI DSGTSLLAGPT VV IN A
Sbjct: 267 YWQFDMGEVLIAGESTGYCGSGCSAIADSGTSLLAGPTVSKYHEFIVLFQLAVVAMINKA 326
Query: 310 IGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEK 369
IG GVVS +CK VV QYG I DLL++ P+K+C QIGLCA++G VS GI++VV+K
Sbjct: 327 IGASGVVSQQCKTVVDQYGQTILDLLLAETQPKKICSQIGLCAYDGTHGVSMGIESVVDK 386
Query: 370 ENVSAGD---SAVCSACEMAVVWVQNQLKQKQTKEKVLSYINEL 410
EN + A C ACEMAVVW+Q+QL+Q T+E++++YINE+
Sbjct: 387 ENTRSSSGLRDAGCPACEMAVVWIQSQLRQNMTQERIVNYINEV 430
>gi|384245845|gb|EIE19337.1| putative aspartic protease [Coccomyxa subellipsoidea C-169]
Length = 508
Score = 471 bits (1213), Expect = e-130, Method: Compositional matrix adjust.
Identities = 243/513 (47%), Positives = 334/513 (65%), Gaps = 16/513 (3%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV 60
M K+ R+ F + S LL + R+ LKKR LD + A + R V
Sbjct: 2 MGTKMKRAGFLSLLCLSIGLLAQAQQSPLRVPLKKRTLDAEQVRATQTALHAR-----NV 56
Query: 61 SGVRHRL-GDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YF 118
V + L G+ +E +PL +F+DAQY+GEIG+G+P Q F+V+FDTGSSNLWVPSS+C YF
Sbjct: 57 RNVANALRGEPEEADIPLLDFLDAQYYGEIGLGTPEQKFTVVFDTGSSNLWVPSSQCSYF 116
Query: 119 SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
++C H+++ + KS TY G I YGSGS+SGFFS D + +G + V++Q F EAT+
Sbjct: 117 DLACLLHNKFYASKSRTYQANGTDFAIQYGSGSLSGFFSTDVLSLGSLNVQNQTFAEATK 176
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
E L F+ A+FDGI+GL F EI++G+ P + NMV+QGLV E VFSFWLNR+ + GGE
Sbjct: 177 EPGLAFVAAKFDGILGLAFPEISIGEVTPPFQNMVQQGLVPEPVFSFWLNRNDPSGPGGE 236
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
+V GGVDP H+ G+H +V VT++ YWQF+LG I + ++ C GC AI DSGTSL+ G
Sbjct: 237 LVLGGVDPSHYTGEHLWVNVTRRAYWQFDLGGISVPGTNS-PCADGCQAIADSGTSLIVG 295
Query: 299 PTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFN---- 354
P+ + EIN AIG +GV+ AEC+ +V QY I ++S L E+VC IGLC+ +
Sbjct: 296 PSDEIAEINRAIGAKGVLPAECRELVRQYVPEIMKAVIS-LPEEQVCGAIGLCSASSLHR 354
Query: 355 -GAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDS 413
GA + + +VE E + A D VC CEMAV +V+ L +T+E+++ ++ LCD+
Sbjct: 355 GGAAKAAASRRLLVEDEALGAPDP-VCQFCEMAVSYVKIALANHETQEQIIGQLDGLCDT 413
Query: 414 LP-NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
L ++++DC+ IP+MP V+FTI K F LS E Y+L+ G A C+SGFM DLP
Sbjct: 414 LAIFSSSQALVDCEAIPSMPPVTFTIAGKKFTLSAEDYVLQVSAGGATQCVSGFMGLDLP 473
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
PP GPLWILGDVFMG YHTVFD G R+GFA++
Sbjct: 474 PPAGPLWILGDVFMGAYHTVFDVGNERVGFADS 506
>gi|12231180|dbj|BAB20973.1| aspartic proteinase 5 [Nepenthes alata]
Length = 358
Score = 465 bits (1197), Expect = e-128, Method: Compositional matrix adjust.
Identities = 237/339 (69%), Positives = 278/339 (82%), Gaps = 3/339 (0%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV 60
M + L +FC L SC S++GL RIGLK++ D +S+ A RI RK G+
Sbjct: 1 MGHRNLWVIFCFCALISCFF-STSADGLVRIGLKRQFSDSNSIRAVRIARKAGM--NQGL 57
Query: 61 SGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI 120
++ GDSD DI+ LKN++DAQY+GEIGIGSPPQ FSVIFDTGSSNLWVPSSKCYFS+
Sbjct: 58 KRFQYSFGDSDTDIVYLKNYLDAQYYGEIGIGSPPQKFSVIFDTGSSNLWVPSSKCYFSV 117
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+CYFHS+YKS KS+TYT+IGKSCEI+YGSGSISGFFSQD VEVG++ VK+QVFIEA+RE
Sbjct: 118 ACYFHSKYKSSKSSTYTKIGKSCEIDYGSGSISGFFSQDIVEVGNLAVKNQVFIEASREK 177
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
SLTF LA+FDGI+GLGF+EI+VGD VPVW NMVEQGLVSE+VFSFW NRDP A+ GGEIV
Sbjct: 178 SLTFALAKFDGILGLGFQEISVGDVVPVWYNMVEQGLVSEKVFSFWFNRDPKAKIGGEIV 237
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGG+D KHF G+H YVP+T+KGYWQFE+G+ LIGN STG C GGC AIVDSGTSLLAGP
Sbjct: 238 FGGIDEKHFVGEHIYVPITRKGYWQFEMGNFLIGNYSTGFCRGGCDAIVDSGTSLLAGPM 297
Query: 301 PVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGL 339
VVTE+NHAIG EG+ S ECK VV QYGD+IWDLLVSG+
Sbjct: 298 HVVTEVNHAIGAEGIASMECKEVVYQYGDMIWDLLVSGV 336
>gi|218188712|gb|EEC71139.1| hypothetical protein OsI_02961 [Oryza sativa Indica Group]
Length = 540
Score = 460 bits (1183), Expect = e-127, Method: Compositional matrix adjust.
Identities = 213/379 (56%), Positives = 283/379 (74%), Gaps = 3/379 (0%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
L LKNF++AQYFGEIG+G PPQNF+V+FDTGSSNLWVPS+KC FS++CYFH +Y+SR S+
Sbjct: 144 LALKNFLNAQYFGEIGVGCPPQNFTVVFDTGSSNLWVPSAKCVFSLACYFHRKYESRSSS 203
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY E G I+YG+GSI G++SQD V +GD+VV +Q FIEAT E LTFL A+FDGI+G
Sbjct: 204 TYMENGTPASIHYGTGSIHGYYSQDQVTIGDLVVNNQEFIEATHEPGLTFLAAKFDGILG 263
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LGF+EI+V A PVW NM++Q LV+++VFSFWLNR+ + GGEIVFGG D H+KG HT
Sbjct: 264 LGFKEISVEGADPVWYNMIQQSLVTDKVFSFWLNRNANDINGGEIVFGGADESHYKGDHT 323
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y VT+K YWQFE+GD LIG +STG+C GCA I DSGTSL+AGP + +I+ IG G
Sbjct: 324 YTRVTRKAYWQFEMGDFLIGGRSTGICVDGCAVIADSGTSLIAGPIAAIAQIHAHIGATG 383
Query: 315 VVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSA 374
V + ECK VV+++G + +LL P +VC +IGLC +GA +S GI++V+ + + SA
Sbjct: 384 VANEECKQVVARHGHEMLELLQDKTPPAQVCSKIGLCKSDGAHGISDGIESVLGETHKSA 443
Query: 375 GD--SAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMP 432
+ A C+ACEMAV W+Q++ Q TKE L Y N+LC ++P+P+G S +DC I +P
Sbjct: 444 DEVSDATCNACEMAVTWMQSEFVQNHTKEGKLEYANQLCGNMPSPVG-SYVDCRHIGHLP 502
Query: 433 NVSFTIGDKIFNLSPEQYI 451
NV+F+IG + F L+PEQ +
Sbjct: 503 NVAFSIGGRAFELTPEQVL 521
>gi|115438741|ref|NP_001043650.1| Os01g0631900 [Oryza sativa Japonica Group]
gi|55297073|dbj|BAD68642.1| putative aspartic proteinase [Oryza sativa Japonica Group]
gi|113533181|dbj|BAF05564.1| Os01g0631900 [Oryza sativa Japonica Group]
Length = 522
Score = 459 bits (1182), Expect = e-126, Method: Compositional matrix adjust.
Identities = 213/377 (56%), Positives = 282/377 (74%), Gaps = 3/377 (0%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
L LKNF++AQYFGEIG+G PPQNF+V+FDTGSSNLWVPS+KC FS++CYFH +Y+SR S+
Sbjct: 129 LALKNFLNAQYFGEIGVGCPPQNFTVVFDTGSSNLWVPSAKCVFSLACYFHRKYESRSSS 188
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY E G I+YG+GSI G++SQD V +GD+VV +Q FIEAT E LTFL A+FDGI+G
Sbjct: 189 TYMENGTPASIHYGTGSIHGYYSQDQVTIGDLVVNNQEFIEATHEPGLTFLAAKFDGILG 248
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LGF+EI+V A PVW NM++Q LV+++VFSFWLNR+ + GGEIVFGG D H+KG HT
Sbjct: 249 LGFKEISVEGADPVWYNMIQQSLVTDKVFSFWLNRNANDINGGEIVFGGADESHYKGDHT 308
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y VT+K YWQFE+GD LIG +STG+C GCA I DSGTSL+AGP + +I+ IG G
Sbjct: 309 YTRVTRKAYWQFEMGDFLIGGRSTGICVDGCAVIADSGTSLIAGPIAAIAQIHAHIGATG 368
Query: 315 VVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSA 374
V + ECK VV+++G + +LL P +VC +IGLC +GA +S GI++V+ + + SA
Sbjct: 369 VANEECKQVVARHGHEMLELLQDKTPPAQVCSKIGLCKSDGAHGISDGIESVLGETHKSA 428
Query: 375 GD--SAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMP 432
+ A C+ACEMAV W+Q++ Q TKE L Y N+LC ++P+P+G S +DC I +P
Sbjct: 429 DEVSDATCNACEMAVTWMQSEFVQNHTKEGKLEYANQLCGNMPSPVG-SYVDCRHIGHLP 487
Query: 433 NVSFTIGDKIFNLSPEQ 449
NV+F+IG + F L+PEQ
Sbjct: 488 NVAFSIGGRAFELTPEQ 504
>gi|307103455|gb|EFN51715.1| hypothetical protein CHLNCDRAFT_59800 [Chlorella variabilis]
Length = 523
Score = 452 bits (1162), Expect = e-124, Method: Compositional matrix adjust.
Identities = 234/522 (44%), Positives = 328/522 (62%), Gaps = 44/522 (8%)
Query: 18 CLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPL 77
CL+ A + G ++ L R+L L + + K R + A + ++D + +P+
Sbjct: 13 CLVATAQATGPLKVHL--RKLPLVAEQRQHLKDKHRLVTLAPAA-------ENDAEPVPI 63
Query: 78 KNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTY 136
NFMDAQY+GEIG+GSPPQ+F VIFDTGSSNLWVPSSKC Y S++CY HS+Y + +S+TY
Sbjct: 64 TNFMDAQYYGEIGLGSPPQSFQVIFDTGSSNLWVPSSKCSYLSVACYLHSKYYAERSHTY 123
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
E G+ I YGSG +SGF SQD + +G + V+ QVF EAT E SL F+ ARFDGI+G+G
Sbjct: 124 KEDGREFAIQYGSGQLSGFLSQDTLSMGGLKVEGQVFAEATMEPSLAFIAARFDGILGMG 183
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
F EIAVG P + NM++Q L+ E VFSFWLNR + EEGGE+V GGVDP HF G+HT+V
Sbjct: 184 FPEIAVGKVTPPFQNMLQQSLLPEPVFSFWLNRKVEGEEGGELVLGGVDPDHFVGEHTWV 243
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
PVT++G+WQF++ + + C+GGC AI D+GTSLL GP V+ IN AIG E V+
Sbjct: 244 PVTRRGFWQFKMDGMEVEGGGE-FCKGGCQAIADTGTSLLVGPPDVIDAINAAIGAEPVL 302
Query: 317 SAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNG-----------AEY---VSTG 362
+CK +V QY I L++ + P+ VCQ +GLC+ G A+Y +
Sbjct: 303 VEQCKEMVHQYLPEIIK-LINNMPPQAVCQSVGLCSAAGVGEDRRVLSKSAQYRRLLKMY 361
Query: 363 IKTVVEKENVSAGD-----------------SAVCSACEMAVVWVQNQLKQKQTKEKVLS 405
+ +++ ++AG + C C+ V +++ L +T +++
Sbjct: 362 GQQQGQEQPLAAGTGEGEEEAQAGGVGGAAANDSCEMCQFVVQYLKIALANNETMAQIMH 421
Query: 406 YINELCDSLP-NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCIS 464
++ C++ GES++DC + MP+++FT+G K F L PEQY+LK G E C+S
Sbjct: 422 NLDRACETFSFGSGGESVVDCKALHKMPSIAFTVGGKEFVLGPEQYVLKIGSMGEEQCVS 481
Query: 465 GFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
GFM D+PPP GPLWILGD+F+G YHTVFD G R+GFA+AA
Sbjct: 482 GFMGLDIPPPLGPLWILGDMFIGPYHTVFDYGNERVGFAQAA 523
>gi|145352062|ref|XP_001420378.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580612|gb|ABO98671.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 454
Score = 434 bits (1117), Expect = e-119, Method: Compositional matrix adjust.
Identities = 212/438 (48%), Positives = 285/438 (65%), Gaps = 9/438 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
+ N+MDAQY+GEI IG+P Q F V+FDTGSSNLWVPSSKC + I C H+++ SR S T
Sbjct: 18 VHNYMDAQYYGEIEIGNPRQKFQVVFDTGSSNLWVPSSKCGFLQIPCDLHAKFDSRASET 77
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G I YGSGS+SGF S+D V+VGD+VV+ Q F EAT+E + FL ++FDGI+GL
Sbjct: 78 YEADGTPFAIQYGSGSLSGFLSKDEVKVGDLVVQGQYFAEATKEPGIAFLFSKFDGILGL 137
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPD-----AEEGGEIVFGGVDPKHFK 250
GF IAV PV+ NM+EQGLV ++FSFWLNR +E GGE++FGG DP HF
Sbjct: 138 GFDNIAVDKVKPVFYNMMEQGLVENKMFSFWLNRTSTKDGMPSEVGGELIFGGSDPDHFI 197
Query: 251 GKHTYVPVTKKGYWQFELGDILIGNQSTGVCEG--GCAAIVDSGTSLLAGPTPVVTEINH 308
G+HTY PVT++GYWQ ++ D + +S G C+G GC I D+GTSLLAGPT +V +IN
Sbjct: 198 GEHTYAPVTREGYWQIKMDDFKVDGRSLGACDGDDGCQVIADTGTSLLAGPTEIVNKIND 257
Query: 309 AIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVE 368
IG ++ EC+L++ QY + + L E++C IG C +G E + +
Sbjct: 258 YIGAHSMIGEECRLLIDQYAEQFVEDL-ENYSSEQICASIGACDADGVEAMEADDDDDLG 316
Query: 369 KENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRI 428
K + S C+AC+ V + Q+ L Q T++ +++ + +CD +P+ G + +DCD I
Sbjct: 317 KSSSSFEGQIACTACKTVVNYAQDMLAQNVTEKIIVNEVKRVCDMVPSVGGTASVDCDNI 376
Query: 429 PTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGV 488
P MP+V F IG F L+PEQY+LK + C+SGFM D+P P GPLWILGDVF+G
Sbjct: 377 PNMPDVEFVIGGVPFKLTPEQYVLKVYQDGEAQCVSGFMGMDIPKPAGPLWILGDVFLGP 436
Query: 489 YHTVFDSGKLRIGFAEAA 506
YHT FD R+GFA AA
Sbjct: 437 YHTEFDYANRRVGFAPAA 454
>gi|414887123|tpg|DAA63137.1| TPA: hypothetical protein ZEAMMB73_794362 [Zea mays]
Length = 608
Score = 420 bits (1080), Expect = e-115, Method: Compositional matrix adjust.
Identities = 193/300 (64%), Positives = 234/300 (78%), Gaps = 4/300 (1%)
Query: 211 NMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 270
NMV+QGL+S+ VFSFW NR D EGGEIVFGG+D H+KG HT+VPVT+KGYWQF +GD
Sbjct: 309 NMVKQGLISDPVFSFWFNRHADEGEGGEIVFGGMDSSHYKGDHTFVPVTRKGYWQFNMGD 368
Query: 271 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 330
+L+ +STG C GGCAA+ DSGTSLLAGPT ++TEIN IG GVVS ECK VVSQYG
Sbjct: 369 VLVDGKSTGFCAGGCAAVADSGTSLLAGPTAIITEINEKIGAAGVVSQECKTVVSQYGQQ 428
Query: 331 IWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVE----KENVSAGDSAVCSACEMA 386
I DLL++ P K+C Q+GLC F+G VS GI++VV+ K N +C+ACEMA
Sbjct: 429 ILDLLLAETQPAKICSQVGLCTFDGTHGVSAGIRSVVDDEAGKSNGGLKSDPMCNACEMA 488
Query: 387 VVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLS 446
VVW+QNQL Q +T+E +L+YIN+LC+ LP+PMGES +DC + +MP+++FTIG K F L
Sbjct: 489 VVWMQNQLAQNKTQELILNYINQLCERLPSPMGESAVDCGSLASMPDIAFTIGGKKFKLK 548
Query: 447 PEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PEQYILK GEG A CISGF A D+PPPRGPLWILGDVFMGVYHTVFD GKLR+GFAE+A
Sbjct: 549 PEQYILKVGEGQAAQCISGFTAMDIPPPRGPLWILGDVFMGVYHTVFDYGKLRVGFAESA 608
>gi|255085919|ref|XP_002508926.1| predicted protein [Micromonas sp. RCC299]
gi|226524204|gb|ACO70184.1| predicted protein [Micromonas sp. RCC299]
Length = 557
Score = 416 bits (1070), Expect = e-113, Method: Compositional matrix adjust.
Identities = 242/560 (43%), Positives = 328/560 (58%), Gaps = 64/560 (11%)
Query: 5 LLRSVFCLWVLASCLLLPA-------SSNGLRRIGLKKRRLD-----LHSLNAARITRKE 52
+LRS+ L+++ + L A S+ L R + KR L ++ AR R E
Sbjct: 4 ILRSIVALFLVCALCLAAAPGASALVESSHLPRAKVHKRALGPPETVKKCVDVARRARYE 63
Query: 53 RYMGGAGVSGVRHRLGDSDEDILP------LKNFMDAQYFGEIGIGSPPQNFSVIFDTGS 106
R+ A + HR D D L + N+MDAQY+G + IG+PPQ+F V+FDTGS
Sbjct: 64 RF--SARLHDEPHR--DPDGPTLAGGTPECISNYMDAQYYGAVSIGTPPQSFLVVFDTGS 119
Query: 107 SNLWVPSSKCYF-SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGD 165
SNLW+PS+KC F I C H +Y+S S+TY +G I YGSGS+SGF SQD V
Sbjct: 120 SNLWIPSAKCSFLQIPCDLHQKYRSGDSSTYKALGDPFAIQYGSGSLSGFLSQDTVTWAG 179
Query: 166 VVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSF 225
+ +KDQVF EAT+E + FL ++FDGI+G+G+ I+V P + N V+QGLV E VFSF
Sbjct: 180 LEIKDQVFAEATKEPGIAFLFSKFDGILGMGWDTISVNGVKPPFYNAVDQGLVVENVFSF 239
Query: 226 WLNR---DPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVC- 281
WLNR + EGGEIV GGVDP HF G+HT++ VT++GYWQ + D+L+G S G C
Sbjct: 240 WLNRDADEGGDGEGGEIVLGGVDPAHFVGEHTWLNVTREGYWQIAMDDVLLGGVSVGQCG 299
Query: 282 EGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGD-LIWDLLVSGLL 340
+ GCAAIVD+GTSLLAGPT VV +N IG + V+ EC++++ QYGD LI DL +
Sbjct: 300 KKGCAAIVDTGTSLLAGPTKVVEALNKRIGAKSVLGEECRVMIDQYGDELIRDL--AEFS 357
Query: 341 PEKVCQQIGLCAFNGAEYVST--------------------------GIKTVVEKENVSA 374
+C +GLC + ST G VV + +
Sbjct: 358 ATDICTSVGLCGPSSETKTSTSRRRGERRRARLGSSWLEWARGWARVGRDAVVLGSDAAP 417
Query: 375 GD------SAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRI 428
D +AVC AC AV + ++ L Q T+ +L +CD +P+ GE+ +DCD +
Sbjct: 418 IDADGLEGAAVCQACVYAVDYAKSLLTQNATESIILDEFKSVCDLIPSSGGEAAVDCDAV 477
Query: 429 PTMPNVSFTIGDKIFNLSPEQYILK--TGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
MP+V F +G + F L+P+QY+LK G+G CISGFM D+PPP GPLWILGDVF+
Sbjct: 478 SKMPDVEFVLGGRPFKLTPDQYVLKVDAGQGGPAQCISGFMGLDIPPPAGPLWILGDVFI 537
Query: 487 GVYHTVFDSGKLRIGFAEAA 506
G YH+VFD R+G A+AA
Sbjct: 538 GPYHSVFDYDNARVGLADAA 557
>gi|413942271|gb|AFW74920.1| hypothetical protein ZEAMMB73_522985 [Zea mays]
Length = 468
Score = 414 bits (1064), Expect = e-113, Method: Compositional matrix adjust.
Identities = 194/300 (64%), Positives = 233/300 (77%), Gaps = 4/300 (1%)
Query: 211 NMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 270
NMV+QGL+S+ VFSFW NR D EGGEIVFGG+D H+KG HT+VPVT+KGYWQF +GD
Sbjct: 169 NMVKQGLISDPVFSFWFNRHADEGEGGEIVFGGMDSSHYKGDHTFVPVTRKGYWQFNMGD 228
Query: 271 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 330
+L+ +STG C GGCAAI DSGTSLLAGPT ++TEIN IG GVVS ECK VVSQYG
Sbjct: 229 VLVDGKSTGFCAGGCAAIADSGTSLLAGPTAIITEINEKIGAAGVVSQECKTVVSQYGQQ 288
Query: 331 IWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVE----KENVSAGDSAVCSACEMA 386
I DLL++ P K+C Q+GLC F+G VS GI++VV+ K N +C+ACEMA
Sbjct: 289 ILDLLLAETQPAKICSQVGLCTFDGTHGVSAGIRSVVDDEARKSNGGLKSDPMCNACEMA 348
Query: 387 VVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLS 446
VVW+QNQL Q +T+E +L+YIN+LC+ LP+PMGES +DC + +MP++ FTIG K F L
Sbjct: 349 VVWMQNQLAQNKTQELILNYINQLCERLPSPMGESAVDCGSLVSMPDIVFTIGGKKFKLK 408
Query: 447 PEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PEQYILK GEG A CISGF A D+PPPRGPLWILGDVFMGVYHTVFD GKLR+GFAE+A
Sbjct: 409 PEQYILKVGEGQAVQCISGFTAMDIPPPRGPLWILGDVFMGVYHTVFDYGKLRVGFAESA 468
>gi|459426|emb|CAA54478.1| aspartic protease [Brassica oleracea]
Length = 292
Score = 414 bits (1063), Expect = e-113, Method: Compositional matrix adjust.
Identities = 194/294 (65%), Positives = 234/294 (79%), Gaps = 7/294 (2%)
Query: 217 LVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQ 276
LVSE FSFWLNR+ D EEGGE+VFGGVDPKHFKG+H YVPVT+KGYWQF++GD+LIG
Sbjct: 2 LVSE--FSFWLNRNADDEEGGELVFGGVDPKHFKGQHIYVPVTQKGYWQFDMGDVLIGGA 59
Query: 277 STGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLV 336
TG CE GC+AI DSGTSLLAGPT ++T INHAIG GV S +CK VV QYG I DLL+
Sbjct: 60 PTGYCESGCSAIADSGTSLLAGPTTIITMINHAIGASGVASQQCKTVVDQYGQTILDLLL 119
Query: 337 SGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVS----AGDSAVCSACEMAVVWVQN 392
S P+K+C QIGLC F+G VS GI++VV+KEN GD+A CSACEMAVVW+Q+
Sbjct: 120 SETQPKKICSQIGLCTFDGKRGVSMGIESVVDKENAKLSNGVGDAA-CSACEMAVVWIQS 178
Query: 393 QLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYIL 452
QL+Q T+E++L Y+NELC +P+PMGES +DC ++ TMP VS TIG K+F+L+P +Y+L
Sbjct: 179 QLRQNMTQERILDYVNELCRRIPSPMGESAVDCAQLSTMPTVSLTIGGKVFDLAPHEYVL 238
Query: 453 KTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
K GEG A CISGF+A D+ PPRGPLWILGDVFMG YHTVFD GK ++GFAEAA
Sbjct: 239 KVGEGAAAQCISGFIALDVAPPRGPLWILGDVFMGKYHTVFDFGKAQVGFAEAA 292
>gi|414871124|tpg|DAA49681.1| TPA: hypothetical protein ZEAMMB73_239621 [Zea mays]
Length = 299
Score = 413 bits (1061), Expect = e-112, Method: Compositional matrix adjust.
Identities = 192/299 (64%), Positives = 232/299 (77%), Gaps = 4/299 (1%)
Query: 212 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 271
MV+QGL+S+ VFSFW NR D EGGEIVFGG+D H+KG HT+VPVT+KGYWQF +GD+
Sbjct: 1 MVKQGLISDPVFSFWFNRHADEGEGGEIVFGGMDSSHYKGDHTFVPVTRKGYWQFNMGDV 60
Query: 272 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 331
L+ +STG C GGCAAI DSGTSLLAGP ++TEIN IG GVVS ECK VVSQYG I
Sbjct: 61 LVDGKSTGFCAGGCAAIADSGTSLLAGPIAIITEINEKIGAAGVVSQECKTVVSQYGQQI 120
Query: 332 WDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVE----KENVSAGDSAVCSACEMAV 387
DLL++ P K+C Q+GLC F+G VS GI++VV+ K N +C+ACEMAV
Sbjct: 121 LDLLLAETQPAKICSQVGLCTFDGTHGVSAGIRSVVDDEAGKSNGGLKSDPMCNACEMAV 180
Query: 388 VWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSP 447
VW+QNQL Q +T+E +L+YIN+LC+ LP+PMGES +DC + +MP+++FTIG K F L P
Sbjct: 181 VWMQNQLAQNKTQELILNYINQLCERLPSPMGESAVDCGSLASMPDIAFTIGGKKFKLKP 240
Query: 448 EQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
EQYILK GEG A CISGF A D+PPPRGPLWILGDVFMGVYHTVFD GKLR+GFAE+A
Sbjct: 241 EQYILKVGEGQAAQCISGFTAMDIPPPRGPLWILGDVFMGVYHTVFDYGKLRVGFAESA 299
>gi|413917603|gb|AFW57535.1| hypothetical protein ZEAMMB73_218341 [Zea mays]
gi|413917604|gb|AFW57536.1| hypothetical protein ZEAMMB73_218341 [Zea mays]
Length = 294
Score = 408 bits (1049), Expect = e-111, Method: Compositional matrix adjust.
Identities = 185/296 (62%), Positives = 236/296 (79%), Gaps = 3/296 (1%)
Query: 212 MVEQGLVSEEVFSFWLNRDPDAEE-GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 270
M EQ L++E+VFSFWLNR PDA GGE+VFGGVDP HF G HTYVPV++KGYWQF++GD
Sbjct: 1 MQEQELLAEDVFSFWLNRSPDAAAAGGELVFGGVDPAHFSGNHTYVPVSRKGYWQFDMGD 60
Query: 271 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 330
+LI STG C GCAAIVDSGTSLLAGPT ++ ++N AIG +G++S ECK VVSQYG++
Sbjct: 61 LLIDGHSTGFCAKGCAAIVDSGTSLLAGPTAIIAQVNEAIGADGIISTECKEVVSQYGEM 120
Query: 331 IWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWV 390
I D+L++ P++VC Q+GLC F+GA VS GI++VV KEN+ G +CSAC+MAVVW+
Sbjct: 121 ILDMLIAQTDPQRVCSQVGLCVFDGARSVSEGIESVVGKENL--GSDVMCSACQMAVVWI 178
Query: 391 QNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQY 450
+NQL++ +TKE +L Y N+LC+ LP+P GES + C I MP+++FTI +K F L+P+QY
Sbjct: 179 ENQLRENKTKELILQYANQLCERLPSPNGESTVSCQEISKMPSLAFTIANKTFTLTPQQY 238
Query: 451 ILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
I+K +G VCISGFMA+D+PPPRGPLWILGDVFMG YHTVFD G RIGFAE+A
Sbjct: 239 IVKLEQGGQTVCISGFMAYDVPPPRGPLWILGDVFMGAYHTVFDFGNDRIGFAESA 294
>gi|320165710|gb|EFW42609.1| lysosomal aspartic protease [Capsaspora owczarzaki ATCC 30864]
Length = 462
Score = 403 bits (1035), Expect = e-109, Method: Compositional matrix adjust.
Identities = 205/450 (45%), Positives = 286/450 (63%), Gaps = 31/450 (6%)
Query: 58 AGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY 117
A ++ R LG + + L NF +AQY+GEI IG+PPQ F V+FDTGSSN WVPS+ C
Sbjct: 37 AAINPNRRSLGANPA--VNLGNFENAQYYGEIEIGTPPQKFKVVFDTGSSNAWVPSATCK 94
Query: 118 FS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEA 176
+ + C H +Y S KS+TY G + I YGSGS++G+ SQD V + V +QVF EA
Sbjct: 95 ITDLPCDLHKKYHSEKSSTYVANGTTFAIQYGSGSLTGYLSQDTFTVAGLKVTNQVFAEA 154
Query: 177 TREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDA--E 234
T E L F+LARFDG++GLGF+EI+V + VPV+ NMV QGL++ F+FWL+R+ + +
Sbjct: 155 TNEPGLAFVLARFDGLLGLGFQEISVLNVVPVFYNMVAQGLLNSASFAFWLSRNGTSILK 214
Query: 235 EGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTS 294
GGE+V GGVDP H+ G TY+PV+K GYWQF L + +G+ + G G I DSGTS
Sbjct: 215 PGGELVLGGVDPSHYTGAFTYIPVSKPGYWQFALDSVQVGSTTFGANTQG---IADSGTS 271
Query: 295 LLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFN 354
LLAGP V +IN IG G+++ EC +++ QY +I + LV L P +C++IG C N
Sbjct: 272 LLAGPVADVKKINAQIGAIGILAEECDMIIEQYEPIIVEGLVQRLDPVTICKEIGSCKAN 331
Query: 355 GAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSL 414
S C C++ + + +L +T+ + + + C+ L
Sbjct: 332 A---------------------STSCYTCKLLITALDAELGNNRTQAAIEAALEGQCNRL 370
Query: 415 PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILK-TGEGIAEVCISGFMAFDLPP 473
P+P GES++DC ++ TMP +SF +G K F L+P+QY+L+ T EG +E CISGF+ D+PP
Sbjct: 371 PSPDGESLVDCTKLDTMPTISFVLGGKSFPLTPKQYVLEVTSEGQSE-CISGFIGLDVPP 429
Query: 474 PRGPLWILGDVFMGVYHTVFDSGKLRIGFA 503
P GPL+ILGDVFMGVY+T FD R+GFA
Sbjct: 430 PLGPLYILGDVFMGVYYTHFDMANKRVGFA 459
>gi|116793748|gb|ABK26865.1| unknown [Picea sitchensis]
Length = 284
Score = 396 bits (1018), Expect = e-107, Method: Compositional matrix adjust.
Identities = 177/283 (62%), Positives = 229/283 (80%), Gaps = 4/283 (1%)
Query: 227 LNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCA 286
+NR+ D E+GGEIVFGGVDP HFKG+H Y VT+KGYWQF++GD LI NQSTG C GGCA
Sbjct: 1 MNRNSDEEDGGEIVFGGVDPNHFKGEHEYASVTRKGYWQFDMGDFLIDNQSTGFCAGGCA 60
Query: 287 AIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQ 346
AIVDSGTSLLAGP+ ++T+IN+AIG G+VS ECK VVSQYGD+I +LL++ P+K+C
Sbjct: 61 AIVDSGTSLLAGPSGIITQINNAIGASGIVSQECKTVVSQYGDVIMELLMAQTNPKKICS 120
Query: 347 QIGLCAFNGAEYVSTGIKTVV----EKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEK 402
QIGLC+++GA V GI +V+ EKE +S+ C+ACEMAVVWVQNQ+ + QTKE+
Sbjct: 121 QIGLCSYDGARDVGIGIASVLEKTHEKETLSSISDGTCTACEMAVVWVQNQIARNQTKEQ 180
Query: 403 VLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVC 462
+++Y+N+LCD LP+P GES++DCD++ +MP VSF+IG+K F+L+P+QYIL+ GEG C
Sbjct: 181 IMTYLNQLCDRLPSPNGESVVDCDQVSSMPTVSFSIGNKTFSLTPDQYILQVGEGSVAQC 240
Query: 463 ISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+SGFM D+ PP GP+WILGD+FMGVYHTVFD G R+GFAEA
Sbjct: 241 VSGFMGLDVSPPLGPIWILGDIFMGVYHTVFDYGNSRVGFAEA 283
>gi|440803835|gb|ELR24718.1| aspartic proteinase, partial [Acanthamoeba castellanii str. Neff]
Length = 489
Score = 395 bits (1016), Expect = e-107, Method: Compositional matrix adjust.
Identities = 202/477 (42%), Positives = 279/477 (58%), Gaps = 53/477 (11%)
Query: 35 KRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSP 94
+R+ +L + A +KE + GG GV P+ NF+DAQY+GEI IG+P
Sbjct: 59 QRKAELKKVEA---MKKEVFGGGKGVE--------------PISNFLDAQYYGEISIGNP 101
Query: 95 PQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSIS 153
PQ F+V+ DTGSSNLWVPS +C ++ I+C H +Y KS+TY G + +I YGSG++S
Sbjct: 102 PQYFNVVLDTGSSNLWVPSIQCPWYEIACDLHHKYDHSKSSTYKANGTNFQIQYGSGAMS 161
Query: 154 GFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMV 213
GF S DNV + + K Q+F EA E L F+ A+FDGI+GLGF I+V PVW ++
Sbjct: 162 GFLSADNVVIAGLTAKGQLFAEAVAEPGLAFVAAQFDGILGLGFDTISVDGVPPVWYTLL 221
Query: 214 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 273
Q V+E VF+FWLNRDP GGE+V GGVD H+ G TY P+TK+GYWQF D LI
Sbjct: 222 AQSQVAEPVFAFWLNRDPSGISGGELVLGGVDESHYTGDFTYTPITKEGYWQFLAHDFLI 281
Query: 274 GNQSTGVC-EGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIW 332
+S G C GGC AI D+GTSLLAGP+ +V +IN I G++ +EC ++V+QY I
Sbjct: 282 NGKSMGFCPAGGCKAIADTGTSLLAGPSKIVAQINKMINATGILESECDMLVNQYAGQII 341
Query: 333 DLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWVQN 392
++ GL P++VC + LC C C++ V +
Sbjct: 342 QYILQGLQPDQVCSAVNLCP------------------------GGSCQLCKVLVSTIDA 377
Query: 393 QLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTI----GDKIFNLSPE 448
L +++++++ + +C GE+ +DC +P++P I G K F L PE
Sbjct: 378 ILGTDPSQQEIVALLKYIC------TGEATVDCKTLPSLPTFDVVIPTANGPKTFTLKPE 431
Query: 449 QYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
YILK G E CISGF+ D+P P GPLWI+GDVF+G Y+T FD G ++GFA A
Sbjct: 432 DYILKQSMGPEETCISGFIGLDIPAPYGPLWIMGDVFLGPYYTKFDFGNKQLGFAVA 488
>gi|412987808|emb|CCO19204.1| cathepsin D (lysosomal aspartyl protease) [Bathycoccus prasinos]
Length = 628
Score = 393 bits (1009), Expect = e-106, Method: Compositional matrix adjust.
Identities = 232/542 (42%), Positives = 314/542 (57%), Gaps = 69/542 (12%)
Query: 33 LKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDI-----------LPLKNFM 81
+K +L + E+Y A S + + +S ED +P+ N+M
Sbjct: 88 MKASKLRAKHAEMKKKQMVEKYTRNAETSLMEDKKMESSEDAAIGGEGGATSSVPIANYM 147
Query: 82 DAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIG 140
DAQY+G + IG+P Q F V FDTGSSNLWVPSSKC FS I C H +Y S KS +Y G
Sbjct: 148 DAQYYGPVEIGTPGQKFQVCFDTGSSNLWVPSSKCKFSQIPCDAHEKYDSEKSRSYEPNG 207
Query: 141 KSCEINYGSGSISGFFSQDNVEVGDVV-VKDQVFIEATREGSLTFLLARFDGIIGLGFRE 199
+ I YGSGS+SGF S D V +G+ + +KDQ F EAT+E LTFL A+FDGI+GLGF+E
Sbjct: 208 EDFAIQYGSGSLSGFLSSDTVRLGNSIEIKDQTFAEATKEPGLTFLFAKFDGILGLGFKE 267
Query: 200 IAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAE---EGGEIVFGGVDPKHFKGKHTYV 256
IAV PV+DN V Q V ++ FSFWLNRD D + +GGE+VFGGVD KHF G+H +V
Sbjct: 268 IAVDGVTPVFDNAVAQNQVEKDQFSFWLNRDQDGDGVVDGGELVFGGVDEKHFVGEHVWV 327
Query: 257 PVTKKGYWQFELGDILIG--------NQSTGVCEGGCA---AIVDSGTSLLAGPTPVVTE 305
+TKKGYWQF+L D+ +G N T V AI D+GTSLLAGP+ V+ +
Sbjct: 328 DLTKKGYWQFDLDDVKVGEFSFIDDKNDKTTVSFSSSTKHQAIADTGTSLLAGPSAVIDK 387
Query: 306 INHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLC-------AFNGAEY 358
IN AIG E ++ ECK+ + +YG+ D + + ++C+ + +C A
Sbjct: 388 INDAIGAENLMIQECKIAIKRYGEEFLDDIET-YDSSQICESLNICPAAAETNAIEKEIS 446
Query: 359 VSTGI---------KTVVEKEN-----------------------VSAGDSAVCSACEMA 386
TG+ T EK++ + CSACEMA
Sbjct: 447 EPTGVLATSRKLLMTTREEKKHRGLRGGLSLLGDLFKPSKKNEEKETKKSKVACSACEMA 506
Query: 387 VVWVQNQLKQKQTKEKVLSYINELCDSLP-NPMGESIIDCDRIPTMPNVSFTIGDKIFNL 445
V + + L+ T+ VL+ + ++CD +P P G++ +DC+ I MPN+SFTI K F L
Sbjct: 507 VDYAKELLQANVTRTVVLNELEKVCDFVPAQPGGQAGVDCNAIVEMPNISFTIAGKSFEL 566
Query: 446 SPEQYILKTGEGI-AEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAE 504
+P+QY+L+ +G + CISGFM D+P P GPLWILGDVF+G YHTVFD G R+GFA+
Sbjct: 567 TPKQYVLEIDDGQGSNTCISGFMGLDVPKPMGPLWILGDVFLGPYHTVFDHGGSRVGFAK 626
Query: 505 AA 506
AA
Sbjct: 627 AA 628
>gi|449533814|ref|XP_004173866.1| PREDICTED: aspartic proteinase-like, partial [Cucumis sativus]
Length = 290
Score = 369 bits (947), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 178/282 (63%), Positives = 224/282 (79%), Gaps = 4/282 (1%)
Query: 8 SVFCLWVLASCLLLPASSN-GLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGV--R 64
+ CL++L S ++ + SN GL R+GLKK LD + AAR+ K+ + A
Sbjct: 9 AFLCLFLLVSLNIVSSVSNDGLLRVGLKKINLDPENRLAARLESKDAEILKAAFRKYSPN 68
Query: 65 HRLGDS-DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCY 123
LG+S D DI+ LKN++DAQY+GEI IG+PPQ F+VIFDTGSSNLWVPS+KC FS++C+
Sbjct: 69 GNLGESSDTDIVALKNYLDAQYYGEIAIGTPPQKFTVIFDTGSSNLWVPSAKCLFSVACH 128
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
FH+RYKS +S+TY + G S I YG+G++SGFFS DNV+VGD+VVK+Q+FIEATRE LT
Sbjct: 129 FHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPGLT 188
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
FL+A+FDG++GLGF+EIAVG AVPVW NMVEQGLV E VFSFWLNR+ + EEGGEIVFGG
Sbjct: 189 FLVAKFDGLLGLGFQEIAVGSAVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGG 248
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGC 285
VDPKH+ GKHTYVPVT+KGYWQF++GD+LI + TG CEGGC
Sbjct: 249 VDPKHYTGKHTYVPVTQKGYWQFDMGDVLIDGKPTGYCEGGC 290
>gi|8272388|dbj|BAA96446.1| aspartic endopeptidase [Pyrus pyrifolia]
Length = 273
Score = 367 bits (941), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 167/273 (61%), Positives = 215/273 (78%), Gaps = 3/273 (1%)
Query: 237 GEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLL 296
GEIVFGGVD HFKG+HTYVPVT+KGYWQF++GD+LI +S+G C GC+AI DSGTSLL
Sbjct: 1 GEIVFGGVDSSHFKGEHTYVPVTQKGYWQFDMGDVLIDGESSGFCANGCSAIADSGTSLL 60
Query: 297 AGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGA 356
AGPT VVT+INHAIG GVVS ECK VV QYG I ++L++ P+K+C QIG C F+G
Sbjct: 61 AGPTTVVTQINHAIGASGVVSQECKTVVEQYGKTIIEMLMAKSQPQKICSQIGFCTFDGT 120
Query: 357 EYVSTGIKTVVEKENVSAGD---SAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDS 413
VS GI+++V++ D A C+ACEM VV +Q +L++ QT+E++L Y+N+LC+
Sbjct: 121 RGVSPGIESLVDQNPEKQSDGVHDATCAACEMPVVLMQIRLRKNQTEEQILDYVNQLCER 180
Query: 414 LPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPP 473
LP+P GES++ CD + ++P+VSFTIG K+F+L+PEQY+LK GEG+A CISGF+A D+ P
Sbjct: 181 LPSPSGESVVQCDSLSSLPSVSFTIGGKVFDLAPEQYVLKVGEGVAAQCISGFIALDVAP 240
Query: 474 PRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PRGPLWILGD+FMG YHTVFD G L +GFAEAA
Sbjct: 241 PRGPLWILGDIFMGRYHTVFDYGNLSVGFAEAA 273
>gi|413953120|gb|AFW85769.1| hypothetical protein ZEAMMB73_486102 [Zea mays]
Length = 267
Score = 365 bits (938), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 169/267 (63%), Positives = 207/267 (77%), Gaps = 4/267 (1%)
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
+D H+KG HT+VPVT+KGYWQF +GD+L+ +STG C GGCAA+ DSGTSLLAGPT ++
Sbjct: 1 MDSSHYKGDHTFVPVTRKGYWQFNMGDVLVDGKSTGFCAGGCAAMADSGTSLLAGPTAII 60
Query: 304 TEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGI 363
TEIN IG GVVS ECK VVSQYG I DLL++ P K+C Q+GLC F+G VS GI
Sbjct: 61 TEINEKIGVAGVVSQECKTVVSQYGQQILDLLLAETQPAKICSQVGLCTFDGTHGVSAGI 120
Query: 364 KTVVE----KENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMG 419
++VV+ K N +C+ACEMAVVW+QNQL Q +T+E +L+YIN+LC+ LP+PMG
Sbjct: 121 RSVVDDEAGKSNGGLKSDPMCNACEMAVVWMQNQLAQNKTQELILNYINQLCERLPSPMG 180
Query: 420 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 479
ES +DC + +MP+++FTIG K F L PEQYILK GEG A CISGF A D+PPPRGPLW
Sbjct: 181 ESAVDCGSLASMPDIAFTIGGKKFKLKPEQYILKVGEGQAAQCISGFKAMDIPPPRGPLW 240
Query: 480 ILGDVFMGVYHTVFDSGKLRIGFAEAA 506
ILGDVFMGVYHTVFD GKLR+GFAE+A
Sbjct: 241 ILGDVFMGVYHTVFDYGKLRVGFAESA 267
>gi|413934460|gb|AFW69011.1| hypothetical protein ZEAMMB73_821214 [Zea mays]
Length = 324
Score = 352 bits (902), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 163/265 (61%), Positives = 199/265 (75%), Gaps = 4/265 (1%)
Query: 219 SEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQST 278
S+ VFSFW NR D EGGEIVFGG+D H+KG HT+VPVT+KGYWQF +GD+L+ +ST
Sbjct: 60 SDPVFSFWFNRHADEGEGGEIVFGGMDSSHYKGDHTFVPVTRKGYWQFNMGDVLVDGKST 119
Query: 279 GVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSG 338
G C GGCAA+ DSGTSLLAGPT ++TEIN IG GVVS ECK VVSQYG I DLL++
Sbjct: 120 GFCAGGCAAMADSGTSLLAGPTAIITEINEKIGAAGVVSQECKTVVSQYGQQILDLLLAE 179
Query: 339 LLPEKVCQQIGLCAFNGAEYVSTGIKTVVE----KENVSAGDSAVCSACEMAVVWVQNQL 394
P K+C Q+GLC F+G VS GI++VV+ K N +C+ACEMAVVW+QNQL
Sbjct: 180 TQPAKICSQVGLCTFDGTHGVSAGIRSVVDDEAGKSNGGLKSDPMCNACEMAVVWMQNQL 239
Query: 395 KQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKT 454
Q +T+E +L+YIN+LC+ LP+PMGES +DC + +MP++ FTIG K F L PEQYILK
Sbjct: 240 AQNKTQELILNYINQLCERLPSPMGESAVDCGSLASMPDIVFTIGGKKFKLKPEQYILKV 299
Query: 455 GEGIAEVCISGFMAFDLPPPRGPLW 479
GEG A CISGF A D+PPPRGPLW
Sbjct: 300 GEGQAAQCISGFTAMDIPPPRGPLW 324
>gi|303285091|ref|XP_003061836.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226457166|gb|EEH54466.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 647
Score = 338 bits (867), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 187/493 (37%), Positives = 275/493 (55%), Gaps = 73/493 (14%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L R+ L KR +D +++A R+ A ++ + G + + + N+MDAQYFG
Sbjct: 54 LPRVSLSKRVVDARAVHA-RVVATRANEANARLNSM---YGADADARVSITNYMDAQYFG 109
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEIN 146
+ IG+PPQ+F V+FDTGSSNLWVPSSKC F+ I C H +Y ++ S+T+ + G I
Sbjct: 110 AVSIGTPPQSFDVVFDTGSSNLWVPSSKCKFTQIPCDLHHKYDAKASSTHAQNGTDFAIQ 169
Query: 147 YGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
YGSGS+SGF S D V G + + Q F EATRE L F+ A+FDGI+G+G+ I+V V
Sbjct: 170 YGSGSLSGFLSADVVGWGGLEIASQTFAEATREPGLAFMFAKFDGILGMGWDTISVDKVV 229
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRD---PDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGY 263
P + N QGLV ++VFSFWLNRD PD GGE+V GGVDP H+ G+H ++PVT++GY
Sbjct: 230 PPFYNAYAQGLVPDDVFSFWLNRDESHPDG-PGGELVLGGVDPAHYVGEHAWLPVTREGY 288
Query: 264 WQFELGDILIGNQSTGVCE--GGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECK 321
WQ + D+++ S G C+ GCAAI+D+GTSLLAGP V+ +IN IG +++ EC+
Sbjct: 289 WQVRMDDVIVDGASAGECDETDGCAAILDTGTSLLAGPKDVIEKINAKIGARPILNEECR 348
Query: 322 LVVSQYGDLIWDLLVSGLLPEKVCQQIGLCA-----------------------FNGAEY 358
+++ QYG+ + D V P+ +C GLC +
Sbjct: 349 VMIEQYGEELID-DVKKFGPKAICVSAGLCHEKTERQPPQRPASSSPFDILGRLAKKSRA 407
Query: 359 VSTGIKTVVEKEN-----------VSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYI 407
++ + V+E +A A C ACEMAV + Q+ +K T+ +L+ +
Sbjct: 408 RASVTRRVLEGRRGRLWADAAADADAASQPASCRACEMAVAYAQSLIKTNVTRALILNEL 467
Query: 408 NELCDSLPNPMGESI---------------------------IDCDRIPTMPNVSFTIGD 440
LCD +P+ GE++ +DCD + MP+VSF +G
Sbjct: 468 KSLCDHIPSKGGEAVRRLPVRPSFVRHVSLTDTRAPDSSSKGVDCDAVDAMPDVSFVLGG 527
Query: 441 KIFNLSPEQYILK 453
K + L+P QY+L+
Sbjct: 528 KAWTLTPRQYVLR 540
Score = 81.6 bits (200), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 34/47 (72%), Positives = 38/47 (80%)
Query: 459 AEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
AE C+SGFM D+PPP GPLWILGDVF+G YHTVFD G R+G AEA
Sbjct: 600 AEQCVSGFMGLDVPPPAGPLWILGDVFIGPYHTVFDHGNARVGIAEA 646
>gi|413948512|gb|AFW81161.1| hypothetical protein ZEAMMB73_941917 [Zea mays]
Length = 243
Score = 328 bits (842), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 155/243 (63%), Positives = 188/243 (77%), Gaps = 4/243 (1%)
Query: 268 LGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQY 327
+GD+L+ +STG C GGCAAI DSGTSLLAGPT ++TEIN IG GVVS ECK VVSQY
Sbjct: 1 MGDVLVDGKSTGFCAGGCAAIADSGTSLLAGPTAIITEINEKIGAAGVVSQECKTVVSQY 60
Query: 328 GDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVE----KENVSAGDSAVCSAC 383
G I DLL++ P K+C Q+GLC F+G VSTGI++VV+ K N +C+AC
Sbjct: 61 GQQILDLLLAETQPAKICSQVGLCTFDGTHGVSTGIRSVVDDKAGKSNGGLKSDPMCNAC 120
Query: 384 EMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIF 443
EMAVVW+QNQL Q +T+E +L+YIN+LC+ LP+PMGES +DC + +MP+++FTIG K F
Sbjct: 121 EMAVVWMQNQLAQNKTQELILTYINQLCERLPSPMGESAVDCASLGSMPDIAFTIGGKKF 180
Query: 444 NLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFA 503
L PEQYILK GEG A CISGF A D+PPPRGPLWILGDVFMGVYHTVFD GKLR+GFA
Sbjct: 181 KLKPEQYILKVGEGQAAQCISGFTAMDIPPPRGPLWILGDVFMGVYHTVFDYGKLRVGFA 240
Query: 504 EAA 506
E+A
Sbjct: 241 ESA 243
>gi|33352213|emb|CAE18153.1| aspartic proteinase [Chlamydomonas reinhardtii]
Length = 578
Score = 317 bits (813), Expect = 7e-84, Method: Compositional matrix adjust.
Identities = 166/362 (45%), Positives = 230/362 (63%), Gaps = 16/362 (4%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSN-GLRRIGLKKRRLDLHSLNAARITRKERYMGGAG 59
M + + ++ L +++ L + A G+ R+ L+K + L +L R Y+
Sbjct: 1 MARSYVPALIALAAVSALLGVAAEQQAGMLRVTLRKTEM-LTTLG-----RPRPYL---- 50
Query: 60 VSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YF 118
G + LG SD+ + LKNFMDAQY+GEIG+G+PPQ F+VIFDTGS+NLWVPSSKC F
Sbjct: 51 -LGEQGLLGSSDQGQVTLKNFMDAQYYGEIGLGTPPQLFNVIFDTGSANLWVPSSKCALF 109
Query: 119 SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
+I+C H +Y + KS TY G I YG+GS+ G+ SQD + G + +KDQ F EA
Sbjct: 110 NIACRLHRKYNAAKSKTYKANGTEFAIEYGTGSLDGYISQDVLTWGGLTIKDQGFAEAIN 169
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
E LTF+ A+FDGI+G+GF I+V P + +VE+G ++ VFSFWLNRDP+A GGE
Sbjct: 170 EPGLTFVAAKFDGILGMGFPAISVQHVPPPFTRLVEEGGLAAPVFSFWLNRDPNAPNGGE 229
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
+V GG+DP HF G+HT+VPVT++GYWQF + + +G S +C GCAAI D+GTSL+AG
Sbjct: 230 LVLGGIDPTHFTGEHTWVPVTRQGYWQFTMEGLDLGPGSQKMCAKGCAAIADTGTSLIAG 289
Query: 299 PTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLP-EKVCQQIGLCAFNGAE 357
P+ V +NHAIG +SA+C+ +V Y I L LP ++VC IGLC A
Sbjct: 290 PSDEVAALNHAIGATSALSAQCRQLVRDYLPQIIAQLHD--LPLDQVCASIGLCPMAAAS 347
Query: 358 YV 359
+
Sbjct: 348 TI 349
Score = 145 bits (365), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 69/135 (51%), Positives = 94/135 (69%), Gaps = 4/135 (2%)
Query: 373 SAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMP 432
SAGDS VCS C+ AV +++ L+ T E++ + +LCD + + G S++DCD+I T+P
Sbjct: 447 SAGDSVVCSFCQTAVAYIKIALQSNSTIEQIADAVGQLCDQV-SFGGPSVVDCDKISTLP 505
Query: 433 NVSFTIGDKIFNLSPEQYILKTGEGIAEV-CISGFMAFDLPPPRGPLWILGDVFMGVYHT 491
+SF IG ++F L PEQY+L+ G E+ CISGFM D+P GPLWILGD+F+G YHT
Sbjct: 506 VISFNIGGRVFPLRPEQYVLQLDAGGGEMQCISGFMGLDVP--AGPLWILGDIFLGAYHT 563
Query: 492 VFDSGKLRIGFAEAA 506
VFD G R+GFA AA
Sbjct: 564 VFDYGAARLGFANAA 578
>gi|510880|emb|CAA56373.1| putative aspartic protease [Brassica oleracea]
Length = 255
Score = 313 bits (801), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 154/250 (61%), Positives = 197/250 (78%), Gaps = 12/250 (4%)
Query: 14 VLASCLLLPASS---NGLRRIGLKKRRLDLHSLNAARITRKE-RYMGGAGVSGVRHRLGD 69
+++ L L AS+ +G R+GLKK +LD S AAR+ K+ + + G G LGD
Sbjct: 13 IVSFLLFLSASAERNDGTFRVGLKKLKLDRKSRIAARVGSKQLKPLRGYG-------LGD 65
Query: 70 S-DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRY 128
S D DI+ LKN++DAQY+GEI IG+PPQ F+V+FDTGSSNLWVPSSKCYFSI+C FHS+Y
Sbjct: 66 SGDADIVTLKNYLDAQYYGEIAIGTPPQKFTVVFDTGSSNLWVPSSKCYFSIACLFHSKY 125
Query: 129 KSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLAR 188
KS +S+TY + GKS I+YG+G+I+GFFS D V VGD+VVKDQ FIEAT+E +TF+LA+
Sbjct: 126 KSSRSSTYEKNGKSAAIHYGTGAIAGFFSNDAVTVGDLVVKDQEFIEATKEPGITFVLAK 185
Query: 189 FDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKH 248
FDGI+GLGF+EI+VG+A PVW NM++QGL E VFSFWLNR+ + EEGGE+VFGGVDP H
Sbjct: 186 FDGILGLGFQEISVGNAAPVWYNMLKQGLYKEPVFSFWLNRNAEDEEGGELVFGGVDPNH 245
Query: 249 FKGKHTYVPV 258
+KG+H YVPV
Sbjct: 246 YKGEHIYVPV 255
>gi|302840660|ref|XP_002951885.1| hypothetical protein VOLCADRAFT_81669 [Volvox carteri f.
nagariensis]
gi|300262786|gb|EFJ46990.1| hypothetical protein VOLCADRAFT_81669 [Volvox carteri f.
nagariensis]
Length = 559
Score = 310 bits (793), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 166/340 (48%), Positives = 218/340 (64%), Gaps = 20/340 (5%)
Query: 14 VLASCLLLPASSNG-LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDE 72
VL +C +L + +G L R+ LKK++L L A R Y+ + LG +
Sbjct: 14 VLVACTVLASGDSGALHRVQLKKKQLSL-----ATYGRPRPYL--------NNMLGYGGD 60
Query: 73 DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSR 131
+PL NFMDAQY+GE+ +G+P Q F VIFDTGSSNLWVPSSKC +F+I+C H RY +
Sbjct: 61 --VPLHNFMDAQYYGEVSLGTPQQYFQVIFDTGSSNLWVPSSKCSFFNIACRLHRRYYAA 118
Query: 132 KSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDG 191
+S TY G + I YGSGS+ GF S+D + G + V +Q F EA E LTF+ A+FDG
Sbjct: 119 RSKTYKANGTAFSIQYGSGSLDGFISEDILGWGGLAVPEQGFAEAVNEPGLTFVAAKFDG 178
Query: 192 IIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKG 251
I+G+GF I+V VP + +V+ GL+SE VFSFWLNRD A GGE+V GGVDP HF G
Sbjct: 179 ILGMGFPAISVSGVVPPFTRLVDSGLLSEPVFSFWLNRDSSAAVGGELVLGGVDPAHFTG 238
Query: 252 KHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
+HT+V VT++GYWQF L I +G+Q +C GC AI D+GTSL+AGP V INHAIG
Sbjct: 239 EHTWVDVTRRGYWQFNLDGIHLGSQR--LCTQGCPAIADTGTSLIAGPVDEVAAINHAIG 296
Query: 312 GEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLC 351
+SA+C+ +V +Y I L L ++VC IGLC
Sbjct: 297 ATSALSAQCRTLVREYLPEIVAAL-HNLPLDQVCASIGLC 335
Score = 122 bits (305), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 59/135 (43%), Positives = 85/135 (62%), Gaps = 4/135 (2%)
Query: 373 SAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMP 432
+ GDS CS C+ AV +++ L+ T E++ + LCD + + G S++DC ++ +P
Sbjct: 428 TTGDSVACSFCQTAVQYIRIALESNATIEQIADAVGNLCDQV-SFGGPSVVDCTKLSKLP 486
Query: 433 NVSFTIGDKIFNLSPEQYILKTGEGIAE-VCISGFMAFDLPPPRGPLWILGDVFMGVYHT 491
+ +G + F L PEQY+L+ G E C+SGFM D+P GPLWILGD+F+G YHT
Sbjct: 487 ILELEVGGRTFPLRPEQYVLRVDAGGGEEQCVSGFMGLDVP--VGPLWILGDIFLGAYHT 544
Query: 492 VFDSGKLRIGFAEAA 506
VFD G R+GFA AA
Sbjct: 545 VFDYGGSRLGFAVAA 559
>gi|4389326|pdb|1B5F|A Chain A, Native Cardosin A From Cynara Cardunculus L.
gi|6729875|pdb|1B5F|C Chain C, Native Cardosin A From Cynara Cardunculus L
Length = 239
Score = 307 bits (787), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 151/238 (63%), Positives = 180/238 (75%), Gaps = 4/238 (1%)
Query: 74 ILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKS 133
++ L N D YFGEIGIG+PPQ F+VIFDTGSS LWVPSSKC S +C HS Y+S S
Sbjct: 4 VVALTNDRDTSYFGEIGIGTPPQKFTVIFDTGSSVLWVPSSKCINSKACRAHSMYESSDS 63
Query: 134 NTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGII 193
+TY E G I YG+GSI+GFFSQD+V +GD+VVK+Q FIEAT E FL FDGI+
Sbjct: 64 STYKENGTFGAIIYGTGSITGFFSQDSVTIGDLVVKEQDFIEATDEADNVFLHRLFDGIL 123
Query: 194 GLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKH 253
GL F+ I+V PVW NM+ QGLV E FSFWLNR+ D EEGGE+VFGG+DP HF+G H
Sbjct: 124 GLSFQTISV----PVWYNMLNQGLVKERRFSFWLNRNVDEEEGGELVFGGLDPNHFRGDH 179
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
TYVPVT + YWQF +GD+LIG++STG C GC A DSGTSLL+GPT +VT+INHAIG
Sbjct: 180 TYVPVTYQYYWQFGIGDVLIGDKSTGFCAPGCQAFADSGTSLLSGPTAIVTQINHAIG 237
>gi|291223847|ref|XP_002731917.1| PREDICTED: putative gut cathepsin D-like aspartic protease-like
[Saccoglossus kowalevskii]
Length = 389
Score = 303 bits (775), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 150/309 (48%), Positives = 209/309 (67%), Gaps = 7/309 (2%)
Query: 12 LWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSD 71
+ L CLL + GL+RI L K R L+ +T K+ + G+ +++ G
Sbjct: 1 MRTLLICLLFVGLACGLQRIHLHKFRSVRRQLSDVGVTIKDLALSGS----LKYTQGAPI 56
Query: 72 EDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKS 130
++L KN++DAQY+GEIG+G+P Q F+V+FDTGSSNLWVPS KC + I+C FH +Y S
Sbjct: 57 PEVL--KNYLDAQYYGEIGLGTPQQKFNVVFDTGSSNLWVPSKKCPITDIACLFHKKYDS 114
Query: 131 RKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFD 190
KS+TY G EI YGSGS+ GF S+D++ + DVV K Q F EAT+E L F+ A+FD
Sbjct: 115 TKSSTYKVNGTKFEIQYGSGSMEGFLSEDSIAISDVVAKSQTFAEATKEPGLAFVAAKFD 174
Query: 191 GIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFK 250
GI+G+G+ +I+V VPV DNM++Q L+ + VFSF+L+R+ + +GGE+ GG DPK++
Sbjct: 175 GILGMGYPQISVDGVVPVIDNMIQQQLIEKPVFSFYLDRNVNDSQGGELFLGGSDPKYYT 234
Query: 251 GKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI 310
G TYVPVT+KGYWQF++ I +G ++ C+GGC AI D+GTSL+AGPT V IN AI
Sbjct: 235 GNFTYVPVTRKGYWQFKMDGITLGGSASQFCKGGCQAIADTGTSLIAGPTEEVQAINKAI 294
Query: 311 GGEGVVSAE 319
G +VS E
Sbjct: 295 GATPIVSGE 303
Score = 101 bits (252), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 42/99 (42%), Positives = 67/99 (67%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
IN+ + P GE +++C++I ++P+++F + +K F L YI++ + +C+SGF
Sbjct: 290 INKAIGATPIVSGEYMVNCNKIDSLPDITFVLNNKPFILKGRDYIMQVSQSGVTLCLSGF 349
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
M D+PPP GP+WILGDVF+G ++T FD G R+GFA A
Sbjct: 350 MGMDIPPPMGPIWILGDVFIGRFYTEFDRGNDRVGFATA 388
>gi|218944225|gb|ACL13150.1| cathepsin D [Azumapecten farreri]
Length = 396
Score = 301 bits (770), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 151/315 (47%), Positives = 203/315 (64%), Gaps = 12/315 (3%)
Query: 9 VFCLWVLASCLLLPASSNGLRRIGL---KKRRLDLHSLNAARITRKERYMGGAGVSGVRH 65
V C++ L + + A S+ L RI L K R L + + K RY G+S
Sbjct: 3 VLCIFALLAVI---ACSSALHRIKLHRVKTVRRSLQEVGTSINLLKNRY---TGLSDRNG 56
Query: 66 RLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYF 124
RL D + PL N++DAQY+G I IG+P Q F V+FDTGSSNLWVPS KC S I+C
Sbjct: 57 RLLGPDPE--PLSNYLDAQYYGAIQIGTPAQEFKVVFDTGSSNLWVPSKKCKLSDIACLL 114
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
H++Y S KS+TY + G EI YG+GS++GF S D+V +GD+ VK Q F EA + +TF
Sbjct: 115 HNKYDSTKSSTYKQNGTHFEIRYGTGSLTGFLSTDSVTIGDITVKGQTFAEAITQPGITF 174
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
+ A+FDGI+G+G+ I+V VPV+ NMV+Q LV VFSF+L+RDPDA GGE++ GG
Sbjct: 175 VAAKFDGILGMGYDTISVDHVVPVFYNMVQQKLVDSPVFSFYLDRDPDASAGGELIIGGS 234
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
DPKH+ G +Y P+TKKGYWQF++ I +G +++ C GGC+AI D+GTSLL GPT V
Sbjct: 235 DPKHYSGNFSYAPITKKGYWQFDMAGIQVGGKASAYCNGGCSAIADTGTSLLVGPTAEVQ 294
Query: 305 EINHAIGGEGVVSAE 319
++N IG E
Sbjct: 295 QLNKQIGATPFAGGE 309
Score = 98.6 bits (244), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 49/129 (37%), Positives = 73/129 (56%), Gaps = 2/129 (1%)
Query: 377 SAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSF 436
SA C+ A+ L T E + +N+ + P GE +DCD+I ++P +SF
Sbjct: 268 SAYCNGGCSAIADTGTSLLVGPTAE--VQQLNKQIGATPFAGGEYTVDCDKISSLPPISF 325
Query: 437 TIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSG 496
I ++F L YILK + +C+SGF D+P P GPLWILGDVF+G +++ FD G
Sbjct: 326 MIDKQLFTLQGSDYILKVTQQGQTICLSGFAGIDVPAPLGPLWILGDVFLGKFYSEFDLG 385
Query: 497 KLRIGFAEA 505
++GFA+
Sbjct: 386 NNKVGFAQT 394
>gi|117662285|gb|ABK55693.1| aspartic proteinase [Cucumis sativus]
Length = 196
Score = 300 bits (768), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 137/196 (69%), Positives = 168/196 (85%)
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+C HS+YKS++S+TY + GKS I YG+G+ISG FS+DNV+VGD++VK Q FIEATRE
Sbjct: 1 ACLLHSKYKSKRSSTYKKNGKSASIKYGTGAISGCFSEDNVKVGDLIVKKQDFIEATREP 60
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
SLTF+LA+FDGI+GLGF+EI+VGDAVPVW NMV+Q LV E VFSFW NR+ D E+GGEIV
Sbjct: 61 SLTFVLAQFDGILGLGFKEISVGDAVPVWYNMVDQNLVKEPVFSFWFNRNADEEQGGEIV 120
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGGVDP H+KG+HTYVPVTKKGYWQF++GD+LI +TG C GGC+AI DSGTSLLAGPT
Sbjct: 121 FGGVDPDHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTGFCSGGCSAIADSGTSLLAGPT 180
Query: 301 PVVTEINHAIGGEGVV 316
++T++NHAIG GVV
Sbjct: 181 TIITQVNHAIGASGVV 196
>gi|336454164|gb|AEI58896.1| cathepsin D [Pinctada maxima]
Length = 390
Score = 296 bits (758), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 142/287 (49%), Positives = 199/287 (69%), Gaps = 8/287 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRL---GDSDEDILPLKNFMDAQYFGEIGIGS 93
R+ LH + + R T +E G + ++ + G + PL N++DAQY+G IGIG+
Sbjct: 21 RIKLHKIKSVRRTLQEV---GTSIESLQQKYSGYGITGPAPEPLSNYLDAQYYGVIGIGT 77
Query: 94 PPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSI 152
P QNF V+FDTGSSNLWVPS KC + I+C H++Y S KS+TY + G EI YG+GS+
Sbjct: 78 PAQNFKVVFDTGSSNLWVPSKKCKVTDIACLLHNKYDSSKSSTYKKNGTDFEIRYGTGSL 137
Query: 153 SGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNM 212
+GF S D V V + VK Q F EAT++ +TF+ A+FDGI+G+ F +I+V VPV+ NM
Sbjct: 138 TGFLSTDTVTVAGIAVKGQTFAEATQQPGITFVAAKFDGILGMAFEKISVDGVVPVFYNM 197
Query: 213 VEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDIL 272
V+QGLV + +FSF+L+RDP A EGGE++ GG D KH+KG TY+PVT++GYWQFE+ +
Sbjct: 198 VKQGLVPQPIFSFYLDRDPSASEGGELILGGSDTKHYKGNFTYLPVTRQGYWQFEMDGVS 257
Query: 273 IGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
+G S C GGC AI D+GTSL+AGPT ++++N AIG + +V+ E
Sbjct: 258 VGG-SAKFCSGGCNAIADTGTSLIAGPTSEISKLNKAIGAKPLVAGE 303
Score = 108 bits (269), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 56/138 (40%), Positives = 80/138 (57%), Gaps = 3/138 (2%)
Query: 368 EKENVSAGDSA-VCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
E + VS G SA CS A+ L T E +S +N+ + P GE +DC+
Sbjct: 252 EMDGVSVGGSAKFCSGGCNAIADTGTSLIAGPTSE--ISKLNKAIGAKPLVAGEYTVDCN 309
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
IP +P ++FT+G K F+L + Y+L + C+SGF D+PPP GPLWILGDVF+
Sbjct: 310 AIPKLPKITFTLGGKQFDLEGKDYVLTVTQQGQTTCLSGFAPIDVPPPAGPLWILGDVFI 369
Query: 487 GVYHTVFDSGKLRIGFAE 504
G ++T FD G ++GFA+
Sbjct: 370 GKFYTEFDMGNTQVGFAQ 387
>gi|329754204|gb|AEC03508.1| cathepsin-D [Polyrhachis vicina]
Length = 384
Score = 293 bits (749), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 144/284 (50%), Positives = 193/284 (67%), Gaps = 6/284 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQ 96
R+ LH + +AR KE S ++ + + PL N++DAQY+G I IG+PPQ
Sbjct: 20 RIPLHKIKSARKHFKEVDTEICPTSILQGGMPHPE----PLSNYLDAQYYGAISIGTPPQ 75
Query: 97 NFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
NF VIFDTGSSNLWVPS KC+F+ I+C H++Y + KS+TY + G I+YGSGS+SG+
Sbjct: 76 NFKVIFDTGSSNLWVPSKKCHFTNIACLLHNKYDTTKSSTYKKNGTDFAIHYGSGSLSGY 135
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
S D V +G + VKDQ F EA E L F+ A+FDGI+G+ + I+V PV+ NMV+Q
Sbjct: 136 LSTDTVTIGGLKVKDQTFAEAMSEPGLAFVAAKFDGILGMAYTTISVDGVTPVFYNMVKQ 195
Query: 216 GLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGN 275
GLVS+ VFSF+LNRDPDA+EGGE++ GG DP H+KG TYVPV +K YWQF++ + IG+
Sbjct: 196 GLVSQPVFSFYLNRDPDAKEGGELILGGSDPNHYKGDFTYVPVDRKAYWQFKMDSVQIGS 255
Query: 276 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
+C+ GC AI D+GTSL+AGP + IN AIG +V E
Sbjct: 256 D-LKLCKQGCEAIADTGTSLIAGPVKEIEAINKAIGATPIVGGE 298
Score = 111 bits (278), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 50/104 (48%), Positives = 68/104 (65%)
Query: 402 KVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEV 461
K + IN+ + P GE ++DC+ IP +P ++F +G K F L E Y+LK + +
Sbjct: 280 KEIEAINKAIGATPIVGGEYMVDCNSIPNLPTINFVLGGKSFTLEGEDYVLKVAQFGKTI 339
Query: 462 CISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
C+SGFM D+PPP GPLWILGDVF+G Y+T FD G R+GFA A
Sbjct: 340 CLSGFMGMDIPPPNGPLWILGDVFIGKYYTEFDMGNNRVGFATA 383
>gi|405951067|gb|EKC19012.1| Lysosomal aspartic protease [Crassostrea gigas]
Length = 439
Score = 291 bits (744), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 144/297 (48%), Positives = 195/297 (65%), Gaps = 3/297 (1%)
Query: 36 RRLDLHSLN-AARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSP 94
+R+ LH ++ R T ER + +R ++ + PL N+MDAQY+G I IG+P
Sbjct: 21 QRIKLHKIDKTVRETLLERGTTAEYLKRKYNRY-ETGPEPEPLSNYMDAQYYGPISIGTP 79
Query: 95 PQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSIS 153
PQNF VIFDTGSSNLWVPS KC S I+C H++Y S KS+TY G EI YG+GS+
Sbjct: 80 PQNFKVIFDTGSSNLWVPSKKCKLSDIACLLHNKYDSTKSSTYKANGTDFEIRYGTGSLK 139
Query: 154 GFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMV 213
GF S D V VGD+ VKDQ F EAT + +TF+ A+FDGI+G+GF EI+V PV++NMV
Sbjct: 140 GFLSTDTVTVGDIKVKDQTFAEATEQPGITFVAAKFDGILGMGFPEISVKGVTPVFNNMV 199
Query: 214 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 273
Q LV +FSF+L+R+P GGE++ GG DPK++ G TYV VT+KGYWQF++ + +
Sbjct: 200 AQKLVPAPIFSFYLDRNPTGTPGGEMILGGSDPKYYSGNFTYVNVTRKGYWQFKMDGVKV 259
Query: 274 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 330
+++ C GGC AI D+GTSLLAGP+ V +N IG + + E + S+ G L
Sbjct: 260 NGKASKYCSGGCNAIADTGTSLLAGPSTEVKSLNAMIGAKPFAAGEYTVDCSKIGSL 316
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 41/94 (43%), Positives = 59/94 (62%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
+N + + P GE +DC +I ++P VSFT+ K F L + YIL E +C+SGF
Sbjct: 292 LNAMIGAKPFAAGEYTVDCSKIGSLPPVSFTLNGKDFTLQGKDYILTVSEMGQTICLSGF 351
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRI 500
+ D+P P GPLWILGD+F+G ++T FD G R+
Sbjct: 352 IGLDIPAPAGPLWILGDIFIGAFYTEFDMGNSRV 385
>gi|224548868|dbj|BAH24176.1| aspartic proteinase [Sitophilus zeamais]
Length = 389
Score = 291 bits (744), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 138/279 (49%), Positives = 189/279 (67%), Gaps = 3/279 (1%)
Query: 42 SLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVI 101
SL + R G V V+ R D PL N++DAQY+G I IG+PPQNF+VI
Sbjct: 25 SLTKGKSVRNTLRDVGTHVQQVKLRYVSVDPSPEPLTNYLDAQYYGPISIGTPPQNFNVI 84
Query: 102 FDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDN 160
FDTGSSNLWVPS KC +I+C H++Y + KS+TY E G I YGSGS+SG+ S D+
Sbjct: 85 FDTGSSNLWVPSKKCELLNIACLLHNKYDATKSSTYKENGTEFAITYGSGSLSGYLSTDS 144
Query: 161 VEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSE 220
+ VG V VKDQ F EA +E LTF+ A+FDGI+G+ + I+V PV+ NM++Q LV+
Sbjct: 145 LSVGSVQVKDQTFGEAIKEPGLTFIAAKFDGILGMAYPRISVDGVTPVFYNMIDQNLVAA 204
Query: 221 EVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGV 280
+FSF+LNRDP+A+ GGEI+ GG DP +++G TY+PV ++ YWQF++ + + +QS +
Sbjct: 205 PIFSFYLNRDPNAQTGGEIILGGSDPNYYEGDFTYLPVDRQAYWQFKMDSVQVADQS--L 262
Query: 281 CEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
C+GGC AI D+GTSL+AGPT + +N AIG +V E
Sbjct: 263 CKGGCEAIADTGTSLIAGPTEEIAALNKAIGASAIVGGE 301
Score = 105 bits (261), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 53/138 (38%), Positives = 80/138 (57%), Gaps = 2/138 (1%)
Query: 368 EKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDR 427
+ ++V D ++C A+ L T+E ++ +N+ + GE I+DC+
Sbjct: 251 KMDSVQVADQSLCKGGCEAIADTGTSLIAGPTEE--IAALNKAIGASAIVGGEYIVDCNS 308
Query: 428 IPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMG 487
I ++P ++ T+G +F L E Y+LK E CISGF+ D+P P GPLWILGDVF+G
Sbjct: 309 ISSLPKINITLGGNLFTLEGEDYVLKVSELGQVTCISGFLGLDVPAPAGPLWILGDVFIG 368
Query: 488 VYHTVFDSGKLRIGFAEA 505
Y+T FD+G R+GFA A
Sbjct: 369 KYYTEFDAGNNRVGFATA 386
>gi|4586590|dbj|BAA76427.1| aspartic proteinase [Cicer arietinum]
Length = 204
Score = 290 bits (743), Expect = 9e-76, Method: Compositional matrix adjust.
Identities = 132/204 (64%), Positives = 166/204 (81%), Gaps = 2/204 (0%)
Query: 305 EINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIK 364
EINHAIG EGV+S ECK VVSQYG+LIWDLLVSG+ P +C Q+GLC+ + S GI+
Sbjct: 1 EINHAIGAEGVLSVECKEVVSQYGELIWDLLVSGVNPGDICSQVGLCSVRSDQSKSAGIE 60
Query: 365 TVVEKEN--VSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESI 422
V E + +SA D+ +CS+C+M V+WVQNQLKQK TKE+V +Y+N+LC+SLP+P GES+
Sbjct: 61 MVTENKQSEMSATDTPLCSSCQMLVIWVQNQLKQKATKERVFNYVNQLCESLPSPSGESV 120
Query: 423 IDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILG 482
I C+ + MPN+SFTIGDK F L+PEQY+L+TGEGI EVC+S F+AFD+PPP+GPLWILG
Sbjct: 121 ISCNDLSRMPNISFTIGDKPFVLTPEQYVLRTGEGITEVCLSAFIAFDIPPPKGPLWILG 180
Query: 483 DVFMGVYHTVFDSGKLRIGFAEAA 506
DVFM YHTVFD G L++GFAEAA
Sbjct: 181 DVFMRAYHTVFDYGNLQVGFAEAA 204
>gi|33347413|gb|AAQ15289.1| aspartic protease [Pyrus pyrifolia]
Length = 199
Score = 288 bits (737), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 132/192 (68%), Positives = 161/192 (83%)
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + GK I YG+G+ISGFFS+D+V VGD+VVKDQ FIEAT+E +TFL A+FDGI+GL
Sbjct: 5 YNKNGKPAAIQYGTGAISGFFSEDHVTVGDLVVKDQEFIEATKEPGITFLAAKFDGILGL 64
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
GF+EI+VG+AVPVW NMV QGL+ E VFSFW NR+ D EEGGEIVFGGVDP H+KGKHTY
Sbjct: 65 GFQEISVGNAVPVWYNMVNQGLLKEPVFSFWFNRNADEEEGGEIVFGGVDPNHYKGKHTY 124
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
VPVT+KGYWQF++GD++I Q+TG C GC+AI DSGTSLL GPT ++TE+NHAIG G+
Sbjct: 125 VPVTQKGYWQFDMGDVMIDGQTTGFCADGCSAIADSGTSLLVGPTTIITELNHAIGASGI 184
Query: 316 VSAECKLVVSQY 327
VS ECK VV++Y
Sbjct: 185 VSQECKTVVAEY 196
>gi|380746491|gb|AFE48185.1| cathepsin D [Pinctada margaritifera]
Length = 390
Score = 287 bits (735), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 139/298 (46%), Positives = 201/298 (67%), Gaps = 8/298 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRL---GDSDEDILPLKNFMDAQYFGEIGIGS 93
R+ LH + + R T +E G + ++ + G + PL N++DAQY+G IGIG+
Sbjct: 21 RIKLHKIKSVRRTLQEV---GTSIESLQQKYSGYGITGPAPEPLSNYLDAQYYGVIGIGT 77
Query: 94 PPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSI 152
P QNF V+FDTGSSNLWVPS KC FS I+C H++Y S KS+TY + + EI YG+GS+
Sbjct: 78 PAQNFKVVFDTGSSNLWVPSKKCKFSDIACLLHNKYDSSKSSTYKKNDTTFEIRYGTGSL 137
Query: 153 SGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNM 212
+GF S D V V + VK Q F EAT++ +TF+ A+FDGI+G+ F +I+V VPV+ NM
Sbjct: 138 TGFLSTDTVTVAGIAVKGQTFAEATQQPGITFVAAKFDGILGMAFDKISVDGVVPVFYNM 197
Query: 213 VEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDIL 272
++QGLV + +FSF+L+RDP A EGGE++ GG D KH+KG TY+PVT++GYW+F++ +
Sbjct: 198 IKQGLVPQPIFSFYLDRDPSASEGGELILGGSDTKHYKGNFTYLPVTRQGYWEFKMDGVS 257
Query: 273 IGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 330
+G ++ C GGC I D+GTSL+AGP+ V ++N AIG + E + ++ DL
Sbjct: 258 VG-ENHKFCTGGCNTIADTGTSLIAGPSSEVKKLNAAIGATAIPGGEYMIDCTKIPDL 314
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 42/98 (42%), Positives = 64/98 (65%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
+N + P GE +IDC +IP +P ++F++G + F+L + Y+L + C+SGF
Sbjct: 290 LNAAIGATAIPGGEYMIDCTKIPDLPKITFSLGGQQFDLEGKDYVLTVTQQGQTTCLSGF 349
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAE 504
D+PPP GPLWILGDVF+G ++T FD G ++GFA+
Sbjct: 350 AGIDVPPPAGPLWILGDVFIGKFYTEFDMGNTQVGFAQ 387
>gi|336454162|gb|AEI58895.1| cathepsin D [Pteria penguin]
Length = 392
Score = 286 bits (733), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 140/291 (48%), Positives = 195/291 (67%), Gaps = 13/291 (4%)
Query: 36 RRLDLHSLNAARITRKERYMGGAGVSGVRHRL------GDSDEDILPLKNFMDAQYFGEI 89
+R+ LH + R T +E G + ++++ G + E PL N+MDAQY+G+I
Sbjct: 20 QRIKLHKFKSVRRTLQEV---GTSIEALQNKYNVYKVEGPAPE---PLSNYMDAQYYGDI 73
Query: 90 GIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYG 148
IG+P Q+F VIFDTGSSNLWVPS KC S I+C H++Y S KS+TY G EI YG
Sbjct: 74 TIGTPGQSFKVIFDTGSSNLWVPSKKCKLSDIACLLHNKYDSSKSSTYKANGTDFEIRYG 133
Query: 149 SGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPV 208
+GS++GF S D V V + VK Q F EAT++ +TF+ A+FDGI+G+G++ I+V VPV
Sbjct: 134 TGSLTGFLSTDTVTVAGIAVKGQTFAEATQQPGITFVAAKFDGILGMGYQTISVDGVVPV 193
Query: 209 WDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFEL 268
+ NMV+Q LV VFSF+LNRDP A +GGE++ GG D K++KG TY+PVTK+GYW+F++
Sbjct: 194 FYNMVKQNLVPASVFSFYLNRDPGASDGGELILGGSDSKYYKGNFTYLPVTKQGYWRFKM 253
Query: 269 GDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
I++ +++ C GGC AI D+GTSLLAGP V +N IG + + E
Sbjct: 254 DGIMMNGKASKYCSGGCKAIADTGTSLLAGPKTEVDALNKQIGATPLAAGE 304
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 40/97 (41%), Positives = 62/97 (63%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
+N+ + P GE ++DC + +P +SF +G + F+L + Y+L + +C+SGF
Sbjct: 291 LNKQIGATPLAAGEYMVDCSSVSKLPVISFMLGGQQFDLQGKDYVLTVTQQGQTICLSGF 350
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFA 503
D+PPP GPLWILGDVF+G ++T FD G ++GFA
Sbjct: 351 TGIDVPPPNGPLWILGDVFIGKFYTEFDLGNNQVGFA 387
>gi|33347411|gb|AAQ15288.1| aspartic protease [Pyrus pyrifolia]
Length = 199
Score = 286 bits (733), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 131/192 (68%), Positives = 161/192 (83%)
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + GK I YG+G+ISGFFS+D+V VGD+VVKDQ FIEAT+E +TFL+A+FDGI+GL
Sbjct: 5 YNKNGKPAAIQYGTGAISGFFSEDHVTVGDLVVKDQEFIEATKEPGITFLVAKFDGILGL 64
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
GF+EI+VG+AVPVW NMV QGL+ E VFS W NR+ D EEGGEIVFGGVDP H+KGKHTY
Sbjct: 65 GFQEISVGNAVPVWYNMVNQGLLKEPVFSLWFNRNADEEEGGEIVFGGVDPNHYKGKHTY 124
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
VPVT+KGYWQF++GD++I Q+TG C GC+AI DSGTSLL GPT ++TE+NHAIG G+
Sbjct: 125 VPVTQKGYWQFDMGDVMIDGQTTGFCADGCSAIADSGTSLLVGPTTIITELNHAIGASGI 184
Query: 316 VSAECKLVVSQY 327
VS ECK VV++Y
Sbjct: 185 VSQECKTVVAEY 196
>gi|227018334|gb|ACP18833.1| aspartic proteinase 1 [Chrysomela tremula]
Length = 386
Score = 285 bits (730), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 145/321 (45%), Positives = 200/321 (62%), Gaps = 24/321 (7%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV 60
M + + SVFC+ +C + R+ LH ++ A+ T + R G
Sbjct: 1 MLRIFVLSVFCVLATVNCDFV---------------RVPLHKMDTAKSTLQSR-----GY 40
Query: 61 SGVRHRLGDSDED-ILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YF 118
+ + D PL N+MDAQY+GEI IG+P Q F+VIFDTGSSNLW+PS KC
Sbjct: 41 KSNENLVKKYTTDGYAPLTNYMDAQYYGEITIGTPGQKFNVIFDTGSSNLWIPSHKCKLL 100
Query: 119 SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
+++C H++Y S KS+TYT G I YGSGS+ GF S D VEV + VKDQ+F EAT
Sbjct: 101 NVACRTHNQYNSDKSSTYTSNGTDFSITYGSGSLKGFLSSDIVEVAGLTVKDQIFAEATE 160
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
E L F+ +FDGI+GL + I+V P + ++EQG+V E VFSF+LNRDP+AE GGE
Sbjct: 161 EPGLAFIAGKFDGILGLAYDTISVNQVTPFFYKLIEQGVVKEPVFSFYLNRDPNAEVGGE 220
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
IVFGG DPK++ G TY+PVT+KGYWQ ++ ++ S +C+GGC AIVD+GTSL+ G
Sbjct: 221 IVFGGSDPKYYTGDFTYLPVTRKGYWQIKMDKAVV--DSNTLCDGGCQAIVDTGTSLITG 278
Query: 299 PTPVVTEINHAIGGEGVVSAE 319
P+ + +I A+G + + E
Sbjct: 279 PSDEIEKIVKAVGATAITAGE 299
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 41/87 (47%), Positives = 58/87 (66%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
GE +DC+++ +MPN+ F +G K F L+P+ Y+L+ + C+ GFM D+ P GPL
Sbjct: 298 GEYTVDCNKLSSMPNIDFVLGGKTFTLTPKDYVLQVKQLFLTTCLLGFMGLDVAEPAGPL 357
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WILGDVF+G Y+T FD G R+G A A
Sbjct: 358 WILGDVFIGKYYTEFDLGNNRVGLAPA 384
>gi|91093044|ref|XP_966517.1| PREDICTED: similar to cathepsin D isoform 1 [Tribolium castaneum]
gi|270002651|gb|EEZ99098.1| hypothetical protein TcasGA2_TC004989 [Tribolium castaneum]
Length = 384
Score = 285 bits (729), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 142/292 (48%), Positives = 192/292 (65%), Gaps = 11/292 (3%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQ 96
R+ L+ + +AR + +E G V VR R G + PL N++DAQY+G I IG+PPQ
Sbjct: 21 RVPLYKVKSARRSLQEV---GTHVQQVRMRYGGPTPE--PLSNYLDAQYYGPISIGNPPQ 75
Query: 97 NFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
NF V+FDTGSSNLWVPS KC+++ I+C H++Y S +S TY + G I YGSGS+SGF
Sbjct: 76 NFKVVFDTGSSNLWVPSKKCHYTNIACLLHNKYDSSQSKTYKKNGTDFAIQYGSGSLSGF 135
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
S D V VG + V+ Q F EA E L F+ A+FDGI+G+ + I+V PV+ NM++Q
Sbjct: 136 LSTDIVTVGGLKVQQQTFAEAMSEPGLAFVAAKFDGILGMAYNRISVDGVTPVFYNMIQQ 195
Query: 216 GLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGN 275
LV++ VFSF+LNRDP A +GGEI+ GG DP H+KG TY+ V ++ YWQF++ I +G
Sbjct: 196 NLVAQPVFSFYLNRDPSAAQGGEIILGGSDPAHYKGDFTYLSVDRQAYWQFKMDSISVGG 255
Query: 276 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLV 323
++T C GC AI D+GTSL+AGP V IN AIG +V E C L+
Sbjct: 256 KNT-FCANGCEAIADTGTSLIAGPVSEVQGINKAIGATPIVGGEYMVDCNLI 306
Score = 108 bits (269), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 48/100 (48%), Positives = 67/100 (67%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
IN+ + P GE ++DC+ IP +P + FT+G K F L + Y+L+ + +C+SGF
Sbjct: 285 INKAIGATPIVGGEYMVDCNLIPNLPLIDFTLGGKNFTLEGKDYVLRVAQMGKTICLSGF 344
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
M D+PPP GPLWILGDVF+G ++T FD G R+GFA AA
Sbjct: 345 MGIDIPPPNGPLWILGDVFIGKFYTEFDLGNNRVGFAVAA 384
>gi|195332251|ref|XP_002032812.1| GM20753 [Drosophila sechellia]
gi|194124782|gb|EDW46825.1| GM20753 [Drosophila sechellia]
Length = 392
Score = 285 bits (729), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 149/322 (46%), Positives = 198/322 (61%), Gaps = 21/322 (6%)
Query: 11 CLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS----GVRHR 66
L ++A A N + GL R+ LH +AR R+ G +R+
Sbjct: 5 ALLLVAFLTAAVAHPNSQEKPGL--LRVPLHKFQSAR-----RHFADVGTELQQLRIRYG 57
Query: 67 LGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFH 125
GD E PL N+MDAQY+G I IGSPPQNF V+FDTGSSNLWVPS KC+ + I+C H
Sbjct: 58 GGDVPE---PLSNYMDAQYYGPIAIGSPPQNFRVVFDTGSSNLWVPSKKCHLTNIACLMH 114
Query: 126 SRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFL 185
++Y + KS TYT+ G I+YGSGS+SG+ S D V + + +KDQ F EA E L F+
Sbjct: 115 NKYDASKSKTYTKNGTEFAIHYGSGSLSGYLSTDTVSIAGLDIKDQTFAEALSEPGLVFV 174
Query: 186 LARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVD 245
A+FDGI+GLG+ I+V P + M EQGL+S VFSF+LNRDP + EGGEI+FGG D
Sbjct: 175 AAKFDGILGLGYSSISVDKVKPPFYAMYEQGLISAPVFSFYLNRDPASPEGGEIIFGGSD 234
Query: 246 PKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTE 305
P H+ G+ TY+PVT+K YWQ ++ IG+ +C+GGC I D+GTSL+A P T
Sbjct: 235 PNHYTGEFTYLPVTRKAYWQIKMDAASIGDLQ--LCKGGCQVIADTGTSLIAAPLEEATS 292
Query: 306 INHAIGGEGVVSAE----CKLV 323
IN IGG ++ + C L+
Sbjct: 293 INQKIGGTPIIGGQYVVSCDLI 314
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 52/139 (37%), Positives = 77/139 (55%), Gaps = 2/139 (1%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
++ + S GD +C + L +E + IN+ P G+ ++ CD
Sbjct: 255 IKMDAASIGDLQLCKGGCQVIADTGTSLIAAPLEEA--TSINQKIGGTPIIGGQYVVSCD 312
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
IP +P + F +G K F L + YIL+ + +C+SGFM D+PPP GPLWILGDVF+
Sbjct: 313 LIPQLPVIKFVLGGKTFELEGKDYILRVAQMGKTICLSGFMGMDIPPPNGPLWILGDVFI 372
Query: 487 GVYHTVFDSGKLRIGFAEA 505
G Y+T FD G R+GFA++
Sbjct: 373 GKYYTEFDMGNDRVGFADS 391
>gi|257228998|gb|ACV53024.1| cathepsin D2 [Homarus americanus]
Length = 385
Score = 285 bits (728), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 137/292 (46%), Positives = 186/292 (63%), Gaps = 8/292 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQ 96
R+ LH + + R T +E V+ + G+ PL N+MDAQY+G I IG+PPQ
Sbjct: 19 RIPLHKIKSVRRTLQEV---DTAVTRAHRKWGNRGPMPEPLSNYMDAQYYGPISIGTPPQ 75
Query: 97 NFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
+F V+FDTGSSNLWVPS +C+++ I+C H++Y +RKS+TY + G I YGSGS+SG+
Sbjct: 76 SFRVVFDTGSSNLWVPSKQCHYTNIACMIHNKYDARKSSTYKKNGTDFAIQYGSGSLSGY 135
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
S D V VG + V+ Q F EA E L F+ A+FDGI+G+GF IAV PV+ NMV+Q
Sbjct: 136 LSTDTVAVGSLAVRQQTFAEALSEPGLAFVAAKFDGILGMGFDNIAVDGVTPVFYNMVKQ 195
Query: 216 GLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGN 275
L+ VFSF+LNRDP + EGGE++ GG DP ++ G TY+PV +KGYWQ ++ I +
Sbjct: 196 SLIPAPVFSFYLNRDPSSPEGGELILGGSDPNYYSGNFTYIPVDRKGYWQIKMDGIQMNG 255
Query: 276 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLV 323
CEGGC AI D+GTSL+A P IN IG + + S E C L+
Sbjct: 256 ARVPFCEGGCEAIADTGTSLIAAPVEEARSINKKIGAKPIASGEWSVDCSLI 307
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 46/99 (46%), Positives = 61/99 (61%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
IN+ + P GE +DC IP +P +SF + + F L + YILK E C+SGF
Sbjct: 286 INKKIGAKPIASGEWSVDCSLIPHLPKISFVLNGQPFTLEGKDYILKVSVFGREECVSGF 345
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+ D+PPP GPLWILGD F+G ++T FD G R+GFA A
Sbjct: 346 IGLDVPPPMGPLWILGDTFIGRFYTEFDLGNNRVGFAIA 384
>gi|327259983|ref|XP_003214815.1| PREDICTED: cathepsin D-like [Anolis carolinensis]
Length = 399
Score = 284 bits (727), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 152/320 (47%), Positives = 203/320 (63%), Gaps = 24/320 (7%)
Query: 12 LWVLASCLLLPASSNGLRRIGLKK----RRL------DLHSLNAARITRKERYMG-GAGV 60
L L L + A+ + L RI LKK R + DL L+ K +Y G GAG
Sbjct: 3 LRALVLLLSVAAAYSALIRIPLKKFPSPRSIYAEYGTDLQDLDKLGEMLKYKYGGPGAGT 62
Query: 61 SGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFS 119
LKN+MDAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS C
Sbjct: 63 PTPET-----------LKNYMDAQYYGEIGIGTPPQKFTVVFDTGSSNLWVPSVHCRLLD 111
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
I+C H +Y S KSNTY + G I+YG+GS+SGF SQD V +GD+ VK+Q+F EAT E
Sbjct: 112 IACMLHHKYDSSKSNTYVQNGTKFAIHYGTGSLSGFISQDTVTIGDIAVKNQMFGEATSE 171
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEI 239
+TFL A+FDGI+GLGF +I+V P +DN ++QGL+ + +FSF+LNRDP + GGEI
Sbjct: 172 PGITFLAAKFDGILGLGFPKISVDKVTPFFDNAMKQGLLDKNMFSFFLNRDPSSSPGGEI 231
Query: 240 VFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGP 299
+FGGVDPK++ G +V VT+K YWQ + + + + T VC+ GC AIVD+GTSL+ GP
Sbjct: 232 IFGGVDPKYYSGDFNWVNVTRKAYWQVHMDRVEVPSGLT-VCKNGCEAIVDTGTSLITGP 290
Query: 300 TPVVTEINHAIGGEGVVSAE 319
T V + AIG + ++ +
Sbjct: 291 TDEVKALQKAIGAKPIIKGQ 310
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 51/138 (36%), Positives = 80/138 (57%), Gaps = 3/138 (2%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
+++ V +G + + CE A+V L T E + + + + P G+ I+ C+
Sbjct: 260 MDRVEVPSGLTVCKNGCE-AIVDTGTSLITGPTDE--VKALQKAIGAKPIIKGQYILPCE 316
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
++ T+P VSF +G + ++LS E Y+LK +C+SGF D+PPP GPLWILGDVF+
Sbjct: 317 KLATLPIVSFVLGGRSYSLSAENYVLKVTVQGETLCLSGFSGLDVPPPGGPLWILGDVFI 376
Query: 487 GVYHTVFDSGKLRIGFAE 504
G Y+T FD +GFA+
Sbjct: 377 GPYYTAFDRDNDAVGFAK 394
>gi|159468321|ref|XP_001692331.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
gi|158278517|gb|EDP04281.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
Length = 303
Score = 284 bits (726), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 144/302 (47%), Positives = 200/302 (66%), Gaps = 13/302 (4%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSN-GLRRIGLKKRRLDLHSLNAARITRKERYMGGAG 59
M + + ++ L +++ L + A G+ R+ L+K + L +L R Y+
Sbjct: 1 MARSYVPALIALAAVSALLGVAAEQQAGMLRVTLRKTEM-LTTLG-----RPRPYL---- 50
Query: 60 VSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YF 118
G + LG SD+ + LKNFMDAQY+GEIG+G+PPQ F+VIFDTGS+NLWVPSSKC F
Sbjct: 51 -LGEQGLLGSSDQGQVTLKNFMDAQYYGEIGLGTPPQLFNVIFDTGSANLWVPSSKCALF 109
Query: 119 SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
+I+C H +Y + KS TY G I YG+GS+ G+ SQD + G + +KDQ F EA
Sbjct: 110 NIACRLHRKYNAAKSKTYKANGTEFAIEYGTGSLDGYISQDVLTWGGLTIKDQGFAEAIN 169
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
E LTF+ A+FDGI+G+GF I+V P + +VE+G ++ VFSFWLNRDP+A GGE
Sbjct: 170 EPGLTFVAAKFDGILGMGFPAISVQHVPPPFTRLVEEGGLAAPVFSFWLNRDPNAPNGGE 229
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
+V GG+DP HF G+HT+VPVT++GYWQF + + +G S +C GCAAI D+GTSL+AG
Sbjct: 230 LVLGGIDPTHFTGEHTWVPVTRQGYWQFNMEGLDLGPGSQKMCAKGCAAIADTGTSLIAG 289
Query: 299 PT 300
P+
Sbjct: 290 PS 291
>gi|332024025|gb|EGI64243.1| Lysosomal aspartic protease [Acromyrmex echinatior]
Length = 381
Score = 284 bits (726), Expect = 9e-74, Method: Compositional matrix adjust.
Identities = 140/288 (48%), Positives = 193/288 (67%), Gaps = 11/288 (3%)
Query: 36 RRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPP 95
+R+ LH ++ R KE G ++ VR + PL N++DAQY+G I IG+PP
Sbjct: 20 QRIPLHKTDSIRKALKEV---GTDLTQVRTFTTTDNYTPEPLSNYLDAQYYGVISIGTPP 76
Query: 96 QNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISG 154
QNF VIFDTGSSNLWVPS KC+ + I+C H++Y S KS TY + G I YGSGS+SG
Sbjct: 77 QNFKVIFDTGSSNLWVPSKKCHITNIACLLHNKYTSEKSTTYKKNGTIFAIRYGSGSLSG 136
Query: 155 FFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVE 214
F S+D V V + V+ Q F EA E + F+ A+FDGI+G+G+ I+V PV+ NMV+
Sbjct: 137 FLSEDVVTVAGLAVQHQTFAEAISEPGIAFVAAKFDGILGMGYSTISVDGVTPVFYNMVK 196
Query: 215 QGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
Q LVS+ VFSF+LNRD A EGGE++ GG DP H++G+ TY+PVT+KGYWQF++ + +
Sbjct: 197 QNLVSQAVFSFYLNRDSSAAEGGEMILGGSDPDHYEGEFTYIPVTRKGYWQFKMDGVQVK 256
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH-----AIGGEGVVS 317
+ + C+ GC AI D+GTSL+AGPT + +IN +IGGE +V+
Sbjct: 257 DHA--FCKEGCQAIADTGTSLIAGPTSEIKDINEMIGATSIGGEAMVN 302
Score = 92.4 bits (228), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 53/134 (39%), Positives = 77/134 (57%), Gaps = 6/134 (4%)
Query: 370 ENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIP 429
+ V D A C A+ L T E + INE+ + + GE++++C++I
Sbjct: 251 DGVQVKDHAFCKEGCQAIADTGTSLIAGPTSE--IKDINEMIGA-TSIGGEAMVNCNQIS 307
Query: 430 TMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVY 489
+MP++SFT+G+K F L E Y+LK + +C+SGFM DLP LWILGDVF+G Y
Sbjct: 308 SMPSISFTLGNKNFTLIGEDYVLKIKQFGKTICMSGFMGMDLPQ---SLWILGDVFIGRY 364
Query: 490 HTVFDSGKLRIGFA 503
+T FD R+GFA
Sbjct: 365 YTEFDMENDRVGFA 378
>gi|332376487|gb|AEE63383.1| unknown [Dendroctonus ponderosae]
Length = 388
Score = 283 bits (725), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 148/315 (46%), Positives = 199/315 (63%), Gaps = 18/315 (5%)
Query: 15 LASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDI 74
L C + + L R+ L K + + I R+ G V VR R E +
Sbjct: 8 LIICFIATITCENLVRVPLTKGK------SPKNILREV----GTHVQQVRLRYTSGAEPV 57
Query: 75 L-PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRK 132
PL N++DAQYFG I IG+PPQ F V+FDTGSSNLWVPS KC F+ I+C H++Y S K
Sbjct: 58 PEPLSNYLDAQYFGAISIGTPPQKFVVVFDTGSSNLWVPSKKCSFTNIACLLHNKYDSSK 117
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGI 192
S+TY E G I YGSGS+SGF S D V V D+ VK Q F EA E L F+ A+FDGI
Sbjct: 118 SSTYKENGTEFAIRYGSGSLSGFLSTDVVGVSDINVKGQTFAEALSEPGLAFVAAKFDGI 177
Query: 193 IGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGK 252
+GL + I+V VP++ NMV QG+VS+ VFSF+LNR+PD + GGE++FGG DP ++ G
Sbjct: 178 LGLAYSRISVDGVVPLFYNMVNQGIVSQAVFSFYLNRNPDGKVGGELIFGGSDPNYYSGN 237
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
TY+PV ++ YWQF++ ++++G ++ C+GGC AI D+GTSL+AGP V +N AIG
Sbjct: 238 FTYLPVDRQAYWQFKMDEVIVGQKT--FCKGGCEAIADTGTSLIAGPVDEVKALNEAIGA 295
Query: 313 EGVVSAE----CKLV 323
+V E C L+
Sbjct: 296 TPLVGGEYAVDCSLI 310
Score = 96.7 bits (239), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 44/97 (45%), Positives = 58/97 (59%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
+NE + P GE +DC IP +P + F +G F L + Y+L VC+SGF
Sbjct: 289 LNEAIGATPLVGGEYAVDCSLIPNLPAIKFILGGNTFVLEGKDYVLAESAMGKTVCLSGF 348
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFA 503
D+PPP GPLWILGDVF+G Y+T FD+ R+GFA
Sbjct: 349 FGIDIPPPNGPLWILGDVFIGKYYTEFDAQNNRVGFA 385
>gi|312861579|gb|ADR10277.1| cathepsin D [Branchiostoma belcheri]
Length = 395
Score = 283 bits (725), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 148/328 (45%), Positives = 203/328 (61%), Gaps = 15/328 (4%)
Query: 4 KLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGV 63
K L +F L V AS L RI L K + L IT + + SG
Sbjct: 2 KFLSVLFALVVFASAL---------HRIPLTKMKTVRRQLADVGITYDQ--VLDKDYSGK 50
Query: 64 RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISC 122
+ + D+ E PL N++DAQY+G I IG+P QNF V+FDTGSSNLWVPS KC S I+C
Sbjct: 51 YYNIKDAPE---PLTNYLDAQYYGPISIGTPAQNFQVVFDTGSSNLWVPSKKCKLSDIAC 107
Query: 123 YFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSL 182
H++Y S +S+TY + G I YGSGS++GF S+D V +G + V++Q F EA + +
Sbjct: 108 LLHNKYDSTQSSTYMKNGTDFAIRYGSGSLTGFLSEDTVTIGGLKVQNQTFAEAVTQPGI 167
Query: 183 TFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFG 242
TF+ A+FDGI+G+G+ I+V VP + NMV+Q LV + VFSF+LNRDP + GE++ G
Sbjct: 168 TFVAAKFDGILGMGYDTISVDGVVPPFYNMVQQKLVDKPVFSFYLNRDPSSTTRGELLLG 227
Query: 243 GVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPV 302
G DPK++ G T++ VTK GYWQF++ I+I ++T C+GGCAAI D+GTSL+AGPT
Sbjct: 228 GTDPKYYTGDFTFLDVTKPGYWQFKMDGIMINGKATDYCKGGCAAIADTGTSLIAGPTTE 287
Query: 303 VTEINHAIGGEGVVSAECKLVVSQYGDL 330
V +N IG + E + SQ L
Sbjct: 288 VQALNKQIGATPIPGGEYMVDCSQVSSL 315
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 44/100 (44%), Positives = 65/100 (65%), Gaps = 2/100 (2%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
+N+ + P P GE ++DC ++ ++P +SF +G K F L + Y+L+ VC+SGF
Sbjct: 291 LNKQIGATPIPGGEYMVDCSQVSSLPPISFMLGGKAFELQGKDYVLQVTTMGQTVCVSGF 350
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
+ D+ P GPLWILGDVF+G Y+T+FD G R+GFA A
Sbjct: 351 LGIDV--PAGPLWILGDVFIGPYYTLFDMGNNRVGFAPTA 388
>gi|380018765|ref|XP_003693293.1| PREDICTED: lysosomal aspartic protease-like [Apis florea]
Length = 385
Score = 283 bits (725), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 145/319 (45%), Positives = 200/319 (62%), Gaps = 18/319 (5%)
Query: 5 LLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVR 64
+ R++ CL C + ++ + RI LH +++ R KE +
Sbjct: 1 MFRAILCL-----CAFIAIANADITRI-------PLHKIDSIRKQFKEY---NTEIYQTH 45
Query: 65 HRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCY 123
GD + PL N++DAQY+G I IG+PPQ+F VIFDTGSSNLWVPS KC+ + I+C
Sbjct: 46 ILQGDFPQP-EPLSNYLDAQYYGVISIGTPPQDFRVIFDTGSSNLWVPSKKCHLTNIACK 104
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
H +Y + KS+TY + G I YGSGS+SG+ S D V++ + + DQ F EA E L
Sbjct: 105 LHRKYDNTKSSTYKKNGTDFAIRYGSGSLSGYLSTDTVDIAGMKISDQTFAEALSEPGLA 164
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
F+ A+FDGI+G+ + +IAV D PV+ NMV+QGLV + VFSF+LNR+PD + GGE++ GG
Sbjct: 165 FVAAKFDGILGMAYSKIAVDDVTPVFYNMVKQGLVPQPVFSFYLNRNPDDKYGGELILGG 224
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
DP H++G TYVPV KKGYWQF++ I IG+ VC+ GC AI D+GTSL+AGP V
Sbjct: 225 SDPNHYEGSFTYVPVDKKGYWQFKMDSIQIGSD-LKVCQQGCEAIADTGTSLIAGPVKEV 283
Query: 304 TEINHAIGGEGVVSAECKL 322
IN AIG + + E +
Sbjct: 284 GAINKAIGATPIAAGEAMI 302
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 58/130 (44%), Positives = 78/130 (60%), Gaps = 2/130 (1%)
Query: 376 DSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVS 435
D VC A+ L KE + IN+ + P GE++IDC+ IP +P ++
Sbjct: 257 DLKVCQQGCEAIADTGTSLIAGPVKE--VGAINKAIGATPIAAGEAMIDCNSIPNLPTIN 314
Query: 436 FTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDS 495
F +G K F+L E Y+LK + VC+SGFM D+PPP GPLWILGDVF+G Y+T FD
Sbjct: 315 FVLGGKSFSLKGEDYVLKVTQFRKTVCLSGFMGMDIPPPNGPLWILGDVFIGRYYTEFDM 374
Query: 496 GKLRIGFAEA 505
G R+GFA+A
Sbjct: 375 GNNRVGFAKA 384
>gi|383859202|ref|XP_003705085.1| PREDICTED: lysosomal aspartic protease-like [Megachile rotundata]
Length = 384
Score = 283 bits (724), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 145/297 (48%), Positives = 190/297 (63%), Gaps = 8/297 (2%)
Query: 36 RRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDIL-PLKNFMDAQYFGEIGIGSP 94
RR+ LH ++ R KE V+ R+ D + PL N++DAQY+G I IG+P
Sbjct: 19 RRIKLHKIDRIRSQLKEY-----DTDLVQTRIVQGDVILPEPLSNYLDAQYYGVINIGTP 73
Query: 95 PQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSIS 153
PQ F VIFDTGSSNLWVPS KC+ + I+C H +Y S KS+TY + G I YGSGS+S
Sbjct: 74 PQKFRVIFDTGSSNLWVPSKKCHLTNIACKLHYKYDSTKSSTYKKNGTDFSIRYGSGSLS 133
Query: 154 GFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMV 213
G+ S D V+V + V DQ F EA E L F+ A+FDGI+G+ + IAV PV+ NMV
Sbjct: 134 GYLSTDMVDVAGIKVNDQTFAEALSEPGLAFVAAKFDGIMGMAYSTIAVDGVTPVFYNMV 193
Query: 214 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 273
+QGLVS+ VFSF+LNRDP+AE GGE++ GG DP H+ G TYVPV KKGYWQF + + +
Sbjct: 194 KQGLVSQPVFSFYLNRDPNAEFGGEMILGGSDPNHYVGPFTYVPVDKKGYWQFAMDRVEV 253
Query: 274 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 330
G+ VCE GC AI D+GTSL+AGP + +N IG + + E + + DL
Sbjct: 254 GSD-VKVCEKGCEAIADTGTSLIAGPVKEIELLNKKIGATPIAAGEAMVECDKIPDL 309
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 58/130 (44%), Positives = 77/130 (59%), Gaps = 2/130 (1%)
Query: 376 DSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVS 435
D VC A+ L KE + +N+ + P GE++++CD+IP +P ++
Sbjct: 256 DVKVCEKGCEAIADTGTSLIAGPVKE--IELLNKKIGATPIAAGEAMVECDKIPDLPTIT 313
Query: 436 FTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDS 495
F G + F L E Y+LK + VCISGFM D+PPP GPLWILGDVF+G Y+T FD
Sbjct: 314 FVFGGRSFPLRGEDYVLKVTQLGKTVCISGFMGMDIPPPNGPLWILGDVFIGRYYTEFDM 373
Query: 496 GKLRIGFAEA 505
G RIGFAEA
Sbjct: 374 GNNRIGFAEA 383
>gi|194863696|ref|XP_001970568.1| GG10707 [Drosophila erecta]
gi|190662435|gb|EDV59627.1| GG10707 [Drosophila erecta]
Length = 390
Score = 283 bits (724), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 147/308 (47%), Positives = 193/308 (62%), Gaps = 21/308 (6%)
Query: 25 SNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS----GVRHRLGDSDEDILPLKNF 80
SN + GL R+ LH +AR R+ G +R+ GD E PL N+
Sbjct: 17 SNPQEKPGL--LRVPLHKFQSAR-----RHFADVGTELQQLRIRYGGGDVPE---PLSNY 66
Query: 81 MDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEI 139
MDAQY+G I IGSPPQNF V+FDTGSSNLWVPS KC+ + I+C H++Y + KS TYT+
Sbjct: 67 MDAQYYGPIAIGSPPQNFRVVFDTGSSNLWVPSKKCHLTNIACLMHNKYDASKSKTYTKN 126
Query: 140 GKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFRE 199
G I YGSGS+SG+ S D V + + +KDQ F EA E L F+ A+FDGI+GLG+
Sbjct: 127 GTEFAIQYGSGSLSGYLSTDTVSIAGLDIKDQTFAEALSEPGLVFVAAKFDGILGLGYSS 186
Query: 200 IAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVT 259
I+V P + M EQGL+S VFSF+LNRDP + EGGEI+FGG DP H+ G+ TY+PVT
Sbjct: 187 ISVDKVKPPFYAMYEQGLISAPVFSFYLNRDPASPEGGEIIFGGSDPNHYTGEFTYLPVT 246
Query: 260 KKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
+K YWQ ++ IG+ +C+GGC I D+GTSL+A P T IN IGG ++ +
Sbjct: 247 RKAYWQIKMDAASIGDLQ--LCKGGCQVIADTGTSLIAAPLEEATSINQKIGGTPIIGGQ 304
Query: 320 ----CKLV 323
C L+
Sbjct: 305 YVVSCDLI 312
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 53/139 (38%), Positives = 77/139 (55%), Gaps = 2/139 (1%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
++ + S GD +C + L +E + IN+ P G+ ++ CD
Sbjct: 253 IKMDAASIGDLQLCKGGCQVIADTGTSLIAAPLEEA--TSINQKIGGTPIIGGQYVVSCD 310
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
IP +P + F +G K F L + YIL+ + +C+SGFM D+PPP GPLWILGDVF+
Sbjct: 311 LIPKLPVIKFVLGGKTFELEGKDYILRVSQMGKTICLSGFMGMDIPPPNGPLWILGDVFI 370
Query: 487 GVYHTVFDSGKLRIGFAEA 505
G Y+T FD G R+GFA+A
Sbjct: 371 GKYYTEFDMGNDRVGFADA 389
>gi|21355083|ref|NP_652013.1| cathD [Drosophila melanogaster]
gi|6685167|gb|AAF23824.1|AF220040_1 cathepsin D precursor [Drosophila melanogaster]
gi|7304149|gb|AAF59186.1| cathD [Drosophila melanogaster]
gi|15292549|gb|AAK93543.1| SD07085p [Drosophila melanogaster]
gi|220946566|gb|ACL85826.1| cathD-PA [synthetic construct]
Length = 392
Score = 283 bits (723), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 143/296 (48%), Positives = 188/296 (63%), Gaps = 19/296 (6%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVS----GVRHRLGDSDEDILPLKNFMDAQYFGEIGIG 92
R+ LH +AR R+ G +R+ GD E PL N+MDAQY+G I IG
Sbjct: 29 RVPLHKFQSAR-----RHFADVGTELQQLRIRYGGGDVPE---PLSNYMDAQYYGPIAIG 80
Query: 93 SPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGS 151
SPPQNF V+FDTGSSNLWVPS KC+ + I+C H++Y + KS TYT+ G I YGSGS
Sbjct: 81 SPPQNFRVVFDTGSSNLWVPSKKCHLTNIACLMHNKYDASKSKTYTKNGTEFAIQYGSGS 140
Query: 152 ISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDN 211
+SG+ S D V + + +KDQ F EA E L F+ A+FDGI+GLG+ I+V P +
Sbjct: 141 LSGYLSTDTVSIAGLDIKDQTFAEALSEPGLVFVAAKFDGILGLGYNSISVDKVKPPFYA 200
Query: 212 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 271
M EQGL+S VFSF+LNRDP + EGGEI+FGG DP H+ G+ TY+PVT+K YWQ ++
Sbjct: 201 MYEQGLISAPVFSFYLNRDPASPEGGEIIFGGSDPNHYTGEFTYLPVTRKAYWQIKMDAA 260
Query: 272 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLV 323
IG+ +C+GGC I D+GTSL+A P T IN IGG ++ + C L+
Sbjct: 261 SIGDLQ--LCKGGCQVIADTGTSLIAAPLEEATSINQKIGGTPIIGGQYVVSCDLI 314
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 53/139 (38%), Positives = 77/139 (55%), Gaps = 2/139 (1%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
++ + S GD +C + L +E + IN+ P G+ ++ CD
Sbjct: 255 IKMDAASIGDLQLCKGGCQVIADTGTSLIAAPLEEA--TSINQKIGGTPIIGGQYVVSCD 312
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
IP +P + F +G K F L + YIL+ + +C+SGFM D+PPP GPLWILGDVF+
Sbjct: 313 LIPQLPVIKFVLGGKTFELEGKDYILRVAQMGKTICLSGFMGLDIPPPNGPLWILGDVFI 372
Query: 487 GVYHTVFDSGKLRIGFAEA 505
G Y+T FD G R+GFA+A
Sbjct: 373 GKYYTEFDMGNDRVGFADA 391
>gi|195474504|ref|XP_002089531.1| GE23596 [Drosophila yakuba]
gi|194175632|gb|EDW89243.1| GE23596 [Drosophila yakuba]
Length = 392
Score = 283 bits (723), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 143/296 (48%), Positives = 188/296 (63%), Gaps = 19/296 (6%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVS----GVRHRLGDSDEDILPLKNFMDAQYFGEIGIG 92
R+ LH +AR R+ G +R+ GD E PL N+MDAQY+G I IG
Sbjct: 29 RVPLHKFQSAR-----RHFADVGTELQQLRIRYGGGDVPE---PLSNYMDAQYYGPIAIG 80
Query: 93 SPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGS 151
SPPQNF V+FDTGSSNLWVPS KC+ + I+C H++Y + KS TYT+ G I YGSGS
Sbjct: 81 SPPQNFRVVFDTGSSNLWVPSKKCHLTNIACLMHNKYDASKSKTYTKNGTEFAIQYGSGS 140
Query: 152 ISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDN 211
+SG+ S D V + + +KDQ F EA E L F+ A+FDGI+GLG+ I+V P +
Sbjct: 141 LSGYLSTDTVSIAGLDIKDQTFAEALSEPGLVFVAAKFDGILGLGYSSISVDKVKPPFYA 200
Query: 212 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 271
M EQGL+S VFSF+LNRDP + EGGEI+FGG DP H+ G+ TY+PVT+K YWQ ++
Sbjct: 201 MYEQGLISAPVFSFYLNRDPASPEGGEIIFGGSDPNHYTGEFTYLPVTRKAYWQIKMDAA 260
Query: 272 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLV 323
IG+ +C+GGC I D+GTSL+A P T IN IGG ++ + C L+
Sbjct: 261 SIGDLQ--LCKGGCQVIADTGTSLIAAPLEEATSINQKIGGTPIIGGQYVVSCDLI 314
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 53/139 (38%), Positives = 77/139 (55%), Gaps = 2/139 (1%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
++ + S GD +C + L +E + IN+ P G+ ++ CD
Sbjct: 255 IKMDAASIGDLQLCKGGCQVIADTGTSLIAAPLEEA--TSINQKIGGTPIIGGQYVVSCD 312
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
IP +P + F +G K F L + YIL+ + +C+SGFM D+PPP GPLWILGDVF+
Sbjct: 313 LIPKLPVIKFVLGGKTFELEGKDYILRVAQMGKTICLSGFMGMDIPPPNGPLWILGDVFI 372
Query: 487 GVYHTVFDSGKLRIGFAEA 505
G Y+T FD G R+GFA+A
Sbjct: 373 GKYYTEFDMGNDRVGFADA 391
>gi|146217392|gb|ABQ10738.1| cathepsin D [Penaeus monodon]
Length = 386
Score = 282 bits (722), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 137/292 (46%), Positives = 189/292 (64%), Gaps = 8/292 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQ 96
R+ LH +AR + +E V V + G+ PL N+MDAQY+G I IG+PPQ
Sbjct: 20 RIKLHKFKSARRSLQEV---DTAVKVVHRKWGNKGPMPEPLSNYMDAQYYGPITIGTPPQ 76
Query: 97 NFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
+F V+FDTGSSNLWVPS +C+F+ I+C H++Y + KS+TY + G +I YGSGS+SG+
Sbjct: 77 SFRVVFDTGSSNLWVPSKQCHFTNIACLIHNKYDATKSSTYKKNGTKFDIQYGSGSLSGY 136
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
S D V VG V VKDQ F EA E L F+ A+FDGI+G+ + IAV PV+ NMV Q
Sbjct: 137 LSTDTVSVGSVSVKDQTFAEAMSEPGLAFVAAKFDGILGMAYDRIAVDGVTPVFYNMVNQ 196
Query: 216 GLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGN 275
+V +FSF+LNRDP A EGGE++ GG DP ++ G TYVPV ++GYWQF++ + +
Sbjct: 197 NVVPAPIFSFYLNRDPAAAEGGELILGGSDPAYYTGDFTYVPVDRQGYWQFKMDGLQMNG 256
Query: 276 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV----SAECKLV 323
+ C+GGC AI D+GTSL+A P+ IN IG + ++ S +C L+
Sbjct: 257 TTVPFCDGGCEAIADTGTSLIAAPSEEARLINKKIGAKPIMGGEWSVDCNLI 308
Score = 98.6 bits (244), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 44/99 (44%), Positives = 64/99 (64%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
IN+ + P GE +DC+ IP +P +SF + K F L + YIL+ + C+SGF
Sbjct: 287 INKKIGAKPIMGGEWSVDCNLIPHLPTISFVLAGKPFTLEGKDYILRVSQFGQTTCLSGF 346
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+ D+PPP GP+WILGD+F+G ++T FD G R+GFAE+
Sbjct: 347 IGLDVPPPMGPIWILGDIFIGRFYTEFDMGNNRVGFAES 385
>gi|238816835|gb|ACR56788.1| aspartic protease 4 [Strongyloides ratti]
Length = 428
Score = 281 bits (719), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 138/281 (49%), Positives = 189/281 (67%), Gaps = 12/281 (4%)
Query: 40 LHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFS 99
L+ L RI + E+Y G HRL DS+E L+N+MDAQY+GEI IG+P QNFS
Sbjct: 34 LNFLENERINKGEKY-------GAVHRLMDSEE---ILRNYMDAQYYGEISIGTPGQNFS 83
Query: 100 VIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQ 158
VIFDTGSSNLW+PS KC ++I+C H++Y S S+TY G++ I YG+GS+ GF S+
Sbjct: 84 VIFDTGSSNLWIPSKKCPIYNIACLLHNKYDSSSSSTYVTDGRTMAIQYGTGSMKGFLSK 143
Query: 159 DNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLV 218
D V + D+ +DQ F EAT E +TF+ A+FDGI+G+ ++ IAV PV++ +++Q V
Sbjct: 144 DKVCIADLCAEDQTFAEATSEPGVTFIAAKFDGILGMAYQNIAVLGVKPVFNTLIDQHKV 203
Query: 219 SEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQST 278
+ +F+FWLNR D +GGEI GG+DPKH+KG TYVPV++KGYWQF++ D +G+
Sbjct: 204 PQPIFAFWLNRIADDSDGGEITLGGMDPKHYKGDITYVPVSRKGYWQFKM-DGFVGDNEK 262
Query: 279 GVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
C+ GC AI D+GTSL+AGP V I IG E + E
Sbjct: 263 IACKNGCQAIADTGTSLIAGPKAQVEAIQKFIGAEPLARGE 303
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 45/99 (45%), Positives = 61/99 (61%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
I + + P GE ++ CD++ ++P V+ IG + F LS + YIL + +SGF
Sbjct: 290 IQKFIGAEPLARGEYMVPCDKVSSLPIVNIVIGGQAFALSGKDYILNVTAMGKSIRLSGF 349
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
M DLP G LWILGDVF+G Y+TVFD GK R+GFA A
Sbjct: 350 MGMDLPERVGELWILGDVFIGRYYTVFDFGKDRVGFAVA 388
>gi|170063951|ref|XP_001867326.1| lysosomal aspartic protease [Culex quinquefasciatus]
gi|167881401|gb|EDS44784.1| lysosomal aspartic protease [Culex quinquefasciatus]
Length = 387
Score = 281 bits (718), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 134/253 (52%), Positives = 176/253 (69%), Gaps = 7/253 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSN 134
PL N+MDAQYFG I IG+PPQ+F V+FDTGSSNLWVPS +C F+ I+C H++Y ++KS+
Sbjct: 59 PLSNYMDAQYFGAITIGTPPQSFKVVFDTGSSNLWVPSKECSFTNIACLMHNKYNAKKSS 118
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
T+ + G + I YGSGS+SG+ S D V VG V ++ Q F EA E L F+ A+FDGI+G
Sbjct: 119 TFEKNGTAFAIQYGSGSLSGYLSTDTVTVGGVAIQKQTFAEAINEPGLVFVAAKFDGILG 178
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V VP + NM QGL+ VFSF+LNRDP A EGGEI+FGG D + G T
Sbjct: 179 LGYSSISVDGVVPPFYNMYNQGLIDSPVFSFYLNRDPSAAEGGEIIFGGSDSAKYTGDFT 238
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y+PV +K YWQF++ + +G+ T C GC AI D+GTSL+AGPT VT IN AIGG
Sbjct: 239 YLPVDRKAYWQFKMDSVKVGD--TEFCNNGCEAIADTGTSLIAGPTSEVTAINKAIGGTP 296
Query: 315 VVSAE----CKLV 323
+++ E C L+
Sbjct: 297 IINGEYMVDCSLI 309
Score = 111 bits (278), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 58/137 (42%), Positives = 79/137 (57%), Gaps = 4/137 (2%)
Query: 370 ENVSAGDSAVCS-ACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRI 428
++V GD+ C+ CE A+ L T E ++ IN+ P GE ++DC I
Sbjct: 253 DSVKVGDTEFCNNGCE-AIADTGTSLIAGPTSE--VTAINKAIGGTPIINGEYMVDCSLI 309
Query: 429 PTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGV 488
P +P + F +G K F L YIL+ + +C+SGFM D+PPP GPLWILGDVF+G
Sbjct: 310 PKLPKIKFVLGGKEFELEGADYILRIAQMGKTICLSGFMGIDIPPPNGPLWILGDVFIGK 369
Query: 489 YHTVFDSGKLRIGFAEA 505
Y+T FD G R+GFA A
Sbjct: 370 YYTEFDMGNDRVGFATA 386
>gi|350411706|ref|XP_003489428.1| PREDICTED: lysosomal aspartic protease-like [Bombus impatiens]
Length = 386
Score = 281 bits (718), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 142/316 (44%), Positives = 194/316 (61%), Gaps = 17/316 (5%)
Query: 5 LLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVR 64
+ R+ CL C + ++ L+RI LH +++ R KE V+
Sbjct: 1 MYRAALCL-----CACIALANADLQRI-------TLHKMDSVRKQFKEYNTEVYQAHMVQ 48
Query: 65 HRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCY 123
+ PL N++DAQY+G I IG+P Q+F VIFDTGSSNLWVPS KC+ + I+C
Sbjct: 49 GGFPQPE----PLSNYLDAQYYGVISIGTPSQDFKVIFDTGSSNLWVPSQKCHLTNIACK 104
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
H +Y + KS+TY + G I YGSGS+SG+ S D V + + V DQ F EA E +
Sbjct: 105 LHHKYDNTKSSTYKKNGTDFAIRYGSGSLSGYLSTDVVNIAGLKVSDQTFAEALSEPGMA 164
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
F+ A+FDGI+G+ + IAV PV+ NMV+QGLV + VFSF+LNR+PD + GGE++ GG
Sbjct: 165 FVAAKFDGILGMAYSRIAVDGVTPVFYNMVKQGLVPQPVFSFYLNRNPDDKAGGELILGG 224
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
DP H++G TYVPV +KGYWQF + I +G+Q +CE GC AI D+GTSL+AGP V
Sbjct: 225 SDPNHYEGPFTYVPVDRKGYWQFRMDGIKVGSQHLAICEKGCEAIADTGTSLIAGPVKEV 284
Query: 304 TEINHAIGGEGVVSAE 319
IN AIG + + E
Sbjct: 285 EAINSAIGATNIAAGE 300
Score = 105 bits (261), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 44/87 (50%), Positives = 63/87 (72%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
GE+++DC+ IP +P ++F +G + F L+ + Y+LK + VC+SGFM D+P P GPL
Sbjct: 299 GEAMVDCNSIPNLPTINFVLGGRSFPLTGKDYVLKVTQFGKTVCLSGFMGMDIPEPNGPL 358
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WILGDVF+G Y+T FD G R+GFA+A
Sbjct: 359 WILGDVFIGRYYTEFDMGNNRVGFAKA 385
>gi|321472775|gb|EFX83744.1| hypothetical protein DAPPUDRAFT_92408 [Daphnia pulex]
Length = 379
Score = 280 bits (716), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 134/302 (44%), Positives = 202/302 (66%), Gaps = 7/302 (2%)
Query: 33 LKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILP--LKNFMDAQYFGEIG 90
+K +R+ L + + R T + G + ++ + G S+ P LKN+MDAQY+G+I
Sbjct: 5 VKLQRVTLEKVPSVRKTLESV---GTSIKVIQKKWGASEAGPTPEELKNYMDAQYYGQIT 61
Query: 91 IGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGS 149
+G+PPQ F+V+FDTGS+NLWVPS+ C+ + ++C H++Y KS TY G I YGS
Sbjct: 62 LGTPPQTFNVVFDTGSANLWVPSTHCHLTNLACLLHNKYNGGKSQTYKANGTDFAIQYGS 121
Query: 150 GSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVW 209
G +SG+ S D + +G +VKDQ F EA E SLTF+ A+FDGI+G+ + I+V PV+
Sbjct: 122 GKLSGYLSTDTLGLGGALVKDQTFAEAISEPSLTFVAAKFDGILGMSYPSISVNGVPPVF 181
Query: 210 DNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELG 269
+NM+EQGLV + VFSFWL+R+PDA +GGEI FGG DP+ + G+ ++ PVT+K YWQF++
Sbjct: 182 NNMIEQGLVEDPVFSFWLSRNPDAAQGGEITFGGADPERYTGEISWAPVTRKAYWQFKVD 241
Query: 270 DILIGNQSTGV-CEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYG 328
+ + N++ G C+GGC I D+GTSL+AGP + ++N IGG +++ E + S+
Sbjct: 242 GVQVSNEADGAFCQGGCQMIADTGTSLIAGPVDEIKKLNTLIGGIPIMAGEYFINCSRID 301
Query: 329 DL 330
+L
Sbjct: 302 EL 303
Score = 105 bits (261), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 49/101 (48%), Positives = 70/101 (69%), Gaps = 3/101 (2%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILK--TGEGIAEVCIS 464
+N L +P GE I+C RI +P +SF+IG K F+L ++Y+++ GI+ CIS
Sbjct: 279 LNTLIGGIPIMAGEYFINCSRIDELPTISFSIGGKSFSLEGKEYVMQIVKSNGIS-ACIS 337
Query: 465 GFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
GF+ ++PPP GPLWILGDVF+G Y+T+FD G R+GFA+A
Sbjct: 338 GFIGLEIPPPAGPLWILGDVFIGRYYTIFDFGNDRVGFADA 378
>gi|195429864|ref|XP_002062977.1| GK21682 [Drosophila willistoni]
gi|194159062|gb|EDW73963.1| GK21682 [Drosophila willistoni]
Length = 389
Score = 279 bits (714), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 149/327 (45%), Positives = 203/327 (62%), Gaps = 23/327 (7%)
Query: 3 QKLLRSVFCLWVLASCLLLPASSNGLRRIGLKK-RRLDLHSLNAARITRKERYMGGAGVS 61
QKLL + +V+A+ S GL R+ LKK + H + ++ R
Sbjct: 2 QKLLILLAIGFVVAA---EAGDSAGLLRVPLKKFQSARRHFADVGTELQQLR-------- 50
Query: 62 GVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-I 120
+++ GD+ E PL N+MDAQY+G I IG+P Q+F V+FDTGSSNLWVPS KC+F+ I
Sbjct: 51 -IKYGGGDAPE---PLSNYMDAQYYGPISIGTPAQSFKVVFDTGSSNLWVPSKKCHFTNI 106
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+C H++Y + KSNTY + G I+YGSGS+SG+ S D V +G + +K Q F EA E
Sbjct: 107 ACLMHNKYDATKSNTYAKNGTEFAIHYGSGSLSGYLSTDTVGIGGLNIKGQTFAEALSEP 166
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
L F+ A+FDGI+GLG+ I+V P + M EQGL+S VFSF+LNRDP A EGGEI+
Sbjct: 167 GLVFVAAKFDGILGLGYSSISVDGVKPPFYAMYEQGLISSPVFSFYLNRDPSAPEGGEII 226
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGG DP H+ G TY+PVT+K YWQ ++ +G+ VC+GGC I D+GTSL+A P
Sbjct: 227 FGGSDPNHYTGDFTYLPVTRKAYWQIKMDSASVGDLQ--VCQGGCQVIADTGTSLIAAPL 284
Query: 301 PVVTEINHAIGGEGVVSAE----CKLV 323
T IN IGG ++ + C L+
Sbjct: 285 SEATSINQKIGGTPIIGGQYVVSCDLI 311
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 53/139 (38%), Positives = 76/139 (54%), Gaps = 2/139 (1%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
++ ++ S GD VC + L E + IN+ P G+ ++ CD
Sbjct: 252 IKMDSASVGDLQVCQGGCQVIADTGTSLIAAPLSEA--TSINQKIGGTPIIGGQYVVSCD 309
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
IP +P + F +G + F L + YIL+ + +C+SGFM D+PPP GPLWILGDVF+
Sbjct: 310 LIPNLPVIKFVLGGRTFELEGKDYILRVSQMGKSICLSGFMGMDIPPPNGPLWILGDVFI 369
Query: 487 GVYHTVFDSGKLRIGFAEA 505
G Y+T FD G R+GFA A
Sbjct: 370 GKYYTEFDMGNDRVGFANA 388
>gi|260810438|ref|XP_002599971.1| hypothetical protein BRAFLDRAFT_74093 [Branchiostoma floridae]
gi|229285255|gb|EEN55983.1| hypothetical protein BRAFLDRAFT_74093 [Branchiostoma floridae]
Length = 388
Score = 279 bits (713), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 144/317 (45%), Positives = 197/317 (62%), Gaps = 8/317 (2%)
Query: 15 LASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDI 74
L L + A++N L RI L K + L + + SG + + +
Sbjct: 4 LLVLLAIVATANALHRIPLTKMKTVRRHLAEVGVPYDKII---KDYSGKYYNMTGPQPE- 59
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKS 133
PL N++DAQYFG I IG+PPQ+F V+FDTGSSNLWVPS KC++S I+C H++Y + KS
Sbjct: 60 -PLSNYLDAQYFGPISIGTPPQSFQVVFDTGSSNLWVPSKKCHYSNIACLLHNKYDASKS 118
Query: 134 NTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGII 193
+TY + G+ I YGSGS+SGF SQD V V + VKDQ F EA E + F+ A+FDGI+
Sbjct: 119 STYKKNGEKFAIQYGSGSLSGFLSQDTVSVAGIEVKDQTFAEALSEPGMAFVAAKFDGIL 178
Query: 194 GLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKH 253
G+G+ IAV VP + NMV QG V E VFSF+LNRDP A GGE++ GG DP ++ G
Sbjct: 179 GMGYSNIAVDGVVPPFYNMVSQGAVPEPVFSFYLNRDPSATAGGELILGGADPNYYTGDF 238
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
T++ VT+KGYWQF++ I +G + C+ GC AI D+GTSL+AGP V +++ IG
Sbjct: 239 TFLDVTRKGYWQFKMDGINVGGST--FCQEGCQAIADTGTSLIAGPIEEVNKLHKQIGAT 296
Query: 314 GVVSAECKLVVSQYGDL 330
+ E K+ S+ L
Sbjct: 297 PLAGGEYKVDCSKVTSL 313
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 51/136 (37%), Positives = 80/136 (58%), Gaps = 2/136 (1%)
Query: 370 ENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIP 429
+ ++ G S C A+ L +E ++ +++ + P GE +DC ++
Sbjct: 254 DGINVGGSTFCQEGCQAIADTGTSLIAGPIEE--VNKLHKQIGATPLAGGEYKVDCSKVT 311
Query: 430 TMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVY 489
++P +SF +G K F L+ ++YIL+ + +C+SGFM D+PPP GPLWILGDVF+G Y
Sbjct: 312 SLPTISFILGGKEFELTGKEYILQVKQFGMTICLSGFMGMDIPPPAGPLWILGDVFIGSY 371
Query: 490 HTVFDSGKLRIGFAEA 505
+T FD GK +GFA A
Sbjct: 372 YTQFDLGKNLVGFATA 387
>gi|224050910|ref|XP_002199093.1| PREDICTED: cathepsin D [Taeniopygia guttata]
Length = 396
Score = 279 bits (713), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 145/293 (49%), Positives = 198/293 (67%), Gaps = 13/293 (4%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
+RRI LK+ ++ +N+ IT +Y G G +IL KN+MDAQYFG
Sbjct: 30 MRRI-LKEAGSEIPDMNS--ITEAIKYKLGFA------EAGKPTPEIL--KNYMDAQYFG 78
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEIN 146
IGIG+PPQNF+VIFDTGSSNLWVPS C I+C H +Y S KS+TY + G I
Sbjct: 79 VIGIGTPPQNFTVIFDTGSSNLWVPSVHCSLLDIACMVHHKYDSAKSSTYVKNGTKFAIR 138
Query: 147 YGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
YG+GS+SG+ SQD V +GD+ + DQ+F EAT++ +TF+ A+FDGI+GL F +I+V A
Sbjct: 139 YGTGSLSGYLSQDIVTLGDLKIMDQIFGEATKQPGITFIAAKFDGILGLAFPKISVEGAE 198
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
P +DN+++Q LV + +FSF+LNRDP GGE+V GG DPK++KG+ ++ VT+K YWQ
Sbjct: 199 PFFDNVMKQKLVEKNMFSFYLNRDPSGVPGGEMVLGGTDPKYYKGEFSWFNVTRKAYWQI 258
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
+ + +GN T VCEGGC AIVD+GTSL+ GPT V +I AIG + ++ E
Sbjct: 259 HMDSVDVGNGPT-VCEGGCEAIVDTGTSLITGPTKEVKKIQEAIGAKPLIKGE 310
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 59/141 (41%), Positives = 82/141 (58%), Gaps = 3/141 (2%)
Query: 367 VEKENVSAGD-SAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
+ ++V G+ VC A+V L TKE + I E + P GE +I C
Sbjct: 258 IHMDSVDVGNGPTVCEGGCEAIVDTGTSLITGPTKE--VKKIQEAIGAKPLIKGEYMIPC 315
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
+++PT+P VS IG K F L+ +QY+LK +C+SGF D+PPP GPLWILGDVF
Sbjct: 316 EKVPTLPVVSMNIGGKTFGLTGDQYVLKMTAQGETICMSGFSGLDIPPPGGPLWILGDVF 375
Query: 486 MGVYHTVFDSGKLRIGFAEAA 506
+G Y+T FD R+GFA++A
Sbjct: 376 IGPYYTSFDRDNNRVGFAQSA 396
>gi|17981530|gb|AAL51056.1|AF454831_1 cathepsin D [Apriona germari]
Length = 386
Score = 278 bits (712), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 149/321 (46%), Positives = 197/321 (61%), Gaps = 22/321 (6%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV 60
M + L SVFC+++ +C L+ R+ L +AR T +E V
Sbjct: 1 MSRLFLMSVFCVFITVNCDLI---------------RVPLERGKSARRTLQEV---NTHV 42
Query: 61 SGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS- 119
VR R G PL N++DAQYFG I IG+PPQ F V+FDTGSSNLWVPS KC+++
Sbjct: 43 QQVRFRYGVGGPAPEPLSNYLDAQYFGPISIGNPPQKFKVVFDTGSSNLWVPSKKCHYTN 102
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
I+C H++Y S KS+TY + G I YGSGS+SGF S D V VG + VKDQ F EA E
Sbjct: 103 IACLLHNKYDSSKSSTYKKNGTDFSIKYGSGSLSGFLSTDVVTVGSLAVKDQTFAEAMSE 162
Query: 180 GSLTFLLARFDGIIGLGFRE-IAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
L F+ A+FD G ++ + ++P + NM+ QGLVS+ VFSF+LNRDPDA EGGE
Sbjct: 163 PGLAFVAAKFDEYPWHGLQQDLGSRASLPFFYNMITQGLVSQPVFSFYLNRDPDAAEGGE 222
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
+ GG DPK++KG TY+ V ++ YWQF++ I +G T C+ GC AI D+GTSL+AG
Sbjct: 223 LSLGGSDPKYYKGNFTYLSVDRQAYWQFKMDKIQLGK--TVFCKSGCQAIADTGTSLVAG 280
Query: 299 PTPVVTEINHAIGGEGVVSAE 319
P VT IN IGG ++ E
Sbjct: 281 PVDEVTSINKLIGGTPIIGGE 301
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 55/137 (40%), Positives = 78/137 (56%), Gaps = 3/137 (2%)
Query: 370 ENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIP 429
+ + G + C + A+ L E ++ IN+L P GE ++DC IP
Sbjct: 253 DKIQLGKTVFCKSGCQAIADTGTSLVAGPVDE--VTSINKLIGGTPIIGGEYVVDC-LIP 309
Query: 430 TMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVY 489
+P + F +G K + L + YIL+ + +C+SGFM D+PPP GPLWILGDVF+G +
Sbjct: 310 KLPEIDFILGGKTYTLEGKDYILRVSQAGKTICLSGFMGIDIPPPNGPLWILGDVFIGKF 369
Query: 490 HTVFDSGKLRIGFAEAA 506
+T FD G RIGFAEAA
Sbjct: 370 YTEFDLGNNRIGFAEAA 386
>gi|56118817|ref|NP_001008172.1| MGC89016 protein precursor [Xenopus (Silurana) tropicalis]
gi|51950197|gb|AAH82490.1| MGC89016 protein [Xenopus (Silurana) tropicalis]
Length = 421
Score = 278 bits (711), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 142/329 (43%), Positives = 205/329 (62%), Gaps = 22/329 (6%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDED 73
++ +CLL A SNGL RI L + + +L+ G+ V VR + D+
Sbjct: 7 LVVTCLLFVAFSNGLERIKLHRFKSVARTLHDV----------GSAVEHVRMKYVDNHMK 56
Query: 74 ILP--LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKS 130
P L N+MD QY+G I IG+PPQ+F V+FDTGSSNLWVPS KC ++ I+C+ H +Y S
Sbjct: 57 SAPEPLTNYMDVQYYGVISIGTPPQSFRVVFDTGSSNLWVPSKKCKWTDIACWLHRKYDS 116
Query: 131 RKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFD 190
+KS+TY G I+YG+GS++GF S D V VG + VK Q F EA + +TF+ A+FD
Sbjct: 117 KKSSTYKANGTEFAIHYGTGSLTGFLSTDTVSVGSLSVKSQTFAEAITQPGITFVAAKFD 176
Query: 191 GIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFK 250
GI+G+ + I+V VPV++NMV Q LV + +FSF+L+RD A+EGGEI+ GG DP H+
Sbjct: 177 GILGMAYPSISVDGVVPVFNNMVNQKLVDQAIFSFYLSRDASAKEGGEIILGGSDPDHYV 236
Query: 251 GKHTYVPVTKKGYWQFELGDILIGNQSTGV---------CEGGCAAIVDSGTSLLAGPTP 301
G TY+ VT+K YWQ ++ + + ++S + C+GGC AI D+GTSL+ GP+
Sbjct: 237 GNFTYLDVTRKAYWQIKMDSVTVSSESECMNAMMVGGEYCKGGCQAIADTGTSLIVGPSS 296
Query: 302 VVTEINHAIGGEGVVSAECKLVVSQYGDL 330
V ++N IG ++S E + S+ L
Sbjct: 297 DVEKLNAEIGALPIISGEYWINCSKIASL 325
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 42/97 (43%), Positives = 65/97 (67%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
+N +LP GE I+C +I ++P ++F +G K F+L+ + Y++ + +C+SGF
Sbjct: 301 LNAEIGALPIISGEYWINCSKIASLPTINFVLGGKSFSLTGKDYVVVVTQMGQTICLSGF 360
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFA 503
+A D+PPP GPLWILGD+F+G Y+T FD R+GFA
Sbjct: 361 VAMDIPPPAGPLWILGDIFIGKYYTEFDLANNRVGFA 397
>gi|156553448|ref|XP_001600543.1| PREDICTED: lysosomal aspartic protease-like [Nasonia vitripennis]
Length = 384
Score = 278 bits (711), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 150/317 (47%), Positives = 202/317 (63%), Gaps = 19/317 (5%)
Query: 5 LLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVR 64
+LR VF LAS L +++ LR + L+ + +AR T +E G + ++
Sbjct: 1 MLRLVF----LASLCLAFVTADVLR--------VPLYRVKSARRTLQEV---GTELHQIK 45
Query: 65 HRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCY 123
R G +D PL N++DAQY+GEIGIGSP Q F+VIFDTGSSNLWVPS KC+ + I+C
Sbjct: 46 LRYG-ADPVPEPLSNYLDAQYYGEIGIGSPMQKFTVIFDTGSSNLWVPSKKCHITNIACL 104
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
H++Y SRKS +Y G I YGSGS+SGF S D V + V VKD F EA E L
Sbjct: 105 LHNKYDSRKSKSYKANGTDFSIRYGSGSLSGFLSTDVVTIAGVDVKDTTFAEAMSEPGLA 164
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
F+ A+FDGI+G+ + I+V PV+ NMV+Q LV + +FSF+LNRDP+A+ GGE++ GG
Sbjct: 165 FVAAKFDGILGMAYDRISVDGVPPVFYNMVKQNLVPQPIFSFYLNRDPNAKIGGEMILGG 224
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
D H+ G TYVPV++K YWQF++ I IG++ CE GC AI D+GTSL+AGP +
Sbjct: 225 SDSAHYTGDFTYVPVSRKAYWQFKMDKITIGDKL--FCENGCEAIADTGTSLIAGPVGEI 282
Query: 304 TEINHAIGGEGVVSAEC 320
IN IG +V+ E
Sbjct: 283 EGINKKIGATPIVAGEA 299
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 56/139 (40%), Positives = 81/139 (58%), Gaps = 4/139 (2%)
Query: 368 EKENVSAGDSAVC-SACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
+ + ++ GD C + CE A+ L E + IN+ + P GE+++ CD
Sbjct: 248 KMDKITIGDKLFCENGCE-AIADTGTSLIAGPVGE--IEGINKKIGATPIVAGEAMVSCD 304
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
+P +P + F +G K F+L E Y+LK + +C+SGFM D+PPP GPLWILGDVF+
Sbjct: 305 AVPNLPTIDFVVGGKKFSLKGEDYVLKVSQFGKTICLSGFMGIDIPPPNGPLWILGDVFI 364
Query: 487 GVYHTVFDSGKLRIGFAEA 505
G ++T FD G RIGFA A
Sbjct: 365 GRFYTEFDMGNDRIGFANA 383
>gi|194757447|ref|XP_001960976.1| GF11236 [Drosophila ananassae]
gi|190622274|gb|EDV37798.1| GF11236 [Drosophila ananassae]
Length = 388
Score = 278 bits (711), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 145/302 (48%), Positives = 190/302 (62%), Gaps = 13/302 (4%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDIL-PLKNFMDAQYFGEIGIGSPP 95
R+ L AR R+ G + R+ D+ PL N+MDAQY+G I IGSPP
Sbjct: 25 RVPLQKFTTAR-----RHFADVGTELQQLRIKYGGGDVPEPLSNYMDAQYYGPISIGSPP 79
Query: 96 QNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISG 154
QNF V+FDTGSSNLWVPS KC+ + I+C H++Y + KS +Y + G I YGSGS+SG
Sbjct: 80 QNFRVVFDTGSSNLWVPSKKCHLTNIACLMHNKYDASKSKSYVKNGTEFAIQYGSGSLSG 139
Query: 155 FFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVE 214
+ S D V +G + +KDQ F EA E L F+ A+FDGI+GLG+ I+V P + M E
Sbjct: 140 YLSTDTVSIGGLNIKDQTFAEALSEPGLVFVAAKFDGILGLGYSSISVDRVKPPFYAMYE 199
Query: 215 QGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
QGL+S +FSF+LNRDP EGGEI+FGG DPKH+ G TY+PVT+K YWQ ++ IG
Sbjct: 200 QGLISAPIFSFYLNRDPAGPEGGEIIFGGSDPKHYSGDFTYLPVTRKAYWQIKMDAASIG 259
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDL 334
+ +C+GGC I D+GTSL+A P T IN IGG ++ + VVS DLI +L
Sbjct: 260 DLE--LCKGGCQVIADTGTSLIAAPMSEATSINQKIGGTPIIGGQ--YVVSC--DLIPNL 313
Query: 335 LV 336
V
Sbjct: 314 PV 315
Score = 108 bits (269), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 53/139 (38%), Positives = 76/139 (54%), Gaps = 2/139 (1%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
++ + S GD +C + L E + IN+ P G+ ++ CD
Sbjct: 251 IKMDAASIGDLELCKGGCQVIADTGTSLIAAPMSEA--TSINQKIGGTPIIGGQYVVSCD 308
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
IP +P + F +G K F L + YIL+ + +C+SGFM D+PPP GPLWILGDVF+
Sbjct: 309 LIPNLPVIKFVLGGKTFELEGKDYILRVAQMGKTICLSGFMGMDIPPPNGPLWILGDVFI 368
Query: 487 GVYHTVFDSGKLRIGFAEA 505
G Y+T FD G R+GFA+A
Sbjct: 369 GKYYTEFDMGNDRVGFADA 387
>gi|322796189|gb|EFZ18765.1| hypothetical protein SINV_10075 [Solenopsis invicta]
Length = 366
Score = 278 bits (710), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 140/270 (51%), Positives = 182/270 (67%), Gaps = 10/270 (3%)
Query: 60 VSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YF 118
++ RH L S E PL N++DAQY+GEI IG+PPQ F VIFDTGSSNLWVPS KC Y
Sbjct: 26 LATTRH-LHSSTE---PLSNYLDAQYYGEITIGTPPQKFKVIFDTGSSNLWVPSKKCRYT 81
Query: 119 SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
+I+C H++Y SRKS TY + G I YG+GS+SGF S D V V + V++Q F EA
Sbjct: 82 NIACLLHNKYDSRKSITYQKNGTPFAIRYGTGSLSGFLSTDVVNVAGLNVQNQTFAEAVS 141
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
E LTF+ A+FDGI+G+G+ I+V PV+ NMV+Q LV + +FSF+LNRDP A +GGE
Sbjct: 142 EPGLTFVAAKFDGILGMGYSTISVDGVTPVFYNMVKQKLVPQPIFSFYLNRDPTAAQGGE 201
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTG--VCEGGCAAIVDSGTSLL 296
++ GG DP+H+ G TYV VT+KGYWQF + I +G+ S +C+ C AI D+GTSL+
Sbjct: 202 MILGGSDPEHYVGSMTYVDVTRKGYWQFTMDRITVGDSSPSHILCKNTCQAIADTGTSLI 261
Query: 297 AGPTPVVTEINHAIGGE---GVVSAECKLV 323
AGPT + EIN IG G C +V
Sbjct: 262 AGPTVEINEINKQIGATMIGGQALVNCAMV 291
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 52/142 (36%), Positives = 78/142 (54%), Gaps = 14/142 (9%)
Query: 370 ENVSAGDSA----VCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPM--GESII 423
+ ++ GDS+ +C A+ L T E INE+ + M G++++
Sbjct: 232 DRITVGDSSPSHILCKNTCQAIADTGTSLIAGPTVE-----INEINKQIGATMIGGQALV 286
Query: 424 DCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGD 483
+C +P +P V+F +G K F+L E Y+L+ E +C+SGF D+ PLWILGD
Sbjct: 287 NCAMVPHLPKVNFILGGKTFSLKGEDYVLEITEMGHTICMSGFQGMDM---GDPLWILGD 343
Query: 484 VFMGVYHTVFDSGKLRIGFAEA 505
VF+G Y+T FD G R+GFAEA
Sbjct: 344 VFIGRYYTEFDLGNNRVGFAEA 365
>gi|46309251|dbj|BAD15111.1| cathepsin D [Todarodes pacificus]
Length = 392
Score = 277 bits (709), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 133/291 (45%), Positives = 186/291 (63%), Gaps = 14/291 (4%)
Query: 36 RRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDIL------PLKNFMDAQYFGEI 89
+R+ LH + +AR+ ++ G+G S ++ + PL N++DAQY+G I
Sbjct: 23 QRIQLHKITSARM-----HLIGSGTSNSTLKMISQLQQRYRAPTPEPLSNYLDAQYYGVI 77
Query: 90 GIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYG 148
IG+P QNF V+FDTGSSNLWVPS KC S I+C H++Y S +S+TY G I YG
Sbjct: 78 SIGTPAQNFKVVFDTGSSNLWVPSKKCKLSDIACLLHNKYDSTQSSTYKANGTDFHIQYG 137
Query: 149 SGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPV 208
SGS+ GF S D V +G V +K Q F EAT + L F+ A+FDGI+G+ + I+V PV
Sbjct: 138 SGSLDGFLSTDTVAIGSVAIKAQTFAEATNQPGLVFVAAKFDGILGMAYDTISVDKVTPV 197
Query: 209 WDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFEL 268
+ ++ Q LV + VFSF+LNRDP +EGGE++ GG DPKH+ G TY+PVT+KGYWQ ++
Sbjct: 198 FYQIISQKLVDQPVFSFYLNRDPSGKEGGELILGGSDPKHYTGNFTYLPVTRKGYWQIKM 257
Query: 269 GDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
++ G + C GGC AI D+GTSL+AGP + ++N AIGG + E
Sbjct: 258 DKVVSGENT--FCSGGCQAIADTGTSLIAGPVDEIKKLNEAIGGRALPGGE 306
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 56/139 (40%), Positives = 80/139 (57%), Gaps = 2/139 (1%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
++ + V +G++ CS A+ L E + +NE P GE ++DC
Sbjct: 255 IKMDKVVSGENTFCSGGCQAIADTGTSLIAGPVDE--IKKLNEAIGGRALPGGEYMVDCA 312
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
IP +PNV F +G K F+L Y+L + +C+SGFM ++PPP GPLWILGDVF+
Sbjct: 313 SIPKLPNVDFVLGGKTFSLKTSDYVLTIKQAGQTICLSGFMGINIPPPAGPLWILGDVFI 372
Query: 487 GVYHTVFDSGKLRIGFAEA 505
G Y+TVFD GK ++GFA A
Sbjct: 373 GKYYTVFDLGKNQVGFAVA 391
>gi|146454530|gb|ABQ41931.1| aspartic proteinase 1 [Sonneratia caseolaris]
Length = 203
Score = 277 bits (708), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 123/203 (60%), Positives = 164/203 (80%), Gaps = 3/203 (1%)
Query: 235 EGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTS 294
EGGE+VFGGVDP H+KG+HTYVPVT+KGYWQF++G++LIG+Q++G C GCAAI DSGTS
Sbjct: 1 EGGELVFGGVDPSHYKGEHTYVPVTQKGYWQFDMGEVLIGDQASGFCGSGCAAIADSGTS 60
Query: 295 LLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFN 354
LLAGPT ++T+INHAIG GVVS ECK VV+QYG I ++L+S PEK+C QIG C F+
Sbjct: 61 LLAGPTSIITQINHAIGASGVVSQECKAVVAQYGKTILEMLLSQSQPEKICSQIGFCTFD 120
Query: 355 GAEYVSTGIKTVVEKENVSAGDS---AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELC 411
G V GIK+VV+ + ++ S A CSACEMAVVW+QN+L+Q QT++++L+Y+NELC
Sbjct: 121 GTRGVDMGIKSVVDDDKSTSSGSVHDASCSACEMAVVWMQNKLRQNQTEDQILNYVNELC 180
Query: 412 DSLPNPMGESIIDCDRIPTMPNV 434
+ +P+PMGES+++C + TMP V
Sbjct: 181 ERIPSPMGESVVECSSLSTMPKV 203
>gi|66560290|ref|XP_392857.2| PREDICTED: lysosomal aspartic protease [Apis mellifera]
Length = 385
Score = 276 bits (707), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 137/287 (47%), Positives = 187/287 (65%), Gaps = 6/287 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQ 96
R+ LH +++ R KE + GD + PL N++DAQY+G I IG+PPQ
Sbjct: 21 RIPLHKIDSIRKQFKEY---NTEIYQTHIFQGDLPQP-EPLSNYLDAQYYGVISIGTPPQ 76
Query: 97 NFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
+F VIFDTGSSNLWVPS KC+ + I+C H +Y + KS+TY + G I YGSGS+SG+
Sbjct: 77 DFRVIFDTGSSNLWVPSKKCHLTNIACKLHRKYDNTKSSTYKKNGTDFAIRYGSGSLSGY 136
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
S D V++ + + DQ F EA E L F+ A+FDGI+G+ + +I+V PV+ NMV+Q
Sbjct: 137 LSTDTVDIAGMKISDQTFAEALSEPGLAFVAAKFDGILGMAYSKISVDGVTPVFYNMVKQ 196
Query: 216 GLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGN 275
GLV + VFSF+LNR+PD + GGE++ GG DP H++G TYVPV KKGYWQF + I IG+
Sbjct: 197 GLVPQPVFSFYLNRNPDDKYGGELILGGSDPNHYEGSFTYVPVDKKGYWQFRMDSIQIGS 256
Query: 276 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKL 322
VC+ GC AI D+GTSL+AGP + IN AIG + + E +
Sbjct: 257 D-LKVCQQGCEAIADTGTSLIAGPVKEIEAINKAIGATPIAAGEAMI 302
Score = 112 bits (279), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 57/130 (43%), Positives = 76/130 (58%), Gaps = 2/130 (1%)
Query: 376 DSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVS 435
D VC A+ L KE + IN+ + P GE++IDC+ IP +P ++
Sbjct: 257 DLKVCQQGCEAIADTGTSLIAGPVKE--IEAINKAIGATPIAAGEAMIDCNSIPNLPTIN 314
Query: 436 FTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDS 495
F +G K F+L E Y+LK + VC+SGFM D+ PP GPLWILGDVF+G Y+T FD
Sbjct: 315 FVLGGKSFSLKGEDYVLKVTQFGKTVCLSGFMGMDISPPNGPLWILGDVFIGRYYTEFDM 374
Query: 496 GKLRIGFAEA 505
G R+GFA A
Sbjct: 375 GNNRVGFATA 384
>gi|326920173|ref|XP_003206349.1| PREDICTED: cathepsin D-like [Meleagris gallopavo]
Length = 397
Score = 276 bits (707), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 133/286 (46%), Positives = 197/286 (68%), Gaps = 5/286 (1%)
Query: 63 VRHRLGDSD-EDILP--LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF- 118
++ +LG SD + P LKN+MDAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS C+
Sbjct: 52 LKFKLGFSDLAEPTPEILKNYMDAQYYGEIGIGTPPQKFTVVFDTGSSNLWVPSVHCHLL 111
Query: 119 SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
I+C H +Y + KS+TY E G I+YG+GS+SGF SQD V +G++ +K+Q+F EA +
Sbjct: 112 DIACLLHHKYDASKSSTYVENGTEFAIHYGTGSLSGFLSQDTVTLGNLKIKNQIFGEAVK 171
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
+ +TF+ A+FDGI+G+ F I+V P +DN+++Q L+ + +FSF+LNRDP A+ GGE
Sbjct: 172 QPGITFIAAKFDGILGMAFPRISVDKVTPFFDNVMKQKLIEKNIFSFYLNRDPTAQPGGE 231
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
++ GG DPK+++G ++V VT+K YWQ + + + N T +C+GGC AIVD+GTSL+ G
Sbjct: 232 LLLGGTDPKYYRGDFSWVNVTRKAYWQVHMDSVNVANGLT-LCKGGCEAIVDTGTSLITG 290
Query: 299 PTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKV 344
PT V E+ AIG + ++ + + + L L+ G P K+
Sbjct: 291 PTKEVKELQTAIGAKPLIKGQYIIPCDKISSLPVVTLMLGGKPYKL 336
Score = 99.4 bits (246), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 54/138 (39%), Positives = 78/138 (56%), Gaps = 3/138 (2%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
++ NV+ G + CE A+V L TKE + + + P G+ II CD
Sbjct: 261 MDSVNVANGLTLCKGGCE-AIVDTGTSLITGPTKE--VKELQTAIGAKPLIKGQYIIPCD 317
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
+I ++P V+ +G K + L+ EQY+ K +C+SGF D+PPP GPLWILGDVF+
Sbjct: 318 KISSLPVVTLMLGGKPYKLTGEQYVFKVSAQGETICLSGFSGLDVPPPGGPLWILGDVFI 377
Query: 487 GVYHTVFDSGKLRIGFAE 504
G Y+TVFD +GFA+
Sbjct: 378 GPYYTVFDRDNDSVGFAK 395
>gi|146454528|gb|ABQ41930.1| aspartic proteinase 1 [Sonneratia alba]
Length = 203
Score = 276 bits (706), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 123/203 (60%), Positives = 163/203 (80%), Gaps = 3/203 (1%)
Query: 235 EGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTS 294
EGGE+VFGGVDP H+KG+HTYVPVT+KGYWQF++G++LIG+Q++G C GCAAI DSGTS
Sbjct: 1 EGGELVFGGVDPSHYKGEHTYVPVTQKGYWQFDMGEVLIGDQASGFCGSGCAAIADSGTS 60
Query: 295 LLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFN 354
LLAGPT ++T+INHAIG GVVS ECK VV+QYG I ++L+S PEK+C QIG C F+
Sbjct: 61 LLAGPTSIITQINHAIGASGVVSQECKAVVAQYGKTILEMLLSQSQPEKICSQIGFCTFD 120
Query: 355 GAEYVSTGIKTVVEKENVSAGDS---AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELC 411
G V GIK+VV+ ++ S A CSACEMAVVW+QN+L+Q QT++++L+Y+NELC
Sbjct: 121 GTRGVDMGIKSVVDDNKSTSSGSVRDASCSACEMAVVWMQNKLRQNQTEDQILNYVNELC 180
Query: 412 DSLPNPMGESIIDCDRIPTMPNV 434
+ +P+PMGES+++C + TMP V
Sbjct: 181 ERIPSPMGESVVECSSLSTMPKV 203
>gi|195997283|ref|XP_002108510.1| expressed hypothetical protein [Trichoplax adhaerens]
gi|190589286|gb|EDV29308.1| expressed hypothetical protein [Trichoplax adhaerens]
Length = 389
Score = 276 bits (706), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 148/321 (46%), Positives = 204/321 (63%), Gaps = 13/321 (4%)
Query: 12 LWVLASCLLLPASSNGLRRIGLKK-RRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDS 70
L V+A+ L+ SS+ L R+ L K ++ L IT + + ++ LG S
Sbjct: 4 LLVIAALFLI--SSDALVRVPLYKFKKTPREHLAEVGIT--------SSMLSEKYELGAS 53
Query: 71 DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF-SISCYFHSRYK 129
L N++DAQY+GEI IG+PPQ F V+FDTGSSNLWVPSSKC F +I+C FHS+Y
Sbjct: 54 RNATEMLNNYLDAQYYGEISIGTPPQKFKVLFDTGSSNLWVPSSKCSFLNIACLFHSKYD 113
Query: 130 SRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARF 189
KS+TY + I YG+GS++GF S D V + V VK+Q F EA E LTF+ A+F
Sbjct: 114 HSKSSTYKKNSTKFSIRYGTGSLTGFLSVDTVRIQGVSVKNQGFAEAVSEPGLTFVAAQF 173
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF 249
DGI+G+G++EIAV PV++N++ Q V + VFSF+LNR A+ GGE++ GG D KH+
Sbjct: 174 DGILGMGYQEIAVDGVPPVFNNIMAQKQVGKSVFSFYLNRKEGAKPGGELILGGSDSKHY 233
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHA 309
G TY+PVTKKGYWQF++ I + + + C+GGC AI D+GTSLLAGPT V +I
Sbjct: 234 SGNFTYLPVTKKGYWQFKMDGISVKGKGS-FCKGGCQAIADTGTSLLAGPTAEVNKIQTL 292
Query: 310 IGGEGVVSAECKLVVSQYGDL 330
IG +++ E + S+ L
Sbjct: 293 IGATPLLNGEYTIDCSKISSL 313
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 55/130 (42%), Positives = 76/130 (58%), Gaps = 2/130 (1%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 434
G + C A+ L T E ++ I L + P GE IDC +I ++P +
Sbjct: 259 GKGSFCKGGCQAIADTGTSLLAGPTAE--VNKIQTLIGATPLLNGEYTIDCSKISSLPPI 316
Query: 435 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 494
+FT+G K F L+ +QY+LK +VC+SGF D+P PRGPLWILGDVF+G Y+T FD
Sbjct: 317 TFTLGGKKFTLTGKQYVLKVSSLGLDVCLSGFTGIDIPKPRGPLWILGDVFIGQYYTEFD 376
Query: 495 SGKLRIGFAE 504
K R+GFA+
Sbjct: 377 MAKNRVGFAK 386
>gi|351712803|gb|EHB15722.1| Cathepsin D, partial [Heterocephalus glaber]
Length = 390
Score = 276 bits (706), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 141/303 (46%), Positives = 200/303 (66%), Gaps = 23/303 (7%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRH--------RLGDSDEDILP--LKNFMDAQYF 86
R+ LH + R T E +GG+ + H +L +P LKN+MDAQY+
Sbjct: 3 RIPLHKFKSIRRTMTE--VGGSVEDLIAHGPLTKYSPQLSTKTTGPVPETLKNYMDAQYY 60
Query: 87 GEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEI 145
GEIGIG+PPQ F+V+FDTGSSNLWVPSS+C I+C+FH +Y S KS+TY + G S +I
Sbjct: 61 GEIGIGTPPQCFTVVFDTGSSNLWVPSSRCNMLDIACWFHHKYHSDKSSTYVKNGSSFDI 120
Query: 146 NYGSGSISGFFSQDNVEVG---------DVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
+YGSGS+SG+ SQD V V ++ V+ Q F EAT++ +TF+ A+FDGI+G+
Sbjct: 121 HYGSGSLSGYLSQDTVSVPCQSAESNPRNLRVEKQTFGEATKQPGITFIAAKFDGILGMA 180
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ I+V + +PV+DN++ Q LV + VFSF+LNRDP A+ GGE++ GG+D K++KG TY+
Sbjct: 181 YPRISVNNVLPVFDNLMSQKLVDKNVFSFYLNRDPSAQPGGELMLGGIDSKYYKGSFTYL 240
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
VT+K YWQ + + +G+ +C+GGC AIVD+GTSLL GP V E+ AIG ++
Sbjct: 241 NVTRKAYWQVHMDQLEVGS-GLNLCKGGCEAIVDTGTSLLVGPVDEVKELQKAIGAIPLI 299
Query: 317 SAE 319
E
Sbjct: 300 QGE 302
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 45/94 (47%), Positives = 67/94 (71%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE ++ C+++ ++P+V+ +G + LSPE Y+LK + +C+SGFM D+P
Sbjct: 295 AIPLIQGEYMVPCEKVSSLPSVTLKLGGSAYPLSPEDYVLKVSQAGRTICLSGFMGMDIP 354
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PP GPLWILGDVF+G Y+TVFD R+GFA+AA
Sbjct: 355 PPTGPLWILGDVFIGRYYTVFDRDNNRVGFAQAA 388
>gi|218847782|ref|NP_001136375.1| cathepsin D-like precursor [Xenopus (Silurana) tropicalis]
gi|159155417|gb|AAI54878.1| LOC613063 protein [Xenopus (Silurana) tropicalis]
Length = 399
Score = 276 bits (706), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 144/321 (44%), Positives = 203/321 (63%), Gaps = 22/321 (6%)
Query: 10 FCLWV--LASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKE--------RYMGGAG 59
+WV LAS LL P S+ L RI LKK H+ A KE +Y G
Sbjct: 4 LLVWVVLLASSLLQPGSA--LIRIPLKKFPSIRHTFTEAGKDVKELLANEVPLKYSPGFP 61
Query: 60 VSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YF 118
SG + LKN++DAQY+GEIG+GSPPQNF+V+FDTGSSNLWVPS C
Sbjct: 62 PSG--------EPTPEALKNYLDAQYYGEIGLGSPPQNFTVVFDTGSSNLWVPSVHCSML 113
Query: 119 SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
I+C+ H +Y S KS+TY + G + I YG+GS+SG+ S+D V +G++ VK Q+F EA +
Sbjct: 114 DIACWMHHKYDSSKSSTYVKNGTAFAIQYGTGSLSGYLSKDTVTIGNLAVKGQIFGEAVK 173
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
+ +TF+ A+FDGI+G+ + I+V PV+DN++ Q LV +FSF+LNR+PD + GGE
Sbjct: 174 QPGVTFVAAKFDGILGMAYPVISVDGVPPVFDNIMAQKLVESNIFSFYLNRNPDTQPGGE 233
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
++ GG DPK++ G Y+ VT+K YWQ + + +G+Q T +C+GGC IVD+GTSL+ G
Sbjct: 234 LLLGGTDPKYYTGDFHYLSVTRKAYWQIHMDQLGVGDQLT-LCKGGCEVIVDTGTSLITG 292
Query: 299 PTPVVTEINHAIGGEGVVSAE 319
P VT + AIG ++ +
Sbjct: 293 PLEEVTALQKAIGAVPLIQGQ 313
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 43/102 (42%), Positives = 73/102 (71%)
Query: 404 LSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCI 463
++ + + ++P G+ ++ CD++PT+P +S T+G +++ L+ EQYI+K + + +C+
Sbjct: 297 VTALQKAIGAVPLIQGQYMVQCDKVPTLPVISLTLGGQVYTLTGEQYIMKVSQRGSTICL 356
Query: 464 SGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
SGFM ++PPP GPLWILGDVF+G Y++VFD +GFA+A
Sbjct: 357 SGFMGLNIPPPAGPLWILGDVFIGQYYSVFDRANDCVGFAKA 398
>gi|66911216|gb|AAH96630.1| LOC613063 protein, partial [Xenopus (Silurana) tropicalis]
Length = 395
Score = 276 bits (705), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 144/319 (45%), Positives = 203/319 (63%), Gaps = 22/319 (6%)
Query: 12 LWV--LASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKE--------RYMGGAGVS 61
+WV LAS LL P S+ L RI LKK H+ A KE +Y G S
Sbjct: 2 VWVVLLASSLLQPGSA--LIRIPLKKFPSIRHTFTEAGKDVKELLANEVPLKYSPGFPPS 59
Query: 62 GVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSI 120
G + LKN++DAQY+GEIG+GSPPQNF+V+FDTGSSNLWVPS C I
Sbjct: 60 G--------EPTPEALKNYLDAQYYGEIGLGSPPQNFTVVFDTGSSNLWVPSVHCSMLDI 111
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+C+ H +Y S KS+TY + G + I YG+GS+SG+ S+D V +G++ VK Q+F EA ++
Sbjct: 112 ACWMHHKYDSSKSSTYVKNGTAFAIQYGTGSLSGYLSKDTVTIGNLAVKGQIFGEAVKQP 171
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
+TF+ A+FDGI+G+ + I+V PV+DN++ Q LV +FSF+LNR+PD + GGE++
Sbjct: 172 GVTFVAAKFDGILGMAYPVISVDGVPPVFDNIMAQKLVESNIFSFYLNRNPDTQPGGELL 231
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
GG DPK++ G Y+ VT+K YWQ + + +G+Q T +C+GGC IVD+GTSL+ GP
Sbjct: 232 LGGTDPKYYTGDFHYLSVTRKAYWQIHMDQLGVGDQLT-LCKGGCEVIVDTGTSLITGPL 290
Query: 301 PVVTEINHAIGGEGVVSAE 319
VT + AIG ++ +
Sbjct: 291 EEVTALQKAIGAVPLIQGQ 309
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 43/102 (42%), Positives = 73/102 (71%)
Query: 404 LSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCI 463
++ + + ++P G+ ++ CD++PT+P +S T+G +++ L+ EQYI+K + + +C+
Sbjct: 293 VTALQKAIGAVPLIQGQYMVQCDKVPTLPVISLTLGGQVYTLTGEQYIMKVSQRGSTICL 352
Query: 464 SGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
SGFM ++PPP GPLWILGDVF+G Y++VFD +GFA+A
Sbjct: 353 SGFMGLNIPPPAGPLWILGDVFIGQYYSVFDRANDCVGFAKA 394
>gi|116284100|gb|AAI23963.1| LOC613063 protein [Xenopus (Silurana) tropicalis]
Length = 396
Score = 276 bits (705), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 144/321 (44%), Positives = 203/321 (63%), Gaps = 22/321 (6%)
Query: 10 FCLWV--LASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKE--------RYMGGAG 59
+WV LAS LL P S+ L RI LKK H+ A KE +Y G
Sbjct: 1 LLVWVVLLASSLLQPGSA--LIRIPLKKFPSIRHTFTEAGKDVKELLANEVPLKYSPGFP 58
Query: 60 VSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YF 118
SG + LKN++DAQY+GEIG+GSPPQNF+V+FDTGSSNLWVPS C
Sbjct: 59 PSG--------EPTPEALKNYLDAQYYGEIGLGSPPQNFTVVFDTGSSNLWVPSVHCSML 110
Query: 119 SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
I+C+ H +Y S KS+TY + G + I YG+GS+SG+ S+D V +G++ VK Q+F EA +
Sbjct: 111 DIACWMHHKYDSSKSSTYVKNGTAFAIQYGTGSLSGYLSKDTVTIGNLAVKGQIFGEAVK 170
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
+ +TF+ A+FDGI+G+ + I+V PV+DN++ Q LV +FSF+LNR+PD + GGE
Sbjct: 171 QPGVTFVAAKFDGILGMAYPVISVDGVPPVFDNIMAQKLVESNIFSFYLNRNPDTQPGGE 230
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
++ GG DPK++ G Y+ VT+K YWQ + + +G+Q T +C+GGC IVD+GTSL+ G
Sbjct: 231 LLLGGTDPKYYTGDFHYLSVTRKAYWQIHMDQLGVGDQLT-LCKGGCEVIVDTGTSLITG 289
Query: 299 PTPVVTEINHAIGGEGVVSAE 319
P VT + AIG ++ +
Sbjct: 290 PLEEVTALQKAIGAVPLIQGQ 310
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 43/102 (42%), Positives = 73/102 (71%)
Query: 404 LSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCI 463
++ + + ++P G+ ++ CD++PT+P +S T+G +++ L+ EQYI+K + + +C+
Sbjct: 294 VTALQKAIGAVPLIQGQYMVQCDKVPTLPVISLTLGGQVYTLTGEQYIMKVSQRGSTICL 353
Query: 464 SGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
SGFM ++PPP GPLWILGDVF+G Y++VFD +GFA+A
Sbjct: 354 SGFMGLNIPPPAGPLWILGDVFIGQYYSVFDRANDCVGFAKA 395
>gi|307167890|gb|EFN61279.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 354
Score = 275 bits (704), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 137/265 (51%), Positives = 175/265 (66%), Gaps = 14/265 (5%)
Query: 63 VRHRLGDSDEDILP-----------LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWV 111
+R+ L + D D+ P L N++DAQY+G I IG+PPQ F VIFDTGSSNLWV
Sbjct: 4 IRNSLKEVDADLQPVHLTGGITPEPLSNYLDAQYYGVISIGTPPQEFKVIFDTGSSNLWV 63
Query: 112 PSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKD 170
PS C+F+ I+C H +Y S+KS+TY G S I YGSGS+SG+ S D V V + V
Sbjct: 64 PSKNCHFTNIACQLHHKYNSKKSSTYEPNGASFAIQYGSGSLSGYLSADVVNVAGLNVTS 123
Query: 171 QVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRD 230
QVF EA E L F+ A+FDGI+G+G+ IAV PV+ NMV+Q LV + VFSF+LNRD
Sbjct: 124 QVFAEAISEPGLAFVAAKFDGILGMGYSTIAVDGVTPVFYNMVKQKLVPKAVFSFYLNRD 183
Query: 231 PDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVD 290
P AE GGE++ GG DP H++ TYVPVT+KGYWQF + I +GN++ C GC AI D
Sbjct: 184 PSAEVGGELILGGSDPDHYEADLTYVPVTRKGYWQFSMDGIEVGNRT--FCNNGCQAIAD 241
Query: 291 SGTSLLAGPTPVVTEINHAIGGEGV 315
+GTSL+AGP V IN IG +
Sbjct: 242 TGTSLIAGPVADVAAINKLIGASAI 266
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 41/87 (47%), Positives = 61/87 (70%), Gaps = 1/87 (1%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G++I+DC++IP +P ++F +G+K F+LS E Y+L+ + +C+SGFM FD+ G
Sbjct: 268 GQAIVDCNKIPQLPEINFNLGNKKFSLSGEDYVLQIKQFGTTICMSGFMGFDI-GSHGLE 326
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WILGDVF+G Y+T FD R+GFA A
Sbjct: 327 WILGDVFIGRYYTEFDLDNDRVGFAPA 353
>gi|125807245|ref|XP_001360320.1| GA13759 [Drosophila pseudoobscura pseudoobscura]
gi|195149648|ref|XP_002015768.1| GL11239 [Drosophila persimilis]
gi|54635492|gb|EAL24895.1| GA13759 [Drosophila pseudoobscura pseudoobscura]
gi|194109615|gb|EDW31658.1| GL11239 [Drosophila persimilis]
Length = 388
Score = 275 bits (704), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 139/292 (47%), Positives = 187/292 (64%), Gaps = 8/292 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQ 96
RL LN + R+ G + +R R G D PL N+MDAQY+G I IGSPPQ
Sbjct: 22 RLLRVPLNRFQSARRHFADVGTELQQLRIRYGGGDVP-EPLSNYMDAQYYGPISIGSPPQ 80
Query: 97 NFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
+F V+FDTGSSNLWVPS KC+ + I+C H++Y + KS+TY + G + I YGSGS+SG+
Sbjct: 81 SFRVVFDTGSSNLWVPSKKCHLTNIACLMHNKYDASKSSTYAKNGTTFAIQYGSGSLSGY 140
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
S D + +G + +K Q F EA E L F+ A+FDGI+GLG+ I+V P + M EQ
Sbjct: 141 LSTDTLSMGGLDIKGQTFAEALSEPGLVFVAAKFDGILGLGYSSISVDGVKPPFYAMYEQ 200
Query: 216 GLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGN 275
GL+S VFSF+LNRDP + EGGEI+FGG DPKH+ G TY+PVT+K YWQ ++ +G+
Sbjct: 201 GLISSPVFSFYLNRDPASPEGGEIIFGGSDPKHYTGDFTYLPVTRKAYWQIKMDSAALGD 260
Query: 276 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLV 323
+C+GGC I D+GTSL+A P T IN IGG ++ + C L+
Sbjct: 261 LE--LCKGGCQVIADTGTSLIAAPMTEATSINQKIGGTPIIGGQYIVSCDLI 310
Score = 108 bits (269), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 53/139 (38%), Positives = 77/139 (55%), Gaps = 2/139 (1%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
++ ++ + GD +C + L E + IN+ P G+ I+ CD
Sbjct: 251 IKMDSAALGDLELCKGGCQVIADTGTSLIAAPMTEA--TSINQKIGGTPIIGGQYIVSCD 308
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
IP +P + F +G K F L + YIL+ + +C+SGFM D+PPP GPLWILGDVF+
Sbjct: 309 LIPKLPVIKFVLGGKTFELEGKDYILRVAQMGKTICLSGFMGIDIPPPNGPLWILGDVFI 368
Query: 487 GVYHTVFDSGKLRIGFAEA 505
G Y+T FD G R+GFA+A
Sbjct: 369 GKYYTEFDMGNDRVGFADA 387
>gi|31197673|ref|XP_307784.1| AGAP003277-PA [Anopheles gambiae str. PEST]
gi|347969584|ref|XP_003436430.1| AGAP003277-PB [Anopheles gambiae str. PEST]
gi|347969586|ref|XP_003436431.1| AGAP003277-PC [Anopheles gambiae str. PEST]
gi|347969588|ref|XP_003436432.1| AGAP003277-PD [Anopheles gambiae str. PEST]
gi|30179074|gb|EAA03535.2| AGAP003277-PA [Anopheles gambiae str. PEST]
gi|333466215|gb|EGK96172.1| AGAP003277-PB [Anopheles gambiae str. PEST]
gi|333466216|gb|EGK96173.1| AGAP003277-PC [Anopheles gambiae str. PEST]
gi|333466217|gb|EGK96174.1| AGAP003277-PD [Anopheles gambiae str. PEST]
Length = 389
Score = 275 bits (704), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 131/255 (51%), Positives = 177/255 (69%), Gaps = 7/255 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSN 134
PL N++DAQYFG I IG+PPQ+F V+FDTGSSNLWVPS +C F+ I+C H++Y ++KS+
Sbjct: 61 PLSNYLDAQYFGAISIGTPPQSFKVVFDTGSSNLWVPSKQCSFTNIACLMHNKYDAKKSS 120
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
++ + G + I YG+GS+SG+ S D V VG V V+ Q F EA +E L F+ A+FDGI+G
Sbjct: 121 SFEKNGTAFHIQYGTGSLSGYLSTDTVTVGGVPVEKQTFAEAIQEPGLVFVAAKFDGILG 180
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L ++ I+V +PV+ NM QG + VFSF+LNRDP A EGGEI+FGG D KH+ G T
Sbjct: 181 LAYKSISVDGVMPVFYNMFNQGKIDAPVFSFYLNRDPSAAEGGEIIFGGSDSKHYTGDFT 240
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y+ V +K YWQF++ + +G+ C GC AI D+GTSL+AGP VT IN AIGG
Sbjct: 241 YLSVDRKAYWQFKMDSVTVGDAQ--YCNNGCEAIADTGTSLIAGPVAEVTAINKAIGGTP 298
Query: 315 VVSAE----CKLVVS 325
V++ E C L+ S
Sbjct: 299 VLNGEYMVDCSLIPS 313
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 58/139 (41%), Positives = 83/139 (59%), Gaps = 4/139 (2%)
Query: 368 EKENVSAGDSAVCS-ACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
+ ++V+ GD+ C+ CE A+ L E ++ IN+ P GE ++DC
Sbjct: 253 KMDSVTVGDAQYCNNGCE-AIADTGTSLIAGPVAE--VTAINKAIGGTPVLNGEYMVDCS 309
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
IP++P ++FT+G K F L YIL+ + +C+SGFM D+PPP GPLWILGDVF+
Sbjct: 310 LIPSLPKITFTLGGKQFTLEGADYILRVAQMGKTICLSGFMGIDIPPPNGPLWILGDVFI 369
Query: 487 GVYHTVFDSGKLRIGFAEA 505
G Y+T FD G R+GFA A
Sbjct: 370 GKYYTEFDMGNDRVGFATA 388
>gi|348565205|ref|XP_003468394.1| PREDICTED: cathepsin D-like [Cavia porcellus]
Length = 407
Score = 275 bits (704), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 143/307 (46%), Positives = 201/307 (65%), Gaps = 11/307 (3%)
Query: 22 PASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILP--LKN 79
P S+ L RI L K + H++ A E + ++ +L +P L N
Sbjct: 15 PFSTTALIRIPLHKFKSIRHTMTEAG-GSVENLIARDPLTKYSPQLSTKATGPVPEPLSN 73
Query: 80 FMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTE 138
+MDAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS+KC I+C+FH +Y KS+TY +
Sbjct: 74 YMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSAKCKMLDIACWFHHKYHGDKSSTYVK 133
Query: 139 IGKSCEINYGSGSISGFFSQDNVEV------GDVVVKDQVFIEATREGSLTFLLARFDGI 192
G S +I+YGSGS+SG+ SQD V V V V Q F EAT++ + F+ A+FDGI
Sbjct: 134 NGTSFDIHYGSGSLSGYLSQDTVSVPCKSSNSSVKVSKQTFGEATKQPGIVFVAAKFDGI 193
Query: 193 IGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGK 252
+GL + I+V + +PV+DN++EQ LV + +FSF+LNRDP A+ GGE+V GG+D K++KG
Sbjct: 194 LGLAYPRISVNNVLPVFDNLMEQKLVEKNIFSFYLNRDPTAQPGGELVLGGIDSKYYKGS 253
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
TY+ VT+K YWQ + + +G++ T +C+GGC AIVD+GTSLL GP V E+ AIG
Sbjct: 254 FTYLNVTRKAYWQVHMDQLQVGSELT-LCKGGCEAIVDTGTSLLVGPVDEVKELQKAIGA 312
Query: 313 EGVVSAE 319
++ E
Sbjct: 313 LPLIQGE 319
Score = 105 bits (262), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 44/94 (46%), Positives = 66/94 (70%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
+LP GE +I C+++ ++P+V+ +G + L+ E Y+LK + +C+SGFM D+P
Sbjct: 312 ALPLIQGEYMIPCEKVSSLPSVTLKLGGTDYTLASEDYVLKVSQAGKTICLSGFMGMDIP 371
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PP GPLWILGDVF+G Y+TVFD R+GFA++A
Sbjct: 372 PPSGPLWILGDVFIGRYYTVFDRDNNRVGFAQSA 405
>gi|62319754|dbj|BAD93734.1| putative aspartic proteinase [Arabidopsis thaliana]
Length = 205
Score = 275 bits (704), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 129/205 (62%), Positives = 160/205 (78%), Gaps = 5/205 (2%)
Query: 306 INHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKT 365
INHAIG GVVS +CK VV QYG I DLL+S P+K+C QIGLC F+G VS GI++
Sbjct: 2 INHAIGAAGVVSQQCKTVVDQYGQTILDLLLSETQPKKICSQIGLCTFDGTRGVSMGIES 61
Query: 366 VVEKENVS----AGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGES 421
VV+KEN GD+A CSACEMAVVW+Q+QL+Q T+E++L+Y+NELC+ LP+PMGES
Sbjct: 62 VVDKENAKLSNGVGDAA-CSACEMAVVWIQSQLRQNMTQERILNYVNELCERLPSPMGES 120
Query: 422 IIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWIL 481
+DC ++ TMP VS TIG K+F+L+PE+Y+LK GEG CISGF+A D+ PPRGPLWIL
Sbjct: 121 AVDCAQLSTMPTVSLTIGGKVFDLAPEEYVLKVGEGPVAQCISGFIALDVAPPRGPLWIL 180
Query: 482 GDVFMGVYHTVFDSGKLRIGFAEAA 506
GDVFMG YHTVFD G ++GFAEAA
Sbjct: 181 GDVFMGKYHTVFDFGNEQVGFAEAA 205
>gi|45384002|ref|NP_990508.1| cathepsin D precursor [Gallus gallus]
gi|461696|sp|Q05744.1|CATD_CHICK RecName: Full=Cathepsin D; Contains: RecName: Full=Cathepsin D
light chain; Contains: RecName: Full=Cathepsin D heavy
chain; Flags: Precursor
gi|259835|gb|AAB24157.1| prepro-cathepsin D [Gallus gallus]
Length = 398
Score = 275 bits (702), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 123/244 (50%), Positives = 178/244 (72%), Gaps = 2/244 (0%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF-SISCYFHSRYKSRKSNT 135
LKN+MDAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS C+ I+C H +Y + KS+T
Sbjct: 70 LKNYMDAQYYGEIGIGTPPQKFTVVFDTGSSNLWVPSVHCHLLDIACLLHHKYDASKSST 129
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y E G I+YG+GS+SGF SQD V +G++ +K+Q+F EA ++ +TF+ A+FDGI+G+
Sbjct: 130 YVENGTEFAIHYGTGSLSGFLSQDTVTLGNLKIKNQIFGEAVKQPGITFIAAKFDGILGM 189
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
F I+V P +DN+++Q L+ + +FSF+LNRDP A+ GGE++ GG DPK++ G ++
Sbjct: 190 AFPRISVDKVTPFFDNVMQQKLIEKNIFSFYLNRDPTAQPGGELLLGGTDPKYYSGDFSW 249
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
V VT+K YWQ + + + N T +C+GGC AIVD+GTSL+ GPT V E+ AIG + +
Sbjct: 250 VNVTRKAYWQVHMDSVDVANGLT-LCKGGCEAIVDTGTSLITGPTKEVKELQTAIGAKPL 308
Query: 316 VSAE 319
+ +
Sbjct: 309 IKGQ 312
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 52/138 (37%), Positives = 78/138 (56%), Gaps = 3/138 (2%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
++ +V+ G + CE A+V L TKE + + + P G+ +I CD
Sbjct: 262 MDSVDVANGLTLCKGGCE-AIVDTGTSLITGPTKE--VKELQTAIGAKPLIKGQYVISCD 318
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
+I ++P V+ +G K + L+ EQY+ K +C+SGF D+PPP GPLWILGDVF+
Sbjct: 319 KISSLPVVTLMLGGKPYQLTGEQYVFKVSAQGETICLSGFSGLDVPPPGGPLWILGDVFI 378
Query: 487 GVYHTVFDSGKLRIGFAE 504
G Y+TVFD +GFA+
Sbjct: 379 GPYYTVFDRDNDSVGFAK 396
>gi|242013446|ref|XP_002427417.1| Lysosomal aspartic protease precursor, putative [Pediculus humanus
corporis]
gi|212511797|gb|EEB14679.1| Lysosomal aspartic protease precursor, putative [Pediculus humanus
corporis]
Length = 383
Score = 275 bits (702), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 135/284 (47%), Positives = 184/284 (64%), Gaps = 8/284 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQ 96
R+ L+ +AR T + G V +R R G + PL N++DAQY+G I IG+PPQ
Sbjct: 21 RVPLYKFQSARRTLRGV---GTDVEHLRMRYGGPTPE--PLSNYLDAQYYGPISIGTPPQ 75
Query: 97 NFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
F VIFDTGSSNLW+PS KC FS I+C H++Y S +S+TY G I YGSGS+SG+
Sbjct: 76 QFKVIFDTGSSNLWIPSKKCLFSNIACLLHNKYDSSRSSTYIRNGTEFSIQYGSGSLSGY 135
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
S D+V +G + +K Q F EA E L F+ A+FDGI+G+G+ IAV VP + NM EQ
Sbjct: 136 LSTDDVTLGGLTIKRQTFAEAISEPGLAFVAAKFDGILGMGYMSIAVDGVVPPFYNMYEQ 195
Query: 216 GLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGN 275
LV +FSF+LNR+P+ + GGE++ GG DP ++KG TY+PV +K YWQF++ +++
Sbjct: 196 RLVDSPIFSFYLNRNPNEKVGGELLLGGSDPNYYKGNFTYLPVNRKAYWQFQMDKVMM-- 253
Query: 276 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
+ VC GGC AI D+GTSL+AGP V +IN + G V E
Sbjct: 254 EDITVCRGGCQAIADTGTSLIAGPVEDVNKINKKLNGVPVSGGE 297
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 51/139 (36%), Positives = 78/139 (56%), Gaps = 2/139 (1%)
Query: 368 EKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDR 427
+ + V D VC A+ L ++ ++ IN+ + +P GE +I+C
Sbjct: 247 QMDKVMMEDITVCRGGCQAIADTGTSLIAGPVED--VNKINKKLNGVPVSGGEYMIECRN 304
Query: 428 IPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMG 487
IP +P ++F + + F L + YIL+ + VC+SGFM D+P P GPLWILGDVF+G
Sbjct: 305 IPNLPKINFVLKGRSFVLEAKDYILRVSQFGKTVCLSGFMGIDIPKPNGPLWILGDVFIG 364
Query: 488 VYHTVFDSGKLRIGFAEAA 506
++T FD R+GFAE+A
Sbjct: 365 KFYTEFDMKNNRVGFAESA 383
>gi|146454534|gb|ABQ41933.1| aspartic proteinase 1 [Sonneratia apetala]
Length = 203
Score = 275 bits (702), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 122/203 (60%), Positives = 163/203 (80%), Gaps = 3/203 (1%)
Query: 235 EGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTS 294
EGGE+VFGGVDP H+KG+HTYVPVT+KGYWQF++G++LIG++++G C GCAAI DSGTS
Sbjct: 1 EGGELVFGGVDPSHYKGEHTYVPVTQKGYWQFDMGEVLIGDEASGFCGSGCAAIADSGTS 60
Query: 295 LLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFN 354
LLAGPT ++T+INHAIG GVVS ECK VV+QYG I ++L+S PEK+C QIG C F+
Sbjct: 61 LLAGPTSIITQINHAIGASGVVSQECKAVVAQYGKTILEMLLSQSQPEKICSQIGFCTFD 120
Query: 355 GAEYVSTGIKTVVEKENVSAGDS---AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELC 411
G V GIK+VV+ ++ S A CSACEMAVVW+QN+L+Q QT++++L+Y+NELC
Sbjct: 121 GTRGVDMGIKSVVDDNKSTSSGSVRDASCSACEMAVVWMQNKLRQNQTEDQILNYVNELC 180
Query: 412 DSLPNPMGESIIDCDRIPTMPNV 434
+ +P+PMGES+++C + TMP V
Sbjct: 181 ERIPSPMGESVVECSSLSTMPKV 203
>gi|225717994|gb|ACO14843.1| Lysosomal aspartic protease precursor [Caligus clemensi]
Length = 386
Score = 275 bits (702), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 131/271 (48%), Positives = 183/271 (67%), Gaps = 3/271 (1%)
Query: 50 RKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNL 109
R+ + G+ + +R R PL N++DAQY+G I IG+PPQ+F+VIFDTGSSNL
Sbjct: 32 RRHFFEVGSSIQLIRRRWNSVGAHPEPLSNYLDAQYYGPITIGTPPQSFNVIFDTGSSNL 91
Query: 110 WVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVV 168
WVPS C+ + I+C H ++ KS++Y G I YGSGS+ GF S D+V +G V +
Sbjct: 92 WVPSKSCHITNIACLLHHKFDHSKSSSYVVNGTEFAIQYGSGSLFGFLSTDSVSMGGVEI 151
Query: 169 KDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLN 228
Q F EA E + F+ A+FDGI+G+G+ IAV VP + NM +QGL+ E VFSF+LN
Sbjct: 152 GSQTFGEAMSEPGMAFVAAKFDGILGMGYSNIAVDGVVPPFYNMFKQGLIQEPVFSFYLN 211
Query: 229 RDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAI 288
RDP+A+ GGEI+FGG DP H+KG TY+PVTKKGYWQF++ + + +++ C+ GC AI
Sbjct: 212 RDPNAQVGGEIIFGGSDPDHYKGNITYIPVTKKGYWQFKMDGMKVSSKT--FCQNGCQAI 269
Query: 289 VDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
D+GTSL+AGP+ V +N +GG +V+ E
Sbjct: 270 ADTGTSLIAGPSVEVNALNQLLGGMPIVNGE 300
Score = 108 bits (269), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 47/99 (47%), Positives = 69/99 (69%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
+N+L +P GE + +C +PT+P ++FTIG F L+ E Y++K + VC+SGF
Sbjct: 287 LNQLLGGMPIVNGEYMFNCADVPTLPAITFTIGGTDFVLTGEDYVMKITQFGKTVCLSGF 346
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
M D+P P GP+WILGDVF+G Y+T+FD GK R+GFA++
Sbjct: 347 MGLDVPAPMGPIWILGDVFIGRYYTIFDMGKDRVGFAQS 385
>gi|226437842|gb|ACO56332.1| putative gut cathepsin D-like aspartic protease [Callosobruchus
maculatus]
Length = 389
Score = 274 bits (700), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 140/291 (48%), Positives = 193/291 (66%), Gaps = 13/291 (4%)
Query: 36 RRLDLHSLNAARITRKERYMGGAGVS-----GVRHR-LGDSDEDILPLKNFMDAQYFGEI 89
R+ L+ + R T +E G VS G ++R LG + PL N++DAQY+G I
Sbjct: 19 HRIPLYKFKSIRRTFQEV---GTDVSQVVLNGNKYRNLGGPVPE--PLSNYLDAQYYGPI 73
Query: 90 GIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYG 148
IG+PPQ F VIFDTGSSNLWVPS C+F+ I+C H++Y S KS+TY + G + I YG
Sbjct: 74 SIGTPPQTFKVIFDTGSSNLWVPSKLCHFTNIACLLHNKYDSSKSSTYKKNGTAFAIRYG 133
Query: 149 SGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPV 208
SGS+ GF S D+V G + V++Q F EA E + F+ A+FDGI+G+G+ IAV PV
Sbjct: 134 SGSLDGFLSTDHVSFGGLKVENQTFAEAMNEPGMAFVAAKFDGILGMGYSRIAVDGVPPV 193
Query: 209 WDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFEL 268
+ NMV Q LVS+ VFSF+LNRDP A +GGE++ GG D H+KG+ TY+PV ++ YWQF++
Sbjct: 194 FYNMVSQKLVSQPVFSFYLNRDPAAPQGGELILGGSDKAHYKGEFTYLPVDRQAYWQFKM 253
Query: 269 GDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
+ +G ++T +C GC AI D+GTSL+AGP+ V IN AIG ++ E
Sbjct: 254 DKVQVGPETT-LCAKGCEAIADTGTSLIAGPSEEVKAINKAIGATPIMGGE 303
Score = 108 bits (269), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 48/99 (48%), Positives = 66/99 (66%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
IN+ + P GE ++ C+ IP +P ++F +G K F L + YIL+ + +C+SGF
Sbjct: 290 INKAIGATPIMGGEYLVSCESIPKLPTINFVLGGKPFALEGKDYILRVSQAGQTLCLSGF 349
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
M D+PPP GPLWILGDVF+G Y+T FD G R+GFAEA
Sbjct: 350 MGIDIPPPNGPLWILGDVFIGRYYTEFDLGNNRVGFAEA 388
>gi|195027894|ref|XP_001986817.1| GH21578 [Drosophila grimshawi]
gi|193902817|gb|EDW01684.1| GH21578 [Drosophila grimshawi]
Length = 388
Score = 274 bits (700), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 128/245 (52%), Positives = 170/245 (69%), Gaps = 3/245 (1%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSN 134
PL N++DAQY+G I IGSPPQNF V+FDTGSSNLWVPS KC+ + I+C H++Y + KS+
Sbjct: 60 PLSNYLDAQYYGPISIGSPPQNFKVVFDTGSSNLWVPSKKCHLTNIACLMHNKYDATKSS 119
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + G I+YGSGS+SG+ S D V + + +KD F EA E L F+ A+FDGI+G
Sbjct: 120 TYVKNGTEFAIHYGSGSLSGYLSTDTVNIAGLDIKDHTFAEALSEPGLVFVAAKFDGILG 179
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V P + M EQGL+S+ VFSF+LNRDP A EGGEI+FGG DP H+ G T
Sbjct: 180 LGYSSISVDGVKPSFYAMYEQGLISDPVFSFYLNRDPKAPEGGEIIFGGSDPNHYTGDFT 239
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y+PVT+KGYWQ ++ + + +C+GGC I D+GTSL+A P T IN AIGG
Sbjct: 240 YLPVTRKGYWQIKMDSAQLNDIE--LCKGGCQVIADTGTSLIAAPQDEATSINQAIGGTP 297
Query: 315 VVSAE 319
++ +
Sbjct: 298 ILGGQ 302
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 46/99 (46%), Positives = 61/99 (61%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
IN+ P G+ ++ CD IP +P + F K F L + YIL+ + +C+SGF
Sbjct: 289 INQAIGGTPILGGQYVVSCDAIPNLPVIKFVFNGKTFELEGKDYILRVAQMGKTICLSGF 348
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
M D+PPP GPLWILGDVF+G Y+T FD G R+GFA A
Sbjct: 349 MGMDIPPPNGPLWILGDVFIGKYYTEFDMGNDRVGFANA 387
>gi|289740593|gb|ADD19044.1| aspartyl protease [Glossina morsitans morsitans]
Length = 394
Score = 274 bits (700), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 140/317 (44%), Positives = 199/317 (62%), Gaps = 11/317 (3%)
Query: 5 LLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVR 64
+++ + L AS LL G + K+ + L + R + G + +R
Sbjct: 1 MIKYILFLLFEASVLL-----QGFHAV--KEEKFIRVPLTRIKTARNYFHEVGTELQQLR 53
Query: 65 HRLGDS-DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISC 122
+ G + D PL N++DAQY+G I IG+P Q+F V+FDTGSSNLWVPS +CYF+ I+C
Sbjct: 54 LKYGSANDVRPEPLSNYLDAQYYGPISIGTPSQDFKVVFDTGSSNLWVPSKQCYFTNIAC 113
Query: 123 YFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSL 182
H++Y + KS++Y + G I+YGSGS+SG+ S D V + + ++ Q F EA E L
Sbjct: 114 LMHNKYDANKSSSYKKNGTEFAIHYGSGSLSGYLSTDTVNIAGLGIEGQTFAEALSEPGL 173
Query: 183 TFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFG 242
F+ A+FDGI+GLG+ IAV P + M EQGL+S+ VFSF+LNRDP A EGGEI+FG
Sbjct: 174 VFIGAKFDGILGLGYSSIAVDGVKPPFYQMYEQGLISQPVFSFYLNRDPKAPEGGEIIFG 233
Query: 243 GVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPV 302
G DP H+KG+ TY+PVT+K YWQ ++ +GN + +C+GGC I D+GTSL+A P
Sbjct: 234 GSDPNHYKGEFTYLPVTRKAYWQIKMDSASMGNLN--LCQGGCQVIADTGTSLIALPPSE 291
Query: 303 VTEINHAIGGEGVVSAE 319
T IN AIGG ++ +
Sbjct: 292 ATSINKAIGGTPIMGGQ 308
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 46/99 (46%), Positives = 64/99 (64%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
IN+ P G+ ++ C+ IP +P + F +G K F L + YIL+ + +C+SGF
Sbjct: 295 INKAIGGTPIMGGQYMVACENIPKLPVIRFVLGGKTFELEGKDYILRIAQMGKTICLSGF 354
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
M D+PPP GP+WILGDVF+G Y+T FD G R+GFAEA
Sbjct: 355 MGIDIPPPNGPIWILGDVFIGKYYTEFDMGNDRVGFAEA 393
>gi|156406785|ref|XP_001641225.1| predicted protein [Nematostella vectensis]
gi|156228363|gb|EDO49162.1| predicted protein [Nematostella vectensis]
Length = 370
Score = 274 bits (700), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 140/284 (49%), Positives = 189/284 (66%), Gaps = 3/284 (1%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQ 96
R+ LH + R + KE + + G + + PL N+MDAQY+GEI IG+PPQ
Sbjct: 3 RIPLHKMPTPRQSLKEVGISVEQLLGKYGGKYEGGDVPEPLINYMDAQYYGEITIGTPPQ 62
Query: 97 NFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
F+V+FDTGSSNLWVPS KC + +I+C H +Y S KS+TY + G I YGSGS+SGF
Sbjct: 63 KFTVVFDTGSSNLWVPSKKCSWTNIACLLHDKYDSTKSSTYKKNGTEFAIRYGSGSLSGF 122
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
S D V VG + VK Q F EA +E LTF+ A+FDGI+G+GF I+V VPV+ +MV Q
Sbjct: 123 LSIDTVSVGGIDVKGQTFAEALKEPGLTFVAAKFDGILGMGFSSISVDQVVPVFYDMVLQ 182
Query: 216 GLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGN 275
LV VFSF+LNR+P A GGE++ GG DPK++KG +YVPVT++GYWQF++ I +
Sbjct: 183 KLVPAPVFSFYLNREPGASPGGELLLGGSDPKYYKGNFSYVPVTQEGYWQFKMDGISVKE 242
Query: 276 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
S C GC AI D+GTSL+AGPT + ++N+ IG + ++ E
Sbjct: 243 GS--FCSDGCQAIADTGTSLIAGPTDEIEKLNNLIGAKIIIGGE 284
Score = 109 bits (272), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 58/136 (42%), Positives = 81/136 (59%), Gaps = 2/136 (1%)
Query: 370 ENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIP 429
+ +S + + CS A+ L T E + +N L + GE ++C I
Sbjct: 236 DGISVKEGSFCSDGCQAIADTGTSLIAGPTDE--IEKLNNLIGAKIIIGGEYTVNCSAID 293
Query: 430 TMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVY 489
++P+++FTIG K + L+ +QYILK VCISGF+ D+PPPRGPLWILGDVF+G Y
Sbjct: 294 SLPDITFTIGGKKYVLTGKQYILKVTTLGQSVCISGFLGLDVPPPRGPLWILGDVFIGPY 353
Query: 490 HTVFDSGKLRIGFAEA 505
+T FD G R+GFAEA
Sbjct: 354 YTEFDFGNKRVGFAEA 369
>gi|60678793|gb|AAX33731.1| Blo t allergen [Blomia tropicalis]
Length = 402
Score = 274 bits (700), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 154/316 (48%), Positives = 205/316 (64%), Gaps = 18/316 (5%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRH 65
++ L LA+ LL+ A L RI L+K + SL R E + A + H
Sbjct: 1 MKYSLVLVFLATILLVDAK---LHRIKLQKAQ----SLRK-RFVEVESPIKLAYTTHHYH 52
Query: 66 RLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYF 124
+ + PL N+ DAQY+GEI IGSPPQ F+VIFDTGSSNLWVPS KC F+ ++C
Sbjct: 53 HWYNGFPE--PLSNYADAQYYGEIQIGSPPQPFNVIFDTGSSNLWVPSKKCKFTNLACLL 110
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
H +Y S KS++Y G S EI YG+GS++GF S D V V + +++Q F EA E +TF
Sbjct: 111 HHKYDSSKSSSYVNNGTSFEIRYGTGSMTGFLSTDVVTVANQQIQNQTFAEAVSEPGITF 170
Query: 185 LLARFDGIIGLGFREIAVGDAVP-VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
+ A+FDGI+GLGF I+V D VP V+D+MV+QGLV + VFSF+LNRD + + GGEI+FGG
Sbjct: 171 VFAKFDGILGLGFNTISV-DGVPTVFDSMVKQGLVQQPVFSFYLNRDTNGKVGGEIIFGG 229
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTG-----VCEGGCAAIVDSGTSLLAG 298
DP ++KG TY P+TK GYWQF++ IL+ N+S VCE GC AI D+GTSL+AG
Sbjct: 230 SDPAYYKGDFTYAPLTKIGYWQFQMHGILLENKSNNKTVGHVCESGCEAIADTGTSLIAG 289
Query: 299 PTPVVTEINHAIGGEG 314
P+ V +N A+G G
Sbjct: 290 PSDQVEHLNRALGAIG 305
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 38/85 (44%), Positives = 54/85 (63%), Gaps = 3/85 (3%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G +++C I T+P++ F I F LSP+QY+++ E+CIS F++ P PL
Sbjct: 309 GIFVLNCSHINTLPSIIFQINGVKFPLSPDQYVMRQSAMGKEICISSFISL---PANIPL 365
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFA 503
WILGDVF+G Y+T FD G R+GFA
Sbjct: 366 WILGDVFIGNYYTEFDYGNKRVGFA 390
>gi|449280808|gb|EMC88033.1| Cathepsin D, partial [Columba livia]
Length = 387
Score = 274 bits (700), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 123/244 (50%), Positives = 178/244 (72%), Gaps = 2/244 (0%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF-SISCYFHSRYKSRKSNT 135
LKN+MDAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS C+ I+C H +Y S KS+T
Sbjct: 59 LKNYMDAQYYGEIGIGTPPQKFTVVFDTGSSNLWVPSVHCHLLDIACLLHHKYDSSKSST 118
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y E G I+YG+GS+SG+ SQD V +G++ +K+Q+F EA ++ +TF+ A+FDGI+G+
Sbjct: 119 YVENGTDFAIHYGTGSLSGYLSQDTVTLGNLKIKNQIFGEALKQPGITFIAAKFDGILGM 178
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
F I+V P +DN+++Q L+ + +FSF+LNRDP A+ GGE++ GG DPK++ G ++
Sbjct: 179 AFPRISVDKVTPFFDNIMQQKLIEKNIFSFYLNRDPSAQPGGELLLGGTDPKYYSGDFSW 238
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
V VT+K YWQ + + + N T +C+GGC AIVD+GTSL+ GPT V E+ AIG + +
Sbjct: 239 VNVTRKAYWQVHMDAVDVANGLT-LCKGGCEAIVDTGTSLITGPTKEVKELQTAIGAKPL 297
Query: 316 VSAE 319
+ +
Sbjct: 298 IKGQ 301
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 51/134 (38%), Positives = 77/134 (57%), Gaps = 3/134 (2%)
Query: 371 NVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPT 430
+V+ G + CE A+V L TKE + + + P G+ +I CD++ +
Sbjct: 255 DVANGLTLCKGGCE-AIVDTGTSLITGPTKE--VKELQTAIGAKPLIKGQYVIPCDKVSS 311
Query: 431 MPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYH 490
+P ++ T+G K + L+ EQY+ K +C+SGF D+PPP GPLWILGDVF+G Y+
Sbjct: 312 LPVITLTLGGKPYQLTGEQYVFKVSVQGETICLSGFSGLDVPPPGGPLWILGDVFIGPYY 371
Query: 491 TVFDSGKLRIGFAE 504
TVFD +GFA+
Sbjct: 372 TVFDRDNDSVGFAK 385
>gi|387015018|gb|AFJ49628.1| Cathepsin D [Crotalus adamanteus]
Length = 399
Score = 274 bits (700), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 125/247 (50%), Positives = 179/247 (72%), Gaps = 2/247 (0%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN+MDAQY+GEIGIG+P Q F+V+FDTGSSNLWVPSS C I+C H +Y S KS+T
Sbjct: 68 LKNYMDAQYYGEIGIGTPQQRFTVVFDTGSSNLWVPSSHCTLLDIACLIHHKYDSSKSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + G I+YG+GS+SG+ SQD V +GD+ VK+Q+F EAT++ +TF+ A+FDGI+G+
Sbjct: 128 YVKNGTDFAIHYGTGSLSGYLSQDTVTIGDMCVKNQLFGEATKQPGITFIAAKFDGILGM 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ EI+V P +DN++EQGL+ + +FSF+LNRDP E GGE++FGG D +++ G ++
Sbjct: 188 AYPEISVDKVAPFFDNVMEQGLLEKNLFSFYLNRDPKGETGGELLFGGTDSQYYSGDFSW 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
V V++K YWQ + + + N T VC+ GC AIVD+GTSL+ GPT + E+ AIG + +
Sbjct: 248 VNVSRKAYWQVHMDKVDVANGLT-VCKDGCEAIVDTGTSLITGPTKEIKELQKAIGAKPI 306
Query: 316 VSAECKL 322
+ + L
Sbjct: 307 IKGQYML 313
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 53/138 (38%), Positives = 81/138 (58%), Gaps = 3/138 (2%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
++K +V+ G + CE A+V L TKE + + + + P G+ ++ CD
Sbjct: 260 MDKVDVANGLTVCKDGCE-AIVDTGTSLITGPTKE--IKELQKAIGAKPIIKGQYMLPCD 316
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
++ T+P VS +G + + L+P+QY LK +C+SGF D+PPP GPLWILGDVF+
Sbjct: 317 KLSTLPTVSLVLGGQSYALTPDQYALKVTVQGETLCLSGFSGLDVPPPGGPLWILGDVFI 376
Query: 487 GVYHTVFDSGKLRIGFAE 504
G Y+TVFD +GFA+
Sbjct: 377 GPYYTVFDRDNDSVGFAK 394
>gi|158523297|gb|ABW70789.1| cathepsin D [Scophthalmus maximus]
Length = 396
Score = 273 bits (699), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 144/316 (45%), Positives = 196/316 (62%), Gaps = 17/316 (5%)
Query: 15 LASCLL-----LPASSNGLRRIGLKK-----RRLDLHSLNAARITRKERYMGGAGVSGVR 64
+ SCLL L S + L RI LKK R L A + + + +G G
Sbjct: 1 MRSCLLVVFVSLALSGDALVRIPLKKFHSVRRELTDSGRKAEELLADKHSLKYSG--GFP 58
Query: 65 HRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCY 123
G + E LKNF+DAQY+G+I +GSPPQ FSV+FDTGSSNLWVPS C I+C
Sbjct: 59 SSNGPTPE---MLKNFLDAQYYGDIALGSPPQTFSVVFDTGSSNLWVPSVHCSLLDIACL 115
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
H +Y S KS+TY + G + I YGSGS+SGF SQD +GDV V++QVF EAT++ +
Sbjct: 116 LHHKYNSAKSSTYVKNGTAFAIQYGSGSLSGFLSQDTCTIGDVTVENQVFGEATKQPGVA 175
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
F+ A+FDGI+G+ F I+V VPV+DN++ Q V + VFSF+LNR+PD GGE++ GG
Sbjct: 176 FIAAKFDGILGMAFPRISVDGVVPVFDNIMSQKKVEQNVFSFYLNRNPDTAPGGELLLGG 235
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
DPK++ G Y+ +T+K YWQ + + +G+Q T +C GGC IVD+GTSL+ GP V
Sbjct: 236 TDPKYYTGDFNYINITRKAYWQIHMDGLAVGSQLT-LCNGGCEVIVDTGTSLITGPAAEV 294
Query: 304 TEINHAIGGEGVVSAE 319
+ AIG ++ E
Sbjct: 295 KALQKAIGAVPLIQGE 310
Score = 106 bits (264), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 43/93 (46%), Positives = 67/93 (72%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE ++ CD+IP++P ++F +G + ++L+ +QY+LK +C+SGFM D+P
Sbjct: 303 AVPLIQGEYMVSCDKIPSLPVITFNLGGRGYSLTGDQYVLKESHAGKTICLSGFMGLDIP 362
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
P GPLWILGDVF+G Y+TVFD R+GFA++
Sbjct: 363 APAGPLWILGDVFIGQYYTVFDRDNDRVGFAKS 395
>gi|324507249|gb|ADY43078.1| Cathepsin D [Ascaris suum]
Length = 437
Score = 273 bits (699), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 144/322 (44%), Positives = 203/322 (63%), Gaps = 19/322 (5%)
Query: 14 VLASCLL---LPASSNGLRRIGLKKRRLDLHSL----------NAARITRKERYMGGAG- 59
++AS LL LP + + R+ ++++ L + K+ + G A
Sbjct: 4 IVASVLLSLFLPVYTQYVMRVPIRRQDTIKEQLMESGSWSDYLHYRHHALKKHFYGIANH 63
Query: 60 -VSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF 118
V +R + G+ +++L KN+MDAQY+G+I IG+PPQNF+VIFDTGS+NLWVPS KC F
Sbjct: 64 RVHSLRGQSGNEIDELL--KNYMDAQYYGDISIGTPPQNFTVIFDTGSANLWVPSRKCPF 121
Query: 119 S-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEAT 177
+ I+C H +Y + KS+TY E G+ +I YG+GS+ GF S DNV V DV +Q F EAT
Sbjct: 122 TDIACLLHHKYDAAKSSTYAEDGRKLQIQYGTGSMKGFISLDNVCVADVCATEQPFAEAT 181
Query: 178 REGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGG 237
E LTF+ A+FDGI+G+ F EIAV PV+ M++Q L++ VF+FWL+R+PD + GG
Sbjct: 182 SEPGLTFIAAKFDGILGMAFPEIAVLGVKPVFHTMIDQQLLAAPVFAFWLDRNPDDQIGG 241
Query: 238 EIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLA 297
EI FGG D K + TY PVT++GYWQF++ D ++G ++ C GC AI D+GTSL+A
Sbjct: 242 EITFGGTDTKRYVEPITYTPVTRRGYWQFKM-DKVVGEEAVLACANGCQAIADTGTSLIA 300
Query: 298 GPTPVVTEINHAIGGEGVVSAE 319
GP V I IG E + E
Sbjct: 301 GPKQQVDTIQKFIGAEPLFRGE 322
Score = 102 bits (253), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 48/99 (48%), Positives = 63/99 (63%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
I + + P GE +I CD++P++P+VSF I K ++L P Y+ VCISGF
Sbjct: 309 IQKFIGAEPLFRGEYMIPCDKVPSLPDVSFVIASKTYSLKPTDYVFNMTAMGKSVCISGF 368
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
M +LP G LWILGDVF+G Y+TVFD G R+GFAEA
Sbjct: 369 MGIELPERVGELWILGDVFIGRYYTVFDVGHERVGFAEA 407
>gi|146454532|gb|ABQ41932.1| aspartic proteinase 1 [Sonneratia ovata]
Length = 203
Score = 273 bits (699), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 122/203 (60%), Positives = 162/203 (79%), Gaps = 3/203 (1%)
Query: 235 EGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTS 294
EGGE+VFGGVDP H+K +HTYVPVT+KGYWQF++G++LIG+Q++G C GCAAI DSGTS
Sbjct: 1 EGGELVFGGVDPSHYKEEHTYVPVTQKGYWQFDMGEVLIGDQASGFCGSGCAAIADSGTS 60
Query: 295 LLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFN 354
LLAGPT ++T+INHAIG GVVS ECK VV+QYG I ++L+S PEK+C QIG C F+
Sbjct: 61 LLAGPTSIITQINHAIGASGVVSQECKAVVAQYGKTILEMLLSQSQPEKICSQIGFCTFD 120
Query: 355 GAEYVSTGIKTVVEKENVSAGDS---AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELC 411
G V GIK+VV+ ++ S A CSACEMAVVW+QN+L+Q QT++++L+Y+NELC
Sbjct: 121 GTRGVDMGIKSVVDDNKSTSSGSVRDASCSACEMAVVWMQNKLRQNQTEDQILNYVNELC 180
Query: 412 DSLPNPMGESIIDCDRIPTMPNV 434
+ +P+PMGES+++C + TMP V
Sbjct: 181 ERIPSPMGESVVECSSLSTMPKV 203
>gi|347451476|gb|AEO94539.1| aspartate protease cathepsin D [Triatoma infestans]
Length = 393
Score = 273 bits (699), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 149/326 (45%), Positives = 196/326 (60%), Gaps = 28/326 (8%)
Query: 14 VLASCLLLPAS-------SNGLRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSG 62
+LA LLL +S S+ L R+ L K RR + A +Y G GV G
Sbjct: 1 MLAHTLLLISSFCGVLLGSDNLVRVPLTKIQSARRF-FQDVGTAVEQLTLKYDTGNGVEG 59
Query: 63 VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSIS 121
PL N++DAQY+G I +GSPPQ+F V+FDTGSSNLWVPS KC F+I+
Sbjct: 60 PFPE---------PLSNYLDAQYYGAITLGSPPQSFRVVFDTGSSNLWVPSKKCSRFNIA 110
Query: 122 CYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGS 181
C+ H +Y S S TY G+ I YGSGS+SGF SQD + +G V V +Q F EA E
Sbjct: 111 CWVHRKYDSSNSKTYVPNGEKFAIQYGSGSLSGFLSQDQLSIGGVTVANQTFAEAVNEPG 170
Query: 182 LTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVF 241
+ F+ A+FDGI+GLG+ I+V P + NM +QG V VFSF+LNRDP A GGEI+F
Sbjct: 171 MVFVAAKFDGILGLGYDTISVDKVTPPFYNMYQQGAVQNPVFSFYLNRDPAAAVGGEIIF 230
Query: 242 GGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTP 301
GG DP+ + G TYVPV K+GYWQF + +++ ++ C+GGC AI D+GTSL+AGPT
Sbjct: 231 GGSDPEKYVGDFTYVPVDKQGYWQFNMDKVIVNGKT--FCKGGCQAIADTGTSLIAGPTE 288
Query: 302 VVTEINHAIGGEGVVSAE----CKLV 323
V +N +GG + E C L+
Sbjct: 289 DVIALNKLLGGTPIAGGEYMISCDLI 314
Score = 102 bits (255), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 46/97 (47%), Positives = 62/97 (63%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
+N+L P GE +I CD IP +P + F IG F+L + YIL+ +C+SGF
Sbjct: 293 LNKLLGGTPIAGGEYMISCDLIPKLPKIDFVIGGNKFSLEGKDYILRVSAMGKTICLSGF 352
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFA 503
+ D+PPP GPLWILGDVF+G ++T FD G R+GFA
Sbjct: 353 LGLDVPPPHGPLWILGDVFIGRFYTEFDLGNNRVGFA 389
>gi|83319201|dbj|BAE53722.1| aspartic protease [Haemaphysalis longicornis]
Length = 391
Score = 273 bits (698), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 127/245 (51%), Positives = 176/245 (71%), Gaps = 2/245 (0%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSN 134
PLKN++DAQY+G++ +G+PPQ F V+FDTGSSNLWVPSSKC F+ I+C H +Y S+KS+
Sbjct: 62 PLKNYLDAQYYGDVTLGTPPQVFRVVFDTGSSNLWVPSSKCPFTNIACMLHHKYNSKKSS 121
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + G EI YGSGS+ G S D +GD+ ++ Q F E RE L F+ A+FDGI+G
Sbjct: 122 TYAKNGTQFEIRYGSGSVKGELSTDVFGLGDIRLQGQTFAEILRESGLAFIAAKFDGILG 181
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ +I+V + PV+DNMV QG+ + VFS +L+R+ GGE++FGG+D H+ G T
Sbjct: 182 LGYPQISVLNVPPVFDNMVAQGVAPKPVFSVYLDRNASDPNGGEVLFGGIDEAHYTGNIT 241
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
YVPVT+KGYWQF + + +G+ +T C GGCAAI D+GTSL+AGPT + ++N AIG
Sbjct: 242 YVPVTRKGYWQFHMNGVKVGDNAT-FCNGGCAAIADTGTSLIAGPTEEIHKLNVAIGAAP 300
Query: 315 VVSAE 319
++ E
Sbjct: 301 FMAGE 305
Score = 102 bits (253), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 52/135 (38%), Positives = 77/135 (57%), Gaps = 3/135 (2%)
Query: 372 VSAGDSAV-CSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPT 430
V GD+A C+ A+ L T+E + +N + P GE I+ C IPT
Sbjct: 258 VKVGDNATFCNGGCAAIADTGTSLIAGPTEE--IHKLNVAIGAAPFMAGEYIVSCKSIPT 315
Query: 431 MPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYH 490
+P ++F + F L + Y+L+ + +C+SGF+ D+P P GPLWILGDVF+G Y+
Sbjct: 316 LPKINFNLNGNEFVLEGKDYVLQVSQAGIPLCLSGFIGLDVPAPLGPLWILGDVFIGRYY 375
Query: 491 TVFDSGKLRIGFAEA 505
T+FD G R+GFAE+
Sbjct: 376 TIFDRGNDRVGFAES 390
>gi|157779726|gb|ABV71391.1| aspartic protease [Haemaphysalis longicornis]
Length = 391
Score = 273 bits (698), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 127/245 (51%), Positives = 176/245 (71%), Gaps = 2/245 (0%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSN 134
PLKN++DAQY+G++ +G+PPQ F V+FDTGSSNLWVPSSKC F+ I+C H +Y S+KS+
Sbjct: 62 PLKNYLDAQYYGDVTLGTPPQVFRVVFDTGSSNLWVPSSKCPFTNIACMLHHKYNSKKSS 121
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + G EI YGSGS+ G S D +GD+ ++ Q F E RE L F+ A+FDGI+G
Sbjct: 122 TYAKNGTQFEIRYGSGSVKGELSTDVFGLGDIRLQGQTFAEILRESGLAFIAAKFDGILG 181
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ +I+V + PV+DNMV QG+ + VFS +L+R+ GGE++FGG+D H+ G T
Sbjct: 182 LGYPQISVLNVPPVFDNMVAQGVAPKPVFSVYLDRNASDPNGGEVLFGGIDEAHYTGNIT 241
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
YVPVT+KGYWQF + + +G+ +T C GGCAAI D+GTSL+AGPT + ++N AIG
Sbjct: 242 YVPVTRKGYWQFHMNGVKVGDNAT-FCNGGCAAIADTGTSLIAGPTEEIHKLNVAIGAAP 300
Query: 315 VVSAE 319
++ E
Sbjct: 301 FMAGE 305
Score = 102 bits (253), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 52/135 (38%), Positives = 77/135 (57%), Gaps = 3/135 (2%)
Query: 372 VSAGDSAV-CSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPT 430
V GD+A C+ A+ L T+E + +N + P GE I+ C IPT
Sbjct: 258 VKVGDNATFCNGGCAAIADTGTSLIAGPTEE--IHKLNVAIGAAPFMAGEYIVSCKSIPT 315
Query: 431 MPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYH 490
+P ++F + F L + Y+L+ + +C+SGF+ D+P P GPLWILGDVF+G Y+
Sbjct: 316 LPKINFNLNGNEFVLEGKDYVLQVSQAGIPLCLSGFIGLDVPAPLGPLWILGDVFIGRYY 375
Query: 491 TVFDSGKLRIGFAEA 505
T+FD G R+GFAE+
Sbjct: 376 TIFDRGNDRVGFAES 390
>gi|340729556|ref|XP_003403066.1| PREDICTED: lysosomal aspartic protease-like [Bombus terrestris]
Length = 385
Score = 273 bits (697), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 137/294 (46%), Positives = 185/294 (62%), Gaps = 12/294 (4%)
Query: 15 LASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDI 74
L C L+ ++ L+RI LH ++ R KE V+ +
Sbjct: 5 LCLCALIALANADLQRI-------TLHKIDTVRKQFKEYNTEVYQAHMVQGNFPQPE--- 54
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKS 133
PL N++DAQY+G I IG+P Q+F VIFDTGSSNLWVPS KC+ + I+C H +Y + KS
Sbjct: 55 -PLSNYLDAQYYGVISIGTPSQDFKVIFDTGSSNLWVPSKKCHLTNIACKLHHKYDNTKS 113
Query: 134 NTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGII 193
+TY + G I YGSGS+SG+ S D V V + V DQ F EA E + F+ A+FDGI+
Sbjct: 114 STYKKNGTDFAIRYGSGSLSGYLSTDVVNVAGLKVSDQTFAEALSEPGMAFVAAKFDGIL 173
Query: 194 GLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKH 253
G+ + +IAV PV+ NMV+QGLV + VFSF+LNR+PD + GGE++ GG DP H++G
Sbjct: 174 GMAYSKIAVDGVTPVFYNMVKQGLVPQPVFSFYLNRNPDDKAGGELILGGSDPNHYEGPF 233
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEIN 307
TYVPV +KGYWQF + I +G+Q +C+ GC AI D+GTSL+AGP V IN
Sbjct: 234 TYVPVDRKGYWQFRMDGIKVGSQHLAICQKGCEAIADTGTSLIAGPVKEVEAIN 287
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 44/86 (51%), Positives = 61/86 (70%)
Query: 420 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 479
E+++DC IP +P ++F +G K F L+ + Y+LK + VC+SGFM D+P P GPLW
Sbjct: 299 EAMVDCSSIPNLPTINFVLGGKSFPLTGKDYVLKVTQFGKTVCLSGFMGMDIPEPNGPLW 358
Query: 480 ILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGDVF+G Y+T FD G R+GFA+A
Sbjct: 359 ILGDVFIGRYYTEFDMGNNRVGFAKA 384
>gi|190576608|gb|ACE79095.1| cathepsin D precursor (predicted) [Sorex araneus]
Length = 405
Score = 273 bits (697), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 140/294 (47%), Positives = 196/294 (66%), Gaps = 11/294 (3%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L+N+MDAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS KC I+C+ H +Y S KS+T
Sbjct: 72 LRNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSVKCQLLDIACWLHHKYNSAKSST 131
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEV---GDVVVKDQVFIEATREGSLTFLLARFDGI 192
Y + G + +I+YGSGS+SG+ SQD V V + V Q+F EAT++ +TF+ A+FDGI
Sbjct: 132 YVKNGTAFDIHYGSGSLSGYLSQDTVSVPCNSGIQVARQLFGEATKQPGVTFIAAKFDGI 191
Query: 193 IGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGK 252
+G+ + I+V + PV+DN+++Q LV + +FSF+LNRDP A+ GGE++ GG+D K+FKG
Sbjct: 192 LGMAYPRISVNNVPPVFDNLMQQKLVDKNIFSFYLNRDPTAQPGGELMLGGIDSKYFKGS 251
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
TY VT++ YWQ + I +GN T +C+GGC AIVD+GTSLL GP V E+ AIG
Sbjct: 252 MTYHNVTRQAYWQVHMDQIDVGNGLT-LCKGGCEAIVDTGTSLLVGPVDEVKELQKAIGA 310
Query: 313 EGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTV 366
++ E + + DL L G ++ L + A VS G KT+
Sbjct: 311 VPLIQGEYIIPCEKLPDLPTVSLTLG------GKEYSLSPHDYALQVSQGGKTI 358
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 49/93 (52%), Positives = 67/93 (72%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE II C+++P +P VS T+G K ++LSP Y L+ +G +C+SGFM D+P
Sbjct: 310 AVPLIQGEYIIPCEKLPDLPTVSLTLGGKEYSLSPHDYALQVSQGGKTICLSGFMGMDIP 369
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
PP GPLWILGDVF+G Y+TVFD + R+G AEA
Sbjct: 370 PPAGPLWILGDVFIGRYYTVFDREQNRVGLAEA 402
>gi|443723962|gb|ELU12180.1| hypothetical protein CAPTEDRAFT_225009 [Capitella teleta]
Length = 364
Score = 273 bits (697), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 128/238 (53%), Positives = 171/238 (71%), Gaps = 1/238 (0%)
Query: 83 AQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGK 141
AQY+G I IG+P Q F V+FDTGSSNLWVPS KC ++ I+C+ H+RY S KS +Y + G
Sbjct: 23 AQYYGAITIGTPAQTFKVVFDTGSSNLWVPSQKCKWTDIACWLHNRYDSTKSTSYKKNGT 82
Query: 142 SCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIA 201
+I YGSGS+SGF S D V +GDV V Q F EAT + +TF+ A+FDGI+G+G+ I+
Sbjct: 83 EFKIQYGSGSLSGFLSTDIVTIGDVSVTAQTFAEATAQPGITFVAAKFDGILGMGYPTIS 142
Query: 202 VGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKK 261
V PV++NMV+Q VS VFSF+LNRDP A EGGE++ GG DPK+++G TY+PV+KK
Sbjct: 143 VDGVTPVFNNMVKQKSVSSPVFSFFLNRDPSASEGGELILGGSDPKYYEGNFTYLPVSKK 202
Query: 262 GYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
GYWQF++ + + ST C+GGC AI D+GTSLLAGP+ V ++N +GG + E
Sbjct: 203 GYWQFKMDGMKLAGSSTSYCDGGCQAIADTGTSLLAGPSAEVQKLNQELGGTAIPGGE 260
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 45/88 (51%), Positives = 62/88 (70%)
Query: 417 PMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG 476
P GE IIDC++IP +PN++F + K F L+ + YIL + +CISGF+ D+P P G
Sbjct: 257 PGGEYIIDCNKIPQLPNITFMLAGKPFTLTGKDYILAVKQLGKTICISGFIGLDVPAPLG 316
Query: 477 PLWILGDVFMGVYHTVFDSGKLRIGFAE 504
PLWILGDVF+G ++T FD G R+GFA+
Sbjct: 317 PLWILGDVFIGRFYTEFDFGNNRVGFAK 344
>gi|227336874|gb|ACP21315.1| aspartic proteinase precursor [Rhipicephalus microplus]
Length = 391
Score = 273 bits (697), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 127/245 (51%), Positives = 172/245 (70%), Gaps = 2/245 (0%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSN 134
PLKN++DAQY+G+I +G+PPQ F V+FDTGSSNLWVPSSKC F+ I+C+ H +Y S KS
Sbjct: 62 PLKNYLDAQYYGDITLGTPPQVFRVVFDTGSSNLWVPSSKCSFTNIACWLHHKYHSSKST 121
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + G + EI YGSGS+ G S D +G+V V+ Q F E E L F+ A+FDGI+G
Sbjct: 122 TYQKNGTAFEIRYGSGSVKGVLSADMFGLGNVTVRSQTFAEIIDESGLAFIAAKFDGILG 181
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V PV+DNMV QG+ + VFS +L+R+ +GGE++FGG+D H+ G T
Sbjct: 182 LGYPRISVLGVPPVFDNMVAQGVAANPVFSVYLDRNTSDPQGGEVLFGGIDKAHYTGNIT 241
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
YVPVT+KGYWQF + + +G +T C GGC AI D+GTSL+AGPT + ++N AIG
Sbjct: 242 YVPVTRKGYWQFHMDGVTVGTNAT-FCNGGCEAIADTGTSLIAGPTAEIQKLNMAIGAAP 300
Query: 315 VVSAE 319
++ E
Sbjct: 301 FLAGE 305
Score = 101 bits (252), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 50/137 (36%), Positives = 79/137 (57%), Gaps = 3/137 (2%)
Query: 370 ENVSAGDSAV-CSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRI 428
+ V+ G +A C+ A+ L T E + +N + P GE ++ C I
Sbjct: 256 DGVTVGTNATFCNGGCEAIADTGTSLIAGPTAE--IQKLNMAIGAAPFLAGEYMVSCKSI 313
Query: 429 PTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGV 488
P +PN++FT+ + F L + YI++ + +C+SGF+ D+P P GPLWILGDVF+G
Sbjct: 314 PKLPNITFTLNGQEFQLQGKDYIMQVSQAGIPMCLSGFIGLDVPAPMGPLWILGDVFIGR 373
Query: 489 YHTVFDSGKLRIGFAEA 505
Y+T+FD G R+GFA++
Sbjct: 374 YYTIFDRGNDRVGFAQS 390
>gi|262232673|gb|ACY38599.1| cathepsin D-like aspartic protease [Anisakis simplex]
Length = 453
Score = 272 bits (696), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 130/244 (53%), Positives = 168/244 (68%), Gaps = 2/244 (0%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNT 135
L+N+MDAQY+G I IG+PPQNF+VIFDTGSSNLWVPS KC ++ I+C+ H +Y + KS+T
Sbjct: 100 LRNYMDAQYYGVISIGTPPQNFTVIFDTGSSNLWVPSRKCKWTDIACWLHHKYDAAKSST 159
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
+ G+ +I YG+GS+ GF S D V V ++ +DQ F EA E +TF+ A+FDGI+G+
Sbjct: 160 HKADGRELQIQYGTGSMKGFISLDTVCVAELCARDQPFAEAASEPGITFVAAKFDGILGM 219
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
F EIA + PV++ MV Q LV+E VF+FWLNR PD E GGEI FGG DPKHF Y
Sbjct: 220 AFPEIAALNVTPVFNTMVNQQLVAEPVFAFWLNRTPDDEIGGEITFGGTDPKHFVEPIVY 279
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
PVT++ YWQF++ D + G T C GC AI D+GTSL+AGP V I IG E +
Sbjct: 280 APVTRRAYWQFKM-DKISGQDGTLACSDGCQAIADTGTSLIAGPKQQVQLIQKYIGAEPL 338
Query: 316 VSAE 319
S E
Sbjct: 339 FSGE 342
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 49/109 (44%), Positives = 70/109 (64%), Gaps = 4/109 (3%)
Query: 397 KQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGE 456
KQ + + YI + P GE +I CD++P++P+VS IG K F+L+ Y+L +
Sbjct: 323 KQQVQLIQKYIG----AEPLFSGEYMIPCDKVPSLPDVSLVIGGKTFSLTSLDYVLNITK 378
Query: 457 GIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+C+SGFM DLP G LWILGDVF+G ++TVFD G+ R+GFA+A
Sbjct: 379 AGKSICLSGFMGIDLPERVGQLWILGDVFIGRFYTVFDMGQERVGFAQA 427
>gi|122938524|gb|ABM69086.1| aspartic proteinase AspMD03 [Musca domestica]
Length = 390
Score = 272 bits (696), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 133/284 (46%), Positives = 183/284 (64%), Gaps = 6/284 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQ 96
R+ + + +AR K Y G + +R G PL N++DAQY+G I IG+PPQ
Sbjct: 26 RVPIQKIKSAR---KHFYEVGTELQQLRLTYGAGGVTPEPLSNYLDAQYYGPISIGTPPQ 82
Query: 97 NFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
+F V+FDTGSSNLWVPS KC+ + I+C H++Y + KS T+ + G I+YGSGS+SG+
Sbjct: 83 DFKVVFDTGSSNLWVPSKKCHLTNIACLMHNKYDATKSKTFKQNGTEFAIHYGSGSLSGY 142
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
S D V +G + +KDQ F EA E L F+ A+FDGI+GLG+ I+V P + M EQ
Sbjct: 143 LSTDTVNIGGLDIKDQTFAEALSEPGLVFVAAKFDGILGLGYSSISVDGVKPPFYAMYEQ 202
Query: 216 GLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGN 275
GL+S+ +FSF+LNRDP A EGGEI+FGG DP H+ G TY+PVT+K YWQ ++ +G+
Sbjct: 203 GLISQPIFSFYLNRDPKAPEGGEIIFGGSDPDHYTGDFTYLPVTRKAYWQIKMDSASMGD 262
Query: 276 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
+GGC I D+GTSL+A P T IN AIGG ++ +
Sbjct: 263 LK--CAKGGCQVIADTGTSLIALPPSEATSINQAIGGTPIMGGQ 304
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 45/99 (45%), Positives = 63/99 (63%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
IN+ P G+ ++ C+ IP +P + F +G K F L + Y+L+ + +C+SGF
Sbjct: 291 INQAIGGTPIMGGQYMVACEDIPKLPVIKFVLGGKTFELEGKDYVLRIAQMGKTICLSGF 350
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
M D+PPP GPLWILGDVF+G Y+T FD G R+GFA A
Sbjct: 351 MGIDIPPPNGPLWILGDVFIGKYYTEFDMGNDRVGFAIA 389
>gi|60678795|gb|AAX33732.1| Blo t allergen isoform 2 [Blomia tropicalis]
Length = 402
Score = 272 bits (696), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 152/316 (48%), Positives = 202/316 (63%), Gaps = 18/316 (5%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRH 65
++ L LA+ LL+ A L RI L+K + + R E + A + H
Sbjct: 1 MKYSLVLVFLATILLVDAK---LHRIKLQKAQS-----HRKRFVEVESPIKLAYTTHHYH 52
Query: 66 RLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYF 124
+ + PL N+ DAQY+GEI IGSPPQ F+VIFDTGSSNLWVPS KC F+ + C
Sbjct: 53 HWYNGFPE--PLSNYADAQYYGEIQIGSPPQPFNVIFDTGSSNLWVPSKKCKFTNLVCLL 110
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
H +Y S KS++Y G S EI YG+GS++GF S D V V + +++Q F EA E +TF
Sbjct: 111 HHKYDSSKSSSYVNNGTSFEIRYGTGSMTGFLSTDVVTVANQQIQNQTFAEAVSEPGITF 170
Query: 185 LLARFDGIIGLGFREIAVGDAVP-VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
+ A+FDGI+GLGF I+V D VP V+D+MV+QGLV VFSF+LNRD + + GGEI+FGG
Sbjct: 171 VFAKFDGILGLGFNTISV-DGVPTVFDSMVKQGLVQHPVFSFYLNRDTNGKVGGEIIFGG 229
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTG-----VCEGGCAAIVDSGTSLLAG 298
DP ++KG TY P+TK GYWQF++ IL+ N+S VCE GC AI D+GTSL+AG
Sbjct: 230 SDPAYYKGDFTYAPLTKIGYWQFQMHGILLENKSNNKTVGHVCESGCEAIADTGTSLIAG 289
Query: 299 PTPVVTEINHAIGGEG 314
P+ V +N A+G G
Sbjct: 290 PSDQVEHLNRALGAIG 305
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 38/85 (44%), Positives = 53/85 (62%), Gaps = 3/85 (3%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G +++C I +PN+ F I F LSP+QY+++ E+CIS F++ P PL
Sbjct: 309 GIFVLNCSHINALPNIIFQINGVKFPLSPDQYVMRQSAMGKEICISSFISL---PANIPL 365
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFA 503
WILGDVF+G Y+T FD G R+GFA
Sbjct: 366 WILGDVFIGNYYTEFDYGNKRVGFA 390
>gi|157112486|ref|XP_001657556.1| cathepsin d [Aedes aegypti]
gi|205831550|sp|Q03168.2|ASPP_AEDAE RecName: Full=Lysosomal aspartic protease; Flags: Precursor
gi|108878060|gb|EAT42285.1| AAEL006169-PA [Aedes aegypti]
Length = 387
Score = 272 bits (695), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 131/253 (51%), Positives = 174/253 (68%), Gaps = 7/253 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSN 134
PL N++DAQY+G I IG+PPQ+F V+FDTGSSNLWVPS +C F+ I+C H++Y ++KS+
Sbjct: 59 PLSNYLDAQYYGAITIGTPPQSFKVVFDTGSSNLWVPSKECSFTNIACLMHNKYNAKKSS 118
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
T+ + G + I YGSGS+SG+ S D V +G V V Q F EA E L F+ A+FDGI+G
Sbjct: 119 TFEKNGTAFHIQYGSGSLSGYLSTDTVGLGGVSVTKQTFAEAINEPGLVFVAAKFDGILG 178
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V VPV+ NM QGL+ VFSF+LNRDP A EGGEI+FGG D + G T
Sbjct: 179 LGYSSISVDGVVPVFYNMFNQGLIDAPVFSFYLNRDPSAAEGGEIIFGGSDSNKYTGDFT 238
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y+ V +K YWQF++ + +G+ T C GC AI D+GTSL+AGP VT IN AIGG
Sbjct: 239 YLSVDRKAYWQFKMDSVKVGD--TEFCNNGCEAIADTGTSLIAGPVSEVTAINKAIGGTP 296
Query: 315 VVSAE----CKLV 323
+++ E C L+
Sbjct: 297 IMNGEYMVDCSLI 309
Score = 112 bits (280), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 57/139 (41%), Positives = 81/139 (58%), Gaps = 4/139 (2%)
Query: 368 EKENVSAGDSAVCS-ACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
+ ++V GD+ C+ CE A+ L E ++ IN+ P GE ++DC
Sbjct: 251 KMDSVKVGDTEFCNNGCE-AIADTGTSLIAGPVSE--VTAINKAIGGTPIMNGEYMVDCS 307
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
IP +P +SF +G K F+L Y+L+ + +C+SGFM D+PPP GPLWILGDVF+
Sbjct: 308 LIPKLPKISFVLGGKSFDLEGADYVLRVAQMGKTICLSGFMGIDIPPPNGPLWILGDVFI 367
Query: 487 GVYHTVFDSGKLRIGFAEA 505
G Y+T FD G R+GFA A
Sbjct: 368 GKYYTEFDMGNDRVGFATA 386
>gi|293230|gb|AAA29350.1| aspartic protease [Aedes aegypti]
Length = 387
Score = 272 bits (695), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 131/253 (51%), Positives = 174/253 (68%), Gaps = 7/253 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSN 134
PL N++DAQY+G I IG+PPQ+F V+FDTGSSNLWVPS +C F+ I+C H++Y ++KS+
Sbjct: 59 PLSNYLDAQYYGAITIGTPPQSFKVVFDTGSSNLWVPSKECSFTNIACLMHNKYNAKKSS 118
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
T+ + G + I YGSGS+SG+ S D V +G V V Q F EA E L F+ A+FDGI+G
Sbjct: 119 TFEKNGTAFHIQYGSGSLSGYLSTDTVGLGGVSVTKQTFAEAINEPGLVFVAAKFDGILG 178
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V VPV+ NM QGL+ VFSF+LNRDP A EGGEI+FGG D + G T
Sbjct: 179 LGYSSISVDGVVPVFYNMFNQGLIDAPVFSFYLNRDPSAAEGGEIIFGGSDSNKYTGDFT 238
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y+ V +K YWQF++ + +G+ T C GC AI D+GTSL+AGP VT IN AIGG
Sbjct: 239 YLSVDRKAYWQFKMDSVKVGD--TEFCNNGCEAIADTGTSLIAGPVSEVTAINKAIGGTP 296
Query: 315 VVSAE----CKLV 323
+++ E C L+
Sbjct: 297 IMNGEYMVDCSLI 309
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 57/139 (41%), Positives = 81/139 (58%), Gaps = 4/139 (2%)
Query: 368 EKENVSAGDSAVCS-ACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
+ ++V GD+ C+ CE A+ L E ++ IN+ P GE ++DC
Sbjct: 251 KMDSVKVGDTEFCNNGCE-AIADTGTSLIAGPVSE--VTAINKAIGGTPIMNGEYMVDCS 307
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
IP +P +SF +G K F+L Y+L+ + +C+SGFM D+PPP GPLWILGDVF+
Sbjct: 308 LIPKLPKISFVLGGKSFDLEGADYVLRVAQMGKTICLSGFMGIDIPPPNGPLWILGDVFI 367
Query: 487 GVYHTVFDSGKLRIGFAEA 505
G Y+T FD G R+GFA A
Sbjct: 368 GKYYTEFDMGNDRVGFATA 386
>gi|3378673|emb|CAA08878.1| Cathepsin D [Podarcis siculus]
Length = 399
Score = 272 bits (695), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 147/328 (44%), Positives = 215/328 (65%), Gaps = 28/328 (8%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKK----RRL------DLHSLNAARITRKERYM 55
LRS L +LAS ++ +S+ L RI LKK R + ++ LN K ++
Sbjct: 3 LRS---LILLASLVV---ASSALIRIPLKKFPSMRTIYTEYGTNVQDLNELGEMLKYKF- 55
Query: 56 GGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSK 115
GGAGV LKN+MDAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS K
Sbjct: 56 GGAGVGAPTPE---------ALKNYMDAQYYGEIGIGTPPQKFTVVFDTGSSNLWVPSVK 106
Query: 116 CYF-SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFI 174
C+ I+C H +Y S KS++Y + G I+YG+GS+SGF SQD+V +GD++V++Q+F
Sbjct: 107 CHLLDIACLLHHKYDSSKSSSYVKNGTDFAIHYGTGSLSGFLSQDHVTIGDLIVQNQLFG 166
Query: 175 EATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAE 234
EA ++ +TF+ A+FDGI+GL + +I+V +P +DN ++Q L+ + +FSF+LNRDP
Sbjct: 167 EAVKQPGITFIAAKFDGILGLAYPKISVDKVLPFFDNAMKQALMEKNLFSFYLNRDPKGA 226
Query: 235 EGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTS 294
GGE++FGGVDP+++ G T+V VT+K YWQ + + + N T VC+ GC AIVD+GTS
Sbjct: 227 TGGELLFGGVDPQYYTGDFTWVNVTRKAYWQIHMEKVDVDNGLT-VCKDGCEAIVDTGTS 285
Query: 295 LLAGPTPVVTEINHAIGGEGVVSAECKL 322
L+ GPT + ++ AIG + ++ + L
Sbjct: 286 LITGPTDEIKQLQKAIGAKPIIKGQYML 313
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 53/140 (37%), Positives = 82/140 (58%), Gaps = 3/140 (2%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
+EK +V G + CE A+V L T E + + + + P G+ ++ CD
Sbjct: 260 MEKVDVDNGLTVCKDGCE-AIVDTGTSLITGPTDE--IKQLQKAIGAKPIIKGQYMLPCD 316
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
++ ++PNV+ +G K + L+P QY+LK +C+SGF D+PPP GPLWILGDVF+
Sbjct: 317 KLSSLPNVNLVLGGKSYALTPNQYVLKVTVQGETLCLSGFSGLDVPPPAGPLWILGDVFI 376
Query: 487 GVYHTVFDSGKLRIGFAEAA 506
G Y+TVFD +GFA+++
Sbjct: 377 GSYYTVFDRDNDAVGFAKSS 396
>gi|387915174|gb|AFK11196.1| cathepsin D1 [Callorhinchus milii]
Length = 394
Score = 272 bits (695), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 136/289 (47%), Positives = 189/289 (65%), Gaps = 27/289 (9%)
Query: 63 VRHRLGDSD---EDILP------------------LKNFMDAQYFGEIGIGSPPQNFSVI 101
+R L DS ED+LP LKN++DAQY+GE+GIG+PPQ F+V+
Sbjct: 31 IRRALSDSGRSVEDLLPENKYKTDSPGINGPTPETLKNYLDAQYYGEVGIGTPPQPFTVV 90
Query: 102 FDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDN 160
FDTGSSNLWVPS C F I+C H +Y S KS++Y G I YGSGS+SG+ S+D
Sbjct: 91 FDTGSSNLWVPSVHCSMFDIACLLHHKYNSDKSSSYVRNGTKFAIRYGSGSLSGYLSKDT 150
Query: 161 VEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSE 220
V +G++ V+ Q+F EA ++ L F+ A+FDGI+G+G+ I+V +PV+DN+V Q LV
Sbjct: 151 VLIGNIKVQSQLFGEAIKQPGLAFIAAKFDGILGMGYPLISVDGVIPVFDNIVTQKLVPN 210
Query: 221 EVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGV 280
VFSF+LNR+PD+ GGE++ GG DPK++ G Y+ VT+K YWQ ++ ++ IG Q T +
Sbjct: 211 NVFSFYLNRNPDSLPGGELILGGTDPKYYTGDFHYLNVTRKAYWQVKMDEVSIGEQLT-L 269
Query: 281 CEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLVVS 325
C+GGCAAIVD+GTSL+ GP + + AIG ++ E CK V S
Sbjct: 270 CKGGCAAIVDTGTSLITGPAQEIKALQKAIGAIPLIQGEYLIDCKKVAS 318
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 52/140 (37%), Positives = 86/140 (61%), Gaps = 3/140 (2%)
Query: 367 VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
V+ + VS G+ +C A+V L +E + + + ++P GE +IDC
Sbjct: 256 VKMDEVSIGEQLTLCKGGCAAIVDTGTSLITGPAQE--IKALQKAIGAIPLIQGEYLIDC 313
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
++ ++P ++F +G +++ L+ EQY+L + +C+SGFM D+PPP GPLWILGDVF
Sbjct: 314 KKVASLPAINFKLGGQVYTLTAEQYVLNETQAGHSICLSGFMGLDIPPPGGPLWILGDVF 373
Query: 486 MGVYHTVFDSGKLRIGFAEA 505
+G Y+T+FD K R+GFA++
Sbjct: 374 IGQYYTMFDREKDRVGFAKS 393
>gi|148229393|ref|NP_001085403.1| MGC82347 protein precursor [Xenopus laevis]
gi|48734644|gb|AAH72252.1| MGC82347 protein [Xenopus laevis]
Length = 401
Score = 271 bits (694), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 138/305 (45%), Positives = 196/305 (64%), Gaps = 4/305 (1%)
Query: 16 ASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDIL 75
S LL P S+ L RI LKK H+L A KE G + +
Sbjct: 14 GSSLLHPGSA--LIRIPLKKFPSIRHTLTEAGGDAKELLGNGMPLKYSTGFPPNGKATPE 71
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSN 134
L N++DAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS C F I+C+ H +Y S KS+
Sbjct: 72 ALMNYLDAQYYGEIGIGTPPQTFTVVFDTGSSNLWVPSVHCSMFDIACWMHHKYDSSKSS 131
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + G I YG+GS+SG+ S+D V +G++ +K+Q+F EA ++ +TF+ A+FDGI+G
Sbjct: 132 TYVKNGTEFAIQYGTGSLSGYLSKDTVTIGNLGIKEQLFGEAIKQPGVTFIAAKFDGILG 191
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+ + I+V PV+DN++ Q LV VFSF+LNR+PD + GGE++ GG DPK++ G
Sbjct: 192 MAYPIISVDGVSPVFDNIMAQKLVESNVFSFYLNRNPDTQPGGELLLGGTDPKYYTGDFH 251
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y+ VT+K YWQ + + +G+Q T +C+GGC AIVD+GTSL+ GP VT + AIG
Sbjct: 252 YLNVTRKAYWQIHMDQLGVGDQLT-LCKGGCEAIVDTGTSLITGPLEEVTALQKAIGAVP 310
Query: 315 VVSAE 319
++ +
Sbjct: 311 LIQGQ 315
Score = 111 bits (278), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 51/140 (36%), Positives = 87/140 (62%), Gaps = 3/140 (2%)
Query: 367 VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
+ + + GD +C A+V L +E ++ + + ++P G+ ++ C
Sbjct: 263 IHMDQLGVGDQLTLCKGGCEAIVDTGTSLITGPLEE--VTALQKAIGAVPLIQGQYMVQC 320
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
D+IPT+P +S T+G +++ L+ EQYI+K + + +C+SGFM ++PPP GPLWILGDVF
Sbjct: 321 DKIPTLPVISLTLGGQVYTLTGEQYIMKVSQRGSTICLSGFMGLNIPPPAGPLWILGDVF 380
Query: 486 MGVYHTVFDSGKLRIGFAEA 505
+G Y++VFD R+GFA++
Sbjct: 381 IGQYYSVFDRANDRVGFAKS 400
>gi|348530268|ref|XP_003452633.1| PREDICTED: cathepsin D-like [Oreochromis niloticus]
Length = 396
Score = 271 bits (692), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 141/325 (43%), Positives = 207/325 (63%), Gaps = 26/325 (8%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGV-- 63
+R++F L+V+A+ L +++ L RI LKK R R+E G G+ +
Sbjct: 1 MRTLF-LFVIAALAL---TNDALVRIPLKK----------FRSIRRELTDSGKGIEELVA 46
Query: 64 -RHRLG-----DSDEDILP--LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSK 115
+H L S P LKN++DAQY+GEI +G+PPQ F+V+FDTGSSNLWVPS
Sbjct: 47 DKHSLKYNFGFPSSNGPTPETLKNYLDAQYYGEITLGTPPQKFTVVFDTGSSNLWVPSVH 106
Query: 116 C-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFI 174
C +F I+C+ H +Y S KS+TY + G S I YGSGS+SG+ SQD +GD+ V+ Q+F
Sbjct: 107 CSFFDIACWLHHKYNSAKSSTYVKNGTSFAIQYGSGSLSGYLSQDTCSIGDISVEKQIFG 166
Query: 175 EATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAE 234
EA ++ + F+ A+FDGI+G+ + I+V VPV+DNM+ Q V + VFSF+LNR+PD E
Sbjct: 167 EAIKQPGVAFIAAKFDGILGMAYPSISVDGVVPVFDNMMNQKKVEKNVFSFYLNRNPDTE 226
Query: 235 EGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTS 294
GGE++ GG DPK++ G Y ++++ YWQ + + +G+Q + +C+GGC AIVD+GTS
Sbjct: 227 PGGELLLGGTDPKYYDGDFHYANISRQAYWQVHMDGMTVGSQLS-LCKGGCEAIVDTGTS 285
Query: 295 LLAGPTPVVTEINHAIGGEGVVSAE 319
L+ GP V + AIG ++ E
Sbjct: 286 LITGPAAEVKALQKAIGAIPLIQGE 310
Score = 109 bits (272), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 43/93 (46%), Positives = 68/93 (73%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +++C +IP++P ++F +G + + L+ EQY+L+ + +C+SGFM D+P
Sbjct: 303 AIPLIQGEYLVNCSKIPSLPVITFNVGGQSYTLTGEQYVLQESQAGKTICLSGFMGLDIP 362
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
PP GPLWILGDVF+G Y+TVFD R+GFA++
Sbjct: 363 PPAGPLWILGDVFIGQYYTVFDRDNNRVGFAKS 395
>gi|403305561|ref|XP_003943328.1| PREDICTED: cathepsin D [Saimiri boliviensis boliviensis]
Length = 522
Score = 270 bits (691), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 129/255 (50%), Positives = 181/255 (70%), Gaps = 13/255 (5%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN+MDAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 36 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSAKSST 95
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEV-----------GDVVVKDQVFIEATREGSLTF 184
Y + G S +I+YGSGS+SG+ SQD V V G V V+ QVF EAT++ +TF
Sbjct: 96 YVKNGTSFDIHYGSGSLSGYLSQDTVLVPCRPSSSASALGGVKVERQVFGEATKQPGITF 155
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
+ A+FDGI+G+ + I+V + +PV+DN+++Q LV + +FSF+LNRDPDA+ GGE++ GG
Sbjct: 156 IAAKFDGILGMAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLNRDPDAQPGGELMLGGT 215
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
D K++KG +Y+ VT+K YWQ + + + + T +C+GGC AIVD+GTSL+ GP V
Sbjct: 216 DSKYYKGSLSYLNVTRKAYWQVHMDQVEVASGLT-LCKGGCEAIVDTGTSLMVGPVDEVR 274
Query: 305 EINHAIGGEGVVSAE 319
E+ AIG ++ E
Sbjct: 275 ELQKAIGAVPLIQGE 289
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 16/41 (39%), Positives = 27/41 (65%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILK 453
++P GE +I C+++ T+P ++ +G K + LSPE Y LK
Sbjct: 282 AVPLIQGEYMIPCEKVSTLPTITLKLGGKDYKLSPEDYTLK 322
>gi|170649686|gb|ACB21270.1| cathepsin D preproprotein (predicted) [Callicebus moloch]
Length = 412
Score = 270 bits (691), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 140/305 (45%), Positives = 197/305 (64%), Gaps = 25/305 (8%)
Query: 37 RLDLHSLNAARITRKERYMGG--------AGVSGVRHRLGDSDEDILP--LKNFMDAQYF 86
R+ LH + R T E MGG +S + +P LKN+MDAQY+
Sbjct: 23 RIPLHKFTSIRRTMSE--MGGPVEDLIAKGPISKYSQGMPTVPAGPVPEILKNYMDAQYY 80
Query: 87 GEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEI 145
GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+TY + G S +I
Sbjct: 81 GEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSAKSSTYVKNGTSFDI 140
Query: 146 NYGSGSISGFFSQDNVEV-----------GDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+YGSGS+SG+ SQD V V G V V+ QVF EAT++ +TF+ A+FDGI+G
Sbjct: 141 HYGSGSLSGYLSQDTVLVPCRSSSSASALGGVKVERQVFGEATKQPGITFIAAKFDGILG 200
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+ + I+V + +PV+DN+++Q LV + +FSF+LNRDPDA+ GGE++ GG D K++KG +
Sbjct: 201 MAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLNRDPDAQPGGELMLGGTDSKYYKGSLS 260
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y+ VT+K YWQ + + + + T +C+GGC AIVD+GTSL+ GP V E+ AIG
Sbjct: 261 YLNVTRKAYWQVHMDQVEVASGLT-LCKGGCEAIVDTGTSLMVGPVDEVRELQKAIGAVP 319
Query: 315 VVSAE 319
++ E
Sbjct: 320 LIQGE 324
Score = 108 bits (270), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 46/94 (48%), Positives = 65/94 (69%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I C+++ T+P ++ +G K + LSPE Y LK + +C+SGFM D+P
Sbjct: 317 AVPLIQGEYMIPCEKVSTLPAITLKLGGKDYRLSPEDYTLKVSQAGKAICLSGFMGMDIP 376
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PP GPLWILGDVF+G Y+TVFD R+GFA+A
Sbjct: 377 PPSGPLWILGDVFIGRYYTVFDRDNNRVGFAQAT 410
>gi|332264729|ref|XP_003281384.1| PREDICTED: cathepsin D [Nomascus leucogenys]
Length = 412
Score = 270 bits (691), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 141/305 (46%), Positives = 197/305 (64%), Gaps = 25/305 (8%)
Query: 37 RLDLHSLNAARITRKERYMGGA--------GVSGVRHRLGDSDEDILP--LKNFMDAQYF 86
R+ LH + R T E +GG+ S L E +P LKN+MDAQY+
Sbjct: 23 RIPLHKFTSIRRTMSE--VGGSVEDLIAKGPSSKYSQALPAVTEGPVPEVLKNYMDAQYY 80
Query: 87 GEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEI 145
GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+TY + G S +I
Sbjct: 81 GEIGIGTPPQCFTVVFDTGSSNLWVPSVHCKLLDIACWIHHKYNSDKSSTYVKNGTSFDI 140
Query: 146 NYGSGSISGFFSQDNVEV-----------GDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+YGSGS+SG+ SQD V V G V V+ QVF EAT++ +TF+ A+FDGI+G
Sbjct: 141 HYGSGSLSGYLSQDTVSVPCQSASSASALGSVKVERQVFGEATKQPGITFIAAKFDGILG 200
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+ + I+V + +PV+DN+++Q LV + +FSF+LNRDPDA+ GGE++ GG D K++KG +
Sbjct: 201 MAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLNRDPDAQPGGELMLGGTDSKYYKGSLS 260
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y+ VT+K YWQ L + + + T +C+ GC AIVD+GTSL+ GP V E+ AIG
Sbjct: 261 YLNVTRKAYWQVHLDQVEVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVP 319
Query: 315 VVSAE 319
++ E
Sbjct: 320 LIQGE 324
Score = 111 bits (278), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 66/94 (70%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I C+++ T+P ++ +G K + LSPE Y LK + +C+SGFM D+P
Sbjct: 317 AVPLIQGEYMIPCEKVSTLPAITLKLGGKGYKLSPEDYTLKVSQAGKTLCLSGFMGMDIP 376
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PP GPLWILGDVF+G Y+TVFD R+GFAEAA
Sbjct: 377 PPSGPLWILGDVFIGRYYTVFDRDNNRVGFAEAA 410
>gi|157644743|gb|ABV59077.1| cathepsin D [Lates calcarifer]
gi|396084116|gb|AFN84539.1| cathepsin D [Lates calcarifer]
Length = 396
Score = 270 bits (690), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 141/318 (44%), Positives = 205/318 (64%), Gaps = 12/318 (3%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRH 65
+RS+F L V A+ L SS+ L RI LKK R L + TR E + A +++
Sbjct: 1 MRSLF-LVVFAALAL---SSDALVRIPLKKFRSIRRELTDSG-TRLEELL--ADKHSLKY 53
Query: 66 RLG-DSDEDILP--LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSIS 121
G S P LKN++DAQY+G+I +G+PPQ FSV+FDTGSSNLWVPS C I+
Sbjct: 54 NFGFPSSNGPTPETLKNYLDAQYYGDISLGTPPQTFSVVFDTGSSNLWVPSVHCSLLDIA 113
Query: 122 CYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGS 181
C H +Y S KS+TY + G + I YGSGS+SG+ S+D +GD+ V+ Q+F EA ++
Sbjct: 114 CLLHHKYNSAKSSTYVKNGTAFAIQYGSGSLSGYLSEDTCTIGDISVEKQLFGEAIKQPG 173
Query: 182 LTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVF 241
+ F+ A+FDGI+G+ + I+V VPV+DN++ Q V + VFSF+LNR+PD GGE++
Sbjct: 174 VAFIAAKFDGILGMAYPRISVDGVVPVFDNIMSQKKVEQNVFSFYLNRNPDTAPGGELLL 233
Query: 242 GGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTP 301
GG DPK++ G YV +T++ YWQ + ++++G Q + +C+GGC AIVD+GTSL+ GP+
Sbjct: 234 GGTDPKYYTGDFNYVNITRQAYWQIHMDELVVGTQLS-LCKGGCEAIVDTGTSLITGPSA 292
Query: 302 VVTEINHAIGGEGVVSAE 319
V + AIG ++ E
Sbjct: 293 EVKALQKAIGAIPLIQGE 310
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 44/93 (47%), Positives = 69/93 (74%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +++CD++P++P ++F +G + ++L+ EQYILK + +C+SGFM D+P
Sbjct: 303 AIPLIQGEYMVNCDKVPSLPVITFNVGGQSYSLTGEQYILKESQAGKTICLSGFMGLDIP 362
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
P GPLWILGDVF+G Y+TVFD R+GFA++
Sbjct: 363 APAGPLWILGDVFIGQYYTVFDRDNNRVGFAKS 395
>gi|184185542|gb|ACC68942.1| cathepsin D (predicted) [Rhinolophus ferrumequinum]
Length = 410
Score = 270 bits (689), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 133/281 (47%), Positives = 191/281 (67%), Gaps = 11/281 (3%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN+MDAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 71 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSGKSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEV---------GDVVVKDQVFIEATREGSLTFLL 186
Y + G S +I+YGSGS+SG+ SQD V V G V V+ QVF EAT++ +TF+
Sbjct: 131 YVKNGTSFDIHYGSGSLSGYLSQDTVSVPCNSALLGLGGVKVERQVFGEATKQPGITFIA 190
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+ + I+V + +PV+DN+++Q LV + +FSF+LNRDP+A+ GGE++ GG D
Sbjct: 191 AKFDGILGMAYPRISVNNVLPVFDNLMQQKLVDKNIFSFYLNRDPNAQPGGELMLGGTDS 250
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
+++KG +Y+ VT+K YWQ + + +GN T +C+ GC AIVD+GTSL+ GP V E+
Sbjct: 251 RYYKGALSYLNVTRKAYWQVHMDQVDVGNSLT-LCKAGCEAIVDTGTSLIVGPVEEVREL 309
Query: 307 NHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQ 347
AIG ++ E + + L +L G K+C +
Sbjct: 310 QKAIGAVPLIQGEYMIPCEKVSSLPEVILKLGGKDYKLCAE 350
Score = 102 bits (253), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 54/141 (38%), Positives = 80/141 (56%), Gaps = 3/141 (2%)
Query: 367 VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
V + V G+S +C A A+V L +E + + + ++P GE +I C
Sbjct: 270 VHMDQVDVGNSLTLCKAGCEAIVDTGTSLIVGPVEE--VRELQKAIGAVPLIQGEYMIPC 327
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
+++ ++P V +G K + L E Y LK + +C+SGFM D+PPP GPLWILGDVF
Sbjct: 328 EKVSSLPEVILKLGGKDYKLCAEDYTLKVSQAGKTICLSGFMGMDIPPPGGPLWILGDVF 387
Query: 486 MGVYHTVFDSGKLRIGFAEAA 506
+G Y+TVFD + R+G AEA
Sbjct: 388 IGRYYTVFDRDENRVGLAEAT 408
>gi|25452827|sp|Q9DEX3.1|CATD_CLUHA RecName: Full=Cathepsin D; Flags: Precursor
gi|11037777|gb|AAG27733.1|AF312364_1 muscular cathepsin D [Clupea harengus]
Length = 396
Score = 270 bits (689), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 139/308 (45%), Positives = 200/308 (64%), Gaps = 12/308 (3%)
Query: 24 SSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLG-DSDEDILP--LKNF 80
+S+ + RI LKK R +L+ + + ++ AG + ++H G S P LKN+
Sbjct: 15 TSDAIVRIPLKKFRSIRRTLSDSGLNVEQLL---AGTNSLQHNQGFPSSNAPTPETLKNY 71
Query: 81 MDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEI 139
MDAQY+GEIG+G+P Q F+V+FDTGSSNLW+PS C F+ I+C H +Y KS+TY +
Sbjct: 72 MDAQYYGEIGLGTPVQMFTVVFDTGSSNLWLPSIHCSFTDIACLLHHKYNGAKSSTYVKN 131
Query: 140 GKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFRE 199
G I YGSGS+SG+ SQD+ +GD+VV+ Q+F EA ++ + F+ A+FDGI+G+ +
Sbjct: 132 GTEFAIQYGSGSLSGYLSQDSCTIGDIVVEKQLFGEAIKQPGVAFIAAKFDGILGMAYPR 191
Query: 200 IAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVT 259
I+V PV+D M+ Q V + VFSF+LNR+PD E GGE++ GG DPK++ G YVPVT
Sbjct: 192 ISVDGVPPVFDMMMSQKKVEQNVFSFYLNRNPDTEPGGELLLGGTDPKYYTGDFNYVPVT 251
Query: 260 KKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
++ YWQ + + IG+Q T +C+ GC AIVD+GTSL+ GP V + AIG ++ E
Sbjct: 252 RQAYWQIHMDGMSIGSQLT-LCKDGCEAIVDTGTSLITGPPAEVRALQKAIGAIPLIQGE 310
Query: 320 ----CKLV 323
CK V
Sbjct: 311 YMIDCKKV 318
Score = 115 bits (287), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 47/94 (50%), Positives = 69/94 (73%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +IDC ++PT+P +SF +G K ++L+ EQY+LK +G +C+SG M ++P
Sbjct: 303 AIPLIQGEYMIDCKKVPTLPTISFNVGGKTYSLTGEQYVLKESQGGKTICLSGLMGLEIP 362
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PP GPLWILGDVF+G Y+TVFD R+GFA++
Sbjct: 363 PPAGPLWILGDVFIGQYYTVFDRESNRVGFAKST 396
>gi|197099366|ref|NP_001125492.1| cathepsin D precursor [Pongo abelii]
gi|55728229|emb|CAH90861.1| hypothetical protein [Pongo abelii]
Length = 412
Score = 269 bits (688), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 140/305 (45%), Positives = 198/305 (64%), Gaps = 25/305 (8%)
Query: 37 RLDLHSLNAARITRKERYMGGA--------GVSGVRHRLGDSDEDILP--LKNFMDAQYF 86
R+ LH + R T E +GG+ VS + E +P LKN+MDAQY+
Sbjct: 23 RIPLHKFTSIRRTMSE--VGGSVEDLIAKGPVSKYSQAMPAVTEGPVPEVLKNYMDAQYY 80
Query: 87 GEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEI 145
GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+TY + G S +I
Sbjct: 81 GEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHRKYNSDKSSTYVKNGTSFDI 140
Query: 146 NYGSGSISGFFSQDNVEV-----------GDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+YGSGS+SG+ SQD V V G V V+ QVF EAT++ +TF+ A+FDGI+G
Sbjct: 141 HYGSGSLSGYLSQDTVSVPCQSASSASALGGVKVERQVFGEATKQPGITFIAAKFDGILG 200
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+ + I+V + +PV+DN+++Q LV + +FSF+L+RDPDA+ GGE++ GG D K++KG +
Sbjct: 201 MAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLSRDPDAQPGGELMLGGTDSKYYKGSLS 260
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y+ VT+K YWQ L + + + T +C+ GC AIVD+GTSL+ GP V E+ AIG
Sbjct: 261 YLNVTRKAYWQVHLDQVEVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVP 319
Query: 315 VVSAE 319
++ E
Sbjct: 320 LIQGE 324
Score = 111 bits (278), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 66/94 (70%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I C+++ T+P ++ +G K + LSPE Y LK + +C+SGFM D+P
Sbjct: 317 AVPLIQGEYMIPCEKVSTLPAITLKLGGKGYKLSPEDYTLKVSQAGKTLCLSGFMGMDIP 376
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PP GPLWILGDVF+G Y+TVFD R+GFAEAA
Sbjct: 377 PPSGPLWILGDVFIGRYYTVFDRDNNRVGFAEAA 410
>gi|427789779|gb|JAA60341.1| Putative cathepsin d isoform 1 protein [Rhipicephalus pulchellus]
Length = 391
Score = 269 bits (688), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 125/245 (51%), Positives = 172/245 (70%), Gaps = 2/245 (0%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSN 134
PLKN++DAQY+G+I +G+PPQ F V+FDTGSSNLWVPSSKC F+ I+C+ H +Y S +S
Sbjct: 62 PLKNYLDAQYYGDITLGTPPQVFRVVFDTGSSNLWVPSSKCSFTNIACWLHHKYHSSRST 121
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + G + EI YGSGS+ G S D +G+V V+ Q F E E L F+ A+FDGI+G
Sbjct: 122 TYQKNGTAFEIRYGSGSVKGVLSTDVFGLGNVTVRSQTFAEIIDESGLAFIAAKFDGILG 181
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V PV+DNMV QG+ ++ VFS +L+R+ +GGE++FGG+D H+ G T
Sbjct: 182 LGYPRISVLGVPPVFDNMVAQGVAAKPVFSVYLDRNASDPQGGEVLFGGIDKAHYTGNIT 241
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
YVPVT+KGYWQF + + +G +T C GGC AI D+GTSL+AGP+ + ++N AIG
Sbjct: 242 YVPVTRKGYWQFHMDGVTVGTNTT-FCNGGCEAIADTGTSLIAGPSEEIQKLNLAIGAAP 300
Query: 315 VVSAE 319
+ E
Sbjct: 301 FTAGE 305
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 41/99 (41%), Positives = 64/99 (64%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
+N + P GE ++ C IP +PN++FT+ F L + Y+++ + +C+SGF
Sbjct: 292 LNLAIGAAPFTAGEYLVSCKSIPKLPNITFTLNGHDFQLQGKDYVMQVSQAGIPLCLSGF 351
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+ D+P P GPLWILGDVF+G Y+T+FD G R+GFA++
Sbjct: 352 IGLDVPAPMGPLWILGDVFIGRYYTIFDRGNDRVGFAQS 390
>gi|195581342|ref|XP_002080493.1| GD10217 [Drosophila simulans]
gi|194192502|gb|EDX06078.1| GD10217 [Drosophila simulans]
Length = 324
Score = 269 bits (688), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 129/248 (52%), Positives = 169/248 (68%), Gaps = 7/248 (2%)
Query: 81 MDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEI 139
MDAQY+G I IGSPPQNF V+FDTGSSNLWVPS KC+ + I+C H++Y + KS TYT+
Sbjct: 1 MDAQYYGPIAIGSPPQNFRVVFDTGSSNLWVPSKKCHLTNIACLMHNKYDASKSKTYTKN 60
Query: 140 GKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFRE 199
G I+YGSGS+SG+ S D V + + +KDQ F EA E L F+ A+FDGI+GLG+
Sbjct: 61 GTEFAIHYGSGSLSGYLSTDTVSIAGLDIKDQTFAEALSEPGLVFVAAKFDGILGLGYSS 120
Query: 200 IAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVT 259
I+V P + M EQGL+S VFSF+LNRDP + EGGEI+FGG DP H+ G+ TY+PVT
Sbjct: 121 ISVDKVKPPFYAMYEQGLISAPVFSFYLNRDPASPEGGEIIFGGSDPNHYTGEFTYLPVT 180
Query: 260 KKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
+K YWQ ++ IG+ +C+GGC I D+GTSL+A P T IN IGG ++ +
Sbjct: 181 RKAYWQIKMDAASIGDLQ--LCKGGCQVIADTGTSLIAAPLEEATSINQKIGGTPIIGGQ 238
Query: 320 ----CKLV 323
C L+
Sbjct: 239 YLVSCDLI 246
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 53/139 (38%), Positives = 77/139 (55%), Gaps = 2/139 (1%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
++ + S GD +C + L +E + IN+ P G+ ++ CD
Sbjct: 187 IKMDAASIGDLQLCKGGCQVIADTGTSLIAAPLEEA--TSINQKIGGTPIIGGQYLVSCD 244
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
IP +P + F +G K F L + YIL+ + +C+SGFM D+PPP GPLWILGDVF+
Sbjct: 245 LIPQLPVIKFVLGGKTFELEGKDYILRVAQMGKTICLSGFMGMDIPPPNGPLWILGDVFI 304
Query: 487 GVYHTVFDSGKLRIGFAEA 505
G Y+T FD G R+GFA+A
Sbjct: 305 GKYYTEFDMGNDRVGFADA 323
>gi|357627475|gb|EHJ77155.1| cathepsin D [Danaus plexippus]
Length = 358
Score = 269 bits (687), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 130/253 (51%), Positives = 173/253 (68%), Gaps = 8/253 (3%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSN 134
PL N++DAQY+G I IG+PPQ F V+FDTGSSNLWVPS KC+++ I+C H++Y S KS
Sbjct: 31 PLSNYLDAQYYGPISIGNPPQTFKVVFDTGSSNLWVPSKKCHYTNIACLLHNKYDSSKSK 90
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y + G I+YGSGS+SGF S D+V +G + VK Q F EA E L F+ A+FDGI+G
Sbjct: 91 SYHKNGTEFAIHYGSGSLSGFLSVDDVTLGGMTVKSQTFAEAMSEPGLAFVAAKFDGILG 150
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+ F IAV PV+DNMV+QGLV+ VFSF+LNRD A +GGE+V GG DP H++G T
Sbjct: 151 MAFASIAVDGVTPVFDNMVKQGLVA-PVFSFYLNRDASAAQGGELVLGGSDPAHYRGPLT 209
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
YVP++K YWQF++ +L+ S C+ GC AI D+GTSL+ GP V +N IG
Sbjct: 210 YVPLSKDTYWQFQMDGVLVNGSS--FCKRGCQAIADTGTSLIGGPVEEVAALNAKIGATP 267
Query: 314 ---GVVSAECKLV 323
G + +C L+
Sbjct: 268 MAFGQFALDCSLI 280
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 47/99 (47%), Positives = 62/99 (62%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
+N + P G+ +DC IP +P V+FTI ++ F L Y+L+ + VC+SGF
Sbjct: 259 LNAKIGATPMAFGQFALDCSLIPRLPPVTFTIANQKFTLEGTDYVLRVSQFGKTVCLSGF 318
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
M D+PPP GPLWILGDVF+G Y+T FD RIGFA A
Sbjct: 319 MGLDIPPPAGPLWILGDVFIGRYYTEFDVANRRIGFAPA 357
>gi|268581165|ref|XP_002645565.1| C. briggsae CBR-ASP-4 protein [Caenorhabditis briggsae]
Length = 446
Score = 269 bits (687), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 145/307 (47%), Positives = 192/307 (62%), Gaps = 20/307 (6%)
Query: 28 LRRIGLKKR-RLDLHSLNAARITRKERYMGGAGVSGVRHR-------------LGDSDED 73
LR I LKK+ L L A R+ G ++H LG+ DE
Sbjct: 27 LRTISLKKQPTLRETLLQAGTFETFARHRHGYQKKFLKHHGNHHFDKYNGVKPLGEIDE- 85
Query: 74 ILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRK 132
L+N+MDAQYFG I IG+P QNF+VIFDTGSSNLWVPS KC ++ I+C H RY S+
Sbjct: 86 --LLRNYMDAQYFGTISIGTPGQNFTVIFDTGSSNLWVPSKKCPFYDIACMLHHRYDSKS 143
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGI 192
S+TY E G+ I YG+GS+ GF S+D+V V + +DQ F EAT E +TF+ A+FDGI
Sbjct: 144 SSTYKEDGRKMAIQYGTGSMKGFISKDSVCVAGICAEDQPFAEATSEPGITFVAAKFDGI 203
Query: 193 IGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGK 252
+G+ + EIAV PV++ + EQ V VFSFWLNR+PD+E GGEI FGG+D + +
Sbjct: 204 LGMAYPEIAVLGVQPVFNTLFEQKKVPSNVFSFWLNRNPDSELGGEITFGGIDARRYVEP 263
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
TY PVT+KGYWQF++ D ++G+ G C GC AI D+GTSL+AGP + I + IG
Sbjct: 264 ITYTPVTRKGYWQFKM-DKVVGSGVLG-CSNGCQAIADTGTSLIAGPKAQIEAIQNFIGA 321
Query: 313 EGVVSAE 319
E ++ E
Sbjct: 322 EPLIKGE 328
Score = 105 bits (263), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 50/99 (50%), Positives = 66/99 (66%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
I + P GE +I CD++PT+P VSF IG + F+L E Y+LK +G +C+SGF
Sbjct: 315 IQNFIGAEPLIKGEYMISCDKVPTLPPVSFVIGGQEFSLKGEDYVLKVSQGGKTICLSGF 374
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
M DLP G LWILGDVF+G Y+TVFD + R+GFA+A
Sbjct: 375 MGIDLPERVGELWILGDVFIGRYYTVFDFDQNRVGFAQA 413
>gi|417400425|gb|JAA47158.1| Putative cathepsin d [Desmodus rotundus]
Length = 409
Score = 269 bits (687), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 130/252 (51%), Positives = 180/252 (71%), Gaps = 10/252 (3%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN+MDAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS C +C+ H +Y S KS T
Sbjct: 71 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDFACWIHHKYNSGKSTT 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEV--------GDVVVKDQVFIEATREGSLTFLLA 187
Y + G + +I+YGSGS+SG+ SQD V V V V+ QVF EAT++ +TF+ A
Sbjct: 131 YVKNGTTFDIHYGSGSLSGYLSQDTVSVPCNSAASGSGVKVERQVFGEATKQPGVTFIAA 190
Query: 188 RFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPK 247
+FDGI+G+ + I+V + +PV+DN+++Q LV E VFSF+LNRDP+A+ GGE++ GGVD K
Sbjct: 191 KFDGILGMAYPRISVNNVLPVFDNLMQQKLVDENVFSFYLNRDPNAQPGGELMLGGVDSK 250
Query: 248 HFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEIN 307
++KG TY+ VT+K YWQ + ++ +G+ T +C+ GC AIVD+GTSLL GP V E+
Sbjct: 251 YYKGPITYLNVTRKAYWQVHMDEVAVGSGLT-LCKEGCEAIVDTGTSLLVGPVEEVRELQ 309
Query: 308 HAIGGEGVVSAE 319
AIG ++ E
Sbjct: 310 KAIGAVPLIQGE 321
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 45/94 (47%), Positives = 65/94 (69%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE ++ C+++ ++P V+ +G K + LS E Y LK +G +C+SGFM D+P
Sbjct: 314 AVPLIQGEYMVPCEKVSSLPEVTLKLGGKAYRLSAEDYTLKVSQGGKSICLSGFMGMDIP 373
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PP GPLWILGDVF+G Y+TVFD + R+G AEA
Sbjct: 374 PPAGPLWILGDVFIGRYYTVFDRDENRVGLAEAT 407
>gi|30584113|gb|AAP36305.1| Homo sapiens cathepsin D (lysosomal aspartyl protease) [synthetic
construct]
gi|60653917|gb|AAX29651.1| cathepsin D [synthetic construct]
Length = 413
Score = 268 bits (686), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 140/305 (45%), Positives = 198/305 (64%), Gaps = 25/305 (8%)
Query: 37 RLDLHSLNAARITRKERYMGGA--------GVSGVRHRLGDSDEDILP--LKNFMDAQYF 86
R+ LH + R T E +GG+ VS + E +P LKN+MDAQY+
Sbjct: 23 RIPLHKFTSIRRTMSE--VGGSVEDLIAKGPVSKYSQAVPAVTEGPIPEVLKNYMDAQYY 80
Query: 87 GEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEI 145
GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+TY + G S +I
Sbjct: 81 GEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSDKSSTYVKNGTSFDI 140
Query: 146 NYGSGSISGFFSQDNVEV-----------GDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+YGSGS+SG+ SQD V V G V V+ QVF EAT++ +TF+ A+FDGI+G
Sbjct: 141 HYGSGSLSGYLSQDTVSVPCQSASSASALGGVKVERQVFGEATKQPGITFIAAKFDGILG 200
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+ + I+V + +PV+DN+++Q LV + +FSF+L+RDPDA+ GGE++ GG D K++KG +
Sbjct: 201 MAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLSRDPDAQPGGELMLGGTDSKYYKGSLS 260
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y+ VT+K YWQ L + + + T +C+ GC AIVD+GTSL+ GP V E+ AIG
Sbjct: 261 YLNVTRKAYWQVHLDQVEVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVP 319
Query: 315 VVSAE 319
++ E
Sbjct: 320 LIQGE 324
Score = 111 bits (278), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 66/94 (70%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I C+++ T+P ++ +G K + LSPE Y LK + +C+SGFM D+P
Sbjct: 317 AVPLIQGEYMIPCEKVSTLPAITLKLGGKGYKLSPEDYTLKVSQAGKTLCLSGFMGMDIP 376
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PP GPLWILGDVF+G Y+TVFD R+GFAEAA
Sbjct: 377 PPSGPLWILGDVFIGRYYTVFDRDNNRVGFAEAA 410
>gi|123993743|gb|ABM84473.1| cathepsin D (lysosomal aspartyl peptidase) [synthetic construct]
Length = 412
Score = 268 bits (686), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 140/305 (45%), Positives = 198/305 (64%), Gaps = 25/305 (8%)
Query: 37 RLDLHSLNAARITRKERYMGGA--------GVSGVRHRLGDSDEDILP--LKNFMDAQYF 86
R+ LH + R T E +GG+ VS + E +P LKN+MDAQY+
Sbjct: 23 RIPLHKFTSIRRTMSE--VGGSVEDLIAKGPVSKYSQAVPAVTEGPIPEVLKNYMDAQYY 80
Query: 87 GEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEI 145
GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+TY + G S +I
Sbjct: 81 GEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSDKSSTYVKNGTSFDI 140
Query: 146 NYGSGSISGFFSQDNVEV-----------GDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+YGSGS+SG+ SQD V V G V V+ QVF EAT++ +TF+ A+FDGI+G
Sbjct: 141 HYGSGSLSGYLSQDTVSVPCQSASSASALGGVKVERQVFGEATKQPGITFIAAKFDGILG 200
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+ + I+V + +PV+DN+++Q LV + +FSF+L+RDPDA+ GGE++ GG D K++KG +
Sbjct: 201 MAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLSRDPDAQPGGELMLGGTDSKYYKGSLS 260
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y+ VT+K YWQ L + + + T +C+ GC AIVD+GTSL+ GP V E+ AIG
Sbjct: 261 YLNVTRKAYWQVHLDQVEVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVP 319
Query: 315 VVSAE 319
++ E
Sbjct: 320 LIQGE 324
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 67/94 (71%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I C+++ T+P ++ +G K + LSPE Y+LK + +C+SGFM D+P
Sbjct: 317 AVPLIQGEYMIPCEKVSTLPAITLKLGGKGYKLSPEDYMLKVSQAGKTLCLSGFMGMDIP 376
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PP GPLWILGDVF+G Y+TVFD R+GFAEAA
Sbjct: 377 PPSGPLWILGDVFIGRYYTVFDRDNNRVGFAEAA 410
>gi|426366854|ref|XP_004050458.1| PREDICTED: cathepsin D [Gorilla gorilla gorilla]
Length = 412
Score = 268 bits (686), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 140/305 (45%), Positives = 198/305 (64%), Gaps = 25/305 (8%)
Query: 37 RLDLHSLNAARITRKERYMGGA--------GVSGVRHRLGDSDEDILP--LKNFMDAQYF 86
R+ LH + R T E +GG+ VS + E +P LKN+MDAQY+
Sbjct: 23 RIPLHKFTSIRRTMSE--VGGSVEDLIAKGPVSKYSQAVPAVTEGPIPEVLKNYMDAQYY 80
Query: 87 GEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEI 145
GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+TY + G S +I
Sbjct: 81 GEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSDKSSTYVKNGTSFDI 140
Query: 146 NYGSGSISGFFSQDNVEV-----------GDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+YGSGS+SG+ SQD V V G V V+ QVF EAT++ +TF+ A+FDGI+G
Sbjct: 141 HYGSGSLSGYLSQDTVSVPCQSASSASAPGGVKVERQVFGEATKQPGITFIAAKFDGILG 200
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+ + I+V + +PV+DN+++Q LV + +FSF+L+RDPDA+ GGE++ GG D K++KG +
Sbjct: 201 MAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLSRDPDAQPGGELMLGGTDSKYYKGSLS 260
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y+ VT+K YWQ L + + + T +C+ GC AIVD+GTSL+ GP V E+ AIG
Sbjct: 261 YLNVTRKAYWQVHLDQVEVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVP 319
Query: 315 VVSAE 319
++ E
Sbjct: 320 LIQGE 324
Score = 111 bits (278), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 66/94 (70%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I C+++ T+P ++ +G K + LSPE Y LK + +C+SGFM D+P
Sbjct: 317 AVPLIQGEYMIPCEKVSTLPAITLKLGGKGYKLSPEDYTLKVSQAGKTLCLSGFMGMDIP 376
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PP GPLWILGDVF+G Y+TVFD R+GFAEAA
Sbjct: 377 PPSGPLWILGDVFIGRYYTVFDRDNNRVGFAEAA 410
>gi|213625094|gb|AAI69806.1| LOC443721 protein [Xenopus laevis]
Length = 399
Score = 268 bits (686), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 127/287 (44%), Positives = 187/287 (65%), Gaps = 25/287 (8%)
Query: 61 SGVRHRLGDSDEDILPLK-----------------------NFMDAQYFGEIGIGSPPQN 97
+ +R + D+D+D L L N++DAQY+GEI IG+PPQ
Sbjct: 32 TSIRRAMSDTDKDSLKLSGNEAATKYSAFPKSNNPTPETLLNYLDAQYYGEISIGTPPQP 91
Query: 98 FSVIFDTGSSNLWVPSSKCYF-SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFF 156
F+V+FDTGSSNLWVPS C F I+C+ H +Y S KS+TY G + I YGSGS++G+
Sbjct: 92 FTVVFDTGSSNLWVPSVHCSFWDIACWLHHKYDSSKSSTYVNNGTAFAIQYGSGSLTGYL 151
Query: 157 SQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQG 216
S+D V +GD+ VK Q+F EA ++ +TF+ A+FDGI+G+G+ I+V PV+D+++EQ
Sbjct: 152 SKDTVTIGDLAVKGQLFAEAVKQPGITFVAAKFDGILGMGYPRISVDGVPPVFDDIMEQK 211
Query: 217 LVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQ 276
LV +FSF+LNR+PD + GGE++ GG DP ++ G +Y+ VT+K YWQ + + +G+Q
Sbjct: 212 LVDSNLFSFYLNRNPDTQPGGELLLGGTDPTYYTGDFSYMNVTRKAYWQIRMDQLSVGDQ 271
Query: 277 STGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLV 323
T +C+GGC AIVD+GTSL+ GP VT + AIG ++ E ++
Sbjct: 272 LT-LCKGGCEAIVDTGTSLITGPVEEVTALQRAIGAIPLIRGEYMIL 317
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 56/140 (40%), Positives = 86/140 (61%), Gaps = 3/140 (2%)
Query: 367 VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
+ + +S GD +C A+V L +E ++ + ++P GE +I C
Sbjct: 261 IRMDQLSVGDQLTLCKGGCEAIVDTGTSLITGPVEE--VTALQRAIGAIPLIRGEYMILC 318
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
D IP++P +SFT G ++++L+ EQY+LK + VC+SGF+ D+PPP GPLWI+GDVF
Sbjct: 319 DNIPSLPVISFTFGGRVYSLTGEQYVLKISKAGRTVCLSGFLGLDIPPPAGPLWIIGDVF 378
Query: 486 MGVYHTVFDSGKLRIGFAEA 505
+G Y+TVFD R+GFA+A
Sbjct: 379 IGQYYTVFDRANDRVGFAKA 398
>gi|4503143|ref|NP_001900.1| cathepsin D preproprotein [Homo sapiens]
gi|115717|sp|P07339.1|CATD_HUMAN RecName: Full=Cathepsin D; Contains: RecName: Full=Cathepsin D
light chain; Contains: RecName: Full=Cathepsin D heavy
chain; Flags: Precursor
gi|29678|emb|CAA28955.1| cathepsin D [Homo sapiens]
gi|179948|gb|AAA51922.1| cathepsin D [Homo sapiens]
gi|181180|gb|AAB59529.1| preprocathepsin D [Homo sapiens]
gi|16740920|gb|AAH16320.1| Cathepsin D [Homo sapiens]
gi|30582659|gb|AAP35556.1| cathepsin D (lysosomal aspartyl protease) [Homo sapiens]
gi|48146011|emb|CAG33228.1| CTSD [Homo sapiens]
gi|54697170|gb|AAV38957.1| cathepsin D (lysosomal aspartyl protease) [Homo sapiens]
gi|61356567|gb|AAX41260.1| cathepsin D [synthetic construct]
gi|61362282|gb|AAX42193.1| cathepsin D [synthetic construct]
gi|119622866|gb|EAX02461.1| cathepsin D (lysosomal aspartyl peptidase), isoform CRA_a [Homo
sapiens]
gi|119622867|gb|EAX02462.1| cathepsin D (lysosomal aspartyl peptidase), isoform CRA_a [Homo
sapiens]
gi|119622868|gb|EAX02463.1| cathepsin D (lysosomal aspartyl peptidase), isoform CRA_a [Homo
sapiens]
gi|123994405|gb|ABM84804.1| cathepsin D (lysosomal aspartyl peptidase) [synthetic construct]
gi|261860344|dbj|BAI46694.1| cathepsin D [synthetic construct]
Length = 412
Score = 268 bits (686), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 140/305 (45%), Positives = 198/305 (64%), Gaps = 25/305 (8%)
Query: 37 RLDLHSLNAARITRKERYMGGA--------GVSGVRHRLGDSDEDILP--LKNFMDAQYF 86
R+ LH + R T E +GG+ VS + E +P LKN+MDAQY+
Sbjct: 23 RIPLHKFTSIRRTMSE--VGGSVEDLIAKGPVSKYSQAVPAVTEGPIPEVLKNYMDAQYY 80
Query: 87 GEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEI 145
GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+TY + G S +I
Sbjct: 81 GEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSDKSSTYVKNGTSFDI 140
Query: 146 NYGSGSISGFFSQDNVEV-----------GDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+YGSGS+SG+ SQD V V G V V+ QVF EAT++ +TF+ A+FDGI+G
Sbjct: 141 HYGSGSLSGYLSQDTVSVPCQSASSASALGGVKVERQVFGEATKQPGITFIAAKFDGILG 200
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+ + I+V + +PV+DN+++Q LV + +FSF+L+RDPDA+ GGE++ GG D K++KG +
Sbjct: 201 MAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLSRDPDAQPGGELMLGGTDSKYYKGSLS 260
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y+ VT+K YWQ L + + + T +C+ GC AIVD+GTSL+ GP V E+ AIG
Sbjct: 261 YLNVTRKAYWQVHLDQVEVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVP 319
Query: 315 VVSAE 319
++ E
Sbjct: 320 LIQGE 324
Score = 111 bits (277), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 66/94 (70%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I C+++ T+P ++ +G K + LSPE Y LK + +C+SGFM D+P
Sbjct: 317 AVPLIQGEYMIPCEKVSTLPAITLKLGGKGYKLSPEDYTLKVSQAGKTLCLSGFMGMDIP 376
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PP GPLWILGDVF+G Y+TVFD R+GFAEAA
Sbjct: 377 PPSGPLWILGDVFIGRYYTVFDRDNNRVGFAEAA 410
>gi|60654209|gb|AAX29797.1| cathepsin D [synthetic construct]
Length = 413
Score = 268 bits (686), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 140/305 (45%), Positives = 198/305 (64%), Gaps = 25/305 (8%)
Query: 37 RLDLHSLNAARITRKERYMGGA--------GVSGVRHRLGDSDEDILP--LKNFMDAQYF 86
R+ LH + R T E +GG+ VS + E +P LKN+MDAQY+
Sbjct: 23 RIPLHKFTSIRRTMSE--VGGSVEDLIAKGPVSKYSQAVPAVTEGPIPEVLKNYMDAQYY 80
Query: 87 GEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEI 145
GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+TY + G S +I
Sbjct: 81 GEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSDKSSTYVKNGTSFDI 140
Query: 146 NYGSGSISGFFSQDNVEV-----------GDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+YGSGS+SG+ SQD V V G V V+ QVF EAT++ +TF+ A+FDGI+G
Sbjct: 141 HYGSGSLSGYLSQDTVSVPCQSASSASALGGVKVERQVFGEATKQPGITFIAAKFDGILG 200
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+ + I+V + +PV+DN+++Q LV + +FSF+L+RDPDA+ GGE++ GG D K++KG +
Sbjct: 201 MAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLSRDPDAQPGGELMLGGTDSKYYKGSLS 260
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y+ VT+K YWQ L + + + T +C+ GC AIVD+GTSL+ GP V E+ AIG
Sbjct: 261 YLNVTRKAYWQVHLDQVEVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVP 319
Query: 315 VVSAE 319
++ E
Sbjct: 320 LIEGE 324
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 66/94 (70%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I C+++ T+P ++ +G K + LSPE Y LK + +C+SGFM D+P
Sbjct: 317 AVPLIEGEYMIPCEKVSTLPAITLKLGGKGYKLSPEDYTLKVSQAGKTLCLSGFMGMDIP 376
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PP GPLWILGDVF+G Y+TVFD R+GFAEAA
Sbjct: 377 PPSGPLWILGDVFIGRYYTVFDRDNNRVGFAEAA 410
>gi|60820131|gb|AAX36524.1| cathepsin D [synthetic construct]
gi|61363243|gb|AAX42359.1| cathepsin D [synthetic construct]
Length = 412
Score = 268 bits (686), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 140/305 (45%), Positives = 198/305 (64%), Gaps = 25/305 (8%)
Query: 37 RLDLHSLNAARITRKERYMGGA--------GVSGVRHRLGDSDEDILP--LKNFMDAQYF 86
R+ LH + R T E +GG+ VS + E +P LKN+MDAQY+
Sbjct: 23 RIPLHKFTSIRRTMSE--VGGSVEDLIAKGPVSKYSQAVPAVTEGPIPEVLKNYMDAQYY 80
Query: 87 GEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEI 145
GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+TY + G S +I
Sbjct: 81 GEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSDKSSTYVKNGTSFDI 140
Query: 146 NYGSGSISGFFSQDNVEV-----------GDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+YGSGS+SG+ SQD V V G V V+ QVF EAT++ +TF+ A+FDGI+G
Sbjct: 141 HYGSGSLSGYLSQDTVSVPCQSASSASALGGVKVERQVFGEATKQPGITFIAAKFDGILG 200
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+ + I+V + +PV+DN+++Q LV + +FSF+L+RDPDA+ GGE++ GG D K++KG +
Sbjct: 201 MAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLSRDPDAQPGGELMLGGTDSKYYKGSLS 260
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y+ VT+K YWQ L + + + T +C+ GC AIVD+GTSL+ GP V E+ AIG
Sbjct: 261 YLNVTRKAYWQVHLDQVEVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVP 319
Query: 315 VVSAE 319
++ E
Sbjct: 320 LIEGE 324
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 66/94 (70%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I C+++ T+P ++ +G K + LSPE Y LK + +C+SGFM D+P
Sbjct: 317 AVPLIEGEYMIPCEKVSTLPAITLKLGGKGYKLSPEDYTLKVSQAGKTLCLSGFMGMDIP 376
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PP GPLWILGDVF+G Y+TVFD R+GFAEAA
Sbjct: 377 PPSGPLWILGDVFIGRYYTVFDRDNNRVGFAEAA 410
>gi|31559113|gb|AAP50847.1| cathepsin D [Bombyx mori]
gi|90992734|gb|ABE03014.1| aspartic protease [Bombyx mori]
Length = 385
Score = 268 bits (685), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 133/292 (45%), Positives = 185/292 (63%), Gaps = 11/292 (3%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQ 96
R+ LH + AR E G + +R + + PL N++DAQY+G I IG+PPQ
Sbjct: 21 RVPLHRMKTARTHFHEV---GTELELLRLKYDVTGPSPEPLSNYLDAQYYGVISIGTPPQ 77
Query: 97 NFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
+F V+FDTGSSNLWVPS KC+++ I+C H++Y SRKS +Y G I YGSGS+SGF
Sbjct: 78 SFKVVFDTGSSNLWVPSKKCHYTNIACLLHNKYDSRKSKSYVANGTQFAIQYGSGSLSGF 137
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
S D+V VG + V+ Q F EA E L F+ A+FDGI+G+ F IAV PV+DNMV Q
Sbjct: 138 LSTDDVTVGGLKVRRQTFAEAVSEPGLAFVAAKFDGILGMAFSTIAVDHVTPVFDNMVAQ 197
Query: 216 GLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGN 275
GLV + VFSF+LNRDP A GGE++ GG DP H++G VP+ + YW+F + + +
Sbjct: 198 GLV-QPVFSFYLNRDPGATTGGELLLGGSDPAHYRGDLVRVPLLRDTYWEFHMDSVNV-- 254
Query: 276 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV----SAECKLV 323
++ C GC+AI D+GTSL+AGP+ V +N A+G + + +C L+
Sbjct: 255 NASRFCAQGCSAIADTGTSLIAGPSKEVEALNAAVGATAIAFGQYAVDCSLI 306
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 48/128 (37%), Positives = 69/128 (53%), Gaps = 2/128 (1%)
Query: 370 ENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIP 429
++V+ S C+ A+ L +KE + +N + G+ +DC IP
Sbjct: 250 DSVNVNASRFCAQGCSAIADTGTSLIAGPSKE--VEALNAAVGATAIAFGQYAVDCSLIP 307
Query: 430 TMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVY 489
+P V+FTI F L Y+L+ + VC+SGFMA D+P P GPLWILGDVF+G Y
Sbjct: 308 HLPRVTFTIAGNDFTLEGNDYVLRVAQMGHTVCLSGFMALDVPKPMGPLWILGDVFIGKY 367
Query: 490 HTVFDSGK 497
+T FD+G
Sbjct: 368 YTEFDAGN 375
>gi|112983576|ref|NP_001037351.1| cathepsin D precursor [Bombyx mori]
gi|66269351|gb|AAY43135.1| CathD [Bombyx mori]
Length = 384
Score = 268 bits (685), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 134/292 (45%), Positives = 184/292 (63%), Gaps = 11/292 (3%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQ 96
R+ LH + AR E G + +R + + PL N++DAQY+G I IG+PPQ
Sbjct: 21 RVPLHRMKTARTHFHEV---GTELELLRLKYDVTGPSPEPLSNYLDAQYYGVISIGTPPQ 77
Query: 97 NFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
+F V+FDTGSSNLWVPS KC+++ I+C H++Y SRKS TY G I YGSGS+SGF
Sbjct: 78 SFKVVFDTGSSNLWVPSKKCHYTNIACLLHNKYDSRKSKTYVANGTQFAIQYGSGSLSGF 137
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
S D+V VG + V+ Q F EA E L F+ A+FDGI+G+ F IAV PV+DNMV Q
Sbjct: 138 LSTDDVTVGGLKVRRQTFAEAVSEPGLAFVAAKFDGILGMAFSTIAVDHVTPVFDNMVAQ 197
Query: 216 GLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGN 275
GLV + VFSF+LNRDP A GGE++ GG DP H++G VP+ + YW+F + + +
Sbjct: 198 GLV-QPVFSFYLNRDPGATTGGELLLGGSDPAHYRGDLVRVPLLRDTYWEFHMDSVNV-- 254
Query: 276 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV----SAECKLV 323
++ C GC+AI D+GTSL+AGP+ V +N A+G + +C L+
Sbjct: 255 NASRFCAQGCSAIADTGTSLIAGPSKEVEALNAAVGATAIAFGQYVVDCSLI 306
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 51/138 (36%), Positives = 75/138 (54%), Gaps = 2/138 (1%)
Query: 368 EKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDR 427
++V+ S C+ A+ L +KE + +N + G+ ++DC
Sbjct: 248 HMDSVNVNASRFCAQGCSAIADTGTSLIAGPSKE--VEALNAAVGATAIAFGQYVVDCSL 305
Query: 428 IPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMG 487
IP +P V+FTI F L Y+L+ + VC+SGFMA D+P P PLWILGDVF+G
Sbjct: 306 IPHLPRVTFTIAGNDFTLEGHDYVLRVAQFGHTVCLSGFMALDVPKPMAPLWILGDVFIG 365
Query: 488 VYHTVFDSGKLRIGFAEA 505
Y+T FD+G ++GFA A
Sbjct: 366 KYYTEFDAGNRQLGFAPA 383
>gi|148232796|ref|NP_001083566.1| napsin A aspartic peptidase precursor [Xenopus laevis]
gi|38197533|gb|AAH61685.1| MGC68767 protein [Xenopus laevis]
Length = 392
Score = 268 bits (685), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 137/316 (43%), Positives = 201/316 (63%), Gaps = 22/316 (6%)
Query: 19 LLLPASSNGLRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDI 74
LLL ++G+ RI LKK RR+ S+ A + GA ++ ++
Sbjct: 9 LLLFWDTDGVIRIPLKKFPSIRRMLSDSMTAEELK-------GATKENLQQQMFPEK--- 58
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKS 133
L N++DAQY+GEI IG+PPQ F+VIFDTGSSNLWVPS KC +F +C+ H +Y+S+ S
Sbjct: 59 --LTNYLDAQYYGEIFIGTPPQKFAVIFDTGSSNLWVPSVKCSFFDFACWVHKKYRSQNS 116
Query: 134 NTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGII 193
+TY + + I YG+GS+SGF SQD V +G + V +Q F EA ++ + F+ A FDGI+
Sbjct: 117 STYRQNNTAFAIQYGTGSLSGFLSQDTVSIGSIEVANQTFAEAIKQPGIVFVFAHFDGIL 176
Query: 194 GLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKH 253
G+G+ +I+V VPV+DNM++Q L+ E VFSF+L+RDP A GGE++ GG DP ++ G
Sbjct: 177 GMGYPDISVDGVVPVFDNMMQQNLLEENVFSFYLSRDPMATVGGELILGGTDPNYYTGDF 236
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
Y+ VT+ YWQ + ++ + NQ +C+GGC AIVD+GTSL+ GP + ++ AIG
Sbjct: 237 HYLNVTRMAYWQIKADEVRVNNQLV-LCKGGCQAIVDTGTSLITGPKEEIRALHKAIGAF 295
Query: 314 GVVSAE----CKLVVS 325
+ + E CK + S
Sbjct: 296 PLFAGEYFINCKRIQS 311
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 50/106 (47%), Positives = 71/106 (66%), Gaps = 1/106 (0%)
Query: 400 KEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIA 459
KE++ + +++ + P GE I+C RI ++P VSF +G +NL+ EQYILK +
Sbjct: 282 KEEIRA-LHKAIGAFPLFAGEYFINCKRIQSLPTVSFILGGVAYNLTGEQYILKISKFGH 340
Query: 460 EVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+C+SGFM D+ PP GP+WILGDVF+G Y+TVFD R+GFA A
Sbjct: 341 TICLSGFMGLDIRPPAGPIWILGDVFIGQYYTVFDRDHDRVGFATA 386
>gi|56417363|gb|AAV90625.1| cathepsin D protein [Sus scrofa]
Length = 395
Score = 268 bits (685), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 136/301 (45%), Positives = 194/301 (64%), Gaps = 19/301 (6%)
Query: 37 RLDLHSLNAARITRKE------RYMGGAGVSGVRHRLGDSDEDILP--LKNFMDAQYFGE 88
R+ LH + R T E + +S + + +P LKN+MDAQY+GE
Sbjct: 8 RIPLHKFTSIRRTMSEVGGPVENLIAKGPISKYSQGVPAVTQGPIPEVLKNYMDAQYYGE 67
Query: 89 IGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+TY + G + I+Y
Sbjct: 68 IGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSGKSSTYVKNGTTFAIHY 127
Query: 148 GSGSISGFFSQDNVEV---------GDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFR 198
GSGS+SG++SQD V V G + V+ Q F EAT++ LTF+ A+FDGI+G+ +
Sbjct: 128 GSGSLSGYWSQDTVSVPCNSALLGVGGIKVERQTFGEATKQPGLTFIAAKFDGILGMAYP 187
Query: 199 EIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPV 258
I+V + VPV+DN+++Q LV + +FSF+LNRDP A+ GGE++ GG+D K++KG Y V
Sbjct: 188 RISVNNVVPVFDNLMQQKLVDKNIFSFYLNRDPGAQPGGELMLGGIDSKYYKGSLDYHNV 247
Query: 259 TKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSA 318
T+K YWQ + + +G+ T +C+GGC AIVD+GTSL+ GP V E+ AIG ++
Sbjct: 248 TRKAYWQIHMDQVAVGSSLT-LCKGGCEAIVDTGTSLIVGPVEEVRELQKAIGAVPLIQG 306
Query: 319 E 319
E
Sbjct: 307 E 307
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 67/94 (71%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I C+++P++P+V+ T+G K + LS E Y LK + +C+SGFM D+P
Sbjct: 300 AVPLIQGEYMIPCEKVPSLPDVTVTLGGKKYKLSSENYTLKVSQAGQTICLSGFMGMDIP 359
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PP GPLWILGDVF+G Y+TVFD R+G AEAA
Sbjct: 360 PPGGPLWILGDVFIGRYYTVFDRDLNRVGLAEAA 393
>gi|397490270|ref|XP_003816129.1| PREDICTED: cathepsin D [Pan paniscus]
Length = 603
Score = 268 bits (684), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 139/305 (45%), Positives = 197/305 (64%), Gaps = 25/305 (8%)
Query: 37 RLDLHSLNAARITRKERYMGGA--------GVSGVRHRLGDSDEDILP--LKNFMDAQYF 86
R+ LH + R T E +GG+ VS + +P LKN+MDAQY+
Sbjct: 23 RIPLHKFTSIRRTMSE--VGGSVEDLIAKGPVSKYSQAVPSVTAGPIPEVLKNYMDAQYY 80
Query: 87 GEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEI 145
GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+TY + G S +I
Sbjct: 81 GEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSDKSSTYVKNGTSFDI 140
Query: 146 NYGSGSISGFFSQDNVEV-----------GDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+YGSGS+SG+ SQD V V G V V+ QVF EAT++ +TF+ A+FDGI+G
Sbjct: 141 HYGSGSLSGYLSQDTVSVPCQSASSASAPGGVKVERQVFGEATKQPGITFIAAKFDGILG 200
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+ + I+V + +PV+DN+++Q LV + +FSF+L+RDPDA+ GGE++ GG D K++KG +
Sbjct: 201 MAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLSRDPDAQPGGELMLGGTDSKYYKGSLS 260
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y+ VT+K YWQ L + + + T +C+ GC AIVD+GTSL+ GP V E+ AIG
Sbjct: 261 YLNVTRKAYWQVHLDQVEVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVP 319
Query: 315 VVSAE 319
++ E
Sbjct: 320 LIQGE 324
Score = 98.6 bits (244), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 42/87 (48%), Positives = 59/87 (67%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I C+++ T+P ++ +G K + LSPE Y LK + +C+SGFM D+P
Sbjct: 317 AVPLIQGEYMIPCEKVSTLPAITLKLGGKGYKLSPEDYTLKVSQAGKTLCLSGFMGMDIP 376
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLR 499
PP GPLWILGDVF+G Y+TVFD R
Sbjct: 377 PPSGPLWILGDVFIGRYYTVFDRDNNR 403
>gi|54020914|ref|NP_001005701.1| napsin A aspartic peptidase precursor [Xenopus (Silurana)
tropicalis]
gi|49522956|gb|AAH75272.1| cathepsin D (lysosomal aspartyl protease) [Xenopus (Silurana)
tropicalis]
Length = 402
Score = 268 bits (684), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 141/314 (44%), Positives = 202/314 (64%), Gaps = 18/314 (5%)
Query: 19 LLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILP-- 76
LLL +++ L RI LKK +L+ + +T++E GA ++ + +P
Sbjct: 9 LLLVWTTDALIRIPLKKFPSIRRTLSDS-MTKEE--FNGATKEFLKQQ-------TIPEK 58
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L N++DAQY+GEI IG+PPQ F+VIFDTGSSNLWVPS KC +F +C+ H +Y+S+ S+T
Sbjct: 59 LTNYLDAQYYGEIFIGTPPQKFAVIFDTGSSNLWVPSIKCSFFDFACWLHKKYRSKDSST 118
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + I YG+GS+SGF SQD V VG + V +Q F EA ++ + F+ A FDGI+G+
Sbjct: 119 YQQNNTEFAIQYGTGSLSGFLSQDTVTVGSIDVANQTFAEAVKQPGIVFVFAHFDGILGM 178
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ I+V VPV+DNM+EQ L+ E VFSF+L+RDP A GGE+V GG DP ++ G Y
Sbjct: 179 GYPNISVDGVVPVFDNMMEQKLLEENVFSFYLSRDPMAMVGGELVLGGTDPNYYTGDFHY 238
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
+ VT+ YWQ + ++ + NQ +C+GGC AIVD+GTSL+ GP + ++ AIG +
Sbjct: 239 LNVTRMAYWQIKADEVRVANQLV-LCKGGCQAIVDTGTSLITGPREEIRALHKAIGAFPL 297
Query: 316 VSAE----CKLVVS 325
S E CK + S
Sbjct: 298 FSGEYFVNCKRIQS 311
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 47/99 (47%), Positives = 66/99 (66%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
+++ + P GE ++C RI ++P VSF +G +NL+ EQY+LK + +C+SGF
Sbjct: 288 LHKAIGAFPLFSGEYFVNCKRIQSLPTVSFILGGVAYNLTGEQYVLKISKFGHTLCLSGF 347
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
M D+ PP GPLWILGDVF+G Y+TVFD R+GFA A
Sbjct: 348 MGLDIRPPAGPLWILGDVFIGQYYTVFDRDNDRVGFATA 386
>gi|339237491|ref|XP_003380300.1| lysosomal aspartic protease [Trichinella spiralis]
gi|316976887|gb|EFV60084.1| lysosomal aspartic protease [Trichinella spiralis]
Length = 405
Score = 268 bits (684), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 128/244 (52%), Positives = 172/244 (70%), Gaps = 2/244 (0%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L N+MDAQY+GEI IG+PPQNF+VIFDTGSSNLWVPSSKC +F I+C+ H+RY S+KS+T
Sbjct: 73 LHNYMDAQYYGEISIGTPPQNFTVIFDTGSSNLWVPSSKCSFFDIACWLHNRYNSKKSST 132
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G++ EI YGSGS+ GF S+D V + + VK Q F EAT + L F+ A FDGI+G+
Sbjct: 133 YEASGETIEIRYGSGSMRGFKSKDTVCIASLCVKGQGFAEATSQPGLAFIFAHFDGILGM 192
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
F IAVG PV+ M+EQ L+SE VF+FWLNR+P+ + GG I FG VD K++ G T+
Sbjct: 193 AFPSIAVGGIQPVFQAMIEQNLISEAVFAFWLNRNPEDDLGGLISFGTVDEKYYIGNITW 252
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
VP+ + YW+F + I +G++ G C GC I D+GTSL+AGP V + AIG + +
Sbjct: 253 VPLVNQRYWEFNMETIKVGDEHVG-CIDGCTTIADTGTSLIAGPKDEVERLQEAIGAKPL 311
Query: 316 VSAE 319
+ +
Sbjct: 312 IMGQ 315
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 45/100 (45%), Positives = 66/100 (66%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
+ E + P MG+ + C+ + ++PNV IG ++F+L PE Y+L+ + +C+SGF
Sbjct: 302 LQEAIGAKPLIMGQYYVSCNEVDSLPNVQMKIGGRMFDLKPEDYVLRVKQMGQSICLSGF 361
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
M DLPP G LWILGD+F+G+Y+TVFD G R+GFA A
Sbjct: 362 MGLDLPPQVGKLWILGDIFIGLYYTVFDVGNSRLGFANAT 401
>gi|431920733|gb|ELK18506.1| Napsin-A [Pteropus alecto]
Length = 760
Score = 267 bits (683), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 134/291 (46%), Positives = 184/291 (63%), Gaps = 14/291 (4%)
Query: 47 RITRKERYMGGAGVSGVRH--------RLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNF 98
RI + Y G ++ +R R+GD +PL NFM+AQY+GEIG+G+PPQNF
Sbjct: 18 RIPLRRVYTGRRTLNPLRRWGNPEEPLRMGDPKFISVPLSNFMNAQYYGEIGLGTPPQNF 77
Query: 99 SVIFDTGSSNLWVPSSKCYF-SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFS 157
SV+FDTGSSNLWVPS +CYF S+ C+FH R+ S+ S+++ G I YG+G +SG S
Sbjct: 78 SVVFDTGSSNLWVPSKRCYFFSLPCWFHHRFDSKASSSFKPNGTKFAIQYGTGRLSGVLS 137
Query: 158 QDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGL 217
+D + +G + F EA E SLTF+ ARFDGI+GLGF +AV P D +V QGL
Sbjct: 138 EDKLTIGGITGASVTFGEALWEPSLTFIFARFDGILGLGFPALAVEGVRPPLDMLVAQGL 197
Query: 218 VSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQS 277
+ + VFSF+L RDP+ +GGE+V GG DP H+ TYVPVT YWQ + + +G
Sbjct: 198 LDKPVFSFYLTRDPEEADGGELVLGGSDPTHYIPPLTYVPVTVPAYWQIHMERVQVGTGL 257
Query: 278 TGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLVV 324
T +C GCAAI+D+GTSL+ GP+ + ++ AIGG ++ E C L+
Sbjct: 258 T-LCAHGCAAILDTGTSLITGPSEEIRALHRAIGGISLLVGEYLIQCSLIT 307
Score = 84.7 bits (208), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 38/81 (46%), Positives = 52/81 (64%)
Query: 418 MGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGP 477
+GE +I C I +P VSF +G FNL+ + Y+++ G VC+SGF + D+PP GP
Sbjct: 296 VGEYLIQCSLITELPPVSFNLGGVWFNLTAQDYVIQIARGGVRVCLSGFRSLDMPPSLGP 355
Query: 478 LWILGDVFMGVYHTVFDSGKL 498
LWILGDVF+ Y VFD G +
Sbjct: 356 LWILGDVFLRSYVPVFDRGNM 376
>gi|17549909|ref|NP_510191.1| Protein ASP-4 [Caenorhabditis elegans]
gi|3879202|emb|CAA90633.1| Protein ASP-4 [Caenorhabditis elegans]
Length = 444
Score = 267 bits (683), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 129/244 (52%), Positives = 172/244 (70%), Gaps = 3/244 (1%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L+N+MDAQYFG I IG+P QNF+VIFDTGSSNLW+PS KC ++ I+C H RY S+ S+T
Sbjct: 86 LRNYMDAQYFGTISIGTPAQNFTVIFDTGSSNLWIPSKKCPFYDIACMLHHRYDSKSSST 145
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y E G+ I YG+GS+ GF S+D+V V V +DQ F EAT E +TF+ A+FDGI+G+
Sbjct: 146 YKEDGRKMAIQYGTGSMKGFISKDSVCVAGVCAEDQPFAEATSEPGITFVAAKFDGILGM 205
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ EIAV PV++ + EQ V +FSFWLNR+PD+E GGEI FGG+D + + TY
Sbjct: 206 AYPEIAVLGVQPVFNTLFEQKKVPSNLFSFWLNRNPDSEIGGEITFGGIDSRRYVEPITY 265
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
VPVT+KGYWQF++ D ++G+ G C GC AI D+GTSL+AGP + I + IG E +
Sbjct: 266 VPVTRKGYWQFKM-DKVVGSGVLG-CSNGCQAIADTGTSLIAGPKAQIEAIQNFIGAEPL 323
Query: 316 VSAE 319
+ E
Sbjct: 324 IKGE 327
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 49/99 (49%), Positives = 66/99 (66%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
I + P GE +I CD++PT+P VSF IG + F+L E Y+LK +G +C+SGF
Sbjct: 314 IQNFIGAEPLIKGEYMISCDKVPTLPPVSFVIGGQEFSLKGEDYVLKVSQGGKTICLSGF 373
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
M DLP G LWILGDVF+G Y++VFD + R+GFA+A
Sbjct: 374 MGIDLPERVGELWILGDVFIGRYYSVFDFDQNRVGFAQA 412
>gi|290561455|gb|ADD38128.1| Lysosomal aspartic protease [Lepeophtheirus salmonis]
Length = 384
Score = 267 bits (683), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 138/284 (48%), Positives = 190/284 (66%), Gaps = 6/284 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQ 96
R+ +H +AR K Y G+ + +R R PL N++DAQY+G I IGSPPQ
Sbjct: 20 RVPVHKFQSAR---KHFYEVGSSIQLIRKRWNTVGAHPEPLSNYLDAQYYGPITIGSPPQ 76
Query: 97 NFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
+F VIFDTGSSNLW+PS C+ + I+C H +Y KS+TY G I YGSGS+SGF
Sbjct: 77 SFKVIFDTGSSNLWIPSKSCHITNIACLLHHKYDHSKSSTYVANGTEFAIQYGSGSLSGF 136
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
S D+V +G+V + Q F EA E + F+ A+FDGI+G+G+ IAV VP + NM +Q
Sbjct: 137 LSSDSVSMGEVEIGSQTFGEAMSEPGMAFVAAKFDGILGMGYSNIAVDGVVPPFYNMFKQ 196
Query: 216 GLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGN 275
GL+ E +FSF+LNR+PDA+ GGEI+FGG DP H+KG TY+PVTKKGYWQF++ + + +
Sbjct: 197 GLIQEPIFSFYLNRNPDAKVGGEIIFGGSDPDHYKGNITYIPVTKKGYWQFKMDKMEVNS 256
Query: 276 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
+S C+ GC AI D+GTSL+AGP+ V +N +GG +++ E
Sbjct: 257 KS--FCQNGCQAIADTGTSLIAGPSIEVNALNQLLGGTPIINGE 298
Score = 106 bits (265), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 48/99 (48%), Positives = 69/99 (69%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
+N+L P GE + +C+ IP +P ++FTIG + F LS E Y+++ + VC+SGF
Sbjct: 285 LNQLLGGTPIINGEYMFNCEDIPNLPPITFTIGGEEFVLSGEDYVMQITQFGKTVCLSGF 344
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
M D+P P GP+WILGDVF+G Y+TVFD GK R+GFA++
Sbjct: 345 MGLDVPEPMGPIWILGDVFIGRYYTVFDMGKDRVGFAQS 383
>gi|311324976|gb|ADP89523.1| cathepsin D [Miichthys miiuy]
Length = 396
Score = 267 bits (682), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 140/319 (43%), Positives = 201/319 (63%), Gaps = 16/319 (5%)
Query: 5 LLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVR 64
LL SVF L +++ L RI LKK R L + R E + A ++
Sbjct: 4 LLLSVFAALAL--------TNDALVRIPLKKFRSIRRELTDSG-KRAEELL--ADRHSLK 52
Query: 65 HRLG-DSDEDILP--LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSI 120
+ G S P LKN++DAQY+GEIG+G+PPQ F+V+FDTGSSNLWVPS C I
Sbjct: 53 YNFGFPSSNGPTPELLKNYLDAQYYGEIGLGTPPQLFTVVFDTGSSNLWVPSVHCQILDI 112
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+C H +Y S KS+TY + G + I YGSGS+SGF SQD +GD+ V++Q+F EAT++
Sbjct: 113 ACLLHHKYNSAKSSTYVKNGTAFAIQYGSGSLSGFLSQDTCTIGDISVQNQLFGEATKQP 172
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
+ F+ A+FDGI+G+ + I+V PV+DN++ Q V + VFSF+LNR+PD + GGE++
Sbjct: 173 GVAFIAAKFDGILGMAYPRISVDGVAPVFDNIMSQKKVEKNVFSFYLNRNPDTQPGGELL 232
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
GG DPK++ G YV +T++ YWQ + + +G+Q T +C+ GC AIVD+GTSL+ GP+
Sbjct: 233 LGGTDPKYYSGDFHYVNITRQAYWQIHVDGMAVGSQLT-LCKSGCEAIVDTGTSLITGPS 291
Query: 301 PVVTEINHAIGGEGVVSAE 319
V + AIG ++ E
Sbjct: 292 AEVRSLQKAIGAIPLIQGE 310
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 45/93 (48%), Positives = 68/93 (73%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE ++ CD+IP++P ++F +G + ++L+ EQYILK + +C+SGFM D+P
Sbjct: 303 AIPLIQGEYMVSCDKIPSLPVITFNVGGQSYSLTGEQYILKETQAGKTICLSGFMGLDIP 362
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
P GPLWILGDVF+G Y+TVFD R+GFA++
Sbjct: 363 APAGPLWILGDVFIGQYYTVFDRESNRVGFAKS 395
>gi|332514729|gb|AEE69372.1| cathepsin D [Fasciola gigantica]
Length = 429
Score = 267 bits (682), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 145/303 (47%), Positives = 188/303 (62%), Gaps = 14/303 (4%)
Query: 14 VLASCLLLPASSNGLRRIGL---KKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDS 70
VL CLL A+ + RI L K R +L + +R G R G
Sbjct: 4 VLLICLLFSAALCDVLRIKLRPFKTTRQELSEYGSLDWESSQRLFGKYA-----GRNGSI 58
Query: 71 DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYK 129
E L N++DAQY+GEIGIG+PPQ F VIFDTGSSNLWVPS +C Y S +C+ H++Y
Sbjct: 59 PEQ---LNNYLDAQYYGEIGIGTPPQTFKVIFDTGSSNLWVPSKRCSYLSWACWLHNKYN 115
Query: 130 SRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARF 189
S+TY G + I YG+GS+SGF S D+ EVG V VK Q F EA +E + F+ A+F
Sbjct: 116 YAASSTYQANGTAFSIQYGTGSVSGFISVDSFEVGGVEVKGQPFGEAIKEPGIVFVFAKF 175
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF 249
DGI+G+GFR I+VG V V++NM+ QGLV E VFSF+LNR+ GGE++ GG+DP ++
Sbjct: 176 DGILGMGFRSISVGGLVTVFENMIAQGLVPEPVFSFYLNRNASDPVGGELLLGGIDPNYY 235
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHA 309
G TYVPVT + YWQF++ I S +C GC AI D+GTSL+AGP V +N
Sbjct: 236 TGDITYVPVTHEAYWQFKVDKIEFPGVS--ICADGCQAIADTGTSLIAGPKKEVDALNEQ 293
Query: 310 IGG 312
IGG
Sbjct: 294 IGG 296
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 40/103 (38%), Positives = 61/103 (59%), Gaps = 2/103 (1%)
Query: 401 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 460
+K + +NE P G +++CD+I + ++F + + L + YI+K
Sbjct: 284 KKEVDALNEQIGGTWMPGGIYVVNCDKIDNLSAITFVVAGRKMVLEAKDYIMKLSNMGRT 343
Query: 461 VCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFA 503
VC++ F+ D+P GPLWILGDVF+G Y+TVFD G+ RIGFA
Sbjct: 344 VCVTSFIGIDVP--VGPLWILGDVFIGSYYTVFDMGQKRIGFA 384
>gi|346469557|gb|AEO34623.1| hypothetical protein [Amblyomma maculatum]
Length = 391
Score = 267 bits (682), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 126/245 (51%), Positives = 174/245 (71%), Gaps = 2/245 (0%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSN 134
PLKN++DAQY+G+I +G+PPQ F V+FDTGSSNLWVPSSKC F+ I+C H +Y ++KS+
Sbjct: 62 PLKNYLDAQYYGDITLGTPPQVFRVVFDTGSSNLWVPSSKCPFTNIACMLHHKYYAKKSS 121
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + G EI YGSGS++G S D +GDV V+ Q F E E L F+ A+FDGI+G
Sbjct: 122 TYVKNGTKFEIRYGSGSVTGELSTDVFGLGDVRVQSQTFAEILHESGLAFIAAKFDGILG 181
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ +I+V PV+DNMV QG+ ++ VFS +L+R+ GGE++FGG+D H+ G +
Sbjct: 182 LGYPQISVLGVPPVFDNMVAQGVATKPVFSVYLDRNATDPNGGEVLFGGIDEAHYTGNIS 241
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
YVPVT+KGYWQF + + +G+ +T C GGC AI D+GTSL+AGPT + ++N AIG
Sbjct: 242 YVPVTRKGYWQFHMDGLKVGDNAT-FCNGGCEAIADTGTSLIAGPTEEIQKLNLAIGAAP 300
Query: 315 VVSAE 319
+ E
Sbjct: 301 FTAGE 305
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 51/137 (37%), Positives = 78/137 (56%), Gaps = 3/137 (2%)
Query: 370 ENVSAGDSAV-CSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRI 428
+ + GD+A C+ A+ L T+E + +N + P GE ++ C I
Sbjct: 256 DGLKVGDNATFCNGGCEAIADTGTSLIAGPTEE--IQKLNLAIGAAPFTAGEYLVSCKSI 313
Query: 429 PTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGV 488
PT+P ++F + F L + YIL+ + +C+SGF+ D+P P GPLWILGDVF+G
Sbjct: 314 PTLPKITFNLNGHEFVLEGKDYILQVSQAGIPLCLSGFIGLDVPAPLGPLWILGDVFIGR 373
Query: 489 YHTVFDSGKLRIGFAEA 505
Y+T+FD G R+GFAE+
Sbjct: 374 YYTIFDRGNDRVGFAES 390
>gi|237874218|ref|NP_001153867.1| cathepsin D [Acyrthosiphon pisum]
Length = 393
Score = 267 bits (682), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 131/295 (44%), Positives = 187/295 (63%), Gaps = 7/295 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQ 96
R+ LH +++ R + R G R+ +++ PL N++DAQY+G I IG+PPQ
Sbjct: 30 RVKLHKIDSVRNQLRGRTSNLFGFVQRRYDPLNAE----PLSNYLDAQYYGPITIGTPPQ 85
Query: 97 NFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
F+V+FDTGSSNLWVPS +C +I+C H++Y KS TY + G I+YGSGS+SG+
Sbjct: 86 PFNVVFDTGSSNLWVPSKQCSVLNIACMLHNKYNMAKSTTYXKNGTEFSIHYGSGSLSGY 145
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
S D + + + +Q F EA +E L F+ A+FDGI+GLG+ IAV VP + NMV Q
Sbjct: 146 LSTDVMSMDGTSIVNQTFAEAIQEPGLAFVAAKFDGILGLGYNTIAVDGVVPPFYNMVNQ 205
Query: 216 GLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGN 275
G++ +FSF+LNRDP + GGEI+FGG DP+ + G TYVPVT+ GYWQF L ++++GN
Sbjct: 206 GIIKSAIFSFYLNRDPSSTPGGEIIFGGSDPEKYTGPFTYVPVTRHGYWQFGLDEVIVGN 265
Query: 276 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 330
T + G AI D+GTSL+AGP + +IN +GG + E + Q +L
Sbjct: 266 --TSIVSGALQAIADTGTSLIAGPVDNIKQINELLGGTAIPGGEYIIACDQIDNL 318
Score = 106 bits (265), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 56/135 (41%), Positives = 74/135 (54%), Gaps = 2/135 (1%)
Query: 370 ENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIP 429
+ V G++++ S A+ L + INEL P GE II CD+I
Sbjct: 259 DEVIVGNTSIVSGALQAIADTGTSLIAGPVDN--IKQINELLGGTAIPGGEYIIACDQID 316
Query: 430 TMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVY 489
+P +SF IG F L + YILK + +C+SGFM D+PPP GPLWILGDVF+G Y
Sbjct: 317 NLPVLSFVIGSTTFKLEGKDYILKVSQFGKTICLSGFMGIDIPPPNGPLWILGDVFIGRY 376
Query: 490 HTVFDSGKLRIGFAE 504
+T FD R+GFA
Sbjct: 377 YTEFDLENNRVGFAN 391
>gi|148231809|ref|NP_001085308.1| cathepsin D precursor [Xenopus laevis]
gi|62739292|gb|AAH94178.1| LOC443721 protein [Xenopus laevis]
Length = 399
Score = 267 bits (682), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 126/287 (43%), Positives = 186/287 (64%), Gaps = 25/287 (8%)
Query: 61 SGVRHRLGDSDEDILPLK-----------------------NFMDAQYFGEIGIGSPPQN 97
+ +R + D+D+D L L N++DAQY+GEI IG+PPQ
Sbjct: 32 TSIRRAMSDTDKDSLKLSGNEAATKYSAFPKSNNPTPETLLNYLDAQYYGEISIGTPPQP 91
Query: 98 FSVIFDTGSSNLWVPSSKCYF-SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFF 156
F+V+FDTGSSNLWVPS C F I+C+ H +Y S KS+TY G + I YGSGS++G+
Sbjct: 92 FTVVFDTGSSNLWVPSVHCSFWDIACWLHHKYDSSKSSTYVNNGTAFAIQYGSGSLTGYL 151
Query: 157 SQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQG 216
S+D V +GD+ VK Q+F EA ++ +TF+ A+FDGI+G+G+ I+V PV+D+++EQ
Sbjct: 152 SKDTVTIGDLAVKGQLFAEAVKQPGITFVAAKFDGILGMGYPRISVDGVPPVFDDIMEQK 211
Query: 217 LVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQ 276
LV +FSF+LNR+PD + GGE++ GG DP ++ G +Y+ VT+K YWQ + + +G+Q
Sbjct: 212 LVDSNLFSFYLNRNPDTQPGGELLLGGTDPTYYTGDFSYMNVTRKAYWQIRMDQLSVGDQ 271
Query: 277 STGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLV 323
T +C+GGC AIVD+GTSL+ GP V + AIG ++ E ++
Sbjct: 272 LT-LCKGGCEAIVDTGTSLITGPVEEVAALQRAIGAIPLIRGEYMIL 317
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 56/140 (40%), Positives = 86/140 (61%), Gaps = 3/140 (2%)
Query: 367 VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
+ + +S GD +C A+V L +E ++ + ++P GE +I C
Sbjct: 261 IRMDQLSVGDQLTLCKGGCEAIVDTGTSLITGPVEE--VAALQRAIGAIPLIRGEYMILC 318
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
D IP++P +SFT G ++++L+ EQY+LK + VC+SGF+ D+PPP GPLWI+GDVF
Sbjct: 319 DNIPSLPVISFTFGGRVYSLTGEQYVLKISKAGRTVCLSGFLGLDIPPPAGPLWIIGDVF 378
Query: 486 MGVYHTVFDSGKLRIGFAEA 505
+G Y+TVFD R+GFA+A
Sbjct: 379 IGQYYTVFDRANDRVGFAKA 398
>gi|49522906|gb|AAH75134.1| LOC443721 protein, partial [Xenopus laevis]
Length = 398
Score = 266 bits (681), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 126/287 (43%), Positives = 186/287 (64%), Gaps = 25/287 (8%)
Query: 61 SGVRHRLGDSDEDILPLK-----------------------NFMDAQYFGEIGIGSPPQN 97
+ +R + D+D+D L L N++DAQY+GEI IG+PPQ
Sbjct: 31 TSIRRAMSDTDKDSLKLSGNEAATKYSAFPKSNNPTPETLLNYLDAQYYGEISIGTPPQP 90
Query: 98 FSVIFDTGSSNLWVPSSKCYF-SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFF 156
F+V+FDTGSSNLWVPS C F I+C+ H +Y S KS+TY G + I YGSGS++G+
Sbjct: 91 FTVVFDTGSSNLWVPSVHCSFWDIACWLHHKYDSSKSSTYVNNGTAFAIQYGSGSLTGYL 150
Query: 157 SQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQG 216
S+D V +GD+ VK Q+F EA ++ +TF+ A+FDGI+G+G+ I+V PV+D+++EQ
Sbjct: 151 SKDTVTIGDLAVKGQLFAEAVKQPGITFVAAKFDGILGMGYPRISVDGVPPVFDDIMEQK 210
Query: 217 LVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQ 276
LV +FSF+LNR+PD + GGE++ GG DP ++ G +Y+ VT+K YWQ + + +G+Q
Sbjct: 211 LVDSNLFSFYLNRNPDTQPGGELLLGGTDPTYYTGDFSYMNVTRKAYWQIRMDQLSVGDQ 270
Query: 277 STGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLV 323
T +C+GGC AIVD+GTSL+ GP V + AIG ++ E ++
Sbjct: 271 LT-LCKGGCEAIVDTGTSLITGPVEEVAALQRAIGAIPLIRGEYMIL 316
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 56/140 (40%), Positives = 86/140 (61%), Gaps = 3/140 (2%)
Query: 367 VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
+ + +S GD +C A+V L +E ++ + ++P GE +I C
Sbjct: 260 IRMDQLSVGDQLTLCKGGCEAIVDTGTSLITGPVEE--VAALQRAIGAIPLIRGEYMILC 317
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
D IP++P +SFT G ++++L+ EQY+LK + VC+SGF+ D+PPP GPLWI+GDVF
Sbjct: 318 DNIPSLPVISFTFGGRVYSLTGEQYVLKISKAGRTVCLSGFLGLDIPPPAGPLWIIGDVF 377
Query: 486 MGVYHTVFDSGKLRIGFAEA 505
+G Y+TVFD R+GFA+A
Sbjct: 378 IGQYYTVFDRANDRVGFAKA 397
>gi|339460405|gb|AEJ76922.1| aspartic protease [Dimocarpus longan]
Length = 222
Score = 266 bits (681), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 131/215 (60%), Positives = 166/215 (77%), Gaps = 6/215 (2%)
Query: 10 FCLWVLASCLLLP----ASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS--GV 63
F + + S LL P A +GL RIGLKK++LD S + +I E A + +
Sbjct: 8 FWVALFLSLLLSPTAFSAPKDGLVRIGLKKKKLDQISRVSGQINSNEGEAIRAPIKKYNL 67
Query: 64 RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCY 123
R LGDSD DI+ LKN+MDAQYFGE+GIG+P Q F+VIFDTGSSNLWVPSSKCYFS++CY
Sbjct: 68 RSNLGDSDTDIVSLKNYMDAQYFGEVGIGTPSQTFTVIFDTGSSNLWVPSSKCYFSVACY 127
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
FHS+Y+S +S+TY + G S I YG+G++SGFFSQD+V+VGD+ VK+Q FIEAT+E S+T
Sbjct: 128 FHSKYRSTQSSTYKKNGTSAAIQYGTGAVSGFFSQDSVKVGDLFVKNQDFIEATKEASIT 187
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLV 218
FL A+FDGI+GLGF+EI+VG+AVPVWDNMV QGLV
Sbjct: 188 FLAAKFDGILGLGFQEISVGNAVPVWDNMVNQGLV 222
>gi|26354406|dbj|BAC40831.1| unnamed protein product [Mus musculus]
Length = 445
Score = 266 bits (681), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 124/253 (49%), Positives = 181/253 (71%), Gaps = 11/253 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN++DAQY+G+IGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 71 LKNYLDAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVG---------DVVVKDQVFIEATREGSLTFLL 186
Y + G S +I+YGSGS+SG+ SQD V V + V+ Q+F EAT++ + F+
Sbjct: 131 YVKNGTSFDIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIFGEATKQPGIVFVA 190
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+G+ I+V + +PV+DN+++Q LV + +FSF+LNRDP+ + GGE++ GG D
Sbjct: 191 AKFDGILGMGYPHISVNNVLPVFDNLMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDS 250
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K++ G+ +Y+ VT+K YWQ + + +GN+ T +C+GGC AIVD+GTSLL GP V E+
Sbjct: 251 KYYHGELSYLNVTRKAYWQVHMDQLEVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKEL 309
Query: 307 NHAIGGEGVVSAE 319
AIG ++ E
Sbjct: 310 QKAIGAVPLIQGE 322
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 31/70 (44%), Positives = 47/70 (67%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I C+++ ++P V +G K + L P++YILK +G +C+SGFM D+P
Sbjct: 315 AVPLIQGEYMIPCEKVSSLPTVYLKLGGKNYELHPDKYILKVSQGGKTICLSGFMGMDIP 374
Query: 473 PPRGPLWILG 482
PP GPLWIL
Sbjct: 375 PPSGPLWILA 384
>gi|74198620|dbj|BAE39786.1| unnamed protein product [Mus musculus]
Length = 410
Score = 266 bits (681), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 126/253 (49%), Positives = 181/253 (71%), Gaps = 11/253 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN+MDAQY+G+IGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 71 LKNYMDAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVG---------DVVVKDQVFIEATREGSLTFLL 186
Y + G S +I+YGSGS+SG+ SQD V V + V+ Q+F EAT++ + F+
Sbjct: 131 YVKNGTSFDIHYGSGSLSGYLSQDTVSVPCKSGQSKARGIKVEKQIFGEATKQPGIVFVA 190
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+G+ I+V + +PV+DNM++Q LV + +FSF+LNRDP+ + GGE++ GG D
Sbjct: 191 AKFDGILGMGYPHISVNNVLPVFDNMMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDS 250
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K++ G+ +Y+ VT+K YWQ + + +GN+ T +C+GGC AIVD+GTSLL GP V E+
Sbjct: 251 KYYHGELSYLNVTRKAYWQVHMDQLEVGNELT-LCKGGCEAIVDTGTSLLVGPVGEVKEL 309
Query: 307 NHAIGGEGVVSAE 319
AIG ++ E
Sbjct: 310 QKAIGAVPLIQGE 322
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 46/93 (49%), Positives = 65/93 (69%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I C+++ ++P V +G K + L P++YILK +G +C+SGFM D+P
Sbjct: 315 AVPLIQGEYMIPCEKVSSLPTVYLKLGGKNYELHPDKYILKVSQGGKTICLSGFMGMDIP 374
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
PP GPLWILGDVF+G Y+TVFD R+GFA A
Sbjct: 375 PPSGPLWILGDVFIGSYYTVFDRDNNRVGFANA 407
>gi|355566182|gb|EHH22561.1| Cathepsin D [Macaca mulatta]
Length = 450
Score = 266 bits (681), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 146/330 (44%), Positives = 203/330 (61%), Gaps = 33/330 (10%)
Query: 18 CLLLPASSNGLRRIGLKKR------RLDLHSLNAARITRKERYMGG--------AGVSGV 63
C +L ASS RR L R+ LH + R T E MGG +S
Sbjct: 38 CAMLAASSG--RREDLPDMPQPLVDRIPLHKFTSIRRTMSE--MGGPVEDLIAKGPISKY 93
Query: 64 RHRLGDSDEDILP--LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSI 120
+ E +P LKN+MDAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS C I
Sbjct: 94 SQAMPAVTEGPIPEVLKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDI 153
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEV-----------GDVVVK 169
+C+ H +Y S KS+TY + G S I+YGSGS+SG+ SQD V V G V V+
Sbjct: 154 ACWLHHKYNSDKSSTYVKNGTSFAIHYGSGSLSGYLSQDTVSVPCKSASSTAALGGVKVE 213
Query: 170 DQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNR 229
QVF EA ++ +TF+ A+FDGI+G+ + I+V + +PV+DN+++Q LV + +FSF+LNR
Sbjct: 214 RQVFGEAIKQPGITFIAAKFDGILGMAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLNR 273
Query: 230 DPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIV 289
DP A+ GGE++ GG D K+++G +Y+ VT+K YWQ L + + + T +C+ GC AIV
Sbjct: 274 DPTAQPGGELMLGGTDSKYYRGSLSYLNVTRKAYWQVRLDQVEVASGLT-LCKEGCEAIV 332
Query: 290 DSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
D+GTSL+ GP V E+ AIG ++ E
Sbjct: 333 DTGTSLMVGPVDEVRELQKAIGAVPLIQGE 362
Score = 111 bits (277), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 66/94 (70%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I C+++ T+P ++ +G K + LSPE Y LK + +C+SGFM D+P
Sbjct: 355 AVPLIQGEYMIPCEKVSTLPTITLKLGGKGYKLSPEDYTLKVSQAGKTLCLSGFMGMDIP 414
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PP GPLWILGDVF+G Y+TVFD R+GFAEAA
Sbjct: 415 PPSGPLWILGDVFIGRYYTVFDRDNNRVGFAEAA 448
>gi|281182624|ref|NP_001162374.1| cathepsin D precursor [Papio anubis]
gi|160904227|gb|ABX52210.1| cathepsin D (predicted) [Papio anubis]
Length = 412
Score = 266 bits (681), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 138/305 (45%), Positives = 194/305 (63%), Gaps = 25/305 (8%)
Query: 37 RLDLHSLNAARITRKERYMGG--------AGVSGVRHRLGDSDEDILP--LKNFMDAQYF 86
R+ LH + R T E MGG +S + E +P LKN+MDAQY+
Sbjct: 23 RIPLHKFTSIRRTMSE--MGGPVEDLIAKGPISKYSQAMPAVTEGPIPEVLKNYMDAQYY 80
Query: 87 GEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEI 145
GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+TY + G S I
Sbjct: 81 GEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWLHRKYNSDKSSTYVKNGTSFAI 140
Query: 146 NYGSGSISGFFSQDNVEV-----------GDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+YGSGS+SG+ SQD V V G V V+ QVF EA ++ +TF+ A+FDGI+G
Sbjct: 141 HYGSGSLSGYLSQDTVSVPCKSASSTAALGGVKVERQVFGEAIKQPGITFIAAKFDGILG 200
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+ + I+V + +PV+DN+++Q LV + +FSF+LNRDP A+ GGE++ GG D K+++G +
Sbjct: 201 MAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLNRDPTAQPGGELMLGGTDSKYYRGSLS 260
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y+ VT+K YWQ L + + + T +C+ GC AIVD+GTSL+ GP V E+ AIG
Sbjct: 261 YLNVTRKAYWQVHLDQVEVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVP 319
Query: 315 VVSAE 319
++ E
Sbjct: 320 LIQGE 324
Score = 111 bits (278), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 66/94 (70%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I C+++ T+P ++ +G K + LSPE Y LK + +C+SGFM D+P
Sbjct: 317 AVPLIQGEYMIPCEKVSTLPTITLKLGGKGYKLSPEDYTLKVSQAGKTLCLSGFMGMDIP 376
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PP GPLWILGDVF+G Y+TVFD R+GFAEAA
Sbjct: 377 PPSGPLWILGDVFIGRYYTVFDRDNNRVGFAEAA 410
>gi|386869594|ref|NP_001247483.1| cathepsin D precursor [Macaca mulatta]
gi|67971186|dbj|BAE01935.1| unnamed protein product [Macaca fascicularis]
gi|384939322|gb|AFI33266.1| cathepsin D preproprotein [Macaca mulatta]
Length = 412
Score = 266 bits (681), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 138/305 (45%), Positives = 194/305 (63%), Gaps = 25/305 (8%)
Query: 37 RLDLHSLNAARITRKERYMGG--------AGVSGVRHRLGDSDEDILP--LKNFMDAQYF 86
R+ LH + R T E MGG +S + E +P LKN+MDAQY+
Sbjct: 23 RIPLHKFTSIRRTMSE--MGGPVEDLIAKGPISKYSQAMPAVTEGPIPEVLKNYMDAQYY 80
Query: 87 GEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEI 145
GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+TY + G S I
Sbjct: 81 GEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWLHHKYNSDKSSTYVKNGTSFAI 140
Query: 146 NYGSGSISGFFSQDNVEV-----------GDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+YGSGS+SG+ SQD V V G V V+ QVF EA ++ +TF+ A+FDGI+G
Sbjct: 141 HYGSGSLSGYLSQDTVSVPCKSASSTAALGGVKVERQVFGEAIKQPGITFIAAKFDGILG 200
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+ + I+V + +PV+DN+++Q LV + +FSF+LNRDP A+ GGE++ GG D K+++G +
Sbjct: 201 MAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLNRDPTAQPGGELMLGGTDSKYYRGSLS 260
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y+ VT+K YWQ L + + + T +C+ GC AIVD+GTSL+ GP V E+ AIG
Sbjct: 261 YLNVTRKAYWQVRLDQVEVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVP 319
Query: 315 VVSAE 319
++ E
Sbjct: 320 LIQGE 324
Score = 111 bits (278), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 66/94 (70%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I C+++ T+P ++ +G K + LSPE Y LK + +C+SGFM D+P
Sbjct: 317 AVPLIQGEYMIPCEKVSTLPTITLKLGGKGYKLSPEDYTLKVSQAGKTLCLSGFMGMDIP 376
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PP GPLWILGDVF+G Y+TVFD R+GFAEAA
Sbjct: 377 PPSGPLWILGDVFIGRYYTVFDRDNNRVGFAEAA 410
>gi|205289916|gb|ACI02330.1| aspartic protease 1 [Uncinaria stenocephala]
Length = 447
Score = 266 bits (680), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 145/326 (44%), Positives = 200/326 (61%), Gaps = 24/326 (7%)
Query: 15 LASCLLLPASSNGLRRIGLKKRRLDLHSLNAAR-ITRKERYMGG---------------- 57
LA C L AS + RR + R + S++ +R T +ER +G
Sbjct: 8 LALCTLAVASIH--RRTFHQPARRHVQSVSLSRQPTLRERLLGTGSWEDYQKQRYHYQRK 65
Query: 58 --AGVSGVR-HRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSS 114
A +G + +L ++E L+N+MDAQYFG I IG+P QNF+VIFDTGSSNLWVPS
Sbjct: 66 LLAKYAGNKASKLQSTNEIDELLRNYMDAQYFGTIQIGTPAQNFTVIFDTGSSNLWVPSR 125
Query: 115 KC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF 173
KC ++ I+C H RY S S+TY E G+ I YG+GS+ GF S+DNV + + ++Q F
Sbjct: 126 KCPFYDIACMLHHRYDSGASSTYKEDGRKMAIQYGTGSMKGFISKDNVCIAGICAEEQPF 185
Query: 174 IEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDA 233
EAT E LTF+ A+FDGI+G+ F EI+V PV+ +EQ V +F+FWLNR+PD+
Sbjct: 186 AEATSEPGLTFIAAKFDGILGMAFPEISVLGVPPVFHTFIEQKKVPSPMFAFWLNRNPDS 245
Query: 234 EEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGT 293
E GGEI GG+DP+ + T+ PVT++GYWQF++ D++ G S+ C GC AI D+GT
Sbjct: 246 ELGGEITLGGMDPRRYVEPLTWTPVTRRGYWQFKM-DMVQGGSSSIACPNGCQAIADTGT 304
Query: 294 SLLAGPTPVVTEINHAIGGEGVVSAE 319
SL+AGP V I IG E ++ E
Sbjct: 305 SLIAGPKAQVEAIQKFIGAEPLMRGE 330
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 47/99 (47%), Positives = 65/99 (65%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
I + + P GE +I CD++P++P++SF IG + F L E Y+L G +C+SGF
Sbjct: 317 IQKFIGAEPLMRGEYMIPCDKVPSLPDLSFVIGGQTFTLKGEDYVLTVKAGGKSICLSGF 376
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
M D P G LWILGDVF+G Y+TVFD G+ R+GFA+A
Sbjct: 377 MGMDFPERIGELWILGDVFIGKYYTVFDVGQARLGFAQA 415
>gi|116282368|gb|ABJ97285.1| cathepsin D-like aspartic protease [Fasciola hepatica]
Length = 429
Score = 266 bits (680), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 144/303 (47%), Positives = 188/303 (62%), Gaps = 14/303 (4%)
Query: 14 VLASCLLLPASSNGLRRIGL---KKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDS 70
VL CLL A+ + RI L K R +L + +R G R G
Sbjct: 4 VLLICLLFSAALCDVLRIKLRPFKTTRQELSEYGSLDWESSQRLFGKYA-----GRNGSI 58
Query: 71 DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYK 129
E L N++DAQY+GEIGIG+PPQ F VIFDTGSSNLWVPS +C Y S +C+ H++Y
Sbjct: 59 PEQ---LNNYLDAQYYGEIGIGTPPQTFKVIFDTGSSNLWVPSKRCSYLSWACWLHNKYN 115
Query: 130 SRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARF 189
S+TY G + I YG+GS+SGF S D+ EVG V VK Q F EA +E + F+ A+F
Sbjct: 116 YAASSTYQVNGTAFSIQYGTGSVSGFISVDSFEVGGVEVKGQPFGEAIKEPGIVFVFAKF 175
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF 249
DGI+G+GFR I+VG + V++NM+ QGLV E VFSF+LNR+ GGE++ GG+DP ++
Sbjct: 176 DGILGMGFRSISVGGLITVFENMIAQGLVPEPVFSFYLNRNASDPVGGELLLGGIDPNYY 235
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHA 309
G TYVPVT + YWQF++ I S +C GC AI D+GTSL+AGP V +N
Sbjct: 236 TGDITYVPVTHEAYWQFKVDKIEFPGVS--ICADGCQAIADTGTSLIAGPKKEVDALNEQ 293
Query: 310 IGG 312
IGG
Sbjct: 294 IGG 296
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 38/103 (36%), Positives = 59/103 (57%), Gaps = 2/103 (1%)
Query: 401 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 460
+K + +NE P G +++ D+I + ++F + + + YI+K
Sbjct: 284 KKEVDALNEQIGGTWMPGGIYVVNWDKIDNLSAITFVVAGRKMVFEAKDYIMKLSNMGRT 343
Query: 461 VCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFA 503
VC++ F+ D+P GPLWILGDVF+G Y+TVFD G+ RIGFA
Sbjct: 344 VCVTSFIGIDVP--VGPLWILGDVFIGSYYTVFDMGQKRIGFA 384
>gi|225713714|gb|ACO12703.1| Lysosomal aspartic protease precursor [Lepeophtheirus salmonis]
gi|290462953|gb|ADD24524.1| Lysosomal aspartic protease [Lepeophtheirus salmonis]
Length = 384
Score = 266 bits (680), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 138/284 (48%), Positives = 189/284 (66%), Gaps = 6/284 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQ 96
R+ +H +AR K Y G+ + +R R PL N++DAQY+G I IGSPPQ
Sbjct: 20 RVPVHKFQSAR---KHFYEVGSSIQLIRKRWNTVGAHPEPLSNYLDAQYYGPITIGSPPQ 76
Query: 97 NFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
+F VIFDTGSSNLW+PS C+ + I+C H +Y KS+TY G I YGSGS+SGF
Sbjct: 77 SFKVIFDTGSSNLWIPSKSCHITNIACLLHHKYDHSKSSTYVANGTEFAIQYGSGSLSGF 136
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
S D+V +G V + Q F EA E + F+ A+FDGI+G+G+ IAV VP + NM +Q
Sbjct: 137 LSSDSVSMGGVEIGSQTFGEAMSEPGMAFVAAKFDGILGMGYSNIAVDGVVPPFYNMFKQ 196
Query: 216 GLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGN 275
GL+ E +FSF+LNR+PDA+ GGEI+FGG DP H+KG TY+PVTKKGYWQF++ + + +
Sbjct: 197 GLIQEPIFSFYLNRNPDAKVGGEIIFGGSDPDHYKGNITYIPVTKKGYWQFKMDKMEVNS 256
Query: 276 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
+S C+ GC AI D+GTSL+AGP+ V +N +GG +++ E
Sbjct: 257 KS--FCQNGCQAIADTGTSLIAGPSIEVNALNQLLGGTPIINGE 298
Score = 106 bits (265), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 48/99 (48%), Positives = 69/99 (69%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
+N+L P GE + +C+ IP +P ++FTIG + F LS E Y+++ + VC+SGF
Sbjct: 285 LNQLLGGTPIINGEYMFNCEDIPNLPPITFTIGGEEFVLSGEDYVMQITQFGKTVCLSGF 344
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
M D+P P GP+WILGDVF+G Y+TVFD GK R+GFA++
Sbjct: 345 MGLDVPEPMGPIWILGDVFIGRYYTVFDMGKDRVGFAQS 383
>gi|395851770|ref|XP_003798425.1| PREDICTED: cathepsin D [Otolemur garnettii]
Length = 405
Score = 266 bits (679), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 124/250 (49%), Positives = 182/250 (72%), Gaps = 8/250 (3%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L+N+MDAQY+GEIGIG+PPQ F+V+FDTGS+NLWVPSSKC I+C+ H+RY S +S T
Sbjct: 69 LRNYMDAQYYGEIGIGTPPQCFTVVFDTGSANLWVPSSKCKMLDIACWLHNRYHSDRSTT 128
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVG------DVVVKDQVFIEATREGSLTFLLARF 189
Y + G + +I+YGSGS+SG+ SQD V + +V V+ QVF EAT++ +TF+ A+F
Sbjct: 129 YVKNGTAFDIHYGSGSLSGYLSQDTVLMPCKSVSVNVKVEKQVFGEATKQPGITFIAAKF 188
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF 249
DGI+G+ + I+V + +P +DN++EQ LV + +FSF+LNRDP+A+ GGE++ GGVD K++
Sbjct: 189 DGILGMAYPRISVDNVLPFFDNLMEQKLVEKNIFSFYLNRDPNAQPGGELMLGGVDSKYY 248
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHA 309
G +Y+ VT+K YW+ + + + + T +C+GGC AIVD+GTSL+ GP V E+ A
Sbjct: 249 TGSLSYLNVTRKAYWEVHMEQVEVASGLT-LCKGGCEAIVDTGTSLMVGPVDEVRELQKA 307
Query: 310 IGGEGVVSAE 319
IG ++ E
Sbjct: 308 IGAIPLIQGE 317
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 53/140 (37%), Positives = 84/140 (60%), Gaps = 3/140 (2%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
+E+ V++G + CE A+V L E + + + ++P GE +I C+
Sbjct: 267 MEQVEVASGLTLCKGGCE-AIVDTGTSLMVGPVDE--VRELQKAIGAIPLIQGEYMIPCE 323
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
++ ++P+V+ + K + LS E Y LK +G +C+SGFM D+P P GPLWI+GDVF+
Sbjct: 324 KVSSLPSVTLKLAGKDYTLSGEDYTLKVSQGGKTICLSGFMGMDIPKPVGPLWIIGDVFI 383
Query: 487 GVYHTVFDSGKLRIGFAEAA 506
G ++TVFD K R+GFA+AA
Sbjct: 384 GCFYTVFDREKDRVGFAKAA 403
>gi|74207446|dbj|BAE30902.1| unnamed protein product [Mus musculus]
Length = 410
Score = 265 bits (678), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 124/253 (49%), Positives = 181/253 (71%), Gaps = 11/253 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN++DAQY+G+IGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 71 LKNYLDAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVG---------DVVVKDQVFIEATREGSLTFLL 186
Y + G S +I+YGSGS+SG+ SQD V V + V+ Q+F EAT++ + F+
Sbjct: 131 YVKNGTSFDIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIFGEATKQPGIVFVA 190
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+G+ I+V + +PV+DN+++Q LV + +FSF+LNRDP+ + GGE++ GG D
Sbjct: 191 AKFDGILGMGYPHISVNNVLPVFDNLMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDS 250
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K++ G+ +Y+ VT+K YWQ + + +GN+ T +C+GGC AIVD+GTSLL GP V E+
Sbjct: 251 KYYHGELSYLNVTRKAYWQVHMDQLEVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKEL 309
Query: 307 NHAIGGEGVVSAE 319
AIG ++ E
Sbjct: 310 QKAIGAVPLIQGE 322
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 46/94 (48%), Positives = 65/94 (69%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I C+++ ++P V +G K + L P++YILK +G +C+SGFM D+P
Sbjct: 315 AVPLIQGEYMIPCEKVSSLPTVYMKLGGKNYELHPDKYILKVSQGGKTICLSGFMGMDIP 374
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PP GPLWILGDVF+G Y+TVFD R+GFA A
Sbjct: 375 PPSGPLWILGDVFIGSYYTVFDRDNNRVGFANAV 408
>gi|205363469|gb|ACI04164.1| cathepsin D-like aspartic protease precursor [Fasciola hepatica]
Length = 429
Score = 265 bits (678), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 144/303 (47%), Positives = 187/303 (61%), Gaps = 14/303 (4%)
Query: 14 VLASCLLLPASSNGLRRIGL---KKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDS 70
VL CLL A+ + R L K R +L + +R G R G
Sbjct: 4 VLLICLLFSAALCDVLRTKLRPFKTTRQELSEYGSLDWESSQRLFGKYA-----GRNGSI 58
Query: 71 DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYK 129
E L N++DAQY+GEIGIG+PPQ F VIFDTGSSNLWVPS +C Y S +C+ H++Y
Sbjct: 59 PEQ---LNNYLDAQYYGEIGIGTPPQTFKVIFDTGSSNLWVPSKRCSYLSWACWLHNKYN 115
Query: 130 SRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARF 189
S+TY G + I YG+GS+SGF S D+ EVG V VK Q F EA +E + F+ A+F
Sbjct: 116 YAASSTYQANGTAFSIQYGTGSVSGFISVDSFEVGGVEVKGQPFGEAIKEPGIVFVFAKF 175
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF 249
DGI+G+GFR I+VG V V++NM+ QGLV E VFSF+LNR+ GGE++ GG+DP ++
Sbjct: 176 DGILGMGFRSISVGGLVTVFENMIAQGLVPEPVFSFYLNRNASDPVGGELLLGGIDPNYY 235
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHA 309
G TYVPVT + YWQF++ I S +C GC AI D+GTSL+AGP V +N
Sbjct: 236 TGDITYVPVTHEAYWQFKVDKIEFPGVS--ICADGCQAIADTGTSLIAGPKKEVDALNEQ 293
Query: 310 IGG 312
IGG
Sbjct: 294 IGG 296
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 40/103 (38%), Positives = 61/103 (59%), Gaps = 2/103 (1%)
Query: 401 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 460
+K + +NE P G +++CD+I + ++F + + L + YI+K
Sbjct: 284 KKEVDALNEQIGGTWMPGGIYVVNCDKIDNLSAITFVVAGRKMVLEAKDYIMKLSNMGRT 343
Query: 461 VCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFA 503
VC++ F+ D+P GPLWILGDVF+G Y+TVFD G+ RIGFA
Sbjct: 344 VCVTSFIGIDVP--VGPLWILGDVFIGSYYTVFDMGQKRIGFA 384
>gi|122114359|gb|AAY42145.2| cathepsin D [Sus scrofa]
Length = 410
Score = 265 bits (678), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 127/253 (50%), Positives = 177/253 (69%), Gaps = 11/253 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN+MDAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 71 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSGKSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEV---------GDVVVKDQVFIEATREGSLTFLL 186
Y + G + I+YGSGS+SG+ SQD V V G + V+ Q F EAT++ LTF+
Sbjct: 131 YVKNGTTFAIHYGSGSLSGYLSQDTVSVPCNSASSGVGGIKVERQTFGEATKQPGLTFIA 190
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+ + I+V + VPV+DN+++Q LV + +FSF+LNRDP A+ G E++ GG+D
Sbjct: 191 AKFDGILGMAYPRISVNNVVPVFDNLMQQKLVDKNIFSFYLNRDPGAQPGSELMLGGIDS 250
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K++KG Y VT+K YWQ + + +G+ T +C+GGC AIVD+GTSL+ GP V E+
Sbjct: 251 KYYKGSLDYHNVTRKAYWQIHMDQVAVGSSLT-LCKGGCEAIVDTGTSLIVGPVEEVREL 309
Query: 307 NHAIGGEGVVSAE 319
AIG ++ E
Sbjct: 310 QKAIGAVPLIQGE 322
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 47/94 (50%), Positives = 66/94 (70%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I C+++P++P+V+ T+G K + LS E Y LK + +C+SGFM D+P
Sbjct: 315 AVPLIQGEYMIPCEKVPSLPDVTVTLGGKKYKLSSENYTLKVSQAGQTICLSGFMGMDIP 374
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PP GPLWILGDVF+G Y+TVF R+G AEAA
Sbjct: 375 PPGGPLWILGDVFIGRYYTVFGRDLNRVGSAEAA 408
>gi|224460527|gb|ACN43675.1| cathepsin D [Paralichthys olivaceus]
Length = 396
Score = 265 bits (677), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 123/244 (50%), Positives = 171/244 (70%), Gaps = 2/244 (0%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN++DAQY+G+I +G+PPQ FSV+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 68 LKNYLDAQYYGDIALGTPPQTFSVVFDTGSSNLWVPSVHCSILDIACWLHHKYNSAKSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + G + I YGSGS+SGF SQD +GD+ V+ QVF EAT++ + F+ A+FDGI+G+
Sbjct: 128 YVKNGTTFAIQYGSGSLSGFLSQDTCTIGDLTVEKQVFGEATKQPGVAFIAAKFDGILGM 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ I+V PV+DN++ Q V E VFSF+LNR+PD GGE++ GG DPK++ G Y
Sbjct: 188 AYPRISVDGVAPVFDNIMSQKKVEENVFSFYLNRNPDMAPGGELLLGGTDPKYYSGDFNY 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
V VT++ YWQ +G + G+Q T +C+ GC AIVD+GTSL+ GP+ V + AIG +
Sbjct: 248 VNVTRQAYWQIHMGGMGAGSQLT-LCKDGCEAIVDTGTSLITGPSAEVKALQKAIGAVPL 306
Query: 316 VSAE 319
+ E
Sbjct: 307 IQGE 310
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 43/93 (46%), Positives = 68/93 (73%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE ++ CD+IP++P ++F +G + ++L+ +QY+LK + +C+SGFM D+P
Sbjct: 303 AVPLIQGEYMVSCDKIPSLPVITFNLGGQSYSLTGDQYVLKVSQAGKVICLSGFMGLDIP 362
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
P GPLWILGDVF+G Y+TVFD R+GFA++
Sbjct: 363 APAGPLWILGDVFIGQYYTVFDRENNRVGFAKS 395
>gi|71043798|ref|NP_001020792.1| cathepsin D precursor [Canis lupus familiaris]
gi|85540968|sp|Q4LAL9.1|CATD_CANFA RecName: Full=Cathepsin D; Flags: Precursor
gi|70561318|emb|CAJ14973.1| cathepsin D [Canis lupus familiaris]
Length = 410
Score = 265 bits (677), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 125/253 (49%), Positives = 180/253 (71%), Gaps = 11/253 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L+N+MDAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 71 LRNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSGKSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEV---------GDVVVKDQVFIEATREGSLTFLL 186
Y + G S +I+YGSGS+SG+ SQD V V + V+ Q F EAT++ +TF+
Sbjct: 131 YVKNGTSFDIHYGSGSLSGYLSQDTVSVPCKSALSGLAGIKVERQTFGEATKQPGITFIA 190
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+ + I+V + +PV+DN+++Q LV + +FSF+LNRDP+A+ GGE++ GG D
Sbjct: 191 AKFDGILGMAYPRISVNNVLPVFDNLMQQKLVEKNIFSFYLNRDPNAQPGGELMLGGTDS 250
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K++KG +Y+ VT+K YWQ + + +G+ T +C+GGC AIVD+GTSL+ GP V E+
Sbjct: 251 KYYKGPLSYLNVTRKAYWQVHMEQVDVGSSLT-LCKGGCEAIVDTGTSLIVGPVDEVREL 309
Query: 307 NHAIGGEGVVSAE 319
AIG ++ E
Sbjct: 310 QKAIGAVPLIQGE 322
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 56/141 (39%), Positives = 82/141 (58%), Gaps = 3/141 (2%)
Query: 367 VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
V E V G S +C A+V L E + + + ++P GE +I C
Sbjct: 270 VHMEQVDVGSSLTLCKGGCEAIVDTGTSLIVGPVDE--VRELQKAIGAVPLIQGEYMIPC 327
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
+++ T+P+V+ +G K++ LS E Y LK +G +C+SGFM D+PPP GPLWILGDVF
Sbjct: 328 EKVSTLPDVTLKLGGKLYKLSSEDYTLKVSQGGKTICLSGFMGMDIPPPGGPLWILGDVF 387
Query: 486 MGVYHTVFDSGKLRIGFAEAA 506
+G Y+TVFD + R+G A+A
Sbjct: 388 IGCYYTVFDRDQNRVGLAQAT 408
>gi|74198040|dbj|BAE35200.1| unnamed protein product [Mus musculus]
Length = 410
Score = 265 bits (677), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 124/253 (49%), Positives = 181/253 (71%), Gaps = 11/253 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN++DAQY+G+IGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 71 LKNYLDAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVG---------DVVVKDQVFIEATREGSLTFLL 186
Y + G S +I+YGSGS+SG+ SQD V V + V+ Q+F EAT++ + F+
Sbjct: 131 YVKNGTSFDIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIFGEATKQPGIVFVA 190
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+G+ I+V + +PV+DN+++Q LV + +FSF+LNRDP+ + GGE++ GG D
Sbjct: 191 AKFDGILGMGYPHISVNNVLPVFDNLMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDS 250
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K++ G+ +Y+ VT+K YWQ + + +GN+ T +C+GGC AIVD+GTSLL GP V E+
Sbjct: 251 KYYHGELSYLNVTRKAYWQVHMDQLEVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKEL 309
Query: 307 NHAIGGEGVVSAE 319
AIG ++ E
Sbjct: 310 QKAIGAVPLIQGE 322
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 46/93 (49%), Positives = 65/93 (69%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I C+++ ++P V +G K + L P++YILK +G +C+SGFM D+P
Sbjct: 315 AVPLIQGEYMIPCEKVSSLPTVYLKLGGKNYELHPDKYILKVSQGGKTICLSGFMGMDIP 374
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
PP GPLWILGDVF+G Y+TVFD R+GFA A
Sbjct: 375 PPSGPLWILGDVFIGSYYTVFDRDNNRVGFANA 407
>gi|6753556|ref|NP_034113.1| cathepsin D precursor [Mus musculus]
gi|115718|sp|P18242.1|CATD_MOUSE RecName: Full=Cathepsin D; Flags: Precursor
gi|50299|emb|CAA37067.1| cathepsin D [Mus musculus]
gi|50301|emb|CAA37423.1| unnamed protein product [Mus musculus]
gi|817945|emb|CAA48453.1| cathepsin d [Mus musculus]
gi|32452040|gb|AAH54758.1| Cathepsin D [Mus musculus]
gi|34785578|gb|AAH57931.1| Cathepsin D [Mus musculus]
gi|74139562|dbj|BAE40918.1| unnamed protein product [Mus musculus]
gi|74139905|dbj|BAE31791.1| unnamed protein product [Mus musculus]
gi|74151769|dbj|BAE29674.1| unnamed protein product [Mus musculus]
gi|74177956|dbj|BAE29773.1| unnamed protein product [Mus musculus]
gi|74178091|dbj|BAE29834.1| unnamed protein product [Mus musculus]
gi|74181413|dbj|BAE29980.1| unnamed protein product [Mus musculus]
gi|74184920|dbj|BAE39078.1| unnamed protein product [Mus musculus]
gi|74185047|dbj|BAE39131.1| unnamed protein product [Mus musculus]
gi|74185557|dbj|BAE30245.1| unnamed protein product [Mus musculus]
gi|74186716|dbj|BAE34813.1| unnamed protein product [Mus musculus]
gi|74189047|dbj|BAE39288.1| unnamed protein product [Mus musculus]
gi|74191359|dbj|BAE30262.1| unnamed protein product [Mus musculus]
gi|74191542|dbj|BAE30346.1| unnamed protein product [Mus musculus]
gi|74197068|dbj|BAE35086.1| unnamed protein product [Mus musculus]
gi|74197198|dbj|BAE35144.1| unnamed protein product [Mus musculus]
gi|74199016|dbj|BAE30724.1| unnamed protein product [Mus musculus]
gi|74204247|dbj|BAE39883.1| unnamed protein product [Mus musculus]
gi|74207294|dbj|BAE30833.1| unnamed protein product [Mus musculus]
gi|74207430|dbj|BAE30895.1| unnamed protein product [Mus musculus]
gi|74212520|dbj|BAE31001.1| unnamed protein product [Mus musculus]
gi|74212556|dbj|BAE31018.1| unnamed protein product [Mus musculus]
gi|74212558|dbj|BAE31019.1| unnamed protein product [Mus musculus]
gi|74213416|dbj|BAE35523.1| unnamed protein product [Mus musculus]
gi|74214708|dbj|BAE31193.1| unnamed protein product [Mus musculus]
gi|74217133|dbj|BAE31236.1| unnamed protein product [Mus musculus]
gi|74219445|dbj|BAE29499.1| unnamed protein product [Mus musculus]
gi|74220283|dbj|BAE31319.1| unnamed protein product [Mus musculus]
gi|74220373|dbj|BAE31412.1| unnamed protein product [Mus musculus]
gi|74220638|dbj|BAE31529.1| unnamed protein product [Mus musculus]
gi|74220740|dbj|BAE31342.1| unnamed protein product [Mus musculus]
gi|74222921|dbj|BAE42305.1| unnamed protein product [Mus musculus]
gi|74225262|dbj|BAE31566.1| unnamed protein product [Mus musculus]
gi|74225282|dbj|BAE31575.1| unnamed protein product [Mus musculus]
gi|148686195|gb|EDL18142.1| cathepsin D, isoform CRA_a [Mus musculus]
Length = 410
Score = 265 bits (676), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 124/253 (49%), Positives = 181/253 (71%), Gaps = 11/253 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN++DAQY+G+IGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 71 LKNYLDAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVG---------DVVVKDQVFIEATREGSLTFLL 186
Y + G S +I+YGSGS+SG+ SQD V V + V+ Q+F EAT++ + F+
Sbjct: 131 YVKNGTSFDIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIFGEATKQPGIVFVA 190
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+G+ I+V + +PV+DN+++Q LV + +FSF+LNRDP+ + GGE++ GG D
Sbjct: 191 AKFDGILGMGYPHISVNNVLPVFDNLMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDS 250
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K++ G+ +Y+ VT+K YWQ + + +GN+ T +C+GGC AIVD+GTSLL GP V E+
Sbjct: 251 KYYHGELSYLNVTRKAYWQVHMDQLEVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKEL 309
Query: 307 NHAIGGEGVVSAE 319
AIG ++ E
Sbjct: 310 QKAIGAVPLIQGE 322
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 46/93 (49%), Positives = 65/93 (69%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I C+++ ++P V +G K + L P++YILK +G +C+SGFM D+P
Sbjct: 315 AVPLIQGEYMIPCEKVSSLPTVYLKLGGKNYELHPDKYILKVSQGGKTICLSGFMGMDIP 374
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
PP GPLWILGDVF+G Y+TVFD R+GFA A
Sbjct: 375 PPSGPLWILGDVFIGSYYTVFDRDNNRVGFANA 407
>gi|74220304|dbj|BAE31329.1| unnamed protein product [Mus musculus]
Length = 410
Score = 265 bits (676), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 124/253 (49%), Positives = 181/253 (71%), Gaps = 11/253 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN++DAQY+G+IGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 71 LKNYLDAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVG---------DVVVKDQVFIEATREGSLTFLL 186
Y + G S +I+YGSGS+SG+ SQD V V + V+ Q+F EAT++ + F+
Sbjct: 131 YVKNGTSFDIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIFGEATKQPGIVFVA 190
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+G+ I+V + +PV+DN+++Q LV + +FSF+LNRDP+ + GGE++ GG D
Sbjct: 191 AKFDGILGMGYPHISVNNVLPVFDNLMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDS 250
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K++ G+ +Y+ VT+K YWQ + + +GN+ T +C+GGC AIVD+GTSLL GP V E+
Sbjct: 251 KYYHGELSYLNVTRKAYWQVHMDQLEVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKEL 309
Query: 307 NHAIGGEGVVSAE 319
AIG ++ E
Sbjct: 310 QKAIGAVPLIQGE 322
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 46/93 (49%), Positives = 65/93 (69%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I C+++ ++P V +G K + L P++YILK +G +C+SGFM D+P
Sbjct: 315 AVPLIQGEYMIPCEKVSSLPTVYLKLGGKNYELHPDKYILKVSQGGKTICLSGFMGMDIP 374
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
PP GPLWILGDVF+G Y+TVFD R+GFA A
Sbjct: 375 PPSGPLWILGDVFIGSYYTVFDRDNNRVGFANA 407
>gi|90076280|dbj|BAE87820.1| unnamed protein product [Macaca fascicularis]
Length = 412
Score = 265 bits (676), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 135/303 (44%), Positives = 192/303 (63%), Gaps = 21/303 (6%)
Query: 37 RLDLHSLNAARITRKE------RYMGGAGVSGVRHRLGDSDEDILP--LKNFMDAQYFGE 88
R+ LH + R T E + +S + E +P LKN+MDAQY+GE
Sbjct: 23 RIPLHKFTSIRRTMSEIGGPVEDLIAKGPISKYSQAMPAVTEGPIPEVLKNYMDAQYYGE 82
Query: 89 IGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+TY + G S I+Y
Sbjct: 83 IGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWLHHKYNSDKSSTYVKNGTSFAIHY 142
Query: 148 GSGSISGFFSQDNVEV-----------GDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
GSGS+SG+ SQD V V G V V+ QVF EA ++ +TF+ A+FDGI+G+
Sbjct: 143 GSGSLSGYLSQDTVSVPCKSAPSTAALGGVKVERQVFGEAIKQPGITFIAAKFDGILGMA 202
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ I+V + +PV+DN+++Q LV + +FSF+LNRDP A+ GGE++ GG D K+++G +Y+
Sbjct: 203 YPRISVNNVLPVFDNLMQQKLVDQNIFSFYLNRDPTAQPGGELMLGGTDSKYYRGSLSYL 262
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
VT+K YWQ L + + + T +C+ GC AIVD+GTSL+ GP V E+ AIG ++
Sbjct: 263 NVTRKAYWQVRLDQVEVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVPLI 321
Query: 317 SAE 319
E
Sbjct: 322 QGE 324
Score = 111 bits (278), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 66/94 (70%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I C+++ T+P ++ +G K + LSPE Y LK + +C+SGFM D+P
Sbjct: 317 AVPLIQGEYMIPCEKVSTLPTITLKLGGKGYKLSPEDYTLKVSQAGKTLCLSGFMGMDIP 376
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PP GPLWILGDVF+G Y+TVFD R+GFAEAA
Sbjct: 377 PPSGPLWILGDVFIGRYYTVFDRDNNRVGFAEAA 410
>gi|74142218|dbj|BAE31874.1| unnamed protein product [Mus musculus]
Length = 410
Score = 265 bits (676), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 124/253 (49%), Positives = 181/253 (71%), Gaps = 11/253 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN++DAQY+G+IGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 71 LKNYLDAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVG---------DVVVKDQVFIEATREGSLTFLL 186
Y + G S +I+YGSGS+SG+ SQD V V + V+ Q+F EAT++ + F+
Sbjct: 131 YVKNGTSFDIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIFGEATKQPGIVFVA 190
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+G+ I+V + +PV+DN+++Q LV + +FSF+LNRDP+ + GGE++ GG D
Sbjct: 191 AKFDGILGMGYPHISVNNVLPVFDNLMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDS 250
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K++ G+ +Y+ VT+K YWQ + + +GN+ T +C+GGC AIVD+GTSLL GP V E+
Sbjct: 251 KYYHGELSYLNVTRKAYWQVHMDQLEVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKEL 309
Query: 307 NHAIGGEGVVSAE 319
AIG ++ E
Sbjct: 310 QKAIGAVPLIQGE 322
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 45/94 (47%), Positives = 64/94 (68%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I C+++ ++P V +G K + L P++YILK +G +C+SGFM D+P
Sbjct: 315 AVPLIQGEYMIPCEKVSSLPTVYLKLGGKNYELHPDKYILKVSQGGKTICLSGFMGMDIP 374
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PP GPLWILGDVF+G Y+TVFD R+GF A
Sbjct: 375 PPSGPLWILGDVFIGSYYTVFDRDNNRVGFTNAV 408
>gi|355681641|gb|AER96810.1| cathepsin D [Mustela putorius furo]
Length = 410
Score = 265 bits (676), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 130/268 (48%), Positives = 185/268 (69%), Gaps = 13/268 (4%)
Query: 62 GVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSI 120
GV GD ++L +N+MDAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS C I
Sbjct: 58 GVPSVAGDPVPEVL--RNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDI 115
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEV---------GDVVVKDQ 171
+C+ H +Y S KS+TY + G S +I+YGSGS+SG+ SQD V V V V+ Q
Sbjct: 116 ACWIHHKYNSGKSSTYVKNGTSFDIHYGSGSLSGYLSQDTVSVPCKSGLSSLAGVKVERQ 175
Query: 172 VFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDP 231
F EAT++ +TF+ A+FDGI+G+ + I+V + +PV+DN+++Q LV + +FSF+LNRDP
Sbjct: 176 TFGEATKQPGITFIAAKFDGILGMAYPRISVNNVLPVFDNLMQQKLVEKNIFSFYLNRDP 235
Query: 232 DAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDS 291
A+ GGE++ GG D K++KG +Y+ VT+K YWQ + + +G+ T +C+GGC AIVD+
Sbjct: 236 GAQPGGELMLGGTDSKYYKGPLSYLNVTRKAYWQVHMEXVDVGSSLT-LCKGGCEAIVDT 294
Query: 292 GTSLLAGPTPVVTEINHAIGGEGVVSAE 319
GTSL+ GP V E+ AIG ++ E
Sbjct: 295 GTSLIVGPVDEVRELQKAIGAVPLIQGE 322
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 46/94 (48%), Positives = 64/94 (68%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I C+++ T+P V+ +G K + L E Y LK +G +C+SGFM D+P
Sbjct: 315 AVPLIQGEYMIPCEKVSTLPEVTLKLGGKPYKLLSEDYTLKVSQGGKTICLSGFMGMDIP 374
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PP GPLWILGDVF+G Y+TVFD + R+G AEA
Sbjct: 375 PPGGPLWILGDVFIGRYYTVFDRDQNRVGLAEAT 408
>gi|432850599|ref|XP_004066827.1| PREDICTED: cathepsin D-like isoform 1 [Oryzias latipes]
Length = 396
Score = 264 bits (675), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 138/310 (44%), Positives = 196/310 (63%), Gaps = 8/310 (2%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLG-DSDE 72
VL L S L RI LKK R L + +E + A +++ LG S
Sbjct: 5 VLCVIAALALSGEALIRIPLKKFRSIRRELTD---SGREAHELLADKHSLKYNLGFPSSN 61
Query: 73 DILP--LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYK 129
P LKN++DAQY+GEI +G+PPQ F+V+FDTGSSNLWVPS C I+C +Y
Sbjct: 62 GPTPETLKNYLDAQYYGEIALGTPPQPFTVVFDTGSSNLWVPSVHCSLLDIACXXXHKYN 121
Query: 130 SRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARF 189
S KS+TY + G S I YGSGS+SG+ SQD +GD+ V++QVF EA ++ + F+ A+F
Sbjct: 122 SAKSSTYVKNGTSFSIQYGSGSLSGYLSQDTCTIGDISVENQVFGEAIKQPGVAFIAAKF 181
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF 249
DGI+G+ + I+V VPV+DN+++Q V VFSF+LNR+PD E GGE++ GG DPK++
Sbjct: 182 DGILGMAYPRISVDGVVPVFDNIMQQKKVDSNVFSFYLNRNPDTEPGGELLLGGTDPKYY 241
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHA 309
G YV ++++ YWQ + + +G+Q + +C+GGC AIVD+GTSLL GP+ V + A
Sbjct: 242 SGDFHYVNISRQAYWQIHMDGMAVGSQLS-LCKGGCEAIVDTGTSLLTGPSAEVKALQKA 300
Query: 310 IGGEGVVSAE 319
IG ++ E
Sbjct: 301 IGAIPLIQGE 310
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 45/93 (48%), Positives = 68/93 (73%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I+CD+IP++P ++F IG + + L+ +QY+LK + +C+SGFM D+P
Sbjct: 303 AIPLIQGEYMINCDKIPSLPAITFNIGGQSYTLTGDQYVLKESQAGKTICLSGFMGLDIP 362
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
P GPLWILGDVF+G Y+TVFD R+GFA++
Sbjct: 363 APAGPLWILGDVFIGQYYTVFDRDSNRVGFAKS 395
>gi|42476045|ref|NP_599161.2| cathepsin D precursor [Rattus norvegicus]
gi|38303993|gb|AAH62032.1| Cathepsin D [Rattus norvegicus]
gi|149061703|gb|EDM12126.1| cathepsin D, isoform CRA_c [Rattus norvegicus]
Length = 407
Score = 264 bits (675), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 124/250 (49%), Positives = 181/250 (72%), Gaps = 8/250 (3%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN++DAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 71 LKNYLDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWVHHKYNSDKSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEV------GDVVVKDQVFIEATREGSLTFLLARF 189
Y + G S +I+YGSGS+SG+ SQD V V G + V+ Q+F EAT++ + F+ A+F
Sbjct: 131 YVKNGTSFDIHYGSGSLSGYLSQDTVSVPCKSDLGGIKVEKQIFGEATKQPGVVFIAAKF 190
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF 249
DGI+G+G+ I+V + +PV+DN+++Q LV + +FSF+LNRDP + GGE++ GG D +++
Sbjct: 191 DGILGMGYPFISVNNVLPVFDNLMKQKLVEKNIFSFYLNRDPTGQPGGELMLGGTDSRYY 250
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHA 309
G+ +Y+ VT+K YWQ + + +G++ T +C+GGC AIVD+GTSLL GP V E+ A
Sbjct: 251 HGELSYLNVTRKAYWQVHMDQLEVGSELT-LCKGGCEAIVDTGTSLLVGPVDEVKELQKA 309
Query: 310 IGGEGVVSAE 319
IG ++ E
Sbjct: 310 IGAVPLIQGE 319
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 46/94 (48%), Positives = 68/94 (72%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I C+++ ++P ++F +G + + L PE+YILK + +C+SGFM D+P
Sbjct: 312 AVPLIQGEYMIPCEKVSSLPIITFKLGGQNYELHPEKYILKVSQAGKTICLSGFMGMDIP 371
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PP GPLWILGDVF+G Y+TVFD R+GFA+AA
Sbjct: 372 PPSGPLWILGDVFIGCYYTVFDREYNRVGFAKAA 405
>gi|354496335|ref|XP_003510282.1| PREDICTED: cathepsin D [Cricetulus griseus]
gi|344248735|gb|EGW04839.1| Cathepsin D [Cricetulus griseus]
Length = 408
Score = 264 bits (674), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 125/251 (49%), Positives = 181/251 (72%), Gaps = 9/251 (3%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN++DAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 71 LKNYLDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSGKSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEV-------GDVVVKDQVFIEATREGSLTFLLAR 188
+ + G S +I+YGSGS+SG+ SQD V V G + V+ Q+F EA ++ +TF+ A+
Sbjct: 131 FVKNGTSFDIHYGSGSLSGYLSQDTVSVPCKSEQPGGLKVEKQIFGEAIKQPGITFIAAK 190
Query: 189 FDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKH 248
FDGI+G+G+ I+V + VPV+DN+++Q LV + +FSF+LNRDP + GGE++ GG+D K+
Sbjct: 191 FDGILGMGYPSISVNNVVPVFDNLMQQKLVEKNIFSFFLNRDPTGQPGGELMLGGIDSKY 250
Query: 249 FKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH 308
++G+ +Y+ VT+K YWQ + + + N T +C+GGC AIVD+GTSLL GP V E+
Sbjct: 251 YEGELSYLNVTRKAYWQVHMDQLDVANGLT-LCKGGCEAIVDTGTSLLVGPVDEVKELQK 309
Query: 309 AIGGEGVVSAE 319
AIG ++ E
Sbjct: 310 AIGAVPLIQGE 320
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 47/94 (50%), Positives = 69/94 (73%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I C+++ ++P+V+ +G K + LSP +Y+LK +G +C+SGFM D+P
Sbjct: 313 AVPLIQGEYMIPCEKVSSLPSVTLKLGGKDYELSPSKYVLKVSQGGKTICLSGFMGMDIP 372
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PP GPLWILGDVF+G Y+TVFD R+GFA+AA
Sbjct: 373 PPSGPLWILGDVFIGTYYTVFDRDNNRVGFAKAA 406
>gi|147743000|sp|P85137.1|CARDF_CYNCA RecName: Full=Cardosin-F; Contains: RecName: Full=Cardosin-F heavy
chain; Contains: RecName: Full=Cardosin-F light chain
Length = 281
Score = 264 bits (674), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 142/261 (54%), Positives = 169/261 (64%), Gaps = 35/261 (13%)
Query: 69 DSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRY 128
DS ++ L N D Y+GEIGIG+PPQ F+VIFDTGSS LWVPSSK HS Y
Sbjct: 2 DSGSAVVALTNDRDTSYYGEIGIGTPPQKFTVIFDTGSSVLWVPSSKA--------HSMY 53
Query: 129 KSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLAR 188
+S S+TY SQD+V +GD+VVK+Q FIEAT E FL
Sbjct: 54 ESSGSSTYK-------------------SQDSVTIGDLVVKEQDFIEATEEADNVFLNRL 94
Query: 189 FDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKH 248
FDGI+GL F+ I+V PVW NM+ QGLV FSFWLNR+ D EEGGE+VFGG+DP H
Sbjct: 95 FDGILGLSFQTISV----PVWYNMLNQGLVKR--FSFWLNRNVDEEEGGELVFGGLDPNH 148
Query: 249 FKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH 308
F+G HTYVPVT + YWQF +GD+LIG++STG C GC A DSGTSLL+GPT +VT+INH
Sbjct: 149 FRGDHTYVPVTYQYYWQFGIGDVLIGDKSTGFCAPGCQAFADSGTSLLSGPTAIVTQINH 208
Query: 309 AIGGEGVVSAECK--LVVSQY 327
AIG G K L QY
Sbjct: 209 AIGANGSEELNVKFGLTPEQY 229
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 43/64 (67%), Positives = 45/64 (70%), Gaps = 4/64 (6%)
Query: 443 FNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGF 502
F L+PEQYILK G A CISGF A D GPLWILGDVFM YHTVFD G L +GF
Sbjct: 222 FGLTPEQYILK---GEATQCISGFTAMD-ATLLGPLWILGDVFMRPYHTVFDYGNLLVGF 277
Query: 503 AEAA 506
AEAA
Sbjct: 278 AEAA 281
>gi|74191361|dbj|BAE30263.1| unnamed protein product [Mus musculus]
Length = 410
Score = 264 bits (674), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 124/253 (49%), Positives = 180/253 (71%), Gaps = 11/253 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN++DAQY+G+IGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 71 LKNYLDAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVG---------DVVVKDQVFIEATREGSLTFLL 186
Y + G S +I+YGSGS+SG+ SQD V V + V+ Q+F EAT++ + F+
Sbjct: 131 YVKNGTSFDIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIFGEATKQPGIVFVA 190
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+G+ I+V + +PV+DN+++Q LV + FSF+LNRDP+ + GGE++ GG D
Sbjct: 191 AKFDGILGMGYPHISVNNVLPVFDNLMQQKLVDKNTFSFYLNRDPEGQPGGELMLGGTDS 250
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K++ G+ +Y+ VT+K YWQ + + +GN+ T +C+GGC AIVD+GTSLL GP V E+
Sbjct: 251 KYYHGELSYLNVTRKAYWQVHMDQLEVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKEL 309
Query: 307 NHAIGGEGVVSAE 319
AIG ++ E
Sbjct: 310 QKAIGAVPLIQGE 322
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 46/93 (49%), Positives = 65/93 (69%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I C+++ ++P V +G K + L P++YILK +G +C+SGFM D+P
Sbjct: 315 AVPLIQGEYMIPCEKVSSLPTVYLKLGGKNYELHPDKYILKVSQGGKTICLSGFMGMDIP 374
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
PP GPLWILGDVF+G Y+TVFD R+GFA A
Sbjct: 375 PPSGPLWILGDVFIGSYYTVFDRDNNRVGFANA 407
>gi|226822856|gb|ACO83090.1| cathepsin D preproprotein (predicted) [Dasypus novemcinctus]
Length = 410
Score = 264 bits (674), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 128/263 (48%), Positives = 181/263 (68%), Gaps = 15/263 (5%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L+N+MDAQY+GEIGIG+P Q F V+FDTGSSNLWVPS C +C+ H +Y S +S+T
Sbjct: 71 LRNYMDAQYYGEIGIGTPAQCFRVVFDTGSSNLWVPSIHCRLLDFACWLHRKYNSGRSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVV---------VKDQVFIEATREGSLTFLL 186
Y + G + +I+YGSGS+SG+ SQD V V +V V QVF EAT++ +TFL+
Sbjct: 131 YVKNGSAFDIHYGSGSLSGYLSQDTVSVSPLVPCSAPVGVSVGKQVFGEATKQPGITFLM 190
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+ + I+VG +PV+DN+++Q LV + VFSF+LNRDP A+ GGE+V GG+DP
Sbjct: 191 AKFDGILGMAYPSISVGGVLPVFDNLMQQKLVDKNVFSFYLNRDPTAQPGGELVLGGMDP 250
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
+H+ G Y+ +T+K YWQ + + +G+ T +C+ GC AIVD+GTSL+ GP V E+
Sbjct: 251 RHYTGSVDYLNITRKAYWQVHMDRLEVGDGLT-LCKQGCEAIVDTGTSLMVGPVAEVREL 309
Query: 307 NHAIGGEGVVSAE----CKLVVS 325
AIG ++ E C+ V S
Sbjct: 310 QKAIGAVPLIQGEYMISCEKVAS 332
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 44/100 (44%), Positives = 69/100 (69%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
+ + ++P GE +I C+++ ++P ++ +G++ + LS E Y LK +G VC+SGF
Sbjct: 309 LQKAIGAVPLIQGEYMISCEKVASLPPITLMLGNRGYRLSGEDYTLKVSQGGQTVCLSGF 368
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
M D+PPP GPLWILGD+F+G ++TVFD R+GFA+AA
Sbjct: 369 MGMDIPPPGGPLWILGDIFIGRFYTVFDRDLNRVGFAKAA 408
>gi|146286061|sp|O93428.2|CATD_CHIHA RecName: Full=Cathepsin D; Flags: Precursor
Length = 396
Score = 263 bits (673), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 120/244 (49%), Positives = 170/244 (69%), Gaps = 2/244 (0%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN++DAQY+GEIG+G+PPQ F+V+FDTGSSNLWVPS C I+C H +Y S KS+T
Sbjct: 68 LKNYLDAQYYGEIGLGTPPQPFTVVFDTGSSNLWVPSIHCSLLDIACLLHHKYNSGKSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + G + I YGSGS+SG+ SQD +GD+ + Q+F EA ++ + F+ A+FDGI+G+
Sbjct: 128 YVKNGTAFAIQYGSGSLSGYLSQDTCTIGDLAIDSQLFGEAIKQPGVAFIAAKFDGILGM 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ I+V PV+DN++ Q V + VFSF+LNR+PD E GGE++ GG DPK++ G Y
Sbjct: 188 AYPRISVDGVAPVFDNIMSQKKVEQNVFSFYLNRNPDTEPGGELLLGGTDPKYYTGDFNY 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
V VT++ YWQ + + +G+Q + +C GGC AIVDSGTSL+ GP+ V + AIG +
Sbjct: 248 VNVTRQAYWQIRVDSMAVGDQLS-LCTGGCEAIVDSGTSLITGPSVEVKALQKAIGAFPL 306
Query: 316 VSAE 319
+ E
Sbjct: 307 IQGE 310
Score = 114 bits (286), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 48/93 (51%), Positives = 68/93 (73%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
+ P GE +++CD +P++P +SFT+G +++ L+ EQYILK + +C+SGFM D+P
Sbjct: 303 AFPLIQGEYMVNCDTVPSLPVISFTVGGQVYTLTGEQYILKVTQAGKTMCLSGFMGLDIP 362
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
P GPLWILGDVFMG Y+TVFD R+GFA+A
Sbjct: 363 APAGPLWILGDVFMGQYYTVFDRDANRVGFAKA 395
>gi|83523775|ref|NP_001032810.1| cathepsin D precursor [Sus scrofa]
gi|65330113|gb|AAY42144.1| cathepsin D [Sus scrofa]
Length = 410
Score = 263 bits (673), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 127/253 (50%), Positives = 178/253 (70%), Gaps = 11/253 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN+MDAQ +GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 71 LKNYMDAQNYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSGKSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEV---------GDVVVKDQVFIEATREGSLTFLL 186
Y + G + I+YGSGS+SG++SQD V V G + V+ Q F EAT++ LTF+
Sbjct: 131 YVKNGTTFAIHYGSGSLSGYWSQDTVSVPCNSALLGVGGIKVERQTFGEATKQPGLTFIA 190
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+ + I+V + VPV+DN+++Q LV + +FSF+LNRDP A+ GGE++ GG+D
Sbjct: 191 AKFDGILGMAYPRISVNNVVPVFDNLMQQKLVDKNIFSFYLNRDPGAQPGGELMLGGIDS 250
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K++KG Y VT+K YWQ + + +G+ T +C+GGC AIVD+GTSL+ GP V E+
Sbjct: 251 KYYKGSLDYHNVTRKAYWQIHMDQVAVGSSLT-LCKGGCEAIVDTGTSLIVGPVEEVREL 309
Query: 307 NHAIGGEGVVSAE 319
AIG ++ E
Sbjct: 310 QKAIGAVPLIQGE 322
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 67/94 (71%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I C+++P++P+V+ T+G K + LS E Y LK + +C+SGFM D+P
Sbjct: 315 AVPLIQGEYMIPCEKVPSLPDVTVTLGGKKYKLSSENYTLKVSQAGQTICLSGFMGMDIP 374
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PP GPLWILGDVF+G Y+TVFD R+G AEAA
Sbjct: 375 PPGGPLWILGDVFIGRYYTVFDRDLNRVGLAEAA 408
>gi|342305186|dbj|BAK55647.1| cathepsin D [Oplegnathus fasciatus]
Length = 396
Score = 263 bits (673), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 139/315 (44%), Positives = 197/315 (62%), Gaps = 11/315 (3%)
Query: 9 VFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLG 68
+F L V A+ L LP S+ L RI L K R L + T +E A + +++ LG
Sbjct: 3 LFLLGVFAA-LALP--SDALIRIPLTKFRSIRRELTDSGRTAEELL---ADKNSLKYNLG 56
Query: 69 -DSDEDILP--LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYF 124
S P LKN++DAQY+GEIG+G+PPQ F+V+FDTGSSNLWVPS C I+C
Sbjct: 57 FPSSNGPTPETLKNYLDAQYYGEIGLGTPPQPFTVVFDTGSSNLWVPSVHCSILDIACLL 116
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
H +Y S KS+TY + G + I YG+GS+SG+ SQD +GD+ V Q+F EA ++ + F
Sbjct: 117 HHKYNSAKSSTYVKNGTAFAIQYGTGSLSGYLSQDTCTIGDISVDKQLFGEAIKQPGVAF 176
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
+ A+FDGI+G+ + I+V PV+DN++ Q V + VFSF+LNR+PD E GGE++ GG
Sbjct: 177 IAAKFDGILGMAYPRISVDGVAPVFDNIMSQKKVEKNVFSFYLNRNPDTEPGGELLLGGT 236
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
DPK++ G YV +T++ YWQ + + +G Q +C GC AIVD+GTSL+ GP+ V
Sbjct: 237 DPKYYSGDFHYVNITRQAYWQIHMDGMAVGGQ-LNLCTSGCEAIVDTGTSLITGPSAEVR 295
Query: 305 EINHAIGGEGVVSAE 319
+ AIG + E
Sbjct: 296 SLQKAIGAIPFIQGE 310
Score = 105 bits (263), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 43/93 (46%), Positives = 67/93 (72%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE ++ CD+IP++P ++F +G + + L+ EQY+LK + +C+SGFM D+P
Sbjct: 303 AIPFIQGEYMVSCDKIPSLPVITFNVGGQSYVLTGEQYVLKVSQAGKTICLSGFMGLDIP 362
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
P GPLWILGDVF+G Y+TVFD ++GFA++
Sbjct: 363 APAGPLWILGDVFIGQYYTVFDRENNQVGFAKS 395
>gi|432102593|gb|ELK30160.1| Napsin-A [Myotis davidii]
Length = 357
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 134/281 (47%), Positives = 179/281 (63%), Gaps = 8/281 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDS----DEDILPLKNFMDAQYFGEIGIG 92
R+ LH + A +R + G G LG +PL N+M+AQY+G+IG+G
Sbjct: 29 RIPLHRVYAG--SRTPNPLRGWGSPEEPRGLGAPPPGGKSAFVPLSNYMNAQYYGKIGLG 86
Query: 93 SPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGS 151
+PPQNFSV+FDTGSSNLWVPS +C +FS+ C+FH R+ + S+T+ G I YGSG
Sbjct: 87 TPPQNFSVVFDTGSSNLWVPSRRCSFFSLPCWFHHRFDPKASSTFKPNGTKFAIQYGSGQ 146
Query: 152 ISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDN 211
+SG S+D + +G + VF EA E SL F+ A FDGI+GLGF +AVG P D
Sbjct: 147 LSGILSEDKLTIGGIKNASVVFGEALWEPSLVFVFAHFDGILGLGFPVLAVGGVRPPLDT 206
Query: 212 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 271
MV+QGL+ + VFSF+LNRDP+A EGGE+V GG DP H+ TYVPVT YWQ + +
Sbjct: 207 MVDQGLLDKPVFSFYLNRDPEAAEGGELVLGGSDPAHYIPPLTYVPVTVPAYWQVHMERV 266
Query: 272 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
+G T +C GC AI+D+GTSL+ GPT + ++ AIGG
Sbjct: 267 TVGPGLT-LCAQGCPAILDTGTSLITGPTEEIRALHRAIGG 306
Score = 44.3 bits (103), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 49/96 (51%), Gaps = 4/96 (4%)
Query: 367 VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
V E V+ G +C+ A++ L T+E + ++ P +G+ II+C
Sbjct: 261 VHMERVTVGPGLTLCAQGCPAILDTGTSLITGPTEE--IRALHRAIGGFPL-LGKYIIEC 317
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEV 461
IP +P VSF++G FNL+ + Y+++ G G +V
Sbjct: 318 SVIPALPPVSFSLGGVWFNLTSQDYVIQVGSGQNDV 353
>gi|115720|sp|P24268.1|CATD_RAT RecName: Full=Cathepsin D; Contains: RecName: Full=Cathepsin D 12
kDa light chain; Contains: RecName: Full=Cathepsin D 9
kDa light chain; Contains: RecName: Full=Cathepsin D 34
kDa heavy chain; Contains: RecName: Full=Cathepsin D 30
kDa heavy chain; Flags: Precursor
gi|55882|emb|CAA38349.1| preprocathepsin D [Rattus norvegicus]
Length = 407
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 124/250 (49%), Positives = 180/250 (72%), Gaps = 8/250 (3%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN++DAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 71 LKNYLDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWVHHKYNSDKSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEV------GDVVVKDQVFIEATREGSLTFLLARF 189
Y + G S +I+YGSGS+SG+ SQD V V G + V+ Q+F EAT++ + F+ A+F
Sbjct: 131 YVKNGTSFDIHYGSGSLSGYLSQDTVSVPCKSDLGGIKVEKQIFGEATKQPGVVFIAAKF 190
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF 249
DGI+G+G+ I+V +PV+DN+++Q LV + +FSF+LNRDP + GGE++ GG D +++
Sbjct: 191 DGILGMGYPFISVNKVLPVFDNLMKQKLVEKNIFSFYLNRDPTGQPGGELMLGGTDSRYY 250
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHA 309
G+ +Y+ VT+K YWQ + + +G++ T +C+GGC AIVD+GTSLL GP V E+ A
Sbjct: 251 HGELSYLNVTRKAYWQVHMDQLEVGSELT-LCKGGCEAIVDTGTSLLVGPVDEVKELQKA 309
Query: 310 IGGEGVVSAE 319
IG ++ E
Sbjct: 310 IGAVPLIQGE 319
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 46/94 (48%), Positives = 68/94 (72%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I C+++ ++P ++F +G + + L PE+YILK + +C+SGFM D+P
Sbjct: 312 AVPLIQGEYMIPCEKVSSLPIITFKLGGQNYELHPEKYILKVSQAGKTICLSGFMGMDIP 371
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PP GPLWILGDVF+G Y+TVFD R+GFA+AA
Sbjct: 372 PPSGPLWILGDVFIGCYYTVFDREYNRVGFAKAA 405
>gi|312097106|ref|XP_003148873.1| aspartic protease BmAsp-2 [Loa loa]
gi|307755962|gb|EFO15196.1| aspartic protease BmAsp-2 [Loa loa]
Length = 417
Score = 263 bits (672), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 140/306 (45%), Positives = 193/306 (63%), Gaps = 20/306 (6%)
Query: 30 RIGLKKR---RLDL---------HSLNAARITRK--ERYMG-GAGVSGVRHRLGDSDEDI 74
RI L+K+ R DL + L +I RK +R +G G++ + ++DE
Sbjct: 4 RIALRKQNSLRADLIKTGSLESYNKLLNFQIQRKKTQRKIGLDFGLASRPRTISETDE-- 61
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKS 133
LKN+MDAQY+G+I IG+P QNFSV+FDTGSSNLW+PS KC FS I+C FH++YK +S
Sbjct: 62 -ILKNYMDAQYYGQISIGTPAQNFSVVFDTGSSNLWIPSVKCPFSDIACLFHNKYKGAQS 120
Query: 134 NTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGII 193
TY G+ +I YG GS+ GF S D V + D+ V DQ F EAT E +TF++A+FDGI+
Sbjct: 121 TTYKPDGRKIKIQYGRGSMEGFISSDTVCIADICVTDQPFAEATSEPGVTFVMAKFDGIL 180
Query: 194 GLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKH 253
G+ F EIAV PV+ M++Q V E +F+FWL+R+P+ E GGEI GG+D F
Sbjct: 181 GMAFPEIAVLGLSPVFHTMIKQKTVKESLFAFWLDRNPNDEIGGEITLGGIDVNRFVAPL 240
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
Y P++K GYWQF++ D + G+ C GC AI D+GTSL+AGP + +I IG E
Sbjct: 241 VYTPISKHGYWQFQM-DSIQGDGKAISCANGCQAIADTGTSLIAGPKSQIDKIQKYIGAE 299
Query: 314 GVVSAE 319
+ + E
Sbjct: 300 HLYADE 305
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 41/86 (47%), Positives = 55/86 (63%)
Query: 420 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 479
E II C ++P++P ++F I K + L Y+L +C+SGFM DLP G LW
Sbjct: 305 EYIIPCYKVPSLPEITFVIAGKSYTLKGSDYVLNVSAQGKSICLSGFMGIDLPERVGELW 364
Query: 480 ILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGDVF+G Y+TVFD G +IGFA+A
Sbjct: 365 ILGDVFIGHYYTVFDVGNSQIGFAQA 390
>gi|74204520|dbj|BAE35336.1| unnamed protein product [Mus musculus]
Length = 410
Score = 263 bits (672), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 123/253 (48%), Positives = 181/253 (71%), Gaps = 11/253 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN++DAQY+G+IGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 71 LKNYLDAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVG---------DVVVKDQVFIEATREGSLTFLL 186
+ + G S +I+YGSGS+SG+ SQD V V + V+ Q+F EAT++ + F+
Sbjct: 131 HVKNGTSFDIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIFGEATKQPGIVFVA 190
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+G+ I+V + +PV+DN+++Q LV + +FSF+LNRDP+ + GGE++ GG D
Sbjct: 191 AKFDGILGMGYPHISVNNVLPVFDNLMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDS 250
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K++ G+ +Y+ VT+K YWQ + + +GN+ T +C+GGC AIVD+GTSLL GP V E+
Sbjct: 251 KYYHGELSYLNVTRKAYWQVHMDQLEVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKEL 309
Query: 307 NHAIGGEGVVSAE 319
AIG ++ E
Sbjct: 310 QKAIGAVPLIQGE 322
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 46/93 (49%), Positives = 65/93 (69%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I C+++ ++P V +G K + L P++YILK +G +C+SGFM D+P
Sbjct: 315 AVPLIQGEYMIPCEKVSSLPTVYLKLGGKNYELHPDKYILKVSQGGKTICLSGFMGMDIP 374
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
PP GPLWILGDVF+G Y+TVFD R+GFA A
Sbjct: 375 PPSGPLWILGDVFIGSYYTVFDRDNNRVGFANA 407
>gi|301769501|ref|XP_002920177.1| PREDICTED: cathepsin D-like [Ailuropoda melanoleuca]
Length = 371
Score = 263 bits (672), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 137/305 (44%), Positives = 194/305 (63%), Gaps = 27/305 (8%)
Query: 37 RLDLHSLNAARITRKERYMGGA------------GVSGVRHRLGDSDEDILPLKNFMDAQ 84
R+ LH + R T E +GG GV G +IL KN+MDAQ
Sbjct: 23 RIPLHKFTSIRRTMSE--LGGPVEDLIAKGPISKYAQGVPSVAGGPIPEIL--KNYMDAQ 78
Query: 85 YFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSC 143
Y+GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+TY + G S
Sbjct: 79 YYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSDKSSTYVKNGTSF 138
Query: 144 EINYGSGSISGFFSQDNVEV---------GDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+I+YGSGS+SG+ SQD V V V V+ Q F EA ++ +TF+ A+FDGI+G
Sbjct: 139 DIHYGSGSLSGYLSQDTVSVPCKSALSSLAGVKVERQTFGEAIKQPGITFIAAKFDGILG 198
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+ + I+V + +PV+DN++EQ LV + +FSF+LNR+P A+ GGE++ GG D K++KG +
Sbjct: 199 MAYPRISVNNVLPVFDNLMEQKLVEKNIFSFYLNRNPGAQPGGELMLGGTDSKYYKGPLS 258
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y+ VT+K YWQ + + +G+ T +C+GGC AI+D+GTSL+ GP V E+ AIG
Sbjct: 259 YLNVTRKAYWQVHMEQVDVGSSLT-LCKGGCEAILDTGTSLIVGPVDEVRELQKAIGAVP 317
Query: 315 VVSAE 319
++ E
Sbjct: 318 LIQGE 322
>gi|74219443|dbj|BAE29498.1| unnamed protein product [Mus musculus]
Length = 410
Score = 263 bits (672), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 123/253 (48%), Positives = 180/253 (71%), Gaps = 11/253 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN++DAQY+G+IGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 71 LKNYLDAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVG---------DVVVKDQVFIEATREGSLTFLL 186
Y + G S +I+YGSGS+SG+ SQD V V + V+ Q+F EAT++ + F+
Sbjct: 131 YVKNGTSFDIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIFGEATKQPGIVFVA 190
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+G+ I+V + +PV+DN+++Q LV + +FSF+LNRDP+ + GGE++ GG D
Sbjct: 191 AKFDGILGMGYPHISVNNVLPVFDNLMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDS 250
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K++ G+ +Y+ VT+K YWQ + + +GN+ T +C+GGC AIVD+G SLL GP V E+
Sbjct: 251 KYYHGELSYLNVTRKAYWQVHMDQLEVGNELT-LCKGGCEAIVDTGASLLVGPVEEVKEL 309
Query: 307 NHAIGGEGVVSAE 319
AIG ++ E
Sbjct: 310 QKAIGAVPLIQGE 322
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 46/93 (49%), Positives = 65/93 (69%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I C+++ ++P V +G K + L P++YILK +G +C+SGFM D+P
Sbjct: 315 AVPLIQGEYMIPCEKVSSLPTVYLKLGGKNYELHPDKYILKVSQGGKTICLSGFMGMDIP 374
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
PP GPLWILGDVF+G Y+TVFD R+GFA A
Sbjct: 375 PPSGPLWILGDVFIGSYYTVFDRDNNRVGFANA 407
>gi|74191270|dbj|BAE39462.1| unnamed protein product [Mus musculus]
gi|74204799|dbj|BAE35462.1| unnamed protein product [Mus musculus]
Length = 410
Score = 263 bits (672), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 123/253 (48%), Positives = 180/253 (71%), Gaps = 11/253 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN++DAQY+G+IGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 71 LKNYLDAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVG---------DVVVKDQVFIEATREGSLTFLL 186
Y + G S +I+YGSGS+SG+ SQD V V + V+ Q+F EAT++ + F+
Sbjct: 131 YVKNGTSFDIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIFGEATKQPGIVFVA 190
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+G+ I+V + +PV+DN+++Q LV + +FSF+LNRDP+ + GGE++ GG D
Sbjct: 191 AKFDGILGMGYPHISVNNVLPVFDNLMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDS 250
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K++ G+ +Y+ VT+K YWQ + + +GN+ T +C+GGC AIVD+GTSLL GP V E+
Sbjct: 251 KYYHGELSYLNVTRKAYWQVHMDQLEVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKEL 309
Query: 307 NHAIGGEGVVSAE 319
A G ++ E
Sbjct: 310 QKATGAVPLIQGE 322
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 46/93 (49%), Positives = 65/93 (69%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I C+++ ++P V +G K + L P++YILK +G +C+SGFM D+P
Sbjct: 315 AVPLIQGEYMIPCEKVSSLPTVYLKLGGKNYELHPDKYILKVSQGGKTICLSGFMGMDIP 374
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
PP GPLWILGDVF+G Y+TVFD R+GFA A
Sbjct: 375 PPSGPLWILGDVFIGSYYTVFDRDNNRVGFANA 407
>gi|21907889|dbj|BAC05689.1| aspartic protease BmAsp-2 [Brugia malayi]
Length = 452
Score = 263 bits (672), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 125/244 (51%), Positives = 167/244 (68%), Gaps = 2/244 (0%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN+MDAQY+GEI IG+PPQNFSV+FDTGSSNLWVPS KC + I+C FH++YK KS T
Sbjct: 91 LKNYMDAQYYGEISIGTPPQNFSVVFDTGSSNLWVPSVKCPFLDIACLFHNKYKGTKSTT 150
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G+ +I YG+GS+ GF S D V + ++ V Q F EAT E TF++A+FDGI+G+
Sbjct: 151 YKPDGRKIQIQYGTGSMEGFISLDTVCIANICVTGQPFAEATSEPGATFVMAKFDGILGM 210
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
F EI+V PV+ M+ Q +V + VF+FWL+R+P + GGEI FGG+D F TY
Sbjct: 211 AFPEISVLGLNPVFHTMISQKVVHQPVFAFWLDRNPSDKIGGEITFGGIDANRFVSPITY 270
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
PV++ GYWQF++ +L ++ G C GC AI D+GTSL+AGP + +I IG E V
Sbjct: 271 TPVSRHGYWQFKMDRVLGRGKAIG-CGNGCQAIADTGTSLIAGPKSQIDKIQEYIGAEHV 329
Query: 316 VSAE 319
+ E
Sbjct: 330 YAGE 333
Score = 95.1 bits (235), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 43/87 (49%), Positives = 57/87 (65%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
GE II C ++P++P ++F I K + L Y+L A +C+SGFM DLP G L
Sbjct: 332 GEYIIPCYKVPSLPEITFVIAGKSYTLKGSDYVLNVTSKGATICLSGFMGIDLPKRVGEL 391
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WILGDVF+G Y+TVFD G +IGFA+A
Sbjct: 392 WILGDVFIGRYYTVFDVGNSQIGFAQA 418
>gi|3378161|emb|CAA07719.1| cathepsin D precursor [Chionodraco hamatus]
Length = 396
Score = 263 bits (671), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 120/244 (49%), Positives = 170/244 (69%), Gaps = 2/244 (0%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN++DAQY+GEIG+G+PPQ F+V+FDTGSSNLWVPS C I+C H +Y S KS+T
Sbjct: 68 LKNYLDAQYYGEIGLGTPPQPFTVVFDTGSSNLWVPSIHCSLLDIACLLHHKYNSGKSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + G + I YGSGS+SG+ SQD +GD+ + Q+F EA ++ + F+ A+FDGI+G+
Sbjct: 128 YVKNGTAFAIQYGSGSLSGYLSQDTCTIGDLAIDSQLFGEAIKQPGVAFIAAKFDGILGM 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ I+V PV+DN++ Q V + VFSF+LNR+PD E GGE++ GG DPK++ G Y
Sbjct: 188 AYPRISVDGVAPVFDNIMSQKKVEQNVFSFYLNRNPDTEPGGELLLGGTDPKYYTGDFNY 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
V VT++ YWQ + + +G+Q + +C GGC AIVDSGTSL+ GP+ V + AIG +
Sbjct: 248 VNVTRQAYWQIRVDSMAVGDQLS-LCTGGCEAIVDSGTSLITGPSVEVKALQKAIGAFPL 306
Query: 316 VSAE 319
+ E
Sbjct: 307 IQGE 310
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 48/93 (51%), Positives = 68/93 (73%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
+ P GE +++CD +P++P +SFT+G +++ L+ EQYILK + +C+SGFM D+P
Sbjct: 303 AFPLIQGEYMVNCDTVPSLPVISFTVGGQVYTLTGEQYILKVTQAGKTMCLSGFMGLDIP 362
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
P GPLWILGDVFMG Y+TVFD R+GFA+A
Sbjct: 363 APAGPLWILGDVFMGQYYTVFDRDANRVGFAKA 395
>gi|281344446|gb|EFB20030.1| hypothetical protein PANDA_008874 [Ailuropoda melanoleuca]
Length = 345
Score = 263 bits (671), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 137/305 (44%), Positives = 194/305 (63%), Gaps = 27/305 (8%)
Query: 37 RLDLHSLNAARITRKERYMGGA------------GVSGVRHRLGDSDEDILPLKNFMDAQ 84
R+ LH + R T E +GG GV G +IL KN+MDAQ
Sbjct: 8 RIPLHKFTSIRRTMSE--LGGPVEDLIAKGPISKYAQGVPSVAGGPIPEIL--KNYMDAQ 63
Query: 85 YFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSC 143
Y+GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+TY + G S
Sbjct: 64 YYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSDKSSTYVKNGTSF 123
Query: 144 EINYGSGSISGFFSQDNVEV---------GDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+I+YGSGS+SG+ SQD V V V V+ Q F EA ++ +TF+ A+FDGI+G
Sbjct: 124 DIHYGSGSLSGYLSQDTVSVPCKSALSSLAGVKVERQTFGEAIKQPGITFIAAKFDGILG 183
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+ + I+V + +PV+DN++EQ LV + +FSF+LNR+P A+ GGE++ GG D K++KG +
Sbjct: 184 MAYPRISVNNVLPVFDNLMEQKLVEKNIFSFYLNRNPGAQPGGELMLGGTDSKYYKGPLS 243
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y+ VT+K YWQ + + +G+ T +C+GGC AI+D+GTSL+ GP V E+ AIG
Sbjct: 244 YLNVTRKAYWQVHMEQVDVGSSLT-LCKGGCEAILDTGTSLIVGPVDEVRELQKAIGAVP 302
Query: 315 VVSAE 319
++ E
Sbjct: 303 LIQGE 307
Score = 38.5 bits (88), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 16/45 (35%), Positives = 27/45 (60%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEG 457
++P GE +I C+++ T+P V+ +G + + LS E Y LK G
Sbjct: 300 AVPLIQGEYMIPCEKVSTLPEVTLKLGGRAYTLSSEDYTLKVSGG 344
>gi|27803878|gb|AAO22152.1| cathepsin D-like aspartic protease [Ancylostoma ceylanicum]
Length = 446
Score = 263 bits (671), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 128/255 (50%), Positives = 171/255 (67%), Gaps = 2/255 (0%)
Query: 66 RLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYF 124
+L ++E L+N+MDAQYFG I IG+P QNF+VIFDTGSSNLWVPS KC ++ I+C
Sbjct: 76 KLQSTNEIDELLRNYMDAQYFGTIQIGTPAQNFTVIFDTGSSNLWVPSRKCPFYDIACML 135
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
H RY S S+TY E G+ I YG+GS+ GF S+DNV + + +Q F EAT E LTF
Sbjct: 136 HHRYDSGASSTYKEDGRKMAIQYGTGSMKGFISKDNVCIAGICAVEQPFAEATSEPGLTF 195
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
+ A+FDGI+G+ F EI+V PV+ +EQ V VF+FWLNR+PD+E GGEI GG+
Sbjct: 196 IAAKFDGILGMAFPEISVLGVPPVFHTFIEQKKVPSPVFAFWLNRNPDSELGGEITLGGM 255
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
DP+ + T+ PVT++GYWQF++ D + G ++ C GC AI D+GTSL+AGP V
Sbjct: 256 DPRRYVEPITWTPVTRRGYWQFKM-DKVQGGSTSIACPNGCQAIADTGTSLIAGPKAQVE 314
Query: 305 EINHAIGGEGVVSAE 319
I IG E ++ E
Sbjct: 315 AIQKFIGAEPLMKGE 329
Score = 99.4 bits (246), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 46/99 (46%), Positives = 63/99 (63%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
I + + P GE +I CD++P++P +SF I + F L E Y+L G +C+SGF
Sbjct: 316 IQKFIGAEPLMKGEYMIPCDKVPSLPELSFVIEGRTFILKGEDYVLTVKAGGKSICLSGF 375
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
M D P G LWILGDVF+G Y+TVFD G+ R+GFA+A
Sbjct: 376 MGMDFPERIGELWILGDVFIGKYYTVFDIGQARLGFAQA 414
>gi|74192771|dbj|BAE34900.1| unnamed protein product [Mus musculus]
Length = 410
Score = 263 bits (671), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 123/253 (48%), Positives = 181/253 (71%), Gaps = 11/253 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN++DAQY+G+IGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 71 LKNYLDAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVG---------DVVVKDQVFIEATREGSLTFLL 186
Y + G S +I+YGSGS+SG+ SQD V V + V+ Q+F EAT++ + F+
Sbjct: 131 YVKNGTSFDIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIFGEATKQPGIVFVA 190
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+G+ I+V + +PV+DN+++Q LV + +FSF+LNRDP+ + GGE++ GG D
Sbjct: 191 AKFDGILGMGYPHISVNNVLPVFDNLMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDS 250
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K++ G+ +Y+ VT+K YWQ + + +G++ T +C+GGC AIVD+GTSLL GP V E+
Sbjct: 251 KYYHGELSYLNVTRKAYWQVHMDQLEVGSELT-LCKGGCEAIVDTGTSLLVGPVEEVKEL 309
Query: 307 NHAIGGEGVVSAE 319
AIG ++ E
Sbjct: 310 QKAIGAVPLIQGE 322
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 46/93 (49%), Positives = 65/93 (69%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I C+++ ++P V +G K + L P++YILK +G +C+SGFM D+P
Sbjct: 315 AVPLIQGEYMIPCEKVSSLPTVYLKLGGKNYELHPDKYILKVSQGGKTICLSGFMGMDIP 374
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
PP GPLWILGDVF+G Y+TVFD R+GFA A
Sbjct: 375 PPSGPLWILGDVFIGSYYTVFDRDNNRVGFANA 407
>gi|144228219|gb|ABO93618.1| aspartic proteinase [Vitis vinifera]
Length = 194
Score = 263 bits (671), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 123/194 (63%), Positives = 151/194 (77%), Gaps = 3/194 (1%)
Query: 306 INHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKT 365
INHAIG GVVS ECK VV+QYG+ I DLL+S P+K+C QIGLC F+G V GI++
Sbjct: 1 INHAIGATGVVSQECKTVVAQYGETIMDLLLSEASPQKICSQIGLCTFDGTRGVGMGIES 60
Query: 366 VVEKEN---VSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESI 422
VV+++N S A CSACEMAVVW+Q+QL+Q QTKE++L Y+NELCD LP+PMGES
Sbjct: 61 VVDEKNGDKSSGVHDAGCSACEMAVVWMQSQLRQNQTKERILEYVNELCDRLPSPMGESA 120
Query: 423 IDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILG 482
+DC ++ +MPNVS TI K+F+LS +Y+LK GEG A CISGF+A D+PPPRGPLWILG
Sbjct: 121 VDCLQLSSMPNVSLTISGKVFDLSANEYVLKVGEGAAAQCISGFIAMDVPPPRGPLWILG 180
Query: 483 DVFMGVYHTVFDSG 496
DVFMG YHTVFD G
Sbjct: 181 DVFMGRYHTVFDYG 194
>gi|167524529|ref|XP_001746600.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774870|gb|EDQ88496.1| predicted protein [Monosiga brevicollis MX1]
Length = 381
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 137/290 (47%), Positives = 176/290 (60%), Gaps = 19/290 (6%)
Query: 32 GLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGI 91
G+++ R L A T+ + M G V PL N+ DAQYFGEI I
Sbjct: 25 GMERTRDSLRRQGAMLTTKYQNIMAGTNV---------------PLSNYEDAQYFGEISI 69
Query: 92 GSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSG 150
G+P Q F VIFDTGSSNLWVPSS+C +I+C H++Y S S+TY G I YG+G
Sbjct: 70 GTPAQKFKVIFDTGSSNLWVPSSQCPKTNIACDVHAKYDSSASSTYKANGTKFAIQYGTG 129
Query: 151 SISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWD 210
S+SGF S D +GD+ VKDQ F EA E +TF+ A+FDGI+G+GF I+V VPVW
Sbjct: 130 SLSGFLSTDTACIGDLCVKDQTFAEALEEPGVTFVAAKFDGILGMGFSTISVDHVVPVWY 189
Query: 211 NMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 270
NMV+Q +V + ++SF+LNR+P+ GGE+ GG D HF G + VT GYWQF +
Sbjct: 190 NMVQQQVVEQNMYSFYLNRNPNGVSGGELTLGGYDESHFAGPIHWTDVTVDGYWQFTMTG 249
Query: 271 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAEC 320
+ I N T C C AI D+GTSLLAGPT VV +IN AIG + + E
Sbjct: 250 LSIEN--TPYCT-NCKAIADTGTSLLAGPTDVVKQINKAIGATTIAAGEA 296
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 54/105 (51%), Positives = 70/105 (66%), Gaps = 2/105 (1%)
Query: 403 VLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILK-TGEGIAEV 461
V+ IN+ + GE+I+DC++IP MPNV+ I ++LS EQY+L+ T EG E
Sbjct: 278 VVKQINKAIGATTIAAGEAIVDCNKIPHMPNVTIVINGIQYSLSAEQYVLQVTAEGETE- 336
Query: 462 CISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
CISGF D+P P GPLWILGDVF+G Y TVFD G R+GF +A
Sbjct: 337 CISGFAGIDVPAPEGPLWILGDVFIGAYTTVFDMGNNRVGFGASA 381
>gi|341884635|gb|EGT40570.1| CBN-ASP-4 protein [Caenorhabditis brenneri]
Length = 447
Score = 262 bits (670), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 128/254 (50%), Positives = 176/254 (69%), Gaps = 6/254 (2%)
Query: 67 LGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFH 125
LG+ DE L+N+MDAQYFG I IG+P QNF+VIFDTGSSNLW+PS KC ++ I+C H
Sbjct: 80 LGEIDE---LLRNYMDAQYFGTISIGTPGQNFTVIFDTGSSNLWIPSKKCPFYDIACMLH 136
Query: 126 SRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFL 185
RY S+ S+TY E G+ I YG+GS+ GF S+D+V + + +DQ F EAT E +TF+
Sbjct: 137 HRYDSKASSTYKEDGRKMAIQYGTGSMKGFISKDSVCLAGICAEDQPFAEATSEPGITFV 196
Query: 186 LARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVD 245
A+FDGI+G+ + EIAV PV++ + EQ V +F+FWLNR+PD++ GGEI FGG+D
Sbjct: 197 AAKFDGILGMAYPEIAVLGVQPVFNTLFEQKKVPANLFAFWLNRNPDSDLGGEITFGGID 256
Query: 246 PKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTE 305
+ + TY PVT+KGYWQF++ D ++G+ G C GC AI D+GTSL+AGP +
Sbjct: 257 SRRYVEPITYAPVTRKGYWQFKM-DKVVGSGVLG-CSNGCQAIADTGTSLIAGPKAQIEA 314
Query: 306 INHAIGGEGVVSAE 319
I + IG E ++ E
Sbjct: 315 IQNFIGAEPLIKGE 328
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 50/99 (50%), Positives = 66/99 (66%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
I + P GE +I CD++PT+P VSF IG + F+L E Y+LK +G +C+SGF
Sbjct: 315 IQNFIGAEPLIKGEYMISCDKVPTLPPVSFVIGGQEFSLKGEDYVLKVSQGGKTICLSGF 374
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
M DLP G LWILGDVF+G Y+TVFD + R+GFA+A
Sbjct: 375 MGIDLPERVGELWILGDVFIGRYYTVFDFDQNRVGFAQA 413
>gi|73947914|ref|XP_533610.2| PREDICTED: napsin-A [Canis lupus familiaris]
Length = 422
Score = 262 bits (670), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 141/310 (45%), Positives = 194/310 (62%), Gaps = 8/310 (2%)
Query: 22 PASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFM 81
PA ++ L RI L++ L +LN+ R K GV GD + +PL N+M
Sbjct: 19 PARAS-LIRIPLRRVYPGLETLNSLRGWGKPTVPPSLGVPSS----GD-NPVFVPLSNYM 72
Query: 82 DAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIG 140
+ QY+GEIG+G+PPQNFSVIFDTGSSNLWVPS +C +FS+ C+FH RY S+ S+++ G
Sbjct: 73 NVQYYGEIGLGTPPQNFSVIFDTGSSNLWVPSIRCHFFSLPCWFHHRYNSKASSSFQPNG 132
Query: 141 KSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREI 200
I YG+G + G S+D + +G V +F EA E SL F LA FDGI+GLGF +
Sbjct: 133 TKFAIQYGTGRLDGILSEDKLTIGGVKSASVIFGEALWEPSLVFTLAHFDGILGLGFPIL 192
Query: 201 AVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTK 260
AVG P D +V+QGL+ + VFSF+LNRDP+A +GGE+V GG DP H+ T++PVT
Sbjct: 193 AVGGVQPPLDLLVDQGLLDKPVFSFYLNRDPEAVDGGELVLGGSDPAHYIPPLTFLPVTV 252
Query: 261 KGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAEC 320
YWQ + + +G +C GCAAI+D+GTSL+ GPT + +N AIGG ++ E
Sbjct: 253 PAYWQIHMERVKVGTGLI-LCAQGCAAILDTGTSLITGPTEEIQALNAAIGGFSLLLGEY 311
Query: 321 KLVVSQYGDL 330
+ S+ L
Sbjct: 312 LIQCSEIPTL 321
Score = 102 bits (255), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 54/145 (37%), Positives = 79/145 (54%), Gaps = 7/145 (4%)
Query: 367 VEKENVSAGDSAV-CSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
+ E V G + C+ A++ L T+E + +N +GE +I C
Sbjct: 258 IHMERVKVGTGLILCAQGCAAILDTGTSLITGPTEE--IQALNAAIGGFSLLLGEYLIQC 315
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
IPT+P +SF +G FNL+ + Y+++ G +C+SGF A D+PPP GPLWILGDVF
Sbjct: 316 SEIPTLPPISFLLGGVWFNLTAQDYVIQIARGGVRLCLSGFQALDIPPPTGPLWILGDVF 375
Query: 486 MGVYHTVFDSGKL----RIGFAEAA 506
+G + VFD G L R+G A A+
Sbjct: 376 LGAHVAVFDRGNLTGGARVGLARAS 400
>gi|241275826|ref|XP_002406708.1| aspartic protease, putative [Ixodes scapularis]
gi|215496940|gb|EEC06580.1| aspartic protease, putative [Ixodes scapularis]
Length = 345
Score = 262 bits (670), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 133/284 (46%), Positives = 182/284 (64%), Gaps = 5/284 (1%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQ 96
R+ LH + ++R + +G R G E PLKN++DAQY+GEI +G+PPQ
Sbjct: 23 RMPLHKMQSSRAHLLDATTPLTRPAGHATRGGPIPE---PLKNYLDAQYYGEITLGTPPQ 79
Query: 97 NFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
+F V+FDTGSSNLWVPS+KC F+ I+C H +Y SRKS+TY + G EI YGSGS+ G
Sbjct: 80 SFRVVFDTGSSNLWVPSAKCPFTNIACLLHRKYYSRKSSTYVKNGTQFEIRYGSGSVRGE 139
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
S D + VGD V Q F E E L FL A+FDGI+GLG+ EI+V V+D MV Q
Sbjct: 140 LSTDTMGVGDSSVTGQTFAEILHESGLAFLAAKFDGILGLGYPEISVLGVPTVFDTMVAQ 199
Query: 216 GLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGN 275
G+ ++ VFS +L+R+ GGE++FGG+D H+ G +YVPV+K+GYWQ + +GN
Sbjct: 200 GVAAKPVFSVFLDRNASDPAGGEVLFGGIDESHYIGNISYVPVSKRGYWQVHMDGTRVGN 259
Query: 276 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
+ C GGC AI+D+GTSL+AGP+ + ++N IG S E
Sbjct: 260 NGS-FCSGGCEAILDTGTSLIAGPSDEIEKLNLLIGAAPFASGE 302
>gi|74151850|dbj|BAE29712.1| unnamed protein product [Mus musculus]
gi|74151877|dbj|BAE29725.1| unnamed protein product [Mus musculus]
Length = 410
Score = 262 bits (669), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 123/253 (48%), Positives = 180/253 (71%), Gaps = 11/253 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN++DAQY+G+IGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 71 LKNYLDAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVG---------DVVVKDQVFIEATREGSLTFLL 186
Y + G S +I+YGSGS+SG+ SQD V V + V+ Q+F EAT++ + F+
Sbjct: 131 YVKNGTSFDIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIFGEATKQPGIVFVA 190
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+G+ I+V + +PV+DN+++Q LV + +FSF+LNRDP+ + GGE++ G D
Sbjct: 191 AKFDGILGMGYPHISVNNVLPVFDNLMQQKLVDKNIFSFYLNRDPEGQPGGELMLGDTDS 250
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K++ G+ +Y+ VT+K YWQ + + +GN+ T +C+GGC AIVD+GTSLL GP V E+
Sbjct: 251 KYYHGELSYLNVTRKAYWQVHMDQLEVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKEL 309
Query: 307 NHAIGGEGVVSAE 319
AIG ++ E
Sbjct: 310 QKAIGAVPLIQGE 322
Score = 108 bits (270), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 46/93 (49%), Positives = 64/93 (68%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I C+++ ++P V +G K + L P++YILK G +C+SGFM D+P
Sbjct: 315 AVPLIQGEYMIPCEKVSSLPTVYLKLGGKNYELHPDKYILKVSHGGKTICLSGFMGMDIP 374
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
PP GPLWILGDVF+G Y+TVFD R+GFA A
Sbjct: 375 PPSGPLWILGDVFIGSYYTVFDRDNNRVGFANA 407
>gi|358255149|dbj|GAA56870.1| cathepsin D [Clonorchis sinensis]
Length = 425
Score = 262 bits (669), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 128/237 (54%), Positives = 173/237 (72%), Gaps = 5/237 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L N++DAQY+GEIGIG+PPQ+F V+FDTGSSNLWVPS C FSI+C+ H +Y S KS+T
Sbjct: 61 LNNYLDAQYYGEIGIGTPPQSFEVVFDTGSSNLWVPSKHCSIFSIACWLHHKYDSAKSST 120
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G I YGSGS+SG S D V VG V VK+Q F EA +E + F+ A+FDGI+G+
Sbjct: 121 YMANGTEFSIRYGSGSVSGILSTDYVSVGTVTVKNQTFGEAMKEPGIAFVAAKFDGILGM 180
Query: 196 GFREIAVGDAVP-VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
GF+ I+V D VP ++DNM+ QGLVSE VFSF+L+R+ GGE++ GG DPK++KG+
Sbjct: 181 GFKTISV-DGVPTLFDNMISQGLVSEPVFSFYLDRNASDPVGGELLLGGTDPKYYKGEIL 239
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
+ P+T + YWQF++ + +G S +CE GC AI D+GTSL+AGP+ V ++N A+G
Sbjct: 240 WAPLTHEAYWQFKVDSMNVG--SMKLCENGCQAIADTGTSLIAGPSEEVGKLNDALG 294
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 52/136 (38%), Positives = 76/136 (55%), Gaps = 4/136 (2%)
Query: 370 ENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIP 429
++++ G +C A+ L ++E + +N+ ++ P G IDC R+
Sbjct: 254 DSMNVGSMKLCENGCQAIADTGTSLIAGPSEE--VGKLNDALGAIKIPGGTYYIDCSRVS 311
Query: 430 TMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVY 489
T+P V F+I K+ L P YIL+ +CISGFM D+ P GPLWILGDVF+G Y
Sbjct: 312 TLPPVQFSISGKLMQLDPSDYILRMTSFGKTICISGFMGIDI--PAGPLWILGDVFIGKY 369
Query: 490 HTVFDSGKLRIGFAEA 505
+T+FD G R+GFA A
Sbjct: 370 YTIFDVGNARVGFATA 385
>gi|9581805|emb|CAC00543.1| necepsin II [Necator americanus]
Length = 446
Score = 262 bits (669), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 128/255 (50%), Positives = 172/255 (67%), Gaps = 2/255 (0%)
Query: 66 RLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYF 124
+L ++E L+N+MDAQY+G I IG+P QNF+VIFDTGSSNLWVPS KC ++ I+C
Sbjct: 76 KLQSANEIDELLRNYMDAQYYGVIQIGTPAQNFTVIFDTGSSNLWVPSRKCPFYDIACML 135
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
H RY S S+TY E G+ I YG+GS+ GF S+D V + + ++Q F EAT E LTF
Sbjct: 136 HHRYDSGASSTYKEDGRKMAIQYGTGSMKGFISKDIVCIAGICAEEQPFAEATSEPGLTF 195
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
+ A+FDGI+G+ F EIAV PV+ +EQ V VF+FWLNR+P++E GGEI FGGV
Sbjct: 196 IAAKFDGILGMAFPEIAVLGVTPVFHTFIEQKKVPSPVFAFWLNRNPESEIGGEITFGGV 255
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
D + + T+ PVT++GYWQF++ D++ G S+ C GC AI D+GTSL+AGP V
Sbjct: 256 DTRRYVEPITWTPVTRRGYWQFKM-DMVQGGSSSIACPNGCQAIADTGTSLIAGPKAQVE 314
Query: 305 EINHAIGGEGVVSAE 319
I IG E ++ E
Sbjct: 315 AIQKYIGAEPLMKGE 329
Score = 102 bits (255), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 47/99 (47%), Positives = 63/99 (63%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
I + + P GE +I CD++P++P+VSF I K F L E Y+L +C+SGF
Sbjct: 316 IQKYIGAEPLMKGEYMIPCDKVPSLPDVSFIIDGKTFTLKGEDYVLTVKAAGKSICLSGF 375
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
M D P G LWILGDVF+G Y+TVFD G+ R+GFA+A
Sbjct: 376 MGMDFPEKIGELWILGDVFIGKYYTVFDVGQARVGFAQA 414
>gi|410982348|ref|XP_003997519.1| PREDICTED: napsin-A [Felis catus]
Length = 422
Score = 262 bits (669), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 133/296 (44%), Positives = 187/296 (63%), Gaps = 7/296 (2%)
Query: 25 SNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQ 84
S L RI L++ +LN R K G GD + ++PL N+M+ Q
Sbjct: 21 SASLIRIPLRRVHTGHRTLNPPRGWGKPAATPALGAPSP----GD-NPTVIPLSNYMNVQ 75
Query: 85 YFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSC 143
Y+GEIG+G+PPQNFSV+FDTGSSNLWVPS +C +FS+ C+ H R+ + S+++ G
Sbjct: 76 YYGEIGLGTPPQNFSVVFDTGSSNLWVPSIRCHFFSLPCWLHHRFNPKASSSFQPNGTKF 135
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVG 203
+I YG+G ++G S+D + +G ++ +F EA E SL F LARFDGI+GL F +AVG
Sbjct: 136 DIQYGTGRLAGILSEDKLTIGGMMNASVIFGEALWESSLVFTLARFDGILGLAFPVLAVG 195
Query: 204 DAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGY 263
P D +V+QGL+ + VFSF+LNRDP+A +GGE+V GG DP H+ T+VPVT Y
Sbjct: 196 GVRPPLDVLVDQGLLDKPVFSFYLNRDPEAADGGELVLGGSDPAHYIPPLTFVPVTIPAY 255
Query: 264 WQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
WQ + + +G T +C GCAAI+D+GTSL+ GPT + +N AIGG ++ E
Sbjct: 256 WQIHMERMKVGTGLT-LCAQGCAAILDTGTSLITGPTEEIRALNTAIGGISLLVGE 310
Score = 95.1 bits (235), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 53/143 (37%), Positives = 78/143 (54%), Gaps = 7/143 (4%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
+E+ V G + C A++ L T+E + +N + +GE +I C+
Sbjct: 260 MERMKVGTGLTLCAQGCA-AILDTGTSLITGPTEE--IRALNTAIGGISLLVGEYLIQCE 316
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
IPT+P VSF +G FNL+ + Y+++ G +C+SGF A D+P P GPLWILGDVF+
Sbjct: 317 TIPTLPPVSFLLGGVWFNLTAQDYVIQIVRGGFRLCLSGFQALDMPSPAGPLWILGDVFL 376
Query: 487 GVYHTVFDSGKL----RIGFAEA 505
Y VFD G L R+G A +
Sbjct: 377 RTYVAVFDRGNLTSGARVGLARS 399
>gi|391329068|ref|XP_003738999.1| PREDICTED: lysosomal aspartic protease-like [Metaseiulus
occidentalis]
Length = 384
Score = 262 bits (669), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 132/294 (44%), Positives = 184/294 (62%), Gaps = 13/294 (4%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L RI L+K + +L R + R LG + E P+ N+MDAQY+G
Sbjct: 18 LLRIPLQKSKSLRQTLIEKNTPRHVMFS--------RPILGGNVE---PIANYMDAQYYG 66
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEIN 146
I IG+PPQ F V+FDTGSSNLWVPS+ C + ++C H++Y S KS +Y G + I
Sbjct: 67 PISIGNPPQPFQVVFDTGSSNLWVPSANCPITNVACLLHNKYHSSKSTSYLANGTTFSIQ 126
Query: 147 YGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
YGSG++SG S D+V V V + Q F E +E L F+ +FDGI+G+G+ +I+V +
Sbjct: 127 YGSGAVSGLLSADDVSVNGVNITRQTFAEILKESGLGFIAGKFDGILGMGYPQISVLGVL 186
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
PV+D MV Q ++ +FSF+L RD D G E+V GG+DPKH KG+ TY+PV++KGYWQF
Sbjct: 187 PVFDQMVAQNAIAAPIFSFYLTRDNDHPTGSELVIGGIDPKHHKGEITYIPVSRKGYWQF 246
Query: 267 ELGDILIGNQS-TGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
++ + IG+ S T +C GC AI D+GTSL+AGPT V +N AIG ++ E
Sbjct: 247 KMDSVKIGDVSKTTLCANGCQAIADTGTSLIAGPTSEVKALNKAIGAAPFLNGE 300
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 57/142 (40%), Positives = 85/142 (59%), Gaps = 7/142 (4%)
Query: 368 EKENVSAGDSAVCSACE---MAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIID 424
+ ++V GD + + C A+ L T E + +N+ + P GE +++
Sbjct: 247 KMDSVKIGDVSKTTLCANGCQAIADTGTSLIAGPTSE--VKALNKAIGAAPFLNGEYLVN 304
Query: 425 CDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDV 484
C+ +PTMPN++FT+G K F L+P Y++K +G +C+SGF+ D+ PRGPLWILGDV
Sbjct: 305 CNNLPTMPNITFTLGGKDFELTPNDYVMKMSQGGLPLCLSGFIGLDV--PRGPLWILGDV 362
Query: 485 FMGVYHTVFDSGKLRIGFAEAA 506
F+G Y TVFD R+GFA AA
Sbjct: 363 FIGRYFTVFDRQSDRVGFAVAA 384
>gi|118429511|gb|ABK91803.1| aspartic protease precursor [Clonorchis sinensis]
Length = 425
Score = 262 bits (669), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 128/237 (54%), Positives = 173/237 (72%), Gaps = 5/237 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L N++DAQY+GEIGIG+PPQ+F V+FDTGSSNLWVPS C FSI+C+ H +Y S KS+T
Sbjct: 61 LNNYLDAQYYGEIGIGTPPQSFEVVFDTGSSNLWVPSKHCSIFSIACWLHHKYDSAKSST 120
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G I YGSGS+SG S D V VG V VK+Q F EA +E + F+ A+FDGI+G+
Sbjct: 121 YMANGTEFSIRYGSGSVSGILSTDYVSVGTVTVKNQTFGEAMKEPGIAFVAAKFDGILGM 180
Query: 196 GFREIAVGDAVP-VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
GF+ I+V D VP ++DNM+ QGLVSE VFSF+L+R+ GGE++ GG DPK++KG+
Sbjct: 181 GFKTISV-DGVPTLFDNMISQGLVSEPVFSFYLDRNASDPVGGELLLGGTDPKYYKGEIL 239
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
+ P+T + YWQF++ + +G S +CE GC AI D+GTSL+AGP+ V ++N A+G
Sbjct: 240 WAPLTHEAYWQFKVDSMNVG--SMKLCENGCQAIADTGTSLIAGPSEEVGKLNDALG 294
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 52/136 (38%), Positives = 76/136 (55%), Gaps = 4/136 (2%)
Query: 370 ENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIP 429
++++ G +C A+ L ++E + +N+ ++ P G IDC R+
Sbjct: 254 DSMNVGSMKLCENGCQAIADTGTSLIAGPSEE--VGKLNDALGAIKIPGGTYYIDCSRVS 311
Query: 430 TMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVY 489
T+P V F+I K+ L P YIL+ +CISGFM D+ P GPLWILGDVF+G Y
Sbjct: 312 TLPPVQFSISGKLMQLDPSDYILRMTSFGKTICISGFMGIDI--PAGPLWILGDVFIGKY 369
Query: 490 HTVFDSGKLRIGFAEA 505
+T+FD G R+GFA A
Sbjct: 370 YTIFDVGNARVGFATA 385
>gi|198422402|ref|XP_002130569.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 389
Score = 261 bits (668), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 126/245 (51%), Positives = 170/245 (69%), Gaps = 2/245 (0%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSN 134
PL N++DAQY+G+I IG+PPQ F+V+FDTGSSNLWVPS C + I+C H++YK+ +S+
Sbjct: 59 PLTNYLDAQYYGKIYIGTPPQPFTVVFDTGSSNLWVPSVHCAITDIACLIHNKYKASESS 118
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y G S I YGSGS+SG+ S D V + V K+Q+F EAT+E LTF+ A+FDGI+G
Sbjct: 119 SYKSNGTSFAIQYGSGSLSGYVSSDIVSIAGVKSKNQLFAEATKEPGLTFVAAKFDGILG 178
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+G+ EI+V PV++ M +Q ++ FSF+LNRD +A GGE+ GGVD K F G +
Sbjct: 179 MGYPEISVNGITPVFNQMFKQEALAHNQFSFYLNRDANASSGGELYLGGVDTKKFTGSFS 238
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y PVT KGYWQ + + +G+ ST C GC AIVDSGTSLLAGPT + +IN IG
Sbjct: 239 YHPVTVKGYWQISMDSVSVGS-STSACVSGCKAIVDSGTSLLAGPTDEIEKINKLIGATK 297
Query: 315 VVSAE 319
++ E
Sbjct: 298 FLNGE 302
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 57/142 (40%), Positives = 84/142 (59%), Gaps = 5/142 (3%)
Query: 367 VEKENVSAGDS--AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIID 424
+ ++VS G S A S C+ A+V L T E + IN+L + GE I+
Sbjct: 250 ISMDSVSVGSSTSACVSGCK-AIVDSGTSLLAGPTDE--IEKINKLIGATKFLNGEYIVQ 306
Query: 425 CDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDV 484
C+++ TMP+++F++ + L P Y++K +CISGFM D+PPPRGPLWILGD+
Sbjct: 307 CNKMATMPDITFSLSGVKYILKPNDYVMKESTAGESICISGFMGLDVPPPRGPLWILGDI 366
Query: 485 FMGVYHTVFDSGKLRIGFAEAA 506
FMG ++T FD R+GFA+ A
Sbjct: 367 FMGKFYTTFDFANNRVGFAQLA 388
>gi|344307517|ref|XP_003422427.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin D-like [Loxodonta
africana]
Length = 419
Score = 261 bits (668), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 124/254 (48%), Positives = 177/254 (69%), Gaps = 12/254 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L+N+MDAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 79 LRNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSVHCKLLDIACWIHHKYNSAKSST 138
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEV----------GDVVVKDQVFIEATREGSLTFL 185
Y + G + +I+YGSGS+SG+ SQD V V G V V+ Q F EAT++ +TF+
Sbjct: 139 YVKNGTTFDIHYGSGSLSGYLSQDTVSVPCSSASASALGGVRVERQTFGEATKQPGITFI 198
Query: 186 LARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVD 245
A+FDGI+G+ + I+V VPV+DN++ Q LV + +FSF+LNRDP A+ GGE++ GG+D
Sbjct: 199 AAKFDGILGMAYPRISVNKVVPVFDNLMAQKLVEKNMFSFYLNRDPTAQPGGELMLGGID 258
Query: 246 PKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTE 305
K++ G + VT++ YWQ + + +GN T +C+GGC AIVD+GTSL+ GP +TE
Sbjct: 259 SKYYTGTLNFNKVTREAYWQIHMDRVDVGNGLT-LCKGGCEAIVDTGTSLMVGPVEEITE 317
Query: 306 INHAIGGEGVVSAE 319
+ A+G ++ E
Sbjct: 318 LQKALGAIPLIQGE 331
Score = 95.9 bits (237), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 51/140 (36%), Positives = 81/140 (57%), Gaps = 3/140 (2%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
+++ +V G + CE A+V L +E ++ + + ++P GE +I C+
Sbjct: 281 MDRVDVGNGLTLCKGGCE-AIVDTGTSLMVGPVEE--ITELQKALGAIPLIQGEYMIPCE 337
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
++ ++P VS +G + + LS E Y+LK + VC+SGFM+ D+PPP PL L DVF+
Sbjct: 338 KVSSLPPVSLQLGGRSYTLSSEDYVLKVSQAGRSVCLSGFMSMDIPPPEEPLXDLSDVFI 397
Query: 487 GVYHTVFDSGKLRIGFAEAA 506
G Y+TVFD +GFAEAA
Sbjct: 398 GRYYTVFDRDNNTVGFAEAA 417
>gi|195120065|ref|XP_002004549.1| GI19550 [Drosophila mojavensis]
gi|193909617|gb|EDW08484.1| GI19550 [Drosophila mojavensis]
Length = 387
Score = 261 bits (668), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 134/258 (51%), Positives = 179/258 (69%), Gaps = 6/258 (2%)
Query: 63 VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-IS 121
+++ GDS E PL N++DAQY+G I IG+PPQNF V+FDTGSSNLWVPS KC+ + I+
Sbjct: 49 IKYGAGDSPE---PLSNYLDAQYYGPISIGTPPQNFKVVFDTGSSNLWVPSKKCHLTNIA 105
Query: 122 CYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGS 181
C H++Y + KS+TY + G S +I+YGSGS+SG+ S D V + + +K Q F EA E
Sbjct: 106 CLMHNKYDASKSSTYNKNGTSFDIHYGSGSLSGYLSSDTVNIAGLDIKGQTFAEALSEPG 165
Query: 182 LTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVF 241
L F+ A+FDGI+GLG+ I+V P + NM EQ L+++ VFSF+LNRDP A EGGEI+F
Sbjct: 166 LVFVAAKFDGILGLGYSSISVDGVKPPFYNMFEQSLIAQPVFSFYLNRDPKAPEGGEIIF 225
Query: 242 GGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTP 301
GG DP H+ G TY+PVT+KGYWQ ++ I N +C+GGC I D+GTSL+A P
Sbjct: 226 GGSDPNHYTGDFTYLPVTRKGYWQIKMDSAQINNVE--LCKGGCQVIADTGTSLIAAPAA 283
Query: 302 VVTEINHAIGGEGVVSAE 319
T IN AIGG +V +
Sbjct: 284 EATSINQAIGGTPIVGGQ 301
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 47/99 (47%), Positives = 64/99 (64%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
IN+ P G+ ++ CD IP +P + F +G K F L + YIL+ + +C+SGF
Sbjct: 288 INQAIGGTPIVGGQYVVSCDMIPNLPVIKFVLGGKTFELEGKDYILRIAQMGKTICLSGF 347
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
M D+PPP GPLWILGDVF+G Y+T FD G R+GFA+A
Sbjct: 348 MGMDIPPPNGPLWILGDVFIGKYYTEFDMGNDRVGFADA 386
>gi|431910128|gb|ELK13201.1| Cathepsin D [Pteropus alecto]
Length = 375
Score = 261 bits (667), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 124/253 (49%), Positives = 177/253 (69%), Gaps = 11/253 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN+MDAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 36 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSGKSST 95
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEV---------GDVVVKDQVFIEATREGSLTFLL 186
Y G + +I+YGSGS+SG+ SQD V V V V+ Q+F EAT++ +TF+
Sbjct: 96 YVRNGTAFDIHYGSGSLSGYLSQDTVSVPCKSAPSPPSSVKVERQIFGEATKQPGITFIA 155
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+ + I+V + +PV+DN+++Q LV + +FSF+LNRDP+A+ GGE++ GG D
Sbjct: 156 AKFDGILGMAYPRISVNNVLPVFDNLMQQKLVDKNIFSFYLNRDPNAQPGGELMLGGTDS 215
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K++ G +Y+ VT+K YWQ + + +GN T +C+ GC AIVD+GTSL+ GP V +
Sbjct: 216 KYYTGSLSYLNVTRKAYWQVHMEQVDVGNSLT-LCKAGCEAIVDTGTSLVVGPVEEVRAL 274
Query: 307 NHAIGGEGVVSAE 319
AIG ++ E
Sbjct: 275 QKAIGAVPLIQGE 287
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 56/141 (39%), Positives = 82/141 (58%), Gaps = 3/141 (2%)
Query: 367 VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
V E V G+S +C A A+V L +E + + + ++P GE +I C
Sbjct: 235 VHMEQVDVGNSLTLCKAGCEAIVDTGTSLVVGPVEE--VRALQKAIGAVPLIQGEYMIPC 292
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
+++ ++P V+ +G K + L E Y LK +G +C+SGFM D+PPP GPLWILGDVF
Sbjct: 293 EKVSSLPEVTLKLGGKGYKLGAEDYTLKVSQGGKTICLSGFMGMDIPPPGGPLWILGDVF 352
Query: 486 MGVYHTVFDSGKLRIGFAEAA 506
+G Y+TVFD + R+G AEA
Sbjct: 353 IGRYYTVFDRDENRVGLAEAT 373
>gi|342675479|gb|AEL31665.1| cathepsin D [Cynoglossus semilaevis]
Length = 396
Score = 261 bits (667), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 118/244 (48%), Positives = 172/244 (70%), Gaps = 2/244 (0%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN++DAQY+G+I +G+PPQ FSV+FDTGSSNLWVPS C I+C H +Y S KS+T
Sbjct: 68 LKNYLDAQYYGDITLGTPPQTFSVVFDTGSSNLWVPSIHCSLLDIACLLHKKYNSAKSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + G + I YGSGS+SG+ SQD +G + V++Q+F EA ++ + F+ A+FDGI+G+
Sbjct: 128 YVKNGTAFAIQYGSGSLSGYLSQDTCSIGGLTVENQLFGEAIKQPGIAFIAAKFDGILGM 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ I+V +PV+DN+++Q V VFSF+LNR+PD GGE++ GG DP ++ G+ Y
Sbjct: 188 AYPRISVDGVLPVFDNIMQQKKVESNVFSFYLNRNPDTAPGGELLLGGTDPTYYTGEFNY 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
V VT++ YWQ + ++ +G+Q T +C+GGC AIVD+GTSLL GP+ V + AIG +
Sbjct: 248 VNVTRQAYWQVSMDELAVGSQLT-LCKGGCQAIVDTGTSLLTGPSAEVKALQKAIGAIPL 306
Query: 316 VSAE 319
+ E
Sbjct: 307 IQGE 310
Score = 111 bits (278), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 46/93 (49%), Positives = 70/93 (75%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +++CD+IP++P ++F +G + ++L+ EQYILK + +C+SGFMA D+P
Sbjct: 303 AIPLIQGEYMVNCDKIPSLPVITFKMGGQSYSLTGEQYILKESQAGKTICLSGFMALDIP 362
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
P GPLWILGDVF+G Y+TVFD R+GFA++
Sbjct: 363 APAGPLWILGDVFIGQYYTVFDRDNNRVGFAKS 395
>gi|326433118|gb|EGD78688.1| cathepsin D [Salpingoeca sp. ATCC 50818]
Length = 385
Score = 261 bits (667), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 142/351 (40%), Positives = 202/351 (57%), Gaps = 28/351 (7%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRI---GLKKRRLDLHSLNAARITRKERYMGGAGVSG 62
+ L +A+ L+ + NGL R+ G+ + R L + AA + +
Sbjct: 3 MARTMALLAVATLLMAACAVNGLHRVPLTGMPRSRDTLRNAGAALLNK------------ 50
Query: 63 VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISC 122
+ LG+ +P+ NF DAQY+GEI IG+PPQ F V+FDTGSSNLWVPS +C S++C
Sbjct: 51 --YSLGNGTN--VPIYNFEDAQYYGEITIGTPPQRFKVVFDTGSSNLWVPSKQCK-SLAC 105
Query: 123 YFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSL 182
H +Y S +S+TY G I YGSGS++GF S D VGD+ V+ Q+F EAT E +
Sbjct: 106 DLHHKYDSSQSSTYFPNGTKFAIEYGSGSLTGFLSGDKTCVGDLCVEKQLFAEATNEPGI 165
Query: 183 TFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFG 242
TF+ A+FDGI+G+GF EI+V VP W N+V G V +++FWLNR A GGE+ G
Sbjct: 166 TFVAAKFDGILGMGFVEISVDQVVPYWYNLVSAGKVESNMYTFWLNRVQGAPSGGELTLG 225
Query: 243 GVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPV 302
G DPKH G +VP+T+ GYWQF + + + S C C AI D+GTSLLAGPT
Sbjct: 226 GYDPKHMSGPIQWVPLTRDGYWQFAMDSLSVNGDS--YCS-NCQAIADTGTSLLAGPTDA 282
Query: 303 VTEINHAIG----GEGVVSAECKLVVSQYG-DLIWDLLVSGLLPEKVCQQI 348
+ ++N IG +G +CK + + D++ + L P++ Q+
Sbjct: 283 IKKLNKQIGAIPIAQGEYMVDCKKIPTMPNVDIVLNGQKFTLTPQQYVLQV 333
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 56/137 (40%), Positives = 81/137 (59%), Gaps = 3/137 (2%)
Query: 370 ENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIP 429
+++S + CS C+ A+ L T + +N+ ++P GE ++DC +IP
Sbjct: 252 DSLSVNGDSYCSNCQ-AIADTGTSLLAGPTD--AIKKLNKQIGAIPIAQGEYMVDCKKIP 308
Query: 430 TMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVY 489
TMPNV + + F L+P+QY+L+ C+SGF D+PPP GPLWILGDVF+G Y
Sbjct: 309 TMPNVDIVLNGQKFTLTPQQYVLQVSAQGQTECLSGFFGLDVPPPAGPLWILGDVFIGAY 368
Query: 490 HTVFDSGKLRIGFAEAA 506
TVFD G R+GFA +A
Sbjct: 369 TTVFDMGNNRVGFAPSA 385
>gi|301619112|ref|XP_002938948.1| PREDICTED: cathepsin D-like [Xenopus (Silurana) tropicalis]
Length = 355
Score = 261 bits (666), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 117/241 (48%), Positives = 171/241 (70%), Gaps = 2/241 (0%)
Query: 80 FMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTE 138
++ AQY+GEIG+GSPPQNF+V+FDTGSSNLWVPS C I+C+ H +Y S KS+TY +
Sbjct: 30 YLQAQYYGEIGLGSPPQNFTVVFDTGSSNLWVPSVHCSMLDIACWMHHKYDSSKSSTYVK 89
Query: 139 IGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFR 198
G + I YG+GS+SG+ S+D V +G++ VK Q+F EA ++ +TF+ A+FDGI+G+ +
Sbjct: 90 NGTAFAIQYGTGSLSGYLSKDTVTIGNLAVKGQIFGEAVKQPGVTFVAAKFDGILGMAYP 149
Query: 199 EIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPV 258
I+V PV+DN++ Q LV +FSF+LNR+PD + GGE++ GG DPK++ G Y+ V
Sbjct: 150 VISVDGVPPVFDNIMAQKLVESNIFSFYLNRNPDTQPGGELLLGGTDPKYYTGDFHYLSV 209
Query: 259 TKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSA 318
T+K YWQ + + +G+Q T +C+GGC IVD+GTSL+ GP VT + AIG ++
Sbjct: 210 TRKAYWQIHMDQLGVGDQLT-LCKGGCEVIVDTGTSLITGPLEEVTALQKAIGAVPLIQG 268
Query: 319 E 319
+
Sbjct: 269 Q 269
Score = 112 bits (280), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 44/93 (47%), Positives = 70/93 (75%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P G+ ++ CD++PT+P +S T+G +++ L+ EQYI+K + + +C+SGFM ++P
Sbjct: 262 AVPLIQGQYMVQCDKVPTLPVISLTLGGQVYTLTGEQYIMKVSQLGSTICLSGFMGLNIP 321
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
PP GPLWILGDVF+G Y++VFD R+GFA+A
Sbjct: 322 PPAGPLWILGDVFIGQYYSVFDRANNRVGFAKA 354
>gi|205364148|gb|ACI04532.1| aspartic protease 1 precursor [Ancylostoma duodenale]
Length = 446
Score = 261 bits (666), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 127/255 (49%), Positives = 171/255 (67%), Gaps = 2/255 (0%)
Query: 66 RLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYF 124
+L ++E L+N+MDAQYFG I IG+P QNF+VIFDTGSSNLWVPS KC ++ I+C
Sbjct: 76 KLQSTNEIDELLRNYMDAQYFGTIQIGTPAQNFTVIFDTGSSNLWVPSRKCPFYDIACML 135
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
H RY S S+TY E G+ I YG+GS+ GF S+DNV + + ++Q F EAT E LTF
Sbjct: 136 HRRYDSGASSTYKEDGRKMAIQYGTGSMKGFISKDNVCIAGICAEEQPFAEATSEPGLTF 195
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
+ A+FDGI+G+ F EI+V PV+ +EQ V VF+FWLNR+PD+E GGEI GG+
Sbjct: 196 IAAKFDGILGMAFPEISVLGVPPVFHTFIEQKKVPSPVFAFWLNRNPDSELGGEITLGGM 255
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
D + + T+ PVT++GYWQF++ D + G ++ C GC AI D+GTSL+AGP V
Sbjct: 256 DTRRYVEPITWTPVTRRGYWQFKM-DKVQGGSTSIACPNGCQAIADTGTSLIAGPKAQVE 314
Query: 305 EINHAIGGEGVVSAE 319
I IG E ++ E
Sbjct: 315 AIQKFIGAEPLMKGE 329
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 45/99 (45%), Positives = 62/99 (62%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
I + + P GE +I CD++P++P +SF I + L E Y+L G +C+SGF
Sbjct: 316 IQKFIGAEPLMKGEYMIPCDKVPSLPELSFVIEGRTSTLKGEDYVLTVKAGGKSICLSGF 375
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
M D P G LWILGDVF+G Y+TVFD G+ R+GFA+A
Sbjct: 376 MGMDFPERIGELWILGDVFIGKYYTVFDIGQARLGFAQA 414
>gi|27503926|gb|AAH42316.1| Ctsd protein [Danio rerio]
gi|38571742|gb|AAH62824.1| Ctsd protein [Danio rerio]
gi|197247273|gb|AAI64814.1| Ctsd protein [Danio rerio]
Length = 398
Score = 260 bits (665), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 137/326 (42%), Positives = 203/326 (62%), Gaps = 16/326 (4%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRH 65
+R F L V+A +S+ + RI LKK R +L+ + + +E + + +++
Sbjct: 1 MRIAFLLLVVA----FFCTSDAIVRIPLKKFRTLRRTLSDSGRSLEELV---SSSNSLKY 53
Query: 66 RLG-DSDEDILP--LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-IS 121
LG + D P LKN++DAQY+GEIG+G+P Q F+V+FDTGSSNLWVPS C + I+
Sbjct: 54 NLGFPASNDPTPETLKNYLDAQYYGEIGLGTPVQTFTVVFDTGSSNLWVPSVHCSLTDIA 113
Query: 122 CYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGS 181
C H +Y KS+TY + G I YGSGS+SG+ SQD +GD+ V+ Q+F EA ++
Sbjct: 114 CLLHHKYNGGKSSTYVKNGTQFAIQYGSGSLSGYLSQDTCTIGDIAVEKQIFGEAIKQPG 173
Query: 182 LTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVF 241
+ F+ A+FDGI+G+ + IAV PV+D M+ Q V + VFSF+LNR+PD + GGE++
Sbjct: 174 VAFIAAKFDGILGMAYPRIAVDGVPPVFDMMMSQKKVEKNVFSFYLNRNPDTQPGGELLL 233
Query: 242 GGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTP 301
GG DPK++ G YV ++++ YWQ + + IG+ +C+GGC AIVD+GTSL+ GP
Sbjct: 234 GGTDPKYYTGDFNYVDISRQAYWQIHMDGMSIGS-GLSLCKGGCEAIVDTGTSLITGPAA 292
Query: 302 VVTEINHAIGG----EGVVSAECKLV 323
V + AIG +G +CK V
Sbjct: 293 EVKALQKAIGAIPLMQGEYMVDCKKV 318
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 50/93 (53%), Positives = 73/93 (78%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE ++DC ++PT+P +SF++G K+++L+ EQYILK +G ++C+SGFM D+P
Sbjct: 303 AIPLMQGEYMVDCKKVPTLPTISFSLGGKVYSLTGEQYILKESQGGHDICLSGFMGLDIP 362
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
PP GPLWILGDVF+G Y+TVFD R+GFA+A
Sbjct: 363 PPAGPLWILGDVFIGQYYTVFDRENNRVGFAKA 395
>gi|407728652|gb|AFU24355.1| cathepsin D [Ctenopharyngodon idella]
Length = 398
Score = 260 bits (665), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 137/319 (42%), Positives = 201/319 (63%), Gaps = 16/319 (5%)
Query: 17 SCLLLPA----SSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLG-DSD 71
+CLLL A +S+ + RI L K R +L+ + +E AG +++ LG +
Sbjct: 4 ACLLLAAAFFWTSDAIVRIPLTKFRSIRRTLSDSGRAVEELV---AGSVPLKYNLGFPAS 60
Query: 72 EDILP--LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRY 128
P LKN++DAQY+GEIG+G+P Q+F+V+FDTGSSNLWVPS C I+C H +Y
Sbjct: 61 NGPTPGTLKNYLDAQYYGEIGLGTPVQSFTVVFDTGSSNLWVPSVHCSLMDIACLLHHKY 120
Query: 129 KSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLAR 188
KS+TY + G I YGSGS+SG+ SQD VGD+ V+ Q+F EA ++ + F+ A+
Sbjct: 121 NGGKSSTYVKNGTEFAIQYGSGSLSGYLSQDTCTVGDIAVEKQIFGEAIKQPGVAFIAAK 180
Query: 189 FDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKH 248
FDGI+G+ + IAV PV+D M+ Q V + +FSF+LNR+PD + GGE++ GG DPK+
Sbjct: 181 FDGILGMAYPRIAVDGVPPVFDMMMSQKKVEKNIFSFYLNRNPDTQPGGELLLGGTDPKY 240
Query: 249 FKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH 308
+ G YV ++++ YWQ + + IG++ T +C+GGC AIVD+GTSL+ GP + +
Sbjct: 241 YTGDFNYVDISRQAYWQIHMDGMSIGSELT-LCKGGCEAIVDTGTSLITGPATEIKALQK 299
Query: 309 AIGG----EGVVSAECKLV 323
AIG +G +CK V
Sbjct: 300 AIGAIPLIQGEYMVDCKKV 318
Score = 121 bits (304), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 51/94 (54%), Positives = 71/94 (75%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE ++DC ++PT+P +SF +G K ++L+ EQYILK + E+C+SGFM D+P
Sbjct: 303 AIPLIQGEYMVDCKKVPTLPTISFVLGGKTYSLTGEQYILKESQAGQEICLSGFMGLDIP 362
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PP GPLWILGDVF+G Y+TVFD R+GFA+AA
Sbjct: 363 PPAGPLWILGDVFIGQYYTVFDRENNRVGFAKAA 396
>gi|195380081|ref|XP_002048799.1| GJ21122 [Drosophila virilis]
gi|194143596|gb|EDW59992.1| GJ21122 [Drosophila virilis]
Length = 391
Score = 260 bits (665), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 146/315 (46%), Positives = 195/315 (61%), Gaps = 21/315 (6%)
Query: 12 LWVLASCLLLP---ASSNGLRRIGLKK---RRLDLHSLNAARITRKERYMGGAGVSGVRH 65
L + A CL L A+ L R+ L K R + + +Y G GVS
Sbjct: 5 LLLFAVCLALAWAVAAEPKLLRVPLNKFQSARRHFADVGTELQQLRIKYGGAGGVSPE-- 62
Query: 66 RLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYF 124
PL N++DAQY+G I IGSPPQNF V+FDTGSSNLWVPS KC+ + I+C
Sbjct: 63 ----------PLSNYLDAQYYGPISIGSPPQNFKVVFDTGSSNLWVPSKKCHLTNIACLM 112
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
H++Y + KS++Y++ G I+YGSGS+SG+ S D V + + +KDQ F EA E L F
Sbjct: 113 HNKYDASKSSSYSKNGTEFAIHYGSGSLSGYLSSDTVNIAGLDIKDQTFAEALSEPGLVF 172
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
+ A+FDGI+GLG+ I+V P + +M EQGL+S+ VFSF+LNRDP A EGGEI+FGG
Sbjct: 173 VAAKFDGILGLGYSSISVDGVKPPFYSMFEQGLISQPVFSFYLNRDPKAPEGGEIIFGGS 232
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
DP H+ G TY+PVT+KGYWQ ++ + N +C+GGC I D+GTSL+A P T
Sbjct: 233 DPNHYTGDFTYLPVTRKGYWQIKMDSAQLNNLE--LCKGGCQIIADTGTSLIAAPVAEAT 290
Query: 305 EINHAIGGEGVVSAE 319
IN AIGG +V +
Sbjct: 291 SINQAIGGTPIVGGQ 305
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 48/99 (48%), Positives = 64/99 (64%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
IN+ P G+ I+ CD IP +P + F +G K F L + YIL+ + +C+SGF
Sbjct: 292 INQAIGGTPIVGGQYIVSCDMIPNLPVIKFVLGGKTFELEGKDYILRVAQMGKTICLSGF 351
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
M D+PPP GPLWILGDVF+G Y+T FD G R+GFA+A
Sbjct: 352 MGMDIPPPNGPLWILGDVFIGKYYTEFDMGNDRVGFADA 390
>gi|86278345|gb|ABC88426.1| cathepsin D-like aspartic proteinase preproprotein [Meloidogyne
incognita]
Length = 454
Score = 260 bits (665), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 127/250 (50%), Positives = 165/250 (66%), Gaps = 7/250 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L+N+MDAQY+G I IGSPPQNFSVIFDTGSSNLWVPS KC ++ I+C H +Y S KS++
Sbjct: 82 LRNYMDAQYYGPISIGSPPQNFSVIFDTGSSNLWVPSKKCPFYDIACLLHHKYDSTKSSS 141
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + G+ +I YG+GS+ GF S+D V + ++ V Q F EA E LTF+ A+FDGI+G+
Sbjct: 142 YKDDGRKMQIQYGTGSMKGFVSKDTVCIANICVAGQEFAEAVSEPGLTFVAAKFDGILGM 201
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
F EI+V PV+ M+ Q V E VFSFWLNRDP ++ GGEI GG D + + Y
Sbjct: 202 AFPEISVLGVQPVFQQMISQQKVPEPVFSFWLNRDPYSKVGGEITIGGTDKRRYVEPLNY 261
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG---- 311
PVT+K YWQF++ + C+ GC AI D+GTSL+AGP + EI H IG
Sbjct: 262 TPVTRKAYWQFKMEGVHNSKGEKIACQNGCEAIADTGTSLIAGPKAQIEEIQHYIGAVPL 321
Query: 312 --GEGVVSAE 319
GE +VS E
Sbjct: 322 MHGEYMVSCE 331
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 47/109 (43%), Positives = 66/109 (60%), Gaps = 4/109 (3%)
Query: 397 KQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGE 456
K E++ YI ++P GE ++ C+R+P +P+++ IG + L YIL
Sbjct: 306 KAQIEEIQHYIG----AVPLMHGEYMVSCERVPRLPDIALVIGGHSYVLKGSDYILNVTA 361
Query: 457 GIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+C+SGFM DLPP G LWILGDVF+G Y+TVFD G+ RIG A+A
Sbjct: 362 MGKSICLSGFMGIDLPPKVGELWILGDVFIGRYYTVFDVGQQRIGLAQA 410
>gi|118344558|ref|NP_001072052.1| cathepsin D1 precursor [Takifugu rubripes]
gi|55771082|dbj|BAD69801.1| cathepsin D1 [Takifugu rubripes]
Length = 396
Score = 260 bits (665), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 118/244 (48%), Positives = 171/244 (70%), Gaps = 2/244 (0%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN++DAQY+GEIG+G+PPQ F+V+FDTGSSNLWVPS C I+C H +Y S KS++
Sbjct: 68 LKNYLDAQYYGEIGLGTPPQPFTVVFDTGSSNLWVPSVHCSLLDIACLLHHKYNSAKSSS 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + G + I YGSGS+SG+ SQD +GD+ V+ Q+F EA ++ + F+ A+FDGI+G+
Sbjct: 128 YVKNGTAFAIRYGSGSLSGYLSQDTCTLGDLAVEKQLFGEAIKQPGIAFIAAKFDGILGM 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ I+V PV+DN++ Q V + VFSF+LNR+PD + GGE++ GG DPK++ G Y
Sbjct: 188 AYPRISVDGVTPVFDNIMSQKKVEKNVFSFYLNRNPDTQPGGELLLGGTDPKYYTGDFDY 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
V VT++ YWQ + + +G+Q + +C+ GC AIVD+GTSLL GP+ V + AIG +
Sbjct: 248 VNVTRQAYWQIHMDGMSVGSQLS-LCKSGCEAIVDTGTSLLTGPSEEVKALQKAIGAMPL 306
Query: 316 VSAE 319
+ E
Sbjct: 307 IQGE 310
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 56/125 (44%), Positives = 82/125 (65%), Gaps = 3/125 (2%)
Query: 381 SACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGD 440
S CE A+V L ++E + + + ++P GE ++ CD+IP++P ++F IG
Sbjct: 274 SGCE-AIVDTGTSLLTGPSEE--VKALQKAIGAMPLIQGEYMVSCDKIPSLPVITFNIGG 330
Query: 441 KIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRI 500
K F+LS +QY+LK + +C+SGFMA D+P P GPLWILGDVF+G Y+TVFD R+
Sbjct: 331 KPFSLSGDQYVLKVSQAGKTICLSGFMALDIPAPAGPLWILGDVFIGQYYTVFDRDNNRV 390
Query: 501 GFAEA 505
GFA+A
Sbjct: 391 GFAKA 395
>gi|71727523|gb|AAZ39883.1| cathepsin D-like aspartic protease [Opisthorchis viverrini]
Length = 425
Score = 260 bits (665), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 124/237 (52%), Positives = 173/237 (72%), Gaps = 5/237 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L N++DAQY+GEIGIG+PPQ+F V+FDTGSSNLWVPS+ C F+I+C+ H +Y S +S+T
Sbjct: 61 LNNYLDAQYYGEIGIGTPPQSFQVVFDTGSSNLWVPSTHCSIFNIACWLHHKYDSARSST 120
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G I YGSGS+SG S D V VG V+VK+Q F EA +E + F+ A+FDGI+G+
Sbjct: 121 YYPNGTEFSIRYGSGSVSGILSTDYVSVGTVIVKNQTFGEAMKEPGIAFVAAKFDGILGM 180
Query: 196 GFREIAVGDAVP-VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
GF+ I+V D VP ++DNM+ QGLV E VFSF+L+R+ GGE++ GG DPK++KG+
Sbjct: 181 GFKSISV-DGVPTLFDNMISQGLVPEPVFSFYLDRNASDPVGGELLLGGTDPKYYKGEIL 239
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
+ P+T + YWQF++ + +G +CE GC AI D+GTSL+AGP+ V ++N A+G
Sbjct: 240 WAPLTHEAYWQFKVDSMSVGGMK--LCENGCQAIADTGTSLIAGPSEEVGKLNDALG 294
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 53/137 (38%), Positives = 77/137 (56%), Gaps = 4/137 (2%)
Query: 370 ENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIP 429
+++S G +C A+ L ++E + +N+ ++ P G I+CDR+
Sbjct: 254 DSMSVGGMKLCENGCQAIADTGTSLIAGPSEE--VGKLNDALGAIKLPGGTYYINCDRVS 311
Query: 430 TMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVY 489
T+P V F I K+ L P YIL+ +CISGFM D+ P GPLWILGDVF+G Y
Sbjct: 312 TLPLVQFNINGKLMELEPSDYILRMTSFGKTLCISGFMGIDI--PAGPLWILGDVFIGKY 369
Query: 490 HTVFDSGKLRIGFAEAA 506
+T+FD G R+GFA A+
Sbjct: 370 YTIFDVGNARVGFATAS 386
>gi|315274244|gb|ADU03674.1| cathepsin D2 [Ixodes ricinus]
Length = 387
Score = 260 bits (664), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 132/286 (46%), Positives = 182/286 (63%), Gaps = 10/286 (3%)
Query: 37 RLDLHSLNAAR--ITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSP 94
R+ LH + +AR + + V R + + PLKN++DAQY+GEI +G+P
Sbjct: 23 RMPLHKMQSARAHLLDATTPLTRPAVHATRGPIPE------PLKNYLDAQYYGEITLGTP 76
Query: 95 PQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSIS 153
PQ+F V+FDTGSSNLWVPS+KC F+ I+C H +Y SRKS+TY + G EI YGSGS+
Sbjct: 77 PQSFRVVFDTGSSNLWVPSAKCPFTNIACLLHRKYYSRKSSTYVKNGTQFEIRYGSGSVR 136
Query: 154 GFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMV 213
G S D + VGD V Q F E E L FL A+FDGI+GLG+ EI+V V+D MV
Sbjct: 137 GELSTDTMGVGDSSVTGQTFAEILHESGLAFLAAKFDGILGLGYPEISVLGVPTVFDTMV 196
Query: 214 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 273
QG+ ++ VFS +L+R+ GGE++FGG+D H+ G +YVPV+K+GYWQ + +
Sbjct: 197 AQGVAAKPVFSVFLDRNASDPAGGEVLFGGIDESHYTGNISYVPVSKRGYWQVHMDGTRV 256
Query: 274 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
GN + C GGC AI+D+GTSL+AGP+ + ++N IG S E
Sbjct: 257 GNNGS-FCSGGCEAILDTGTSLIAGPSDEIEKLNLLIGAAPFASGE 301
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 48/130 (36%), Positives = 72/130 (55%), Gaps = 2/130 (1%)
Query: 376 DSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVS 435
+ + CS A++ L + E + +N L + P GE I+ C I +P ++
Sbjct: 259 NGSFCSGGCEAILDTGTSLIAGPSDE--IEKLNLLIGAAPFASGEYIVSCKSIDKLPKIT 316
Query: 436 FTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDS 495
FT+ K F L + Y+L+ +C+SGF+ D+P P GPLWILGDVF+G Y+T+FD
Sbjct: 317 FTLAGKDFVLEGKDYVLQMSSAGVPLCLSGFIGLDVPAPLGPLWILGDVFIGRYYTIFDR 376
Query: 496 GKLRIGFAEA 505
G R+G A A
Sbjct: 377 GNDRVGLANA 386
>gi|311258028|ref|XP_003127411.1| PREDICTED: napsin-A [Sus scrofa]
Length = 416
Score = 260 bits (664), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 134/296 (45%), Positives = 185/296 (62%), Gaps = 14/296 (4%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLG---DSDEDILPLKNFMDAQ 84
L RI L++ L +LN R K S RLG D+ +PL N+++ Q
Sbjct: 23 LIRIPLRRVHAGLRTLNPLRAWEK---------SAEPPRLGAPSPGDKTFVPLSNYLNVQ 73
Query: 85 YFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF-SISCYFHSRYKSRKSNTYTEIGKSC 143
Y+GEIG+G+PPQNFSVIFDTGSSNLWVPS +C+F S+ C+ H RY S+ S+++
Sbjct: 74 YYGEIGLGTPPQNFSVIFDTGSSNLWVPSGRCHFLSLPCWLHHRYHSKASSSFHSNETKF 133
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVG 203
I YG+G ++G S+D + +G + +F EA E SL F A FDGI+GLGF +AVG
Sbjct: 134 AIQYGTGRLNGILSEDKLTIGGLTGASVIFGEALWEPSLVFAFAHFDGILGLGFPVLAVG 193
Query: 204 DAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGY 263
P D++V+QGL+ + VFSF+LNRDP+A +GGE+V GG DP H+ T+VPVT Y
Sbjct: 194 GVRPPLDSLVDQGLLDKPVFSFYLNRDPEAADGGELVLGGSDPAHYIPPLTFVPVTVPAY 253
Query: 264 WQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
WQ + + +G T +C GCAAI+D+GTSL+ GPT + + AIGG ++ E
Sbjct: 254 WQVHVERVHVGTGLT-LCAQGCAAILDTGTSLITGPTEEIQALQAAIGGIPLLMGE 308
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 58/143 (40%), Positives = 81/143 (56%), Gaps = 7/143 (4%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
VE+ +V G + C A++ L T+E + + +P MGE +I C
Sbjct: 258 VERVHVGTGLTLCAQGCA-AILDTGTSLITGPTEE--IQALQAAIGGIPLLMGEYLIQCS 314
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
+IPT+P VSF +G FNL+ + Y+++ G A +C+SGF A D+PPP GPLWILGDVF+
Sbjct: 315 KIPTLPPVSFHLGGVWFNLTAQDYVIQITRGGASLCLSGFQALDMPPPTGPLWILGDVFL 374
Query: 487 GVYHTVFDSG----KLRIGFAEA 505
G Y VFD G R+G A A
Sbjct: 375 GSYVAVFDRGDRKSDARVGLARA 397
>gi|315440803|gb|ADU20407.1| aspartic protease 1 [Clonorchis sinensis]
Length = 425
Score = 260 bits (664), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 127/237 (53%), Positives = 172/237 (72%), Gaps = 5/237 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L N++DAQY+GEIGIG+PPQ+F V+FDTGSSNLWVPS C FSI+C+ H +Y S K +T
Sbjct: 61 LNNYLDAQYYGEIGIGTPPQSFEVVFDTGSSNLWVPSKHCSIFSIACWLHHKYDSAKYST 120
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G I YGSGS+SG S D V VG V VK+Q F EA +E + F+ A+FDGI+G+
Sbjct: 121 YMANGTEFSIRYGSGSVSGILSTDYVSVGTVTVKNQTFGEAMKEPGIAFVAAKFDGILGM 180
Query: 196 GFREIAVGDAVP-VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
GF+ I+V D VP ++DNM+ QGLVSE VFSF+L+R+ GGE++ GG DPK++KG+
Sbjct: 181 GFKTISV-DGVPTLFDNMISQGLVSEPVFSFYLDRNASDPVGGELLLGGTDPKYYKGEIL 239
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
+ P+T + YWQF++ + +G S +CE GC AI D+GTSL+AGP+ V ++N A+G
Sbjct: 240 WAPLTHEAYWQFKVDSMNVG--SMKLCENGCQAIADTGTSLIAGPSEEVGKLNDALG 294
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 52/136 (38%), Positives = 76/136 (55%), Gaps = 4/136 (2%)
Query: 370 ENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIP 429
++++ G +C A+ L ++E + +N+ ++ P G IDC R+
Sbjct: 254 DSMNVGSMKLCENGCQAIADTGTSLIAGPSEE--VGKLNDALGAIKIPGGTYYIDCSRVS 311
Query: 430 TMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVY 489
T+P V F+I K+ L P YIL+ +CISGFM D+ P GPLWILGDVF+G Y
Sbjct: 312 TLPPVQFSISGKLMQLDPSDYILRMTSFGKTICISGFMGIDI--PAGPLWILGDVFIGKY 369
Query: 490 HTVFDSGKLRIGFAEA 505
+T+FD G R+GFA A
Sbjct: 370 YTIFDVGNARVGFATA 385
>gi|22651403|gb|AAL61540.1| cathepsin D precursor [Danio rerio]
Length = 398
Score = 259 bits (663), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 137/326 (42%), Positives = 202/326 (61%), Gaps = 16/326 (4%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRH 65
+R F L V A +S+ + RI LKK R +L+ + + +E + + +++
Sbjct: 1 MRIAFLLLVAA----FFCTSDAIVRIPLKKFRTLRRTLSDSGRSLEELV---SSSNSLKY 53
Query: 66 RLG-DSDEDILP--LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-IS 121
LG + D P LKN++DAQY+GEIG+G+P Q F+V+FDTGSSNLWVPS C + I+
Sbjct: 54 NLGFPASNDPTPETLKNYLDAQYYGEIGLGTPVQTFTVVFDTGSSNLWVPSVHCSLTDIA 113
Query: 122 CYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGS 181
C H +Y KS+TY + G I YGSGS+SG+ SQD +GD+ V+ Q+F EA ++
Sbjct: 114 CLLHHKYNGGKSSTYVKNGTQFAIQYGSGSLSGYLSQDTCTIGDIAVEKQIFGEAIKQPG 173
Query: 182 LTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVF 241
+ F+ A+FDGI+G+ + IAV PV+D M+ Q V + VFSF+LNR+PD + GGE++
Sbjct: 174 VAFIAAKFDGILGMAYPRIAVDGVPPVFDMMMSQKKVEKNVFSFYLNRNPDTQPGGELLL 233
Query: 242 GGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTP 301
GG DPK++ G YV ++++ YWQ + + IG+ +C+GGC AIVD+GTSL+ GP
Sbjct: 234 GGTDPKYYTGDFNYVDISRQAYWQIHMDGMSIGS-GLSLCKGGCEAIVDTGTSLITGPAA 292
Query: 302 VVTEINHAIGG----EGVVSAECKLV 323
V + AIG +G +CK V
Sbjct: 293 EVKALQKAIGAIPLMQGEYMVDCKKV 318
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 50/93 (53%), Positives = 73/93 (78%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE ++DC ++PT+P +SF++G K+++L+ EQYILK +G ++C+SGFM D+P
Sbjct: 303 AIPLMQGEYMVDCKKVPTLPTISFSLGGKVYSLTGEQYILKESQGGHDICLSGFMGLDIP 362
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
PP GPLWILGDVF+G Y+TVFD R+GFA+A
Sbjct: 363 PPAGPLWILGDVFIGQYYTVFDRENNRVGFAKA 395
>gi|395858453|ref|XP_003801583.1| PREDICTED: napsin-A [Otolemur garnettii]
Length = 419
Score = 259 bits (663), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 134/290 (46%), Positives = 183/290 (63%), Gaps = 7/290 (2%)
Query: 24 SSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDA 83
S L R+ L++ +LN R R+ + S ++LG ++PL +F+D
Sbjct: 21 SGATLIRVSLRRVHSGHKTLNLLRRWREPAELSSLEASSPGNKLG-----LVPLSDFLDV 75
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKS 142
QYFGEIG+G+PPQNFSV+FDTGSSNLWVPS +C +FS+ C+FH R+ S+++ G
Sbjct: 76 QYFGEIGLGTPPQNFSVVFDTGSSNLWVPSRRCHFFSVPCWFHHRFNPNASSSFQPNGTK 135
Query: 143 CEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAV 202
I YGSG ++G S+D + +G + VF EA E SLTF A FDGI+GLGF +AV
Sbjct: 136 FAIEYGSGRLNGILSKDKLTIGGLKGASVVFGEALWEPSLTFTFAPFDGILGLGFPILAV 195
Query: 203 GDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
P D +VEQGL+ + VFSF+LNRDPD +GGE+V GG DP H+ T+VPVT
Sbjct: 196 EGVRPPLDVLVEQGLLDKPVFSFYLNRDPDVADGGELVLGGSDPAHYIPPLTFVPVTIPA 255
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
YWQ + + +G T +C GCAAI+D+GTSL+ GPT + ++ AIGG
Sbjct: 256 YWQIHMERVKVGTGLT-LCAQGCAAILDTGTSLITGPTEEIRALHAAIGG 304
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 54/143 (37%), Positives = 78/143 (54%), Gaps = 7/143 (4%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
+E+ V G + C A++ L T+E + ++ +P P GE +I+C
Sbjct: 261 MERVKVGTGLTLCAQGCA-AILDTGTSLITGPTEE--IRALHAAIGGIPLPPGEHLIECS 317
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
IP +P VSF +G FNL+ + Y+++ G +C+SGF D+PPP GPLWILGDVF+
Sbjct: 318 EIPRLPPVSFLLGGVWFNLTGKDYVVQITWGGVHLCLSGFQPLDMPPPAGPLWILGDVFL 377
Query: 487 GVYHTVFDSGKL----RIGFAEA 505
G Y VFD G R+G A A
Sbjct: 378 GAYVAVFDRGDTNTGARVGLARA 400
>gi|147906891|ref|NP_001082550.1| cathepsin D precursor [Xenopus laevis]
gi|28436104|dbj|BAC57431.1| cathepsin D [Xenopus laevis]
Length = 409
Score = 259 bits (662), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 137/326 (42%), Positives = 197/326 (60%), Gaps = 20/326 (6%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKK----RRLDLHSLNAARITRKERYMG 56
M + S+ CL C L+ + L RI LKK RR A T K+
Sbjct: 1 MASAPVWSLLCL-----CCLVFQPGSSLVRIPLKKFTSIRR-------AMSDTDKDSLKL 48
Query: 57 GAGVSGVRHRLGDSDEDILP--LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSS 114
+ ++ + P L N++DAQY+GEI IG+PPQ F+V+FDTGSSNLWV S
Sbjct: 49 SGNEAATKYSAFPKSNNPTPETLLNYLDAQYYGEISIGTPPQPFTVVFDTGSSNLWVASV 108
Query: 115 KC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF 173
C F I+C+ H +Y S KS+TY + G I YG+GSISG+ S+D V +G++ K+Q+F
Sbjct: 109 HCSMFDIACWMHRKYDSSKSSTYVKNGTEFAIQYGTGSISGYLSKDTVTIGNLGYKEQIF 168
Query: 174 IEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDA 233
EA ++ +TF+ A+FDGI+G+ + I+V P +DN++ Q LV VFSF+LNR+PD
Sbjct: 169 GEAIKQPGVTFIAAKFDGILGMAYPIISVDGVSPCFDNIMAQKLVESNVFSFYLNRNPDT 228
Query: 234 EEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGT 293
+ GGE++ GG DPK++ G Y+ VT+K YWQ + + +G+Q T +C+GGC AIVD+GT
Sbjct: 229 QPGGELLLGGTDPKYYTGDFHYLNVTRKAYWQIHMDQLGVGDQLT-LCKGGCEAIVDTGT 287
Query: 294 SLLAGPTPVVTEINHAIGGEGVVSAE 319
SL+ GP V + AIG ++ E
Sbjct: 288 SLITGPVEEVAALQRAIGAIPLIRGE 313
>gi|380036056|ref|NP_001244039.1| cathepsin D1 precursor [Ictalurus punctatus]
gi|330689904|gb|AEC33270.1| cathepsin D1 [Ictalurus punctatus]
Length = 396
Score = 259 bits (662), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 124/254 (48%), Positives = 172/254 (67%), Gaps = 6/254 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNT 135
LKN++DAQY+GEIG+GSP Q F+V+FDTGSSNLWVPS C + I+C H +Y KS+T
Sbjct: 68 LKNYLDAQYYGEIGLGSPVQTFTVVFDTGSSNLWVPSVHCSLTDIACLLHHKYNGAKSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + G + I YGSGS+SG+ SQD +GD+ V+ Q+F EA ++ + F+ A+FDGI+G+
Sbjct: 128 YVKNGTAFAIQYGSGSLSGYLSQDVCTIGDIAVEKQIFGEAIKQPGVAFIAAKFDGILGM 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ IAV PV+D M+ Q V + VFSF+LNR+PD + GGE++ GG DPK + G Y
Sbjct: 188 AYPRIAVDGVPPVFDMMMSQKKVEKNVFSFYLNRNPDTQPGGELLLGGTDPKFYTGDFHY 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
V +T++ YWQ + + IG+Q T +C+GGC AIVD+GTSL+ GP V + AIG
Sbjct: 248 VNITRQAYWQIHMDGMTIGSQLT-LCKGGCEAIVDTGTSLITGPAAEVKALQKAIGAIPL 306
Query: 313 -EGVVSAECKLVVS 325
+G +CK V S
Sbjct: 307 IQGEYMVDCKKVPS 320
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 48/93 (51%), Positives = 70/93 (75%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE ++DC ++P++P +SF +G + + L+ EQYILK + E+C+SGFMA D+P
Sbjct: 303 AIPLIQGEYMVDCKKVPSLPTISFNLGGQTYTLTGEQYILKESQAGREICLSGFMALDIP 362
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
PP GPLWILGDVF+G Y+T+FD R+GFA+A
Sbjct: 363 PPAGPLWILGDVFIGQYYTMFDRENNRVGFAKA 395
>gi|432870116|ref|XP_004071815.1| PREDICTED: cathepsin D-like [Oryzias latipes]
Length = 397
Score = 259 bits (661), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 129/300 (43%), Positives = 186/300 (62%), Gaps = 12/300 (4%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILP------LKNFMDAQYFGEIG 90
R+ LH + R + M + + R+G D P L NFMDAQY+G I
Sbjct: 23 RVPLHKTRSLRRLMSDNGMSLDDLRALGMRVGSLDSSASPELPVERLTNFMDAQYYGLIS 82
Query: 91 IGSPPQNFSVIFDTGSSNLWVPSSKCYF-SISCYFHSRYKSRKSNTYTEIGKSCEINYGS 149
IG+PPQNFSV+FDTGSSNLWVPS C F ++C+ H RY S+KS++Y + G I YG
Sbjct: 83 IGTPPQNFSVLFDTGSSNLWVPSIHCSFLDVACWVHRRYNSKKSSSYVKNGTEFSIRYGR 142
Query: 150 GSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVW 209
GS+SGF SQD V V + V Q F EA ++ +TF +ARFDG++G+ + I+V + PV+
Sbjct: 143 GSLSGFISQDTVSVAGLSVPGQQFGEAVKQPGITFAVARFDGVLGMAYPSISVANVTPVF 202
Query: 210 DNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELG 269
D + L+ + +FS +++RD AE GGE++ GG+DP++F G YV VT+K YWQ ++
Sbjct: 203 DTAMAAKLLPQNIFSVYISRDTAAEVGGELILGGIDPQYFSGDLHYVNVTRKAYWQIQMD 262
Query: 270 DILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLVVS 325
+ +GNQ T +C+ GC +IVD+GTSL+ GP + ++ AIG ++ E CK + S
Sbjct: 263 RVDVGNQLT-LCKAGCQSIVDTGTSLMVGPAEEIRALHKAIGALPLLMGEYFIDCKKIPS 321
Score = 125 bits (314), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 63/140 (45%), Positives = 88/140 (62%), Gaps = 3/140 (2%)
Query: 367 VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
++ + V G+ +C A ++V L +E + +++ +LP MGE IDC
Sbjct: 259 IQMDRVDVGNQLTLCKAGCQSIVDTGTSLMVGPAEE--IRALHKAIGALPLLMGEYFIDC 316
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
+IP++P +SF IG K FNL+ E YILK + A +C+SGFMA D+PPP GPLWILGDVF
Sbjct: 317 KKIPSLPVISFNIGGKTFNLTGEDYILKESQMGASICLSGFMAMDIPPPAGPLWILGDVF 376
Query: 486 MGVYHTVFDSGKLRIGFAEA 505
+G Y+TVFD R+GFA A
Sbjct: 377 IGKYYTVFDRNADRVGFAAA 396
>gi|4099023|gb|AAD00524.1| aspartic protease [Onchocerca volvulus]
Length = 422
Score = 259 bits (661), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 136/292 (46%), Positives = 182/292 (62%), Gaps = 18/292 (6%)
Query: 21 LPASSNGLRRIGLKKR-RLDLHSLNAA-----------RITRKE-RYMGGAGVSGVRHRL 67
+ A N RI L K+ + H L A +I RK ++ G +
Sbjct: 26 IAAEENHFTRIALHKQDSIHSHLLKAGSWEAYSELVNFQIQRKRIQHKYEFGSRSGKSIA 85
Query: 68 GDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHS 126
G++DE LKN+MDAQY+GEI IG+PPQNFSVIFDTGSSNLW+PS KC + I+C H+
Sbjct: 86 GETDE---VLKNYMDAQYYGEISIGTPPQNFSVIFDTGSSNLWIPSIKCPFLDIACLLHN 142
Query: 127 RYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLL 186
+YK +S TY G+ EI YG GS+ GF S D V + DV V DQ F EAT E +TF++
Sbjct: 143 KYKGTESKTYKSDGRKIEIQYGRGSMKGFVSMDTVCIADVCVTDQPFAEATSEPGVTFIM 202
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+ F EIAV PV++ M+ Q ++ + VF+FWL+R+P E GGEI GG+D
Sbjct: 203 AKFDGILGMAFPEIAVLGLSPVFNTMISQKVLQQPVFAFWLDRNPSDEVGGEITLGGIDT 262
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
F TY PV++ GYWQF++ I +++ G C GC AI D+GTSL+AG
Sbjct: 263 NRFVSPITYTPVSRHGYWQFKMDSIQGKDEAIG-CANGCQAIADTGTSLIAG 313
Score = 88.6 bits (218), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 44/106 (41%), Positives = 64/106 (60%), Gaps = 1/106 (0%)
Query: 400 KEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIA 459
K K++ + N L ++ P GE II C ++ ++P ++F I K + L Y+ +
Sbjct: 315 KVKLIKFSNILVLNMCMP-GEYIIPCYKVSSLPEITFVIAGKSYTLKGSDYVFECNNKGK 373
Query: 460 EVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+C+SG M DLP G LWILGDVF+G Y+TVFD G +IGFA+A
Sbjct: 374 SICLSGSMGIDLPERLGELWILGDVFIGRYYTVFDVGNSQIGFAQA 419
>gi|260837471|ref|XP_002613727.1| hypothetical protein BRAFLDRAFT_114822 [Branchiostoma floridae]
gi|229299116|gb|EEN69736.1| hypothetical protein BRAFLDRAFT_114822 [Branchiostoma floridae]
Length = 392
Score = 258 bits (660), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 127/268 (47%), Positives = 177/268 (66%), Gaps = 13/268 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
LKNFMD QY+G I +G+PPQ+F+VIFDTGSSNLWVPS KC +C H RY KS TY
Sbjct: 66 LKNFMDVQYYGVISLGTPPQDFNVIFDTGSSNLWVPSVKCE-GAACANHQRYNHSKSCTY 124
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
G+ +I YGSGS+SGF SQD V +G +V+K+Q F EAT E F +FDGI+GL
Sbjct: 125 KADGRPLKITYGSGSLSGFLSQDVVMIGSIVIKNQTFGEATNEPGSAFATGKFDGILGLA 184
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ +IAV PV+D +++Q LV + VFSF+L+RDP GGE++ GG DP ++ G TY+
Sbjct: 185 YPQIAVDHIRPVFDMIMDQKLVDKNVFSFYLDRDPSRAPGGELLLGGTDPTYYTGNFTYI 244
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
PV+ +GYWQ + + +G+Q +C GGC AIVD+GTSL+AGP+ + ++ AIG + +
Sbjct: 245 PVSYQGYWQLNMDGVHVGDQK--LCAGGCQAIVDTGTSLIAGPSEEIHKLQAAIGSQQIS 302
Query: 317 SAE----------CKLVVSQYGDLIWDL 334
+ +V Q+GD +++L
Sbjct: 303 PGQYLVDCGRLDDLPVVSFQFGDKLFNL 330
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 55/140 (39%), Positives = 80/140 (57%), Gaps = 3/140 (2%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
+ + V GD +C+ A+V L ++E + + S G+ ++DC
Sbjct: 254 LNMDGVHVGDQKLCAGGCQAIVDTGTSLIAGPSEE--IHKLQAAIGSQQISPGQYLVDCG 311
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILK-TGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
R+ +P VSF GDK+FNL+ ++Y +K +VC+ GFM D+P PRGPLWILGDVF
Sbjct: 312 RLDDLPVVSFQFGDKLFNLTGQEYTVKEQASPTTQVCLVGFMPMDIPNPRGPLWILGDVF 371
Query: 486 MGVYHTVFDSGKLRIGFAEA 505
+G Y+T FD G R+GFA A
Sbjct: 372 IGQYYTEFDRGNNRVGFARA 391
>gi|94732449|emb|CAK11131.1| cathepsin D [Danio rerio]
gi|94733132|emb|CAK05390.1| cathepsin D [Danio rerio]
gi|158253911|gb|AAI54316.1| Ctsd protein [Danio rerio]
Length = 398
Score = 258 bits (660), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 136/326 (41%), Positives = 202/326 (61%), Gaps = 16/326 (4%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRH 65
+R F L V A +S+ + RI LKK R +L+ + + +E + + +++
Sbjct: 1 MRIAFLLLVAA----FFCTSDAIVRIPLKKFRTLRRTLSDSGRSLEELV---SSSNSLKY 53
Query: 66 RLG-DSDEDILP--LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-IS 121
LG + D P LKN++DAQY+GEIG+G+P Q F+V+FDTGSSNLWVPS C + I+
Sbjct: 54 NLGFPASNDPTPETLKNYLDAQYYGEIGLGTPVQTFTVVFDTGSSNLWVPSVHCSLTDIA 113
Query: 122 CYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGS 181
C H +Y KS+TY + G I YGSGS+SG+ SQD +GD+ V+ Q+F EA ++
Sbjct: 114 CLLHHKYNGGKSSTYVKNGTQFAIQYGSGSLSGYLSQDTCTIGDIAVEKQIFGEAIKQPG 173
Query: 182 LTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVF 241
+ F+ A+FDGI+G+ + I+V PV+D M+ Q V + VFSF+LNR+PD + GGE++
Sbjct: 174 VAFIAAKFDGILGMAYPRISVDGVPPVFDMMMSQKKVEKNVFSFYLNRNPDTQPGGELLL 233
Query: 242 GGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTP 301
GG DPK++ G YV ++++ YWQ + + IG+ +C+GGC AIVD+GTSL+ GP
Sbjct: 234 GGTDPKYYTGDFNYVDISRQAYWQIHMDGMSIGS-GLSLCKGGCEAIVDTGTSLITGPAA 292
Query: 302 VVTEINHAIGG----EGVVSAECKLV 323
V + AIG +G +CK V
Sbjct: 293 EVKALQKAIGAIPLMQGEYMVDCKKV 318
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 50/93 (53%), Positives = 73/93 (78%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE ++DC ++PT+P +SF++G K+++L+ EQYILK +G ++C+SGFM D+P
Sbjct: 303 AIPLMQGEYMVDCKKVPTLPTISFSLGGKVYSLTGEQYILKESQGGHDICLSGFMGLDIP 362
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
PP GPLWILGDVF+G Y+TVFD R+GFA+A
Sbjct: 363 PPAGPLWILGDVFIGQYYTVFDRENNRVGFAKA 395
>gi|13637914|sp|P80209.2|CATD_BOVIN RecName: Full=Cathepsin D; Flags: Precursor
Length = 390
Score = 258 bits (659), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 123/253 (48%), Positives = 176/253 (69%), Gaps = 11/253 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN+MDAQY+GEIGIG+PPQ F+V+FDTGS+NLWVPS C I+C+ H +Y S KS+T
Sbjct: 51 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSANLWVPSIHCKLLDIACWTHRKYNSDKSST 110
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEV---------GDVVVKDQVFIEATREGSLTFLL 186
Y + G + +I+YGSGS+SG+ SQD V V G V V+ Q F EA ++ + F+
Sbjct: 111 YVKNGTTFDIHYGSGSLSGYLSQDTVSVPCNPSSSSPGGVTVQRQTFGEAIKQPGVVFIA 170
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+ + I+V + +PV+DN+++Q LV + VFSF+LNRDP A+ GGE++ GG D
Sbjct: 171 AKFDGILGMAYPRISVNNVLPVFDNLMQQKLVDKNVFSFFLNRDPKAQPGGELMLGGTDS 230
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K+++G + VT++ YWQ + + +G+ T VC+GGC AIVD+GTSL+ GP V E+
Sbjct: 231 KYYRGSLMFHNVTRQAYWQIHMDQLDVGSSLT-VCKGGCEAIVDTGTSLIVGPVEEVREL 289
Query: 307 NHAIGGEGVVSAE 319
AIG ++ E
Sbjct: 290 QKAIGAVPLIQGE 302
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 66/94 (70%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I C+++ ++P V+ +G K + LSPE Y LK + VC+SGFM D+P
Sbjct: 295 AVPLIQGEYMIPCEKVSSLPEVTVKLGGKDYALSPEDYALKVSQAETTVCLSGFMGMDIP 354
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PP GPLWILGDVF+G Y+TVFD + R+G AEAA
Sbjct: 355 PPGGPLWILGDVFIGRYYTVFDRDQNRVGLAEAA 388
>gi|308483047|ref|XP_003103726.1| CRE-ASP-4 protein [Caenorhabditis remanei]
gi|308259744|gb|EFP03697.1| CRE-ASP-4 protein [Caenorhabditis remanei]
Length = 462
Score = 258 bits (659), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 133/270 (49%), Positives = 177/270 (65%), Gaps = 22/270 (8%)
Query: 67 LGDSDEDILPLKNFMD----------------AQYFGEIGIGSPPQNFSVIFDTGSSNLW 110
LG+ DE L+N+MD AQYFG I IG+P QNF+VIFDTGSSNLW
Sbjct: 80 LGEIDE---LLRNYMDVRAQRLCCLKSKIIFQAQYFGTISIGTPGQNFTVIFDTGSSNLW 136
Query: 111 VPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVK 169
VPS KC ++ I+C H RY S+ S+TY E G+ I YG+GS+ GF S+D+V V V +
Sbjct: 137 VPSKKCPFYDIACMLHHRYDSKSSSTYKEDGRKMAIQYGTGSMKGFISKDSVCVAGVCAE 196
Query: 170 DQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNR 229
+Q F EAT E +TF+ A+FDGI+G+ + EIAV PV++ + EQ V VFSFWLNR
Sbjct: 197 EQPFAEATSEPGITFVAAKFDGILGMAYPEIAVLGVQPVFNTLFEQKKVPSNVFSFWLNR 256
Query: 230 DPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIV 289
+PD++ GGEI FGG+DP+ + TY PVT+KGYWQF++ D ++G+ G C GC AI
Sbjct: 257 NPDSDLGGEITFGGIDPRRYVEPITYTPVTRKGYWQFKM-DKVVGSGVLG-CSNGCQAIA 314
Query: 290 DSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
D+GTSL+AGP + I + IG E ++ E
Sbjct: 315 DTGTSLIAGPKAQIEAIQNFIGAEPLIKGE 344
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 51/99 (51%), Positives = 66/99 (66%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
I + P GE +I CD+IPT+P VSF IG + F+L E Y+LK +G +C+SGF
Sbjct: 331 IQNFIGAEPLIKGEYMISCDKIPTLPPVSFVIGGQEFSLKGEDYVLKIAQGGKTICLSGF 390
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
M DLP G LWILGDVF+G Y+TVFD + R+GFA+A
Sbjct: 391 MGIDLPERVGELWILGDVFIGRYYTVFDFDQNRVGFAQA 429
>gi|344269496|ref|XP_003406588.1| PREDICTED: LOW QUALITY PROTEIN: napsin-A-like [Loxodonta africana]
Length = 396
Score = 258 bits (659), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 132/295 (44%), Positives = 182/295 (61%), Gaps = 7/295 (2%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L RI L + D +LN+ R RK +S V GD +PL N+M+ QYFG
Sbjct: 26 LIRIPLHRVHPDPRTLNSPRAWRK----AAEHMSLVASSPGDKST-FVPLSNYMNVQYFG 80
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF-SISCYFHSRYKSRKSNTYTEIGKSCEIN 146
EIG+G+PPQNFSV+FDTGSSNLWVPS +C+F S+ C+ H R+ S+++ G I
Sbjct: 81 EIGLGTPPQNFSVVFDTGSSNLWVPSKRCHFLSLPCWVHHRFNPNASSSFQPNGTKFAIQ 140
Query: 147 YGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
YG+G ++G S+D + +G + VF EA E SL F A FDGI+GLGF +AV
Sbjct: 141 YGTGRLTGILSEDKLTIGGIEGTSVVFGEALWEPSLVFTFAPFDGILGLGFPILAVDGVR 200
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
P D +VEQGLV + VFSF+LNRDP+A +GGE+V GG DP H+ ++PVT YWQ
Sbjct: 201 PPLDILVEQGLVDKPVFSFYLNRDPEAPDGGELVLGGSDPAHYIPPLNFMPVTIPAYWQI 260
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECK 321
+ + +G +C GCAAI+D+GTSL+ GP + +N AIGG +++ + +
Sbjct: 261 HMERVKVGT-GLNLCAQGCAAILDTGTSLITGPAEEIQALNSAIGGVALLTGQVR 314
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/38 (60%), Positives = 28/38 (73%)
Query: 460 EVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGK 497
+C+SGF A D+PPP GP WI GDVFMG + VFD G+
Sbjct: 328 RLCLSGFQALDVPPPMGPFWIXGDVFMGSHVAVFDRGE 365
>gi|313219527|emb|CBY30450.1| unnamed protein product [Oikopleura dioica]
Length = 396
Score = 258 bits (658), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 139/297 (46%), Positives = 190/297 (63%), Gaps = 14/297 (4%)
Query: 63 VRHR-LGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-I 120
++H+ LGD + P+ N+MDAQY+G I IG+PPQ FSVIFDTGSSNLWVPS+KC F+ +
Sbjct: 51 LQHKFLGDGHSE--PITNYMDAQYYGTIHIGTPPQEFSVIFDTGSSNLWVPSTKCKFTNV 108
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+C+ H +Y S+ S ++ G+ I YGSGS+SGF S D VEV V V+DQ F EA E
Sbjct: 109 ACFLHRKYDSQSSTSWKADGQEFAIQYGSGSLSGFCSTDAVEVAGVWVQDQKFAEAVEEP 168
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
+TF+ A+FDGI+GLG+ IAV P +NM+EQGL+S+ +FSF+LNR +AE+GGE+
Sbjct: 169 GITFVAAKFDGIMGLGYPSIAVNKITPPVNNMIEQGLLSDGMFSFFLNRTANAEDGGELT 228
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVC---EGGCAAIVDSGTSLLA 297
GGVD F G ++ VT++ YWQ ++ + + + C E GC IVDSGTSLLA
Sbjct: 229 IGGVDNSRFTGDFSWNEVTRQAYWQIKMDNFEVQGKGVSACGGNENGCQVIVDSGTSLLA 288
Query: 298 GPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDL------LVSGLLPEKVCQQI 348
P + EINHAIG + E +V ++ D + D+ V L PE +I
Sbjct: 289 VPKNLAEEINHAIGAFQFANGEW-IVPCRHMDTMPDIDFTLNGKVYTLTPEDYVMKI 344
Score = 112 bits (279), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 50/100 (50%), Positives = 64/100 (64%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
IN + GE I+ C + TMP++ FT+ K++ L+PE Y++K E CISGF
Sbjct: 297 INHAIGAFQFANGEWIVPCRHMDTMPDIDFTLNGKVYTLTPEDYVMKIAAEGQEQCISGF 356
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
M D+PPP GPLWILGDVFMG Y+T FD R+GFAE A
Sbjct: 357 MGMDIPPPAGPLWILGDVFMGKYYTAFDFDNNRVGFAELA 396
>gi|301764903|ref|XP_002917936.1| PREDICTED: napsin-A-like [Ailuropoda melanoleuca]
Length = 406
Score = 258 bits (658), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 138/314 (43%), Positives = 195/314 (62%), Gaps = 16/314 (5%)
Query: 22 PASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLG---DSDEDI-LPL 77
PA ++ L RI L++ +LN R G G V LG D+ I +PL
Sbjct: 19 PAGAS-LIRISLRRVYPGRGTLNPLR---------GWGRPAVPPSLGAPSPGDKPIFVPL 68
Query: 78 KNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF-SISCYFHSRYKSRKSNTY 136
N+M+AQY+GEIG+G+PPQNFSV+FDTGSSNLWVPS +C+F S+ C+FH R+ S+ S+++
Sbjct: 69 SNYMNAQYYGEIGLGTPPQNFSVVFDTGSSNLWVPSIRCHFLSLPCWFHHRFNSKASSSF 128
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
G I YG+G + G S+D + +G + +F EA E SL F A FDG++GLG
Sbjct: 129 HPNGTKFAIQYGTGKLDGILSEDKLTIGGIKGASVIFGEALWEPSLVFTFAHFDGVLGLG 188
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
F +AVG P D +V+QGL+ + VFSF+LNRDP+A +GGE+V GG DP H+ T++
Sbjct: 189 FPILAVGGVRPPLDTLVDQGLLDKPVFSFYLNRDPEAADGGELVLGGSDPAHYVPPLTFL 248
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
PVT YWQ + + +G T +C GCAAI+D+GTSL+ GPT + ++ AIGG ++
Sbjct: 249 PVTIPAYWQIHMERVNVGTGLT-LCAQGCAAILDTGTSLITGPTEEIQALHAAIGGVSLL 307
Query: 317 SAECKLVVSQYGDL 330
E + S+ L
Sbjct: 308 VGEYLIQCSKIPTL 321
Score = 101 bits (252), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 53/143 (37%), Positives = 80/143 (55%), Gaps = 7/143 (4%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
+E+ NV G + C A++ L T+E + ++ + +GE +I C
Sbjct: 260 MERVNVGTGLTLCAQGCA-AILDTGTSLITGPTEE--IQALHAAIGGVSLLVGEYLIQCS 316
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
+IPT+P +SF +G FNL+ + Y+++ G +C+SGF A D+PPP GPLWILGDVF+
Sbjct: 317 KIPTLPPISFFLGGVWFNLTAQDYVIQIARGGVRLCLSGFQALDMPPPAGPLWILGDVFL 376
Query: 487 GVYHTVFDSGKL----RIGFAEA 505
Y +FD G L R+G A A
Sbjct: 377 RTYVAIFDRGNLRGGARVGLARA 399
>gi|440899428|gb|ELR50729.1| Cathepsin D, partial [Bos grunniens mutus]
Length = 394
Score = 258 bits (658), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 123/253 (48%), Positives = 176/253 (69%), Gaps = 11/253 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN+MDAQY+GEIGIG+PPQ F+V+FDTGS+NLWVPS C I+C+ H +Y S KS+T
Sbjct: 55 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSANLWVPSIHCKLLDIACWTHRKYNSDKSST 114
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEV---------GDVVVKDQVFIEATREGSLTFLL 186
Y + G + +I+YGSGS+SG+ SQD V V G V V+ Q F EA ++ + F+
Sbjct: 115 YVKNGTTFDIHYGSGSLSGYLSQDTVSVPCNPSSSSPGGVTVQRQTFGEAIKQPGVVFIA 174
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+ + I+V + +PV+DN+++Q LV + VFSF+LNRDP A+ GGE++ GG D
Sbjct: 175 AKFDGILGMAYPRISVNNVLPVFDNLMQQKLVDKNVFSFFLNRDPKAQPGGELMLGGTDS 234
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K+++G + VT++ YWQ + + +G+ T VC+GGC AIVD+GTSL+ GP V E+
Sbjct: 235 KYYRGSLMFHNVTRQAYWQIHMDQLDVGSSLT-VCKGGCEAIVDTGTSLIVGPVEEVREL 293
Query: 307 NHAIGGEGVVSAE 319
AIG ++ E
Sbjct: 294 QKAIGAVPLIQGE 306
Score = 108 bits (270), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 66/94 (70%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I C+++ ++P V+ +G K + LSPE Y LK + VC+SGFM D+P
Sbjct: 299 AVPLIQGEYMIPCEKVSSLPQVTVKLGGKDYALSPEDYALKVSQAGTTVCLSGFMGMDIP 358
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PP GPLWILGDVF+G Y+TVFD + R+G AEAA
Sbjct: 359 PPGGPLWILGDVFIGRYYTVFDRDQNRVGLAEAA 392
>gi|226476812|emb|CAX72322.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 257 bits (657), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 129/293 (44%), Positives = 190/293 (64%), Gaps = 8/293 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGV-RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPP 95
R+ L+ L +AR + E V V R + + LKN++DAQY+G+I IG+PP
Sbjct: 17 RVPLYPLKSARRSLIEFETSLENVQKVWFSRFSNVEPRPEYLKNYLDAQYYGDITIGTPP 76
Query: 96 QNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISG 154
Q FSV+FDTGSSNLWVPS+ C YF I+C H +Y S KS TY G I YG+GS+SG
Sbjct: 77 QTFSVVFDTGSSNLWVPSTHCSYFDIACLLHRKYDSSKSTTYVPNGTDFSIRYGTGSLSG 136
Query: 155 FFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVE 214
F S D++++G + VK Q F EAT++ L F++A+FDGI+G+ + +AVG PV+ NM++
Sbjct: 137 FLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGMAYPSLAVGGVTPVFVNMIK 196
Query: 215 QGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++ I
Sbjct: 197 QGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTIS 256
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 323
+ S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 257 DLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 79/140 (56%), Gaps = 4/140 (2%)
Query: 366 VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
+ + +N++ D ++C+ A+ + T E + IN+ + P G + C
Sbjct: 247 LFKMDNLTISDLSICTDGCQAIADTGTSMIAGPTDE--VKQINQKLGATHLPGGIYTVSC 304
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
D I +P++ F I K L P YI+K + +E+C++GF+ DL PR LWILGDVF
Sbjct: 305 DVINNLPSIDFVINGKHMTLEPTDYIMKVSKLGSEICLTGFIGMDL--PRKKLWILGDVF 362
Query: 486 MGVYHTVFDSGKLRIGFAEA 505
+G ++T+FD GK R+GFA+A
Sbjct: 363 IGKFYTIFDMGKNRVGFAKA 382
>gi|21552717|gb|AAM62283.1|AF396662_1 cathepsin D preproprotein [Silurus asotus]
Length = 395
Score = 257 bits (657), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 139/323 (43%), Positives = 202/323 (62%), Gaps = 21/323 (6%)
Query: 17 SCLLLPA----SSNGLRRIGLKKRRLDLHSL-NAARITRKERYMGGAGVS----GVRHRL 67
+CLLL +++ L RI LKK R ++ ++ R + R G + + GV ++
Sbjct: 4 ACLLLLVFIAWTADALVRIPLKKFRSIRRTMSDSGRAVEESR--GNSQNTKYNLGVTNKF 61
Query: 68 GDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHS 126
G + E LKN++DAQY+GEIG+G+P Q F+V+FDTGSSNLWVPS C + I+C H
Sbjct: 62 GPTPET---LKNYLDAQYYGEIGLGTPVQTFTVVFDTGSSNLWVPSVHCSLTDIACLLHH 118
Query: 127 RYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLL 186
+Y KS+TY + G + I YGSGS+SG+ SQD +GD+ V+ Q+F EA ++ + F+
Sbjct: 119 KYNGAKSSTYVKNGTAFAIQYGSGSLSGYLSQDVCSIGDIAVEKQIFGEAIKQPGVAFIA 178
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+ + IAV PV+D M+ Q + VFSF+LNR+PD + GGE++ GG DP
Sbjct: 179 AKFDGILGMAYPRIAVDGVPPVFD-MMSQKKFEKNVFSFYLNRNPDTQPGGELLLGGTDP 237
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K + G YV +T++ YWQ + + IG+Q + +C GGC AIVD+GTSL+ GP V +
Sbjct: 238 KFYTGDFHYVNITRQAYWQIHMDGMSIGSQLS-LCNGGCEAIVDTGTSLITGPAAEVKAL 296
Query: 307 NHAIGG----EGVVSAECKLVVS 325
AIG +G +CK V S
Sbjct: 297 QKAIGAIPLIQGEYMVDCKKVPS 319
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 47/93 (50%), Positives = 70/93 (75%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE ++DC ++P++P +SF +G + + L+ EQYILK + E+C+SGFMA D+P
Sbjct: 302 AIPLIQGEYMVDCKKVPSLPTISFNLGGQTYTLTGEQYILKESQAGREICLSGFMALDIP 361
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
PP GPLWILGDVF+G Y+T+FD ++GFA+A
Sbjct: 362 PPAGPLWILGDVFIGQYYTMFDRENNQVGFAKA 394
>gi|197631813|gb|ACH70630.1| cathepsin D [Salmo salar]
gi|223648160|gb|ACN10838.1| Cathepsin D precursor [Salmo salar]
Length = 398
Score = 257 bits (657), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 117/244 (47%), Positives = 172/244 (70%), Gaps = 2/244 (0%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNT 135
LKNFMDAQY+GEIG+G+P Q F+V+FDTGSSNLWVPS C F+ I+C H +Y KS+T
Sbjct: 70 LKNFMDAQYYGEIGLGTPAQTFTVVFDTGSSNLWVPSVHCSFTDIACLLHHKYNGAKSST 129
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + G + I YGSGS+SG+ SQD +G + +++QVF EA ++ + F+ A+FDGI+G+
Sbjct: 130 YVKNGTAFAIQYGSGSLSGYLSQDTCTIGGLSIEEQVFGEAIKQPGVAFIAAKFDGILGM 189
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ I+V P +DN++ Q V + VFSF+LNR+P++E GGE++ GG DPK++ G Y
Sbjct: 190 AYPRISVDGVAPPFDNIMSQKKVEQNVFSFYLNRNPESEPGGELLLGGTDPKYYSGDFQY 249
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
+ V+++ YWQ + + +G+Q + +C+GGC AIVD+GTSL+ GPT V + AIG +
Sbjct: 250 LNVSRQAYWQVHMDGMGVGSQLS-LCKGGCEAIVDTGTSLITGPTAEVKALQKAIGATPL 308
Query: 316 VSAE 319
+ E
Sbjct: 309 IQGE 312
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 52/128 (40%), Positives = 81/128 (63%), Gaps = 2/128 (1%)
Query: 378 AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFT 437
++C A+V L T E + + + + P GE +++CD+IPTMP+++F
Sbjct: 272 SLCKGGCEAIVDTGTSLITGPTAE--VKALQKAIGATPLIQGEYMVNCDKIPTMPDITFN 329
Query: 438 IGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGK 497
+G + ++L+ EQY+LK + +C+SGFM D+P P GPLWILGDVF+G Y+TVFD
Sbjct: 330 LGGQSYSLTAEQYVLKESQAGKTICLSGFMGLDIPAPAGPLWILGDVFIGQYYTVFDRDN 389
Query: 498 LRIGFAEA 505
R+GFA++
Sbjct: 390 NRVGFAKS 397
>gi|196123668|gb|ACG70181.1| cathepsin D-like protein [Homarus americanus]
Length = 386
Score = 257 bits (656), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 140/298 (46%), Positives = 183/298 (61%), Gaps = 21/298 (7%)
Query: 28 LRRIGLKK--RRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQY 85
L RI LKK + L L R+ RY G+ D++ L N+ DAQY
Sbjct: 18 LHRIPLKKIEKSRTLQDLRRTRVFLNHRYGVGS--------------DVIDLDNYEDAQY 63
Query: 86 FGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFHSRYKSRKSNTYTEIGKSCE 144
+G I IG+P Q F VIFDTGSSNLW+PS KC+ +++ H+RY S KS+TY E G + +
Sbjct: 64 YGPITIGTPGQGFDVIFDTGSSNLWIPSEKCFILNLARRLHNRYDSTKSSTYIENGTAFD 123
Query: 145 INYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGD 204
I YGSG++ GF S DNVE+G V Q F EAT+E L F++ + DGI+G+ F EI+V
Sbjct: 124 IQYGSGALHGFLSSDNVEMGGVNAMGQTFAEATQEPGLAFIMGKLDGILGMAFTEISVMG 183
Query: 205 AVPVWDNMVEQGLVSEEVFSFWLNRD-PDAEE--GGEIVFGGVDPKHFKGKHTYVPVTKK 261
V+D MV QG V + +FSF+LN D D E GGE+V GG DP H++G+ YVPV+K
Sbjct: 184 IPTVFDTMVAQGAVDQPIFSFYLNHDVSDMNETLGGELVLGGSDPNHYEGEFHYVPVSKV 243
Query: 262 GYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
GYWQ I +G+ TG C C AIVD+GTSL+AGP V EI H +GG G ++ E
Sbjct: 244 GYWQVTAEAIKVGDNVTGFCN-PCEAIVDTGTSLIAGPNAEVKEIVHMLGGYGFIAGE 300
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 44/146 (30%), Positives = 69/146 (47%), Gaps = 15/146 (10%)
Query: 367 VEKENVSAGD--SAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIID 424
V E + GD + C+ CE A+V L E + I + GE +I
Sbjct: 248 VTAEAIKVGDNVTGFCNPCE-AIVDTGTSLIAGPNAE--VKEIVHMLGGYGFIAGEYLIS 304
Query: 425 CDRIPTMPNVSFTIGDKIFNLSPEQYILK-----TGEGIAEVCISGFMAFDLPPPRGPLW 479
C ++P MP +FT+ K F++ +++ TG ++CI G M + W
Sbjct: 305 CHKVPEMPEFTFTLNGKDFSIDGPDLVIEDIDPSTG---VKICIVGIMGLQMGELEA--W 359
Query: 480 ILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ ++T FD G+ RIGFA++
Sbjct: 360 ILGDPFIADWYTEFDVGQKRIGFAKS 385
>gi|226476838|emb|CAX72335.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 257 bits (656), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 129/293 (44%), Positives = 189/293 (64%), Gaps = 8/293 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGV-RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPP 95
R+ L+ L +AR + E V V R + + LKN++DAQY+G+I IG+PP
Sbjct: 17 RVPLYPLKSARRSLIEFETSLKNVQKVWFSRFSNVEPRPEYLKNYLDAQYYGDITIGTPP 76
Query: 96 QNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISG 154
Q FSV+FDTGSSNLWVPS C YF I+C H +Y S KS TY G I YG+GS+SG
Sbjct: 77 QTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTTYVPNGTDFSIRYGTGSLSG 136
Query: 155 FFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVE 214
F S D++++G + VK Q F EAT++ L F++A+FDGI+G+ + +AVG PV+ NM++
Sbjct: 137 FLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGMAYPSLAVGGVTPVFVNMIK 196
Query: 215 QGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++ I
Sbjct: 197 QGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTIS 256
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 323
+ S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 257 DLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 79/140 (56%), Gaps = 4/140 (2%)
Query: 366 VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
+ + +N++ D ++C+ A+ + T E + IN+ + P G + C
Sbjct: 247 LFKMDNLTISDLSICTDGCQAIADTGTSMIAGPTDE--VKQINQKLGATHLPGGIYTVSC 304
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
D I +P++ F I K L P YI+K + +E+C++GF+ DL PR LWILGDVF
Sbjct: 305 DVINNLPSIDFVINGKHMTLEPTDYIMKVSKLGSEICLTGFIGMDL--PRKKLWILGDVF 362
Query: 486 MGVYHTVFDSGKLRIGFAEA 505
+G ++T+FD GK R+GFA+A
Sbjct: 363 IGKFYTIFDMGKNRVGFAKA 382
>gi|432850601|ref|XP_004066828.1| PREDICTED: cathepsin D-like isoform 2 [Oryzias latipes]
Length = 398
Score = 257 bits (656), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 138/312 (44%), Positives = 195/312 (62%), Gaps = 10/312 (3%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLG-DSDE 72
VL L S L RI LKK R L + +E + A +++ LG S
Sbjct: 5 VLCVIAALALSGEALIRIPLKKFRSIRRELTD---SGREAHELLADKHSLKYNLGFPSSN 61
Query: 73 DILP--LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCY--FHSR 127
P LKN++DAQY+GEI +G+PPQ F+V+FDTGSSNLWVPS C I+C
Sbjct: 62 GPTPETLKNYLDAQYYGEIALGTPPQPFTVVFDTGSSNLWVPSVHCSLLDIACRECPPPS 121
Query: 128 YKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLA 187
Y S KS+TY + G S I YGSGS+SG+ SQD +GD+ V++QVF EA ++ + F+ A
Sbjct: 122 YNSAKSSTYVKNGTSFSIQYGSGSLSGYLSQDTCTIGDISVENQVFGEAIKQPGVAFIAA 181
Query: 188 RFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPK 247
+FDGI+G+ + I+V VPV+DN+++Q V VFSF+LNR+PD E GGE++ GG DPK
Sbjct: 182 KFDGILGMAYPRISVDGVVPVFDNIMQQKKVDSNVFSFYLNRNPDTEPGGELLLGGTDPK 241
Query: 248 HFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEIN 307
++ G YV ++++ YWQ + + +G+Q + +C+GGC AIVD+GTSLL GP+ V +
Sbjct: 242 YYSGDFHYVNISRQAYWQIHMDGMAVGSQLS-LCKGGCEAIVDTGTSLLTGPSAEVKALQ 300
Query: 308 HAIGGEGVVSAE 319
AIG ++ E
Sbjct: 301 KAIGAIPLIQGE 312
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 45/93 (48%), Positives = 68/93 (73%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I+CD+IP++P ++F IG + + L+ +QY+LK + +C+SGFM D+P
Sbjct: 305 AIPLIQGEYMINCDKIPSLPAITFNIGGQSYTLTGDQYVLKESQAGKTICLSGFMGLDIP 364
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
P GPLWILGDVF+G Y+TVFD R+GFA++
Sbjct: 365 APAGPLWILGDVFIGQYYTVFDRDSNRVGFAKS 397
>gi|432099182|gb|ELK28547.1| Cathepsin D [Myotis davidii]
Length = 351
Score = 257 bits (656), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 125/248 (50%), Positives = 174/248 (70%), Gaps = 11/248 (4%)
Query: 82 DAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIG 140
+AQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+TY E G
Sbjct: 34 EAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSGKSSTYVENG 93
Query: 141 KSCEINYGSGSISGFFSQDNVEV---------GDVVVKDQVFIEATREGSLTFLLARFDG 191
+ +I+YGSGS+SG+ SQD V V G V V+ QVF EAT++ +TF+ A+FDG
Sbjct: 94 TTFDIHYGSGSLSGYLSQDTVSVPCNSGLASLGGVKVERQVFGEATKQPGITFIAAKFDG 153
Query: 192 IIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKG 251
I+G+ + I+V + VPV+DN+++Q LV + +FSF+LNRDP A+ GGE++ GG D K++KG
Sbjct: 154 ILGMAYPRISVNNVVPVFDNLMQQKLVEKNIFSFYLNRDPSAQPGGELMLGGTDSKYYKG 213
Query: 252 KHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
Y+ VT+K YWQ + + +GN T +C+ GC AIVD+GTSL+ GP V E+ AIG
Sbjct: 214 PIAYLNVTRKAYWQVHMDQVDVGNGLT-LCKEGCEAIVDTGTSLMVGPVDEVRELQKAIG 272
Query: 312 GEGVVSAE 319
++ E
Sbjct: 273 AVPLIQGE 280
Score = 81.6 bits (200), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 39/94 (41%), Positives = 56/94 (59%), Gaps = 17/94 (18%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I C+++ ++P S + +G +C+SGFM D+P
Sbjct: 273 AVPLIQGEYMIPCEKVSSLPEPS-----------------QVSQGGKTICLSGFMGMDIP 315
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PP GPLWILGDVF+G Y+TVFD + R+G AEAA
Sbjct: 316 PPAGPLWILGDVFIGRYYTVFDREENRVGLAEAA 349
>gi|299522|gb|AAB26186.1| cathepsin D {EC 3.4.23.5} [cattle, Peptide Partial, 346 aa]
Length = 346
Score = 257 bits (656), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 123/253 (48%), Positives = 176/253 (69%), Gaps = 11/253 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN+MDAQY+GEIGIG+PPQ F+V+FDTGS+NLWVPS C I+C+ H +Y S KS+T
Sbjct: 7 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSANLWVPSIHCKLLDIACWTHRKYNSDKSST 66
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEV---------GDVVVKDQVFIEATREGSLTFLL 186
Y + G + +I+YGSGS+SG+ SQD V V G V V+ Q F EA ++ + F+
Sbjct: 67 YVKNGTTFDIHYGSGSLSGYLSQDTVSVPCNPSSSSPGGVTVQRQTFGEAIKQPGVVFIA 126
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+ + I+V + +PV+DN+++Q LV + VFSF+LNRDP A+ GGE++ GG D
Sbjct: 127 AKFDGILGMAYPRISVNNVLPVFDNLMQQKLVDKNVFSFFLNRDPKAQPGGELMLGGTDS 186
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K+++G + VT++ YWQ + + +G+ T VC+GGC AIVD+GTSL+ GP V E+
Sbjct: 187 KYYRGSLMFHNVTRQAYWQIHMDQLDVGSSLT-VCKGGCEAIVDTGTSLIVGPVEEVREL 245
Query: 307 NHAIGGEGVVSAE 319
AIG ++ E
Sbjct: 246 QKAIGAVPLIQGE 258
Score = 108 bits (270), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 66/94 (70%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I C+++ ++P V+ +G K + LSPE Y LK + VC+SGFM D+P
Sbjct: 251 AVPLIQGEYMIPCEKVSSLPEVTVKLGGKDYALSPEDYALKVSQAETTVCLSGFMGMDIP 310
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PP GPLWILGDVF+G Y+TVFD + R+G AEAA
Sbjct: 311 PPGGPLWILGDVFIGRYYTVFDRDQNRVGLAEAA 344
>gi|226476854|emb|CAX72343.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 435
Score = 257 bits (656), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 129/293 (44%), Positives = 189/293 (64%), Gaps = 8/293 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGV-RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPP 95
R+ L+ L +AR + E V V R + + LKN++DAQY+G+I IG+PP
Sbjct: 23 RVPLYPLKSARRSLIEFETSLENVQKVWFSRFSNVEPRPEYLKNYLDAQYYGDITIGTPP 82
Query: 96 QNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISG 154
Q FSV+FDTGSSNLWVPS C YF I+C H +Y S KS TY G I YG+GS+SG
Sbjct: 83 QTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTTYVPNGTDFSIRYGTGSLSG 142
Query: 155 FFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVE 214
F S D++++G + VK Q F EAT++ L F++A+FDGI+G+ + +AVG PV+ NM++
Sbjct: 143 FLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGMAYPSLAVGGVTPVFVNMIK 202
Query: 215 QGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++ I
Sbjct: 203 QGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTIS 262
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 323
+ S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 263 DLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 313
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 79/140 (56%), Gaps = 4/140 (2%)
Query: 366 VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
+ + +N++ D ++C+ A+ + T E + IN+ + P G + C
Sbjct: 253 LFKMDNLTISDLSICTDGCQAIADTGTSMIAGPTDE--VKQINQKLGATHLPGGIYTVSC 310
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
D I +P++ F I K L P YI+K + +E+C++GF+ DL PR LWILGDVF
Sbjct: 311 DVINNLPSIDFVINGKHMTLEPTDYIMKVSKLGSEICLTGFIGMDL--PRKKLWILGDVF 368
Query: 486 MGVYHTVFDSGKLRIGFAEA 505
+G ++T+FD GK R+GFA+A
Sbjct: 369 IGKFYTIFDMGKNRVGFAKA 388
>gi|226476810|emb|CAX72321.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 129/293 (44%), Positives = 189/293 (64%), Gaps = 8/293 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGV-RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPP 95
R+ L+ L +AR + E V V R + + LKN++DAQY+G+I IG+PP
Sbjct: 17 RVPLYPLKSARRSLIEFETSLENVQKVWFSRFSNVEPRPEYLKNYLDAQYYGDITIGTPP 76
Query: 96 QNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISG 154
Q FSV+FDTGSSNLWVPS C YF I+C H +Y S KS TY G I YG+GS+SG
Sbjct: 77 QTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTTYVPNGTDFSIRYGTGSLSG 136
Query: 155 FFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVE 214
F S D++++G + VK Q F EAT++ L F++A+FDGI+G+ + +AVG PV+ NM++
Sbjct: 137 FLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGMAYPSLAVGGVTPVFVNMIK 196
Query: 215 QGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++ I
Sbjct: 197 QGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTIS 256
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 323
+ S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 257 DLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 79/140 (56%), Gaps = 4/140 (2%)
Query: 366 VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
+ + +N++ D ++C+ A+ + T E + IN+ + P G + C
Sbjct: 247 LFKMDNLTISDLSICTDGCQAIADTGTSMIAGPTDE--VKQINQKLGATHLPGGIYTVSC 304
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
D I +P++ F I K L P YI+K + +E+C++GF+ DL PR LWILGDVF
Sbjct: 305 DVINNLPSIDFVINGKHMTLEPTDYIMKVSKLGSEICLTGFIGMDL--PRKKLWILGDVF 362
Query: 486 MGVYHTVFDSGKLRIGFAEA 505
+G ++T+FD GK R+GFA+A
Sbjct: 363 IGKFYTIFDMGKNRVGFAKA 382
>gi|18203300|sp|Q9MZS8.1|CATD_SHEEP RecName: Full=Cathepsin D; Flags: Precursor
gi|8886526|gb|AAF80494.1|AF164143_1 cathepsin D [Ovis aries]
Length = 365
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 123/253 (48%), Positives = 175/253 (69%), Gaps = 11/253 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L N+MDAQY+GEIGIG+PPQ F+V+FDTGS+NLWVPS C I+C+ H +Y S KS+T
Sbjct: 46 LTNYMDAQYYGEIGIGTPPQCFTVVFDTGSANLWVPSIHCKLLDIACWVHHKYNSDKSST 105
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEV---------GDVVVKDQVFIEATREGSLTFLL 186
Y + G + +I+YGSGS+SG+ SQD V V G V V+ Q F EA ++ + F+
Sbjct: 106 YVKNGTTFDIHYGSGSLSGYLSQDTVSVPCNPSSSSPGGVTVQRQTFGEAIKQPGVVFIA 165
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+ + I+V + +PV+DN++ Q LV + VFSF+LNRDP A+ G E++ GG D
Sbjct: 166 AKFDGILGMAYPRISVNNVLPVFDNLMRQKLVDKNVFSFFLNRDPKAQPGEELMLGGTDS 225
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K+++G TY VT++ YWQ + + +G+ T VC+GGC AIVD+GTSL+ GP V E+
Sbjct: 226 KYYRGSLTYHNVTRQAYWQIHMDQLDVGSSLT-VCKGGCEAIVDTGTSLMVGPVDEVREL 284
Query: 307 NHAIGGEGVVSAE 319
+ AIG ++ E
Sbjct: 285 HKAIGAVPLIQGE 297
Score = 88.6 bits (218), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 37/81 (45%), Positives = 55/81 (67%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
+++ ++P GE +I C+++ ++P V+ +G K + LSPE Y LK + VC+SGF
Sbjct: 284 LHKAIGAVPLIQGEYMIPCEKVSSLPQVTLKLGGKDYTLSPEDYTLKVSQAGTTVCLSGF 343
Query: 467 MAFDLPPPRGPLWILGDVFMG 487
M D+PPP GPLWILGDVF+G
Sbjct: 344 MGMDIPPPGGPLWILGDVFIG 364
>gi|281348334|gb|EFB23918.1| hypothetical protein PANDA_006240 [Ailuropoda melanoleuca]
Length = 379
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 135/306 (44%), Positives = 190/306 (62%), Gaps = 15/306 (4%)
Query: 30 RIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLG---DSDEDI-LPLKNFMDAQY 85
RI L++ +LN R G G V LG D+ I +PL N+M+AQY
Sbjct: 1 RISLRRVYPGRGTLNPLR---------GWGRPAVPPSLGAPSPGDKPIFVPLSNYMNAQY 51
Query: 86 FGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF-SISCYFHSRYKSRKSNTYTEIGKSCE 144
+GEIG+G+PPQNFSV+FDTGSSNLWVPS +C+F S+ C+FH R+ S+ S+++ G
Sbjct: 52 YGEIGLGTPPQNFSVVFDTGSSNLWVPSIRCHFLSLPCWFHHRFNSKASSSFHPNGTKFA 111
Query: 145 INYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGD 204
I YG+G + G S+D + +G + +F EA E SL F A FDG++GLGF +AVG
Sbjct: 112 IQYGTGKLDGILSEDKLTIGGIKGASVIFGEALWEPSLVFTFAHFDGVLGLGFPILAVGG 171
Query: 205 AVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYW 264
P D +V+QGL+ + VFSF+LNRDP+A +GGE+V GG DP H+ T++PVT YW
Sbjct: 172 VRPPLDTLVDQGLLDKPVFSFYLNRDPEAADGGELVLGGSDPAHYVPPLTFLPVTIPAYW 231
Query: 265 QFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVV 324
Q + + +G T +C GCAAI+D+GTSL+ GPT + ++ AIGG ++ E +
Sbjct: 232 QIHMERVNVGTGLT-LCAQGCAAILDTGTSLITGPTEEIQALHAAIGGVSLLVGEYLIQC 290
Query: 325 SQYGDL 330
S+ L
Sbjct: 291 SKIPTL 296
Score = 101 bits (252), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 53/143 (37%), Positives = 80/143 (55%), Gaps = 7/143 (4%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
+E+ NV G + C A++ L T+E + ++ + +GE +I C
Sbjct: 235 MERVNVGTGLTLCAQGCA-AILDTGTSLITGPTEE--IQALHAAIGGVSLLVGEYLIQCS 291
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
+IPT+P +SF +G FNL+ + Y+++ G +C+SGF A D+PPP GPLWILGDVF+
Sbjct: 292 KIPTLPPISFFLGGVWFNLTAQDYVIQIARGGVRLCLSGFQALDMPPPAGPLWILGDVFL 351
Query: 487 GVYHTVFDSGKL----RIGFAEA 505
Y +FD G L R+G A A
Sbjct: 352 RTYVAIFDRGNLRGGARVGLARA 374
>gi|226476818|emb|CAX72325.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 129/293 (44%), Positives = 189/293 (64%), Gaps = 8/293 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGV-RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPP 95
R+ L+ L +AR + E V V R + + LKN++DAQY+G+I IG+PP
Sbjct: 17 RVPLYPLKSARRSLIEFETSLENVQKVWFSRFSNVEPRPEYLKNYLDAQYYGDITIGTPP 76
Query: 96 QNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISG 154
Q FSV+FDTGSSNLWVPS C YF I+C H +Y S KS TY G I YG+GS+SG
Sbjct: 77 QTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTTYVPNGTDFSIRYGTGSLSG 136
Query: 155 FFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVE 214
F S D++++G + VK Q F EAT++ L F++A+FDGI+G+ + +AVG PV+ NM++
Sbjct: 137 FLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGMAYPSLAVGGVTPVFVNMIK 196
Query: 215 QGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++ I
Sbjct: 197 QGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTIS 256
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 323
+ S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 257 DLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
Score = 95.1 bits (235), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 79/140 (56%), Gaps = 4/140 (2%)
Query: 366 VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
+ + +N++ D ++C+ A+ + T E + IN+ + P G + C
Sbjct: 247 LFKMDNLTISDLSICTDGCQAIADTGTSMIAGPTDE--VKQINQKLGATHLPGGIYTVSC 304
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
D I +P++ F I K L P YI+K + +E+C++GF+ DL PR LWILGDVF
Sbjct: 305 DVINNLPSIDFVINGKHMTLEPTDYIMKVSKSGSEICLTGFIGMDL--PRKKLWILGDVF 362
Query: 486 MGVYHTVFDSGKLRIGFAEA 505
+G ++T+FD GK R+GFA+A
Sbjct: 363 IGKFYTIFDMGKNRVGFAKA 382
>gi|226476830|emb|CAX72331.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 256 bits (655), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 129/293 (44%), Positives = 189/293 (64%), Gaps = 8/293 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGV-RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPP 95
R+ L+ L +AR + E V V R + + LKN++DAQY+G+I IG+PP
Sbjct: 17 RVPLYPLKSARRSLIEFETSLENVQKVWFSRFSNVEPRPEYLKNYLDAQYYGDITIGTPP 76
Query: 96 QNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISG 154
Q FSV+FDTGSSNLWVPS C YF I+C H +Y S KS TY G I YG+GS+SG
Sbjct: 77 QTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTTYVPNGTDFSIRYGTGSLSG 136
Query: 155 FFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVE 214
F S D++++G + VK Q F EAT++ L F++A+FDGI+G+ + +AVG PV+ NM++
Sbjct: 137 FLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGMAYPSLAVGGVTPVFVNMIK 196
Query: 215 QGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++ I
Sbjct: 197 QGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTIS 256
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 323
+ S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 257 DLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 79/140 (56%), Gaps = 4/140 (2%)
Query: 366 VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
+ + +N++ D ++C+ A+ + T E + IN+ + P G + C
Sbjct: 247 LFKMDNLTISDLSICTDGCQAIADTGTSMIAGPTDE--VKQINQKLGATHLPGGIYTVSC 304
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
D I +P++ F I K L P YI+K + +E+C++GF+ DL PR LWILGDVF
Sbjct: 305 DVINNLPSIDFVINGKHMTLEPTDYIMKVSKLGSEICLTGFIGMDL--PRKKLWILGDVF 362
Query: 486 MGVYHTVFDSGKLRIGFAEA 505
+G ++T+FD GK R+GFA+A
Sbjct: 363 IGKFYTIFDMGKNRVGFAKA 382
>gi|226476902|emb|CAX72307.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 256 bits (655), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 129/293 (44%), Positives = 189/293 (64%), Gaps = 8/293 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGV-RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPP 95
R+ L+ L +AR + E V V R + + LKN++DAQY+G+I IG+PP
Sbjct: 17 RVPLYPLKSARRSLIEFETSLENVQKVWFSRFSNVEPRPEYLKNYLDAQYYGDITIGTPP 76
Query: 96 QNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISG 154
Q FSV+FDTGSSNLWVPS C YF I+C H +Y S KS TY G I YG+GS+SG
Sbjct: 77 QTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTTYVPNGTDFSIRYGTGSLSG 136
Query: 155 FFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVE 214
F S D++++G + VK Q F EAT++ L F++A+FDGI+G+ + +AVG PV+ NM++
Sbjct: 137 FLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGMAYPSLAVGGVTPVFVNMIK 196
Query: 215 QGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++ I
Sbjct: 197 QGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTIS 256
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 323
+ S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 257 DLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 79/140 (56%), Gaps = 4/140 (2%)
Query: 366 VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
+ + +N++ D ++C+ A+ + T E + IN+ + P G + C
Sbjct: 247 LFKMDNLTISDLSICTDGCQAIADTGTSMIAGPTDE--VKQINQKLGATHLPGGIYTVSC 304
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
D I +P++ F I K L P YI+K + +E+C++GF+ DL PR LWILGDVF
Sbjct: 305 DVINNLPSIDFVINGKHMTLEPTDYIMKVSKLGSEICLTGFIGMDL--PRKKLWILGDVF 362
Query: 486 MGVYHTVFDSGKLRIGFAEA 505
+G ++T+FD GK R+GFA+A
Sbjct: 363 IGKFYTIFDMGKNRVGFAKA 382
>gi|189502972|gb|ACE06867.1| unknown [Schistosoma japonicum]
Length = 429
Score = 256 bits (655), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 129/293 (44%), Positives = 189/293 (64%), Gaps = 8/293 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGV-RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPP 95
R+ L+ L +AR + E V V R + + LKN++DAQY+G+I IG+PP
Sbjct: 17 RVPLYPLKSARRSLIEFETSLENVQKVWFSRFSNVEPRPEYLKNYLDAQYYGDITIGTPP 76
Query: 96 QNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISG 154
Q FSV+FDTGSSNLWVPS C YF I+C H +Y S KS TY G I YG+GS+SG
Sbjct: 77 QTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTTYVPNGTDFSIRYGTGSLSG 136
Query: 155 FFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVE 214
F S D++++G + VK Q F EAT++ L F++A+FDGI+G+ + +AVG PV+ NM++
Sbjct: 137 FLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGMAYPSLAVGGVTPVFVNMIK 196
Query: 215 QGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++ I
Sbjct: 197 QGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTIS 256
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 323
+ S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 257 DLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 49/141 (34%), Positives = 79/141 (56%), Gaps = 4/141 (2%)
Query: 366 VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
+ + +N++ D ++C+ A+ + T E + IN+ + P G + C
Sbjct: 247 LFKMDNLTISDLSICTDGCQAIADTGTSMIAGPTDE--VKQINQKLGATHLPGGIYTVSC 304
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
D I +P++ F I K L P YI+K + +E+C++GF+ DL PR LWILGDVF
Sbjct: 305 DVINNLPSIDFVINGKHMTLEPTDYIMKVSKLGSEICLTGFIGMDL--PRKKLWILGDVF 362
Query: 486 MGVYHTVFDSGKLRIGFAEAA 506
+G ++T+FD GK R+GFA+A
Sbjct: 363 IGKFYTIFDMGKNRVGFAKAV 383
>gi|2102722|gb|AAB63357.1| aspartic protease precursor, partial [Schistosoma japonicum]
Length = 428
Score = 256 bits (655), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 129/293 (44%), Positives = 189/293 (64%), Gaps = 8/293 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGV-RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPP 95
R+ L+ L +AR + E V V R + + LKN++DAQY+G+I IG+PP
Sbjct: 16 RVPLYPLKSARRSLIEFETSLENVQKVWFSRFSNVEPRPEYLKNYLDAQYYGDITIGTPP 75
Query: 96 QNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISG 154
Q FSV+FDTGSSNLWVPS C YF I+C H +Y S KS TY G I YG+GS+SG
Sbjct: 76 QTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTTYVPNGTDFSIRYGTGSLSG 135
Query: 155 FFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVE 214
F S D++++G + VK Q F EAT++ L F++A+FDGI+G+ + +AVG PV+ NM++
Sbjct: 136 FLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGMAYPSLAVGGVTPVFVNMIK 195
Query: 215 QGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++ I
Sbjct: 196 QGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTIS 255
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 323
+ S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 256 DLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 306
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 79/140 (56%), Gaps = 4/140 (2%)
Query: 366 VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
+ + +N++ D ++C+ A+ + T E + IN+ + P G + C
Sbjct: 246 LFKMDNLTISDLSICTDGCQAIADTGTSMIAGPTDE--VKQINQKLGATHLPGGIYTVSC 303
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
D I +P++ F I K L P YI+K + +E+C++GF+ DL PR LWILGDVF
Sbjct: 304 DVINNLPSIDFVINGKHMTLEPTDYIMKVSKLGSEICLTGFIGMDL--PRKKLWILGDVF 361
Query: 486 MGVYHTVFDSGKLRIGFAEA 505
+G ++T+FD GK R+GFA+A
Sbjct: 362 IGKFYTIFDMGKNRVGFAKA 381
>gi|2347147|gb|AAC37302.1| aspartic proteinase precursor [Schistosoma japonicum]
gi|226476814|emb|CAX72323.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476816|emb|CAX72324.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476820|emb|CAX72326.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476822|emb|CAX72327.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476824|emb|CAX72328.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476826|emb|CAX72329.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476834|emb|CAX72333.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476836|emb|CAX72334.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476840|emb|CAX72336.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476842|emb|CAX72337.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476844|emb|CAX72338.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476846|emb|CAX72339.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476852|emb|CAX72342.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476880|emb|CAX72318.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476882|emb|CAX72317.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476886|emb|CAX72315.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476890|emb|CAX72313.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476892|emb|CAX72312.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476894|emb|CAX72311.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476896|emb|CAX72310.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476898|emb|CAX72309.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476900|emb|CAX72308.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226482870|emb|CAX79402.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 256 bits (654), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 129/293 (44%), Positives = 189/293 (64%), Gaps = 8/293 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGV-RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPP 95
R+ L+ L +AR + E V V R + + LKN++DAQY+G+I IG+PP
Sbjct: 17 RVPLYPLKSARRSLIEFETSLENVQKVWFSRFSNVEPRPEYLKNYLDAQYYGDITIGTPP 76
Query: 96 QNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISG 154
Q FSV+FDTGSSNLWVPS C YF I+C H +Y S KS TY G I YG+GS+SG
Sbjct: 77 QTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTTYVPNGTDFSIRYGTGSLSG 136
Query: 155 FFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVE 214
F S D++++G + VK Q F EAT++ L F++A+FDGI+G+ + +AVG PV+ NM++
Sbjct: 137 FLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGMAYPSLAVGGVTPVFVNMIK 196
Query: 215 QGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++ I
Sbjct: 197 QGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTIS 256
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 323
+ S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 257 DLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 79/140 (56%), Gaps = 4/140 (2%)
Query: 366 VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
+ + +N++ D ++C+ A+ + T E + IN+ + P G + C
Sbjct: 247 LFKMDNLTISDLSICTDGCQAIADTGTSMIAGPTDE--VKQINQKLGATHLPGGIYTVSC 304
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
D I +P++ F I K L P YI+K + +E+C++GF+ DL PR LWILGDVF
Sbjct: 305 DVINNLPSIDFVINGKHMTLEPTDYIMKVSKLGSEICLTGFIGMDL--PRKKLWILGDVF 362
Query: 486 MGVYHTVFDSGKLRIGFAEA 505
+G ++T+FD GK R+GFA+A
Sbjct: 363 IGKFYTIFDMGKNRVGFAKA 382
>gi|313226363|emb|CBY21507.1| unnamed protein product [Oikopleura dioica]
Length = 396
Score = 256 bits (654), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 139/297 (46%), Positives = 189/297 (63%), Gaps = 14/297 (4%)
Query: 63 VRHR-LGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-I 120
++H+ LGD + P+ N+MDAQY+G I IG+PPQ FSVIFDTGSSNLWVPS+KC F+ +
Sbjct: 51 LQHKFLGDGHSE--PITNYMDAQYYGTIHIGTPPQEFSVIFDTGSSNLWVPSTKCKFTNV 108
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+C H +Y S+ S ++ G+ I YGSGS+SGF S D VEV V V+DQ F EA E
Sbjct: 109 ACLLHRKYDSQSSTSWKADGQEFAIQYGSGSLSGFCSTDAVEVAGVWVQDQKFAEAVEEP 168
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
+TF+ A+FDGI+GLG+ IAV P +NM+EQGL+S+ +FSF+LNR +AE+GGE+
Sbjct: 169 GITFVAAKFDGIMGLGYPSIAVNKITPPVNNMIEQGLLSDGMFSFFLNRTANAEDGGELT 228
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVC---EGGCAAIVDSGTSLLA 297
GGVD F G ++ VT++ YWQ ++ + + + C E GC IVDSGTSLLA
Sbjct: 229 IGGVDNSRFTGDFSWNEVTRQAYWQIKMDNFEVQGKGVSACGGNENGCQVIVDSGTSLLA 288
Query: 298 GPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDL------LVSGLLPEKVCQQI 348
P + EINHAIG + E +V ++ D + D+ V L PE +I
Sbjct: 289 VPKNLAEEINHAIGAFQFANGEW-IVPCRHMDTMPDIDFTLNGKVYTLTPEDYVMKI 344
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 49/100 (49%), Positives = 64/100 (64%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
IN + GE I+ C + TMP++ FT+ K++ L+PE Y++K E CISGF
Sbjct: 297 INHAIGAFQFANGEWIVPCRHMDTMPDIDFTLNGKVYTLTPEDYVMKIAAEGQEQCISGF 356
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
M D+PPP GPLWILGDVFMG Y+T FD R+GFA+ A
Sbjct: 357 MGMDIPPPAGPLWILGDVFMGKYYTAFDFDNNRVGFADLA 396
>gi|226476888|emb|CAX72314.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476904|emb|CAX72306.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 256 bits (654), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 129/293 (44%), Positives = 189/293 (64%), Gaps = 8/293 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGV-RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPP 95
R+ L+ L +AR + E V V R + + LKN++DAQY+G+I IG+PP
Sbjct: 17 RVPLYPLKSARRSLIEFETSLENVQKVWFSRFSNVEPRPEYLKNYLDAQYYGDITIGTPP 76
Query: 96 QNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISG 154
Q FSV+FDTGSSNLWVPS C YF I+C H +Y S KS TY G I YG+GS+SG
Sbjct: 77 QTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTTYVPNGTDFSIRYGTGSLSG 136
Query: 155 FFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVE 214
F S D++++G + VK Q F EAT++ L F++A+FDGI+G+ + +AVG PV+ NM++
Sbjct: 137 FLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGMAYPSLAVGGVTPVFVNMIK 196
Query: 215 QGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++ I
Sbjct: 197 QGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTIS 256
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 323
+ S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 257 DLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 79/140 (56%), Gaps = 4/140 (2%)
Query: 366 VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
+ + +N++ D ++C+ A+ + T E + IN+ + P G + C
Sbjct: 247 LFKMDNLTISDLSICTDGCQAIADTGTSMIAGPTDE--VKQINQKLGATHLPGGIYTVSC 304
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
D I +P++ F I K L P YI+K + +E+C++GF+ DL PR LWILGDVF
Sbjct: 305 DVINNLPSIDFVINGKHMTLEPTDYIMKVSKLGSEICLTGFIGMDL--PRKKLWILGDVF 362
Query: 486 MGVYHTVFDSGKLRIGFAEA 505
+G ++T+FD GK R+GFA+A
Sbjct: 363 IGKFYTIFDMGKNRVGFAKA 382
>gi|226476856|emb|CAX72344.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 256 bits (654), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 129/293 (44%), Positives = 189/293 (64%), Gaps = 8/293 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGV-RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPP 95
R+ L+ L +AR + E V V R + + LKN++DAQY+G+I IG+PP
Sbjct: 17 RVPLYPLKSARRSLIEFETSLENVQKVWFSRFSNVEPRPEYLKNYLDAQYYGDITIGTPP 76
Query: 96 QNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISG 154
Q FSV+FDTGSSNLWVPS C YF I+C H +Y S KS TY G I YG+GS+SG
Sbjct: 77 QTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTTYVPNGTDFSIRYGTGSLSG 136
Query: 155 FFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVE 214
F S D++++G + VK Q F EAT++ L F++A+FDGI+G+ + +AVG PV+ NM++
Sbjct: 137 FLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGMAYPSLAVGGVTPVFVNMIK 196
Query: 215 QGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++ I
Sbjct: 197 QGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTIS 256
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 323
+ S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 257 DLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 79/140 (56%), Gaps = 4/140 (2%)
Query: 366 VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
+ + +N++ D ++C+ A+ + T E + IN+ + P G + C
Sbjct: 247 LFKMDNLTISDLSICTDGCQAIADTGTSMIAGPTDE--VKQINQKLGATHLPGGIYTVSC 304
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
D I +P++ F I K L P YI+K + +E+C++GF+ DL PR LWILGDVF
Sbjct: 305 DVINNLPSIDFVINGKHMTLEPTDYIMKVSKLGSEICLTGFIGMDL--PRKKLWILGDVF 362
Query: 486 MGVYHTVFDSGKLRIGFAEA 505
+G ++T+FD GK R+GFA+A
Sbjct: 363 IGKFYTIFDMGKNRVGFAKA 382
>gi|149757990|ref|XP_001490885.1| PREDICTED: napsin-A [Equus caballus]
Length = 401
Score = 256 bits (654), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 122/252 (48%), Positives = 172/252 (68%), Gaps = 2/252 (0%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKS 133
+PL ++M+AQY+GEIG+G+PPQNFSV+FDTGSSNLWVPS +C +FS+ C+FH R+ + S
Sbjct: 63 VPLSDYMNAQYYGEIGLGTPPQNFSVLFDTGSSNLWVPSVRCHFFSLPCWFHHRFNPKAS 122
Query: 134 NTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGII 193
+++ G I YG+G ++G S+D + +G + VF EA E SL F +A FDGI+
Sbjct: 123 SSFKPNGTKFAIQYGTGRLNGILSEDKLTIGGITGASVVFGEALSEPSLIFTIAHFDGIL 182
Query: 194 GLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKH 253
GLGF +AV P D +V+QGL+ + VFSF+LNRDP+A +GGE+V GG DP H+
Sbjct: 183 GLGFPILAVEGVRPPLDTLVDQGLLDKPVFSFYLNRDPEAADGGELVLGGSDPSHYIPPL 242
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
T+VPVT YWQ + + +G T +C GCAAI+D+GTSL+ GPT + ++ AIGG
Sbjct: 243 TFVPVTIPAYWQIHMKRVKVGTGLT-LCAQGCAAILDTGTSLITGPTEEIRALHAAIGGI 301
Query: 314 GVVSAECKLVVS 325
+++ E L S
Sbjct: 302 PLLAGEYLLQCS 313
Score = 92.4 bits (228), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 49/143 (34%), Positives = 76/143 (53%), Gaps = 7/143 (4%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
+++ V G + C A++ L T+E + ++ +P GE ++ C
Sbjct: 257 MKRVKVGTGLTLCAQGCA-AILDTGTSLITGPTEE--IRALHAAIGGIPLLAGEYLLQCS 313
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
IP +P VS +G F L+ + Y+++ G +C+SGF A D+PPP GPLWILGDVF+
Sbjct: 314 TIPRLPPVSLLLGGTWFTLTAQDYVIQIVRGGVRLCLSGFAALDMPPPTGPLWILGDVFL 373
Query: 487 GVYHTVFDSGKL----RIGFAEA 505
G + VFD G + R+G A A
Sbjct: 374 GSFVAVFDRGDMNGGARVGLARA 396
>gi|226476832|emb|CAX72332.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 256 bits (654), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 129/293 (44%), Positives = 189/293 (64%), Gaps = 8/293 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGV-RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPP 95
R+ L+ L +AR + E V V R + + LKN++DAQY+G+I IG+PP
Sbjct: 17 RVPLYPLKSARRSLIEFETSLENVQKVWFSRFSNVEPRPEYLKNYLDAQYYGDITIGTPP 76
Query: 96 QNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISG 154
Q FSV+FDTGSSNLWVPS C YF I+C H +Y S KS TY G I YG+GS+SG
Sbjct: 77 QTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTTYVPNGTDFSIRYGTGSLSG 136
Query: 155 FFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVE 214
F S D++++G + VK Q F EAT++ L F++A+FDGI+G+ + +AVG PV+ NM++
Sbjct: 137 FLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGMAYPSLAVGGVTPVFVNMIK 196
Query: 215 QGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++ I
Sbjct: 197 QGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTIS 256
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 323
+ S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 257 DLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 79/140 (56%), Gaps = 4/140 (2%)
Query: 366 VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
+ + +N++ D ++C+ A+ + T E + IN+ + P G + C
Sbjct: 247 LFKMDNLTISDLSICTDGCQAIADTGTSMIAGPTDE--VKQINQKLGATHLPGGIYTVSC 304
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
D I +P++ F I K L P YI+K + +E+C++GF+ DLP R LWILGDVF
Sbjct: 305 DVINNLPSIDFVINGKHMTLEPTDYIMKVFKLGSEICLTGFIGMDLP--RKKLWILGDVF 362
Query: 486 MGVYHTVFDSGKLRIGFAEA 505
+G ++T+FD GK R+GFA+A
Sbjct: 363 IGKFYTIFDMGKNRVGFAKA 382
>gi|115279794|gb|ABI85390.1| cathepsin D [Hippoglossus hippoglossus]
Length = 399
Score = 256 bits (653), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 127/298 (42%), Positives = 189/298 (63%), Gaps = 10/298 (3%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDI-LPLK---NFMDAQYFGEIGIG 92
R+ LH + R + M + + G SD + LP++ NFMDAQY+GEIGIG
Sbjct: 27 RVPLHKTRSLRRLMTDNGMSLQELQALASSTGASDSVLSLPVERPTNFMDAQYYGEIGIG 86
Query: 93 SPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGS 151
+PPQ F+V+FDTGSSNLW+PS C F+++C+ H RY S+KS+TY + G I YG GS
Sbjct: 87 TPPQPFTVLFDTGSSNLWIPSIHCNLFNVACWLHHRYNSKKSSTYVKNGTEFSIQYGRGS 146
Query: 152 ISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDN 211
++G+ S+D V + + V Q F EA ++ +TF +ARFDG++G+G+ I+V PV+D+
Sbjct: 147 LTGYISEDTVSLAGLSVPGQQFAEAVKQPGITFAVARFDGVLGMGYPSISVDKVKPVFDS 206
Query: 212 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 271
+ L+ + VFSF+++RD A GGE++ GG DP+++ G YV VT+K YWQ ++ +
Sbjct: 207 AMAAKLLPQNVFSFYISRDASATVGGELILGGTDPQYYTGDLHYVNVTRKAYWQIKMDGV 266
Query: 272 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLVVS 325
+G Q T +C+ GC AIVD+GTSL+ GP V ++ AIG ++ E CK + S
Sbjct: 267 EVGTQLT-LCKAGCQAIVDTGTSLIVGPREEVRALHRAIGALPLIMGEYLIDCKKIPS 323
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 54/99 (54%), Positives = 72/99 (72%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
++ +LP MGE +IDC +IP++P VSF IG K+ NL+ E YI+K + + +C+SGF
Sbjct: 300 LHRAIGALPLIMGEYLIDCKKIPSLPVVSFNIGGKMLNLTGEDYIMKEFQKGSSICLSGF 359
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
MA D+PPP GPLWILGDVF+G Y+TVFD R+GFA A
Sbjct: 360 MAMDIPPPAGPLWILGDVFIGKYYTVFDRNADRLGFAPA 398
>gi|296417651|ref|XP_002838466.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295634405|emb|CAZ82657.1| unnamed protein product [Tuber melanosporum]
Length = 396
Score = 256 bits (653), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 143/327 (43%), Positives = 195/327 (59%), Gaps = 31/327 (9%)
Query: 14 VLASCLLLPASSNGLRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSGVR----- 64
+ A+ LL ++ G+ R LKK +L H +N ++YMG +R
Sbjct: 6 IFAAGSLLGSAMAGVHRAPLKKVPLTEQLSHHDINTQMRALGQKYMG------IRPEKID 59
Query: 65 ------HRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF 118
+ D +P+ NF++AQYF EI IG+PPQ F V+ DTGSSNLWVPSS+C
Sbjct: 60 EEMFKTQEIKTDDGHPVPVSNFLNAQYFSEITIGTPPQTFKVVLDTGSSNLWVPSSQCG- 118
Query: 119 SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
SI+CY HS+Y S S+TY G S EI YGSGS+SGF SQDN+E+G++ +KDQ F EAT
Sbjct: 119 SIACYLHSKYDSSTSSTYRPNGTSFEIRYGSGSLSGFVSQDNIEIGNLKIKDQTFAEATS 178
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
E L F RFDGI+GLG+ I+V VP + MV+QGL+ E VF+F+L D ++ E
Sbjct: 179 EPGLAFAFGRFDGILGLGYDSISVNHIVPPFYQMVDQGLLDEPVFAFYLG---DKDDQSE 235
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
+FGG+D H++GK +PV +K YW+ E I G +ST E AIVD+GTSL+A
Sbjct: 236 AIFGGIDKAHYQGKLIKLPVRRKAYWEVEFEAITFG-KSTAQFE-NTGAIVDTGTSLIAL 293
Query: 299 PTPVVTEINHAIGGE----GVVSAECK 321
P+ + +N IG + G S EC+
Sbjct: 294 PSTLAELLNKEIGAKKGFNGQYSVECE 320
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 53/87 (60%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ ++C++ ++P+++FT+ F ++ YIL+ + CIS FM D P P GPL
Sbjct: 313 GQYSVECEKRDSLPDLTFTLTGHDFTITAYDYILE----VQGSCISAFMGMDFPEPIGPL 368
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ Y++V++ G IG A++
Sbjct: 369 AILGDAFLRRYYSVYNLGDNTIGLAKS 395
>gi|185132376|ref|NP_001118183.1| cathepsin D precursor [Oncorhynchus mykiss]
gi|1858020|gb|AAC60301.1| cathepsin D [Oncorhynchus mykiss]
Length = 398
Score = 256 bits (653), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 117/244 (47%), Positives = 170/244 (69%), Gaps = 2/244 (0%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNT 135
LKNFMDAQY+GEIG+G+P Q F+V+FDTGSSNLWVPS C F+ I+C H +Y KS+T
Sbjct: 70 LKNFMDAQYYGEIGLGTPVQTFTVVFDTGSSNLWVPSVHCSFTDIACLLHHKYNGAKSST 129
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + G + I YGSGS+SG+ SQD +G + ++DQ F EA ++ + F+ A+FDGI+G+
Sbjct: 130 YVKNGTAFAIQYGSGSLSGYLSQDTCTIGGLSIEDQGFGEAIKQPGVAFIAAKFDGILGM 189
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ I+V P +DN++ Q V + VFSF+LNR+PD+E GGE++ GG DPK++ G Y
Sbjct: 190 AYPRISVDGVAPPFDNIMSQKKVEQNVFSFYLNRNPDSEPGGELLLGGTDPKYYSGDFQY 249
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
+ V+++ YWQ + + +G+Q + +C+GGC AIVD+GTSL+ GP V + AIG +
Sbjct: 250 LDVSRQAYWQIHMDGMGVGSQLS-LCKGGCEAIVDTGTSLITGPAAEVKALQRAIGATPL 308
Query: 316 VSAE 319
+ E
Sbjct: 309 IQGE 312
Score = 112 bits (279), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 46/93 (49%), Positives = 68/93 (73%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
+ P GE +++CD+IPTMP ++F +G + ++L+ EQY+LK + +C+SGFM D+P
Sbjct: 305 ATPLIQGEYMVNCDKIPTMPVITFNLGGQSYSLTAEQYVLKESQAGKTICLSGFMGLDIP 364
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
P GPLWILGDVF+G Y+TVFD R+GFA++
Sbjct: 365 APAGPLWILGDVFIGQYYTVFDRDNNRVGFAKS 397
>gi|226476906|emb|CAX72305.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 256 bits (653), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 128/293 (43%), Positives = 189/293 (64%), Gaps = 8/293 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGV-RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPP 95
R+ L+ L +AR + E V V R + + LKN++DAQY+G+I +G+PP
Sbjct: 17 RVPLYPLKSARRSLIEFETSLENVQKVWFSRFSNVEPRPEYLKNYLDAQYYGDITVGTPP 76
Query: 96 QNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISG 154
Q FSV+FDTGSSNLWVPS C YF I+C H +Y S KS TY G I YG+GS+SG
Sbjct: 77 QTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTTYVPNGTDFSIRYGTGSLSG 136
Query: 155 FFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVE 214
F S D++++G + VK Q F EAT++ L F++A+FDGI+G+ + +AVG PV+ NM++
Sbjct: 137 FLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGMAYPSLAVGGVTPVFVNMIK 196
Query: 215 QGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++ I
Sbjct: 197 QGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTIS 256
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 323
+ S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 257 DLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 79/140 (56%), Gaps = 4/140 (2%)
Query: 366 VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
+ + +N++ D ++C+ A+ + T E + IN+ + P G + C
Sbjct: 247 LFKMDNLTISDLSICTDGCQAIADTGTSMIAGPTDE--VKQINQKLGATHLPGGIYTVSC 304
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
D I +P++ F I K L P YI+K + +E+C++GF+ DL PR LWILGDVF
Sbjct: 305 DVINNLPSIDFVINGKHMTLEPTDYIMKVSKLGSEICLTGFIGMDL--PRKKLWILGDVF 362
Query: 486 MGVYHTVFDSGKLRIGFAEA 505
+G ++T+FD GK R+GFA+A
Sbjct: 363 IGKFYTIFDMGKNRVGFAKA 382
>gi|226476876|emb|CAX72320.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 255 bits (652), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 128/293 (43%), Positives = 189/293 (64%), Gaps = 8/293 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGV-RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPP 95
R+ L+ L +AR + E V V R + + LKN++DAQY+G+I IG+PP
Sbjct: 17 RVPLYPLKSARRSLIEFETSLENVQKVWFSRFSNVEPRPEYLKNYLDAQYYGDITIGTPP 76
Query: 96 QNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISG 154
Q FSV+FDTGSSNLWVPS C YF I+C H +Y S KS TY G I YG+GS+SG
Sbjct: 77 QTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTTYVPNGTDFSIRYGTGSLSG 136
Query: 155 FFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVE 214
F S D++++G + VK Q + EAT++ L F++A+FDGI+G+ + +AVG PV+ NM++
Sbjct: 137 FLSTDSLQLGSLGVKGQTYGEATKQPGLVFVMAKFDGILGMAYPSLAVGGVTPVFVNMIK 196
Query: 215 QGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++ I
Sbjct: 197 QGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTIS 256
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 323
+ S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 257 DLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 79/140 (56%), Gaps = 4/140 (2%)
Query: 366 VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
+ + +N++ D ++C+ A+ + T E + IN+ + P G + C
Sbjct: 247 LFKMDNLTISDLSICTDGCQAIADTGTSMIAGPTDE--VKQINQKLGATHLPGGIYTVSC 304
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
D I +P++ F I K L P YI+K + +E+C++GF+ DL PR LWILGDVF
Sbjct: 305 DVINNLPSIDFVINGKHMTLEPTDYIMKVSKLGSEICLTGFIGMDL--PRKKLWILGDVF 362
Query: 486 MGVYHTVFDSGKLRIGFAEA 505
+G ++T+FD GK R+GFA+A
Sbjct: 363 IGKFYTIFDMGKNRVGFAKA 382
>gi|74198157|dbj|BAE35255.1| unnamed protein product [Mus musculus]
Length = 335
Score = 255 bits (651), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 121/248 (48%), Positives = 176/248 (70%), Gaps = 11/248 (4%)
Query: 82 DAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIG 140
DAQY+G+IGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+TY + G
Sbjct: 1 DAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSSTYVKNG 60
Query: 141 KSCEINYGSGSISGFFSQDNVEVG---------DVVVKDQVFIEATREGSLTFLLARFDG 191
S +I+YGSGS+SG+ SQD V V + V+ Q+F EAT++ + F+ A+FDG
Sbjct: 61 TSFDIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIFGEATKQPGIVFVAAKFDG 120
Query: 192 IIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKG 251
I+G+G+ I+V + +PV+DN+++Q LV + +FSF+LNRDP+ + GGE++ GG D K++ G
Sbjct: 121 ILGMGYPHISVNNVLPVFDNLMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDSKYYHG 180
Query: 252 KHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
+ +Y+ VT+K YWQ + + +GN+ T +C+GGC AIVD+GTSLL GP V E+ AIG
Sbjct: 181 ELSYLNVTRKAYWQVHMDQLEVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKELQKAIG 239
Query: 312 GEGVVSAE 319
++ E
Sbjct: 240 AMPLIQGE 247
Score = 109 bits (272), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 46/93 (49%), Positives = 65/93 (69%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I C+++ ++P V +G K + L P++YILK +G +C+SGFM D+P
Sbjct: 240 AMPLIQGEYMIPCEKVSSLPTVYLKLGGKNYELHPDKYILKVSQGGKTICLSGFMGMDIP 299
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
PP GPLWILGDVF+G Y+TVFD R+GFA A
Sbjct: 300 PPSGPLWILGDVFIGSYYTVFDRDNNRVGFANA 332
>gi|209154266|gb|ACI33365.1| Cathepsin D precursor [Salmo salar]
Length = 402
Score = 254 bits (650), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 138/326 (42%), Positives = 200/326 (61%), Gaps = 13/326 (3%)
Query: 11 CLWVL-ASCLLLPASSNGLRRIGLKKRR-----LDLHSLNAARITRKERYMGGAGVSGVR 64
CL +L + LL A S+ + RI L K R + + ++ ++ + GAG + V
Sbjct: 3 CLKILYITIALLIAHSSAIIRIPLHKTRSMRRLMSDNGMSFEQLQDMAKTGCGAG-ANVP 61
Query: 65 HRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF-SISCY 123
+ L NFMDAQY+G I IG+PPQ+F+V+FDTGSSNLWVPS C F ++C+
Sbjct: 62 INAPSPKVPVERLTNFMDAQYYGVISIGTPPQDFTVLFDTGSSNLWVPSIHCSFLDVACW 121
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
H RY S+KS+TY + G I YG GS+SGF S D V + + V Q F EA ++ +T
Sbjct: 122 LHHRYNSKKSSTYVQNGTKFSIQYGRGSLSGFISGDTVSLAGMQVTGQQFGEAVKQPGIT 181
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
F +ARFDG++G+G+ I+V + PV+D + L+ + +FSF+++RDP A GGE++ GG
Sbjct: 182 FAVARFDGVLGMGYPTISVNNITPVFDTAMAAKLLPQNIFSFYISRDPLAAVGGELMLGG 241
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
DP ++ G YV VT+K YWQ E+ ++ +GNQ T +C+ GC AIVD+GTSL+ GP V
Sbjct: 242 TDPLYYTGDLHYVNVTRKAYWQIEMSNVEVGNQLT-LCKAGCQAIVDTGTSLIIGPAEEV 300
Query: 304 TEINHAIGGEGVVSAE----CKLVVS 325
++ AIG ++ E CK V S
Sbjct: 301 RVLHKAIGALPLLMGEYWIDCKKVPS 326
Score = 119 bits (299), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 60/140 (42%), Positives = 89/140 (63%), Gaps = 3/140 (2%)
Query: 367 VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
+E NV G+ +C A A+V L +E + +++ +LP MGE IDC
Sbjct: 264 IEMSNVEVGNQLTLCKAGCQAIVDTGTSLIIGPAEE--VRVLHKAIGALPLLMGEYWIDC 321
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
++P++P ++F +G K+FNL+ + YILK + ++C+SGFMA D+PPP GPLWILGDVF
Sbjct: 322 KKVPSLPVIAFNLGGKMFNLTGDDYILKESQMGLKICLSGFMAMDIPPPAGPLWILGDVF 381
Query: 486 MGVYHTVFDSGKLRIGFAEA 505
+G Y++VFD R+GFA A
Sbjct: 382 IGRYYSVFDRDADRMGFAPA 401
>gi|307203870|gb|EFN82801.1| Lysosomal aspartic protease [Harpegnathos saltator]
Length = 374
Score = 254 bits (650), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 142/292 (48%), Positives = 188/292 (64%), Gaps = 11/292 (3%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQ 96
R+ LH + R +E G + VR G + PL N++DAQY+G I IG+PPQ
Sbjct: 11 RIQLHKTESIRRILQEV---GTDLHQVR-LYGVTTPTPEPLSNYLDAQYYGVITIGTPPQ 66
Query: 97 NFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
F VIFDTGSSNLWVPS KC + I+C H +Y SRKS+TY + G I YGSGS+SGF
Sbjct: 67 EFRVIFDTGSSNLWVPSKKCSITNIACLLHHKYDSRKSSTYQKNGTEFAIRYGSGSLSGF 126
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
S D V +G + V+ Q F EA +E L F+ A+FDGI+G+G+ IAV PV+ NMV+Q
Sbjct: 127 LSSDVVNIGGLNVQGQTFAEAVKEPGLVFVAAKFDGILGMGYSTIAVDGVTPVFYNMVKQ 186
Query: 216 GLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGN 275
LV + VFSF+LNRDPDA+ GGE++ GG D H++G+ TYVPV++KGYWQF + I +
Sbjct: 187 DLVPKAVFSFYLNRDPDAKVGGEMLLGGSDSDHYEGEFTYVPVSRKGYWQFAMDSIQVHG 246
Query: 276 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLV 323
+ +C GC AI D+GTSL+AGP V IN IG +++ E C L+
Sbjct: 247 HT--LCASGCQAIADTGTSLIAGPVEEVAVINSLIGATTIIAGEAIVDCDLI 296
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 48/102 (47%), Positives = 67/102 (65%)
Query: 404 LSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCI 463
++ IN L + GE+I+DCD I +P + IG K+F+LS + YIL+ + +C+
Sbjct: 272 VAVINSLIGATTIIAGEAIVDCDLIEKLPGIDVIIGGKMFSLSGKDYILRVKQFGKTICM 331
Query: 464 SGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
SGFM D+PPP GPLWILGDVF+G ++T FD R+GFA A
Sbjct: 332 SGFMGMDIPPPNGPLWILGDVFIGRFYTEFDMENDRVGFAVA 373
>gi|12697815|dbj|BAB21620.1| cathepsin D [Bos taurus]
Length = 386
Score = 254 bits (649), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 122/253 (48%), Positives = 175/253 (69%), Gaps = 11/253 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN+MDAQY+GEIGIG+PPQ F+V+FDTGS+NLWVPS C I+C+ H +Y S KS+T
Sbjct: 47 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSANLWVPSIHCKLLDIACWTHRKYNSDKSST 106
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEV---------GDVVVKDQVFIEATREGSLTFLL 186
Y + G + +I+YGSGS+SG+ SQD V V G V V+ Q F EA ++ + F+
Sbjct: 107 YVKNGTTFDIHYGSGSLSGYLSQDTVSVPCNPSSSSPGGVTVQRQTFGEAIKQPGVVFIA 166
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+F GI+G+ + I+V + +PV+DN+++Q LV + VFSF+LNRDP A+ GGE++ GG D
Sbjct: 167 AKFGGILGMAYPRISVNNVLPVFDNLMQQKLVDKNVFSFFLNRDPKAQPGGELMLGGTDS 226
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K+++G + VT++ YWQ + + +G+ T VC+GGC AIVD+GTSL+ GP V E+
Sbjct: 227 KYYRGSLMFHNVTRQAYWQIHMDQLDVGSSLT-VCKGGCEAIVDTGTSLIVGPVEEVREL 285
Query: 307 NHAIGGEGVVSAE 319
AIG ++ E
Sbjct: 286 QKAIGAVPLIQGE 298
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 47/94 (50%), Positives = 65/94 (69%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I C+++ ++P V+ +G K + SPE Y LK + VC+SGFM D+P
Sbjct: 291 AVPLIQGEYMIPCEKVSSLPQVTVKLGGKDYAXSPEDYALKVSQAGTTVCLSGFMGMDIP 350
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PP GPLWILGDVF+G Y+TVFD + R+G AEAA
Sbjct: 351 PPGGPLWILGDVFIGRYYTVFDRDQNRVGLAEAA 384
>gi|66815097|ref|XP_641645.1| cathepsin D [Dictyostelium discoideum AX4]
gi|74960832|sp|O76856.1|CATD_DICDI RecName: Full=Cathepsin D; AltName: Full=Ddp44; Flags: Precursor
gi|3288145|emb|CAA76563.1| preprocathepsin D [Dictyostelium discoideum]
gi|6010025|emb|CAB57223.1| cathepsin D [Dictyostelium discoideum]
gi|60469656|gb|EAL67644.1| cathepsin D [Dictyostelium discoideum AX4]
Length = 383
Score = 254 bits (649), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 128/255 (50%), Positives = 169/255 (66%), Gaps = 8/255 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI-SCYFHSRYKSRKS 133
+P+ +F DAQY+G I IG+P Q F V+FDTGSSNLW+PS KC ++ +C H++Y S S
Sbjct: 53 IPISDFEDAQYYGAITIGTPGQAFKVVFDTGSSNLWIPSKKCPITVVACDLHNKYNSGAS 112
Query: 134 NTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGII 193
+TY G I YGSG++SGF SQD+V VG + VKDQ+F EAT E + F A+FDGI+
Sbjct: 113 STYVANGTDFTIQYGSGAMSGFVSQDSVTVGSLTVKDQLFAEATAEPGIAFDFAKFDGIL 172
Query: 194 GLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKH 253
GL F+ I+V PV+ NM+ QGLVS +FSFWL+R P A GGE+ FG +D + G
Sbjct: 173 GLAFQSISVNSIPPVFYNMLSQGLVSSTLFSFWLSRTPGA-NGGELSFGSIDNTKYTGDI 231
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG-- 311
TYVP+T + YW+F + D I QS G C C AI DSGTSL+AGP +T +N +G
Sbjct: 232 TYVPLTNETYWEFVMDDFAIDGQSAGFCGTTCHAICDSGTSLIAGPMADITALNEKLGAV 291
Query: 312 ---GEGVVSAECKLV 323
GEGV S +C ++
Sbjct: 292 ILNGEGVFS-DCSVI 305
Score = 79.7 bits (195), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 40/88 (45%), Positives = 56/88 (63%), Gaps = 3/88 (3%)
Query: 419 GESII-DCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGP 477
GE + DC I T+PNV+ T+ + F L+P++Y+L+ E C+SGFM +L G
Sbjct: 295 GEGVFSDCSVINTLPNVTITVAGREFVLTPKEYVLEVTEFGKTECLSGFMGIEL--NMGN 352
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAEA 505
WILGDVF+ Y+TVFD G ++GFA A
Sbjct: 353 FWILGDVFISAYYTVFDFGNKQVGFATA 380
>gi|226476848|emb|CAX72340.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 254 bits (648), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 128/293 (43%), Positives = 188/293 (64%), Gaps = 8/293 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGV-RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPP 95
R+ L+ L +AR + E V V R + + LKN++DAQY+G+I IG+PP
Sbjct: 17 RVPLYPLKSARRSLIEFETSLENVQKVWFSRFSNVEPRPEYLKNYLDAQYYGDITIGTPP 76
Query: 96 QNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISG 154
Q FSV+FDTGSSNLWVPS C YF I+C H +Y S KS TY G I YG+GS+SG
Sbjct: 77 QTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTTYVPNGTDFSIRYGTGSLSG 136
Query: 155 FFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVE 214
F S D++++G + VK Q F EAT++ L F++A+FDGI+G+ + +AVG PV+ NM++
Sbjct: 137 FLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGMAYPSLAVGGVTPVFVNMIK 196
Query: 215 QGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
Q +V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++ I
Sbjct: 197 QRVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTIS 256
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 323
+ S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 257 DLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 79/140 (56%), Gaps = 4/140 (2%)
Query: 366 VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
+ + +N++ D ++C+ A+ + T E + IN+ + P G + C
Sbjct: 247 LFKMDNLTISDLSICTDGCQAIADTGTSMIAGPTDE--VKQINQKLGATHLPGGIYTVSC 304
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
D I +P++ F I K L P YI+K + +E+C++GF+ DL PR LWILGDVF
Sbjct: 305 DVINNLPSIDFVINGKHMTLEPTDYIMKVSKLGSEICLTGFIGMDL--PRKKLWILGDVF 362
Query: 486 MGVYHTVFDSGKLRIGFAEA 505
+G ++T+FD GK R+GFA+A
Sbjct: 363 IGKFYTIFDMGKNRVGFAKA 382
>gi|4927648|gb|AAD33219.1| cathepsin D [Hynobius leechii]
Length = 397
Score = 254 bits (648), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 132/295 (44%), Positives = 183/295 (62%), Gaps = 7/295 (2%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILP--LKNFMDAQY 85
+ RI L K R H+L A K A V++ + P LKN++DAQY
Sbjct: 20 MVRIPLTKFRSIRHTLTEAGGDIKNLV---ATSDQVKYNCFPKTQQPTPEILKNYLDAQY 76
Query: 86 FGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCE 144
+GEI IG+PPQ F+V+FDTGSSNLWVPS C I+C H +Y S S+TY + G
Sbjct: 77 YGEICIGTPPQCFTVVFDTGSSNLWVPSVHCSLLDIACLVHPKYDSSSSSTYVKNGTEFS 136
Query: 145 INYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGD 204
I YG+GS+SG+ QD V VG + V QVF EA ++ + F+ A+FDGI+G+ + I+V
Sbjct: 137 IQYGTGSLSGYLRQDTVSVGGLGVLKQVFGEAIKQPGVAFIAAKFDGILGMAYPRISVDG 196
Query: 205 AVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYW 264
V+DN++ Q LV + VFSF+LNR+PD GGE++ GG DP ++ G TY+ VT K YW
Sbjct: 197 VTTVFDNIMSQKLVEKNVFSFYLNRNPDTRPGGELLLGGTDPNYYTGDFTYLNVTPKAYW 256
Query: 265 QFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
Q + + +G+Q T +C+GGC AIVD+GTSL+ GP+ VT + AIG ++ E
Sbjct: 257 QIHMDQLGVGDQLT-LCKGGCEAIVDTGTSLIIGPSAEVTALQKAIGAIPLIQGE 310
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 44/93 (47%), Positives = 64/93 (68%), Gaps = 1/93 (1%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I CD++P++P ++F +G K F +S E Y+LK + +C+SGFM D+P
Sbjct: 303 AIPLIQGEYMIPCDKVPSLPVITFNLGGKAFTVSGEDYVLKVSQAGHTICLSGFMGMDIP 362
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
PP G LW LGDVF+G Y+TVFD R+G A+A
Sbjct: 363 PPSG-LWTLGDVFIGPYYTVFDRENDRVGLAKA 394
>gi|256072903|ref|XP_002572773.1| cathepsin D (A01 family) [Schistosoma mansoni]
gi|360043053|emb|CCD78465.1| cathepsin D (A01 family) [Schistosoma mansoni]
Length = 430
Score = 254 bits (648), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 127/292 (43%), Positives = 188/292 (64%), Gaps = 8/292 (2%)
Query: 35 KRRLDLHSLNAARITRKERYMGGAGVSGV-RHRLGDSDEDILPLKNFMDAQYFGEIGIGS 93
+ R+ LH L +A+ T E V V R+ D LKN++DAQY+G+I IG+
Sbjct: 17 RPRIPLHPLKSAQRTLIEFETSLEIVKKVWLSRVSGVDPQPEYLKNYLDAQYYGDITIGT 76
Query: 94 PPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSI 152
PPQ FSV+FDTGSSNLWVPS C YF I+C H +Y S KS+TY G ++YG+GS+
Sbjct: 77 PPQTFSVVFDTGSSNLWVPSKYCSYFDIACLLHRKYDSSKSSTYIPNGTEFSVHYGTGSL 136
Query: 153 SGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNM 212
SGF S D++++G + VK Q F EAT++ L F++A+FDGI+G+ + I+V PV+ NM
Sbjct: 137 SGFLSTDSLQLGSLSVKGQTFGEATQQPGLVFVMAKFDGILGMAYPSISVDGVTPVFVNM 196
Query: 213 VEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDIL 272
++QG+V VFSF+L+R+ A GGE++ GG+D K++ G+ YV +T++ YW F++ +
Sbjct: 197 IQQGIVESPVFSFYLSRNISAVLGGELMIGGIDKKYYSGEINYVDLTEQSYWLFKMDKLT 256
Query: 273 IGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAEC 320
I + + C GC AI D+GTS++AGPT + +IN +G G+ + C
Sbjct: 257 ISDMT--ACPDGCLAIADTGTSMIAGPTDEIQKINAKLGATRLPGGIYTVSC 306
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 47/140 (33%), Positives = 73/140 (52%), Gaps = 4/140 (2%)
Query: 366 VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
+ + + ++ D C +A+ + T E + IN + P G + C
Sbjct: 249 LFKMDKLTISDMTACPDGCLAIADTGTSMIAGPTDE--IQKINAKLGATRLPGGIYTVSC 306
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
I +P + F I K L P Y+LK + +E+C++GFM DL P+ LWILGD+F
Sbjct: 307 GNINNLPTIDFVINGKAMTLEPTDYLLKVSKMGSEICLTGFMGLDL--PKRKLWILGDIF 364
Query: 486 MGVYHTVFDSGKLRIGFAEA 505
+G ++TVFD GK R+GFA+A
Sbjct: 365 IGKFYTVFDMGKNRVGFAKA 384
>gi|74199699|dbj|BAE41511.1| unnamed protein product [Mus musculus]
Length = 419
Score = 254 bits (648), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 119/250 (47%), Positives = 167/250 (66%), Gaps = 2/250 (0%)
Query: 71 DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYK 129
+ +PL FM+ QYFG IG+G+PPQNF+V+FDTGSSNLWVPS++C +FS++C+FH R+
Sbjct: 59 NPSFVPLSKFMNTQYFGTIGLGTPPQNFTVVFDTGSSNLWVPSTRCHFFSLACWFHHRFN 118
Query: 130 SRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARF 189
+ S+++ G I YG+G +SG SQDN+ +G + F EA E SL F LA F
Sbjct: 119 PKASSSFRPNGTKFAIQYGTGRLSGILSQDNLTIGGIHDAFATFGEALWEPSLIFALAHF 178
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF 249
DGI+GLGF +AVG P D MVEQGL+ + VFSF+LNRD + +GGE+V GG DP H+
Sbjct: 179 DGILGLGFPTLAVGGVQPPLDAMVEQGLLEKPVFSFYLNRDSEGSDGGELVLGGSDPAHY 238
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHA 309
T++PVT YWQ + + +G +C GC+AI+D+GTSL+ GP+ + +N A
Sbjct: 239 VPPLTFIPVTIPAYWQVHMESVKVGT-GLSLCAQGCSAILDTGTSLITGPSEEIRALNKA 297
Query: 310 IGGEGVVSAE 319
IGG ++ +
Sbjct: 298 IGGYPFLNGQ 307
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 52/144 (36%), Positives = 77/144 (53%), Gaps = 7/144 (4%)
Query: 367 VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
V E+V G ++C+ A++ L ++E + +N+ P G+ I C
Sbjct: 255 VHMESVKVGTGLSLCAQGCSAILDTGTSLITGPSEE--IRALNKAIGGYPFLNGQYFIQC 312
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
+ PT+P VSF +G FNL+ + Y++K + +C+ GF A D+P P GPLWILGDVF
Sbjct: 313 SKTPTLPPVSFHLGGVWFNLTGQDYVIKILQSDVGLCLLGFQALDIPKPAGPLWILGDVF 372
Query: 486 MGVYHTVFDSGK----LRIGFAEA 505
+G Y VFD G R+G A A
Sbjct: 373 LGPYVAVFDRGDKNVGPRVGLARA 396
>gi|6978973|dbj|BAA90785.1| aspartic proteinase family member similar to renin [Mus musculus]
Length = 419
Score = 254 bits (648), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 119/250 (47%), Positives = 168/250 (67%), Gaps = 2/250 (0%)
Query: 71 DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYK 129
+ +PL FM+ QYFG IG+G+PPQNF+V+FDTGSSNLWVPS++C +FS++C+FH R+
Sbjct: 59 NPSFVPLSKFMNTQYFGTIGLGTPPQNFTVVFDTGSSNLWVPSTRCHFFSLACWFHHRFN 118
Query: 130 SRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARF 189
+ S+++ G I YG+G +SG SQDN+ +G + F EA E SL F LA F
Sbjct: 119 PKASSSFRPNGTKFAIQYGTGRLSGILSQDNLTIGGIHDAFVTFGEALWEPSLIFALAHF 178
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF 249
DGI+GLGF +AVG P D+MVEQGL+ + VFSF+LNRD + +GGE+V GG DP H+
Sbjct: 179 DGILGLGFPTLAVGGVQPPLDSMVEQGLLEKPVFSFYLNRDSEGSDGGELVLGGSDPAHY 238
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHA 309
T++PVT YWQ + + +G +C GC+AI+D+GTSL+ GP+ + +N A
Sbjct: 239 VPPLTFIPVTIPAYWQVHMESVKVGT-GLSLCAQGCSAILDTGTSLITGPSEEIRALNKA 297
Query: 310 IGGEGVVSAE 319
IGG ++ +
Sbjct: 298 IGGYPFLNGQ 307
Score = 86.7 bits (213), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 51/144 (35%), Positives = 76/144 (52%), Gaps = 7/144 (4%)
Query: 367 VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
V E+V G ++C+ A++ L ++E + +N+ P G+ I C
Sbjct: 255 VHMESVKVGTGLSLCAQGCSAILDTGTSLITGPSEE--IRALNKAIGGYPFLNGQYFIQC 312
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
+ PT+P VS +G FNL+ + Y++K + +C+ GF A D+P P GPLWILGDVF
Sbjct: 313 SKTPTLPPVSSHLGGVWFNLTGQDYVIKILQSDVGLCLLGFQALDIPKPAGPLWILGDVF 372
Query: 486 MGVYHTVFDSGK----LRIGFAEA 505
+G Y VFD G R+G A A
Sbjct: 373 LGPYVAVFDRGDKNVGPRVGLARA 396
>gi|24417300|gb|AAN60260.1| unknown [Arabidopsis thaliana]
Length = 168
Score = 254 bits (648), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 118/168 (70%), Positives = 141/168 (83%)
Query: 157 SQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQG 216
S D V VGD+VVKDQ F+EAT+E +TF++A+ DGI+GLGF+EI+VG A PVW NM++QG
Sbjct: 1 SNDAVTVGDLVVKDQEFMEATKELGITFVVAKXDGILGLGFQEISVGKAAPVWYNMLKQG 60
Query: 217 LVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQ 276
L+ E VFSFWLNR+ D EEGGE+VFGGVDP HFKGKHTYVPVT+KGYWQF++GD+LIG
Sbjct: 61 LIKEPVFSFWLNRNADEEEGGELVFGGVDPNHFKGKHTYVPVTQKGYWQFDMGDVLIGGA 120
Query: 277 STGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVV 324
TG CE GC+AI DSGTSLLAGPT ++T INHAIG GVVS +CK VV
Sbjct: 121 PTGFCESGCSAIADSGTSLLAGPTTIITMINHAIGAAGVVSQQCKTVV 168
>gi|334562337|gb|AEG79714.1| cathepsin D [Apostichopus japonicus]
Length = 372
Score = 253 bits (647), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 130/307 (42%), Positives = 198/307 (64%), Gaps = 8/307 (2%)
Query: 19 LLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLK 78
LLLP +S L+RI L K L R ++ + G+G+ ++ + + + LK
Sbjct: 9 LLLPIAS-ALQRIPLFKVESARQRLIRTRSSKSDLEAIGSGL-----QVKEVNGSPIILK 62
Query: 79 NFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYT 137
+++DAQY+G I +G+PPQ+F V+FDTGSSNLWVPSS C + I+C F +Y S+TY
Sbjct: 63 DYLDAQYYGPITLGTPPQDFVVVFDTGSSNLWVPSSTCSWKDIACSFTKKYDHSVSSTYV 122
Query: 138 EIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGF 197
+ I YGSG+ +GF S D + +G+V VK Q+F EAT E L++++A+FDGI+G+G+
Sbjct: 123 ANDTAFAIPYGSGNCAGFLSYDTLMMGNVAVKSQLFGEATAEPGLSWIMAQFDGILGMGY 182
Query: 198 REIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVP 257
I+V +P +DN++ + L+S +FSF+L++DP A GGE++ GG D K++ G TYV
Sbjct: 183 PTISVDGVIPPFDNIMNRKLISNNIFSFYLSKDPSAAVGGELLLGGTDSKYYTGNFTYVK 242
Query: 258 VTKKGYWQFELGDILIGNQSTGVCEG-GCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
V+KKGYWQF + + IG + G C G C+AI D+GTSL+AGPT + ++N IG ++
Sbjct: 243 VSKKGYWQFAMDKVSIGGKDAGYCTGKNCSAICDTGTSLIAGPTADINDLNKKIGAIPLI 302
Query: 317 SAECKLV 323
E ++
Sbjct: 303 KGEAIIL 309
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 38/80 (47%), Positives = 54/80 (67%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
+N+ ++P GE+II C+ IP++P++SF + F L P+ Y+LK E +CISGF
Sbjct: 292 LNKKIGAIPLIKGEAIILCNTIPSLPDISFQLNGHDFTLKPDDYVLKVSEANETICISGF 351
Query: 467 MAFDLPPPRGPLWILGDVFM 486
+ DLPP GPLWILGDVF+
Sbjct: 352 LGIDLPPEIGPLWILGDVFI 371
>gi|318977821|ref|NP_001187407.1| cathepsin D precursor [Ictalurus punctatus]
gi|308322929|gb|ADO28602.1| cathepsin D [Ictalurus punctatus]
Length = 398
Score = 253 bits (647), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 115/244 (47%), Positives = 171/244 (70%), Gaps = 2/244 (0%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L NFMDAQY+G I IG+PPQ F+V+FDTGSSNLWVPS C +F ++C+ H RY S+KS+T
Sbjct: 70 LSNFMDAQYYGVISIGTPPQEFTVLFDTGSSNLWVPSIHCAFFDLACWLHHRYDSKKSST 129
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + G I YG GS+SGFFSQD V + + V++Q+F EA ++ + F LA+FDG++G+
Sbjct: 130 YVQNGTQFSIQYGRGSLSGFFSQDTVTLAGLGVQNQMFAEAVKQPGVVFALAKFDGVLGM 189
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ ++VG P++D+++ L+ + +FSF++NRDP AE GGE++ GG D ++F G Y
Sbjct: 190 AYPILSVGKVRPIFDSIMAGKLLQQNIFSFYINRDPKAEVGGELMLGGCDKQYFDGDLHY 249
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
+ VT+K YWQ ++ + +G+ T +C+ GC AIVDSGTS++ GP + +N AIG +
Sbjct: 250 LNVTRKAYWQIKMDTVEVGSTLT-LCKDGCQAIVDSGTSMITGPVEEIRALNKAIGAVPL 308
Query: 316 VSAE 319
+ E
Sbjct: 309 IMGE 312
Score = 113 bits (282), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 51/99 (51%), Positives = 70/99 (70%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
+N+ ++P MGE I C +IP++P VSF +G K+FNL+ Y+ K+ + VC+SGF
Sbjct: 299 LNKAIGAVPLIMGEYWISCSKIPSLPVVSFHLGGKVFNLTGGDYVYKSTKMGVSVCLSGF 358
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
MA D+PPP GPLWILGDVFMG ++TVFD ++GFA A
Sbjct: 359 MALDIPPPAGPLWILGDVFMGRFYTVFDRDNNQVGFAPA 397
>gi|256072901|ref|XP_002572772.1| cathepsin D (A01 family) [Schistosoma mansoni]
gi|360043052|emb|CCD78464.1| cathepsin D (A01 family) [Schistosoma mansoni]
Length = 428
Score = 253 bits (647), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 127/290 (43%), Positives = 187/290 (64%), Gaps = 8/290 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGV-RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPP 95
R+ LH L +A+ T E V V R+ D LKN++DAQY+G+I IG+PP
Sbjct: 17 RIPLHPLKSAQRTLIEFETSLEIVKKVWLSRVSGVDPQPEYLKNYLDAQYYGDITIGTPP 76
Query: 96 QNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISG 154
Q FSV+FDTGSSNLWVPS C YF I+C H +Y S KS+TY G ++YG+GS+SG
Sbjct: 77 QTFSVVFDTGSSNLWVPSKYCSYFDIACLLHRKYDSSKSSTYIPNGTEFSVHYGTGSLSG 136
Query: 155 FFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVE 214
F S D++++G + VK Q F EAT++ L F++A+FDGI+G+ + I+V PV+ NM++
Sbjct: 137 FLSTDSLQLGSLSVKGQTFGEATQQPGLVFVMAKFDGILGMAYPSISVDGVTPVFVNMIQ 196
Query: 215 QGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
QG+V VFSF+L+R+ A GGE++ GG+D K++ G+ YV +T++ YW F++ + I
Sbjct: 197 QGIVESPVFSFYLSRNISAVLGGELMIGGIDKKYYSGEINYVDLTEQSYWLFKMDKLTIS 256
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAEC 320
+ + C GC AI D+GTS++AGPT + +IN +G G+ + C
Sbjct: 257 DMT--ACPDGCLAIADTGTSMIAGPTDEIQKINAKLGATRLPGGIYTVSC 304
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 47/140 (33%), Positives = 73/140 (52%), Gaps = 4/140 (2%)
Query: 366 VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
+ + + ++ D C +A+ + T E + IN + P G + C
Sbjct: 247 LFKMDKLTISDMTACPDGCLAIADTGTSMIAGPTDE--IQKINAKLGATRLPGGIYTVSC 304
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
I +P + F I K L P Y+LK + +E+C++GFM DL P+ LWILGD+F
Sbjct: 305 GNINNLPTIDFVINGKAMTLEPTDYLLKVSKMGSEICLTGFMGLDL--PKRKLWILGDIF 362
Query: 486 MGVYHTVFDSGKLRIGFAEA 505
+G ++TVFD GK R+GFA+A
Sbjct: 363 IGKFYTVFDMGKNRVGFAKA 382
>gi|1778026|gb|AAB63442.1| aspartic proteinase [Schistosoma mansoni]
Length = 427
Score = 253 bits (647), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 127/290 (43%), Positives = 187/290 (64%), Gaps = 8/290 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGV-RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPP 95
R+ LH L +A+ T E V V R+ D LKN++DAQY+G+I IG+PP
Sbjct: 16 RIPLHPLKSAQRTLIEFETSLEIVKKVWLSRVSGVDPQPEYLKNYLDAQYYGDITIGTPP 75
Query: 96 QNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISG 154
Q FSV+FDTGSSNLWVPS C YF I+C H +Y S KS+TY G ++YG+GS+SG
Sbjct: 76 QTFSVVFDTGSSNLWVPSKYCSYFDIACLLHRKYDSSKSSTYIPNGTEFSVHYGTGSLSG 135
Query: 155 FFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVE 214
F S D++++G + VK Q F EAT++ L F++A+FDGI+G+ + I+V PV+ NM++
Sbjct: 136 FLSTDSLQLGSLSVKGQTFGEATQQPGLVFVMAKFDGILGMAYPSISVDGVTPVFVNMIQ 195
Query: 215 QGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
QG+V VFSF+L+R+ A GGE++ GG+D K++ G+ YV +T++ YW F++ + I
Sbjct: 196 QGIVESPVFSFYLSRNISAVLGGELMIGGIDKKYYSGEINYVDLTEQSYWLFKMDKLTIS 255
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAEC 320
+ + C GC AI D+GTS++AGPT + +IN +G G+ + C
Sbjct: 256 DMT--ACPDGCLAIADTGTSMIAGPTDEIQKINAKLGATRLPGGIYTVSC 303
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 47/140 (33%), Positives = 73/140 (52%), Gaps = 4/140 (2%)
Query: 366 VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
+ + + ++ D C +A+ + T E + IN + P G + C
Sbjct: 246 LFKMDKLTISDMTACPDGCLAIADTGTSMIAGPTDE--IQKINAKLGATRLPGGIYTVSC 303
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
I +P + F I K L P Y+LK + +E+C++GFM DL P+ LWILGD+F
Sbjct: 304 GNINNLPTIDFVINGKAMTLEPTDYLLKVSKMGSEICLTGFMGLDL--PKRKLWILGDIF 361
Query: 486 MGVYHTVFDSGKLRIGFAEA 505
+G ++TVFD GK R+GFA+A
Sbjct: 362 IGKFYTVFDMGKNRVGFAKA 381
>gi|320163747|gb|EFW40646.1| cathepsin D [Capsaspora owczarzaki ATCC 30864]
Length = 382
Score = 253 bits (647), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 120/227 (52%), Positives = 154/227 (67%), Gaps = 3/227 (1%)
Query: 74 ILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRK 132
I P N+ DAQY+G+I IG+P Q F+V+FDTGS+NLWVPS KC + I+C H++Y S K
Sbjct: 51 IEPQHNYQDAQYYGDITIGTPGQKFTVVFDTGSANLWVPSKKCPVTDIACQLHNKYDSTK 110
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGI 192
S+TY G S I YGSG +SGF S D+V + V Q F EAT E L+F+ A+FDGI
Sbjct: 111 SSTYKVNGTSFAIQYGSGKLSGFLSTDSVSFAGLTVTGQTFAEATAEPGLSFVAAKFDGI 170
Query: 193 IGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGK 252
+GLGF +IAV PVW+N + QG+ + +F FWLNRDP A +GGEI FG +D H+ G
Sbjct: 171 LGLGFPQIAVDGVTPVWNNAILQGVAAAPLFGFWLNRDPTAADGGEIDFGAIDDSHYTGP 230
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGP 299
Y PVT++GYWQF LG + + ++ C GC AI DSGTSLL GP
Sbjct: 231 ILYTPVTRQGYWQFALGAVTVSGKN--YCASGCQAIADSGTSLLVGP 275
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 40/92 (43%), Positives = 54/92 (58%), Gaps = 3/92 (3%)
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
N GE +DC +I ++PN+ FTI + F L+ Y+LK G C+ G M+ DL
Sbjct: 291 NIAGEYTLDCSKIASLPNLVFTISGQQFALTGADYVLKITSGSTTECLLGLMSMDL-SAE 349
Query: 476 GPLWILGDVFMGVYHTVFD--SGKLRIGFAEA 505
G WILGDVF+G ++TVFD R+GFA A
Sbjct: 350 GIQWILGDVFIGKFYTVFDFNGNAPRVGFATA 381
>gi|195997419|ref|XP_002108578.1| expressed hypothetical protein [Trichoplax adhaerens]
gi|190589354|gb|EDV29376.1| expressed hypothetical protein [Trichoplax adhaerens]
Length = 383
Score = 253 bits (646), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 140/325 (43%), Positives = 197/325 (60%), Gaps = 21/325 (6%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRH 65
+RS+ L VLA L A+ L+RI L K + +L A IT + M A S
Sbjct: 1 MRSI--LLVLALVLSCAAA---LQRIKLYKMKTIRQTLLDAGITAE---MLKAKYSKFSA 52
Query: 66 RLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFH 125
GD L N++DAQY+G I IG+PPQNF ++FDTGSS+LWVPS+KC + +C H
Sbjct: 53 SRGDES-----LSNYLDAQYYGPITIGTPPQNFKILFDTGSSDLWVPSTKCNGNAACESH 107
Query: 126 SRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFL 185
+Y KS+TY G+ I YGSG+ SGF S+D V V + V++Q F EA E L+F+
Sbjct: 108 DKYDHTKSSTYVSNGQQWSIQYGSGAASGFLSEDVVTVAGISVRNQTFGEAVGEPGLSFV 167
Query: 186 LARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVD 245
A+FDGI+G+G+++++ PV+ NMV+QGLV + VFSF+LNR GGE++ GG D
Sbjct: 168 AAKFDGILGMGYKQLSAERTNPVFVNMVQQGLVRKPVFSFYLNRKQGGAVGGELILGGSD 227
Query: 246 PKHFKGKHTYVPVTKKGYWQFEL--GDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
P ++ G+ YVP++++ YWQF + G + G T VC GGC AI D+GT+L+ GP V
Sbjct: 228 PNYYSGQFNYVPLSRESYWQFAMDGGKVATG---TTVCNGGCQAIADTGTTLIVGPPEDV 284
Query: 304 TEINHAIGGE---GVVSAECKLVVS 325
I AIG + G + +C + S
Sbjct: 285 QRIQQAIGAQNAGGQYTVDCSTISS 309
Score = 82.4 bits (202), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 41/90 (45%), Positives = 57/90 (63%), Gaps = 2/90 (2%)
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
N G+ +DC I ++P ++FTI + L+ EQYI + + E CISGF +
Sbjct: 295 NAGGQYTVDCSTISSLPTITFTINGVNYPLTGEQYIWQVTQQGQEQCISGFQGGVIG--T 352
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
GP WILGDVF+GVY+T FD G+ R+GFA+A
Sbjct: 353 GPQWILGDVFIGVYYTEFDMGQNRLGFAKA 382
>gi|6680552|ref|NP_032463.1| napsin-A precursor [Mus musculus]
gi|6016430|sp|O09043.1|NAPSA_MOUSE RecName: Full=Napsin-A; AltName: Full=KDAP-1; AltName:
Full=Kidney-derived aspartic protease-like protein;
Short=KAP; Flags: Precursor
gi|1906810|dbj|BAA19004.1| kidney-derived aspartic protease-like protein [Mus musculus]
gi|7340352|emb|CAB82907.1| Napsin [Mus musculus]
gi|15928694|gb|AAH14813.1| Napsin A aspartic peptidase [Mus musculus]
gi|74220342|dbj|BAE31398.1| unnamed protein product [Mus musculus]
Length = 419
Score = 253 bits (646), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 119/250 (47%), Positives = 167/250 (66%), Gaps = 2/250 (0%)
Query: 71 DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYK 129
+ +PL FM+ QYFG IG+G+PPQNF+V+FDTGSSNLWVPS++C +FS++C+FH R+
Sbjct: 59 NPSFVPLSKFMNTQYFGTIGLGTPPQNFTVVFDTGSSNLWVPSTRCHFFSLACWFHHRFN 118
Query: 130 SRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARF 189
+ S+++ G I YG+G +SG SQDN+ +G + F EA E SL F LA F
Sbjct: 119 PKASSSFRPNGTKFAIQYGTGRLSGILSQDNLTIGGIHDAFVTFGEALWEPSLIFALAHF 178
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF 249
DGI+GLGF +AVG P D MVEQGL+ + VFSF+LNRD + +GGE+V GG DP H+
Sbjct: 179 DGILGLGFPTLAVGGVQPPLDAMVEQGLLEKPVFSFYLNRDSEGSDGGELVLGGSDPAHY 238
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHA 309
T++PVT YWQ + + +G +C GC+AI+D+GTSL+ GP+ + +N A
Sbjct: 239 VPPLTFIPVTIPAYWQVHMESVKVGT-GLSLCAQGCSAILDTGTSLITGPSEEIRALNKA 297
Query: 310 IGGEGVVSAE 319
IGG ++ +
Sbjct: 298 IGGYPFLNGQ 307
Score = 90.1 bits (222), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 52/144 (36%), Positives = 77/144 (53%), Gaps = 7/144 (4%)
Query: 367 VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
V E+V G ++C+ A++ L ++E + +N+ P G+ I C
Sbjct: 255 VHMESVKVGTGLSLCAQGCSAILDTGTSLITGPSEE--IRALNKAIGGYPFLNGQYFIQC 312
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
+ PT+P VSF +G FNL+ + Y++K + +C+ GF A D+P P GPLWILGDVF
Sbjct: 313 SKTPTLPPVSFHLGGVWFNLTGQDYVIKILQSDVGLCLLGFQALDIPKPAGPLWILGDVF 372
Query: 486 MGVYHTVFDSGK----LRIGFAEA 505
+G Y VFD G R+G A A
Sbjct: 373 LGPYVAVFDRGDKNVGPRVGLARA 396
>gi|148690790|gb|EDL22737.1| napsin A aspartic peptidase, isoform CRA_a [Mus musculus]
Length = 393
Score = 253 bits (646), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 120/257 (46%), Positives = 168/257 (65%), Gaps = 2/257 (0%)
Query: 64 RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISC 122
R + +PL FM+ QYFG IG+G+PPQNF+V+FDTGSSNLWVPS++C +FS++C
Sbjct: 27 RTSTSGGNPSFVPLSKFMNTQYFGTIGLGTPPQNFTVVFDTGSSNLWVPSTRCHFFSLAC 86
Query: 123 YFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSL 182
+FH R+ + S+++ G I YG+G +SG SQDN+ +G + F EA E SL
Sbjct: 87 WFHHRFNPKASSSFRPNGTKFAIQYGTGRLSGILSQDNLTIGGIHDAFVTFGEALWEPSL 146
Query: 183 TFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFG 242
F LA FDGI+GLGF +AVG P D MVEQGL+ + VFSF+LNRD + +GGE+V G
Sbjct: 147 IFALAHFDGILGLGFPTLAVGGVQPPLDAMVEQGLLEKPVFSFYLNRDSEGSDGGELVLG 206
Query: 243 GVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPV 302
G DP H+ T++PVT YWQ + + +G +C GC+AI+D+GTSL+ GP+
Sbjct: 207 GSDPAHYVPPLTFIPVTIPAYWQVHMESVKVGT-GLSLCAQGCSAILDTGTSLITGPSEE 265
Query: 303 VTEINHAIGGEGVVSAE 319
+ +N AIGG ++ +
Sbjct: 266 IRALNKAIGGYPFLNGQ 282
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 52/144 (36%), Positives = 77/144 (53%), Gaps = 7/144 (4%)
Query: 367 VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
V E+V G ++C+ A++ L ++E + +N+ P G+ I C
Sbjct: 230 VHMESVKVGTGLSLCAQGCSAILDTGTSLITGPSEE--IRALNKAIGGYPFLNGQYFIQC 287
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
+ PT+P VSF +G FNL+ + Y++K + +C+ GF A D+P P GPLWILGDVF
Sbjct: 288 SKTPTLPPVSFHLGGVWFNLTGQDYVIKILQSDVGLCLLGFQALDIPKPAGPLWILGDVF 347
Query: 486 MGVYHTVFDSGK----LRIGFAEA 505
+G Y VFD G R+G A A
Sbjct: 348 LGPYVAVFDRGDKNVGPRVGLARA 371
>gi|12832561|dbj|BAB22158.1| unnamed protein product [Mus musculus]
Length = 419
Score = 253 bits (646), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 119/243 (48%), Positives = 164/243 (67%), Gaps = 2/243 (0%)
Query: 71 DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYK 129
+ +PL FM+ QYFG IG+G+PPQNF+V+FDTGSSNLWVPS++C +FS++C+FH R+
Sbjct: 59 NPSFVPLSKFMNTQYFGTIGLGTPPQNFTVVFDTGSSNLWVPSTRCHFFSLACWFHHRFN 118
Query: 130 SRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARF 189
+ S+++ G I YG+G +SG SQDN+ +G + F EA E SL F LA F
Sbjct: 119 PKASSSFRPNGTKFAIQYGTGRLSGILSQDNLTIGGIHDAFVTFGEALWEPSLIFALAHF 178
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF 249
DGI+GLGF +AVG P D MVEQGL+ + VFSF+LNRD + +GGE+V GG DP H+
Sbjct: 179 DGILGLGFPTLAVGGVQPPLDAMVEQGLLEKPVFSFYLNRDSEGSDGGELVLGGSDPAHY 238
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHA 309
T++PVT YWQ + + +G +C GC+AI+D+GTSL+ GP+ + +N A
Sbjct: 239 VPPLTFIPVTIPAYWQVHMESVKVGT-GLSLCAQGCSAILDTGTSLITGPSEEIRALNKA 297
Query: 310 IGG 312
IGG
Sbjct: 298 IGG 300
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 51/144 (35%), Positives = 76/144 (52%), Gaps = 7/144 (4%)
Query: 367 VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
V E+V G ++C+ A++ L ++E + +N+ P G+ I C
Sbjct: 255 VHMESVKVGTGLSLCAQGCSAILDTGTSLITGPSEE--IRALNKAIGGYPFLNGQYFIQC 312
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
+ PT+P VSF +G FNL+ + Y++K + +C+ GF A D+P GPLWILGDVF
Sbjct: 313 SKTPTLPPVSFHLGGVWFNLTGQDYVIKILQSDVGLCLLGFQALDIPNAAGPLWILGDVF 372
Query: 486 MGVYHTVFDSGK----LRIGFAEA 505
+G Y VFD G R+G A A
Sbjct: 373 LGPYVAVFDRGDKNVGPRVGLARA 396
>gi|432850603|ref|XP_004066829.1| PREDICTED: cathepsin D-like isoform 3 [Oryzias latipes]
Length = 416
Score = 253 bits (645), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 139/330 (42%), Positives = 196/330 (59%), Gaps = 28/330 (8%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLG-DSDE 72
VL L S L RI LKK R L + +E + A +++ LG S
Sbjct: 5 VLCVIAALALSGEALIRIPLKKFRSIRRELTD---SGREAHELLADKHSLKYNLGFPSSN 61
Query: 73 DILP--LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYK 129
P LKN++DAQY+GEI +G+PPQ F+V+FDTGSSNLWVPS C I+C +Y
Sbjct: 62 GPTPETLKNYLDAQYYGEIALGTPPQPFTVVFDTGSSNLWVPSVHCSLLDIACXXXHKYN 121
Query: 130 SRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEV--------------------GDVVVK 169
S KS+TY + G S I YGSGS+SG+ SQD V GD+ V+
Sbjct: 122 SAKSSTYVKNGTSFSIQYGSGSLSGYLSQDTCTVSVGGAVTPPTTHSVETAKAIGDISVE 181
Query: 170 DQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNR 229
+QVF EA ++ + F+ A+FDGI+G+ + I+V VPV+DN+++Q V VFSF+LNR
Sbjct: 182 NQVFGEAIKQPGVAFIAAKFDGILGMAYPRISVDGVVPVFDNIMQQKKVDSNVFSFYLNR 241
Query: 230 DPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIV 289
+PD E GGE++ GG DPK++ G YV ++++ YWQ + + +G+Q + +C+GGC AIV
Sbjct: 242 NPDTEPGGELLLGGTDPKYYSGDFHYVNISRQAYWQIHMDGMAVGSQLS-LCKGGCEAIV 300
Query: 290 DSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
D+GTSLL GP+ V + AIG ++ E
Sbjct: 301 DTGTSLLTGPSAEVKALQKAIGAIPLIQGE 330
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 45/93 (48%), Positives = 68/93 (73%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I+CD+IP++P ++F IG + + L+ +QY+LK + +C+SGFM D+P
Sbjct: 323 AIPLIQGEYMINCDKIPSLPAITFNIGGQSYTLTGDQYVLKESQAGKTICLSGFMGLDIP 382
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
P GPLWILGDVF+G Y+TVFD R+GFA++
Sbjct: 383 APAGPLWILGDVFIGQYYTVFDRDSNRVGFAKS 415
>gi|74220823|dbj|BAE31380.1| unnamed protein product [Mus musculus]
Length = 404
Score = 253 bits (645), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 122/253 (48%), Positives = 178/253 (70%), Gaps = 17/253 (6%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN++DAQY+G+IGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 71 LKNYLDAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVG---------DVVVKDQVFIEATREGSLTFLL 186
Y + G S +I+YGSGS+S + SQD V V + V+ Q+F EAT++ + F+
Sbjct: 131 YVKNGTSFDIHYGSGSLSRYLSQDTVSVPCKSDQSKARGIKVEKQIFGEATKQPGIVFVA 190
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+G+ I+V + +PV+DN+++Q LV + +FSF+LNRDP+ + GGE++ GG D
Sbjct: 191 AKFDGILGMGYPHISVNNVLPVFDNLMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDS 250
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K++ G+ +Y+ VT+K YW + +GN+ T +C+GGC AIVD+GTSLL GP V E+
Sbjct: 251 KYYHGELSYLNVTRKAYW------LEVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKEL 303
Query: 307 NHAIGGEGVVSAE 319
AIG ++ E
Sbjct: 304 QKAIGAVPLIQGE 316
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 46/93 (49%), Positives = 65/93 (69%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I C+++ ++P V +G K + L P++YILK +G +C+SGFM D+P
Sbjct: 309 AVPLIQGEYMIPCEKVSSLPTVYLKLGGKNYELHPDKYILKVSQGGKTICLSGFMGMDIP 368
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
PP GPLWILGDVF+G Y+TVFD R+GFA A
Sbjct: 369 PPSGPLWILGDVFIGSYYTVFDRDNNRVGFANA 401
>gi|449666857|ref|XP_002161366.2| PREDICTED: lysosomal aspartic protease-like [Hydra magnipapillata]
Length = 387
Score = 253 bits (645), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 137/306 (44%), Positives = 195/306 (63%), Gaps = 14/306 (4%)
Query: 17 SCLLLPAS--SNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDI 74
+ LLL AS + + RI +KK R T +E+ + G + + G+S E
Sbjct: 7 TILLLCASLIFSEIHRIKIKKLE------TTVRRTLREQGFDFQKL-GFQSKWGESPE-- 57
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKS 133
L+N+MDAQY+G+I +G+PPQ F V+FDTGSSNLWVPSS C + I+C H++Y KS
Sbjct: 58 -VLRNYMDAQYYGDISLGTPPQPFKVVFDTGSSNLWVPSSHCGWTDIACLTHNKYHGDKS 116
Query: 134 NTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGII 193
+TY + G I YGSGS SG+ S D ++V D+ VK+Q+F EAT E + F+ A+FDG++
Sbjct: 117 STYVQNGTKFSIQYGSGSCSGYQSIDTLQVADISVKNQMFGEATSEPGIAFVAAKFDGLL 176
Query: 194 GLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKH 253
G+G+ +I+V VP + NMV+Q LV + VFSF+L+R+ + GGE++ GGVD F G
Sbjct: 177 GMGYSQISVNGVVPPFYNMVDQKLVEDAVFSFYLDRNVNDSTGGELLLGGVDSSKFVGDI 236
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
TY PVT +GYWQF++ +++ N C GC AI D+GTSL+AGPT V ++N IG
Sbjct: 237 TYTPVTVEGYWQFKMDKVVV-NGEPMFCASGCNAIADTGTSLIAGPTEEVNKLNQMIGAT 295
Query: 314 GVVSAE 319
+V E
Sbjct: 296 PIVGGE 301
Score = 108 bits (269), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 57/134 (42%), Positives = 79/134 (58%), Gaps = 2/134 (1%)
Query: 372 VSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTM 431
V G+ C++ A+ L T+E ++ +N++ + P GE IIDC ++P++
Sbjct: 255 VVNGEPMFCASGCNAIADTGTSLIAGPTEE--VNKLNQMIGATPIVGGEYIIDCAKVPSL 312
Query: 432 PNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHT 491
P + F IG K F L Y+LK CISGF+A D+PPPRGPLWILGDVF+G Y+T
Sbjct: 313 PALEFWIGGKQFVLKGSDYVLKVSTLGQTECISGFIAIDVPPPRGPLWILGDVFIGPYYT 372
Query: 492 VFDSGKLRIGFAEA 505
VFD R+GFA
Sbjct: 373 VFDLKNNRVGFANT 386
>gi|18858489|ref|NP_571785.1| cathepsin D [Danio rerio]
gi|12053845|emb|CAC20111.1| cathepsin D enzyme [Danio rerio]
Length = 399
Score = 253 bits (645), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 138/329 (41%), Positives = 202/329 (61%), Gaps = 21/329 (6%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLR-RIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVR 64
+R FC C LLP S+ RI LKK R +L+ + + +E + + ++
Sbjct: 1 MRIRFC------CSLLPFSARRRDCRIPLKKFRTLRRTLSDSGRSLEELV---SSSNSLK 51
Query: 65 HRLG-DSDEDILP--LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-I 120
+ LG + D P LKN++DAQY+GEIG+G+P Q F+V+FDTGSSNLWVPS C + I
Sbjct: 52 YNLGFPASNDPTPETLKNYLDAQYYGEIGLGTPVQTFTVVFDTGSSNLWVPSVHCSLTDI 111
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+C H +Y KS+TY + G I YGSGS+SG+ SQD +GD+ V+ Q+F EA ++
Sbjct: 112 ACLLHHKYNGGKSSTYVKNGTQFAIQYGSGSLSGYLSQDTCTIGDIAVEKQIFGEAIKQP 171
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
+ F+ A+FDGI+G+ + I+V PV+D M+ Q V + VFSF+LNR+PD + GGE++
Sbjct: 172 GVAFIAAKFDGILGMAYPRISVDGVPPVFDMMMSQKKVEKNVFSFYLNRNPDTQPGGELL 231
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSG--TSLLAG 298
GG DPK++ G YV ++++ YWQ + + IG+ +C+GGC AIVD+G TSL+ G
Sbjct: 232 LGGTDPKYYTGDFNYVDISRQAYWQIHMDGMSIGS-GLSLCKGGCEAIVDTGTSTSLITG 290
Query: 299 PTPVVTEINHAIGG----EGVVSAECKLV 323
P V + AIG +G +CK V
Sbjct: 291 PAAEVKALQKAIGAIPLMQGEYMVDCKKV 319
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 50/93 (53%), Positives = 73/93 (78%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE ++DC ++PT+P +SF++G K+++L+ EQYILK +G ++C+SGFM D+P
Sbjct: 304 AIPLMQGEYMVDCKKVPTLPTISFSLGGKVYSLTGEQYILKESQGGHDICLSGFMGLDIP 363
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
PP GPLWILGDVF+G Y+TVFD R+GFA+A
Sbjct: 364 PPAGPLWILGDVFIGQYYTVFDRENNRVGFAKA 396
>gi|1585311|prf||2124395A Asp protease
Length = 380
Score = 253 bits (645), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 128/293 (43%), Positives = 187/293 (63%), Gaps = 8/293 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGV-RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPP 95
R+ L+ L +AR + E V V R + + LKN++DAQY G+I IG+PP
Sbjct: 17 RVPLYPLKSARRSLIEFETSLENVQKVWFSRFSNVEPRPEYLKNYLDAQYHGDITIGTPP 76
Query: 96 QNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISG 154
Q FS +FDTGSSNLWVPS C YF I+C H +Y S KS TY G I YG+GS+SG
Sbjct: 77 QTFSAVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTTYVPNGTDFSIRYGTGSLSG 136
Query: 155 FFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVE 214
F S D++++G + VK Q F EAT++ L F++A+FDGI+G+ + +AVG PV+ NM++
Sbjct: 137 FLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGMAYPSLAVGGVTPVFVNMIK 196
Query: 215 QGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++ I
Sbjct: 197 QGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTIS 256
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 323
+ S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 257 DLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 47/138 (34%), Positives = 76/138 (55%), Gaps = 4/138 (2%)
Query: 366 VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
+ + +N++ D ++C+ A+ + T E + IN+ + P G + C
Sbjct: 247 LFKMDNLTISDLSICTDGCQAIADTGTSMIAGPTDE--VKQINQKLGATHLPGGIYTVSC 304
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
D I +P++ F I K L P YI+K + +E+C++GF+ DL PR LWILGDVF
Sbjct: 305 DVINNLPSIDFVINGKHMTLEPTDYIMKVSKLGSEICLTGFIGMDL--PRKKLWILGDVF 362
Query: 486 MGVYHTVFDSGKLRIGFA 503
+G ++T+FD GK R+GF
Sbjct: 363 IGKFYTIFDMGKNRVGFG 380
>gi|315274255|gb|ADU03675.1| putative cathepsin D3 [Ixodes ricinus]
Length = 398
Score = 252 bits (644), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 120/238 (50%), Positives = 162/238 (68%), Gaps = 1/238 (0%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSN 134
PL N++DAQY+G I IGSPPQ F V+FDTGSSNLWVPS +C + +I+C H +Y +S
Sbjct: 65 PLSNYLDAQYYGPISIGSPPQPFRVVFDTGSSNLWVPSKQCKWTNIACLLHKKYDHTRSR 124
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y + G + + YG+GS++GF S D V + + V +Q F EA E LTF+ A+FDGI+G
Sbjct: 125 SYRKNGTAISLRYGTGSMTGFLSVDTVSLAGIDVHNQTFAEAVTEPGLTFVAAKFDGILG 184
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LGF IAV A V+DNMV Q LV VFSF+LNR+ + GGEI FGG D + + G +
Sbjct: 185 LGFSNIAVMGAPTVFDNMVAQLLVPRPVFSFFLNRNTTSPTGGEITFGGTDDRFYSGDIS 244
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
YVPV+ KGYWQF + +I++ N S +C GC AI D+GTSL+AGP+ + ++ IG
Sbjct: 245 YVPVSTKGYWQFTVDNIVVKNSSFKLCAEGCEAIADTGTSLMAGPSLEIMKLQKLIGA 302
Score = 99.4 bits (246), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 44/99 (44%), Positives = 66/99 (66%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
+ +L +LP G+ + C+ I +P++ F IG + + L+ Y+LK + +C+SGF
Sbjct: 296 LQKLIGALPFSHGQYTVRCEDIHKLPDIKFHIGGQEYVLTGSDYVLKITQFGRMICLSGF 355
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+ D+P PRGPLWILGDVF+G Y+TVFD G R+GFA+A
Sbjct: 356 VGLDIPEPRGPLWILGDVFIGRYYTVFDYGASRVGFAKA 394
>gi|23237804|dbj|BAC16371.1| aspartic proteinase 5 [Glycine max]
Length = 175
Score = 252 bits (644), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 116/176 (65%), Positives = 147/176 (83%), Gaps = 3/176 (1%)
Query: 333 DLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN--VSAGDSAVCSACEMAVVWV 390
DLLVSG+ P+ VC Q+GLC F + S GI+ V EKE +S D+A+C++C+M VVW+
Sbjct: 1 DLLVSGVRPDDVCSQVGLC-FKRTKSESNGIEMVTEKEQRELSTKDTALCTSCQMLVVWI 59
Query: 391 QNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQY 450
QNQLKQK+TKE V +Y+N+LC+SLP+P GES++DC+ I +PN++FT+GDK F L+PEQY
Sbjct: 60 QNQLKQKKTKEIVFNYVNQLCESLPSPNGESVVDCNSIYGLPNITFTVGDKPFTLTPEQY 119
Query: 451 ILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
ILKTGEGIAEVC+SGF+AFD+PPPRGPLWILGDVFM VYHTVFD G LR+GFA+AA
Sbjct: 120 ILKTGEGIAEVCLSGFIAFDIPPPRGPLWILGDVFMRVYHTVFDYGNLRVGFAKAA 175
>gi|241813645|ref|XP_002416518.1| aspartic protease, putative [Ixodes scapularis]
gi|215510982|gb|EEC20435.1| aspartic protease, putative [Ixodes scapularis]
Length = 392
Score = 252 bits (644), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 122/251 (48%), Positives = 166/251 (66%), Gaps = 5/251 (1%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSN 134
PL N++DAQY+G I IGSPPQ F V+FDTGSSNLWVPS +C + +I+C H +Y +S
Sbjct: 59 PLSNYLDAQYYGPISIGSPPQPFRVVFDTGSSNLWVPSKQCKWTNIACLLHKKYDHTRSR 118
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y + G + + YG+GS++GF S D V + + V +Q F EA E LTF+ A+FDGI+G
Sbjct: 119 SYRKNGTAISLRYGTGSMTGFLSVDTVSLAGIDVHNQTFAEAVTEPGLTFVAAKFDGILG 178
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LGF IAV A V+DNMV Q LV VFSF+LNR+ + GGEI FGG D + + G +
Sbjct: 179 LGFSNIAVMGAPTVFDNMVAQLLVPRPVFSFFLNRNTTSPTGGEITFGGTDDRFYSGDIS 238
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG-- 312
YVPV+ KGYWQF + +I++ N S +C GC AI D+GTSL+AGP+ + ++ IG
Sbjct: 239 YVPVSTKGYWQFTVDNIVVKNSSFKLCAEGCEAIADTGTSLMAGPSLEIMKLQKLIGALP 298
Query: 313 --EGVVSAECK 321
G + C+
Sbjct: 299 FSHGQYTVRCQ 309
Score = 98.6 bits (244), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 44/99 (44%), Positives = 65/99 (65%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
+ +L +LP G+ + C I +P++ F IG + + L+ Y+LK + +C+SGF
Sbjct: 290 LQKLIGALPFSHGQYTVRCQDIHQLPDIKFHIGGQEYVLTGSDYVLKITQFGRMICLSGF 349
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+ D+P PRGPLWILGDVF+G Y+TVFD G R+GFA+A
Sbjct: 350 VGLDIPEPRGPLWILGDVFIGRYYTVFDYGASRVGFAKA 388
>gi|328869722|gb|EGG18099.1| cathepsin D [Dictyostelium fasciculatum]
Length = 476
Score = 252 bits (643), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 129/304 (42%), Positives = 186/304 (61%), Gaps = 24/304 (7%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDED 73
V+A ++P N + R L++ +L +K+ + AG +
Sbjct: 103 VVAQAYVVPLGFNKVTRQALRRIPQNL---------QKKYMLAAAGTT------------ 141
Query: 74 ILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRK 132
+PL +F DAQY+G I IG+P Q F V+FDTGSSNLW+PS KC + I+C H++Y S K
Sbjct: 142 -IPLSDFEDAQYYGAITIGTPGQPFKVVFDTGSSNLWIPSKKCPITVIACDLHNKYDSTK 200
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGI 192
S+++ + G I YGSG++SGF S+D V+VG + VK+Q+F EAT E + F A+FDGI
Sbjct: 201 SSSFVQNGTDFSIQYGSGAMSGFVSEDTVQVGSLSVKNQLFAEATAEPGIAFDFAKFDGI 260
Query: 193 IGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGK 252
+GL F+ I+V + PV+ NM++QGLV++ +F+FWL++ GGE+ FG +D F G
Sbjct: 261 LGLAFQSISVNNIPPVFYNMMDQGLVAQPLFAFWLSKTASPTNGGELSFGSIDNSKFTGA 320
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVC-EGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
TYVP+T + YW+F + D+ S G C + GC AI DSGTSLLAGPT + IN +G
Sbjct: 321 ITYVPLTNRTYWEFSMDDVQYDGNSLGYCGKTGCRAIADSGTSLLAGPTEQIEAINTKLG 380
Query: 312 GEGV 315
V
Sbjct: 381 AVSV 384
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 41/88 (46%), Positives = 57/88 (64%), Gaps = 1/88 (1%)
Query: 419 GESII-DCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGP 477
GE+I C+ I ++P+V + F L+P YIL+ E C+SGFM D+P P GP
Sbjct: 386 GEAIFPSCNVISSLPDVQIVLAGTTFVLTPTDYILQITEFGKTTCLSGFMGIDIPAPIGP 445
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAEA 505
L+ILGDVF+ Y+T+FD G R+GFA+A
Sbjct: 446 LYILGDVFISTYYTIFDFGNSRVGFAQA 473
>gi|403299328|ref|XP_003940441.1| PREDICTED: napsin-A-like [Saimiri boliviensis boliviensis]
Length = 421
Score = 251 bits (642), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 134/314 (42%), Positives = 188/314 (59%), Gaps = 16/314 (5%)
Query: 22 PASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDED----ILPL 77
PA + L RI L++ + + +LN R G G +LG +PL
Sbjct: 22 PAGAT-LIRIPLRRVQPERRTLNLLR---------GWGEPAKLPKLGAPSPGDKPAFVPL 71
Query: 78 KNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTY 136
N+ D QYFGEIG+G PPQNF+V+FDTGSSNLWVPS +C +FS+ C+ H R+ + S+++
Sbjct: 72 SNYRDVQYFGEIGLGMPPQNFTVVFDTGSSNLWVPSRRCHFFSVPCWLHHRFDPKASSSF 131
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
G I YGSG + G S+D + +G + +F EA E SL F A FDGI+GLG
Sbjct: 132 QPNGTKFAIQYGSGRVDGILSEDKLTIGGIKGASVIFGEALWEPSLVFTFAHFDGILGLG 191
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
F +AV P D +VEQGL+ + VFSF+ NRDP+ +GGE+V GG DP H+ T+V
Sbjct: 192 FPVLAVEGVRPPLDVLVEQGLLDKPVFSFYFNRDPEKPDGGELVLGGSDPAHYIPPLTFV 251
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
PVT YWQ + + +G+ T +C GCAAI+D+GTSL+ GPT + +N AIGG ++
Sbjct: 252 PVTVPAYWQIHMERVKVGSGLT-LCARGCAAILDTGTSLITGPTEEIQALNAAIGGFPLL 310
Query: 317 SAECKLVVSQYGDL 330
+ E ++ S+ L
Sbjct: 311 AGEYIILCSEIPKL 324
Score = 99.4 bits (246), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 55/143 (38%), Positives = 76/143 (53%), Gaps = 7/143 (4%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
+E+ V +G + C A++ L T+E + +N P GE II C
Sbjct: 263 MERVKVGSGLTLCARGCA-AILDTGTSLITGPTEE--IQALNAAIGGFPLLAGEYIILCS 319
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
IP +P VSF +G FNL+ + Y+++T +C+SGF A D+PPP GP WILGDVF+
Sbjct: 320 EIPKLPAVSFLLGGVWFNLTAQDYVIQTTRNGVRLCLSGFQALDVPPPAGPFWILGDVFL 379
Query: 487 GVYHTVFDSG----KLRIGFAEA 505
G Y VFD G R+G A A
Sbjct: 380 GTYVAVFDRGDRKSSARVGLARA 402
>gi|13928928|ref|NP_113858.1| napsin A aspartic peptidase precursor [Rattus norvegicus]
gi|6689137|emb|CAB65392.1| napsin [Rattus norvegicus]
gi|51260062|gb|AAH78790.1| Napsin A aspartic peptidase [Rattus norvegicus]
gi|149056039|gb|EDM07470.1| napsin A aspartic peptidase, isoform CRA_a [Rattus norvegicus]
Length = 420
Score = 251 bits (642), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 118/258 (45%), Positives = 172/258 (66%), Gaps = 2/258 (0%)
Query: 74 ILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRK 132
+PL FM+ QYFG+IG+G+PPQNF+V+FDTGSSNLWVPS++C +FS++C+FH R+ +
Sbjct: 63 FVPLSKFMNTQYFGDIGLGTPPQNFTVVFDTGSSNLWVPSTRCHFFSLACWFHHRFNPKA 122
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGI 192
S+++ G I YG+G +SG S+DN+ +G + F EA E SL F LARFDGI
Sbjct: 123 SSSFRPNGTKFAIQYGTGRLSGILSRDNLTIGGIHNVSVTFGEALWEPSLVFALARFDGI 182
Query: 193 IGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGK 252
+GLGF +AVG P D +VEQ L+ + VFSF+LNRD + +GGE+V GG DP H+
Sbjct: 183 LGLGFPTLAVGGVQPPLDALVEQRLLEKPVFSFYLNRDSEGSDGGELVLGGSDPDHYVPP 242
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
T++PVT YWQ + + +G +C GC AI+D+GTSL+ GP+ + +N A+GG
Sbjct: 243 LTFIPVTIPAYWQVHMQSVKVGT-GLNLCAQGCGAILDTGTSLITGPSEEIRALNKAVGG 301
Query: 313 EGVVSAECKLVVSQYGDL 330
+++ + + S+ +L
Sbjct: 302 FPLLTGQYLIQCSKIPEL 319
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 42/95 (44%), Positives = 59/95 (62%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
+N+ P G+ +I C +IP +P VSF++G FNL+ + Y++K + +C+ GF
Sbjct: 295 LNKAVGGFPLLTGQYLIQCSKIPELPTVSFSLGGVWFNLTGQDYVIKILQSDVGLCLLGF 354
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIG 501
A D+P P GPLWILGDVF+G Y VFD G IG
Sbjct: 355 QALDIPKPEGPLWILGDVFLGSYVAVFDRGDKNIG 389
>gi|330800100|ref|XP_003288077.1| preprocathepsin D [Dictyostelium purpureum]
gi|325081901|gb|EGC35401.1| preprocathepsin D [Dictyostelium purpureum]
Length = 386
Score = 251 bits (641), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 140/318 (44%), Positives = 188/318 (59%), Gaps = 25/318 (7%)
Query: 5 LLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVR 64
LL + VLA+ L +P S R +K R+ + N I GG +
Sbjct: 4 LLALILTFIVLANALTVPLSFTPASRQAIK--RIPQNVANKYTIAAN----GGTNI---- 53
Query: 65 HRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI-SCY 123
P+ +F DAQY+G I IG+P Q F V+FDTGSSNLW+PS KC ++ +C
Sbjct: 54 -----------PISDFEDAQYYGAITIGTPGQPFKVVFDTGSSNLWIPSKKCSITVPACD 102
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
H +Y S KS++Y G S I YGSG++SGF SQD V VG + VK+Q+F EAT E +
Sbjct: 103 LHEKYDSSKSSSYVANGTSFSIQYGSGAMSGFVSQDTVTVGSLSVKNQLFAEATAEPGIA 162
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
F A+FDGI+GL F+ I+V D PV+ NM++QGLV + +FSFWL++ P GGE+ FG
Sbjct: 163 FDFAKFDGILGLAFQSISVNDIPPVFYNMIDQGLVGQNLFSFWLSKTP-GSNGGELSFGS 221
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVC-EGGCAAIVDSGTSLLAGPTPV 302
+D + G TYVP+T YW+F++ D IG QS G C GC AI DSGTSL+AGP
Sbjct: 222 IDSSKYTGPITYVPLTNTTYWEFKMDDFAIGGQSAGFCGSQGCPAIADSGTSLIAGPIDF 281
Query: 303 VTEINHAIGGEGVVSAEC 320
+T +N +G V+S E
Sbjct: 282 ITALNQKLGAV-VISGEA 298
Score = 98.2 bits (243), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 46/88 (52%), Positives = 61/88 (69%), Gaps = 1/88 (1%)
Query: 419 GESII-DCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGP 477
GE+I DC I T+PNV+ T+ + FNL+P+ Y+L+ E C+SGFM +LPP GP
Sbjct: 296 GEAIFPDCSVINTLPNVTVTLAGRQFNLTPKDYVLQITEFGKTECLSGFMGIELPPQVGP 355
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAEA 505
LWILGDVF+ Y+TVFD G ++GFA A
Sbjct: 356 LWILGDVFISTYYTVFDFGNSQVGFATA 383
>gi|332241362|ref|XP_003269849.1| PREDICTED: napsin-A-like [Nomascus leucogenys]
Length = 421
Score = 251 bits (641), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 131/310 (42%), Positives = 190/310 (61%), Gaps = 8/310 (2%)
Query: 22 PASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFM 81
PA + L RI L + + + +LN R R+ + G GD +PL N+
Sbjct: 22 PAGAT-LIRIPLHRVQPERRTLNLMRGWREPAELPKLGAPSP----GDK-PTFVPLSNYR 75
Query: 82 DAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIG 140
D QYFGEIG+G+PPQNF+V+FDTGSSNLWVPS +C +FS+ C+ H R+ + S+++ G
Sbjct: 76 DVQYFGEIGLGTPPQNFTVVFDTGSSNLWVPSRRCHFFSVPCWLHHRFDPKASSSFQANG 135
Query: 141 KSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREI 200
+I YG+G + G S+D + +G + +F EA E SL F A FDGI+GLGF +
Sbjct: 136 TKFDIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWEPSLVFTFAHFDGILGLGFPIL 195
Query: 201 AVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTK 260
+V P D +VEQGL+ + +FSF+LNRDP+ +GGE+V GG DP H+ T+VPVT
Sbjct: 196 SVEGVRPPVDVLVEQGLLDKPIFSFYLNRDPEEPDGGELVLGGSDPAHYIPPLTFVPVTV 255
Query: 261 KGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAEC 320
YWQ + + +G T +C GCAAI+D+GTSL+ GPT + ++ AIGG +++ E
Sbjct: 256 PAYWQIHMERVKVGPGLT-LCARGCAAILDTGTSLITGPTEEIRALHAAIGGYPLLAGEY 314
Query: 321 KLVVSQYGDL 330
++ S+ L
Sbjct: 315 IILCSEIPKL 324
Score = 95.1 bits (235), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 54/143 (37%), Positives = 75/143 (52%), Gaps = 7/143 (4%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
+E+ V G + C A++ L T+E + ++ P GE II C
Sbjct: 263 MERVKVGPGLTLCARGCA-AILDTGTSLITGPTEE--IRALHAAIGGYPLLAGEYIILCS 319
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
IP +P VSF +G FNL+ + Y+++T +C+SGF A D+PPP GP WILGDVF+
Sbjct: 320 EIPKLPAVSFLLGGVWFNLTAQDYVIQTTLNGVRLCLSGFQALDVPPPAGPFWILGDVFL 379
Query: 487 GVYHTVFDSG----KLRIGFAEA 505
G Y VFD G R+G A A
Sbjct: 380 GTYVAVFDRGDRKSSARVGLARA 402
>gi|355703800|gb|EHH30291.1| hypothetical protein EGK_10923 [Macaca mulatta]
Length = 423
Score = 251 bits (641), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 136/324 (41%), Positives = 195/324 (60%), Gaps = 17/324 (5%)
Query: 22 PASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDED----ILPL 77
PA + L RI L++ L +LN R G G RLG ++PL
Sbjct: 22 PARAT-LIRIPLRRVHPGLRTLNLLR---------GWGKPAKLPRLGAPSPGDKPALVPL 71
Query: 78 KNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTY 136
F+DAQYFGEIG+G+PPQNF+V+FDTGSSNLWVPS +C +FS+ C+FH R+ S+++
Sbjct: 72 SKFLDAQYFGEIGLGTPPQNFTVVFDTGSSNLWVPSRRCHFFSVPCWFHHRFNPNASSSF 131
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
G I YG+G + G S+D + +G + +F EA E SL F ++R DGI+GLG
Sbjct: 132 QPNGTKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWESSLVFTISRPDGILGLG 191
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
F +AV P D +VEQGL+ + VFSF+LNRD + +GGE+V GG DP H+ T+V
Sbjct: 192 FPILAVEGVPPPLDVLVEQGLLDKPVFSFYLNRDSEVADGGELVLGGSDPAHYIPPLTFV 251
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
PVT YWQ + +++G+ T +C GCAAI+D+GT ++ GPT + ++ AIGG ++
Sbjct: 252 PVTVPAYWQIHMERVMVGSGLT-LCARGCAAILDTGTPVIIGPTEEIRALHEAIGGIPLL 310
Query: 317 SAECKLVVSQYGDL-IWDLLVSGL 339
+ E + S+ L LL+ G+
Sbjct: 311 AGEYIIRCSEIPKLPTVSLLIGGV 334
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 44/103 (42%), Positives = 61/103 (59%), Gaps = 4/103 (3%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
++E +P GE II C IP +P VS IG FNL+ + Y+++ +G +C+SGF
Sbjct: 300 LHEAIGGIPLLAGEYIIRCSEIPKLPTVSLLIGGVWFNLTAQDYVIQFAQGDVRLCLSGF 359
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKL----RIGFAEA 505
A D+ P P+WILGDVF+G Y VFD G + R+G A A
Sbjct: 360 RALDIALPPVPVWILGDVFLGAYVAVFDRGDMKSGARVGLARA 402
>gi|348511299|ref|XP_003443182.1| PREDICTED: cathepsin D-like [Oreochromis niloticus]
Length = 397
Score = 251 bits (640), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 127/303 (41%), Positives = 184/303 (60%), Gaps = 3/303 (0%)
Query: 19 LLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLK 78
+LL A + R+ L K R L L + + A +G + L
Sbjct: 12 VLLLAQCTAILRVPLYKTR-SLRRLMSDNGMSVDELRALAKSTGSPDSAPSPQLPVERLT 70
Query: 79 NFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYT 137
NF+D+QY+G I IG+PPQNF+V+FDTGSSNLWVPS C I+C+FH RY S+KS+TY
Sbjct: 71 NFLDSQYYGIISIGTPPQNFTVLFDTGSSNLWVPSIHCSLLDIACWFHHRYNSKKSSTYA 130
Query: 138 EIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGF 197
+ G I YG+GS+SGF S D V + + V Q F EA ++ +TF ARFDG++G+G+
Sbjct: 131 KNGTEFSIQYGTGSLSGFISGDTVTIAGLSVPGQQFGEAVKQPGITFAFARFDGVLGMGY 190
Query: 198 REIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVP 257
I+V + +PV+D + L+ + +FSF+++RDP A GGE++ GG DP+++ G YV
Sbjct: 191 PSISVDNVMPVFDTAMAAKLLPQNIFSFYISRDPTAAVGGELMLGGTDPQYYTGDLHYVN 250
Query: 258 VTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVS 317
VT+K +WQ + + +GNQ T +C+ GC AIVD+GTSL+ GP V + AIG ++
Sbjct: 251 VTRKAFWQIGMNRVDVGNQLT-LCKAGCQAIVDTGTSLIVGPKEEVKALQKAIGAIPLLM 309
Query: 318 AEC 320
E
Sbjct: 310 GEA 312
Score = 119 bits (299), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 55/106 (51%), Positives = 77/106 (72%), Gaps = 1/106 (0%)
Query: 400 KEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIA 459
KE+V + + + ++P MGE++I+C +IPT+P +SF IG K FNL+ E Y++K +
Sbjct: 292 KEEVKA-LQKAIGAIPLLMGEALIECTKIPTLPVISFDIGGKTFNLTGEDYVVKESQMGV 350
Query: 460 EVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+C+SGFMA D+PPP GPLWILGDVF+G Y+TVFD R+GFA A
Sbjct: 351 TICLSGFMAMDIPPPTGPLWILGDVFIGKYYTVFDRDADRVGFATA 396
>gi|327278613|ref|XP_003224055.1| PREDICTED: cathepsin E-like [Anolis carolinensis]
Length = 396
Score = 250 bits (639), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 136/315 (43%), Positives = 194/315 (61%), Gaps = 15/315 (4%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKE--RYMGGAGVSGVRHRLGDS- 70
VL +C +L S GL+R+ LK+ + SL R E ++ V +++ S
Sbjct: 5 VLITCFILFVS--GLQRVPLKRHK----SLRNILRERGELSKFWKSYKVDNIQYTQDCSA 58
Query: 71 -DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYK 129
E PL N+ D +YFGEI IG+PPQNF+V+FDTGSSNLWVPS C S +C HSR+
Sbjct: 59 FQEANEPLLNYFDVEYFGEISIGTPPQNFTVLFDTGSSNLWVPSVYCA-SKACVEHSRFH 117
Query: 130 SRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARF 189
+S+TY E+G S I+YG+GS++G D+V V + V +Q F E+ E TFL + F
Sbjct: 118 PTESSTYNEVGTSFSIHYGTGSLTGIIGMDSVTVEGITVTNQQFAESVSEPGKTFLDSEF 177
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF 249
DGI+GL + +AV PV+DNM+ Q LV +FS +L+R+PD+ GGE++FGG DP F
Sbjct: 178 DGILGLAYPSLAVDGVTPVFDNMMAQNLVELPLFSVYLSRNPDSSIGGELIFGGYDPSLF 237
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHA 309
G ++PV+KKGYWQ +L +I +G + C GC AIVD+GTSL+ GP+ + ++ +
Sbjct: 238 SGNLNWIPVSKKGYWQIQLDNIQVGG-TIAFCAEGCQAIVDTGTSLITGPSDDIKQMQNL 296
Query: 310 IGGE---GVVSAECK 321
IG + G + EC
Sbjct: 297 IGAQPVDGEYAVECS 311
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 38/85 (44%), Positives = 55/85 (64%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
GE ++C + MP+V+FT+ ++L+PE Y L ++C SGF A ++ P GPL
Sbjct: 304 GEYAVECSNLSMMPSVTFTLNGIPYSLTPEAYTLMENSDGMQLCSSGFQALNMQTPEGPL 363
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFA 503
WILGDVF+G Y++VFD G R+G A
Sbjct: 364 WILGDVFIGQYYSVFDRGNDRVGLA 388
>gi|47213062|emb|CAF91576.1| unnamed protein product [Tetraodon nigroviridis]
Length = 395
Score = 250 bits (638), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 118/255 (46%), Positives = 172/255 (67%), Gaps = 13/255 (5%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L N++DAQY+GEIG+G+PPQ F+V+FDTGSSNLWVPS C I+C H +Y S KS+T
Sbjct: 57 LTNYLDAQYYGEIGLGTPPQPFTVVFDTGSSNLWVPSVHCSLLDIACLLHRKYNSAKSST 116
Query: 136 YTEIGKSCEINYGSGSISGFFSQDN-----------VEVGDVVVKDQVFIEATREGSLTF 184
Y + G + I YGSGS+SG+ SQD +VG + V+ Q+F EA ++ + F
Sbjct: 117 YVKNGTAFAIRYGSGSLSGYLSQDTCTVRACDPCPFFQVGGLAVEKQLFGEAIKQPGIAF 176
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
+ A+FDGI+G+G+ I+V PV+DN++ Q V + VFSF+LNR+P + GGE++ GG
Sbjct: 177 IAAKFDGILGMGYPRISVDGVAPVFDNIMSQKKVEKNVFSFYLNRNPQTQPGGELLLGGT 236
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
DP+++ G +YV VT++ YWQ + ++ +G+Q T +C+ GC AIVD+GTSLL GP+ V
Sbjct: 237 DPQYYTGDFSYVNVTRQAYWQIHVDELSVGSQLT-LCKSGCEAIVDTGTSLLTGPSEEVR 295
Query: 305 EINHAIGGEGVVSAE 319
+ AIG ++ E
Sbjct: 296 SLQKAIGALPLIQGE 310
Score = 113 bits (282), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 55/125 (44%), Positives = 81/125 (64%), Gaps = 3/125 (2%)
Query: 381 SACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGD 440
S CE A+V L ++E + + + +LP GE ++ CD+IPT+P ++F IG
Sbjct: 274 SGCE-AIVDTGTSLLTGPSEE--VRSLQKAIGALPLIQGEYMVSCDKIPTLPVITFNIGG 330
Query: 441 KIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRI 500
K ++L+ +QY+LK + +C+SGFM D+P P GPLWILGDVF+G Y+TVFD R+
Sbjct: 331 KPYSLTGDQYVLKVSQAGKTICLSGFMGLDIPAPAGPLWILGDVFIGQYYTVFDRDNNRV 390
Query: 501 GFAEA 505
GFA+A
Sbjct: 391 GFAKA 395
>gi|402906426|ref|XP_003916003.1| PREDICTED: napsin-A-like [Papio anubis]
Length = 423
Score = 250 bits (638), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 136/324 (41%), Positives = 194/324 (59%), Gaps = 17/324 (5%)
Query: 22 PASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDED----ILPL 77
PA + L RI L++ L +LN R G G RLG ++PL
Sbjct: 22 PAGAT-LIRIPLRRVHPGLRTLNLLR---------GWGKPAKLPRLGAPSPGDKPALVPL 71
Query: 78 KNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTY 136
F+DAQYFGEIG+G+PPQNF+V+FDTGSSNLWVPS +C +FS+ C+FH R+ S+++
Sbjct: 72 SKFLDAQYFGEIGLGTPPQNFTVVFDTGSSNLWVPSRRCHFFSVPCWFHHRFNPNASSSF 131
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
G I YG+G + G S+D + +G + +F EA E SL F ++R DGI+GLG
Sbjct: 132 QPNGTKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWESSLVFTISRPDGILGLG 191
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
F +AV P D +VEQGL+ + VFSF+LNRD + +GGE+V GG DP H+ T+V
Sbjct: 192 FPILAVEGVPPPLDVLVEQGLLDKPVFSFYLNRDSEVADGGELVLGGSDPAHYIPPLTFV 251
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
PVT YWQ + + +G+ T +C GCAAI+D+GT ++ GPT + ++ AIGG ++
Sbjct: 252 PVTVPAYWQIHMERVTVGSGLT-LCARGCAAILDTGTPVIIGPTEEIRALHEAIGGIPLL 310
Query: 317 SAECKLVVSQYGDL-IWDLLVSGL 339
+ E + S+ L LL+ G+
Sbjct: 311 AGEYIIRCSEIPKLPTVSLLIGGV 334
Score = 90.1 bits (222), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 51/144 (35%), Positives = 77/144 (53%), Gaps = 7/144 (4%)
Query: 367 VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
+ E V+ G +C+ A++ + T+E + ++E +P GE II C
Sbjct: 261 IHMERVTVGSGLTLCARGCAAILDTGTPVIIGPTEE--IRALHEAIGGIPLLAGEYIIRC 318
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
IP +P VS IG FNL+ + Y+++ +G +C+SGF A D+ P P+WILGDVF
Sbjct: 319 SEIPKLPTVSLLIGGVWFNLTAQDYVIQFAQGDVRLCLSGFRALDIALPPVPVWILGDVF 378
Query: 486 MGVYHTVFDSGKL----RIGFAEA 505
+G Y VFD G + R+G A A
Sbjct: 379 LGAYVAVFDRGDMKSGARVGLARA 402
>gi|156039363|ref|XP_001586789.1| hypothetical protein SS1G_11818 [Sclerotinia sclerotiorum 1980]
gi|154697555|gb|EDN97293.1| hypothetical protein SS1G_11818 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 396
Score = 249 bits (637), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 137/321 (42%), Positives = 193/321 (60%), Gaps = 17/321 (5%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITR----KERYMGGAGVSGVRHRLGD 69
VLA+ LL + S G+ ++ LKK L A T ++YMG S +
Sbjct: 5 VLAAASLLGSVSAGVHKMPLKKVSLSEQLATANMDTHVKHLGQKYMGVRPQSHASEMFKE 64
Query: 70 SD------EDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCY 123
+ + +P+ NF++AQYF EI IG+PPQ F V+ DTGSSNLWVPSS+C SI+CY
Sbjct: 65 TSVHLEGGDHTVPVSNFLNAQYFSEITIGTPPQTFKVVLDTGSSNLWVPSSEC-GSIACY 123
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
H++Y S S+TY + G S EI YGSGS+SGF S+D + +GD+ +KDQVF EAT E L
Sbjct: 124 LHTKYDSSSSSTYEKNGTSFEIRYGSGSLSGFTSRDVMSIGDLEIKDQVFAEATEEPGLA 183
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
F RFDGI+GLG+ I+V VP + NM+ QGL+ E VF+F+L D + E +FGG
Sbjct: 184 FAFGRFDGILGLGYDTISVNQIVPPFYNMINQGLLDEPVFAFYLGDSKDEGDESEAIFGG 243
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
V+ H++GK T +P+ +K YW+ +L I G+ + G I+D+GTSL+A P+ +
Sbjct: 244 VNKDHYEGKITEIPLRRKAYWEVDLDAISFGDAKADLDNTGV--ILDTGTSLIAVPSTLA 301
Query: 304 TEINHAIGGE----GVVSAEC 320
+N IG + G S +C
Sbjct: 302 ELLNKEIGAKKGWNGQYSVDC 322
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 32/83 (38%), Positives = 51/83 (61%), Gaps = 4/83 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC + ++P+++FT+ F ++P YIL+ + + CIS M D P P GPL
Sbjct: 316 GQYSVDCAKRDSLPDLTFTLSGNDFAITPYDYILE----VQDSCISTIMGMDFPEPVGPL 371
Query: 479 WILGDVFMGVYHTVFDSGKLRIG 501
ILGD F+ Y++V+D GK +G
Sbjct: 372 AILGDAFLRRYYSVYDLGKNTVG 394
>gi|354497676|ref|XP_003510945.1| PREDICTED: napsin-A [Cricetulus griseus]
Length = 569
Score = 249 bits (637), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 117/258 (45%), Positives = 168/258 (65%), Gaps = 2/258 (0%)
Query: 74 ILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRK 132
+PL FM+ QYFG+IG+G+PPQNF+V+FDTGSSNLWVPS +C +FS+ C+FH R+ +
Sbjct: 62 FVPLYKFMNTQYFGDIGLGTPPQNFTVVFDTGSSNLWVPSVRCHFFSLPCWFHRRFNPKA 121
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGI 192
S+++ G I YGSG ++G SQDN+ +G++ F EA E S+ F LA FDGI
Sbjct: 122 SSSFRPNGTKLAIQYGSGQLTGILSQDNLTIGEIRGVSVTFGEALWESSMVFTLAHFDGI 181
Query: 193 IGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGK 252
+GLGF +AV P D MVEQGL+ + +FSF+LNRD + +GGE+V GG DP H+
Sbjct: 182 LGLGFPSLAVDGVQPPLDAMVEQGLLQKPIFSFYLNRDAEGSDGGELVLGGSDPAHYIPP 241
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
T++PVT YWQ + + +G +C GC I+D+GTSL+ GP+ + +N AIGG
Sbjct: 242 LTFIPVTIPAYWQVHMESVNVGT-GLSLCAQGCGVILDTGTSLITGPSEEIHALNKAIGG 300
Query: 313 EGVVSAECKLVVSQYGDL 330
++ + + S+ +L
Sbjct: 301 LPFLAGQYFIQCSKTPEL 318
Score = 85.9 bits (211), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 52/144 (36%), Positives = 74/144 (51%), Gaps = 8/144 (5%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
+E NV G S C + ++ L ++E + +N+ LP G+ I C
Sbjct: 257 MESVNVGTGLSLCAQGCGV-ILDTGTSLITGPSEE--IHALNKAIGGLPFLAGQYFIQCS 313
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKT-GEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
+ P +P VSF +G FNL+ + Y++K +C+ GF A D+P P GPLWILGDVF
Sbjct: 314 KTPELPTVSFRLGGVWFNLTGQDYVIKILNSDDVGLCLLGFQALDIPKPAGPLWILGDVF 373
Query: 486 MGVYHTVFDSG----KLRIGFAEA 505
+G Y VFD G R+G A A
Sbjct: 374 LGPYVAVFDRGVKTVGPRVGLARA 397
>gi|351702766|gb|EHB05685.1| Napsin-A [Heterocephalus glaber]
Length = 417
Score = 249 bits (636), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 122/258 (47%), Positives = 167/258 (64%), Gaps = 2/258 (0%)
Query: 74 ILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRK 132
++PL FM+ QYFGEIG+G+PPQNFSV+FDTGSSNLWVPS +C +FS+ C+FH RY +
Sbjct: 64 LVPLSKFMNVQYFGEIGLGTPPQNFSVVFDTGSSNLWVPSKRCHFFSVPCWFHHRYDPKA 123
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGI 192
S+++ G I YG+G +SG S+D + +G + F EA E SL F A FDGI
Sbjct: 124 SSSFRPNGTKFAIQYGTGRLSGILSEDKLNIGGISNASVTFGEALWEPSLVFAFASFDGI 183
Query: 193 IGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGK 252
GLGF +AV P D +VEQGL+ + +FSF+LNRD +GGE+V GG DP H+
Sbjct: 184 FGLGFPTLAVDRVPPPLDVLVEQGLLEKPIFSFYLNRDFAGADGGELVLGGADPAHYIPP 243
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
T+VPVT YWQ + + +G T +C GCAAIVD+GTSL+ GP+ + ++ AIGG
Sbjct: 244 LTFVPVTVPAYWQIHMERVKVGTGLT-LCAQGCAAIVDTGTSLITGPSEEIRALHRAIGG 302
Query: 313 EGVVSAECKLVVSQYGDL 330
++ E ++ S+ L
Sbjct: 303 LPWLAGEHFILCSKIPTL 320
Score = 102 bits (253), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 56/143 (39%), Positives = 78/143 (54%), Gaps = 7/143 (4%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
+E+ V G + C A+V L ++E + ++ LP GE I C
Sbjct: 259 MERVKVGTGLTLCAQGCA-AIVDTGTSLITGPSEE--IRALHRAIGGLPWLAGEHFILCS 315
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
+IPT+P VSF +G FNL+ + Y+++ +G C+SGF A D+PPP GPLWILGDVF+
Sbjct: 316 KIPTLPPVSFLLGGVWFNLTAQDYVIQISQGGFRFCLSGFHALDMPPPAGPLWILGDVFL 375
Query: 487 GVYHTVFDSGKL----RIGFAEA 505
G Y VFD G R+G A A
Sbjct: 376 GAYVAVFDRGSTSSGARVGLARA 398
>gi|315440805|gb|ADU20408.1| aspartic protease 2 [Clonorchis sinensis]
Length = 385
Score = 249 bits (636), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 124/254 (48%), Positives = 172/254 (67%), Gaps = 7/254 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFHSRYKSRKSNT 135
L N+MD+QY+GEI IG+PPQ F V+FDTGSSNLWVPS++C ++ +C H RY KS+T
Sbjct: 60 LDNYMDSQYYGEIAIGTPPQPFKVVFDTGSSNLWVPSNRCSPWNEACRLHHRYDCEKSST 119
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y GK I YG+GS+SG S D V V V+DQ F EA E L F++A+FDGI+GL
Sbjct: 120 YKANGKPFSIQYGTGSVSGVLSTDVVTVSSAKVQDQTFGEAINEPGLVFVVAKFDGILGL 179
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
F+ IAV + VPV+DNM+ QGLV + +FS WL+R+ + GGEI+FGG++ +H+ G +
Sbjct: 180 AFQSIAVDNVVPVFDNMISQGLVEKPLFSVWLDRNDVQDIGGEIMFGGINKEHYMGDMYF 239
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
VP++ + YWQ +L I + S +C GC AIVD+GT+L+ GPT V ++N A+G
Sbjct: 240 VPLSSETYWQIDLDGIQV--TSLTLCAQGCQAIVDTGTTLIVGPTADVNQLNEALGAVSI 297
Query: 313 EGVVSA-ECKLVVS 325
EG +S EC + +
Sbjct: 298 EGGLSVLECSQIYT 311
Score = 75.5 bits (184), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 58/103 (56%), Gaps = 2/103 (1%)
Query: 404 LSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCI 463
++ +NE ++ G S+++C +I T+P + F+I + L P Y+ + +C
Sbjct: 285 VNQLNEALGAVSIEGGLSVLECSQIYTLPPIEFSINGENLTLQPTDYVQEMSYRGGTICT 344
Query: 464 SGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
SGF + P P WILGDVF+G Y+TVFD + R+GFA +
Sbjct: 345 SGFSGMETP--GAPTWILGDVFIGAYYTVFDKEQRRVGFARST 385
>gi|321461134|gb|EFX72169.1| hypothetical protein DAPPUDRAFT_189045 [Daphnia pulex]
Length = 391
Score = 249 bits (636), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 137/322 (42%), Positives = 191/322 (59%), Gaps = 25/322 (7%)
Query: 6 LRSVFCLWVL----ASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS 61
++ +F L+ L A+ LL S L R+ + + L + R + RY G ++
Sbjct: 1 MKKIFVLFALVGLSAAAKLL---SIPLERLPTARSSMSLVEQSMER--TRNRYSSGKILT 55
Query: 62 GVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF-SI 120
ED L+NF D+QYFG I +G+PPQ+F+VIFDTGS+NLWVPSS+C ++
Sbjct: 56 ----------ED---LRNFQDSQYFGPITLGTPPQDFTVIFDTGSANLWVPSSQCSEENL 102
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+C H++Y S S+TY G I YG+G++ GF S D + V V DQ F EA E
Sbjct: 103 ACKVHNQYNSSLSDTYKPNGTEFSIQYGTGAMDGFLSTDILGVAGAQVMDQTFAEAVNEP 162
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDP-DAEEGGEI 239
+TF+ RFDGI+G+ + IAV VP++ NM+ QGLV E VFSFWLNRD D GGEI
Sbjct: 163 GVTFVAGRFDGILGMSYPNIAVQGVVPMFQNMMAQGLVDEPVFSFWLNRDASDPVNGGEI 222
Query: 240 VFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI-GNQSTGVCEGGCAAIVDSGTSLLAG 298
VFGG +P H+ G+ Y+PVT+K YWQF ++I G C+GGC I D+GTS++AG
Sbjct: 223 VFGGTNPDHYVGEINYIPVTRKAYWQFRADGLMIEGIPEYPFCDGGCEMISDTGTSVIAG 282
Query: 299 PTPVVTEINHAIGGEGVVSAEC 320
P V +N +G +++ E
Sbjct: 283 PAEEVNLLNRLLGAINIINGEA 304
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 45/104 (43%), Positives = 65/104 (62%), Gaps = 2/104 (1%)
Query: 404 LSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEV-- 461
++ +N L ++ GE++I C RIP +P ++ TI + L E YILK +
Sbjct: 287 VNLLNRLLGAINIINGEAVISCLRIPYLPPITITISGLPYTLEGEDYILKVDDPTTNTST 346
Query: 462 CISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
CISGF+ D+PPP GPLWILGDVF+G +++++D G RIG A A
Sbjct: 347 CISGFLGLDIPPPSGPLWILGDVFIGKFYSIYDFGMDRIGLATA 390
>gi|262073106|ref|NP_001159993.1| cathepsin D precursor [Bos taurus]
gi|296471411|tpg|DAA13526.1| TPA: cathepsin D [Bos taurus]
Length = 410
Score = 249 bits (635), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 121/253 (47%), Positives = 174/253 (68%), Gaps = 13/253 (5%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN+MD Y+GEIGIG+PPQ F+V+FDTGS+NLWVPS C I+C+ H +Y S KS+T
Sbjct: 73 LKNYMD--YYGEIGIGTPPQCFTVVFDTGSANLWVPSIHCKLLDIACWTHRKYNSDKSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEV---------GDVVVKDQVFIEATREGSLTFLL 186
Y + G + +I+YGSGS+SG+ SQD V V G V V+ Q F EA ++ + F+
Sbjct: 131 YVKNGTTFDIHYGSGSLSGYLSQDTVSVPCNPSSSSPGGVTVQRQTFGEAIKQPGVVFIA 190
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+ + I+V + +PV+DN+++Q LV + VFSF+LNRDP A+ GGE++ GG D
Sbjct: 191 AKFDGILGMAYPRISVNNVLPVFDNLMQQKLVDKNVFSFFLNRDPKAQPGGELMLGGTDS 250
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K+++G + VT++ YWQ + + +G+ T VC+GGC AIVD+GTSL+ GP V E+
Sbjct: 251 KYYRGSLMFHNVTRQAYWQIHMDQLDVGSSLT-VCKGGCEAIVDTGTSLIVGPVEEVREL 309
Query: 307 NHAIGGEGVVSAE 319
AIG ++ E
Sbjct: 310 QKAIGAVPLIQGE 322
Score = 108 bits (270), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 48/94 (51%), Positives = 66/94 (70%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I C+++ ++P V+ +G K + LSPE Y LK + VC+SGFM D+P
Sbjct: 315 AVPLIQGEYMIPCEKVSSLPQVTVKLGGKDYALSPEDYALKVSQAGTTVCLSGFMGMDIP 374
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PP GPLWILGDVF+G Y+TVFD + R+G AEAA
Sbjct: 375 PPGGPLWILGDVFIGRYYTVFDRDQNRVGLAEAA 408
>gi|402906424|ref|XP_003916002.1| PREDICTED: napsin-A-like [Papio anubis]
Length = 421
Score = 249 bits (635), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 129/310 (41%), Positives = 188/310 (60%), Gaps = 8/310 (2%)
Query: 22 PASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFM 81
PA + L RI L + + + +LN R R+ + G +L +PL N+
Sbjct: 22 PARAT-LIRIPLHRVQPERRTLNLLRGWREPAEVPKLGAPSPGDKL-----TFVPLSNYR 75
Query: 82 DAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIG 140
D QYFG+IG+G+PPQNF+V+FDTGSSNLWVPS +C +FS+ C+ H R+ + S+++ G
Sbjct: 76 DVQYFGKIGLGTPPQNFTVVFDTGSSNLWVPSRRCHFFSVPCWLHHRFDPKASSSFQANG 135
Query: 141 KSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREI 200
I YG+G + G S+D + +G + +F EA E L F A FDGI+GLGF +
Sbjct: 136 TKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWEPGLVFTFAHFDGILGLGFPIL 195
Query: 201 AVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTK 260
+V P D +VEQGL+ + VFSF+LNRDP+ +GGE+V GG DP H+ T+VPVT
Sbjct: 196 SVEGVRPPMDVLVEQGLLDKPVFSFYLNRDPEEPDGGELVLGGSDPAHYIPPLTFVPVTV 255
Query: 261 KGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAEC 320
YWQ + + +G T +C GCAAI+D+GTSL+ GPT + ++ AIGG +++ E
Sbjct: 256 PAYWQIHMERVKVGPGLT-LCVPGCAAILDTGTSLITGPTEEIRALHAAIGGYPLLAGEY 314
Query: 321 KLVVSQYGDL 330
++ S+ L
Sbjct: 315 IILCSEIPKL 324
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 51/135 (37%), Positives = 73/135 (54%), Gaps = 3/135 (2%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
+E+ V G + C A++ L T+E + ++ P GE II C
Sbjct: 263 MERVKVGPGLTLCVPGCA-AILDTGTSLITGPTEE--IRALHAAIGGYPLLAGEYIILCS 319
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
IP +P VSF +G+ FNL+ + Y+++T +C+SGF A D+PPP GP WILGDVF+
Sbjct: 320 EIPKLPAVSFLLGEVWFNLTAQDYVIQTTRNGVRLCLSGFQALDVPPPAGPFWILGDVFL 379
Query: 487 GVYHTVFDSGKLRIG 501
G Y VFD G + G
Sbjct: 380 GTYVAVFDRGDTKSG 394
>gi|426244096|ref|XP_004015868.1| PREDICTED: napsin-A [Ovis aries]
Length = 443
Score = 249 bits (635), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 129/304 (42%), Positives = 184/304 (60%), Gaps = 11/304 (3%)
Query: 30 RIGLKKRRLDLHSLNAARITRK--ERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
RI L++ +LN R K E GA G + +PL N+++AQY+G
Sbjct: 28 RIPLRRVNTGFKALNPLRGWEKLAEAPRLGAPSPG-------NKSLFVPLSNYLNAQYYG 80
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEIN 146
EIG+G+PPQNFSV+FDTGSSNLWVPS +C +FS+ C+ H R+ + S+++ G I
Sbjct: 81 EIGLGTPPQNFSVVFDTGSSNLWVPSVRCRFFSLPCWLHHRFNPKASSSFRFNGTKFAIQ 140
Query: 147 YGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
YG+G ++G S+D + +G + F EA E SL F A FDGI+GLGF +AVG
Sbjct: 141 YGTGRLAGILSEDKLTIGGITGATVTFGEALWEPSLVFTFAHFDGILGLGFPVLAVGGVQ 200
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
P D +V+QGL+ + VFSF+LNR+P+A +GGE+V GG DP H+ T+VPVT +WQ
Sbjct: 201 PPLDRLVDQGLLDKPVFSFYLNRNPEAADGGELVLGGSDPAHYIPPLTFVPVTIPAFWQI 260
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQ 326
+ + +G T +C GCAAI+D+GTSL+ GPT + + AIG ++ E + S+
Sbjct: 261 HMERVQVGTGLT-LCARGCAAILDTGTSLITGPTEEIRALQKAIGAVPLLMGEYYIKCSK 319
Query: 327 YGDL 330
L
Sbjct: 320 IPTL 323
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 53/135 (39%), Positives = 76/135 (56%), Gaps = 3/135 (2%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
+E+ V G + C A++ L T+E + + + ++P MGE I C
Sbjct: 262 MERVQVGTGLTLCARGCA-AILDTGTSLITGPTEE--IRALQKAIGAVPLLMGEYYIKCS 318
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
+IPT+P VSF +G FNL+ + Y+++ VC+SGFMA D+PPP GP WILGDVF+
Sbjct: 319 KIPTLPPVSFLLGGVWFNLTAQDYVIQITRSGFSVCLSGFMALDVPPPSGPFWILGDVFL 378
Query: 487 GVYHTVFDSGKLRIG 501
G Y VFD G + G
Sbjct: 379 GSYVAVFDRGDRKSG 393
>gi|358333762|dbj|GAA52230.1| cathepsin D [Clonorchis sinensis]
Length = 408
Score = 248 bits (634), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 124/254 (48%), Positives = 172/254 (67%), Gaps = 7/254 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFHSRYKSRKSNT 135
L N+MD+QY+GEI IG+PPQ F V+FDTGSSNLWVPS++C ++ +C H RY KS+T
Sbjct: 83 LDNYMDSQYYGEIAIGTPPQPFKVVFDTGSSNLWVPSNRCSPWNEACRLHHRYDCEKSST 142
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y GK I YG+GS+SG S D V V V+DQ F EA E L F++A+FDGI+GL
Sbjct: 143 YKANGKPFSIQYGTGSVSGVLSTDVVTVSSAKVQDQTFGEAINEPGLVFVVAKFDGILGL 202
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
F+ IAV + VPV+DNM+ QGLV + +FS WL+R+ + GGEI+FGG++ +H+ G +
Sbjct: 203 AFQSIAVDNVVPVFDNMISQGLVEKPLFSVWLDRNDVQDIGGEIMFGGINKEHYMGDMYF 262
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
VP++ + YWQ +L I + S +C GC AIVD+GT+L+ GPT V ++N A+G
Sbjct: 263 VPLSSETYWQIDLDGIQV--TSLTLCAQGCQAIVDTGTTLIVGPTADVNQLNEALGAVSI 320
Query: 313 EGVVSA-ECKLVVS 325
EG +S EC + +
Sbjct: 321 EGGLSVLECSQIYT 334
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 58/103 (56%), Gaps = 2/103 (1%)
Query: 404 LSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCI 463
++ +NE ++ G S+++C +I T+P + F+I + L P Y+ + +C
Sbjct: 308 VNQLNEALGAVSIEGGLSVLECSQIYTLPPIEFSINGENLTLQPTDYVQEMSYRGGTICT 367
Query: 464 SGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
SGF + P P WILGDVF+G Y+TVFD + R+GFA +
Sbjct: 368 SGFSGMETP--GAPTWILGDVFIGAYYTVFDKEQRRVGFARST 408
>gi|17389633|gb|AAH17842.1| Napsin A aspartic peptidase [Homo sapiens]
gi|123982255|gb|ABM82919.1| napsin A aspartic peptidase [synthetic construct]
gi|123997015|gb|ABM86109.1| napsin A aspartic peptidase [synthetic construct]
Length = 420
Score = 248 bits (634), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 131/309 (42%), Positives = 186/309 (60%), Gaps = 9/309 (2%)
Query: 24 SSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDI-LPLKNFMD 82
S L RI L + + +LN R R+ + G D+ I +PL N+ D
Sbjct: 22 SGATLIRIPLHRVQPGRRTLNLLRGWREPAELPKLGAPS------PGDKPIFVPLSNYRD 75
Query: 83 AQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGK 141
QYFGEIG+G+PPQNF+V FDTGSSNLWVPS +C +FS+ C+ H R+ + S+++ G
Sbjct: 76 VQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPCWLHHRFDPKASSSFQANGT 135
Query: 142 SCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIA 201
I YG+G + G S+D + +G + +F EA E SL F A FDGI+GLGF ++
Sbjct: 136 KFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWEPSLVFAFAHFDGILGLGFPILS 195
Query: 202 VGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKK 261
V P D +VEQGL+ + VFSF+LNRDP+ +GGE+V GG DP H+ T+VPVT
Sbjct: 196 VEGVRPPMDVLVEQGLLDKPVFSFYLNRDPEEPDGGELVLGGSDPAHYIPPLTFVPVTVP 255
Query: 262 GYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECK 321
YWQ + + +G T +C GCAAI+D+GTSL+ GPT + ++ AIGG +++ E
Sbjct: 256 AYWQIHMERVKVGPGLT-LCAKGCAAILDTGTSLITGPTEEIRALHAAIGGIPLLAGEYI 314
Query: 322 LVVSQYGDL 330
++ S+ L
Sbjct: 315 ILCSEIPKL 323
Score = 98.2 bits (243), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 54/143 (37%), Positives = 76/143 (53%), Gaps = 7/143 (4%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
+E+ V G + C A++ L T+E + ++ +P GE II C
Sbjct: 262 MERVKVGPGLTLCAKGCA-AILDTGTSLITGPTEE--IRALHAAIGGIPLLAGEYIILCS 318
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
IP +P VSF +G FNL+ Y+++T +C+SGF A D+PPP GP WILGDVF+
Sbjct: 319 EIPKLPAVSFLLGGVWFNLTAHDYVIQTTRNGVRLCLSGFQALDVPPPAGPFWILGDVFL 378
Query: 487 GVYHTVFDSGKL----RIGFAEA 505
G Y VFD G + R+G A A
Sbjct: 379 GTYVAVFDRGDMKSSARVGLARA 401
>gi|45360583|ref|NP_988964.1| cathepsin D precursor [Xenopus (Silurana) tropicalis]
gi|38174445|gb|AAH61433.1| cathepsin D (lysosomal aspartyl protease) [Xenopus (Silurana)
tropicalis]
Length = 398
Score = 248 bits (633), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 135/316 (42%), Positives = 198/316 (62%), Gaps = 17/316 (5%)
Query: 12 LWVLAS--CLLLPASSNGLRRIGLKK-----RRLDLHSLNAARITRKERYMGGAGVSGVR 64
+W L + C++ P SS L RI LKK R + +A +++ E S
Sbjct: 6 VWALLALCCVMQPGSS--LVRIPLKKFTSIRRAMSETDQDALKLSGNE---AATKYSAFL 60
Query: 65 HRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF-SISCY 123
+ + E +L N++DAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS C F ++C+
Sbjct: 61 NSKNPTPETLL---NYLDAQYYGEIGIGTPPQPFTVVFDTGSSNLWVPSIHCSFWDLACW 117
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
H +Y S KS TY G I YGSGS++G+ S+D V +GD+ V Q F EA ++ +T
Sbjct: 118 LHHKYDSSKSTTYINNGTEFAIQYGSGSLTGYLSKDTVTIGDLAVNGQFFAEAIKQPGIT 177
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
F+ A+FDGI+G+G+ +I+V PV+D+++EQ LV +FSF+LNR+PD GGE++ GG
Sbjct: 178 FVAAKFDGILGMGYPKISVDGVPPVFDDIMEQKLVDSNIFSFYLNRNPDTLPGGELLLGG 237
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
DP + G Y+ VT+K YWQ + + +G++ + +C+ GC AIVD+GTSL+ GP V
Sbjct: 238 TDPAFYTGDFNYMNVTRKAYWQIHMDQLSVGDRLS-LCKDGCEAIVDTGTSLITGPVEEV 296
Query: 304 TEINHAIGGEGVVSAE 319
T + AIG ++ E
Sbjct: 297 TALQRAIGAIPLICGE 312
Score = 112 bits (279), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 58/141 (41%), Positives = 88/141 (62%), Gaps = 5/141 (3%)
Query: 367 VEKENVSAGDS-AVCS-ACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIID 424
+ + +S GD ++C CE A+V L +E ++ + ++P GE +I
Sbjct: 260 IHMDQLSVGDRLSLCKDGCE-AIVDTGTSLITGPVEE--VTALQRAIGAIPLICGEYMIL 316
Query: 425 CDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDV 484
CD IP++P +SFT G + ++L+ EQY+LK + VC+SGF+ D+PPP GPLWI+GDV
Sbjct: 317 CDSIPSLPVISFTFGGRAYSLTGEQYVLKISKAGRTVCLSGFLGLDIPPPAGPLWIIGDV 376
Query: 485 FMGVYHTVFDSGKLRIGFAEA 505
F+G Y+TVFD R+GFA+A
Sbjct: 377 FIGQYYTVFDRANDRVGFAKA 397
>gi|397485038|ref|XP_003813670.1| PREDICTED: napsin-A-like [Pan paniscus]
Length = 420
Score = 248 bits (633), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 132/311 (42%), Positives = 188/311 (60%), Gaps = 10/311 (3%)
Query: 22 PASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDI-LPLKNF 80
PA + L RI L + + +LN R R+ + G D+ I +PL N+
Sbjct: 21 PAGAT-LIRIPLHRVQPGRRTLNLLRGWREPAELPKLGAPS------PGDKTIFVPLSNY 73
Query: 81 MDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEI 139
D QYFGEIG+G+PPQNF+V FDTGSSNLWVPS +C +FS+ C+ H R+ + S+++
Sbjct: 74 RDVQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPCWLHHRFDPKASSSFQAN 133
Query: 140 GKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFRE 199
G I YG+G + G S+D + +G + +F EA E SL F A FDGI+GLGF
Sbjct: 134 GTKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWEPSLVFAFAHFDGILGLGFPI 193
Query: 200 IAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVT 259
++V P D +VEQGL+ + VFSF+LNRDP+ +GGE+V GG DP H+ T+VPVT
Sbjct: 194 LSVEGVRPPMDVLVEQGLLEKPVFSFYLNRDPEEPDGGELVLGGSDPAHYIPPLTFVPVT 253
Query: 260 KKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
YWQ + + +G T +C GCAAI+D+GTSL+ GPT + ++ AIGG +++ E
Sbjct: 254 VPAYWQIHMERVKVGPGLT-LCAQGCAAILDTGTSLITGPTEEIRALHAAIGGIPLLAGE 312
Query: 320 CKLVVSQYGDL 330
++ S+ L
Sbjct: 313 YIILCSEIPKL 323
Score = 98.6 bits (244), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 54/143 (37%), Positives = 76/143 (53%), Gaps = 7/143 (4%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
+E+ V G + C A++ L T+E + ++ +P GE II C
Sbjct: 262 MERVKVGPGLTLCAQGCA-AILDTGTSLITGPTEE--IRALHAAIGGIPLLAGEYIILCS 318
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
IP +P VSF +G FNL+ Y+++T +C+SGF A D+PPP GP WILGDVF+
Sbjct: 319 EIPKLPAVSFLLGGVWFNLTAHDYVIQTTRNGVRLCLSGFQALDVPPPAGPFWILGDVFL 378
Query: 487 GVYHTVFDSGKL----RIGFAEA 505
G Y VFD G + R+G A A
Sbjct: 379 GTYVAVFDRGDMKSSARVGLARA 401
>gi|114678580|ref|XP_524345.2| PREDICTED: napsin-A isoform 4 [Pan troglodytes]
Length = 420
Score = 248 bits (633), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 132/311 (42%), Positives = 188/311 (60%), Gaps = 10/311 (3%)
Query: 22 PASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDI-LPLKNF 80
PA + L RI L + + +LN R R+ + G D+ I +PL N+
Sbjct: 21 PAGAT-LIRIPLHRVQPGRRTLNLLRGWREPAELPKLGAPS------PGDKTIFVPLSNY 73
Query: 81 MDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEI 139
D QYFGEIG+G+PPQNF+V FDTGSSNLWVPS +C +FS+ C+ H R+ + S+++
Sbjct: 74 RDVQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPCWLHHRFDPKASSSFQAN 133
Query: 140 GKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFRE 199
G I YG+G + G S+D + +G + +F EA E SL F A FDGI+GLGF
Sbjct: 134 GTKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWEPSLVFAFAHFDGILGLGFPI 193
Query: 200 IAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVT 259
++V P D +VEQGL+ + VFSF+LNRDP+ +GGE+V GG DP H+ T+VPVT
Sbjct: 194 LSVEGVRPPMDVLVEQGLLDKPVFSFYLNRDPEEPDGGELVLGGSDPAHYIPPLTFVPVT 253
Query: 260 KKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
YWQ + + +G T +C GCAAI+D+GTSL+ GPT + ++ AIGG +++ E
Sbjct: 254 VPAYWQIHMERVKVGPGLT-LCAQGCAAILDTGTSLITGPTEEIRALHAAIGGIPLLAGE 312
Query: 320 CKLVVSQYGDL 330
++ S+ L
Sbjct: 313 YIILCSEIPKL 323
Score = 98.6 bits (244), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 54/143 (37%), Positives = 76/143 (53%), Gaps = 7/143 (4%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
+E+ V G + C A++ L T+E + ++ +P GE II C
Sbjct: 262 MERVKVGPGLTLCAQGCA-AILDTGTSLITGPTEE--IRALHAAIGGIPLLAGEYIILCS 318
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
IP +P VSF +G FNL+ Y+++T +C+SGF A D+PPP GP WILGDVF+
Sbjct: 319 EIPKLPAVSFLLGGVWFNLTAHDYVIQTTRNGVRLCLSGFQALDVPPPTGPFWILGDVFL 378
Query: 487 GVYHTVFDSGKL----RIGFAEA 505
G Y VFD G + R+G A A
Sbjct: 379 GTYVAVFDRGDMKSSARVGLARA 401
>gi|355756059|gb|EHH59806.1| hypothetical protein EGM_10003 [Macaca fascicularis]
Length = 423
Score = 248 bits (633), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 135/324 (41%), Positives = 194/324 (59%), Gaps = 17/324 (5%)
Query: 22 PASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDED----ILPL 77
PA + L RI L++ L +LN R G G RLG ++PL
Sbjct: 22 PARAT-LIRIPLRRVHPGLRTLNLLR---------GWGKPAKLPRLGAPSPGDKPALVPL 71
Query: 78 KNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTY 136
F+DAQYFGEIG+G+PPQNF+V+FDTGSSNLWVPS +C +FS+ C+FH R+ S+++
Sbjct: 72 SKFLDAQYFGEIGLGTPPQNFTVVFDTGSSNLWVPSRRCHFFSVPCWFHHRFNPNASSSF 131
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
G I YG+G + G S+D + +G + +F EA E SL F ++R DGI+GLG
Sbjct: 132 QPNGTKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWESSLVFTISRPDGILGLG 191
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
F ++V P D +VEQGL+ + VFSF+LNRD + +GGE+V GG DP H+ T+V
Sbjct: 192 FPILSVEGVRPPMDVLVEQGLLDKPVFSFYLNRDSEVADGGELVLGGSDPAHYIPPLTFV 251
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
PVT YWQ + + +G+ T +C GCAAI+D+GT ++ GPT + ++ AIGG ++
Sbjct: 252 PVTVPAYWQIHMERVTVGSGLT-LCARGCAAILDTGTPVIIGPTEEIRALHEAIGGIPLL 310
Query: 317 SAECKLVVSQYGDL-IWDLLVSGL 339
+ E + S+ L LL+ G+
Sbjct: 311 AGEYIIRCSEIPKLPTVSLLIGGV 334
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 51/144 (35%), Positives = 77/144 (53%), Gaps = 7/144 (4%)
Query: 367 VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
+ E V+ G +C+ A++ + T+E + ++E +P GE II C
Sbjct: 261 IHMERVTVGSGLTLCARGCAAILDTGTPVIIGPTEE--IRALHEAIGGIPLLAGEYIIRC 318
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
IP +P VS IG FNL+ + Y+++ +G +C+SGF A D+ P P+WILGDVF
Sbjct: 319 SEIPKLPTVSLLIGGVWFNLTAQDYVIQFAQGDVRLCLSGFRALDIALPPVPVWILGDVF 378
Query: 486 MGVYHTVFDSGKL----RIGFAEA 505
+G Y VFD G + R+G A A
Sbjct: 379 LGAYVAVFDRGDMKSGARVGLARA 402
>gi|198421979|ref|XP_002130758.1| PREDICTED: similar to Ctsd protein [Ciona intestinalis]
Length = 385
Score = 248 bits (633), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 121/246 (49%), Positives = 161/246 (65%), Gaps = 2/246 (0%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSN 134
PL N+MDAQYFGEI IG+P Q F+VIFDTGSSNLWVPS+ C + +C H++Y S S+
Sbjct: 54 PLTNYMDAQYFGEISIGTPEQTFTVIFDTGSSNLWVPSASCPSTNYACMTHNKYNSAASS 113
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY G+ I YG+GS+ G+ S D V++ V Q F EA E +TF+ A+FDGI+G
Sbjct: 114 TYVADGEEFRIQYGTGSMVGYDSVDTVKIAGVPSTSQTFAEALEEPGITFVAAKFDGILG 173
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+G+ IAV PV++ M EQG V + +F+F+LNRDP+A +GGEI GGV+P + G
Sbjct: 174 MGYPNIAVNGMKPVFNQMFEQGAVDQNLFAFYLNRDPEAADGGEITLGGVNPARYVGDFN 233
Query: 255 YVPVTKKGYWQFELGDILIGNQS-TGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
Y VT++GYWQ ++ + I + + T C GGC IVDSGTSL+ GP+ IN AIG
Sbjct: 234 YHDVTRQGYWQIKMDGLSIADTAKTTACNGGCQVIVDSGTSLITGPSADTDAINQAIGAI 293
Query: 314 GVVSAE 319
V E
Sbjct: 294 KFVQGE 299
Score = 92.4 bits (228), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 47/141 (33%), Positives = 76/141 (53%), Gaps = 1/141 (0%)
Query: 367 VEKENVSAGDSAVCSACEMAV-VWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
++ + +S D+A +AC V V + IN+ ++ GE ++ C
Sbjct: 245 IKMDGLSIADTAKTTACNGGCQVIVDSGTSLITGPSADTDAINQAIGAIKFVQGEYLVIC 304
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
RIP MP+++F + + L+P+ Y+++ C+S FM D+P P GPLWILGD F
Sbjct: 305 RRIPEMPDITFVLDGIEYVLTPQDYVIQMTADGQTQCLSAFMGMDIPEPTGPLWILGDAF 364
Query: 486 MGVYHTVFDSGKLRIGFAEAA 506
MG ++T FD G ++GFA+ A
Sbjct: 365 MGKFYTSFDFGTNQVGFAKLA 385
>gi|297462061|ref|XP_001790669.2| PREDICTED: napsin-A [Bos taurus]
gi|297485858|ref|XP_002695173.1| PREDICTED: napsin-A [Bos taurus]
gi|296477597|tpg|DAA19712.1| TPA: napsin A aspartic peptidase [Bos taurus]
Length = 408
Score = 248 bits (632), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 126/287 (43%), Positives = 178/287 (62%), Gaps = 11/287 (3%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRK--ERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQY 85
L RI L++ + +LN R K E GA G + +PL ++M+ QY
Sbjct: 26 LIRIPLRRVNIGFKALNPLRGWEKLAEPPRLGAPAPG-------NKSLFVPLSDYMNVQY 78
Query: 86 FGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCE 144
+GEIG+G+PPQNFSV+FDTGSSNLWVPS +C +FS+ C+ H R+ + S+++ G
Sbjct: 79 YGEIGLGTPPQNFSVVFDTGSSNLWVPSVRCHFFSLPCWLHHRFNPKASSSFRSNGTKFA 138
Query: 145 INYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGD 204
I YG+G ++G S+D + +G + F EA E SL F A FDGI+GLGF +AVG
Sbjct: 139 IQYGTGRLAGILSEDKLTIGGITGATVTFGEALWEPSLVFTFAHFDGILGLGFPVLAVGG 198
Query: 205 AVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYW 264
P D +V+QGL+ + VFSF+LNR+P+A +GGE+V GG DP H+ T+VPVT +W
Sbjct: 199 VRPPLDRLVDQGLLDKPVFSFYLNRNPEAADGGELVLGGSDPAHYIPPLTFVPVTIPAFW 258
Query: 265 QFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
Q + + +G T +C GCAAI+D+GTSL+ GPT + + AIG
Sbjct: 259 QIHMERVQVGTGLT-LCARGCAAILDTGTSLITGPTEEIRALQKAIG 304
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 52/135 (38%), Positives = 77/135 (57%), Gaps = 3/135 (2%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
+E+ V G + C A++ L T+E + + + ++P MG+ I+C
Sbjct: 262 MERVQVGTGLTLCARGCA-AILDTGTSLITGPTEE--IRALQKAIGAVPLLMGKYYIECS 318
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
+IPT+P VSF +G FNL+ + Y+++ VC+SGFMA D+PPP GP WILGDVF+
Sbjct: 319 KIPTLPPVSFLLGGVWFNLTAQDYVIQITRSGFSVCLSGFMALDVPPPSGPFWILGDVFL 378
Query: 487 GVYHTVFDSGKLRIG 501
G Y VFD G + G
Sbjct: 379 GSYVAVFDRGDRKSG 393
>gi|321461133|gb|EFX72168.1| hypothetical protein DAPPUDRAFT_227643 [Daphnia pulex]
Length = 394
Score = 248 bits (632), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 129/292 (44%), Positives = 181/292 (61%), Gaps = 12/292 (4%)
Query: 34 KKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGS 93
K R+ L ++++R T K G V+ R G PL N+ DAQYFG + +G+
Sbjct: 19 KGLRVPLKQMDSSRKTMKGL---GLAYEKVQRRYGSGKLISEPLTNYQDAQYFGPLTLGT 75
Query: 94 PPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSI 152
PPQ F +IFDTGS+NLWVPSS+C +++C H++Y S S+TYT G I YG+G++
Sbjct: 76 PPQEFDIIFDTGSANLWVPSSECAPTNLACRNHNQYNSSLSSTYTPNGTEFSIQYGTGAM 135
Query: 153 SGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNM 212
+GF S D + + V DQ F EA E + F+ RFDGI+G+ + I+V VP++ NM
Sbjct: 136 TGFLSTDVLGIAGAQVIDQTFAEAVEEPGVVFVAGRFDGILGMSYPSISVQGVVPMFQNM 195
Query: 213 VEQGLVSEEVFSFWLNRD-PDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 271
+ QGLV E VFSFWLNR+ + E GGEI+FGG +P H++G+ +YVPV++K YWQF + +
Sbjct: 196 MAQGLVDEPVFSFWLNRNLNNPENGGEILFGGTNPTHYEGEISYVPVSRKAYWQFSVDGV 255
Query: 272 -LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVV 316
L G C GGC I D+GTSL+ GP+ +T + IG GEG+V
Sbjct: 256 NLAGYDEYPFCNGGCEMISDTGTSLITGPSEEITLFHKLIGAQVNIVGEGIV 307
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 45/102 (44%), Positives = 67/102 (65%), Gaps = 2/102 (1%)
Query: 404 LSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE--V 461
++ ++L + N +GE I+DC+ IP +P ++FTIG K F L YI+ + +
Sbjct: 288 ITLFHKLIGAQVNIVGEGIVDCNEIPNLPAMTFTIGGKPFVLEGVDYIIPFVDTTTNDTL 347
Query: 462 CISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFA 503
C+SGFM D+P P GPLWILGDVF+G +++V+D G+ RIG A
Sbjct: 348 CLSGFMGLDIPEPAGPLWILGDVFIGKFYSVYDFGQDRIGLA 389
>gi|109125662|ref|XP_001116026.1| PREDICTED: napsin-A-like [Macaca mulatta]
Length = 421
Score = 248 bits (632), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 129/310 (41%), Positives = 188/310 (60%), Gaps = 8/310 (2%)
Query: 22 PASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFM 81
PA + L RI L + + + +LN R R+ + G +L +PL N+
Sbjct: 22 PARAT-LIRIPLHRVQPERRNLNLLRGWREPAEVPKLGAPSPGDKL-----TFVPLSNYR 75
Query: 82 DAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIG 140
D QYFG+IG+G+PPQNF+V+FDTGSSNLWVPS +C +FS+ C+ H R+ + S+++ G
Sbjct: 76 DVQYFGKIGLGTPPQNFTVVFDTGSSNLWVPSRRCHFFSVPCWLHHRFDPKASSSFQANG 135
Query: 141 KSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREI 200
I YG+G + G S+D + +G + +F EA E L F A FDGI+GLGF +
Sbjct: 136 TKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWEPGLVFTFAHFDGILGLGFPIL 195
Query: 201 AVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTK 260
+V P D +VEQGL+ + VFSF+LNRDP+ +GGE+V GG DP H+ T+VPVT
Sbjct: 196 SVEGVRPPMDVLVEQGLLDKPVFSFYLNRDPEEPDGGELVLGGSDPAHYIPPLTFVPVTV 255
Query: 261 KGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAEC 320
YWQ + + +G T +C GCAAI+D+GTSL+ GPT + ++ AIGG +++ E
Sbjct: 256 PAYWQIHMERVKVGPGLT-LCVRGCAAILDTGTSLITGPTEEIRALHAAIGGYPLLAGEY 314
Query: 321 KLVVSQYGDL 330
++ S+ L
Sbjct: 315 IILCSEIPKL 324
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 54/143 (37%), Positives = 75/143 (52%), Gaps = 7/143 (4%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
+E+ V G + C A++ L T+E + ++ P GE II C
Sbjct: 263 MERVKVGPGLTLCVRGCA-AILDTGTSLITGPTEE--IRALHAAIGGYPLLAGEYIILCS 319
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
IP +P VSF +G FNL+ + Y+++T +C+SGF A D+PPP GP WILGDVF+
Sbjct: 320 EIPKLPAVSFLLGGVWFNLTAQDYVIQTTRNGVRLCLSGFQALDVPPPAGPFWILGDVFL 379
Query: 487 GVYHTVFDSGKL----RIGFAEA 505
G Y VFD G R+G A A
Sbjct: 380 GTYVAVFDRGDTKSGARVGLARA 402
>gi|119592255|gb|EAW71849.1| napsin A aspartic peptidase, isoform CRA_c [Homo sapiens]
Length = 328
Score = 248 bits (632), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 131/307 (42%), Positives = 183/307 (59%), Gaps = 9/307 (2%)
Query: 24 SSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDI-LPLKNFMD 82
S L RI L + + LN R R+ + G D+ I +PL N+ D
Sbjct: 22 SGATLIRIPLHRVQPGRRILNLLRGWREPAELPKLGAPS------PGDKPIFVPLSNYRD 75
Query: 83 AQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGK 141
QYFGEIG+G+PPQNF+V FDTGSSNLWVPS +C +FS+ C+ H R+ + S+++ G
Sbjct: 76 VQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPCWLHHRFDPKASSSFQANGT 135
Query: 142 SCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIA 201
I YG+G + G S+D + +G + +F EA E SL F A FDGI+GLGF ++
Sbjct: 136 KFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWEPSLVFAFAHFDGILGLGFPILS 195
Query: 202 VGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKK 261
V P D +VEQGL+ + VFSF+LNRDP+ +GGE+V GG DP H+ T+VPVT
Sbjct: 196 VEGVRPPMDVLVEQGLLDKPVFSFYLNRDPEEPDGGELVLGGSDPAHYIPPLTFVPVTVP 255
Query: 262 GYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECK 321
YWQ + + +G T +C GCAAI+D+GTSL+ GPT + ++ AIGG +++ E +
Sbjct: 256 AYWQIHMERVKVGPGLT-LCAKGCAAILDTGTSLITGPTEEIRALHAAIGGIPLLAGEVR 314
Query: 322 LVVSQYG 328
YG
Sbjct: 315 SQSGGYG 321
>gi|154309857|ref|XP_001554261.1| hypothetical protein BC1G_06849 [Botryotinia fuckeliana B05.10]
gi|38195404|gb|AAR13364.1| aspartic proteinase precursor [Botryotinia fuckeliana]
Length = 398
Score = 248 bits (632), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 134/309 (43%), Positives = 192/309 (62%), Gaps = 15/309 (4%)
Query: 15 LASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGA--GVSGVRH------- 65
LA+ LL + S G+ ++ LKK L L A + +++G GV H
Sbjct: 6 LAAASLLGSVSAGVHKMPLKKVSLS-EQLATANMQEHAKHLGQKYMGVRPESHASEMFKE 64
Query: 66 -RLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYF 124
+ D+ + +P+ NF++AQYF EI IG+PPQ+F V+ DTGSSNLWVPSS+C SI+CY
Sbjct: 65 TSVHDAGDHTVPVSNFLNAQYFSEITIGTPPQSFKVVLDTGSSNLWVPSSQC-GSIACYL 123
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
H++Y S S+TY + G S EI YGSGS+SGF S+D + +GD+ +KDQVF EAT E L F
Sbjct: 124 HTKYDSSSSSTYKQNGTSFEIRYGSGSLSGFTSKDVMTIGDLKIKDQVFAEATEEPGLAF 183
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
RFDGI+GLG+ I+V VP + +MV+QGL+ E VF+F+L + D + E +FGGV
Sbjct: 184 AFGRFDGILGLGYDTISVNSIVPPFYSMVDQGLLDEPVFAFYLGSN-DESDPSEAIFGGV 242
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
+ H+ GK T +P+ +K YW+ +L I G+ + G I+D+GTSL+A P +
Sbjct: 243 NKDHYDGKITEIPLRRKAYWEVDLDSIAFGDSEAELENTGV--ILDTGTSLIALPADLAG 300
Query: 305 EINHAIGGE 313
+N IG +
Sbjct: 301 LLNAEIGAK 309
Score = 72.4 bits (176), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 51/87 (58%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC + ++P ++FT+ F + P YIL+ + CIS M D P P GPL
Sbjct: 314 GQYTVDCAKRDSLPELTFTLSGHKFPIGPYDYILE----VQGSCISAIMGMDFPEPVGPL 369
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ Y++++D GK +G A+A
Sbjct: 370 AILGDAFLRRYYSIYDLGKNTVGLAKA 396
>gi|347836229|emb|CCD50801.1| similar to vacuolar protease A (secreted protein) [Botryotinia
fuckeliana]
Length = 398
Score = 247 bits (631), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 134/309 (43%), Positives = 192/309 (62%), Gaps = 15/309 (4%)
Query: 15 LASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGA--GVSGVRH------- 65
LA+ LL + S G+ ++ LKK L L A + +++G GV H
Sbjct: 6 LAAASLLGSVSAGVHKMPLKKVSLS-EQLATANMQEHAKHLGQKYMGVRPESHASEMFKE 64
Query: 66 -RLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYF 124
+ D+ + +P+ NF++AQYF EI IG+PPQ+F V+ DTGSSNLWVPSS+C SI+CY
Sbjct: 65 TSVHDAGDHTVPVSNFLNAQYFSEITIGTPPQSFKVVLDTGSSNLWVPSSQC-GSIACYL 123
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
H++Y S S+TY + G S EI YGSGS+SGF S+D + +GD+ +KDQVF EAT E L F
Sbjct: 124 HTKYDSSSSSTYKQNGTSFEIRYGSGSLSGFTSKDVMTIGDLKIKDQVFAEATEEPGLAF 183
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
RFDGI+GLG+ I+V VP + +MV+QGL+ E VF+F+L + D + E +FGGV
Sbjct: 184 AFGRFDGILGLGYDTISVNSIVPPFYSMVDQGLLDEPVFAFYLGSN-DESDPSEAIFGGV 242
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
+ H+ GK T +P+ +K YW+ +L I G+ + G I+D+GTSL+A P +
Sbjct: 243 NKDHYDGKITEIPLRRKAYWEVDLDSIAFGDSEAELENTGV--ILDTGTSLIALPADLAG 300
Query: 305 EINHAIGGE 313
+N IG +
Sbjct: 301 LLNAEIGAK 309
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 51/87 (58%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ IDC + ++P ++FT+ F + P YIL+ + CIS M D P P GPL
Sbjct: 314 GQYTIDCAKRDSLPELTFTLSGHKFPIGPYDYILE----VQGSCISAIMGMDFPEPVGPL 369
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ Y++++D GK +G A+A
Sbjct: 370 AILGDAFLRRYYSIYDLGKNTVGLAKA 396
>gi|410974821|ref|XP_003993838.1| PREDICTED: cathepsin D [Felis catus]
Length = 418
Score = 247 bits (631), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 131/304 (43%), Positives = 188/304 (61%), Gaps = 25/304 (8%)
Query: 36 RRLDLHSLNAARITRKERYMGGA------------GVSGVRHRLGDSDEDILPLKNFMDA 83
R+ LH + R T E +GG GV G +IL KN++DA
Sbjct: 32 ERIPLHKFTSVRRTMSE--LGGPVEDLIAKGPISKYAQGVPAVTGGPIPEIL--KNYLDA 87
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKS 142
QY+GEIGIG+PPQ F+V+FDTGS+NLWVPS C I+C+ S Y + G S
Sbjct: 88 QYYGEIGIGTPPQCFTVVFDTGSANLWVPSIHCKLLDIACWGGSVAXXXXXXXYVKNGTS 147
Query: 143 CEINYGSGSISGFFSQDNVEV-------GDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
+I+YGSGS+SG+ SQD V V V V+ Q+F EA ++ +TF+ A+FDGI+G+
Sbjct: 148 FDIHYGSGSLSGYLSQDTVSVPCQTPTVAGVKVERQIFGEAIKQPGITFIAAKFDGILGM 207
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ I+V D +PV+DN+++Q LV + +FSF+LNRDP+A+ GGE++ GG D K++KG +Y
Sbjct: 208 AYPRISVDDVLPVFDNLMKQKLVEKNIFSFYLNRDPNAQPGGELMLGGTDSKYYKGPLSY 267
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
+ VT+K YWQ + + +G T +C+GGC AI+D+GTSL+ GP V E+ AIG +
Sbjct: 268 LNVTRKAYWQVHMDQVDVGTSLT-LCKGGCEAILDTGTSLMVGPVDEVRELQKAIGAVPL 326
Query: 316 VSAE 319
+ E
Sbjct: 327 IQGE 330
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 46/94 (48%), Positives = 65/94 (69%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I C+++ T+P V+ +G K + LS + Y LK +G +C+SGFM D+P
Sbjct: 323 AVPLIQGEYMIPCEKVSTLPEVTVKLGGKGYKLSSKDYTLKVSQGGRTICLSGFMGMDIP 382
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PP GPLWILGDVF+G Y+TVFD + R+G AEA
Sbjct: 383 PPGGPLWILGDVFIGRYYTVFDRDENRVGLAEAT 416
>gi|195430468|ref|XP_002063276.1| GK21477 [Drosophila willistoni]
gi|194159361|gb|EDW74262.1| GK21477 [Drosophila willistoni]
Length = 402
Score = 247 bits (631), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 131/295 (44%), Positives = 184/295 (62%), Gaps = 11/295 (3%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDIL---PLKNFMDAQYFGEIGIGS 93
R+ LH +AR R E++ +R+ + D + L PL N++DAQYFG I IG+
Sbjct: 32 RVPLHRFPSAR-RRFEQFGIRMERLRLRYSVMPRDGEKLRTEPLTNYLDAQYFGPITIGT 90
Query: 94 PPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSI 152
PPQ F VIFDTGS+NLWVPS+ C S++C HSR+ +++S +Y IG I+YGSGS+
Sbjct: 91 PPQIFKVIFDTGSANLWVPSTSCSPASVACMIHSRFHAKRSTSYYPIGAPFAIHYGSGSL 150
Query: 153 SGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNM 212
SG+ S+D V V + +++QVF EAT FL A+FDGI GLG+R I+V P + M
Sbjct: 151 SGYLSRDTVRVAGLEIENQVFAEATNMPGPIFLAAKFDGIFGLGYRSISVQRIKPPFYAM 210
Query: 213 VEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDIL 272
+EQ L++ VFS +LNRD A+EGG + FGG +P+++ G TYVPV+++ YWQ +
Sbjct: 211 MEQNLLASPVFSVYLNRDVAAKEGGALFFGGSNPQYYTGNFTYVPVSRRSYWQITMDSAH 270
Query: 273 IGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 323
I + +CE GC I+D+GTS LA P IN +IGG G+ S C+ V
Sbjct: 271 I--KDLNLCEQGCEVIIDTGTSFLAMPYDQAMLINKSIGGTPSSYGMFSIPCEQV 323
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 45/99 (45%), Positives = 60/99 (60%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
IN+ P+ G I C+++P +P ++F +G + F+L YI K VC S
Sbjct: 302 INKSIGGTPSSYGMFSIPCEQVPHLPTMTFQLGGRKFHLEGRDYIFKDTYQDGIVCASAL 361
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+A DLP P GPLWILGDVF+G Y+T FD G RIGFA+A
Sbjct: 362 IAVDLPSPSGPLWILGDVFLGKYYTEFDMGNHRIGFADA 400
>gi|380483026|emb|CCF40872.1| vacuolar protease A [Colletotrichum higginsianum]
Length = 399
Score = 247 bits (631), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 135/321 (42%), Positives = 194/321 (60%), Gaps = 18/321 (5%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRK-----ERYMGG-----AGVSGV 63
+L + +LL A+ ++ LKK L+ LN+ I + ++YMG A
Sbjct: 5 LLTAAVLLGAAQAEFHKLKLKKVSLE-EQLNSVPIEHQVRQLGQKYMGARPDNHADAMFK 63
Query: 64 RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCY 123
+ + + E +P+ NFM+AQYF EI IG+PPQ F V+ DTGSSNLWVPS +C SI+CY
Sbjct: 64 QKPVQSNGEHPVPVSNFMNAQYFSEIEIGNPPQTFKVVLDTGSSNLWVPSQQC-GSIACY 122
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
H++Y S S+TY G S EI+YGSGS++GF SQD+V +GD+ +K Q F EAT E L
Sbjct: 123 LHTKYDSSASSTYKANGSSFEIHYGSGSLTGFVSQDDVSIGDLKIKKQDFAEATSEPGLA 182
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
F RFDGI+GLG+ I+V VP + N+V Q + E VF+F+L + + E FGG
Sbjct: 183 FAFGRFDGILGLGYDTISVNKIVPPFYNLVNQKAIDEPVFAFYLGDTNEEGDESEATFGG 242
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
+D H++GK TY+P+ +K YW+ +L I +G+Q+ + G AI+D+GTSL P+ +
Sbjct: 243 LDDSHYEGKITYIPLRRKAYWEVDLDAISLGDQTAEL--EGHGAILDTGTSLNVLPSALA 300
Query: 304 TEINHAIGGE----GVVSAEC 320
+N IG + G S EC
Sbjct: 301 ELLNKEIGAKKGYNGQYSVEC 321
Score = 68.2 bits (165), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 32/87 (36%), Positives = 53/87 (60%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ ++C + +P+++FT+ F++S YIL+ ++ CIS F D P P GPL
Sbjct: 315 GQYSVECSKRDELPDITFTLAGYNFSISAYDYILE----VSGSCISTFQGMDFPEPVGPL 370
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ +++V+D GK +G A+A
Sbjct: 371 VILGDAFLRRWYSVYDLGKNAVGLAKA 397
>gi|397485042|ref|XP_003813672.1| PREDICTED: napsin-A-like [Pan paniscus]
Length = 420
Score = 247 bits (631), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 126/299 (42%), Positives = 184/299 (61%), Gaps = 8/299 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDED----ILPLKNFMDAQYFGEIGIG 92
R+ L ++ R R + G G +LG ++PL F+DAQYFGEIG+G
Sbjct: 28 RIPLRQVHPGR--RTLNLLRGWGKPAELPKLGAPSPGDKPALVPLSKFLDAQYFGEIGLG 85
Query: 93 SPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGS 151
+PPQNF+V FDTGSSNLWVPS +C +FS+ C+FH R+ S+++ G I YG+G
Sbjct: 86 TPPQNFTVAFDTGSSNLWVPSRRCHFFSVPCWFHHRFNPNASSSFKPNGTKFAIQYGTGR 145
Query: 152 ISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDN 211
+ G S+D + +G + +F EA E SL F ++R DGI+GLGF ++V P D
Sbjct: 146 VDGILSEDKLTIGGIKGASVIFGEALWESSLVFTVSRPDGILGLGFPILSVEGVRPPLDV 205
Query: 212 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 271
+VEQGL+ + VFSF+LNRDP+ +GGE+V GG DP H+ T+VPVT YWQ + +
Sbjct: 206 LVEQGLLDKPVFSFYLNRDPEVADGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERV 265
Query: 272 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 330
+G++ T +C GCAAI+D+GT ++ GPT + ++ AIGG +++ E + S+ L
Sbjct: 266 KVGSRLT-LCAQGCAAILDTGTPVIVGPTEEIRALHAAIGGIPLLAGEYIIRCSEIPKL 323
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 51/144 (35%), Positives = 76/144 (52%), Gaps = 7/144 (4%)
Query: 367 VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
+ E V G +C+ A++ + T+E + ++ +P GE II C
Sbjct: 260 IHMERVKVGSRLTLCAQGCAAILDTGTPVIVGPTEE--IRALHAAIGGIPLLAGEYIIRC 317
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
IP +P VS IG FNL+ + Y+++ +G +C+SGF A D+ P P+WILGDVF
Sbjct: 318 SEIPKLPAVSLLIGGVWFNLTAQDYVIQFAQGDVRLCLSGFRALDIASPPVPVWILGDVF 377
Query: 486 MGVYHTVFDSGKL----RIGFAEA 505
+G Y TVFD G + R+G A A
Sbjct: 378 LGAYVTVFDRGDMKSGARVGLARA 401
>gi|426389739|ref|XP_004061277.1| PREDICTED: napsin-A-like [Gorilla gorilla gorilla]
Length = 420
Score = 247 bits (631), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 132/311 (42%), Positives = 188/311 (60%), Gaps = 10/311 (3%)
Query: 22 PASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDI-LPLKNF 80
PA + L RI L + + +LN R R+ + G D+ I +PL N+
Sbjct: 21 PAGAT-LIRIPLHRVQPGRRTLNLLRGWREPAELPKLGAPS------PVDKPIFVPLLNY 73
Query: 81 MDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEI 139
D QYFGEIG+G+PPQNF+V FDTGSSNLWVPS +C +FS+ C+ H R+ + S+++
Sbjct: 74 RDVQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPCWLHDRFDPKASSSFQAN 133
Query: 140 GKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFRE 199
G I YG+G + G S+D + +G + +F EA E SL F A FDGI+GLGF
Sbjct: 134 GTKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWEPSLVFAFAHFDGILGLGFPI 193
Query: 200 IAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVT 259
++V P D +VEQGL+ + VFSF+LNRDP+ +GGE+V GG DP H+ T+VPVT
Sbjct: 194 LSVEGVRPPMDVLVEQGLLDKPVFSFYLNRDPEEPDGGELVLGGSDPAHYIPPLTFVPVT 253
Query: 260 KKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
YWQ + + +G T +C GCAAI+D+GTSL+ GPT + ++ AIGG +++ E
Sbjct: 254 VPAYWQIHMERVKVGPGLT-LCAQGCAAILDTGTSLITGPTEEIRALHAAIGGIPLLAGE 312
Query: 320 CKLVVSQYGDL 330
++ S+ L
Sbjct: 313 YIILCSEIPKL 323
Score = 98.2 bits (243), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 54/143 (37%), Positives = 76/143 (53%), Gaps = 7/143 (4%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
+E+ V G + C A++ L T+E + ++ +P GE II C
Sbjct: 262 MERVKVGPGLTLCAQGCA-AILDTGTSLITGPTEE--IRALHAAIGGIPLLAGEYIILCS 318
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
IP +P VSF +G FNL+ Y+++T +C+SGF A D+PPP GP WILGDVF+
Sbjct: 319 EIPKLPAVSFLLGGVWFNLTAHDYVIQTTRNGVRLCLSGFQALDVPPPAGPFWILGDVFL 378
Query: 487 GVYHTVFDSGKL----RIGFAEA 505
G Y VFD G + R+G A A
Sbjct: 379 GTYVAVFDRGDMKNSARVGLARA 401
>gi|114678578|ref|XP_530061.2| PREDICTED: napsin-A-like [Pan troglodytes]
Length = 420
Score = 247 bits (631), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 131/310 (42%), Positives = 188/310 (60%), Gaps = 8/310 (2%)
Query: 22 PASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFM 81
PA + L RI L++ +LN R K + G GD + PL F+
Sbjct: 21 PAGAT-LIRIPLRQVHPGRRTLNLLRGWGKPAELPKLGAPSP----GDKPASV-PLSKFL 74
Query: 82 DAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIG 140
DAQYFGEIG+G+PPQNF+V FDTGSSNLWVPS +C +FS+ C+FH R+ S+++ G
Sbjct: 75 DAQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPCWFHHRFNPNASSSFKPNG 134
Query: 141 KSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREI 200
I YG+G + G S+D + +G + +F EA E SL F ++R DGI+GLGF +
Sbjct: 135 TKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWESSLVFTVSRPDGILGLGFPIL 194
Query: 201 AVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTK 260
+V P D +VEQGL+ + VFSF+LNRDP+ +GGE+V GG DP H+ T+VPVT
Sbjct: 195 SVEGVRPPLDVLVEQGLLDKPVFSFYLNRDPEVADGGELVLGGSDPAHYIPPLTFVPVTV 254
Query: 261 KGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAEC 320
YWQ + + +G++ T +C GCAAI+D+GT ++ GPT + ++ AIGG +++ E
Sbjct: 255 PAYWQIHMERVKVGSRLT-LCAQGCAAILDTGTPVIVGPTEEIRALHAAIGGIPLLAGEY 313
Query: 321 KLVVSQYGDL 330
+ S+ L
Sbjct: 314 IIRCSEIPKL 323
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 50/144 (34%), Positives = 75/144 (52%), Gaps = 7/144 (4%)
Query: 367 VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
+ E V G +C+ A++ + T+E + ++ +P GE II C
Sbjct: 260 IHMERVKVGSRLTLCAQGCAAILDTGTPVIVGPTEE--IRALHAAIGGIPLLAGEYIIRC 317
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
IP +P VS IG F L+ + Y+++ +G +C+SGF A D+ P P+WILGDVF
Sbjct: 318 SEIPKLPAVSLLIGGVWFTLTAQDYVIQFAQGDVRLCLSGFRALDIASPPVPVWILGDVF 377
Query: 486 MGVYHTVFDSGKL----RIGFAEA 505
+G Y TVFD G + R+G A A
Sbjct: 378 LGAYVTVFDRGDMKSGARVGLARA 401
>gi|310796316|gb|EFQ31777.1| eukaryotic aspartyl protease [Glomerella graminicola M1.001]
Length = 399
Score = 247 bits (630), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 134/321 (41%), Positives = 193/321 (60%), Gaps = 18/321 (5%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDS--- 70
+L + +LL A+ + ++ LKK L+ LNA I + R +G + + D+
Sbjct: 5 LLTAAVLLGAAQAEVHKLKLKKVPLE-EQLNAVPIEHQVRQLGQKYMGTRPNNHADAMFN 63
Query: 71 -------DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCY 123
E +P+ NFM+AQYF EI IG+PPQ F V+ DTGSSNLWVPS +C SI+CY
Sbjct: 64 QKPIQTDGEHPVPVSNFMNAQYFSEIQIGTPPQTFKVVLDTGSSNLWVPSQQC-GSIACY 122
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
H++Y S S+TY G S EI+YGSGS++GF SQD+V +GD+ +K Q F EAT E L
Sbjct: 123 LHTKYDSSASSTYKSNGSSFEIHYGSGSLTGFVSQDDVSIGDLKIKKQDFAEATSEPGLA 182
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
F RFDGI+GLG+ I+V VP + N+V Q + E VF+F+L + + E FGG
Sbjct: 183 FAFGRFDGILGLGYDTISVNKIVPPFYNLVNQKAIDEPVFAFYLGDTNEEGDESEATFGG 242
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
+D H++GK TY+P+ +K YW+ +L I +G+++ + G AI+D+GTSL P+ +
Sbjct: 243 LDESHYEGKVTYIPLRRKAYWEVDLDAISLGDETADL--EGHGAILDTGTSLNVLPSALA 300
Query: 304 TEINHAIGGE----GVVSAEC 320
+N IG + G S EC
Sbjct: 301 ELLNKEIGAKKGYNGQYSVEC 321
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 53/87 (60%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ ++C + +P+++FT+ F++S Y+L+ ++ CIS F D P P GPL
Sbjct: 315 GQYSVECSKRDELPDITFTLAGYNFSISAYDYVLE----VSGSCISTFQGMDFPEPVGPL 370
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ +++V+D GK +G A+A
Sbjct: 371 VILGDAFLRRWYSVYDLGKNAVGLAKA 397
>gi|4758754|ref|NP_004842.1| napsin-A preproprotein [Homo sapiens]
gi|6225749|sp|O96009.1|NAPSA_HUMAN RecName: Full=Napsin-A; AltName: Full=Aspartyl protease 4;
Short=ASP4; Short=Asp 4; AltName: Full=Napsin-1;
AltName: Full=TA01/TA02; Flags: Precursor
gi|4154287|gb|AAD04917.1| napsin A [Homo sapiens]
gi|4235425|gb|AAD13215.1| napsin 1 precursor [Homo sapiens]
gi|6561818|gb|AAF17081.1| aspartyl protease 4 [Homo sapiens]
gi|119592253|gb|EAW71847.1| napsin A aspartic peptidase, isoform CRA_a [Homo sapiens]
Length = 420
Score = 247 bits (630), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 131/309 (42%), Positives = 185/309 (59%), Gaps = 9/309 (2%)
Query: 24 SSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDI-LPLKNFMD 82
S L RI L + + LN R R+ + G D+ I +PL N+ D
Sbjct: 22 SGATLIRIPLHRVQPGRRILNLLRGWREPAELPKLGAPS------PGDKPIFVPLSNYRD 75
Query: 83 AQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGK 141
QYFGEIG+G+PPQNF+V FDTGSSNLWVPS +C +FS+ C+ H R+ + S+++ G
Sbjct: 76 VQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPCWLHHRFDPKASSSFQANGT 135
Query: 142 SCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIA 201
I YG+G + G S+D + +G + +F EA E SL F A FDGI+GLGF ++
Sbjct: 136 KFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWEPSLVFAFAHFDGILGLGFPILS 195
Query: 202 VGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKK 261
V P D +VEQGL+ + VFSF+LNRDP+ +GGE+V GG DP H+ T+VPVT
Sbjct: 196 VEGVRPPMDVLVEQGLLDKPVFSFYLNRDPEEPDGGELVLGGSDPAHYIPPLTFVPVTVP 255
Query: 262 GYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECK 321
YWQ + + +G T +C GCAAI+D+GTSL+ GPT + ++ AIGG +++ E
Sbjct: 256 AYWQIHMERVKVGPGLT-LCAKGCAAILDTGTSLITGPTEEIRALHAAIGGIPLLAGEYI 314
Query: 322 LVVSQYGDL 330
++ S+ L
Sbjct: 315 ILCSEIPKL 323
Score = 98.2 bits (243), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 54/143 (37%), Positives = 76/143 (53%), Gaps = 7/143 (4%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
+E+ V G + C A++ L T+E + ++ +P GE II C
Sbjct: 262 MERVKVGPGLTLCAKGCA-AILDTGTSLITGPTEE--IRALHAAIGGIPLLAGEYIILCS 318
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
IP +P VSF +G FNL+ Y+++T +C+SGF A D+PPP GP WILGDVF+
Sbjct: 319 EIPKLPAVSFLLGGVWFNLTAHDYVIQTTRNGVRLCLSGFQALDVPPPAGPFWILGDVFL 378
Query: 487 GVYHTVFDSGKL----RIGFAEA 505
G Y VFD G + R+G A A
Sbjct: 379 GTYVAVFDRGDMKSSARVGLARA 401
>gi|194374823|dbj|BAG62526.1| unnamed protein product [Homo sapiens]
Length = 325
Score = 247 bits (630), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 126/295 (42%), Positives = 181/295 (61%), Gaps = 7/295 (2%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L RI L++ +LN R K + G GD + PL F+DAQYFG
Sbjct: 26 LIRIPLRQVHPGRRTLNLLRGWGKPAELPKLGAPSP----GDKPASV-PLSKFLDAQYFG 80
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEIN 146
EIG+G+PPQNF+V FDTGSSNLWVPS +C +FS+ C+FH R+ S+++ G I
Sbjct: 81 EIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPCWFHHRFNPNASSSFKPSGTKFAIQ 140
Query: 147 YGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
YG+G + G S+D + +G + +F EA E SL F ++R DGI+GLGF ++V
Sbjct: 141 YGTGRVDGILSEDKLTIGGIKGASVIFGEALWESSLVFTVSRPDGILGLGFPILSVEGVR 200
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
P D +VEQGL+ + VFSF+ NRDP+ +GGE+V GG DP H+ T+VPVT YWQ
Sbjct: 201 PPLDVLVEQGLLDKPVFSFYFNRDPEVADGGELVLGGSDPAHYIPPLTFVPVTVPAYWQI 260
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECK 321
+ + +G++ T +C GCAAI+D+GT ++ GPT + ++ AIGG +++ E +
Sbjct: 261 HMERVKVGSRLT-LCAQGCAAILDTGTPVIVGPTEEIRALHAAIGGIPLLAGEVR 314
>gi|348559312|ref|XP_003465460.1| PREDICTED: napsin-A-like [Cavia porcellus]
Length = 523
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 127/254 (50%), Positives = 166/254 (65%), Gaps = 5/254 (1%)
Query: 68 GDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHS 126
GDS +PL F++ QYFGEIG+G+PPQNFSV+FDTGSSNLWVPS C +FS+ C+FH
Sbjct: 59 GDS-PFFVPLSKFLNVQYFGEIGLGTPPQNFSVVFDTGSSNLWVPSKSCRFFSLPCWFHH 117
Query: 127 RYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLL 186
RY + S+++ G I YG+G +SG SQD + +G + F EA E SL F
Sbjct: 118 RYDPKASSSFCPNGTKFAIQYGTGRLSGILSQDKLTIGGINNVSVTFGEALWEPSLVFAF 177
Query: 187 ARFDGIIGLGFREIAVGDAVPV-WDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVD 245
A FDGI GLGF +AV D VP D MVEQGL+ + VFSF+LNRD + GGE+V GG D
Sbjct: 178 ASFDGIFGLGFPALAV-DGVPTPLDVMVEQGLLDKPVFSFYLNRDFEGTHGGELVLGGSD 236
Query: 246 PKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTE 305
P H+ T+VPVT YWQ + +++G T +C GCAAIVD+GTSL+ GP+ +
Sbjct: 237 PAHYIPPLTFVPVTIPAYWQIHMDRVMVGTGLT-LCAQGCAAIVDTGTSLITGPSEEIRA 295
Query: 306 INHAIGGEGVVSAE 319
++ AIGG ++ E
Sbjct: 296 LHRAIGGLPWLAGE 309
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 50/131 (38%), Positives = 74/131 (56%), Gaps = 6/131 (4%)
Query: 379 VCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTI 438
+C+ A+V L ++E + ++ LP GE I C +IPT+P +SF +
Sbjct: 270 LCAQGCAAIVDTGTSLITGPSEE--IRALHRAIGGLPWLAGEHFIQCSKIPTLPPISFLL 327
Query: 439 GDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKL 498
G FNL+ + Y+++ +G +C+SGF A D+PPP GPLWILGDVF+ Y VFD G
Sbjct: 328 GGVWFNLTAQDYVIQISQGGFRLCLSGFQALDVPPPAGPLWILGDVFLRTYVAVFDRGNT 387
Query: 499 ----RIGFAEA 505
R+G A +
Sbjct: 388 SRGARVGLARS 398
>gi|344312912|emb|CCC33063.1| cathepsin D-1 [Dermanyssus gallinae]
Length = 383
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 121/258 (46%), Positives = 162/258 (62%), Gaps = 2/258 (0%)
Query: 74 ILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRK 132
I PL NF DAQY+G I IG+PPQ F VIFDTGSS+LWVPSSKC S I+C HS+Y + K
Sbjct: 54 IEPLNNFGDAQYYGPITIGTPPQTFQVIFDTGSSDLWVPSSKCPSSNIACATHSKYNAEK 113
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGI 192
S+TY G I YGSGS+SG S D V V + V Q F E T E +F+ ++DGI
Sbjct: 114 SSTYVANGTKFAIQYGSGSVSGVLSTDTVSVSGITVTKQTFGEITEESGDSFIYGKYDGI 173
Query: 193 IGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGK 252
+G+G+ EIA +PV+D MV+Q +V + +FSF+L RDP G E+V GG+DPKH+KG
Sbjct: 174 LGMGYPEIA-SSGLPVFDQMVKQKVVEKAIFSFFLTRDPQHPIGSELVLGGIDPKHYKGD 232
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
TY P+T++ YWQF + + + ++ VC+ GC I D+GTSL GPT V + +
Sbjct: 233 ITYAPLTRESYWQFRVDKVTLNGKAAPVCQKGCEGIADTGTSLFVGPTADVAALASQLDA 292
Query: 313 EGVVSAECKLVVSQYGDL 330
+ + + GDL
Sbjct: 293 QETAPGLYLVDCEKAGDL 310
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 35/88 (39%), Positives = 54/88 (61%), Gaps = 2/88 (2%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G ++DC++ +PN+ FTI + F L+P Y+++ + C+ F D+P P+
Sbjct: 298 GLYLVDCEKAGDLPNIEFTIAGRPFELTPLDYVVRLKQSGQTFCVLAFQGMDIP--DDPI 355
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEAA 506
WILGD+F+G Y TVFD R+GFA+AA
Sbjct: 356 WILGDIFIGKYFTVFDRENNRVGFADAA 383
>gi|119592251|gb|EAW71845.1| hCG1733572, isoform CRA_a [Homo sapiens]
Length = 449
Score = 246 bits (629), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 131/314 (41%), Positives = 189/314 (60%), Gaps = 8/314 (2%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L RI L++ +LN R K + G GD + PL F+DAQYFG
Sbjct: 26 LIRIPLRQVHPGRRTLNLLRGWGKPAELPKLGAPSP----GDKPASV-PLSKFLDAQYFG 80
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEIN 146
EIG+G+PPQNF+V FDTGSSNLWVPS +C +FS+ C+FH R+ S+++ G I
Sbjct: 81 EIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPCWFHHRFNPNASSSFKPSGTKFAIQ 140
Query: 147 YGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
YG+G + G S+D + +G + +F EA E SL F ++R DGI+GLGF ++V
Sbjct: 141 YGTGRVDGILSEDKLTIGGIKGASVIFGEALWESSLVFTVSRPDGILGLGFPILSVEGVR 200
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
P D +VEQGL+ + VFSF+ NRDP+ +GGE+V GG DP H+ T+VPVT YWQ
Sbjct: 201 PPLDVLVEQGLLDKPVFSFYFNRDPEVADGGELVLGGSDPAHYIPPLTFVPVTVPAYWQI 260
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQ 326
+ + +G++ T +C GCAAI+D+GT ++ GPT + ++ AIGG +++ E + S+
Sbjct: 261 HMERVKVGSRLT-LCAQGCAAILDTGTPVIVGPTEEIRALHAAIGGIPLLAGEYIIRCSE 319
Query: 327 YGDL-IWDLLVSGL 339
L LL+ G+
Sbjct: 320 IPKLPAVSLLIGGV 333
Score = 89.4 bits (220), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 51/144 (35%), Positives = 76/144 (52%), Gaps = 7/144 (4%)
Query: 367 VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
+ E V G +C+ A++ + T+E + ++ +P GE II C
Sbjct: 260 IHMERVKVGSRLTLCAQGCAAILDTGTPVIVGPTEE--IRALHAAIGGIPLLAGEYIIRC 317
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
IP +P VS IG FNL+ + Y+++ +G +C+SGF A D+ P P+WILGDVF
Sbjct: 318 SEIPKLPAVSLLIGGVWFNLTAQDYVIQFAQGDVRLCLSGFRALDIASPPVPVWILGDVF 377
Query: 486 MGVYHTVFDSGKL----RIGFAEA 505
+G Y TVFD G + R+G A A
Sbjct: 378 LGAYVTVFDRGDMKSGARVGLARA 401
>gi|332241360|ref|XP_003269848.1| PREDICTED: napsin-A-like [Nomascus leucogenys]
Length = 421
Score = 246 bits (629), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 130/306 (42%), Positives = 185/306 (60%), Gaps = 12/306 (3%)
Query: 26 NGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQY 85
N LRR+ +R +LN R K + G GD + PL F+DAQY
Sbjct: 30 NPLRRVHPGRR-----ALNLLRGWGKPAELPKLGAPSP----GDKPASV-PLSKFLDAQY 79
Query: 86 FGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCE 144
FGEIG+G+PPQNF+V FDTGSSNLWVPS +C +FS+ C+FH R+ S+++ G
Sbjct: 80 FGEIGLGTPPQNFTVTFDTGSSNLWVPSRRCHFFSVPCWFHHRFNPNASSSFKPNGTKFA 139
Query: 145 INYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGD 204
I YG+G + G S+D + +G + +F EA E SL F ++R DGI+GLGF +AV
Sbjct: 140 IQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWESSLVFTVSRPDGILGLGFPILAVEG 199
Query: 205 AVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYW 264
P D +VEQGL+ + +FSF+LNRDP+ +GGE+V GG DP H+ T+VPVT YW
Sbjct: 200 VRPPLDVLVEQGLLDKPIFSFYLNRDPEVADGGELVLGGSDPAHYIPPLTFVPVTVPAYW 259
Query: 265 QFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVV 324
Q + + +G+ T +C GCAAI+D+GT ++ GPT + ++ AIGG +++ E +
Sbjct: 260 QIHMERVKVGSGLT-LCARGCAAILDTGTPVIIGPTEEIRALHAAIGGISLLAGEYLIRC 318
Query: 325 SQYGDL 330
S+ L
Sbjct: 319 SEIPKL 324
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 41/91 (45%), Positives = 56/91 (61%), Gaps = 4/91 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
GE +I C IP +P VS IG FNL+ + Y+++ +G +C+SGF A D+ P P+
Sbjct: 312 GEYLIRCSEIPKLPAVSLLIGGVWFNLTAQDYVIQFAQGDVRLCLSGFRALDIASPPVPV 371
Query: 479 WILGDVFMGVYHTVFDSGKL----RIGFAEA 505
WILGDVF+G Y VFD G + R+G A A
Sbjct: 372 WILGDVFLGAYVAVFDRGDMKSGARVGLARA 402
>gi|6561816|gb|AAF17080.1| aspartyl protease 3 [Homo sapiens]
Length = 450
Score = 246 bits (628), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 131/314 (41%), Positives = 189/314 (60%), Gaps = 8/314 (2%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L RI L++ +LN R K + G GD + PL F+DAQYFG
Sbjct: 26 LIRIPLRQVHPGRRTLNLLRGWGKPAELPKLGAPSP----GDKPASV-PLSKFLDAQYFG 80
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEIN 146
EIG+G+PPQNF+V FDTGSSNLWVPS +C +FS+ C+FH R+ S+++ G I
Sbjct: 81 EIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPCWFHHRFNPNASSSFKPSGTKFAIQ 140
Query: 147 YGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
YG+G + G S+D + +G + +F EA E SL F ++R DGI+GLGF ++V
Sbjct: 141 YGTGRVDGILSEDKLTIGGIKGASVIFGEALWESSLVFTVSRPDGILGLGFPILSVEGVR 200
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
P D +VEQGL+ + VFSF+ NRDP+ +GGE+V GG DP H+ T+VPVT YWQ
Sbjct: 201 PPLDVLVEQGLLDKPVFSFYFNRDPEVADGGELVLGGSDPAHYIPPLTFVPVTVPAYWQI 260
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQ 326
+ + +G++ T +C GCAAI+D+GT ++ GPT + ++ AIGG +++ E + S+
Sbjct: 261 HMERVKVGSRLT-LCAQGCAAILDTGTPVIVGPTEEIRALHAAIGGIPLLAGEYIIRCSE 319
Query: 327 YGDL-IWDLLVSGL 339
L LL+ G+
Sbjct: 320 IPKLPAVSLLIGGV 333
Score = 89.4 bits (220), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 44/96 (45%), Positives = 59/96 (61%), Gaps = 4/96 (4%)
Query: 414 LPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPP 473
+P GE II C IP +P VS IG FNL+ + Y+++ +G +C+SGF A D+
Sbjct: 306 IPLLAGEYIIRCSEIPKLPAVSLLIGGVWFNLTAQDYVIQFAQGDVRLCLSGFRALDIAS 365
Query: 474 PRGPLWILGDVFMGVYHTVFDSGKL----RIGFAEA 505
P P+WILGDVF+G Y TVFD G + R+G A A
Sbjct: 366 PPVPVWILGDVFLGAYVTVFDRGDMKSGARVGLARA 401
>gi|307166067|gb|EFN60339.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 370
Score = 246 bits (628), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 120/240 (50%), Positives = 165/240 (68%), Gaps = 9/240 (3%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSK---CYFSISCY---FHSRYKS 130
L ++DAQY+G I IG+PPQNF+V+FDTGSSNLWVPS K ++ +SC+ +H +Y +
Sbjct: 46 LFKYLDAQYYGVISIGTPPQNFTVLFDTGSSNLWVPSIKSEITFYKLSCWTAPYHHKYNN 105
Query: 131 RKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFD 190
KS TY I YGSG +SGF S D V V + V++Q F EAT E S+ F+L +FD
Sbjct: 106 SKSITYQANSAPFAIEYGSGDLSGFLSTDVVNVAGLNVRNQTFAEATHESSI-FILMQFD 164
Query: 191 GIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFK 250
GI+G+G+ I+V P++ NM++Q LVS+ +FSF+LNR+P AEEGGE++ GG DP H+
Sbjct: 165 GILGMGYPTISVDGVTPIFQNMIQQRLVSQPIFSFYLNRNPSAEEGGELILGGCDPNHYV 224
Query: 251 GKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI 310
G+ TYVPVT +GYWQF + ++ GN +C GC AI D+GTSL+ GP+ + IN I
Sbjct: 225 GEFTYVPVTVEGYWQFTMDSVIAGNYI--LCAQGCQAIADTGTSLIVGPSEDIDVINGYI 282
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 45/138 (32%), Positives = 71/138 (51%), Gaps = 13/138 (9%)
Query: 370 ENVSAGDSAVCSACEMAVVWVQNQL--KQKQTKEKVLSYINELCDSLPNPMGESIIDCDR 427
++V AG+ +C+ A+ L + + + YI + D+ N +DCD+
Sbjct: 243 DSVIAGNYILCAQGCQAIADTGTSLIVGPSEDIDVINGYIQNISDNDGN------VDCDK 296
Query: 428 IPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMG 487
I +P ++F + K NL+P YI++ E +C SGF L WILGDVF+G
Sbjct: 297 INELPTINFILSGKPHNLTPHDYIIRDTEDGVAICYSGFQGSYLSG-----WILGDVFIG 351
Query: 488 VYHTVFDSGKLRIGFAEA 505
++TVFD G R+GFA +
Sbjct: 352 HFYTVFDMGNNRVGFAPS 369
>gi|41053329|ref|NP_956325.1| uncharacterized protein LOC336746 precursor [Danio rerio]
gi|34783813|gb|AAH56836.1| Zgc:63831 [Danio rerio]
Length = 412
Score = 246 bits (628), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 131/315 (41%), Positives = 184/315 (58%), Gaps = 16/315 (5%)
Query: 20 LLPASSNGLRRIGLKKRRLDLHSL--NAARITR-------KERYMGGAGVSGVRHRLGDS 70
LL A S + RI L K R L N I K +Y G + +
Sbjct: 13 LLIADSQAIIRIPLHKMRTVRRMLADNGKTIDEIKSLAKMKAKYSDGTFTNQGSVTIPAP 72
Query: 71 DEDILP-----LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF-SISCYF 124
LP L NFMDAQY+G I IG+PPQ+FSV+FDTGSSNLWVPS C F I+C+
Sbjct: 73 TTTQLPPPVEKLTNFMDAQYYGMISIGTPPQDFSVLFDTGSSNLWVPSIHCAFLDIACWL 132
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
H RY S+KS+TY + G I YG GS+SGF SQD V + + V Q F EA ++ + F
Sbjct: 133 HRRYNSKKSSTYVQNGTEFSIQYGRGSLSGFISQDTVNLAGLNVTGQQFAEAVKQPGIVF 192
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
+ARFDG++G+ + I+V PV+D + ++ + +FSF++NRDP + GGE++ GG
Sbjct: 193 AVARFDGVLGMAYPAISVDRVTPVFDTAMAAKILPQNIFSFYINRDPAGDVGGELMLGGF 252
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
D ++F G YV VT+K YWQ ++ ++ +G+ T +C+ GC AIVD+GTS++ GP V
Sbjct: 253 DQQYFNGDLHYVNVTRKAYWQIKMDEVQVGSTLT-LCKSGCQAIVDTGTSMITGPVQEVR 311
Query: 305 EINHAIGGEGVVSAE 319
+ AIG ++ E
Sbjct: 312 ALQKAIGAIPLLMGE 326
Score = 121 bits (303), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 65/157 (41%), Positives = 96/157 (61%), Gaps = 6/157 (3%)
Query: 353 FNG-AEYVSTGIKTV--VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYIN 408
FNG YV+ K ++ + V G + +C + A+V + +E + +
Sbjct: 257 FNGDLHYVNVTRKAYWQIKMDEVQVGSTLTLCKSGCQAIVDTGTSMITGPVQE--VRALQ 314
Query: 409 ELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMA 468
+ ++P MGE IDC +IPT+P VSF++G K+FNL+ ++Y++K VC+SGFMA
Sbjct: 315 KAIGAIPLLMGEYWIDCKKIPTLPVVSFSLGGKMFNLTGQEYVMKMSHMGMNVCLSGFMA 374
Query: 469 FDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
D+PPP GPLWILGDVF+G Y+TVFD + R+GFA A
Sbjct: 375 MDIPPPAGPLWILGDVFIGRYYTVFDRDQDRVGFAPA 411
>gi|361128953|gb|EHL00878.1| putative Vacuolar protease A [Glarea lozoyensis 74030]
Length = 399
Score = 246 bits (628), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 136/323 (42%), Positives = 195/323 (60%), Gaps = 20/323 (6%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRK-----ERYMGGAGVSGVRHRLG 68
++A+ LL + S G+ ++ LKK L L A I ++YMG + +
Sbjct: 5 LIAAASLLGSVSAGIHKMPLKKISLS-EQLAGANIDTHVKHLGQKYMGIRPEAHEQEMFK 63
Query: 69 DSDEDI------LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISC 122
DS +P+ NF++AQYF EI IG+PPQ+F V+ DTGSSNLWVPSS+C SI+C
Sbjct: 64 DSSLHTEKGAHPVPVSNFLNAQYFSEITIGTPPQSFKVVLDTGSSNLWVPSSEC-GSIAC 122
Query: 123 YFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSL 182
Y H++Y S S+TY + G EI YGSGS+SGF SQD + +GD+ +KDQ+F EAT E L
Sbjct: 123 YLHTKYDSSSSSTYKKNGSDFEIRYGSGSLSGFVSQDTMTIGDLKIKDQIFAEATEEPGL 182
Query: 183 TFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFG 242
F RFDGI+GLGF I+V VP + +M+ QGL+ E VF+F+L + EE E FG
Sbjct: 183 AFAFGRFDGILGLGFDTISVNKIVPPFYSMINQGLLDEPVFAFYLGDTNNGEE-SEATFG 241
Query: 243 GVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPV 302
GV+ H+ GK T +P+ +K YW+ +L I G+ + + G I+D+GTSL+A P+ +
Sbjct: 242 GVNEDHYTGKMTTIPLRRKAYWEVDLDAITFGDATAELENTGV--ILDTGTSLIALPSTL 299
Query: 303 VTEINHAIGGE----GVVSAECK 321
+N +G + G + EC+
Sbjct: 300 AELLNKEMGAKKGYNGQYTVECE 322
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 32/87 (36%), Positives = 53/87 (60%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ ++C++ ++P++SF + F ++P YIL+ + CIS FM D P P GPL
Sbjct: 315 GQYTVECEKRDSLPDMSFNLSGYNFTITPYDYILE----VQGSCISSFMGMDFPEPVGPL 370
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ +++V+D GK +G A +
Sbjct: 371 AILGDAFLRKWYSVYDLGKGTVGLAAS 397
>gi|158254091|gb|AAI54325.1| Zgc:63831 [Danio rerio]
Length = 412
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 114/244 (46%), Positives = 163/244 (66%), Gaps = 2/244 (0%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF-SISCYFHSRYKSRKSNT 135
L NFMDAQY+G I IG+PPQ+FSV+FDTGSSNLWVPS C F I+C+ H RY S+KS+T
Sbjct: 84 LTNFMDAQYYGMISIGTPPQDFSVLFDTGSSNLWVPSIHCAFLDIACWLHRRYNSKKSST 143
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + G I YG GS+SGF SQD V + + V Q F EA ++ + F +ARFDG++G+
Sbjct: 144 YVQNGTEFSIQYGRGSLSGFISQDTVNLAGLNVTGQQFAEAVKQPGIVFAVARFDGVLGM 203
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ I+V PV+D + ++ + +FSF++NRDP + GGE++ GG D ++F G Y
Sbjct: 204 AYPAISVDRVTPVFDTAMAAKILPQNIFSFYINRDPAGDVGGELMLGGFDQQYFNGDLHY 263
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
V VT+K YWQ ++ ++ +G+ T +C+ GC AIVD+GTS++ GP V + AIG +
Sbjct: 264 VNVTRKAYWQIKMDEVQVGSTLT-LCKSGCQAIVDTGTSMITGPVQEVRALQKAIGAIPL 322
Query: 316 VSAE 319
+ E
Sbjct: 323 LMGE 326
Score = 121 bits (303), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 65/157 (41%), Positives = 96/157 (61%), Gaps = 6/157 (3%)
Query: 353 FNG-AEYVSTGIKTV--VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYIN 408
FNG YV+ K ++ + V G + +C + A+V + +E + +
Sbjct: 257 FNGDLHYVNVTRKAYWQIKMDEVQVGSTLTLCKSGCQAIVDTGTSMITGPVQE--VRALQ 314
Query: 409 ELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMA 468
+ ++P MGE IDC +IPT+P VSF++G K+FNL+ ++Y++K VC+SGFMA
Sbjct: 315 KAIGAIPLLMGEYWIDCKKIPTLPVVSFSLGGKMFNLTGQEYVMKVSHMGMNVCLSGFMA 374
Query: 469 FDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
D+PPP GPLWILGDVF+G Y+TVFD + R+GFA A
Sbjct: 375 MDIPPPAGPLWILGDVFIGRYYTVFDRDQDRVGFAPA 411
>gi|302696543|ref|XP_003037950.1| hypothetical protein SCHCODRAFT_71897 [Schizophyllum commune H4-8]
gi|300111647|gb|EFJ03048.1| hypothetical protein SCHCODRAFT_71897 [Schizophyllum commune H4-8]
Length = 406
Score = 245 bits (626), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 143/332 (43%), Positives = 198/332 (59%), Gaps = 27/332 (8%)
Query: 20 LLPASSNGLRRIGLKK-----RRLDLHSLNAAR----ITRKERYMGGAGVSGVRHRLGDS 70
LLPA + ++ L+K +L SL+ A + + + GAG +G R + D+
Sbjct: 10 LLPAVYAEVHKLQLQKIPATVGNPELESLHLAEKYGVVNEFQTPLMGAGGAGRRLK-NDA 68
Query: 71 DEDI------------LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF 118
ED+ +PL NFM+AQYF EI +G+PPQNF VI DTGSSNLWVPSSKC
Sbjct: 69 GEDLFWTQEQVKGGHGVPLTNFMNAQYFTEITLGTPPQNFKVILDTGSSNLWVPSSKCT- 127
Query: 119 SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
SI+C+ H++Y S S+TY + G I YGSGS+ GF SQD + +GD+ + Q F EA +
Sbjct: 128 SIACFLHAKYDSSASSTYKQNGTEFSIQYGSGSMEGFVSQDVLTIGDLTIPGQDFAEAVK 187
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
E LTF +FDGI+GLG+ I+V VP NM+ +GL+ E VFSF L + E+GGE
Sbjct: 188 EPGLTFAFGKFDGILGLGYDTISVNHIVPPHYNMINKGLLDEPVFSFRLGK--SEEDGGE 245
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
+FGGVD +KG TYVPV +K YW+ EL I G++ + G A +D+GTSL+A
Sbjct: 246 AIFGGVDKSAYKGDLTYVPVRRKAYWEVELEKISFGSEELELESTGAA--IDTGTSLIAL 303
Query: 299 PTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 330
PT + IN IG + + + ++ S+ DL
Sbjct: 304 PTDMAEMINAEIGAKKSWNGQYQVECSKVPDL 335
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 35/87 (40%), Positives = 52/87 (59%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ ++C ++P +P +S G K + L YIL+ + CIS F D+ P G L
Sbjct: 323 GQYQVECSKVPDLPELSLYFGGKPYTLKGTDYILE----VQGTCISSFTGLDINVPGGSL 378
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WI+GDVF+ Y+TV+D G+ +GFAEA
Sbjct: 379 WIIGDVFLRKYYTVYDLGRDAVGFAEA 405
>gi|119592254|gb|EAW71848.1| napsin A aspartic peptidase, isoform CRA_b [Homo sapiens]
Length = 357
Score = 245 bits (626), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 119/258 (46%), Positives = 168/258 (65%), Gaps = 2/258 (0%)
Query: 74 ILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRK 132
+PL N+ D QYFGEIG+G+PPQNF+V FDTGSSNLWVPS +C +FS+ C+ H R+ +
Sbjct: 4 FVPLSNYRDVQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPCWLHHRFDPKA 63
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGI 192
S+++ G I YG+G + G S+D + +G + +F EA E SL F A FDGI
Sbjct: 64 SSSFQANGTKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWEPSLVFAFAHFDGI 123
Query: 193 IGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGK 252
+GLGF ++V P D +VEQGL+ + VFSF+LNRDP+ +GGE+V GG DP H+
Sbjct: 124 LGLGFPILSVEGVRPPMDVLVEQGLLDKPVFSFYLNRDPEEPDGGELVLGGSDPAHYIPP 183
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
T+VPVT YWQ + + +G T +C GCAAI+D+GTSL+ GPT + ++ AIGG
Sbjct: 184 LTFVPVTVPAYWQIHMERVKVGPGLT-LCAKGCAAILDTGTSLITGPTEEIRALHAAIGG 242
Query: 313 EGVVSAECKLVVSQYGDL 330
+++ E ++ S+ L
Sbjct: 243 IPLLAGEYIILCSEIPKL 260
Score = 98.2 bits (243), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 54/143 (37%), Positives = 76/143 (53%), Gaps = 7/143 (4%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
+E+ V G + C A++ L T+E + ++ +P GE II C
Sbjct: 199 MERVKVGPGLTLCAKGCA-AILDTGTSLITGPTEE--IRALHAAIGGIPLLAGEYIILCS 255
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
IP +P VSF +G FNL+ Y+++T +C+SGF A D+PPP GP WILGDVF+
Sbjct: 256 EIPKLPAVSFLLGGVWFNLTAHDYVIQTTRNGVRLCLSGFQALDVPPPAGPFWILGDVFL 315
Query: 487 GVYHTVFDSGKL----RIGFAEA 505
G Y VFD G + R+G A A
Sbjct: 316 GTYVAVFDRGDMKSSARVGLARA 338
>gi|121543617|gb|ABM55520.1| putative cathepsin D [Maconellicoccus hirsutus]
Length = 391
Score = 245 bits (625), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 140/320 (43%), Positives = 203/320 (63%), Gaps = 25/320 (7%)
Query: 6 LRSVFCLWVLASCLLLP-ASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVR 64
L +F L+ + +C + +SS L RI L + +T +ER ++G
Sbjct: 3 LLCIFVLFSIGTCHVNSVSSSEKLFRISLSRV-----------VTPRERLR----LAGTE 47
Query: 65 HRLGDSDEDIL----PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFS 119
+L ++ + PL+N++DAQY+G I IG+PPQ F+V+FDTGSSNLWVPS +C +
Sbjct: 48 FKLLNARYNGTGTPEPLRNYLDAQYYGPITIGTPPQPFNVVFDTGSSNLWVPSKQCSILN 107
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
I+C H++Y S+ S+TY G I+YGSGS+SGF S D V +G + ++ Q F EA +E
Sbjct: 108 IACLIHNKYNSKTSSTYQANGTEFAIHYGSGSLSGFLSSDTVSIGGLDIEKQTFAEAVKE 167
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEI 239
+ F+ A+FDGI+GLG++EI+VG P + NMV+QGLV + VFSF+LNR+ A +GGEI
Sbjct: 168 PGIAFIAAKFDGILGLGYKEISVGGIPPPFYNMVDQGLVKDSVFSFYLNRNTSAADGGEI 227
Query: 240 VFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGP 299
+FGGVDP F+G TYVPV+ KGYWQF + I +G + + AI D+GTSL+AGP
Sbjct: 228 IFGGVDPSKFRGNFTYVPVSVKGYWQFGMEKISLGGKDIQTSQ----AIADTGTSLIAGP 283
Query: 300 TPVVTEINHAIGGEGVVSAE 319
+ + IN AIG ++ +
Sbjct: 284 SEDIAAINKAIGAVEILGGQ 303
Score = 101 bits (252), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 44/87 (50%), Positives = 61/87 (70%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ + C+ I +P+++FTI + LS Y+L+ + +CISGFM D+PPPRGPL
Sbjct: 302 GQYTVSCESIDQLPDITFTINGVDYTLSGRDYVLQVSQLGRTLCISGFMGIDIPPPRGPL 361
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WILGDVF+G Y+TVFD G R+GFAE+
Sbjct: 362 WILGDVFIGKYYTVFDLGNNRLGFAES 388
>gi|403299330|ref|XP_003940442.1| PREDICTED: napsin-A-like [Saimiri boliviensis boliviensis]
Length = 425
Score = 245 bits (625), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 127/310 (40%), Positives = 187/310 (60%), Gaps = 8/310 (2%)
Query: 22 PASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFM 81
PA + L I L++ +LN R K+ + G H+ G +PL F+
Sbjct: 26 PAEAT-LIHIPLRRVHPGRRTLNLLRGWGKQAKLPRLGAPSPGHKPG-----FVPLSKFL 79
Query: 82 DAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIG 140
D QYFGEIG+G+PPQNF+V FDTGSSNLWVPS +C+ S + C+FH R+ + S+++ G
Sbjct: 80 DVQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSKRCHLSSVPCWFHHRFDPKASSSFQPNG 139
Query: 141 KSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREI 200
I YG+G + G S+D + +G + +F EA E SL F ++R DGI+GLGF +
Sbjct: 140 TKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWESSLVFTVSRPDGILGLGFPIL 199
Query: 201 AVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTK 260
AV P D +VEQGL+ + VFSF+LNRDP+ +GGE+V GG DP H+ T+VPVT
Sbjct: 200 AVEGVRPPLDVLVEQGLLDKPVFSFYLNRDPEVADGGELVLGGSDPAHYIPPLTFVPVTV 259
Query: 261 KGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAEC 320
YWQ + + +G++ T +C GCAA++D+GT ++ GP + ++ AIGG +++ E
Sbjct: 260 PAYWQIHMERVKVGSELT-LCARGCAAVLDTGTPVIIGPAEEIRALHKAIGGLPLLAGEY 318
Query: 321 KLVVSQYGDL 330
+ S+ L
Sbjct: 319 IIRCSEIPKL 328
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 44/103 (42%), Positives = 61/103 (59%), Gaps = 4/103 (3%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
+++ LP GE II C IP +P VS +G FNL+ + Y+++ +G C+SGF
Sbjct: 304 LHKAIGGLPLLAGEYIIRCSEIPKLPTVSLFLGGVWFNLTAQDYVIQFVQGDFRFCVSGF 363
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKL----RIGFAEA 505
D+P P GP+WILGDVF+G Y VFD G + R+G A A
Sbjct: 364 RGLDIPSPPGPMWILGDVFLGAYVAVFDRGDMKSGARVGLARA 406
>gi|148227998|ref|NP_001079043.1| cathepsin E-A precursor [Xenopus laevis]
gi|46395761|sp|Q805F3.1|CATEA_XENLA RecName: Full=Cathepsin E-A; Flags: Precursor
gi|28460653|dbj|BAC57453.1| cathepsin E1 [Xenopus laevis]
gi|213625998|gb|AAI69692.1| Cathepsin E1 [Xenopus laevis]
gi|213627772|gb|AAI69694.1| Cathepsin E1 [Xenopus laevis]
Length = 397
Score = 244 bits (623), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 132/313 (42%), Positives = 193/313 (61%), Gaps = 22/313 (7%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKER-----YMGGAGV 60
+R + L + A+ + GL R+ LK+++ + R T KE+ G+
Sbjct: 1 MRQILVLLLFATLVY------GLIRVPLKRQK-------SIRKTLKEKGKLSHIWTQQGI 47
Query: 61 SGVRHRLGDSDEDIL--PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF 118
V++ S++ PL N+MD +YFGEI +G+PPQNF+VIFDTGSSNLWVPS C
Sbjct: 48 DMVQYTDSCSNDQAPSEPLINYMDVEYFGEISVGTPPQNFTVIFDTGSSNLWVPSVYC-I 106
Query: 119 SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
S +C H R++ + S+TY G + + YG+GS+SG D V V ++V++Q F E+
Sbjct: 107 SQACAQHDRFQPQLSSTYESNGNNFSLQYGTGSLSGVIGIDAVTVEGILVQNQQFGESVS 166
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
E TF+ A FDGI+GLG+ IAVGD PV+DNM+ Q LV +FS +++R+P++ GGE
Sbjct: 167 EPGSTFVDAEFDGILGLGYPSIAVGDCTPVFDNMIAQNLVELPMFSVYMSRNPNSAVGGE 226
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
+VFGG D F G+ +VPVT +GYWQ +L ++ I N C GGC AIVD+GTSL+ G
Sbjct: 227 LVFGGFDASRFSGQLNWVPVTNQGYWQIQLDNVQI-NGEVLFCSGGCQAIVDTGTSLITG 285
Query: 299 PTPVVTEINHAIG 311
P+ + ++ + IG
Sbjct: 286 PSSDIVQLQNIIG 298
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 43/85 (50%), Positives = 57/85 (67%), Gaps = 3/85 (3%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC + MP V+FTI + ++P+QY L+ G G VC SGF D+PPP GPL
Sbjct: 304 GDYEVDCSVLNEMPTVTFTINGIGYQMTPQQYTLQDGGG---VCSSGFQGLDIPPPAGPL 360
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFA 503
WILGDVF+G Y++VFD G R+G A
Sbjct: 361 WILGDVFIGQYYSVFDRGNNRVGLA 385
>gi|440898030|gb|ELR49612.1| Napsin-A, partial [Bos grunniens mutus]
Length = 406
Score = 244 bits (623), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 124/287 (43%), Positives = 177/287 (61%), Gaps = 11/287 (3%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRK--ERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQY 85
L RI L++ + +LN R K E A G + +PL ++M+ QY
Sbjct: 26 LIRIPLRRVNIGFKALNPPRGWEKLAEPPRLAAPSPG-------NKSLFVPLSDYMNVQY 78
Query: 86 FGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCE 144
+GEIG+G+PPQNFSV+FDTGSSNLWVPS +C +FS+ C+ H R+ + S+++ G
Sbjct: 79 YGEIGLGTPPQNFSVVFDTGSSNLWVPSVRCHFFSLPCWLHHRFNPKASSSFRSNGTKFA 138
Query: 145 INYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGD 204
I YG+G ++G S+D + +G + F EA E SL F A FDGI+GLGF +AVG
Sbjct: 139 IQYGTGRLAGILSEDKLTIGGITGATVTFGEALWEPSLVFTFAHFDGILGLGFPVLAVGG 198
Query: 205 AVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYW 264
P D +V++GL+ + VFSF+LNR+P+A +GGE+V GG DP H+ T+VPVT +W
Sbjct: 199 VRPPLDRLVDRGLLDKPVFSFYLNRNPEAADGGELVLGGSDPAHYIPPLTFVPVTIPAFW 258
Query: 265 QFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
Q + + +G T +C GCAAI+D+GTSL+ GPT + + AIG
Sbjct: 259 QIHMERVQVGTGLT-LCARGCAAILDTGTSLITGPTEEIRALQKAIG 304
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 52/135 (38%), Positives = 77/135 (57%), Gaps = 3/135 (2%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
+E+ V G + C A++ L T+E + + + ++P MG+ I+C
Sbjct: 262 MERVQVGTGLTLCARGCA-AILDTGTSLITGPTEE--IRALQKAIGAVPLLMGKYYIECS 318
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
+IPT+P VSF +G FNL+ + Y+++ VC+SGFMA D+PPP GP WILGDVF+
Sbjct: 319 KIPTLPPVSFLLGGVWFNLTAQDYVIQITRSGFSVCLSGFMALDVPPPSGPFWILGDVFL 378
Query: 487 GVYHTVFDSGKLRIG 501
G Y VFD G + G
Sbjct: 379 GSYVAVFDRGDRKSG 393
>gi|147743007|sp|P85138.1|CARDG_CYNCA RecName: Full=Cardosin-G; Contains: RecName: Full=Cardosin-G heavy
chain; Contains: RecName: Full=Cardosin-G light chain
Length = 266
Score = 244 bits (622), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 131/243 (53%), Positives = 153/243 (62%), Gaps = 46/243 (18%)
Query: 69 DSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRY 128
DS ++ L N D YFGEIGIG+PPQ F+VIFDTGSS LWVPSSK HS Y
Sbjct: 1 DSGSTVVALTNDRDTSYFGEIGIGTPPQKFTVIFDTGSSYLWVPSSKA--------HSMY 52
Query: 129 KSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLAR 188
+S S+TY K+Q FIEAT E FL
Sbjct: 53 ESSDSSTY--------------------------------KEQDFIEATEEADNVFLNRL 80
Query: 189 FDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKH 248
FDGI+GL F+ I +VPVW NMV QGLV FSFWLNR+ D EEGGE+VFGG+DP H
Sbjct: 81 FDGILGLSFQTI----SVPVWYNMVNQGLVKR--FSFWLNRNVDEEEGGELVFGGLDPNH 134
Query: 249 FKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH 308
F+G HTYVPVT + YWQF +GD+LIG++STG C GC A DSGTSLL+GPT +VT+INH
Sbjct: 135 FRGDHTYVPVTYQYYWQFGIGDVLIGDKSTGFCAPGCQAFADSGTSLLSGPTAIVTQINH 194
Query: 309 AIG 311
AIG
Sbjct: 195 AIG 197
Score = 79.0 bits (193), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 43/64 (67%), Positives = 45/64 (70%), Gaps = 4/64 (6%)
Query: 443 FNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGF 502
F L+PEQYILK G A CISGF A D GPLWILGDVFM YHTVFD G L +GF
Sbjct: 207 FGLTPEQYILK---GEATQCISGFTAMD-ATLLGPLWILGDVFMRPYHTVFDYGNLLVGF 262
Query: 503 AEAA 506
AEAA
Sbjct: 263 AEAA 266
>gi|406861956|gb|EKD15008.1| aspartic endopeptidase Pep2 [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 401
Score = 243 bits (621), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 128/311 (41%), Positives = 192/311 (61%), Gaps = 14/311 (4%)
Query: 14 VLASCLLLPASSNGLRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSGVRHRLG- 68
++ + LL ++S G+ ++ LKK +L +++A ++YMG S
Sbjct: 5 LVTAATLLSSASAGIHKLPLKKVSLSEQLATANIDAHVKNLGQKYMGIRPQSHADEMFKE 64
Query: 69 -----DSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCY 123
D + +P+ NF++AQYF EI IG+PPQ F V+ DTGSSNLWVPSS+C SI+CY
Sbjct: 65 TSVHEDGSDHTVPVSNFLNAQYFSEITIGTPPQTFKVVLDTGSSNLWVPSSQC-GSIACY 123
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
H++Y S S+TY + G + EI YGSGS+SGF S+D + +GD+ +K+Q+F EAT+E L
Sbjct: 124 LHTKYDSSSSSTYKKNGTAFEIRYGSGSLSGFTSEDTMSIGDLKIKNQIFAEATQEPGLA 183
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFG 242
F RFDGI+GLG+ I+V P + NMV Q L+ E VF+F+L + D E+ E +FG
Sbjct: 184 FAFGRFDGILGLGYDTISVNKIPPPFYNMVNQELLDEPVFAFYLGSTDKGEEDQSEAIFG 243
Query: 243 GVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPV 302
GV+ HF GK T +P+ +K YW+ +L I G+ + + G I+D+GTSL+A P+ +
Sbjct: 244 GVNKDHFTGKITEIPLRRKAYWEVDLDAITFGDATAELENTGV--ILDTGTSLIALPSTL 301
Query: 303 VTEINHAIGGE 313
+N +G +
Sbjct: 302 AELLNKEMGAK 312
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 53/87 (60%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC + ++P++SFT+ F ++P YIL+ + CIS FM D P P GPL
Sbjct: 317 GQYTVDCAKRDSLPDMSFTLSGHEFTITPYDYILE----VQGSCISSFMGMDFPEPVGPL 372
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ +++++D GK +G A A
Sbjct: 373 AILGDAFLRKWYSIYDLGKGTVGLAAA 399
>gi|125858582|gb|AAI29608.1| Ce1-A protein [Xenopus laevis]
Length = 394
Score = 243 bits (621), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 131/300 (43%), Positives = 187/300 (62%), Gaps = 16/300 (5%)
Query: 19 LLLPASSNGLRRIGLKKRRLDLHSLNAARITRKER-----YMGGAGVSGVRHRLGDSDED 73
LL GL R+ LK+++ + R T KE+ G+ V++ S++
Sbjct: 5 LLFATLVYGLIRVPLKRQK-------SIRKTLKEKGKLSHIWTQQGIDMVQYTDSCSNDQ 57
Query: 74 IL--PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSR 131
PL N+MD +YFGEI +G+PPQNF+VIFDTGSSNLWVPS C S +C H R++ +
Sbjct: 58 APSEPLINYMDVEYFGEISVGTPPQNFTVIFDTGSSNLWVPSVYC-ISQACAQHDRFQPQ 116
Query: 132 KSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDG 191
S+TY G + + YG+GS+SG D V V ++V++Q F E+ E TF+ A FDG
Sbjct: 117 LSSTYESNGNNFSLQYGTGSLSGVIGIDAVTVEGILVQNQQFGESVSEPGSTFVDAEFDG 176
Query: 192 IIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKG 251
I+GLG+ IAVGD PV+DNM+ Q LV +FS +++R+P++ GGE+VFGG D F G
Sbjct: 177 ILGLGYPSIAVGDCTPVFDNMIAQNLVELPMFSIYMSRNPNSAVGGELVFGGFDASRFSG 236
Query: 252 KHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
+ +VPVT +GYWQ +L ++ I N C GGC AIVD+GTSL+ GP+ + ++ + IG
Sbjct: 237 QLNWVPVTNQGYWQIQLDNVQI-NGEVLFCSGGCQAIVDTGTSLITGPSSDIVQLQNIIG 295
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 43/85 (50%), Positives = 57/85 (67%), Gaps = 3/85 (3%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC + MP V+FTI + ++P+QY L+ G G VC SGF D+PPP GPL
Sbjct: 301 GDYEVDCSVLNEMPTVTFTINGIGYQMTPQQYTLQDGGG---VCSSGFQGLDIPPPAGPL 357
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFA 503
WILGDVF+G Y++VFD G R+G A
Sbjct: 358 WILGDVFIGQYYSVFDRGNNRVGLA 382
>gi|157423181|gb|AAI53793.1| Cathepsin E2 [Xenopus laevis]
Length = 397
Score = 243 bits (620), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 130/292 (44%), Positives = 185/292 (63%), Gaps = 16/292 (5%)
Query: 27 GLRRIGLKKRRLDLHSLNAARITRKER-----YMGGAGVSGVRHRLGDSDEDIL--PLKN 79
GL R+ LK+++ + R T KE+ G+ V++ +++ PL N
Sbjct: 16 GLIRVPLKRQK-------SIRKTLKEKGKLSHVWTQQGIDMVQYTDSCNNDQAPSEPLIN 68
Query: 80 FMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEI 139
+MD QYFGEI IG+PPQNF+VIFDTGSSNLWVPS C S +C H+R++ + S+TY
Sbjct: 69 YMDVQYFGEISIGTPPQNFTVIFDTGSSNLWVPSVYC-ISPACAQHNRFQPQLSSTYESN 127
Query: 140 GKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFRE 199
G + + YG+GS+SG D+V V ++V++Q F E+ E TF+ A FDGI+GLG+
Sbjct: 128 GNNFSLQYGTGSLSGVIGIDSVTVEGILVQNQQFGESVSEPGSTFVDASFDGILGLGYPS 187
Query: 200 IAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVT 259
IAVG PV+DNM+ Q LV +FS +++RDP++ GGE+VFGG D F G+ +VPVT
Sbjct: 188 IAVGGCTPVFDNMIAQNLVELPMFSVYMSRDPNSPVGGELVFGGFDASRFSGQLNWVPVT 247
Query: 260 KKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
+GYWQ +L +I I N C GGC AIVD+GTS++ GP+ + ++ IG
Sbjct: 248 NQGYWQIQLDNIQI-NGEVVFCSGGCQAIVDTGTSMITGPSSDIVQLQSIIG 298
Score = 88.2 bits (217), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 40/85 (47%), Positives = 56/85 (65%), Gaps = 3/85 (3%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC + MP ++FTI + ++P+QY L+ +G VC SGF D+ PP GPL
Sbjct: 304 GDYEVDCTVLNKMPTMTFTINGIGYQMTPQQYTLQDDDG---VCSSGFQGLDISPPAGPL 360
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFA 503
WILGDVF+G Y++VFD G R+G A
Sbjct: 361 WILGDVFIGQYYSVFDRGNNRVGLA 385
>gi|16119024|gb|AAL14708.1|AF420068_1 aspartic protease [Clonorchis sinensis]
Length = 419
Score = 242 bits (618), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 122/237 (51%), Positives = 166/237 (70%), Gaps = 11/237 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L N++DAQY+GEIGIG+PPQ+F V+FDTGSSNLWVPS C FSI+C+ H +Y S KS+T
Sbjct: 61 LNNYLDAQYYGEIGIGTPPQSFEVVFDTGSSNLWVPSKHCSIFSIACWLHHKYDSAKSST 120
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G I YGSGS+SG S D V VG V VK+Q F EA +E + F+ A+FDGI+G+
Sbjct: 121 YMANGTEFNIRYGSGSVSGILSTDYVSVGTVTVKNQTFGEAMKEPGIAFVAAKFDGILGM 180
Query: 196 GFREIAVGDAVP-VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
GF+ I+V D VP ++DNM+ QG F F L+R+ GGE++ GG DPK++KG+
Sbjct: 181 GFKTISV-DGVPTLFDNMISQG------FGFRLDRNRSDPVGGELLLGGTDPKYYKGEIL 233
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
+ P+T + YWQF++ + +G S +CE GC AI D+GTSL+AGP+ V ++N A+G
Sbjct: 234 WAPLTHEAYWQFKVDSMNVG--SMKLCENGCQAIADTGTSLIAGPSEEVGKLNDALG 288
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 49/136 (36%), Positives = 75/136 (55%), Gaps = 4/136 (2%)
Query: 370 ENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIP 429
++++ G +C A+ L ++E + +N+ ++ P G IDC R+
Sbjct: 248 DSMNVGSMKLCENGCQAIADTGTSLIAGPSEE--VGKLNDALGAINIPGGTYYIDCSRVS 305
Query: 430 TMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVY 489
T+P V F+I K+ L P YIL+ +CISGFM ++ P GPLWI G+VF+G Y
Sbjct: 306 TLPPVQFSISGKLMQLDPSDYILRMTWFGKTICISGFMGINI--PGGPLWIFGEVFIGKY 363
Query: 490 HTVFDSGKLRIGFAEA 505
+T+FD G R+GFA A
Sbjct: 364 YTIFDVGNARVGFATA 379
>gi|148236737|ref|NP_001079044.1| cathepsin E-B precursor [Xenopus laevis]
gi|46395760|sp|Q805F2.1|CATEB_XENLA RecName: Full=Cathepsin E-B; Flags: Precursor
gi|28460655|dbj|BAC57454.1| cathepsin E2 [Xenopus laevis]
Length = 397
Score = 242 bits (618), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 130/292 (44%), Positives = 185/292 (63%), Gaps = 16/292 (5%)
Query: 27 GLRRIGLKKRRLDLHSLNAARITRKER-----YMGGAGVSGVRHRLGDSDEDIL--PLKN 79
GL R+ LK+++ + R T KE+ G+ V++ +++ PL N
Sbjct: 16 GLIRVPLKRQK-------SIRKTPKEKGKLSHVWTQQGIDMVQYTDSCNNDQAPSEPLIN 68
Query: 80 FMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEI 139
+MD QYFGEI IG+PPQNF+VIFDTGSSNLWVPS C S +C H+R++ + S+TY
Sbjct: 69 YMDVQYFGEISIGTPPQNFTVIFDTGSSNLWVPSVYC-ISPACAQHNRFQPQLSSTYESN 127
Query: 140 GKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFRE 199
G + + YG+GS+SG D+V V ++V++Q F E+ E TF+ A FDGI+GLG+
Sbjct: 128 GNNFSLQYGTGSLSGVIGIDSVTVEGILVQNQQFGESVSEPGSTFVDASFDGILGLGYPS 187
Query: 200 IAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVT 259
IAVG PV+DNM+ Q LV +FS +++RDP++ GGE+VFGG D F G+ +VPVT
Sbjct: 188 IAVGGCTPVFDNMIAQNLVELPMFSVYMSRDPNSPVGGELVFGGFDASRFSGQLNWVPVT 247
Query: 260 KKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
+GYWQ +L +I I N C GGC AIVD+GTS++ GP+ + ++ IG
Sbjct: 248 NQGYWQIQLDNIQI-NGEVVFCSGGCQAIVDTGTSMITGPSSDIVQLQSIIG 298
Score = 88.2 bits (217), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 40/85 (47%), Positives = 56/85 (65%), Gaps = 3/85 (3%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC + MP ++FTI + ++P+QY L+ +G VC SGF D+ PP GPL
Sbjct: 304 GDYEVDCTVLNKMPTMTFTINGIGYQMTPQQYTLQDDDG---VCSSGFQGLDISPPAGPL 360
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFA 503
WILGDVF+G Y++VFD G R+G A
Sbjct: 361 WILGDVFIGQYYSVFDRGNNRVGLA 385
>gi|297705581|ref|XP_002829653.1| PREDICTED: napsin-A, partial [Pongo abelii]
Length = 392
Score = 242 bits (617), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 129/318 (40%), Positives = 189/318 (59%), Gaps = 21/318 (6%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDE----DILPLKNFMDA 83
LRR+ ++R L+L + G G +LG +PL N+ D
Sbjct: 3 LRRVHPERRTLNL--------------LKGWGKPAKLPKLGAPSPGDKPTFVPLSNYWDV 48
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKS 142
QYFGEIG+G+PPQNF+V FDTGSSNLWVPS +C +FS+ C+FH R+ S+++ G
Sbjct: 49 QYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPCWFHHRFNPSASSSFKPNGTK 108
Query: 143 CEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAV 202
I YG+G + G S+D + +G + +F EA E SL F ++R DGI+GLGF +AV
Sbjct: 109 FAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWESSLVFTVSRPDGILGLGFPILAV 168
Query: 203 GDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
P D +V+QGL+ + +FSF+LNRDP +GGE+V GG DP H+ T+VPVT
Sbjct: 169 EGVRPPLDVLVKQGLLDKPIFSFYLNRDPKVADGGELVLGGSDPAHYIPPLTFVPVTVPA 228
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKL 322
YWQ + + +G+ T +C GCAAI+D+GT ++ GPT + ++ AIGG +++ E +
Sbjct: 229 YWQIHMERVKVGSGLT-LCARGCAAILDTGTPVIVGPTEEIRALHAAIGGIPLLAGEYII 287
Query: 323 VVSQYGDL-IWDLLVSGL 339
S+ L LL++G+
Sbjct: 288 RCSEIPKLPAVSLLIAGV 305
Score = 85.9 bits (211), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 49/143 (34%), Positives = 75/143 (52%), Gaps = 7/143 (4%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
+E+ V +G + C A++ + T+E + ++ +P GE II C
Sbjct: 234 MERVKVGSGLTLCARGCA-AILDTGTPVIVGPTEE--IRALHAAIGGIPLLAGEYIIRCS 290
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
IP +P VS I FNL+ + Y+++ +G +C+SGF A D+ P P+WILGDVF+
Sbjct: 291 EIPKLPAVSLLIAGVWFNLTAQDYVIQFAQGDVRLCLSGFRALDIASPPVPVWILGDVFL 350
Query: 487 GVYHTVFDSGKL----RIGFAEA 505
G Y VFD G + R+G A A
Sbjct: 351 GAYVAVFDRGDMKSGARVGLARA 373
>gi|440633873|gb|ELR03792.1| vacuolar protease A [Geomyces destructans 20631-21]
Length = 395
Score = 242 bits (617), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 130/321 (40%), Positives = 190/321 (59%), Gaps = 19/321 (5%)
Query: 14 VLASCLLLPASSNGLRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSGVRHRLGD 69
+ + +LL ++S G+ ++ L+K +L+ ++ ++YMG + V +
Sbjct: 5 LFTAAMLLGSASAGVHKMKLQKIPLAEQLEFANVETHVRNLGQKYMGIRPQTHVDAVFQE 64
Query: 70 SDE-----DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYF 124
S ++P+ NF++AQYF EI IG+PPQ F V+ DTGSSNLWVPS C SI+CY
Sbjct: 65 SSSIKQGGHLVPVSNFLNAQYFSEITIGNPPQTFKVVLDTGSSNLWVPSQSC-GSIACYL 123
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
HS+Y S +S TY + G I YGSGS+SG+ SQD V +GD+V+KDQ+F EA E L F
Sbjct: 124 HSKYDSSESKTYEKNGTEFAIQYGSGSVSGYISQDQVTIGDLVIKDQLFGEAVEEPGLAF 183
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
RFDGI+GLGF I+V VP + +M++QGL+ E+VFSF+L D E VFGG+
Sbjct: 184 AFGRFDGILGLGFDTISVNKVVPPFYSMIDQGLLDEKVFSFYLADDKSQSEA---VFGGI 240
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
D H+ G TY+P+ +K YW+ + I G+ + G I+D+GTSL P+ +
Sbjct: 241 DKSHYTGDLTYIPLRRKAYWEVDFDAISFGDVKADLDNTGV--ILDTGTSLNTLPSSLAE 298
Query: 305 EINHAIGGE----GVVSAECK 321
+N IG + G + +CK
Sbjct: 299 LLNKEIGAKKGYNGQYTIDCK 319
Score = 69.7 bits (169), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 50/87 (57%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ IDC + +P+++FT+ F LS Y L+ G C+S FM D+P P GPL
Sbjct: 312 GQYTIDCKKRDDLPDITFTLAGHDFALSAYDYTLEMGGS----CVSTFMGMDMPEPVGPL 367
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ +++V+D K +G A A
Sbjct: 368 AILGDAFLRRWYSVYDLEKGAVGLAAA 394
>gi|395531206|ref|XP_003767673.1| PREDICTED: cathepsin E [Sarcophilus harrisii]
Length = 395
Score = 241 bits (616), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 133/298 (44%), Positives = 178/298 (59%), Gaps = 18/298 (6%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLG----------DSDEDILPLKNFMDAQYF 86
RL L + R T +ER G H+L D E+ PL N++D +Y+
Sbjct: 22 RLPLKRHKSLRKTLRER--GQLSQFWETHKLDMLQFTDFCSQDQSEN-EPLINYLDMEYY 78
Query: 87 GEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEIN 146
G I IGSPPQNF+VIFDTGSSNLWVPS C S +C H+R+ +S+TY E G S I
Sbjct: 79 GVISIGSPPQNFTVIFDTGSSNLWVPSVYC-VSPACKNHNRFYPSQSSTYVENGNSFSIQ 137
Query: 147 YGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
YG+GS+SG D V V + V +Q F E+ E TF+ A FDGI+GL + +AVG
Sbjct: 138 YGTGSLSGIIGMDQVSVEGITVANQQFGESVSEPGSTFVNAEFDGILGLAYPSLAVGGVT 197
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
PV+DNM+ Q LV +FS ++ R+PD+ G E+VFGG D HF G +VPVTK+GYWQ
Sbjct: 198 PVFDNMIAQNLVDMPIFSVYMTRNPDSPTGSELVFGGYDHAHFTGSLNWVPVTKQGYWQI 257
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---EGVVSAECK 321
L +I +G + C GC AIVD+GTSL+ GP+ + ++ +AIG +G + EC
Sbjct: 258 ALDNIQVGG-TIMFCAEGCQAIVDTGTSLITGPSDKIKQLQNAIGAVLTDGEYAMECN 314
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 39/87 (44%), Positives = 53/87 (60%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
GE ++C+ + MP+V+FTI + L P+ Y L E C SGF D+ PP GPL
Sbjct: 307 GEYAMECNNLNVMPDVTFTINGIPYTLPPKAYTLTDFVDGMEFCTSGFQGLDIHPPAGPL 366
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WILGDVF+G +++VFD G +G A A
Sbjct: 367 WILGDVFIGQFYSVFDRGNNLVGLAPA 393
>gi|296230510|ref|XP_002760737.1| PREDICTED: renin isoform 1 [Callithrix jacchus]
gi|50401196|sp|Q9TSZ1.1|RENI_CALJA RecName: Full=Renin; AltName: Full=Angiotensinogenase; Flags:
Precursor
gi|6687184|emb|CAB64879.1| preprorenin [Callithrix jacchus]
Length = 400
Score = 241 bits (615), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 123/301 (40%), Positives = 187/301 (62%), Gaps = 14/301 (4%)
Query: 17 SCLL-LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDIL 75
SC LP + +RI LK+ + + R + KER + A + R L + ++
Sbjct: 19 SCTFGLPTETTTFKRISLKR-------MPSIRESLKERGVDMARLGPERMALVNITSSVI 71
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSN 134
L N+MD QY+GEIGIG+PPQ F V+FDTGSSN+WVPSSKC +C +H + + S+
Sbjct: 72 -LTNYMDTQYYGEIGIGTPPQTFKVVFDTGSSNVWVPSSKCSRLYTACVYHKLFDASDSS 130
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y G + Y +G++SGF SQD + VG + V Q F E T +L F+LA FDG++G
Sbjct: 131 SYKHNGTELTLRYSTGTVSGFLSQDVITVGGITVT-QTFGEVTEMPALPFMLAEFDGVVG 189
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEE--GGEIVFGGVDPKHFKGK 252
+GF E A+G P++DN++ QGL+ E+VFSF+ NRD + + GG+IV GG DP+H++G
Sbjct: 190 MGFSEQAIGKVTPLFDNIISQGLLKEDVFSFYYNRDSENSQSLGGQIVLGGSDPQHYEGN 249
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
Y+ + + G WQ + + +G+ ST +CE GC A+VD+G S ++G T + ++ A+G
Sbjct: 250 FHYINLIRTGLWQIPMKGVSVGS-STLLCEDGCLALVDTGASYISGSTSSIEKLMEALGA 308
Query: 313 E 313
+
Sbjct: 309 K 309
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 49/84 (58%)
Query: 422 IIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWIL 481
++ C+ PT+P++SF +G K + L+ Y+ + ++C A D+PPP GP W L
Sbjct: 316 VVKCNEGPTLPDISFHLGGKEYTLTSADYVFQESYSSKKLCTLAIHAMDIPPPTGPTWAL 375
Query: 482 GDVFMGVYHTVFDSGKLRIGFAEA 505
G F+ ++T FD G RIGFA A
Sbjct: 376 GATFIRKFYTEFDRGNNRIGFALA 399
>gi|194756946|ref|XP_001960731.1| GF13504 [Drosophila ananassae]
gi|190622029|gb|EDV37553.1| GF13504 [Drosophila ananassae]
Length = 402
Score = 241 bits (615), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 148/388 (38%), Positives = 211/388 (54%), Gaps = 42/388 (10%)
Query: 6 LRSVFCLWVLASCLL--LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGV 63
L V C W L S L P++ + ++G+ R+D L + + +ER G S
Sbjct: 13 LLPVTCNWELYSVPLRRFPSARHRFEKLGI---RMDRLRLKYSSESSEER-----GNSRT 64
Query: 64 RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISC 122
+ + + L N++DAQYFG I IG+PPQ F VIFDTGSSNLWVPS+ C + ++C
Sbjct: 65 KWNVKSTT-----LSNYLDAQYFGPITIGTPPQTFQVIFDTGSSNLWVPSATCSSTMVAC 119
Query: 123 YFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSL 182
HSRY +R+S +Y IG I+YGSGS++GF S D V V + ++DQVF EAT
Sbjct: 120 RVHSRYYARRSRSYRPIGDHFVIHYGSGSLAGFLSTDTVRVAGLEIEDQVFAEATNMPGP 179
Query: 183 TFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRD-PDAEEGGEIVF 241
FL A+FDGI GL +R I++ P + M+EQGL+ VFS +LNR + EEGG + F
Sbjct: 180 IFLAAKFDGIFGLAYRSISMQRIKPPFYAMIEQGLLPRAVFSVYLNRHLGNQEEGGVLFF 239
Query: 242 GGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTP 301
GG +P++++G TYVPV+++ YWQ ++ I + +C+ GC I+D+GTS LA P
Sbjct: 240 GGSNPEYYRGNFTYVPVSRRAYWQVKMDAATI--RKLELCQNGCEVIIDTGTSFLALPYD 297
Query: 302 VVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAF--NGAEYV 359
IN +IGG + + Q DL ++ +G AF G EYV
Sbjct: 298 QAILINKSIGGRPSAYGQFSVPCDQVSDL-----------PRITFTMGGRAFFLEGHEYV 346
Query: 360 STGIKTVVEKENVSAGDSAVCSACEMAV 387
I D +CS+ +AV
Sbjct: 347 FRDI----------FKDQRICSSAFVAV 364
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 41/99 (41%), Positives = 63/99 (63%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
IN+ P+ G+ + CD++ +P ++FT+G + F L +Y+ + +C S F
Sbjct: 302 INKSIGGRPSAYGQFSVPCDQVSDLPRITFTMGGRAFFLEGHEYVFRDIFKDQRICSSAF 361
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+A DLP P+GPLWILGDVF+G Y+T FD + RIGFA++
Sbjct: 362 VAVDLPSPQGPLWILGDVFLGKYYTEFDMERHRIGFADS 400
>gi|195382956|ref|XP_002050194.1| GJ22010 [Drosophila virilis]
gi|194144991|gb|EDW61387.1| GJ22010 [Drosophila virilis]
Length = 394
Score = 241 bits (614), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 118/253 (46%), Positives = 159/253 (62%), Gaps = 6/253 (2%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N++DAQYFG I IG+PPQ F+VIFDTGS+NLWVPS C+ ++C HSRY SR S
Sbjct: 65 VPLSNYLDAQYFGPISIGTPPQKFNVIFDTGSANLWVPSESCHQKLACQIHSRYNSRHSR 124
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y GK +I YGSGS++G+ SQD V V + + +Q F EAT FL A+FDGI G
Sbjct: 125 SYKSDGKQFDIQYGSGSLAGYLSQDTVRVAGLEITNQTFAEATEMPGPIFLAAKFDGIFG 184
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L +R I++ + P + ++EQ L+ VFS +LNR + +GG + FGG P++++G T
Sbjct: 185 LAYRGISIQNIKPPFYAVMEQNLLKRPVFSVYLNRIASSRQGGYLFFGGSSPRYYRGNFT 244
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
YVPVT + YWQ +L IG +C GC I+D+GTS LA P IN +IGG
Sbjct: 245 YVPVTHRAYWQVKLEAARIG--PLQLCLNGCQVIIDTGTSFLAVPYEQAILINESIGGTP 302
Query: 314 ---GVVSAECKLV 323
G S C+ V
Sbjct: 303 AAYGQFSVPCEQV 315
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 47/99 (47%), Positives = 61/99 (61%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
INE P G+ + C+++P +P +SFT+G + F L E Y+ VC S F
Sbjct: 294 INESIGGTPAAYGQFSVPCEQVPHLPTLSFTLGGRRFELKGEDYVFHDIFSDRTVCASAF 353
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+A DLP P GPLWILGDVF+G Y+T FD G RIGFA+A
Sbjct: 354 IAVDLPSPSGPLWILGDVFLGKYYTEFDMGNHRIGFADA 392
>gi|384490965|gb|EIE82161.1| hypothetical protein RO3G_06866 [Rhizopus delemar RA 99-880]
Length = 403
Score = 241 bits (614), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 122/258 (47%), Positives = 163/258 (63%), Gaps = 5/258 (1%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+M+AQY+GEI IG+P Q F+VIFDTGSSNLWVPS+ C S +C H RY S KS
Sbjct: 78 VPLSNYMNAQYYGEIQIGTPAQTFTVIFDTGSSNLWVPSTHC-MSFACLMHRRYSSSKST 136
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + I YGSGS+ G SQD + VG + ++DQ F E+T E LTF +ARFDGI G
Sbjct: 137 TYRKNETDFVIRYGSGSLQGINSQDTLRVGGIEIRDQGFAESTVEPGLTFAMARFDGIFG 196
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEE--GGEIVFGGVDPKHFKGK 252
LG+ I+V VP + NM+ + L+ +E+FSFWL+ D GGE+ FGG+D F G
Sbjct: 197 LGYDTISVQQTVPPFYNMINKKLIDQEIFSFWLSDTNDGNNNLGGELAFGGIDEARFSGN 256
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
T+ PVT+KGYW+ EL + +Q + G A +D+GTSLL PT V +N+ IGG
Sbjct: 257 ITWSPVTRKGYWEIELQNTKFNDQPMNM--GSIGAAIDTGTSLLIAPTAVAEFVNNQIGG 314
Query: 313 EGVVSAECKLVVSQYGDL 330
+ + + S G+L
Sbjct: 315 QADAYGQYTVDCSSVGNL 332
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 58/103 (56%), Gaps = 4/103 (3%)
Query: 403 VLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVC 462
V ++N + G+ +DC + +P F K F L + YIL + C
Sbjct: 304 VAEFVNNQIGGQADAYGQYTVDCSSVGNLPEFCFQFSGKDFCLQGKDYILD----VDGQC 359
Query: 463 ISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+SGF+A D+PPP GPLWI+GDVF+ +++++D R+GFA++
Sbjct: 360 MSGFVALDIPPPAGPLWIVGDVFLRKFYSIYDLQNHRVGFAQS 402
>gi|398396710|ref|XP_003851813.1| hypothetical protein MYCGRDRAFT_104895 [Zymoseptoria tritici
IPO323]
gi|339471693|gb|EGP86789.1| hypothetical protein MYCGRDRAFT_104895 [Zymoseptoria tritici
IPO323]
Length = 398
Score = 240 bits (613), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 131/322 (40%), Positives = 193/322 (59%), Gaps = 21/322 (6%)
Query: 14 VLASCLLLPASSNGLRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSGVRHRLGD 69
+LAS L+ +S G+ ++ L+K +L+ +S+ ++YMG +
Sbjct: 6 LLASALVAGTASAGVHKMKLQKVPLSEQLEGYSIEEQVQHLGQKYMGIRPQGRINEMF-- 63
Query: 70 SDEDILPLK-------NFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISC 122
++ P K NF++AQYF EI IG+PPQ F V+ DTGSSNLWVPS C SI+C
Sbjct: 64 KEQSYKPNKGHPVGVSNFLNAQYFSEIAIGTPPQEFKVVLDTGSSNLWVPSKDC-GSIAC 122
Query: 123 YFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSL 182
Y HS+Y SNTY + G I YGSGS+ G+ SQD V++GD+ +K+Q+F EAT E L
Sbjct: 123 YLHSKYNHGDSNTYKQNGSDFAIQYGSGSLEGYISQDTVQIGDLKIKNQLFAEATSEPGL 182
Query: 183 TFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFG 242
F RFDGI+GLG+ I+V P + NM++QGL+ E+VF+F+L+ +E E +FG
Sbjct: 183 AFAFGRFDGIMGLGYDTISVNGIPPPFYNMIDQGLLDEKVFAFYLSSTDKGDE-SEAIFG 241
Query: 243 GVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPV 302
GV+ H+ GK T +P+ +K YW+ + I +G+Q+ + G AI+D+GTSL+A P+ +
Sbjct: 242 GVNKDHYTGKMTNIPLRRKAYWEVDFDAITLGDQTAELDSTG--AILDTGTSLIALPSTM 299
Query: 303 VTEINHAIGGE----GVVSAEC 320
+N IG + G S EC
Sbjct: 300 AELLNKEIGAKKGYNGQYSVEC 321
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 53/87 (60%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ ++C ++P+++FT+ F +S YIL+ + CIS FM FD+P P GPL
Sbjct: 315 GQYSVECSARDSLPDLTFTLTGHNFTISAYDYILE----VQGSCISAFMGFDIPAPAGPL 370
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ Y++V+D G +G A+A
Sbjct: 371 AILGDAFLRRYYSVYDLGNNAVGLAKA 397
>gi|147743015|sp|P85139.1|CARDH_CYNCA RecName: Full=Cardosin-H; Contains: RecName: Full=Cardosin-H heavy
chain; Contains: RecName: Full=Cardosin-H light chain
Length = 265
Score = 240 bits (613), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 130/243 (53%), Positives = 153/243 (62%), Gaps = 46/243 (18%)
Query: 69 DSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRY 128
DS ++ L N D YFGEIGIG+PPQ F+VIFDTGSS LWVPSSK HS Y
Sbjct: 1 DSGSAVVALTNDRDTSYFGEIGIGTPPQKFTVIFDTGSSVLWVPSSKA--------HSMY 52
Query: 129 KSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLAR 188
+S S+TY K+Q FIEAT E FL
Sbjct: 53 ESSGSSTY--------------------------------KEQDFIEATDETDNVFLHRL 80
Query: 189 FDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKH 248
FDGI+GL F+ I +VPVW NM+ QGLV FSFWLNR+ D EEGGE+VFGG+DP H
Sbjct: 81 FDGILGLSFQTI----SVPVWYNMLNQGLVKR--FSFWLNRNVDEEEGGELVFGGLDPNH 134
Query: 249 FKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH 308
F+G HTYVPVT + YWQF +GD+LIG++STG C GC A DSGTSLL+GPT +VT+INH
Sbjct: 135 FRGDHTYVPVTYQYYWQFGIGDVLIGDKSTGFCAPGCQAFADSGTSLLSGPTAIVTQINH 194
Query: 309 AIG 311
AIG
Sbjct: 195 AIG 197
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 43/66 (65%), Positives = 46/66 (69%), Gaps = 4/66 (6%)
Query: 441 KIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRI 500
+F L+PEQYILK G A CISGF A D GPLWILGDVFM YHTVFD G L +
Sbjct: 204 NVFGLTPEQYILK---GEATQCISGFTAMD-ATLLGPLWILGDVFMRPYHTVFDYGNLLV 259
Query: 501 GFAEAA 506
GFAEAA
Sbjct: 260 GFAEAA 265
>gi|296230582|ref|XP_002760770.1| PREDICTED: cathepsin E isoform 1 [Callithrix jacchus]
Length = 396
Score = 240 bits (613), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 120/248 (48%), Positives = 162/248 (65%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C H+R++ +SNT
Sbjct: 69 PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKRHTRFQPSQSNT 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + G+S I YG+GS+SG D V V + V Q F E+ E TF+ A FDGI+GL
Sbjct: 128 YNQPGQSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAEFDGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV +FS +++ +P+ G E++FGG D HF G +
Sbjct: 188 GYPSLAVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGSLNW 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
VPVTK+ YWQ L DI +G + C GC AIVD+GTSL+ GP+ + ++ +AIG
Sbjct: 248 VPVTKQAYWQIALDDIQVGGTAM-FCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGAAPV 306
Query: 313 EGVVSAEC 320
+G + EC
Sbjct: 307 DGEYAVEC 314
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 39/87 (44%), Positives = 52/87 (59%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
GE ++C + MP+V+FTI + LSP Y L + C SGF D+ PP GPL
Sbjct: 308 GEYAVECANLNVMPDVTFTINGVPYTLSPTAYTLLDFVDGMQFCSSGFQGLDIHPPAGPL 367
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WILGDVF+ +++VFD G R+G A A
Sbjct: 368 WILGDVFIRQFYSVFDRGNNRVGLAPA 394
>gi|301786118|ref|XP_002928474.1| PREDICTED: cathepsin E-like [Ailuropoda melanoleuca]
Length = 396
Score = 240 bits (613), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 124/248 (50%), Positives = 160/248 (64%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C HSR+ +SNT
Sbjct: 69 PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SAACKTHSRFYPSQSNT 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y+ +G I YG+GS+SG D V+V +VV Q F E+ E TF+ A FDGI+GL
Sbjct: 128 YSVLGSHFSIQYGTGSLSGIIGADQVDVEGLVVVGQQFGESVTEPGQTFVNAEFDGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV +FS +++ DP+ G E++FGG D HF G +
Sbjct: 188 GYPSLAVGGVTPVFDNMMAQNLVDIPMFSVYMSSDPEGGAGSELIFGGYDHSHFSGNLHW 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE-- 313
VPVTK+GYWQ L I +G + C GC AIVD+GTSL+ GP+ V ++ AIG E
Sbjct: 248 VPVTKQGYWQIALDAIQVGG-AVMFCSEGCQAIVDTGTSLITGPSDKVKQLQKAIGAEPM 306
Query: 314 -GVVSAEC 320
G EC
Sbjct: 307 DGEYGVEC 314
Score = 82.0 bits (201), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 41/90 (45%), Positives = 53/90 (58%), Gaps = 1/90 (1%)
Query: 417 PM-GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
PM GE ++C + MP+V+FTI + L P Y L E C SGF D+ PP
Sbjct: 305 PMDGEYGVECANLNVMPDVTFTINGISYTLQPTAYTLLDFVDGMEFCSSGFQGLDIQPPA 364
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
GPLWILGDVF+ +++VFD G R+G A A
Sbjct: 365 GPLWILGDVFIRRFYSVFDRGNNRVGLAPA 394
>gi|210109642|gb|ACJ07131.1| cathepsin D-like protein, partial [Homarus gammarus]
Length = 231
Score = 240 bits (612), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 122/232 (52%), Positives = 157/232 (67%), Gaps = 5/232 (2%)
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFHSRYKSRKSNTYTEIGKS 142
QY+G I IG+P Q F VIFDTGSSNLW+PS KC+ +++C H+RY S KS+TY E G +
Sbjct: 1 QYYGPITIGTPGQGFDVIFDTGSSNLWIPSEKCFILNLACRLHNRYDSTKSSTYIENGTA 60
Query: 143 CEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAV 202
+I YGSG++ GF S DNVE+G V Q F EAT+E L F++ +FDGI+G+ F EI+V
Sbjct: 61 FDIQYGSGALHGFLSSDNVEMGGVNAMGQTFAEATQEPGLAFIMGKFDGILGMAFTEISV 120
Query: 203 GDAVPVWDNMVEQGLVSEEVFSFWLNRD-PDAEE--GGEIVFGGVDPKHFKGKHTYVPVT 259
V+D MV QG V + +FSF+LN D D E GGE+V GG DP H++G+ YVPV+
Sbjct: 121 MGIPTVFDTMVAQGAVDQPIFSFYLNHDVSDMNETLGGELVLGGSDPNHYEGEFHYVPVS 180
Query: 260 KKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
K GYWQ I +G+ TG C C AIVD+GTSL+AGP V EI H +G
Sbjct: 181 KVGYWQVTAEAIKVGDNVTGFCN-PCEAIVDTGTSLIAGPNAEVQEIVHMLG 231
>gi|426198518|gb|EKV48444.1| hypothetical protein AGABI2DRAFT_192052 [Agaricus bisporus var.
bisporus H97]
Length = 413
Score = 240 bits (612), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 132/288 (45%), Positives = 176/288 (61%), Gaps = 24/288 (8%)
Query: 57 GAGVSGVR--HRLGDSDEDIL-------------PLKNFMDAQYFGEIGIGSPPQNFSVI 101
GAG +G R H DE +L PL NFM+AQYF EI IGSPPQ F VI
Sbjct: 59 GAGGTGRRIAHPSQQDDETLLWTQEHQVQGGHGVPLSNFMNAQYFTEIQIGSPPQTFKVI 118
Query: 102 FDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNV 161
DTGSSNLWVPS KC SI+C+ H++Y S +S+TY G + EI YGSG++ GF SQD +
Sbjct: 119 LDTGSSNLWVPSVKCT-SIACFLHTKYDSGQSSTYKANGSTFEIQYGSGAMEGFVSQDQL 177
Query: 162 EVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEE 221
++GD+ +K Q F EAT+E L F +FDGI+GLG+ I+V VP + M+EQ L+ E
Sbjct: 178 QIGDLTIKGQDFAEATKEPGLAFAFGKFDGILGLGYDTISVNHIVPPFYKMIEQNLLDER 237
Query: 222 VFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVC 281
VFSF L E+GGE VFGG+D +KGK YVP+ +K YW+ +L I +G + +
Sbjct: 238 VFSFRLGSSD--EDGGEAVFGGIDESAYKGKMHYVPIRQKAYWEVQLDKISLGGEELELE 295
Query: 282 EGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLVVS 325
G A +D+GTSL+A P+ + +N IG + G + +C V S
Sbjct: 296 NTGAA--IDTGTSLIALPSDMAEMLNTQIGAKKSWNGQYTIDCAKVAS 341
Score = 78.2 bits (191), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 36/88 (40%), Positives = 54/88 (61%), Gaps = 4/88 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ IDC ++ ++P ++F G + F L E Y+L + CIS F D+ P G L
Sbjct: 330 GQYTIDCAKVASLPELTFHFGGRAFPLKGEDYVLN----VQGSCISSFTGLDINLPWGSL 385
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEAA 506
WI+GDVF+ Y+TV+D G+ +GFAE+A
Sbjct: 386 WIIGDVFLRRYYTVYDLGRDAVGFAESA 413
>gi|126681053|gb|ABO26561.1| cathepsin D-like aspartic protease [Ixodes ricinus]
Length = 382
Score = 240 bits (612), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 122/242 (50%), Positives = 161/242 (66%), Gaps = 6/242 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N +D +Y+G I IG+PPQ+F VIFDTGS+NLW+PSSKC + C H RY S KS+T
Sbjct: 52 PLVNLLDVEYYGPISIGTPPQDFQVIFDTGSANLWLPSSKCT-TKYCLHHHRYDSSKSST 110
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G++ I YGSG++ GF S+D +G V Q EA G + L A FDGI+GL
Sbjct: 111 YEADGRNFTIVYGSGNVEGFISKDVCRIGSAKVSGQPLGEALVVGGESLLEAPFDGILGL 170
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEE-VFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+ IAV VPV+DNM++QGL+ E+ VFS +LNRDP ++EGGE++FGG+D H+KG T
Sbjct: 171 AYPSIAVDGVVPVFDNMMKQGLLGEQNVFSVYLNRDPSSKEGGEVLFGGIDHDHYKGSIT 230
Query: 255 YVPVTKKGYWQFELGDILIGNQSTG----VCEGGCAAIVDSGTSLLAGPTPVVTEINHAI 310
YVPVT KGYWQF + + + S +C+ GC AI D+GTSL+ GP V +N +
Sbjct: 231 YVPVTAKGYWQFHVDGVKSVSASKSAPELLCKDGCEAIADTGTSLITGPPEEVDSLNQYL 290
Query: 311 GG 312
GG
Sbjct: 291 GG 292
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 40/88 (45%), Positives = 62/88 (70%), Gaps = 3/88 (3%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ ++DCD++ ++PNV+FTI K F+L + Y+LK + +C+SGFM+ ++P PL
Sbjct: 298 GQYLLDCDKLESLPNVTFTISGKEFSLRSKDYVLKVNQQGQTLCVSGFMSLEMPQ---PL 354
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEAA 506
WI GDVF+G Y+ +FD + R+GFAE A
Sbjct: 355 WIFGDVFLGPYYPIFDRDQDRVGFAEVA 382
>gi|23237802|dbj|BAC16370.1| aspartic proteinase 4 [Glycine max]
Length = 169
Score = 239 bits (611), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 110/169 (65%), Positives = 138/169 (81%), Gaps = 3/169 (1%)
Query: 341 PEKVCQQIGLCAFNGAEYVSTGIKTVVEK-ENVSAGD--SAVCSACEMAVVWVQNQLKQK 397
P+K+C QIGLC F+G VS GI++VV+K E S+G A CSACEMAV+W+QNQL+Q
Sbjct: 1 PKKICSQIGLCTFDGTHGVSMGIESVVDKNERKSSGSIRDAGCSACEMAVIWMQNQLRQN 60
Query: 398 QTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEG 457
QT+++++ Y NELCD LPNPMG+S +DC+++ +MP VSFTIG K+F+LSP++YILK GEG
Sbjct: 61 QTEDRIIDYANELCDKLPNPMGQSSVDCEKLSSMPIVSFTIGGKVFDLSPQEYILKVGEG 120
Query: 458 IAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
CISGF A D+PPPRGPLWILGDVFMG YHT+FD GKLR+GFAEAA
Sbjct: 121 PEAQCISGFTALDVPPPRGPLWILGDVFMGRYHTIFDYGKLRVGFAEAA 169
>gi|336273300|ref|XP_003351405.1| hypothetical protein SMAC_03712 [Sordaria macrospora k-hell]
Length = 381
Score = 239 bits (611), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 133/310 (42%), Positives = 187/310 (60%), Gaps = 17/310 (5%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRK-----ERYMGGAGVSGVRHRLG 68
+L + +LL ++ G+ + LKK L L + I + ++Y G S +
Sbjct: 5 LLTAAMLLGSAQAGVHTMKLKKVPL-AEQLESVPIDMQVQHLGQKYTGLRPESHTQAMFK 63
Query: 69 DSDEDI-----LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCY 123
+D + +P+ NFM+AQYF EI +G+PPQ F V+ DTGSSNLWVPSS+C SI+CY
Sbjct: 64 ATDAQVTGNHPVPISNFMNAQYFSEITLGTPPQTFKVVLDTGSSNLWVPSSQC-GSIACY 122
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
H++Y+S +S+TY + G S EI YGSGS+SGF SQD + +GD+ + DQ+F EAT E L
Sbjct: 123 LHNKYESSESSTYKKNGTSFEIQYGSGSLSGFVSQDRMTIGDITINDQLFAEATSEPGLA 182
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
F RFDGI+GLG+ IAV P + MVEQ LV E VFSF+L D D E E+VFGG
Sbjct: 183 FAFGRFDGILGLGYSRIAVNGITPPFYKMVEQKLVDEPVFSFYL-ADQDGES--EVVFGG 239
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
V+ + GK T +P+ +K YW+ + I G + G I+D+GTSL+A P+ +
Sbjct: 240 VNKDRYTGKITTIPLRRKAYWEVDFDAIGYGEDIADL--EGHGVILDTGTSLIALPSQLA 297
Query: 304 TEINHAIGGE 313
+N IG +
Sbjct: 298 EMLNAQIGAK 307
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/58 (46%), Positives = 35/58 (60%), Gaps = 4/58 (6%)
Query: 448 EQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
E YIL+ + C+S FM D+P P GPL ILGD F+ Y+TV+D G +G A A
Sbjct: 326 EDYILEA----SGSCLSTFMGMDMPAPVGPLAILGDAFLRKYYTVYDLGADTVGIATA 379
>gi|367031892|ref|XP_003665229.1| aspartic protease [Myceliophthora thermophila ATCC 42464]
gi|347012500|gb|AEO59984.1| aspartic protease [Myceliophthora thermophila ATCC 42464]
Length = 397
Score = 239 bits (611), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 133/313 (42%), Positives = 190/313 (60%), Gaps = 20/313 (6%)
Query: 13 WVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDE 72
++L + +LL ++ + ++ L+K L L A I + ++G + G+R R +D
Sbjct: 5 FLLTAAVLLGSAQGAVHKMKLQKIPLS-EQLEAVPINTQLEHLGQKYM-GLRPRESQADA 62
Query: 73 -------DI-----LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI 120
D+ +P+ NFM+AQYF EI IG+PPQ+F V+ DTGSSNLWVPS +C SI
Sbjct: 63 IFKGMVADVKGNHPIPISNFMNAQYFSEITIGTPPQSFKVVLDTGSSNLWVPSVEC-GSI 121
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+CY HS+Y S S+TY + G S EI YGSGS+SGF SQD V +GD+ ++ Q F EAT E
Sbjct: 122 ACYLHSKYDSSASSTYKKNGTSFEIRYGSGSLSGFVSQDTVSIGDITIQGQDFAEATSEP 181
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
L F RFDGI+GLG+ I+V VP + MVEQ L+ E VF+F+L D E+V
Sbjct: 182 GLAFAFGRFDGILGLGYDRISVNGIVPPFYKMVEQKLIDEPVFAFYL---ADTNGQSEVV 238
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGGVD +KGK T +P+ +K YW+ + I G+ + + G I+D+GTSL+A P+
Sbjct: 239 FGGVDHDKYKGKITTIPLRRKAYWEVDFDAISYGDDTAELENTGI--ILDTGTSLIALPS 296
Query: 301 PVVTEINHAIGGE 313
+ +N IG +
Sbjct: 297 QLAEMLNAQIGAK 309
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 51/87 (58%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ IDC++ ++ +V+F + F L P Y+L+ + CIS FM D P P GPL
Sbjct: 314 GQYTIDCNKRDSLKDVTFNLAGYNFTLGPYDYVLE----VQGSCISTFMGMDFPAPTGPL 369
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ Y++++D G +G AEA
Sbjct: 370 AILGDAFLRRYYSIYDLGADTVGLAEA 396
>gi|397504905|ref|XP_003823019.1| PREDICTED: renin [Pan paniscus]
Length = 406
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 126/321 (39%), Positives = 197/321 (61%), Gaps = 20/321 (6%)
Query: 3 QKLLRSVFCLWVLASCLL-LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS 61
+++ R L + SC LP + +RI LK+ + + R + KER + A +
Sbjct: 5 RRMPRWGLLLLLWGSCTFGLPTDTTTFKRIFLKR-------MPSIRESLKERGVDMARLG 57
Query: 62 G------VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSK 115
R LG++ ++ L N+MD QY+GEIGIG+PPQ F V+FDTGSSN+WVPSSK
Sbjct: 58 PEWSQPMKRLTLGNTTSSVI-LTNYMDTQYYGEIGIGTPPQTFKVVFDTGSSNVWVPSSK 116
Query: 116 C-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFI 174
C +C +H + + S++Y G + Y +G++SGF SQD + VG + V Q+F
Sbjct: 117 CSRLYTACVYHKLFDASDSSSYKHNGTELTLRYSTGTVSGFLSQDIITVGGITVT-QMFG 175
Query: 175 EATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAE 234
E T +L F+LA FDG++G+GF E A+G P++DN++ QG++ E+VFSF+ NRD +
Sbjct: 176 EVTEMPALPFMLAEFDGVVGMGFIEQAIGSVTPIFDNIISQGVLKEDVFSFYYNRDSENS 235
Query: 235 E--GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSG 292
+ GG+IV GG DP+H++G Y+ + K G WQ ++ + +G+ ST +CE GC A+VD+G
Sbjct: 236 QSLGGQIVLGGSDPQHYEGNFHYINLIKTGVWQIQMKGVSVGS-STLLCEDGCLALVDTG 294
Query: 293 TSLLAGPTPVVTEINHAIGGE 313
S ++G T + ++ A+G +
Sbjct: 295 ASYISGSTSSIEKLMEALGAK 315
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 57/103 (55%), Gaps = 2/103 (1%)
Query: 405 SYINELCDSL--PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVC 462
S I +L ++L + + ++ C+ PT+P++SF +G K + L+ Y+ + ++C
Sbjct: 303 SSIEKLMEALGAKKRLFDYVVKCNEGPTLPDISFHLGGKEYTLTSADYVFQESYSSKKLC 362
Query: 463 ISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
A D+PPP GP W LG F+ ++T FD RIGFA A
Sbjct: 363 TLAIHAMDIPPPTGPTWALGATFIRKFYTEFDRRNNRIGFALA 405
>gi|344276734|ref|XP_003410162.1| PREDICTED: LOW QUALITY PROTEIN: renin-like [Loxodonta africana]
Length = 409
Score = 239 bits (609), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 136/326 (41%), Positives = 195/326 (59%), Gaps = 19/326 (5%)
Query: 2 EQKLLRSVFCLWVLASCLL-LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV 60
++ R L + SC LPA S RRI LKK + + R + KER + A +
Sbjct: 4 HSRMARWGLLLVLWGSCTFGLPADSGTFRRIFLKK-------MPSVRESLKERGVDVAKL 56
Query: 61 S------GVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSS 114
S R LG+ ++ L N++D QY+GEIGIG+PPQ F VIFDTGS+NLWVPSS
Sbjct: 57 STEWSQFSKRVSLGNGTSPMI-LTNYLDTQYYGEIGIGTPPQTFKVIFDTGSANLWVPSS 115
Query: 115 KCY-FSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF 173
KC +C H+RY S +S++Y E INYGSG + GF SQD V +G + V Q F
Sbjct: 116 KCSPLYTACETHNRYDSSESSSYVENKMEFTINYGSGKVKGFLSQDVVTMGGITVT-QTF 174
Query: 174 IEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDA 233
E T + F+LA+FDGI+G+GF AV PV+DN++ QG++ E+VFS + +R+
Sbjct: 175 GEVTELPVIPFMLAKFDGILGMGFPAQAVSGVTPVFDNIISQGVLKEDVFSVYYSRNSHL 234
Query: 234 EEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGT 293
GGEIV GG DP++++G YV ++K G WQ ++ + + +T CE GCAA+VD+G
Sbjct: 235 -LGGEIVLGGSDPQYYQGNFHYVSLSKNGLWQIKMKGVSV-RSATLFCEEGCAAMVDTGA 292
Query: 294 SLLAGPTPVVTEINHAIGGEGVVSAE 319
S + GPT + + A+G + +++ E
Sbjct: 293 SFITGPTSSLKLLMDALGAKELITNE 318
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 33/91 (36%), Positives = 54/91 (59%), Gaps = 5/91 (5%)
Query: 420 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEV-----CISGFMAFDLPPP 474
E +++C+++PT+P++SF +G + + L+ Y+L+ G + V C D+PPP
Sbjct: 318 EYVVNCNQVPTLPDISFHLGGRAYTLTSADYVLQVRLGTSTVNDDDLCTLAIHGLDVPPP 377
Query: 475 RGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
GP W+LG F+ ++T FD RIGFA A
Sbjct: 378 LGPXWVLGASFIRKFYTEFDRRNNRIGFALA 408
>gi|297662235|ref|XP_002809619.1| PREDICTED: renin [Pongo abelii]
Length = 406
Score = 239 bits (609), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 127/321 (39%), Positives = 197/321 (61%), Gaps = 20/321 (6%)
Query: 3 QKLLRSVFCLWVLASCLL-LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS 61
+++ R L + SC LP + +RI LK+ + + R + KER + A +
Sbjct: 5 RRMPRWGLLLLLWGSCTFGLPTDTTTFKRIFLKR-------MPSIRESLKERGVDMARLG 57
Query: 62 G------VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSK 115
R LG++ ++ L N+MD QY+GEIGIG+PPQ F V+FDTGSSN+WVPSSK
Sbjct: 58 PEWSQPMKRLTLGNTTSSVI-LTNYMDTQYYGEIGIGTPPQTFKVVFDTGSSNVWVPSSK 116
Query: 116 C-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFI 174
C +C +H + + S++Y G + Y +G++SGF SQD + VG + V Q+F
Sbjct: 117 CSRLYTACVYHKLFDASDSSSYKHNGTELTLRYSTGTVSGFLSQDIITVGGITVT-QMFG 175
Query: 175 EATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAE 234
E T +L F+LA FDG++G+GF E A+G P++DN++ QG++ E+VFSF+ NRD +
Sbjct: 176 EVTEMPALPFMLAEFDGVVGMGFIEQAIGRVTPIFDNIISQGVLKEDVFSFYYNRDSENS 235
Query: 235 E--GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSG 292
+ GG+IV GG DP+H++G YV + K G WQ ++ + +G+ ST +CE GC A+VD+G
Sbjct: 236 QSLGGQIVLGGSDPQHYEGNFHYVNLIKTGVWQIQMKGVSVGS-STLLCEDGCLALVDTG 294
Query: 293 TSLLAGPTPVVTEINHAIGGE 313
S ++G T + ++ A+G +
Sbjct: 295 ASYISGSTSSIEKLMEALGAK 315
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 57/103 (55%), Gaps = 2/103 (1%)
Query: 405 SYINELCDSL--PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVC 462
S I +L ++L + + ++ C+ PT+P++SF +G K + L+ Y+ + ++C
Sbjct: 303 SSIEKLMEALGAKKRLFDYVVKCNEGPTLPDISFHLGGKEYTLTSADYVFQESYSSKKLC 362
Query: 463 ISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
A D+PPP GP W LG F+ ++T FD RIGFA A
Sbjct: 363 TLAIHAMDIPPPTGPTWALGATFIRKFYTEFDRRNNRIGFALA 405
>gi|380092926|emb|CCC09679.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 410
Score = 239 bits (609), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 133/310 (42%), Positives = 187/310 (60%), Gaps = 17/310 (5%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRK-----ERYMGGAGVSGVRHRLG 68
+L + +LL ++ G+ + LKK L L + I + ++Y G S +
Sbjct: 5 LLTAAMLLGSAQAGVHTMKLKKVPL-AEQLESVPIDMQVQHLGQKYTGLRPESHTQAMFK 63
Query: 69 DSDEDI-----LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCY 123
+D + +P+ NFM+AQYF EI +G+PPQ F V+ DTGSSNLWVPSS+C SI+CY
Sbjct: 64 ATDAQVTGNHPVPISNFMNAQYFSEITLGTPPQTFKVVLDTGSSNLWVPSSQC-GSIACY 122
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
H++Y+S +S+TY + G S EI YGSGS+SGF SQD + +GD+ + DQ+F EAT E L
Sbjct: 123 LHNKYESSESSTYKKNGTSFEIQYGSGSLSGFVSQDRMTIGDITINDQLFAEATSEPGLA 182
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
F RFDGI+GLG+ IAV P + MVEQ LV E VFSF+L D D E E+VFGG
Sbjct: 183 FAFGRFDGILGLGYSRIAVNGITPPFYKMVEQKLVDEPVFSFYL-ADQDGES--EVVFGG 239
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
V+ + GK T +P+ +K YW+ + I G + G I+D+GTSL+A P+ +
Sbjct: 240 VNKDRYTGKITTIPLRRKAYWEVDFDAIGYGEDIADL--EGHGVILDTGTSLIALPSQLA 297
Query: 304 TEINHAIGGE 313
+N IG +
Sbjct: 298 EMLNAQIGAK 307
>gi|187608619|ref|NP_001120469.1| cathepsin E precursor [Xenopus (Silurana) tropicalis]
gi|170284872|gb|AAI61297.1| LOC100145572 protein [Xenopus (Silurana) tropicalis]
Length = 397
Score = 239 bits (609), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 125/285 (43%), Positives = 175/285 (61%), Gaps = 2/285 (0%)
Query: 27 GLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYF 86
GL R+ LK+++ L G + ++ PL N+MD +YF
Sbjct: 16 GLIRVPLKRQKSIRKKLKEKGKLSHVWTQQGIDMIQYTDSCSNNQAPSEPLINYMDVEYF 75
Query: 87 GEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEIN 146
GEI IG+PPQNF+VIFDTGSSNLWVPS C S +C H+R++ + S+TY G + +
Sbjct: 76 GEISIGTPPQNFTVIFDTGSSNLWVPSVYC-ISPACAQHNRFQPQFSSTYQSNGNNFSLQ 134
Query: 147 YGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
YG+GS+SG D+V V ++V+ Q F E+ E TF+ A FDGI+GLG+ IAVGD
Sbjct: 135 YGTGSLSGIIGTDSVSVEGILVQSQQFGESVSEPGSTFVDAEFDGILGLGYPSIAVGDCT 194
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
PV+DNM+ Q LV +FS +++R+P++ GGE+VFGG D F G+ +V VT +GYWQ
Sbjct: 195 PVFDNMMTQNLVELPMFSVYMSRNPNSPVGGELVFGGFDASRFSGQLNWVSVTNQGYWQI 254
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
+L +I I N C GGC AIVD+GTSL+ GP+ + ++ IG
Sbjct: 255 QLDNIQI-NGEVVFCTGGCQAIVDTGTSLITGPSSDIVQLQSIIG 298
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 42/85 (49%), Positives = 56/85 (65%), Gaps = 3/85 (3%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC + MP V+FTI + ++P+QY L+ G GI C SGF D+ PP GPL
Sbjct: 304 GDYEVDCSVLNEMPTVTFTINGIGYQMTPQQYTLQDGGGI---CSSGFQGLDISPPAGPL 360
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFA 503
WILGDVF+G Y++VFD G R+G A
Sbjct: 361 WILGDVFIGQYYSVFDRGNNRVGLA 385
>gi|402857430|ref|XP_003893258.1| PREDICTED: cathepsin E [Papio anubis]
Length = 396
Score = 239 bits (609), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 119/248 (47%), Positives = 165/248 (66%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C H+R++ +S+T
Sbjct: 69 PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHTRFQPSQSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y++ G+S I YG+GS+SG D V V + V Q F E+ E TF+ A FDGI+GL
Sbjct: 128 YSQPGQSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAEFDGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV +FS +++ +P+ G E++FGG D HF G ++
Sbjct: 188 GYPSLAVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGSLSW 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
VPVTK+GYWQ L +I +G + C GC AIVD+GTSL+ GP+ + ++ +AIG
Sbjct: 248 VPVTKQGYWQIALDNIQVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGAAPV 306
Query: 313 EGVVSAEC 320
+G + EC
Sbjct: 307 DGEYAVEC 314
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 39/87 (44%), Positives = 52/87 (59%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
GE ++C + MP+V+FTI + LSP Y L + C SGF D+ PP GPL
Sbjct: 308 GEYAVECANLNVMPDVTFTINGVPYTLSPTAYTLLDFVDGMQFCSSGFQGLDIHPPAGPL 367
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WILGDVF+ +++VFD G R+G A A
Sbjct: 368 WILGDVFIRQFYSVFDRGNNRVGLAPA 394
>gi|309319873|pdb|2X0B|A Chain A, Crystal Structure Of Human Angiotensinogen Complexed With
Renin
gi|309319875|pdb|2X0B|C Chain C, Crystal Structure Of Human Angiotensinogen Complexed With
Renin
gi|309319877|pdb|2X0B|E Chain E, Crystal Structure Of Human Angiotensinogen Complexed With
Renin
gi|309319879|pdb|2X0B|G Chain G, Crystal Structure Of Human Angiotensinogen Complexed With
Renin
Length = 383
Score = 239 bits (609), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 122/302 (40%), Positives = 189/302 (62%), Gaps = 19/302 (6%)
Query: 21 LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSG------VRHRLGDSDEDI 74
LP + +RI LK+ + + R + KER + A + R LG++ +
Sbjct: 1 LPTDTTTFKRIFLKR-------MPSIRESLKERGVDMARLGPEWSQPMKRLTLGNTTSSV 53
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKS 133
+ L N+MD QY+GEIGIG+PPQ F V+FDTGSSN+WVPSSKC +C +H + + S
Sbjct: 54 I-LTNYMDTQYYGEIGIGTPPQTFKVVFDTGSSNVWVPSSKCSRLYTACVYHKLFDASDS 112
Query: 134 NTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGII 193
++Y G + Y +G++SGF SQD + VG + V Q+F E T +L F+LA FDG++
Sbjct: 113 SSYKHNGTELTLRYSTGTVSGFLSQDIITVGGITVT-QMFGEVTEMPALPFMLAEFDGVV 171
Query: 194 GLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEE--GGEIVFGGVDPKHFKG 251
G+GF E A+G P++DN++ QG++ E+VFSF+ NRD + + GG+IV GG DP+H++G
Sbjct: 172 GMGFIEQAIGRVTPIFDNIISQGVLKEDVFSFYYNRDSENSQSLGGQIVLGGSDPQHYEG 231
Query: 252 KHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
Y+ + K G WQ ++ + +G+ ST +CE GC A+VD+G S ++G T + ++ A+G
Sbjct: 232 NFHYINLIKTGVWQIQMKGVSVGS-STLLCEDGCLALVDTGASYISGSTSSIEKLMEALG 290
Query: 312 GE 313
+
Sbjct: 291 AK 292
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 57/103 (55%), Gaps = 2/103 (1%)
Query: 405 SYINELCDSL--PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVC 462
S I +L ++L + + ++ C+ PT+P++SF +G K + L+ Y+ + ++C
Sbjct: 280 SSIEKLMEALGAKKRLFDYVVKCNEGPTLPDISFHLGGKEYTLTSADYVFQESYSSKKLC 339
Query: 463 ISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
A D+PPP GP W LG F+ ++T FD RIGFA A
Sbjct: 340 TLAIHAMDIPPPTGPTWALGATFIRKFYTEFDRRNNRIGFALA 382
>gi|384498765|gb|EIE89256.1| endopeptidase [Rhizopus delemar RA 99-880]
Length = 401
Score = 239 bits (609), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 122/256 (47%), Positives = 166/256 (64%), Gaps = 8/256 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+++AQY+GEI IG+PPQ F+V+FDTGSSNLWVPS+ C SI+C+ H RY S S
Sbjct: 77 VPLSNYLNAQYYGEIEIGTPPQPFTVVFDTGSSNLWVPSTHCT-SIACFLHKRYDSASSR 135
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY+E G I YG+GS+ GF SQD + VG + V+DQ F E+T+E LTF A+FDGI G
Sbjct: 136 TYSENGTEFAIQYGTGSLEGFISQDTLSVGGIQVEDQGFAESTKEPGLTFAFAKFDGIFG 195
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLN-RDPDAEEGGEIVFGGVDPKHFKGKH 253
LG+ I+V +P + +MV + LV E +FSFWLN + D + GGE++FGGVD HF+G
Sbjct: 196 LGYDTISVKHTIPPFYHMVNRDLVDEPLFSFWLNDANKDQDNGGELIFGGVDEDHFEGDI 255
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
+ V +KGYW+ + +I G+ + G A +D+G+SLL PT V IN +G E
Sbjct: 256 HWSDVRRKGYWEITMENIKFGDDYVDIDPVGAA--IDTGSSLLVAPTTVAALINKELGAE 313
Query: 314 ----GVVSAECKLVVS 325
G +C V S
Sbjct: 314 KNWAGQYVVDCNKVPS 329
Score = 85.5 bits (210), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 47/139 (33%), Positives = 75/139 (53%), Gaps = 6/139 (4%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
+ EN+ GD V A + + L T V + IN+ + N G+ ++DC+
Sbjct: 268 ITMENIKFGDDYVDIDPVGAAIDTGSSLLVAPTT--VAALINKELGAEKNWAGQYVVDCN 325
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
++P++P F K F L + Y+L+ + CISGFM D+P P GPLWI+GDVF+
Sbjct: 326 KVPSLPEFCFVFNGKDFCLEGKDYVLE----VQGQCISGFMGMDIPEPAGPLWIVGDVFL 381
Query: 487 GVYHTVFDSGKLRIGFAEA 505
+++V+D G R+G A +
Sbjct: 382 RKFYSVYDLGNNRVGLAPS 400
>gi|4506475|ref|NP_000528.1| renin preproprotein [Homo sapiens]
gi|57114109|ref|NP_001009122.1| renin precursor [Pan troglodytes]
gi|132326|sp|P00797.1|RENI_HUMAN RecName: Full=Renin; AltName: Full=Angiotensinogenase; Flags:
Precursor
gi|38503275|sp|P60016.1|RENI_PANTR RecName: Full=Renin; AltName: Full=Angiotensinogenase; Flags:
Precursor
gi|11118368|gb|AAG30305.1|AF193456_1 renin [Pan troglodytes]
gi|190994|gb|AAA60363.1| renin [Homo sapiens]
gi|337340|gb|AAD03461.1| renin [Homo sapiens]
gi|29126911|gb|AAH47752.1| Renin [Homo sapiens]
gi|49168484|emb|CAG38737.1| REN [Homo sapiens]
gi|54311156|gb|AAH33474.1| Renin [Homo sapiens]
gi|166706825|gb|ABY87560.1| renin [Homo sapiens]
gi|208967276|dbj|BAG73652.1| renin [synthetic construct]
gi|312153236|gb|ADQ33130.1| renin [synthetic construct]
Length = 406
Score = 238 bits (608), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 126/321 (39%), Positives = 197/321 (61%), Gaps = 20/321 (6%)
Query: 3 QKLLRSVFCLWVLASCLL-LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS 61
+++ R L + SC LP + +RI LK+ + + R + KER + A +
Sbjct: 5 RRMPRWGLLLLLWGSCTFGLPTDTTTFKRIFLKR-------MPSIRESLKERGVDMARLG 57
Query: 62 G------VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSK 115
R LG++ ++ L N+MD QY+GEIGIG+PPQ F V+FDTGSSN+WVPSSK
Sbjct: 58 PEWSQPMKRLTLGNTTSSVI-LTNYMDTQYYGEIGIGTPPQTFKVVFDTGSSNVWVPSSK 116
Query: 116 C-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFI 174
C +C +H + + S++Y G + Y +G++SGF SQD + VG + V Q+F
Sbjct: 117 CSRLYTACVYHKLFDASDSSSYKHNGTELTLRYSTGTVSGFLSQDIITVGGITVT-QMFG 175
Query: 175 EATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAE 234
E T +L F+LA FDG++G+GF E A+G P++DN++ QG++ E+VFSF+ NRD +
Sbjct: 176 EVTEMPALPFMLAEFDGVVGMGFIEQAIGRVTPIFDNIISQGVLKEDVFSFYYNRDSENS 235
Query: 235 E--GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSG 292
+ GG+IV GG DP+H++G Y+ + K G WQ ++ + +G+ ST +CE GC A+VD+G
Sbjct: 236 QSLGGQIVLGGSDPQHYEGNFHYINLIKTGVWQIQMKGVSVGS-STLLCEDGCLALVDTG 294
Query: 293 TSLLAGPTPVVTEINHAIGGE 313
S ++G T + ++ A+G +
Sbjct: 295 ASYISGSTSSIEKLMEALGAK 315
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 57/103 (55%), Gaps = 2/103 (1%)
Query: 405 SYINELCDSL--PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVC 462
S I +L ++L + + ++ C+ PT+P++SF +G K + L+ Y+ + ++C
Sbjct: 303 SSIEKLMEALGAKKRLFDYVVKCNEGPTLPDISFHLGGKEYTLTSADYVFQESYSSKKLC 362
Query: 463 ISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
A D+PPP GP W LG F+ ++T FD RIGFA A
Sbjct: 363 TLAIHAMDIPPPTGPTWALGATFIRKFYTEFDRRNNRIGFALA 405
>gi|345797646|ref|XP_545694.3| PREDICTED: cathepsin E [Canis lupus familiaris]
Length = 396
Score = 238 bits (608), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 123/255 (48%), Positives = 164/255 (64%), Gaps = 8/255 (3%)
Query: 69 DSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRY 128
D++E PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C H+++
Sbjct: 65 DTNE---PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHAKF 120
Query: 129 KSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLAR 188
+SNTY+ +G I YG+GS+SG D V V +VV Q F E+ E TF+ A
Sbjct: 121 YPSQSNTYSALGNQFSIQYGTGSLSGIIGADQVNVEGLVVVGQQFGESVTEPGQTFVNAE 180
Query: 189 FDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKH 248
FDGI+GLG+ +AVG PV+DNM+ Q LV +FS +++ DP+ G E++FGG D H
Sbjct: 181 FDGILGLGYPSLAVGGVTPVFDNMMAQNLVDIPMFSVYMSSDPEGGTGSELIFGGYDHSH 240
Query: 249 FKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH 308
F G +VPVTK+GYWQ L I +G + C GC AIVD+GTSL+ GP+ + ++ +
Sbjct: 241 FSGNLNWVPVTKQGYWQIALDAIQVGG-TVMFCSEGCQAIVDTGTSLITGPSDEIKQLQN 299
Query: 309 AIGGE---GVVSAEC 320
AIG E G EC
Sbjct: 300 AIGAEPMDGEYGVEC 314
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 40/90 (44%), Positives = 52/90 (57%), Gaps = 1/90 (1%)
Query: 417 PM-GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
PM GE ++C + MP+V+F I + L P Y L E C SGF D+ PP
Sbjct: 305 PMDGEYGVECANLNVMPDVTFIINGVSYTLQPTAYTLLDYVDGMEFCSSGFQGLDIQPPA 364
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
GPLWILGDVF+ +++VFD G R+G A A
Sbjct: 365 GPLWILGDVFIRKFYSVFDRGNNRVGLALA 394
>gi|85094599|ref|XP_959917.1| vacuolar protease A precursor [Neurospora crassa OR74A]
gi|59802879|sp|Q01294.2|CARP_NEUCR RecName: Full=Vacuolar protease A; Flags: Precursor
gi|28921374|gb|EAA30681.1| vacuolar protease A precursor [Neurospora crassa OR74A]
gi|40804614|emb|CAF05874.1| aspartic proteinase, pepstatin-sensitive [Neurospora crassa]
gi|336467530|gb|EGO55694.1| aspartic proteinase, pepstatin-sensitive [Neurospora tetrasperma
FGSC 2508]
gi|350287820|gb|EGZ69056.1| aspartic proteinase, pepstatin-sensitive [Neurospora tetrasperma
FGSC 2509]
Length = 396
Score = 238 bits (608), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 131/309 (42%), Positives = 187/309 (60%), Gaps = 15/309 (4%)
Query: 14 VLASCLLLPASSNGLRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSGVRHRLGD 69
+L + +LL ++ G+ + LKK +L+ ++ ++Y G S +
Sbjct: 5 LLTAAMLLGSAQAGVHTMKLKKVPLAEQLESVPIDVQVQHLGQKYTGLRTESHTQAMFKA 64
Query: 70 SDEDI-----LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYF 124
+D + +P+ NFM+AQYF EI IG+PPQ F V+ DTGSSNLWVPSS+C SI+CY
Sbjct: 65 TDAQVSGNHPVPITNFMNAQYFSEITIGTPPQTFKVVLDTGSSNLWVPSSQC-GSIACYL 123
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
H++Y+S +S+TY + G S +I YGSGS+SGF SQD + +GD+ + DQ+F EAT E L F
Sbjct: 124 HNKYESSESSTYKKNGTSFKIEYGSGSLSGFVSQDRMTIGDITINDQLFAEATSEPGLAF 183
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
RFDGI+GLG+ IAV P + MVEQ LV E VFSF+L D D E E+VFGGV
Sbjct: 184 AFGRFDGILGLGYDRIAVNGITPPFYKMVEQKLVDEPVFSFYL-ADQDGES--EVVFGGV 240
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
+ + GK T +P+ +K YW+ + I G + G I+D+GTSL+A P+ +
Sbjct: 241 NKDRYTGKITTIPLRRKAYWEVDFDAIGYGKDFAEL--EGHGVILDTGTSLIALPSQLAE 298
Query: 305 EINHAIGGE 313
+N IG +
Sbjct: 299 MLNAQIGAK 307
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 35/87 (40%), Positives = 52/87 (59%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ IDC + ++ +V+FT+ F L PE YIL+ + C+S FM D+P P GPL
Sbjct: 312 GQFTIDCGKKSSLEDVTFTLAGYNFTLGPEDYILEA----SGSCLSTFMGMDMPAPVGPL 367
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ Y++++D G +G A A
Sbjct: 368 AILGDAFLRKYYSIYDLGADTVGIATA 394
>gi|281207795|gb|EFA81975.1| cathepsin D [Polysphondylium pallidum PN500]
Length = 390
Score = 238 bits (607), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 133/325 (40%), Positives = 197/325 (60%), Gaps = 28/325 (8%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRH 65
L F V++ +P S N R +++ ++ A R+ +G +G +
Sbjct: 7 LAVFFAFIVVSQAFTVPLSFNKASRQAIRRIPQNIQKKFAGRL------LGASGTT---- 56
Query: 66 RLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI-SCYF 124
+P+ ++ DAQY+G I IG+P Q+F V+FDTGSSNLW+PS KC ++ +C
Sbjct: 57 ---------IPISDYEDAQYYGAITIGTPAQSFKVVFDTGSSNLWIPSKKCPVTVVACDL 107
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
HS+Y S KS++Y G S I YGSG++SGF SQD V+VG + V++Q+F EAT E + F
Sbjct: 108 HSKYDSSKSSSYVANGTSFSIQYGSGAMSGFVSQDTVQVGSLTVQNQLFAEATAEPGIAF 167
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
LA+FDGI+GL F+ I+V PV+ NM+ QGLV + VF+FWL++ P A GGE+ FG +
Sbjct: 168 DLAKFDGILGLAFQSISVNSIPPVFYNMMAQGLVQQPVFAFWLSKVPGA-NGGELTFGSI 226
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEG-GCAAIVDSGTSLLAGPTPVV 303
D + G TYVP+T + YW+F++ D + S G C GC AI DSGTSL+AGP+ +
Sbjct: 227 DTTRYTGPITYVPLTNETYWEFKMDDFALNGNSLGYCGADGCHAICDSGTSLIAGPSAQI 286
Query: 304 TEINHAIG-----GEGVVSAECKLV 323
+N +G GEG+ ++ C ++
Sbjct: 287 NALNTKLGAVVMNGEGIFTS-CSVI 310
Score = 88.6 bits (218), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 40/88 (45%), Positives = 54/88 (61%), Gaps = 1/88 (1%)
Query: 419 GESII-DCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGP 477
GE I C I T+PN+ T+ + F L+P Y+L+ C+SGFM D+P P GP
Sbjct: 300 GEGIFTSCSVISTLPNIEITVAGRQFLLTPTDYVLQVTSMGQTECLSGFMGIDIPAPIGP 359
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAEA 505
LWILGDVF+ Y+ +FD G ++GFA A
Sbjct: 360 LWILGDVFISTYYAIFDYGNRQVGFATA 387
>gi|409079719|gb|EKM80080.1| hypothetical protein AGABI1DRAFT_113304 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 413
Score = 238 bits (607), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 131/288 (45%), Positives = 175/288 (60%), Gaps = 24/288 (8%)
Query: 57 GAGVSGVR--HRLGDSDEDIL-------------PLKNFMDAQYFGEIGIGSPPQNFSVI 101
GAG +G R H DE +L PL NFM+AQYF EI IGSPPQ F VI
Sbjct: 59 GAGGTGRRIAHPSQQDDETLLWTQEHQVQGGHGVPLSNFMNAQYFTEIQIGSPPQTFKVI 118
Query: 102 FDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNV 161
DTGSSNLWVPS KC SI+C+ H++Y S +S+TY G + EI YGSG++ GF SQD +
Sbjct: 119 LDTGSSNLWVPSVKCT-SIACFLHTKYDSGQSSTYKANGSTFEIQYGSGAMEGFVSQDQL 177
Query: 162 EVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEE 221
++GD+ + Q F EAT+E L F +FDGI+GLG+ I+V VP + M+EQ L+ E
Sbjct: 178 QIGDLTINGQDFAEATKEPGLAFAFGKFDGILGLGYDTISVNHIVPPFYKMIEQNLLDER 237
Query: 222 VFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVC 281
VFSF L E+GGE VFGG+D +KGK YVP+ +K YW+ +L I +G + +
Sbjct: 238 VFSFRLGS--SDEDGGEAVFGGIDESAYKGKMHYVPIRQKAYWEVQLDKISLGGEELELE 295
Query: 282 EGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLVVS 325
G A +D+GTSL+A P+ + +N IG + G + +C V S
Sbjct: 296 NTGAA--IDTGTSLIALPSDMAEMLNTQIGAKKSWNGQYTIDCAKVAS 341
Score = 78.2 bits (191), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 36/88 (40%), Positives = 54/88 (61%), Gaps = 4/88 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ IDC ++ ++P ++F G + F L E Y+L + CIS F D+ P G L
Sbjct: 330 GQYTIDCAKVASLPELTFHFGGRAFPLKGEDYVLN----VQGSCISSFTGLDINLPWGSL 385
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEAA 506
WI+GDVF+ Y+TV+D G+ +GFAE+A
Sbjct: 386 WIIGDVFLRRYYTVYDLGRDAVGFAESA 413
>gi|407924694|gb|EKG17726.1| Peptidase A1 [Macrophomina phaseolina MS6]
Length = 378
Score = 238 bits (606), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 120/244 (49%), Positives = 165/244 (67%), Gaps = 7/244 (2%)
Query: 72 EDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSR 131
E +P+ NF++AQYF E+ +G+PPQ F VI DTGSSNLWVPSS+C SI+CY H++Y S
Sbjct: 52 EHPVPVTNFLNAQYFSEVSLGTPPQTFKVILDTGSSNLWVPSSEC-GSIACYLHTKYDSS 110
Query: 132 KSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDG 191
S+TY++ G + EI YGSGS+SGF S D +GD+ VKDQ F EAT E L F RFDG
Sbjct: 111 ASSTYSKNGSTFEIRYGSGSLSGFVSNDVFTIGDLTVKDQDFAEATSEPGLAFAFGRFDG 170
Query: 192 IIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV--FGGVDPKHF 249
I+GLG+ I+V VP + NM++QGL+ E VF+F+L+ D EG E V FGG+D H+
Sbjct: 171 ILGLGYDTISVNHIVPPFYNMIDQGLLDEPVFAFYLSDTND--EGSESVATFGGIDESHY 228
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHA 309
GK T +P+ +K YW+ +L I G+ + + G AI+D+GTSL+A P+ + +N
Sbjct: 229 TGKLTKIPLRRKAYWEVDLDSITFGDATAELDNTG--AILDTGTSLIALPSTLAELLNKE 286
Query: 310 IGGE 313
IG +
Sbjct: 287 IGAK 290
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 52/87 (59%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DCD+ +P+++FT+ F ++ YIL+ + CIS FM D P P GPL
Sbjct: 295 GQYTVDCDKRDGLPDLTFTLTGHNFTITSYDYILE----VQGSCISAFMGMDFPEPAGPL 350
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ +++V+D G +G A+A
Sbjct: 351 AILGDAFLRKWYSVYDLGNDAVGIAKA 377
>gi|402857516|ref|XP_003893299.1| PREDICTED: renin [Papio anubis]
gi|62287423|sp|Q6DLS0.1|RENI_MACFA RecName: Full=Renin; AltName: Full=Angiotensinogenase; Flags:
Precursor
gi|50346961|gb|AAT75162.1| renin [Macaca fascicularis]
Length = 406
Score = 238 bits (606), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 126/321 (39%), Positives = 197/321 (61%), Gaps = 20/321 (6%)
Query: 3 QKLLRSVFCLWVLASCLL-LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS 61
+++ R L + SC LP + +RI LK+ + + R + KER + A +
Sbjct: 5 RRMPRWGLLLLLWGSCTFGLPTDTTTFKRIFLKR-------MPSIRESLKERGVDMARLG 57
Query: 62 G------VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSK 115
R LG++ ++ L N+MD QY+GEIGIG+PPQ F V+FDTGSSN+WVPSSK
Sbjct: 58 PEWSQPMKRLALGNTTSSVI-LTNYMDTQYYGEIGIGTPPQTFKVVFDTGSSNVWVPSSK 116
Query: 116 C-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFI 174
C +C +H + + S++Y G + Y +G++SGF SQD + VG + V Q+F
Sbjct: 117 CSRLYTACVYHKLFDASDSSSYKHNGTELTLRYSTGTVSGFLSQDIITVGGITVT-QMFG 175
Query: 175 EATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAE 234
E T +L F+LA FDG++G+GF E A+G P++DN++ QG++ E+VFSF+ NRD +
Sbjct: 176 EVTEMPALPFMLAEFDGVVGMGFIEQAIGRVTPIFDNILSQGVLKEDVFSFYYNRDSENA 235
Query: 235 E--GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSG 292
+ GG+IV GG DP+H++G Y+ + K G WQ ++ + +G+ ST +CE GC A+VD+G
Sbjct: 236 QSLGGQIVLGGSDPQHYEGNFHYINLIKTGVWQIQMKGVSVGS-STLLCEDGCLALVDTG 294
Query: 293 TSLLAGPTPVVTEINHAIGGE 313
S ++G T + ++ A+G +
Sbjct: 295 ASYISGSTSSIEKLMEALGAK 315
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 57/103 (55%), Gaps = 2/103 (1%)
Query: 405 SYINELCDSL--PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVC 462
S I +L ++L + + ++ C+ PT+P++SF +G K + L+ Y+ + ++C
Sbjct: 303 SSIEKLMEALGAKKRLFDYVVKCNEGPTLPDISFHLGGKEYTLTSADYVFQESYSSKKLC 362
Query: 463 ISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
A D+PPP GP W LG F+ ++T FD RIGFA A
Sbjct: 363 TLAIHAMDIPPPTGPTWALGATFIRKFYTEFDRRNNRIGFALA 405
>gi|109018632|ref|XP_001090284.1| PREDICTED: cathepsin E isoform 4 [Macaca mulatta]
Length = 396
Score = 237 bits (605), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 119/248 (47%), Positives = 164/248 (66%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C H+R++ +S+T
Sbjct: 69 PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHTRFQPSQSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y++ G+S I YG+GS+SG D V V + V Q F E+ E TF+ A FDGI+GL
Sbjct: 128 YSQPGQSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAEFDGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV +FS +++ +P+ G E++FGG D HF G +
Sbjct: 188 GYPSLAVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGVGSELIFGGYDHSHFSGSLNW 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
VPVTK+GYWQ L +I +G + C GC AIVD+GTSL+ GP+ + ++ +AIG
Sbjct: 248 VPVTKQGYWQIALDNIQVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGAAPV 306
Query: 313 EGVVSAEC 320
+G + EC
Sbjct: 307 DGEYAVEC 314
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 39/87 (44%), Positives = 52/87 (59%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
GE ++C + MP+V+FTI + LSP Y L + C SGF D+ PP GPL
Sbjct: 308 GEYAVECANLNVMPDVTFTINGVPYTLSPTAYTLLDFVDGMQFCSSGFQGLDIHPPAGPL 367
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WILGDVF+ +++VFD G R+G A A
Sbjct: 368 WILGDVFIRQFYSVFDRGNNRVGLAPA 394
>gi|410986349|ref|XP_003999473.1| PREDICTED: cathepsin E [Felis catus]
Length = 396
Score = 237 bits (605), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 121/248 (48%), Positives = 161/248 (64%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N+MD +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C H+R+ +S+T
Sbjct: 69 PLINYMDTEYFGSISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHARFYPSQSDT 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y+ +G I YG+GS+SG D V V ++V Q F E+ E TF+ A FDGI+GL
Sbjct: 128 YSALGNHFSIQYGTGSLSGIIGTDQVYVEGLLVVGQQFGESVTEPGQTFVNAEFDGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV +FS +++ DP++ G E++FGG D HF G +
Sbjct: 188 GYPSLAVGGVTPVFDNMMAQNLVDIPMFSVYMSSDPESGVGSELIFGGYDHSHFSGTLNW 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE-- 313
VPVTK+GYWQ L I +G + C GC AIVD+GTSL+ GP+ + ++ AIG E
Sbjct: 248 VPVTKQGYWQIALDVIQVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQKAIGAEPM 306
Query: 314 -GVVSAEC 320
G + EC
Sbjct: 307 DGEYAVEC 314
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 40/90 (44%), Positives = 52/90 (57%), Gaps = 1/90 (1%)
Query: 417 PM-GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
PM GE ++C + MP+V+F I + L P Y L E C SGF D+ PP
Sbjct: 305 PMDGEYAVECANLNVMPDVTFIINGVSYTLQPTAYTLLDFVDGMEFCSSGFQGLDIQPPA 364
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
GPLWILGDVF+ +++VFD G R+G A A
Sbjct: 365 GPLWILGDVFIRQFYSVFDRGNNRVGLAPA 394
>gi|337347|gb|AAA60364.1| renin [Homo sapiens]
Length = 403
Score = 237 bits (605), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 125/319 (39%), Positives = 197/319 (61%), Gaps = 19/319 (5%)
Query: 3 QKLLRSVFCLWVLASCLL-LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS 61
+++ R L + SC LP + +RI LK+ + + R + KER + A +
Sbjct: 5 RRMPRWGLLLLLWGSCTFGLPTDTTTFKRIFLKR-------MPSIRESLKERGVDMASLG 57
Query: 62 G------VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSK 115
R LG++ ++ L N+MD QY+GEIGIG+PPQ F V+FDTGSSN+WVPSSK
Sbjct: 58 PEWSQPMKRLTLGNTTSSVI-LTNYMDTQYYGEIGIGTPPQTFKVVFDTGSSNVWVPSSK 116
Query: 116 C-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFI 174
C +C +H + + S++Y G + Y +G++SGF SQD + VG + V Q+F
Sbjct: 117 CSRLYTACVYHKLFDASDSSSYKHNGTELTLRYSTGTVSGFLSQDIITVGGITVT-QMFG 175
Query: 175 EATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAE 234
E T +L F+LA+FDG++G+GF E A+G P++DN++ QG++ E+VFSF+ NR+ +
Sbjct: 176 EVTEMPALPFMLAQFDGVVGMGFIEQAIGRVTPIFDNIISQGVLKEDVFSFYYNRNSQS- 234
Query: 235 EGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTS 294
GG+IV GG DP+H++G Y+ + K G WQ ++ + +G+ ST +CE GC A+VD+G S
Sbjct: 235 LGGQIVLGGSDPQHYEGNFHYINLIKTGVWQIQMKGVSVGS-STLLCEDGCLALVDTGAS 293
Query: 295 LLAGPTPVVTEINHAIGGE 313
++G T + ++ A+G +
Sbjct: 294 YISGSTSCIEKLMEALGAK 312
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 49/86 (56%)
Query: 420 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 479
+ ++ C+ PT+P++SF +G K + L+ Y+ + ++C A D+PPP GP W
Sbjct: 317 DYVVKCNEGPTLPDISFHLGGKEYTLTSADYVFQESYSSKKLCTLAIHAMDIPPPTGPTW 376
Query: 480 ILGDVFMGVYHTVFDSGKLRIGFAEA 505
LG F+ ++T FD RIGFA A
Sbjct: 377 ALGATFIRKFYTEFDRRNNRIGFALA 402
>gi|195029909|ref|XP_001987814.1| GH19747 [Drosophila grimshawi]
gi|193903814|gb|EDW02681.1| GH19747 [Drosophila grimshawi]
Length = 390
Score = 237 bits (605), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 125/298 (41%), Positives = 179/298 (60%), Gaps = 19/298 (6%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRL-------GDSDEDILPLKNFMDAQYFGEI 89
R+ LH + R R +++ G+ R RL GDS + PL N++DAQYFG I
Sbjct: 22 RVPLHRFPSVR-HRFQQF----GIRMDRLRLKYSLRTRGDSLRSV-PLSNYLDAQYFGPI 75
Query: 90 GIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGS 149
IG+PPQ F+VIFDTGS+NLWVPS C+ ++C HSRY +++S +Y G +I YGS
Sbjct: 76 SIGTPPQTFNVIFDTGSANLWVPSETCHRKLACQIHSRYNAKRSRSYKSNGSQFDIQYGS 135
Query: 150 GSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVW 209
GS++G+ SQD V + + + +Q F EAT FL A+FDGI GLG++ I++ + P +
Sbjct: 136 GSLTGYLSQDTVRMAGLELLNQTFAEATDMPGPIFLAAKFDGIFGLGYQAISIKNIKPPF 195
Query: 210 DNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELG 269
++EQ L+ VFS +LNRD + +GG + FGG ++++G TYVPVT + YWQ +L
Sbjct: 196 YAVMEQSLLERPVFSVYLNRDSTSLQGGYLFFGGSSRRYYRGNFTYVPVTHRAYWQVKLE 255
Query: 270 DILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 323
IG +C+ GC I+D+GTS +A P IN +IGG G S C+ V
Sbjct: 256 AAYIGKLQ--MCQKGCHVIIDTGTSFIAVPYEQAILINESIGGTPAAYGQFSVPCEQV 311
Score = 95.1 bits (235), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 43/99 (43%), Positives = 59/99 (59%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
INE P G+ + C+++P +P +SF +G + F + E Y+ VC S F
Sbjct: 290 INESIGGTPAAYGQFSVPCEQVPHLPTLSFALGGRRFQMKGEDYVFHDIFADRTVCASAF 349
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+A DLP P GPLWILGDVF+ Y+T FD G RIGFA++
Sbjct: 350 IAVDLPSPSGPLWILGDVFLSKYYTEFDMGNHRIGFADS 388
>gi|149707989|ref|XP_001491088.1| PREDICTED: cathepsin E [Equus caballus]
Length = 396
Score = 237 bits (604), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 120/248 (48%), Positives = 158/248 (63%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C H+R+ +SNT
Sbjct: 69 PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SSACKTHTRFYPSQSNT 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y+ +G I YG+GS+SG D V V + V Q F E+ E TF+ A FDGI+GL
Sbjct: 128 YSMVGSQFSIQYGTGSLSGIIGADQVSVEGLTVVGQRFGESVTEPGQTFVDAEFDGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV +FS +++ DP+ G E++FGG D HF G +
Sbjct: 188 GYPSLAVGGVTPVFDNMMAQNLVDVPMFSVYMSSDPEGGAGSELIFGGYDHSHFSGSLNW 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE-- 313
VPVTK+GYWQ L I +G + C GC AIVD+GTSL+ GP + ++ AIG +
Sbjct: 248 VPVTKQGYWQIALDAIQVGG-TVMFCSQGCQAIVDTGTSLITGPPDKIKQLQEAIGAQPM 306
Query: 314 -GVVSAEC 320
G + EC
Sbjct: 307 DGEYAVEC 314
Score = 78.2 bits (191), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 39/91 (42%), Positives = 52/91 (57%), Gaps = 1/91 (1%)
Query: 416 NPM-GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPP 474
PM GE ++C + MP+V+FTI + L P Y L + C SGF D+ PP
Sbjct: 304 QPMDGEYAVECVNLNVMPDVTFTINGVPYTLQPTAYTLLDFVDGMQFCSSGFQGLDIQPP 363
Query: 475 RGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
GPLWILGDVF+ +++VFD G +G A A
Sbjct: 364 AGPLWILGDVFIRQFYSVFDRGNNLVGLAPA 394
>gi|426333405|ref|XP_004028268.1| PREDICTED: renin [Gorilla gorilla gorilla]
Length = 406
Score = 237 bits (604), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 126/321 (39%), Positives = 197/321 (61%), Gaps = 20/321 (6%)
Query: 3 QKLLRSVFCLWVLASCLL-LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS 61
+++ R L + SC LP + +RI LK+ + + R + KER + A +
Sbjct: 5 RRMPRWGLLLLLWGSCTFGLPTDTTTFKRIFLKR-------MPSIRESLKERGVDMARLG 57
Query: 62 G------VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSK 115
R LG++ ++ L N+MD QY+GEIGIG+PPQ F V+FDTGSSN+WVPSSK
Sbjct: 58 PEWRQPMKRLTLGNTTSSVI-LTNYMDTQYYGEIGIGTPPQTFKVVFDTGSSNVWVPSSK 116
Query: 116 C-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFI 174
C +C +H + + S++Y G + Y +G++SGF SQD + VG + V Q+F
Sbjct: 117 CSRLYTACVYHKLFDASDSSSYKHNGTELTLRYSTGTVSGFLSQDIITVGGITVT-QMFG 175
Query: 175 EATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAE 234
E T +L F+LA FDG++G+GF E A+G P++DN++ QG++ E+VFSF+ NRD +
Sbjct: 176 EVTEMPALPFMLAEFDGVVGMGFIEQAIGRVTPIFDNIISQGVLKEDVFSFYYNRDSENF 235
Query: 235 E--GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSG 292
+ GG+IV GG DP+H++G Y+ + K G WQ ++ + +G+ ST +CE GC A+VD+G
Sbjct: 236 QSLGGQIVLGGSDPQHYEGNFHYINLIKTGVWQIQMKGVSVGS-STLLCEDGCLALVDTG 294
Query: 293 TSLLAGPTPVVTEINHAIGGE 313
S ++G T + ++ A+G +
Sbjct: 295 ASYISGSTSSIEKLMEALGAK 315
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 57/103 (55%), Gaps = 2/103 (1%)
Query: 405 SYINELCDSL--PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVC 462
S I +L ++L + + ++ C+ PT+P++SF +G K + L+ Y+ + ++C
Sbjct: 303 SSIEKLMEALGAKKRLFDYVVKCNEGPTLPDISFHLGGKEYTLTSADYVFQESYSSKKLC 362
Query: 463 ISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
A D+PPP GP W LG F+ ++T FD RIGFA A
Sbjct: 363 TLAIHAMDIPPPTGPTWALGATFIRKFYTEFDRRNNRIGFALA 405
>gi|60816208|gb|AAX36374.1| cathepsin E [synthetic construct]
Length = 396
Score = 237 bits (604), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 119/248 (47%), Positives = 163/248 (65%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C HSR++ +S+T
Sbjct: 69 PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHSRFQPSQSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y++ G+S I YG+GS+SG D V V + V Q F E+ E TF+ A FDGI+GL
Sbjct: 128 YSQPGQSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAEFDGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV +FS +++ +P+ G E++FGG D HF G +
Sbjct: 188 GYPSLAVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGSLNW 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
VPVTK+ YWQ L +I +G + C GC AIVD+GTSL+ GP+ + ++ +AIG
Sbjct: 248 VPVTKQAYWQIALDNIQVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGAAPV 306
Query: 313 EGVVSAEC 320
+G + EC
Sbjct: 307 DGEYAVEC 314
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 38/87 (43%), Positives = 51/87 (58%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
GE ++C + MP+V+FTI + LSP Y L + C SGF D+ PP GPL
Sbjct: 308 GEYAVECANLNVMPDVTFTINGVPYTLSPTAYTLLDFVDGMQFCSSGFQGLDIHPPAGPL 367
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WILGDVF+ +++VFD G R+G A
Sbjct: 368 WILGDVFIRQFYSVFDRGNNRVGLTPA 394
>gi|443927046|gb|ELU45582.1| endopeptidase [Rhizoctonia solani AG-1 IA]
Length = 934
Score = 237 bits (604), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 115/238 (48%), Positives = 157/238 (65%), Gaps = 5/238 (2%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+++AQY+ +I +GSPPQ+F V+ DTGSSNLWVP C SI+C+ H++Y S SN
Sbjct: 121 VPLHNYLNAQYYADITLGSPPQSFKVVLDTGSSNLWVPGKSCT-SIACFLHAKYDSSASN 179
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY G I YGSGS+SGF SQD + +GD+ VK Q F EAT+E L F +FDGI+G
Sbjct: 180 TYKANGTEFAIQYGSGSLSGFMSQDTLTIGDIAVKHQDFAEATKEPGLAFAFGKFDGILG 239
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L F I+V AVP NM++QGL+ E +F+F + ++GGE VFGG+D H+KGK
Sbjct: 240 LAFPRISVNGAVPPVYNMIDQGLIKEPLFTFRVGS--SEQDGGEAVFGGIDESHYKGKIH 297
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
YVPV ++ YW+ EL + +G + + G A +D+GTSL+A PT + IN IG
Sbjct: 298 YVPVRRQAYWEVELSSVSLGEDTLELENTGAA--IDTGTSLIALPTDIAEMINAQIGA 353
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 50/85 (58%), Gaps = 5/85 (5%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP-PPRGP 477
G+ + CD++P++P+++F G K + L Y+L + CIS F D+ P G
Sbjct: 359 GQYTVPCDKVPSLPDLTFQFGGKPYALGGSDYVLN----VQGTCISAFTGLDINLPDGGS 414
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGF 502
+WI+GDVF+ Y TV+D G+ +GF
Sbjct: 415 IWIVGDVFLRKYFTVYDIGRDAVGF 439
>gi|4503145|ref|NP_001901.1| cathepsin E isoform a preproprotein [Homo sapiens]
gi|114572172|ref|XP_001163151.1| PREDICTED: cathepsin E isoform 2 [Pan troglodytes]
gi|181194|gb|AAA52130.1| cathepsin E precursor [Homo sapiens]
gi|181205|gb|AAA52300.1| cathepsin E [Homo sapiens]
gi|7339520|emb|CAB82850.1| procathepsin E [Homo sapiens]
gi|27502799|gb|AAH42537.1| Cathepsin E [Homo sapiens]
gi|61358295|gb|AAX41543.1| cathepsin E [synthetic construct]
gi|119611998|gb|EAW91592.1| cathepsin E, isoform CRA_a [Homo sapiens]
gi|158257546|dbj|BAF84746.1| unnamed protein product [Homo sapiens]
gi|325463731|gb|ADZ15636.1| cathepsin E [synthetic construct]
Length = 396
Score = 237 bits (604), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 119/248 (47%), Positives = 163/248 (65%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C HSR++ +S+T
Sbjct: 69 PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHSRFQPSQSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y++ G+S I YG+GS+SG D V V + V Q F E+ E TF+ A FDGI+GL
Sbjct: 128 YSQPGQSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAEFDGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV +FS +++ +P+ G E++FGG D HF G +
Sbjct: 188 GYPSLAVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGSLNW 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
VPVTK+ YWQ L +I +G + C GC AIVD+GTSL+ GP+ + ++ +AIG
Sbjct: 248 VPVTKQAYWQIALDNIQVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGAAPV 306
Query: 313 EGVVSAEC 320
+G + EC
Sbjct: 307 DGEYAVEC 314
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 39/87 (44%), Positives = 52/87 (59%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
GE ++C + MP+V+FTI + LSP Y L + C SGF D+ PP GPL
Sbjct: 308 GEYAVECANLNVMPDVTFTINGVPYTLSPTAYTLLDFVDGMQFCSSGFQGLDIHPPAGPL 367
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WILGDVF+ +++VFD G R+G A A
Sbjct: 368 WILGDVFIRQFYSVFDRGNNRVGLAPA 394
>gi|397504824|ref|XP_003822980.1| PREDICTED: cathepsin E [Pan paniscus]
Length = 396
Score = 237 bits (604), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 119/248 (47%), Positives = 163/248 (65%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C HSR++ +S+T
Sbjct: 69 PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHSRFQPSQSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y++ G+S I YG+GS+SG D V V + V Q F E+ E TF+ A FDGI+GL
Sbjct: 128 YSQPGQSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAEFDGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV +FS +++ +P+ G E++FGG D HF G +
Sbjct: 188 GYPSLAVGGVTPVFDNMLAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGSLNW 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
VPVTK+ YWQ L +I +G + C GC AIVD+GTSL+ GP+ + ++ +AIG
Sbjct: 248 VPVTKQAYWQIALDNIQVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGAAPV 306
Query: 313 EGVVSAEC 320
+G + EC
Sbjct: 307 DGEYAVEC 314
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 39/87 (44%), Positives = 52/87 (59%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
GE ++C + MP+V+FTI + LSP Y L + C SGF D+ PP GPL
Sbjct: 308 GEYAVECANLNVMPDVTFTINGVPYTLSPTAYTLLDFVDGMQFCSSGFQGLDIHPPAGPL 367
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WILGDVF+ +++VFD G R+G A A
Sbjct: 368 WILGDVFIRQFYSVFDRGNNRVGLAPA 394
>gi|115719|sp|P00795.2|CATD_PIG RecName: Full=Cathepsin D; Contains: RecName: Full=Cathepsin D
light chain; Contains: RecName: Full=Cathepsin D heavy
chain; Flags: Precursor
Length = 345
Score = 237 bits (604), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 127/254 (50%), Positives = 178/254 (70%), Gaps = 12/254 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN+MDAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 7 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSGKSST 66
Query: 136 YTEIGKSCEINYGSGSISGFFS-QDNVEV---------GDVVVKDQVFIEATREGSLTFL 185
Y + G + I+YGSGS+SG+ S QD V V G + V+ Q F EAT++ LTF+
Sbjct: 67 YVKNGTTFAIHYGSGSLSGYLSSQDTVSVPCNSALSGVGGIKVERQTFGEATKQPGLTFI 126
Query: 186 LARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVD 245
A+FDGI+G+ + I+V + VPV+DN+++Q LV +++FSF+LNRDP A+ GGE++ GG+D
Sbjct: 127 AAKFDGILGMAYPRISVNNVVPVFDNLMQQKLVDKDIFSFYLNRDPGAQPGGELMLGGID 186
Query: 246 PKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTE 305
K++KG Y VT+K YWQ + + +G+ T +C+GGC AIVD+GTSL+ G V E
Sbjct: 187 SKYYKGSLDYHNVTRKAYWQIHMNQVAVGSSLT-LCKGGCEAIVDTGTSLIVGQPEEVRE 245
Query: 306 INHAIGGEGVVSAE 319
+ AIG ++ E
Sbjct: 246 LGKAIGAVPLIQGE 259
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 57/141 (40%), Positives = 84/141 (59%), Gaps = 3/141 (2%)
Query: 367 VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDC 425
+ V+ G S +C A+V L Q +E + + + ++P GE +I C
Sbjct: 207 IHMNQVAVGSSLTLCKGGCEAIVDTGTSLIVGQPEE--VRELGKAIGAVPLIQGEYMIPC 264
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
+++P++P+V+ T+G K + LS E Y LK + +C+SGFM D+PPP GPLWILGDVF
Sbjct: 265 EKVPSLPDVTVTLGGKKYKLSSENYTLKVSQAGQTICLSGFMGMDIPPPGGPLWILGDVF 324
Query: 486 MGVYHTVFDSGKLRIGFAEAA 506
+G Y+TVFD R+G AEAA
Sbjct: 325 IGRYYTVFDRDLNRVGLAEAA 345
>gi|118102416|ref|XP_001235024.1| PREDICTED: cathepsin E [Gallus gallus]
Length = 397
Score = 237 bits (604), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 123/293 (41%), Positives = 180/293 (61%), Gaps = 8/293 (2%)
Query: 26 NGLRRIGLKKRRLDLHSL-NAARITR--KERYMGGAGVSGVRHRLGDSDEDILPLKNFMD 82
NGL+R+ L + R SL + ++++ K + S G+++E PL N++D
Sbjct: 20 NGLKRVTLTRHRSLRKSLRDRGQLSQFWKAHRLDMVQYSQDCSLFGEANE---PLINYLD 76
Query: 83 AQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKS 142
+YFG+I IG+PPQNF+V+FDTGSSNLWVPS C S +C H+R++ S+TY +G
Sbjct: 77 MEYFGQISIGTPPQNFTVVFDTGSSNLWVPSIYCT-SKACTKHARFQPSHSSTYQPLGIP 135
Query: 143 CEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAV 202
I YG+GS++G D V V + V +Q F E+ E TF + FDGI+GL + +AV
Sbjct: 136 VSIQYGTGSLTGIIGSDQVTVEGMTVYNQPFAESVSEPGKTFQDSEFDGILGLAYPSLAV 195
Query: 203 GDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
PV+DNM+ Q LV +FS +++ +PD+ GGE++FGG DP F G +VPVT++G
Sbjct: 196 DGVTPVFDNMMAQDLVEMPIFSVYMSANPDSSLGGEVLFGGFDPSRFLGTLHWVPVTQQG 255
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
YWQ +L ++ +G + C GC AIVD+GTSLL GPT + E+ IG +
Sbjct: 256 YWQIQLDNVQVGG-TVAFCADGCQAIVDTGTSLLTGPTKDIKEMQRYIGATAM 307
Score = 98.2 bits (243), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 56/143 (39%), Positives = 79/143 (55%), Gaps = 8/143 (5%)
Query: 367 VEKENVSAGDS-AVCSACEMAVVWVQNQLKQKQTKE--KVLSYINELCDSLPNPMGESII 423
++ +NV G + A C+ A+V L TK+ ++ YI GE I+
Sbjct: 259 IQLDNVQVGGTVAFCADGCQAIVDTGTSLLTGPTKDIKEMQRYIGATAMD-----GEYIV 313
Query: 424 DCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGD 483
DC R+ +MP V+FTI + LS + Y L ++C+SGF D+PPP GPLWILGD
Sbjct: 314 DCGRLSSMPIVTFTINGIPYVLSAQAYTLMEQSDGVDICLSGFQGMDVPPPAGPLWILGD 373
Query: 484 VFMGVYHTVFDSGKLRIGFAEAA 506
VF+ Y++VFD G R+GFA A
Sbjct: 374 VFIRQYYSVFDRGNNRVGFAPTA 396
>gi|426333516|ref|XP_004028322.1| PREDICTED: cathepsin E isoform 1 [Gorilla gorilla gorilla]
Length = 396
Score = 237 bits (604), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 119/248 (47%), Positives = 163/248 (65%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C HSR++ +S+T
Sbjct: 69 PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHSRFQPSQSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y++ G+S I YG+GS+SG D V V + V Q F E+ E TF+ A FDGI+GL
Sbjct: 128 YSQPGQSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAEFDGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV +FS +++ +P+ G E++FGG D HF G +
Sbjct: 188 GYPSLAVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGSLNW 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
VPVTK+ YWQ L +I +G + C GC AIVD+GTSL+ GP+ + ++ +AIG
Sbjct: 248 VPVTKQAYWQIALDNIQVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGSAPV 306
Query: 313 EGVVSAEC 320
+G + EC
Sbjct: 307 DGEYAVEC 314
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 39/87 (44%), Positives = 52/87 (59%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
GE ++C + MP+V+FTI + LSP Y L + C SGF D+ PP GPL
Sbjct: 308 GEYAVECANLNVMPDVTFTINGVPYTLSPTAYTLLDFVDGMQFCSSGFQGLDIHPPAGPL 367
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WILGDVF+ +++VFD G R+G A A
Sbjct: 368 WILGDVFIRQFYSVFDRGNNRVGLAPA 394
>gi|198457045|ref|XP_001360531.2| GA10074 [Drosophila pseudoobscura pseudoobscura]
gi|198135836|gb|EAL25106.2| GA10074 [Drosophila pseudoobscura pseudoobscura]
Length = 399
Score = 236 bits (603), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 131/334 (39%), Positives = 187/334 (55%), Gaps = 36/334 (10%)
Query: 12 LWVLASCLLLP-----ASSNGLRRIGLKKR------------RLDLHSLNAARITRKERY 54
+W+L L+LP + S L R+ L++ R+D L +R+ + R
Sbjct: 1 MWLLFLSLILPPLVAPSPSTELYRVPLRRFPSARNRFVQFGIRMDRFRLKYSRVDGRSRP 60
Query: 55 MGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSS 114
GG V PL N++DAQYFG I IGSPPQ F VIFDTGSSNLWVPS+
Sbjct: 61 RGGWEVRSE------------PLSNYLDAQYFGPITIGSPPQTFKVIFDTGSSNLWVPST 108
Query: 115 KCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF 173
C + ++C HSRY +R+S+++ G I+YGSGS++G+ S D V V + +++Q F
Sbjct: 109 SCAPTMVACMVHSRYNARQSSSHRRNGVRFAIHYGSGSLAGYLSSDTVRVAGLEIQNQTF 168
Query: 174 IEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDA 233
E T FL A+FDGI GL ++ I++ D P + ++EQ L+S VFS +LNR +
Sbjct: 169 AEVTTMPGPIFLAAKFDGIFGLAYQSISMQDVKPPFYAIMEQKLLSNPVFSVYLNRQQEH 228
Query: 234 EEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGT 293
EGG + FGG +P++++G TYVPV+ + YWQ + I + +C+ GC I+D+GT
Sbjct: 229 PEGGALFFGGSNPRYYRGNFTYVPVSHRAYWQVRMEAATINDLR--LCQHGCEVIIDTGT 286
Query: 294 SLLAGPTPVVTEINHAIGGE----GVVSAECKLV 323
S LA P IN +IGG G S C V
Sbjct: 287 SFLALPYDQAILINESIGGTPSEYGQYSVPCDQV 320
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 45/99 (45%), Positives = 60/99 (60%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
INE P+ G+ + CD++P +P ++F +G + F L YI + E+C S
Sbjct: 299 INESIGGTPSEYGQYSVPCDQVPQLPRLTFQLGSQQFFLDGSNYIFRDVYQDREICFSAI 358
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+ DLP P GPLWILGDVF+G Y+T FD G RIGFAEA
Sbjct: 359 IGVDLPSPSGPLWILGDVFLGKYYTEFDMGNHRIGFAEA 397
>gi|449299914|gb|EMC95927.1| hypothetical protein BAUCODRAFT_34686 [Baudoinia compniacensis UAMH
10762]
Length = 376
Score = 236 bits (603), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 115/249 (46%), Positives = 164/249 (65%), Gaps = 8/249 (3%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
+ NF++AQYF +I IG+PPQ+F V+ DTGSSNLWVPS C SI+CY HS+Y S+TY
Sbjct: 56 VSNFLNAQYFSDISIGTPPQDFKVVLDTGSSNLWVPSQDC-GSIACYLHSKYDHSDSSTY 114
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
+ G +I YGSG + G+ SQD V +GD+ +K+Q+F EAT E L F RFDGI+GLG
Sbjct: 115 KKNGSDFQIRYGSGELEGYISQDTVRIGDLSIKNQLFAEATSEPGLAFAFGRFDGIMGLG 174
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ I+V VP + NM+ QGL+ E+VF+F+L+ D + + E FGG+D H++GK T +
Sbjct: 175 YDTISVNHIVPPFYNMINQGLIDEQVFAFYLS-DTNKGDESEATFGGIDESHYEGKMTKI 233
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE--- 313
P+ +K YW+ +L I G+Q+ + G AI+D+GTSL+A PT + +N IG +
Sbjct: 234 PLRRKAYWEVDLDAITFGDQTAEIDSTG--AILDTGTSLIALPTTLAELLNREIGAKKSY 291
Query: 314 -GVVSAECK 321
G + EC
Sbjct: 292 NGQYTIECN 300
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 55/87 (63%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ I+C++ ++P+++FT+ F + P YIL+ + CIS FM FD+P P GPL
Sbjct: 293 GQYTIECNKRDSLPDLTFTLTGYNFTIGPYDYILE----VQGSCISSFMGFDIPEPAGPL 348
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ +++V+D G +G A+A
Sbjct: 349 AILGDAFLRKWYSVYDLGNNAVGLAKA 375
>gi|403294878|ref|XP_003938389.1| PREDICTED: cathepsin E [Saimiri boliviensis boliviensis]
Length = 396
Score = 236 bits (603), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 119/248 (47%), Positives = 162/248 (65%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C H+R++ +SNT
Sbjct: 69 PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKRHTRFQPSQSNT 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + G+S I YG+GS+SG D V V + V Q F E+ E TF+ A FDGI+GL
Sbjct: 128 YNQPGQSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAEFDGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV +FS +++ +P+ G E++FGG D HF G +
Sbjct: 188 GYPSLAVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGVGSELIFGGYDHSHFSGSLNW 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
VPVTK+ YWQ L +I +G + C GC AIVD+GTSL+ GP+ + ++ +AIG
Sbjct: 248 VPVTKQAYWQIALDNIQVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGAAPV 306
Query: 313 EGVVSAEC 320
+G + EC
Sbjct: 307 DGEYAVEC 314
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 40/87 (45%), Positives = 52/87 (59%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
GE ++C + MP+V+FTI + LSP Y L E C SGF D+ PP GPL
Sbjct: 308 GEYAVECANLNVMPDVTFTINGVPYTLSPTAYTLLDFVDGMEFCSSGFQGLDIHPPAGPL 367
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WILGDVF+ +++VFD G R+G A A
Sbjct: 368 WILGDVFIRQFYSVFDRGNNRVGLAPA 394
>gi|351710945|gb|EHB13864.1| Cathepsin E, partial [Heterocephalus glaber]
Length = 391
Score = 236 bits (603), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 120/248 (48%), Positives = 158/248 (63%), Gaps = 6/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C H + SNT
Sbjct: 65 PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHPVFHPSLSNT 123
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y+E+G I YG+GS++G D V V + V Q F E+ +E TF+ A FDGI+GL
Sbjct: 124 YSEVGNPFSIQYGTGSLTGIIGADQVSVEGLTVVGQQFGESVKEPGQTFVHAEFDGILGL 183
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +A G PV+DNM+ Q LV+ +FS +++ +P GGE+ FGG DP HF G +
Sbjct: 184 GYPSLAAGGVTPVFDNMMAQNLVALPLFSVYMSSNPGG-SGGELTFGGYDPSHFSGSLNW 242
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
VPVTK+ YWQ L IL+G+ S C GC AIVD+GTSL+ GP P + ++ A+G V
Sbjct: 243 VPVTKQAYWQIALDGILVGD-SVMFCSEGCQAIVDTGTSLITGPPPKIKQLQEALGATYV 301
Query: 316 ---VSAEC 320
+ EC
Sbjct: 302 DEEYAVEC 309
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 48/147 (32%), Positives = 74/147 (50%), Gaps = 18/147 (12%)
Query: 367 VEKENVSAGDSAV-CSACEMAVVWVQNQL------KQKQTKEKV-LSYINELCDSLPNPM 418
+ + + GDS + CS A+V L K KQ +E + +Y++E
Sbjct: 253 IALDGILVGDSVMFCSEGCQAIVDTGTSLITGPPPKIKQLQEALGATYVDE--------- 303
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
E ++C + M +V+F I ++ LSP Y L +VC +GF ++ PP GPL
Sbjct: 304 -EYAVECANLNMMQDVTFVINGVLYTLSPTAYTLLDYADGMQVCSTGFQGLEIQPPAGPL 362
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WILGDVF+ ++ VFD G ++G A A
Sbjct: 363 WILGDVFIRQFYAVFDRGNNQVGLAPA 389
>gi|355681644|gb|AER96811.1| cathepsin E [Mustela putorius furo]
Length = 375
Score = 236 bits (603), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 120/248 (48%), Positives = 159/248 (64%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I +GSPPQNF+VIFDTGSSNLWVPS C S +C H+R+ +S+T
Sbjct: 48 PLINYLDMEYFGTISVGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHTRFYPSQSST 106
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y+ +G I YG+GS+SG D V V +VV Q F E+ E TF+ A FDGI+GL
Sbjct: 107 YSTLGSHFSIQYGTGSLSGILGADQVNVEGLVVVGQQFGESVTEPGQTFVNAEFDGILGL 166
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV +FS +++ DP+ G E++FGG D HF G +
Sbjct: 167 GYPSLAVGGVTPVFDNMMAQNLVDIPMFSVYMSSDPEGGAGSELIFGGYDHSHFSGNLNW 226
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE-- 313
VPVTK+GYWQ L I +G + C GC AIVD+GTSL+ GP+ + ++ AIG E
Sbjct: 227 VPVTKQGYWQIALDAIQVGG-AVMFCSEGCQAIVDTGTSLITGPSDKIKQLQKAIGAEPM 285
Query: 314 -GVVSAEC 320
G EC
Sbjct: 286 DGEYGVEC 293
Score = 82.4 bits (202), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 41/90 (45%), Positives = 53/90 (58%), Gaps = 1/90 (1%)
Query: 417 PM-GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
PM GE ++C + MP+V+FTI + L P Y L E C SGF D+ PP
Sbjct: 284 PMDGEYGVECANLNVMPDVTFTINGVSYTLQPTAYTLLDFVDGMEFCSSGFQGLDIQPPA 343
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
GPLWILGDVF+ +++VFD G R+G A A
Sbjct: 344 GPLWILGDVFIRQFYSVFDRGNNRVGLAPA 373
>gi|126723599|ref|NP_001075713.1| cathepsin E precursor [Oryctolagus cuniculus]
gi|1168791|sp|P43159.1|CATE_RABIT RecName: Full=Cathepsin E; Flags: Precursor
gi|402729|gb|AAC37308.1| procathepsin E [Oryctolagus cuniculus]
Length = 396
Score = 236 bits (603), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 118/249 (47%), Positives = 161/249 (64%), Gaps = 5/249 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDT SSNLWVPS C S +C H +++ +SNT
Sbjct: 69 PLINYLDMEYFGTISIGSPPQNFTVIFDTVSSNLWVPSVYCT-SPACQMHPQFRPSQSNT 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y+E+G I YG+GS++G D V V + V Q F E+ +E TF+ A FDGI+GL
Sbjct: 128 YSEVGTPFSIAYGTGSLTGIIGADQVSVQGLTVVGQQFGESVKEPGQTFVNAEFDGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +A G PV+DNM+ Q LVS +FS +++ +P+ G E+ FGG D HF G +
Sbjct: 188 GYPSLAAGGVTPVFDNMMAQNLVSLPMFSVYMSSNPEGGSGSELTFGGYDSSHFSGSLNW 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
VPVTK+GYWQ L +I +G S C GC AIVD+GTSL+ GP+ + ++ AIG
Sbjct: 248 VPVTKQGYWQIALDEIQVGG-SPMFCPEGCQAIVDTGTSLITGPSDKIIQLQAAIGATPM 306
Query: 313 EGVVSAECK 321
+G + EC+
Sbjct: 307 DGEYAVECE 315
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 39/91 (42%), Positives = 53/91 (58%), Gaps = 1/91 (1%)
Query: 416 NPM-GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPP 474
PM GE ++C+ + MP+V+F I + LS Y L + C SGF D+ PP
Sbjct: 304 TPMDGEYAVECENLNIMPDVTFVINGVPYTLSATAYTLPDFVDGMQFCGSGFQGLDIQPP 363
Query: 475 RGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
GPLWILGDVF+ +++VFD G R+G A A
Sbjct: 364 AGPLWILGDVFIRQFYSVFDRGSNRVGLAPA 394
>gi|169600915|ref|XP_001793880.1| hypothetical protein SNOG_03312 [Phaeosphaeria nodorum SN15]
gi|111068923|gb|EAT90043.1| hypothetical protein SNOG_03312 [Phaeosphaeria nodorum SN15]
Length = 347
Score = 236 bits (603), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 118/250 (47%), Positives = 166/250 (66%), Gaps = 8/250 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+P+ NF++AQYF EI +G+PPQ F V+ DTGSSNLWVPSS+C SI+CY H++Y S S+
Sbjct: 25 VPVSNFLNAQYFSEISLGTPPQTFKVVLDTGSSNLWVPSSECN-SIACYLHTKYDSSSSS 83
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + G S EI YGSG +SGF S D ++GD+ VK+Q F EAT E L F RFDGI+G
Sbjct: 84 TYKKNGTSFEIRYGSGELSGFVSNDVFQIGDLKVKNQDFAEATSEPGLAFAFGRFDGIMG 143
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V VP + NM+EQGL+ E VF+F+L D +A++ E FGG+D H+ GK
Sbjct: 144 LGYDTISVNKIVPPFYNMLEQGLLDEPVFAFYLG-DTNAQQESEATFGGIDESHYSGKLI 202
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
+P+ +K YW+ +L I G ++ + + G I+D+GTSL+A P+ + +N IG +
Sbjct: 203 KLPLRRKAYWEVDLDAITFGKETAEMDDTGV--ILDTGTSLIALPSTIAELLNKEIGAKK 260
Query: 314 ---GVVSAEC 320
G + EC
Sbjct: 261 GFNGQYTVEC 270
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 32/87 (36%), Positives = 52/87 (59%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ ++CD+ +P+++FT+ F +S YIL+ + CIS FM D P P GPL
Sbjct: 264 GQYTVECDKRDGLPDLTFTLTGHNFTISAFDYILE----VQGSCISAFMGMDFPEPVGPL 319
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ +++V+D G +G A++
Sbjct: 320 AILGDAFLRKWYSVYDVGNNAVGLAKS 346
>gi|195997417|ref|XP_002108577.1| hypothetical protein TRIADDRAFT_19349 [Trichoplax adhaerens]
gi|190589353|gb|EDV29375.1| hypothetical protein TRIADDRAFT_19349, partial [Trichoplax
adhaerens]
Length = 370
Score = 236 bits (603), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 119/241 (49%), Positives = 159/241 (65%), Gaps = 3/241 (1%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
L N++DA+YFG I IG+PPQ+F V+FDTGSS+ WVPSS+C S +C H RY KS+TY
Sbjct: 46 LNNYLDAEYFGPITIGTPPQDFLVLFDTGSSDFWVPSSECT-SQACEMHHRYDHSKSSTY 104
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
GK I YGSGS GF S D V+V + V++ F E T F A+FDGI+GLG
Sbjct: 105 RPNGKRWSIEYGSGSAEGFLSTDVVKVAGITVQNVTFGEVTNLPGPIFAAAKFDGILGLG 164
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
F ++V ++D M++QGL+ + VFS +LNR GGE+VFGG DP ++ G +YV
Sbjct: 165 FASLSVEGVKTIFDLMLQQGLIQKPVFSVYLNRQGTQNVGGELVFGGSDPNYYTGAFSYV 224
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
P++K+GYWQFEL I N+ CEGGC A++D+GTSL+ GP V +INH IG + +
Sbjct: 225 PLSKEGYWQFELDGGTIENEF--FCEGGCQAVIDTGTSLIVGPNEEVAKINHLIGADSIQ 282
Query: 317 S 317
S
Sbjct: 283 S 283
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 39/86 (45%), Positives = 59/86 (68%), Gaps = 1/86 (1%)
Query: 420 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 479
+S+++C+ +P +P ++ TIG K ++LS ++YILK +G E+C SGF + G W
Sbjct: 282 QSLVNCNSMPELPVITLTIGGKEYSLSGQEYILKYRQGEQEICRSGFQGGNFEGI-GVQW 340
Query: 480 ILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGDVF+G Y+T FD G R+GFA+A
Sbjct: 341 ILGDVFIGTYYTEFDKGNGRLGFAKA 366
>gi|451853159|gb|EMD66453.1| hypothetical protein COCSADRAFT_34972 [Cochliobolus sativus ND90Pr]
Length = 399
Score = 236 bits (602), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 119/250 (47%), Positives = 165/250 (66%), Gaps = 8/250 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+P+ NF++AQYF +I +G+PPQ+F VI DTGSSNLWVPS++C SI+CY H++Y S S+
Sbjct: 77 VPVSNFLNAQYFSDISLGTPPQSFKVILDTGSSNLWVPSTECS-SIACYLHTKYDSSASS 135
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + G EI YGSGS+SGF S D ++GD+ VK+Q F EAT E L F RFDGI+G
Sbjct: 136 TYKKNGSEFEIRYGSGSLSGFVSNDVFQIGDLKVKNQDFAEATSEPGLAFAFGRFDGIMG 195
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V VP + NM+ QGL+ E VF+F+L D +E E FGG+D H+ GK T
Sbjct: 196 LGYDTISVNGIVPPFYNMLNQGLLDEPVFAFYLGDTKDGKE-SEATFGGIDESHYTGKLT 254
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
+P+ +K YW+ +L I G ++ + G AI+D+GTSL+A P+ + +N IG +
Sbjct: 255 KLPLRRKAYWEVDLDAITFGKETAEMENIG--AILDTGTSLIALPSAIAELLNKEIGAKK 312
Query: 314 ---GVVSAEC 320
G S EC
Sbjct: 313 GFNGQYSVEC 322
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 52/87 (59%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ ++C++ ++PN++FT+ F + YIL+ + CIS FM D+P P GPL
Sbjct: 316 GQYSVECNKRDSLPNLTFTLTGHNFTIDAYDYILE----VQGSCISAFMGMDIPEPAGPL 371
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ +++V+D G + A++
Sbjct: 372 AILGDAFLRKWYSVYDLGNSAVALAKS 398
>gi|74136391|ref|NP_001028088.1| renin precursor [Macaca mulatta]
gi|67461396|sp|Q6DLW5.2|RENI_MACMU RecName: Full=Renin; AltName: Full=Angiotensinogenase; Flags:
Precursor
gi|61699710|gb|AAT74864.2| prorenin [Macaca mulatta]
Length = 406
Score = 236 bits (602), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 126/321 (39%), Positives = 196/321 (61%), Gaps = 20/321 (6%)
Query: 3 QKLLRSVFCLWVLASCLL-LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS 61
+++ R L + SC LP + +RI LK+ + + R + KER + A +
Sbjct: 5 RRMPRWGLLLLLWGSCTFGLPTDTTTFKRIFLKR-------MPSIRESLKERGVDMARLG 57
Query: 62 G------VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSK 115
R LG++ ++ L N+MD QY+GEIGIG+PPQ F V+FDTGSSN+WVPSSK
Sbjct: 58 PEWSQPMKRLALGNTTSSVI-LTNYMDTQYYGEIGIGTPPQTFKVVFDTGSSNVWVPSSK 116
Query: 116 C-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFI 174
C +C +H + + S++Y G + Y +G++SGF SQD + VG + V Q+F
Sbjct: 117 CSRLYTACVYHKLFDASDSSSYKHNGTELTLRYSTGTVSGFLSQDIITVGGITVT-QMFG 175
Query: 175 EATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAE 234
E T +L F+LA FDG++G+GF E A+G P++DN++ QG++ E+VFSF+ NRD +
Sbjct: 176 EVTEMPALPFMLAEFDGVVGMGFIEQAIGRVTPIFDNILSQGVLKEDVFSFYYNRDSENA 235
Query: 235 E--GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSG 292
+ GG+IV GG DP+H++G Y+ + K G WQ + + +G+ ST +CE GC A+VD+G
Sbjct: 236 QSLGGQIVLGGSDPQHYEGNFHYINLIKTGVWQIPMKGVSVGS-STLLCEDGCLALVDTG 294
Query: 293 TSLLAGPTPVVTEINHAIGGE 313
S ++G T + ++ A+G +
Sbjct: 295 ASYISGSTSSIEKLMEALGAK 315
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 31/84 (36%), Positives = 48/84 (57%)
Query: 422 IIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWIL 481
++ C+ PT+P++SF +G K + L+ Y+ + ++C A D+PPP GP W L
Sbjct: 322 VVKCNEGPTLPDISFHLGGKEYTLTSADYVFQESYSSKKLCTLAIHAMDIPPPTGPTWAL 381
Query: 482 GDVFMGVYHTVFDSGKLRIGFAEA 505
G F+ ++T FD RIGFA A
Sbjct: 382 GATFIRKFYTEFDRRNNRIGFALA 405
>gi|403294825|ref|XP_003938364.1| PREDICTED: renin [Saimiri boliviensis boliviensis]
Length = 400
Score = 236 bits (602), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 120/296 (40%), Positives = 185/296 (62%), Gaps = 13/296 (4%)
Query: 21 LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNF 80
LP + +RI LK+ + + R + KER + A + R L + ++ L N+
Sbjct: 24 LPTDTITFKRISLKR-------MPSIRESLKERGVDMARLGPERMALVNVTSSVI-LTNY 75
Query: 81 MDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEI 139
MD QY+GEIGIG+PPQ F V+FDTGSSN+WVPSSKC +C +H + + S++Y
Sbjct: 76 MDTQYYGEIGIGTPPQIFKVVFDTGSSNVWVPSSKCSRLYTACAYHKLFDASDSSSYKHN 135
Query: 140 GKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFRE 199
G + Y +G++SGF SQD + VG + V Q F E T +L F+LA FDG++G+GF E
Sbjct: 136 GTELTLRYSTGTVSGFLSQDVITVGGITVT-QTFGEVTEMPALPFMLAEFDGVVGMGFIE 194
Query: 200 IAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEE--GGEIVFGGVDPKHFKGKHTYVP 257
A+G P++DN++ QG++ E+VFSF+ NRD + + GG+IV GG DP+H++G Y+
Sbjct: 195 QAIGRVTPLFDNIISQGVLKEDVFSFYYNRDSENSQSLGGQIVLGGSDPQHYEGNFHYIN 254
Query: 258 VTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
+ + G WQ + + +G+ ST +CE GC A+VD+G S ++G T + ++ A+G +
Sbjct: 255 LIRTGLWQIPMKGVSVGS-STLLCEDGCLALVDTGASYISGSTSSIEKLMEALGAK 309
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 30/84 (35%), Positives = 48/84 (57%)
Query: 422 IIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWIL 481
++ C+ PT+P+++F +G K + L+ Y+ + ++C A D+PPP GP W L
Sbjct: 316 VVKCNEGPTLPDIAFHLGGKEYTLTSADYVFQESYSSKKLCTLAIHAMDIPPPTGPTWAL 375
Query: 482 GDVFMGVYHTVFDSGKLRIGFAEA 505
G F+ ++T FD RIGFA A
Sbjct: 376 GATFIRKFYTEFDRRNNRIGFALA 399
>gi|346973691|gb|EGY17143.1| vacuolar protease A [Verticillium dahliae VdLs.17]
Length = 398
Score = 236 bits (602), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 119/250 (47%), Positives = 163/250 (65%), Gaps = 7/250 (2%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+P+ NFM+AQYF EI IG+PPQ F V+ DTGSSNLWVPS +C SI+CY H++Y S S+
Sbjct: 74 VPVSNFMNAQYFSEITIGTPPQTFKVVLDTGSSNLWVPSQQCS-SIACYLHTKYDSSDSS 132
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY G EI+YGSGS++GF SQD V +GD+ +K+Q F EAT E L F RFDGI+G
Sbjct: 133 TYKANGSEFEIHYGSGSLTGFVSQDTVTIGDIKIKNQDFAEATSEPGLAFAFGRFDGILG 192
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V VP + MV Q V E VF+F+L + + E+VFGGVD H++GK T
Sbjct: 193 LGYDTISVNKIVPPFYQMVNQKAVDEPVFAFYLGDTNEQGDESEVVFGGVDESHYEGKIT 252
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
+P+ +K YW+ +L I +G+ + + G AI+D+GTSL P+ + +N+ IG +
Sbjct: 253 TIPLRRKAYWEVDLDSISLGDNTAEL--DGHGAILDTGTSLNVLPSTLADMLNNEIGAKK 310
Query: 314 ---GVVSAEC 320
G S EC
Sbjct: 311 GYNGQWSVEC 320
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 54/87 (62%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ ++CD+ ++P+++F + F++S YIL+ ++ CIS F D P P GPL
Sbjct: 314 GQWSVECDKRASLPDITFNLAGYNFSISAYDYILE----VSGSCISTFQGMDFPEPVGPL 369
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ +++++D GK +G A+A
Sbjct: 370 VILGDAFLRRWYSIYDLGKNTVGLAKA 396
>gi|73535294|pdb|1TZS|A Chain A, Crystal Structure Of An Activation Intermediate Of
Cathepsin E
Length = 351
Score = 236 bits (602), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 119/248 (47%), Positives = 163/248 (65%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C HSR++ +S+T
Sbjct: 16 PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHSRFQPSQSST 74
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y++ G+S I YG+GS+SG D V V + V Q F E+ E TF+ A FDGI+GL
Sbjct: 75 YSQPGQSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAEFDGILGL 134
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV +FS +++ +P+ G E++FGG D HF G +
Sbjct: 135 GYPSLAVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGSLNW 194
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
VPVTK+ YWQ L +I +G + C GC AIVD+GTSL+ GP+ + ++ +AIG
Sbjct: 195 VPVTKQAYWQIALDNIQVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGAAPV 253
Query: 313 EGVVSAEC 320
+G + EC
Sbjct: 254 DGEYAVEC 261
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 39/87 (44%), Positives = 52/87 (59%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
GE ++C + MP+V+FTI + LSP Y L + C SGF D+ PP GPL
Sbjct: 255 GEYAVECANLNVMPDVTFTINGVPYTLSPTAYTLLDFVDGMQFCSSGFQGLDIHPPAGPL 314
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WILGDVF+ +++VFD G R+G A A
Sbjct: 315 WILGDVFIRQFYSVFDRGNNRVGLAPA 341
>gi|405117936|gb|AFR92711.1| endopeptidase [Cryptococcus neoformans var. grubii H99]
Length = 438
Score = 236 bits (601), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 123/255 (48%), Positives = 163/255 (63%), Gaps = 9/255 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL NFM+AQYF + +G+P Q F V+ DTGSSNLWVPS KC SI+C+ H++Y S +S+
Sbjct: 117 VPLSNFMNAQYFATVELGTPFQTFKVVLDTGSSNLWVPSVKCT-SIACFLHNKYDSSQSS 175
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY G EI+YGSGS+ GF SQD + +GD+VVK Q F EAT+E L F +FDGI+G
Sbjct: 176 TYKANGSDFEIHYGSGSLEGFISQDTLSIGDLVVKKQDFAEATKEPGLAFAFGKFDGILG 235
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V VP + NM+ Q L+ E VFSF L E+GGE +FGG+D + GK
Sbjct: 236 LGYDTISVNHIVPPFYNMLNQHLLDEPVFSFRLGS--SDEDGGEAIFGGIDDSAYSGKLA 293
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
YVPV +KGYW+ EL I G++ + G A +D+GTSL+ PT V +N IG E
Sbjct: 294 YVPVRRKGYWEVELESISFGDEELELENTGAA--IDTGTSLIVMPTDVAELLNKEIGAEK 351
Query: 314 ---GVVSAECKLVVS 325
G + +C V S
Sbjct: 352 SWNGQYTVDCNTVSS 366
Score = 89.0 bits (219), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 50/139 (35%), Positives = 76/139 (54%), Gaps = 6/139 (4%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
VE E++S GD + A + L T V +N+ + + G+ +DC+
Sbjct: 305 VELESISFGDEELELENTGAAIDTGTSLIVMPTD--VAELLNKEIGAEKSWNGQYTVDCN 362
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
+ ++P ++FT G K + LS + YIL G CIS F D+P P GPLWI+GDVF+
Sbjct: 363 TVSSLPELAFTFGGKDYTLSADDYILNAGG----TCISSFTGMDIPAPIGPLWIVGDVFL 418
Query: 487 GVYHTVFDSGKLRIGFAEA 505
Y+TV+D G+ +GFAE+
Sbjct: 419 RKYYTVYDLGRNAVGFAES 437
>gi|1507725|gb|AAB06575.1| aspartic protease, partial [Ancylostoma caninum]
Length = 442
Score = 236 bits (601), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 117/227 (51%), Positives = 154/227 (67%), Gaps = 5/227 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF-SISCYFHSRYKSRKSNT 135
L+N+MDAQYFG I IG+P QNF+VIFDTGSSNLWVPS K F I+C RY S S+T
Sbjct: 80 LRNYMDAQYFGTIQIGTPAQNFTVIFDTGSSNLWVPSEKMPFHDIACMLRHRYDSGASST 139
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y E G+ I YG+GS+ GF S+DNV + + ++Q F EAT E LTF+ A+FDGI+G+
Sbjct: 140 YKEDGRKMAIQYGTGSMKGFISKDNVCIAGICAEEQPFAEATSEPGLTFIAAKFDGILGI 199
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
F EI+V PV+ +EQ V VF+ WLNR+PD+E GGEI GG+D + + T+
Sbjct: 200 TFPEISVLGVPPVFHTFIEQKKVPSPVFALWLNRNPDSELGGEITLGGMDTRRYVEPITW 259
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEG---GCAAIVDSGTSLLAGP 299
PVT++GYWQF++ D + G ++ C GC AI D+GTSL+AGP
Sbjct: 260 TPVTRRGYWQFKM-DKVQGGSTSIACPNEFSGCQAIADTGTSLIAGP 305
Score = 98.2 bits (243), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 45/91 (49%), Positives = 58/91 (63%)
Query: 415 PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPP 474
P GE +I CD++P P +SF I + F L E Y+L G +C+SGFM D P
Sbjct: 320 PTYEGEYMIPCDKVPFPPRLSFVIEARTFTLKGEDYVLTVKAGGKSICLSGFMGMDFPER 379
Query: 475 RGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
G LWILGDVF+G Y+TVFD G+ R+GFA+A
Sbjct: 380 IGELWILGDVFIGKYYTVFDVGQARLGFAQA 410
>gi|58258949|ref|XP_566887.1| endopeptidase [Cryptococcus neoformans var. neoformans JEC21]
gi|134107071|ref|XP_777848.1| hypothetical protein CNBA5450 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50260546|gb|EAL23201.1| hypothetical protein CNBA5450 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57223024|gb|AAW41068.1| endopeptidase, putative [Cryptococcus neoformans var. neoformans
JEC21]
Length = 438
Score = 236 bits (601), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 125/255 (49%), Positives = 163/255 (63%), Gaps = 9/255 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+M+AQYF + IG+P Q F VI DTGSSNLWVPS KC SI+C+ HS+Y S +S+
Sbjct: 117 VPLSNYMNAQYFATMEIGTPFQTFKVILDTGSSNLWVPSVKCT-SIACFLHSKYDSSQSS 175
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY G EI+YGSGS+ GF SQD V +GD+VVK Q F EAT+E L F +FDGI+G
Sbjct: 176 TYKANGSDFEIHYGSGSLEGFISQDTVSIGDLVVKKQDFAEATKEPGLAFAFGKFDGILG 235
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V VP + NM+ Q L+ E VFSF L E+GGE +FGG+D + G+
Sbjct: 236 LGYDTISVNHIVPPFYNMLNQHLLDEPVFSFRLGS--SDEDGGEAIFGGIDDSAYSGELQ 293
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
YVPV +KGYW+ EL I G++ + G A +D+GTSL+ PT V +N IG E
Sbjct: 294 YVPVRRKGYWEVELESISFGDEELELENTGAA--IDTGTSLIVMPTDVAELLNKEIGAEK 351
Query: 314 ---GVVSAECKLVVS 325
G + +C V S
Sbjct: 352 SWNGQYTVDCSTVSS 366
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 49/139 (35%), Positives = 75/139 (53%), Gaps = 6/139 (4%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
VE E++S GD + A + L T V +N+ + + G+ +DC
Sbjct: 305 VELESISFGDEELELENTGAAIDTGTSLIVMPTD--VAELLNKEIGAEKSWNGQYTVDCS 362
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
+ ++P ++FT G K + L+ + YIL G CIS F D+P P GPLWI+GDVF+
Sbjct: 363 TVSSLPVLAFTFGGKDYKLTGDDYILNAGG----TCISSFTGMDIPAPIGPLWIVGDVFL 418
Query: 487 GVYHTVFDSGKLRIGFAEA 505
Y+TV+D GK +GFA++
Sbjct: 419 RKYYTVYDLGKNAVGFAKS 437
>gi|308809631|ref|XP_003082125.1| putative vacuaolar aspartic proteinase (ISS) [Ostreococcus tauri]
gi|116060592|emb|CAL55928.1| putative vacuaolar aspartic proteinase (ISS) [Ostreococcus tauri]
Length = 505
Score = 236 bits (601), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 115/242 (47%), Positives = 153/242 (63%), Gaps = 9/242 (3%)
Query: 119 SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
S+ C H+++ S S TY G I YGSGS+SGF SQD+V VGD+ VK Q F EAT+
Sbjct: 91 SVPCDLHAKFDSAASETYEADGTPFAIQYGSGSLSGFLSQDDVTVGDITVKGQYFAEATK 150
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAE---- 234
E + FL A+FDGI+GLGF I+V PV+ NM+EQ L+ + +FSFWLNR + +
Sbjct: 151 EPGIAFLFAKFDGILGLGFDTISVDKVKPVFYNMMEQKLIDKNMFSFWLNRTSNVDGTPS 210
Query: 235 -EGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEG--GCAAIVDS 291
GGE+VFGG DPKHF G+HTY PVT+ GYWQ ++ D + +S GVC+G GC I D+
Sbjct: 211 VTGGELVFGGSDPKHFVGEHTYAPVTRAGYWQIKMDDFKVAGRSLGVCKGENGCQVIADT 270
Query: 292 GTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGD--LIWDLLVSGLLPEKVCQQIG 349
GTSLL GP VV +IN IG ++ EC++++ QY D + E++C IG
Sbjct: 271 GTSLLTGPADVVKKINDYIGAHSMLGEECRMLIDQYADEXXXXXXXLETYTSEQICTSIG 330
Query: 350 LC 351
C
Sbjct: 331 AC 332
Score = 58.5 bits (140), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 34/60 (56%)
Query: 380 CSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIG 439
C AC V + QN L + T + S + +CD +P+ G + +DC+ IP MPNV F IG
Sbjct: 434 CKACTTVVNYAQNLLSENATSRVIASEVKRVCDMIPSYGGTAAVDCEDIPHMPNVEFVIG 493
>gi|37790800|gb|AAR03502.1| renin [Homo sapiens]
gi|119611911|gb|EAW91505.1| renin [Homo sapiens]
Length = 403
Score = 236 bits (601), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 125/319 (39%), Positives = 196/319 (61%), Gaps = 19/319 (5%)
Query: 3 QKLLRSVFCLWVLASCLL-LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS 61
+++ R L + SC LP + +RI LK+ + + R + KER + A +
Sbjct: 5 RRMPRWGLLLLLWGSCTFGLPTDTTTFKRIFLKR-------MPSIRESLKERGVDMARLG 57
Query: 62 G------VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSK 115
R LG++ ++ L N+MD QY+GEIGIG+PPQ F V+FDTGSSN+WVPSSK
Sbjct: 58 PEWSQPMKRLTLGNTTSSVI-LTNYMDTQYYGEIGIGTPPQTFKVVFDTGSSNVWVPSSK 116
Query: 116 C-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFI 174
C +C +H + + S++Y G + Y +G++SGF SQD + VG + V Q+F
Sbjct: 117 CSRLYTACVYHKLFDASDSSSYKHNGTELTLRYSTGTVSGFLSQDIITVGGITVT-QMFG 175
Query: 175 EATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAE 234
E T +L F+LA FDG++G+GF E A+G P++DN++ QG++ E+VFSF+ NR+ +
Sbjct: 176 EVTEMPALPFMLAEFDGVVGMGFIEQAIGRVTPIFDNIISQGVLKEDVFSFYYNRNSQS- 234
Query: 235 EGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTS 294
GG+IV GG DP+H++G Y+ + K G WQ ++ + +G+ ST +CE GC A+VD+G S
Sbjct: 235 LGGQIVLGGSDPQHYEGNFHYINLIKTGVWQIQMKGVSVGS-STLLCEDGCLALVDTGAS 293
Query: 295 LLAGPTPVVTEINHAIGGE 313
++G T + ++ A+G +
Sbjct: 294 YISGSTSSIEKLMEALGAK 312
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 57/103 (55%), Gaps = 2/103 (1%)
Query: 405 SYINELCDSL--PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVC 462
S I +L ++L + + ++ C+ PT+P++SF +G K + L+ Y+ + ++C
Sbjct: 300 SSIEKLMEALGAKKRLFDYVVKCNEGPTLPDISFHLGGKEYTLTSADYVFQESYSSKKLC 359
Query: 463 ISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
A D+PPP GP W LG F+ ++T FD RIGFA A
Sbjct: 360 TLAIHAMDIPPPTGPTWALGATFIRKFYTEFDRRNNRIGFALA 402
>gi|354478111|ref|XP_003501259.1| PREDICTED: cathepsin E-like isoform 1 [Cricetulus griseus]
Length = 396
Score = 236 bits (601), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 116/248 (46%), Positives = 158/248 (63%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C H + +S+T
Sbjct: 69 PLINYLDVEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHPVFHPSQSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y E+G I YG+GS++G D V V + V Q F E+ +E TF+ A FDGI+GL
Sbjct: 128 YEEVGNHFSIQYGTGSLTGIIGADQVSVEGLTVDGQQFGESVKEPGQTFVNAEFDGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV +FS +++ DP G E+ FGG DP HF G +
Sbjct: 188 GYPSLAVGGVTPVFDNMMAQNLVDLPIFSVYMSSDPQGGSGSELTFGGFDPSHFSGNLNW 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
+PVTK+GYWQ L + +G+ + C GC AIVD+GTSL+ GP+ + ++ AIG
Sbjct: 248 IPVTKQGYWQIALDGVQVGD-TVMFCSEGCQAIVDTGTSLITGPSHKIKQLQEAIGATPM 306
Query: 313 EGVVSAEC 320
+G + +C
Sbjct: 307 DGEYAVDC 314
Score = 85.1 bits (209), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 42/90 (46%), Positives = 54/90 (60%), Gaps = 1/90 (1%)
Query: 417 PM-GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
PM GE +DC + TMPNV+F + + LSP YIL + C SGF D+ PP
Sbjct: 305 PMDGEYAVDCANLNTMPNVAFILNGVSYTLSPTAYILPDLVDGMQFCGSGFQGLDIQPPS 364
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
GPLWILGDVF+ ++ VFD G ++G A A
Sbjct: 365 GPLWILGDVFIRQFYAVFDRGNNQVGLAPA 394
>gi|1039445|gb|AAA79878.1| vacuolar protease A [Neurospora crassa]
Length = 396
Score = 236 bits (601), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 132/309 (42%), Positives = 188/309 (60%), Gaps = 15/309 (4%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRL--DLHS--LNAARITRKERYMGGAGVSGVRHRLGD 69
+L + +LL ++ G+ + LKK L +L S ++ ++Y G S +
Sbjct: 5 LLTAAMLLGSAQAGVHTMKLKKVPLADELESVPIDVQVQHLGQKYTGLRTESHTQAMFKA 64
Query: 70 SDEDI-----LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYF 124
+D + +P+ NFM+AQYF EI IG+PPQ F V+ DTGSSNLWVPSS+C SI+CY
Sbjct: 65 TDAQVSGNHPVPITNFMNAQYFSEITIGTPPQTFKVVLDTGSSNLWVPSSQC-GSIACYL 123
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
H++Y+S +S+TY + G S +I YGSGS+SGF SQD + +GD+ + DQ+F EAT E L F
Sbjct: 124 HNKYESSESSTYKKNGTSFKIEYGSGSLSGFVSQDRMTIGDITINDQLFAEATSEPGLAF 183
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
RFDGI+GLG+ +AV P + MVEQ LV E VFSF+L D D E E+VFGGV
Sbjct: 184 AFGRFDGILGLGYDRLAVPGITPPFYKMVEQKLVDEPVFSFYL-ADQDGES--EVVFGGV 240
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
+ + GK T +P+ +K YW+ + I G + G I+D+GTSL+A P+ +
Sbjct: 241 NKDRYTGKITTIPLRRKAYWEVDFDAIGYGKDFAEL--EGHGVILDTGTSLIALPSQLAE 298
Query: 305 EINHAIGGE 313
+N IG +
Sbjct: 299 MLNAQIGAK 307
Score = 73.6 bits (179), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 35/87 (40%), Positives = 52/87 (59%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ IDC + ++ +V+FT+ F L PE YIL+ + C+S FM D+P P GPL
Sbjct: 312 GQFTIDCGKKSSLEDVTFTLAGYNFTLGPEDYILEA----SGSCLSTFMGMDMPAPVGPL 367
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ Y++++D G +G A A
Sbjct: 368 AILGDAFLRKYYSIYDLGADTVGIATA 394
>gi|403414885|emb|CCM01585.1| predicted protein [Fibroporia radiculosa]
Length = 414
Score = 236 bits (601), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 148/375 (39%), Positives = 197/375 (52%), Gaps = 54/375 (14%)
Query: 23 ASSNGLRRIGLKKRRLDLHSLNAARITRKERY----------MGGAGVSGVRHRLGDSD- 71
A++NG+ ++ L+K L + E+Y GG G + V R D
Sbjct: 15 AAANGVHKLKLQKLPQSLGNPTLETAYLAEKYGGQAQMPLVGAGGLGRNMVLARPVHEDG 74
Query: 72 EDIL--------------PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY 117
ED+L PL NFM+AQYF EI +G+P Q+F VI DTGSSNLWVPSSKC
Sbjct: 75 EDLLWTQEEILVNGGHNVPLSNFMNAQYFAEIQLGTPAQSFKVILDTGSSNLWVPSSKCT 134
Query: 118 FSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEAT 177
SI+C+ H++Y S S TY G I YGSGS+ GF SQD +++GD+ +K Q F EAT
Sbjct: 135 -SIACFLHAKYDSSSSTTYKANGSEFSIQYGSGSMEGFVSQDLLKIGDLSIKHQDFAEAT 193
Query: 178 REGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGG 237
+E L F +FDGI+GLG+ I+V P + MV Q L+ E VF+F L E+GG
Sbjct: 194 KEPGLAFAFGKFDGILGLGYDTISVNHMTPPFYEMVAQKLIDEPVFAFRLGS--SEEDGG 251
Query: 238 EIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLA 297
E VFGG+D + G YVPV +K YW+ EL + +G+ + G A +D+GTSL+A
Sbjct: 252 EAVFGGIDRTAYTGSIDYVPVRRKAYWEVELQKVALGDDELDLEHTGAA--IDTGTSLIA 309
Query: 298 GPTPVVTEINHAIGGE----GVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAF 353
PT + IN IG + G + +C V S LPE V F
Sbjct: 310 LPTDIAEMINTQIGAQKQWNGQYTVDCSKVPS--------------LPELV------LTF 349
Query: 354 NGAEYVSTGIKTVVE 368
NG Y G V+E
Sbjct: 350 NGKPYPLKGTDYVLE 364
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 52/169 (30%), Positives = 80/169 (47%), Gaps = 10/169 (5%)
Query: 342 EKVCQQIGLCAFNGA-EYVSTGIKTV--VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQ 398
E V I A+ G+ +YV K VE + V+ GD + A + L
Sbjct: 252 EAVFGGIDRTAYTGSIDYVPVRRKAYWEVELQKVALGDDELDLEHTGAAIDTGTSLIALP 311
Query: 399 TKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGI 458
T + IN + G+ +DC ++P++P + T K + L Y+L+ +
Sbjct: 312 TD--IAEMINTQIGAQKQWNGQYTVDCSKVPSLPELVLTFNGKPYPLKGTDYVLE----V 365
Query: 459 AEVCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
C+S F D+ P G LWI+GDVF+ Y+TV+D G+ +GFAEAA
Sbjct: 366 QGTCMSAFTPMDIQMPGGDSLWIIGDVFLRRYYTVYDLGRNAVGFAEAA 414
>gi|116203505|ref|XP_001227563.1| vacuolar protease A precursor [Chaetomium globosum CBS 148.51]
gi|88175764|gb|EAQ83232.1| vacuolar protease A precursor [Chaetomium globosum CBS 148.51]
Length = 396
Score = 236 bits (601), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 130/316 (41%), Positives = 187/316 (59%), Gaps = 28/316 (8%)
Query: 14 VLASCLLLPASSNGLRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSGVRHRLGD 69
+L + +LL ++ + ++ L+K +L+ LN ++YMG VR R
Sbjct: 5 LLTAAVLLGSAQGAVHKMKLQKVPLSEQLEAVPLNTQLEQLGQKYMG------VRPRQSH 58
Query: 70 SD------------EDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY 117
++ +P+ NFM+AQYF EI IGSPPQ F V+ DTGSSNLWVPS +C
Sbjct: 59 ANAVFNGMVAEVKGNHPVPISNFMNAQYFSEITIGSPPQTFKVVLDTGSSNLWVPSVEC- 117
Query: 118 FSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEAT 177
SI+CY H++Y S S+TY + G + EI YGSGS+SGF SQD + +GD+ +K Q F EAT
Sbjct: 118 GSIACYLHTKYDSSASSTYKKNGTNFEIRYGSGSLSGFVSQDTMTIGDITIKGQDFAEAT 177
Query: 178 REGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGG 237
E L F RFDGI+GLG+ I+V VP + M+EQ L+ E VF+F+L D +
Sbjct: 178 SEPGLAFAFGRFDGILGLGYDTISVNGIVPPFYKMLEQKLIDEPVFAFYL---ADEKGQS 234
Query: 238 EIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLA 297
E+VFGGVD +KGK T +P+ +K YW+ + I G+ + + G I+D+GTSL+A
Sbjct: 235 EVVFGGVDSDKYKGKITTIPLRRKAYWEVDFDAISYGDDTAELENTGV--ILDTGTSLIA 292
Query: 298 GPTPVVTEINHAIGGE 313
P+ + +N IG +
Sbjct: 293 LPSQLAEMLNAQIGAK 308
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 37/99 (37%), Positives = 56/99 (56%), Gaps = 4/99 (4%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
+N + N G+ IDC++ ++ +V+F + F L P YIL+ ++ CIS F
Sbjct: 301 LNAQIGAKKNYAGQYAIDCNKRDSLKDVTFNLAGYNFTLGPYDYILE----VSGSCISTF 356
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
M D P P GPL ILGD F+ Y++++D G +G AEA
Sbjct: 357 MGMDFPEPTGPLAILGDAFLRRYYSIYDLGANTVGLAEA 395
>gi|453084572|gb|EMF12616.1| aspartyl proteinase [Mycosphaerella populorum SO2202]
Length = 396
Score = 235 bits (600), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 127/313 (40%), Positives = 189/313 (60%), Gaps = 21/313 (6%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRL--DLHSLNAARITRK--ERYMGGAGVSGVRHRLGD 69
L + L+ + G+ ++ L+K L L +N ++ ++YMG + RL +
Sbjct: 4 ALMTSALVAGAQAGVHKMKLQKIPLSEQLEGMNIESQVQRLGQKYMG----IRAQGRLDE 59
Query: 70 --SDEDILP-------LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI 120
+ + P + NF++AQYF EI +G+PPQ F V+ DTGSSNLWVPSS+C SI
Sbjct: 60 MFKETSVAPEAGHPVAVSNFLNAQYFSEIAVGTPPQEFKVVLDTGSSNLWVPSSEC-GSI 118
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+CY HS+Y SNTY + G I YGSGS+ G+ SQD V++GD+ +KDQ+F EAT E
Sbjct: 119 ACYLHSKYNHGDSNTYKQNGSEFAIRYGSGSLEGYVSQDTVQIGDLKIKDQLFAEATSEP 178
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
L F RFDGI+GLG+ I+V P + NM++QGL+ E+VF+F+L+ +E E +
Sbjct: 179 GLAFAFGRFDGIMGLGYDTISVNGIPPPFYNMIDQGLLDEKVFAFYLSSTDKGDE-SEAI 237
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGGV+ H+ G T +P+ +K YW+ +L I G Q+ + G AI+D+GTSL+A P+
Sbjct: 238 FGGVNKDHYTGDMTKIPLRRKAYWEVDLDAITFGKQTAEIDATG--AILDTGTSLIALPS 295
Query: 301 PVVTEINHAIGGE 313
+ +N IG +
Sbjct: 296 TLAELLNKEIGAK 308
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 32/87 (36%), Positives = 51/87 (58%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC ++P+++FT+ F + YIL+ + CIS FM FD+P P GPL
Sbjct: 313 GQYTVDCSARDSLPDLTFTLTGHNFTIDSYDYILE----VQGSCISAFMGFDIPEPAGPL 368
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ +++V+D +G A+A
Sbjct: 369 AILGDAFLRKWYSVYDLENNAVGLAKA 395
>gi|340373429|ref|XP_003385244.1| PREDICTED: cathepsin D-like [Amphimedon queenslandica]
Length = 382
Score = 235 bits (600), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 128/313 (40%), Positives = 180/313 (57%), Gaps = 19/313 (6%)
Query: 12 LWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSD 71
L+V AS L L L R+ LH R + R + ++ D
Sbjct: 6 LFVFASLLTL----------TLAFVRVPLHRHVVPRSQTRARLLAKYPSYFSSFKVNDVP 55
Query: 72 EDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKS 130
E PL N++DA+Y+G I IG+PPQNF VIFDTGSSNLW+PSSKC +C H +Y
Sbjct: 56 E---PLTNYLDAEYYGNITIGTPPQNFLVIFDTGSSNLWIPSSKCDPKDKACQTHHQYNH 112
Query: 131 RKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFD 190
S+TY + I YG+G+++GF S D V + ++ V Q F EA + TF+ A+FD
Sbjct: 113 DHSSTYVKNDTKFAIQYGTGNLTGFLSVDTVTIANLTVPAQKFAEAVEQPGDTFVNAQFD 172
Query: 191 GIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFK 250
GI+G+ + I+V +P ++N+V+Q LV++ VF F+L+RD + GGE+ GG DP H+K
Sbjct: 173 GILGMAWPSISVDGVIPFFNNLVQQSLVAQPVFGFYLDRDENGTLGGELALGGTDPSHYK 232
Query: 251 GKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI 310
YVP++ K YWQF+L I +G T +C GC AI D+GTSLL GP+ V +I I
Sbjct: 233 APINYVPLSDKTYWQFKLDKIKVG--GTTLCSNGCQAIADTGTSLLVGPSVDVQKIMKEI 290
Query: 311 GG---EGVVSAEC 320
G +GV +C
Sbjct: 291 GAKNTDGVYMIDC 303
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 43/91 (47%), Positives = 56/91 (61%), Gaps = 5/91 (5%)
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
N G +IDC + +P VSF IG + LSP+QYI+K C+ GF + D +
Sbjct: 294 NTDGVYMIDCGNMSNLPTVSFVIGGAQYLLSPQQYIMKEEAEGQTFCLVGFDSLD----Q 349
Query: 476 G-PLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
G PLWILGDVF+G Y+T FD G+ R+GFA A
Sbjct: 350 GEPLWILGDVFIGYYYTEFDVGQGRVGFAPA 380
>gi|396499231|ref|XP_003845423.1| similar to Vacuolar aspartyl protease (proteinase A) [Leptosphaeria
maculans JN3]
gi|21914374|gb|AAM81358.1|AF522873_1 aspartyl proteinase [Leptosphaeria maculans]
gi|312222004|emb|CBY01944.1| similar to Vacuolar aspartyl protease (proteinase A) [Leptosphaeria
maculans JN3]
Length = 397
Score = 235 bits (600), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 129/308 (41%), Positives = 185/308 (60%), Gaps = 21/308 (6%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRK-----ERYMGGAGVSGVRHR------LGDSDEDILP 76
+ ++ LKK LD L A I + ++YM G + + + D + P
Sbjct: 19 VHKMPLKKVSLD-EQLKYASIQEQVSALSQKYMSGFKPTSHMEQVFKAPYIADGTHPV-P 76
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
+ NF++AQYF EI +G+PPQ F V+ DTGSSNLWVPSS+C SI+CY H++Y S S+TY
Sbjct: 77 VSNFLNAQYFSEISLGTPPQTFKVVLDTGSSNLWVPSSECN-SIACYLHTKYDSSASSTY 135
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
+ G S EI YGSG +SGF S D ++GD+ VK+Q F EAT E L F RFDGI+GLG
Sbjct: 136 KKNGTSFEIRYGSGELSGFVSNDVFQIGDLKVKNQDFAEATSEPGLAFAFGRFDGIMGLG 195
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ I+V VP + NM++QGL+ E VF+F+L D + ++ E FGG+D H+ GK +
Sbjct: 196 YDTISVNHIVPPFYNMLDQGLLDEPVFAFYLG-DTNEQQESEATFGGIDESHYSGKLIKL 254
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE--- 313
P+ +K YW+ +L I G ++ + G I+D+GTSL+A P+ + +N IG +
Sbjct: 255 PLRRKAYWEVDLDAITFGKETAEMDNTGV--ILDTGTSLIALPSTMAELLNREIGAKKGF 312
Query: 314 -GVVSAEC 320
G S EC
Sbjct: 313 NGQYSVEC 320
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/87 (36%), Positives = 52/87 (59%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ ++CD+ +P+++FT+ F +S YIL+ + CIS FM D P P GPL
Sbjct: 314 GQYSVECDKRDGLPDLTFTLTGHNFTISAFDYILE----VQGSCISAFMGMDFPEPVGPL 369
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ +++V+D G +G A++
Sbjct: 370 AILGDAFLRKWYSVYDLGNSAVGLAKS 396
>gi|353234557|emb|CCA66581.1| probable PEP4-aspartyl protease [Piriformospora indica DSM 11827]
Length = 411
Score = 235 bits (600), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 136/323 (42%), Positives = 190/323 (58%), Gaps = 33/323 (10%)
Query: 14 VLASCLLLP--ASSNGLRRIGLKK--RRLDLHSLNAARITRKERYMGG----AGVSGVRH 65
VL+S LL P +++G+ R+ L K R + AA + K GG AGV G+
Sbjct: 7 VLSSLLLAPFVHAADGVHRMKLNKMPRTAPGSAEEAALLAHK---YGGQVPLAGVGGLGR 63
Query: 66 RL------GDSD----EDIL-------PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSN 108
+L GD +DI+ PL N+M+AQY+ +I IG+PPQ F V+ DTGSSN
Sbjct: 64 KLANPPTAGDDQMFWTQDIVANGGHGVPLNNYMNAQYYADITIGTPPQTFKVVLDTGSSN 123
Query: 109 LWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVV 168
LWVPS+ C SI+C+ H++Y S S+TY G I YGSGS+ GF SQD + +GD+ +
Sbjct: 124 LWVPSTSCT-SIACFLHTKYDSSASSTYKANGTEFAIRYGSGSLEGFVSQDTMTLGDLTI 182
Query: 169 KDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLN 228
K Q F EAT+E L F +FDGI+GL + I+V P + N ++QGL+ E+VF+F +
Sbjct: 183 KKQDFAEATKEPGLAFAFGKFDGILGLAYDTISVNHITPPFYNAIDQGLLKEKVFTFRVG 242
Query: 229 RDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAI 288
+GGE VFGG+D H+ GK TYVPV +KGYW+ EL + G+ + G A
Sbjct: 243 A--SEADGGEAVFGGIDSSHYTGKITYVPVRRKGYWEVELESVAFGDDELELENTGAA-- 298
Query: 289 VDSGTSLLAGPTPVVTEINHAIG 311
+D+GTSL+ PT + +N IG
Sbjct: 299 IDTGTSLIVMPTTIAEMLNSEIG 321
Score = 75.1 bits (183), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 55/195 (28%), Positives = 89/195 (45%), Gaps = 25/195 (12%)
Query: 331 IWDLLVSGLLPEKVCQ-QIGLCAFNGAEYVSTGIKTV------------------VEKEN 371
++ + GLL EKV ++G +G E V GI + VE E+
Sbjct: 223 FYNAIDQGLLKEKVFTFRVGASEADGGEAVFGGIDSSHYTGKITYVPVRRKGYWEVELES 282
Query: 372 VSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTM 431
V+ GD + A + L T + +N + + G+ + CD++P +
Sbjct: 283 VAFGDDELELENTGAAIDTGTSLIVMPTT--IAEMLNSEIGATRSWNGQYTLPCDKVPGL 340
Query: 432 PNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHT 491
P+ +F G K + ++ Y+L G C+S F D+ P G LWI+GDVF+ Y T
Sbjct: 341 PDFTFVFGGKPYPIASTDYVLNLGN----QCVSAFTGMDINLPGGELWIVGDVFLRKYFT 396
Query: 492 VFDSGKLRIGFAEAA 506
V+D G+ +GFA +A
Sbjct: 397 VYDLGRDAVGFAVSA 411
>gi|378731872|gb|EHY58331.1| vacuolar protease A [Exophiala dermatitidis NIH/UT8656]
Length = 398
Score = 235 bits (599), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 127/306 (41%), Positives = 179/306 (58%), Gaps = 17/306 (5%)
Query: 28 LRRIGLKKRRLDLH--------SLNAARITRKERYMGGAGVSGVRHRLGDSDE-DILPLK 78
+ R+ L+K L+ L A R ++ +GG RH D D +P++
Sbjct: 20 MHRMKLQKVPLEQQLSAANIGDHLRALRHKYTQKTLGGPAEDIFRHTSIDIDSPHEVPVE 79
Query: 79 NFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTE 138
NF++AQYF I +G+PPQ F V+ DTGSSNLWVPSS+C SI+CY H +Y S S+TY +
Sbjct: 80 NFLNAQYFSTIALGTPPQEFKVVLDTGSSNLWVPSSEC-GSIACYLHQKYDSSASSTYKK 138
Query: 139 IGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFR 198
G I YGSG ++GF SQD + +GD+ +KDQ+F EAT E L F RFDGI+GLG+
Sbjct: 139 NGSEFGIRYGSGEVAGFISQDILRIGDLKIKDQLFGEATSEPGLAFAFGRFDGILGLGYD 198
Query: 199 EIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPV 258
IAV P + NM++QGL+ E VF+F+L D E E FGG+D H+ GK +P+
Sbjct: 199 TIAVNHIPPPFYNMIDQGLLDEPVFAFYLGNTNDGTE-SEATFGGIDKDHYTGKMVKIPL 257
Query: 259 TKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----G 314
+K YW+ L I G ++ + G I+D+GTSL+A P+ + +N IG + G
Sbjct: 258 RRKAYWEVNLDAITFGKETADLDNTGV--ILDTGTSLIALPSTLAELLNKEIGAKKGFNG 315
Query: 315 VVSAEC 320
+ EC
Sbjct: 316 QYTVEC 321
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 32/87 (36%), Positives = 52/87 (59%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ ++CD+ ++P+VSFT+ F+++ YIL+ + CIS FM D P P GPL
Sbjct: 315 GQYTVECDKRDSLPDVSFTLSGYNFSITAYDYILE----VQGSCISSFMGMDFPAPTGPL 370
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ +++V+D G + A +
Sbjct: 371 AILGDSFLRRWYSVYDLGNDAVALARS 397
>gi|390477486|ref|XP_003735302.1| PREDICTED: cathepsin E isoform 2 [Callithrix jacchus]
Length = 401
Score = 235 bits (599), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 120/253 (47%), Positives = 163/253 (64%), Gaps = 10/253 (3%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C H+R++ +SNT
Sbjct: 69 PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKRHTRFQPSQSNT 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNV-----EVGDVVVKDQVFIEATREGSLTFLLARFD 190
Y + G+S I YG+GS+SG D V +V + V Q F E+ E TF+ A FD
Sbjct: 128 YNQPGQSFSIQYGTGSLSGIIGADQVSAFSWQVEGLTVVGQQFGESVTEPGQTFVDAEFD 187
Query: 191 GIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFK 250
GI+GLG+ +AVG PV+DNM+ Q LV +FS +++ +P+ G E++FGG D HF
Sbjct: 188 GILGLGYPSLAVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFS 247
Query: 251 GKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI 310
G +VPVTK+ YWQ L DI +G + C GC AIVD+GTSL+ GP+ + ++ +AI
Sbjct: 248 GSLNWVPVTKQAYWQIALDDIQVGGTAM-FCSEGCQAIVDTGTSLITGPSDKIKQLQNAI 306
Query: 311 GG---EGVVSAEC 320
G +G + EC
Sbjct: 307 GAAPVDGEYAVEC 319
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 39/87 (44%), Positives = 52/87 (59%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
GE ++C + MP+V+FTI + LSP Y L + C SGF D+ PP GPL
Sbjct: 313 GEYAVECANLNVMPDVTFTINGVPYTLSPTAYTLLDFVDGMQFCSSGFQGLDIHPPAGPL 372
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WILGDVF+ +++VFD G R+G A A
Sbjct: 373 WILGDVFIRQFYSVFDRGNNRVGLAPA 399
>gi|354478113|ref|XP_003501260.1| PREDICTED: cathepsin E-like isoform 2 [Cricetulus griseus]
Length = 363
Score = 235 bits (599), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 114/236 (48%), Positives = 153/236 (64%), Gaps = 2/236 (0%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C H + +S+T
Sbjct: 69 PLINYLDVEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHPVFHPSQSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y E+G I YG+GS++G D V V + V Q F E+ +E TF+ A FDGI+GL
Sbjct: 128 YEEVGNHFSIQYGTGSLTGIIGADQVSVEGLTVDGQQFGESVKEPGQTFVNAEFDGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV +FS +++ DP G E+ FGG DP HF G +
Sbjct: 188 GYPSLAVGGVTPVFDNMMAQNLVDLPIFSVYMSSDPQGGSGSELTFGGFDPSHFSGNLNW 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
+PVTK+GYWQ L + +G+ + C GC AIVD+GTSL+ GP+ + ++ AIG
Sbjct: 248 IPVTKQGYWQIALDGVQVGD-TVMFCSEGCQAIVDTGTSLITGPSHKIKQLQEAIG 302
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 31/46 (67%)
Query: 460 EVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+ C SGF D+ PP GPLWILGDVF+ ++ VFD G ++G A A
Sbjct: 316 QFCGSGFQGLDIQPPSGPLWILGDVFIRQFYAVFDRGNNQVGLAPA 361
>gi|325087547|gb|EGC40857.1| aspartic endopeptidase Pep2 [Ajellomyces capsulatus H88]
Length = 398
Score = 234 bits (598), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 126/301 (41%), Positives = 187/301 (62%), Gaps = 12/301 (3%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGD----SDEDILPLKNFMDA 83
L++I L ++ +++ ++A ++YMG V+ GD S LP+ NF++A
Sbjct: 25 LQKIPLSEQFANVN-IDAHVRALGQKYMGVKPNQNVQDVFGDPAKASGGHSLPVDNFLNA 83
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
QYF EIGIG+PPQ F V+ DTGSSNLWVPSS+C SI+CY H++Y S S+T+ + G
Sbjct: 84 QYFSEIGIGTPPQTFKVVLDTGSSNLWVPSSECG-SIACYLHNKYDSSASSTHKKNGSEF 142
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVG 203
I YGSGS++GF SQD + +GD+VV++QVF EAT E L F RFDGI+GLG+ I+V
Sbjct: 143 SITYGSGSLTGFVSQDCLTIGDLVVENQVFAEATSEPGLAFAFGRFDGILGLGYDTISVN 202
Query: 204 DAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGY 263
VP + M+ + L+ E +FSF+L ++ E+VFGG++ F G+ T +P+ +K Y
Sbjct: 203 KIVPPFYEMLNKDLLDEPMFSFYLGDANIDDDQSEVVFGGMNKDRFTGELTKIPLRRKAY 262
Query: 264 WQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAE 319
W+ +L I G Q+ + G I+D+GTSL+A P+ + +N IG + G + E
Sbjct: 263 WEVDLDSITFGKQTAMMTNTGV--ILDTGTSLIALPSTIAELLNKEIGAKKSFNGQYTVE 320
Query: 320 C 320
C
Sbjct: 321 C 321
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 32/85 (37%), Positives = 48/85 (56%), Gaps = 4/85 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ ++C + ++PN++F + F + P Y L+ + CIS FM D P P GPL
Sbjct: 315 GQYTVECAKRDSLPNLTFGLSGHNFTIGPYDYTLE----VQGTCISSFMGMDFPAPVGPL 370
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFA 503
ILGD F+ Y+TV+D G +G A
Sbjct: 371 AILGDAFLRRYYTVYDLGNDAVGLA 395
>gi|166235886|ref|NP_031825.2| cathepsin E preproprotein [Mus musculus]
gi|341940308|sp|P70269.2|CATE_MOUSE RecName: Full=Cathepsin E; Flags: Precursor
gi|5748654|emb|CAA08880.2| cathepsin E protein [Mus musculus]
gi|74146932|dbj|BAE25449.1| unnamed protein product [Mus musculus]
gi|74192082|dbj|BAE34257.1| unnamed protein product [Mus musculus]
gi|74219155|dbj|BAE26716.1| unnamed protein product [Mus musculus]
gi|74222421|dbj|BAE38113.1| unnamed protein product [Mus musculus]
gi|148707758|gb|EDL39705.1| cathepsin E [Mus musculus]
Length = 397
Score = 234 bits (598), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 116/248 (46%), Positives = 157/248 (63%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IG+PPQNF+VIFDTGSSNLWVPS C S +C H + +S+T
Sbjct: 70 PLINYLDMEYFGTISIGTPPQNFTVIFDTGSSNLWVPSVYCT-SPACKAHPVFHPSQSDT 128
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
YTE+G I YG+GS++G D V V + V Q F E+ +E TF+ A FDGI+GL
Sbjct: 129 YTEVGNHFSIQYGTGSLTGIIGADQVSVEGLTVDGQQFGESVKEPGQTFVNAEFDGILGL 188
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +A G PV+DNM+ Q LV+ +FS +L+ DP G E+ FGG DP HF G +
Sbjct: 189 GYPSLAAGGVTPVFDNMMAQNLVALPMFSVYLSSDPQGGSGSELTFGGYDPSHFSGSLNW 248
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
+PVTK+ YWQ L I +G+ + C GC AIVD+GTSL+ GP + ++ AIG
Sbjct: 249 IPVTKQAYWQIALDGIQVGD-TVMFCSEGCQAIVDTGTSLITGPPDKIKQLQEAIGATPI 307
Query: 313 EGVVSAEC 320
+G + +C
Sbjct: 308 DGEYAVDC 315
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 41/87 (47%), Positives = 55/87 (63%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
GE +DC + TMPNV+F I + + L+P YIL + C SGF D+PPP GPL
Sbjct: 309 GEYAVDCATLDTMPNVTFLINEVSYTLNPTDYILPDLVEGMQFCGSGFQGLDIPPPAGPL 368
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WILGDVF+ +++VFD G ++G A A
Sbjct: 369 WILGDVFIRQFYSVFDRGNNQVGLAPA 395
>gi|332247693|ref|XP_003272996.1| PREDICTED: cathepsin E [Nomascus leucogenys]
Length = 396
Score = 234 bits (598), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 117/248 (47%), Positives = 162/248 (65%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C H+R++ +S+T
Sbjct: 69 PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHTRFQPSQSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y++ G+S I YG+GS+SG D V V + V Q F E+ E TF+ A FDGI+GL
Sbjct: 128 YSQPGQSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAEFDGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV +FS +++ +P+ G E++FGG D HF G +
Sbjct: 188 GYPSLAVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGSLNW 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
VPVTK+ YWQ L +I +G + C GC AIVD+GTSL+ GP+ + ++ + IG
Sbjct: 248 VPVTKQAYWQIALDNIQVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNTIGAAPV 306
Query: 313 EGVVSAEC 320
+G + EC
Sbjct: 307 DGEYAVEC 314
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 39/87 (44%), Positives = 52/87 (59%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
GE ++C + MP+V+FTI + LSP Y L + C SGF D+ PP GPL
Sbjct: 308 GEYAVECANLNVMPDVTFTINGVPYTLSPTAYTLLDFVDGMQFCSSGFQGLDIHPPAGPL 367
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WILGDVF+ +++VFD G R+G A A
Sbjct: 368 WILGDVFIRQFYSVFDRGNNRVGLAPA 394
>gi|2288908|emb|CAA71859.1| cathepsin E [Mus musculus]
Length = 397
Score = 234 bits (598), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 116/248 (46%), Positives = 157/248 (63%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IG+PPQNF+VIFDTGSSNLWVPS C S +C H + +S+T
Sbjct: 70 PLINYLDMEYFGTISIGTPPQNFTVIFDTGSSNLWVPSVYCT-SPACKAHPVFHPSQSDT 128
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
YTE+G I YG+GS++G D V V + V Q F E+ +E TF+ A FDGI+GL
Sbjct: 129 YTEVGNHFSIQYGTGSLTGIIGADQVSVEGLTVDGQQFGESVKEPGQTFVNAEFDGILGL 188
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +A G PV+DNM+ Q LV+ +FS +L+ DP G E+ FGG DP HF G +
Sbjct: 189 GYPSLAAGGVTPVFDNMMAQNLVALPMFSVYLSSDPQGGSGSELTFGGYDPSHFSGSLNW 248
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
+PVTK+ YWQ L I +G+ + C GC AIVD+GTSL+ GP + ++ AIG
Sbjct: 249 IPVTKQAYWQIALDGIQVGD-TVMFCSEGCQAIVDTGTSLITGPPDKIKQLQEAIGATPI 307
Query: 313 EGVVSAEC 320
+G + +C
Sbjct: 308 DGEYAVDC 315
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 41/87 (47%), Positives = 55/87 (63%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
GE +DC + TMPNV+F I + + L+P YIL + C SGF D+PPP GPL
Sbjct: 309 GEYAVDCATLDTMPNVTFLINEVSYTLNPTDYILPDLVDGMQFCGSGFQGLDIPPPAGPL 368
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WILGDVF+ +++VFD G ++G A A
Sbjct: 369 WILGDVFIRQFYSVFDRGNNQVGLAPA 395
>gi|429860373|gb|ELA35113.1| vacuolar protease a [Colletotrichum gloeosporioides Nara gc5]
Length = 399
Score = 234 bits (597), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 137/321 (42%), Positives = 194/321 (60%), Gaps = 18/321 (5%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRK-----ERYMGG-----AGVSGV 63
+L + +LL A+ + ++ LKK + LNA I + ++YMG A
Sbjct: 5 LLTAAVLLGAAQADVHKLKLKKVPIS-EQLNAVPIEHQVRSLGQKYMGARPQNHADAMFN 63
Query: 64 RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCY 123
+ + + E +P+ NFM+AQYF EI IG+PPQ+F V+ DTGSSNLWVPS +C SI+CY
Sbjct: 64 QKPIKSNGEHPVPVSNFMNAQYFSEISIGTPPQSFKVVLDTGSSNLWVPSQQC-GSIACY 122
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
HS+Y S S+TY G EI+YGSGS++GF SQD+V +GD+ +K Q F EAT E L
Sbjct: 123 LHSKYDSSSSSTYKSNGSEFEIHYGSGSLTGFVSQDDVSIGDIKIKKQDFAEATSEPGLA 182
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
F RFDGI+GLG+ I+V VP + MV Q + E VF+F+L D + E VFGG
Sbjct: 183 FAFGRFDGILGLGYDTISVNKIVPPFYQMVNQKAIDEPVFAFYLGDTNDEGDESEAVFGG 242
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
VD H++GK TY+P+ +K YW+ +L I +G+++ + G AI+D+GTSL P+ +
Sbjct: 243 VDDSHYEGKITYIPLRRKAYWEVDLDAITLGDETADL--EGHGAILDTGTSLNVLPSALA 300
Query: 304 TEINHAIGGE----GVVSAEC 320
+N IG + G S EC
Sbjct: 301 ELLNKEIGAKKGFNGQYSVEC 321
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 54/87 (62%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ ++CD+ +P+++FT+ F++S YIL+ ++ CIS F D P P GPL
Sbjct: 315 GQYSVECDKRAELPDITFTLAGYNFSISAYDYILE----VSGSCISTFQGMDFPEPVGPL 370
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ +++V+D GK +G A+A
Sbjct: 371 VILGDAFLRRWYSVYDLGKNAVGLAKA 397
>gi|99031884|pdb|2BKS|A Chain A, Crystal Structure Of Renin-Pf00074777 Complex
gi|99031885|pdb|2BKS|B Chain B, Crystal Structure Of Renin-Pf00074777 Complex
gi|99031886|pdb|2BKT|A Chain A, Crystal Structure Of Renin-Pf00257567 Complex
gi|99031887|pdb|2BKT|B Chain B, Crystal Structure Of Renin-Pf00257567 Complex
gi|119390207|pdb|2IKO|A Chain A, Crystal Structure Of Human Renin Complexed With Inhibitor
gi|119390208|pdb|2IKO|B Chain B, Crystal Structure Of Human Renin Complexed With Inhibitor
gi|119390209|pdb|2IKU|A Chain A, Crystal Structure Of Human Renin Complexed With Inhibitors
gi|119390210|pdb|2IKU|B Chain B, Crystal Structure Of Human Renin Complexed With Inhibitors
gi|119390211|pdb|2IL2|A Chain A, Crystal Structure Of Human Renin Complexed With Inhibitor
gi|119390212|pdb|2IL2|B Chain B, Crystal Structure Of Human Renin Complexed With Inhibitor
gi|151568107|pdb|2V0Z|C Chain C, Crystal Structure Of Renin With Inhibitor 10 (Aliskiren)
gi|151568108|pdb|2V0Z|O Chain O, Crystal Structure Of Renin With Inhibitor 10 (Aliskiren)
gi|151568109|pdb|2V10|C Chain C, Crystal Structure Of Renin With Inhibitor 9
gi|151568110|pdb|2V10|O Chain O, Crystal Structure Of Renin With Inhibitor 9
gi|151568111|pdb|2V11|C Chain C, Crystal Structure Of Renin With Inhibitor 6
gi|151568112|pdb|2V11|O Chain O, Crystal Structure Of Renin With Inhibitor 6
gi|151568113|pdb|2V12|C Chain C, Crystal Structure Of Renin With Inhibitor 8
gi|151568114|pdb|2V12|O Chain O, Crystal Structure Of Renin With Inhibitor 8
gi|157830213|pdb|1BBS|A Chain A, X-Ray Analyses Of Peptide Inhibitor Complexes Define The
Structural Basis Of Specificity For Human And Mouse
Renins
gi|157830214|pdb|1BBS|B Chain B, X-Ray Analyses Of Peptide Inhibitor Complexes Define The
Structural Basis Of Specificity For Human And Mouse
Renins
gi|157833710|pdb|1RNE|A Chain A, The Crystal Structure Of Recombinant Glycosylated Human
Renin Alone And In Complex With A Transition State
Analog Inhibitor
gi|157836332|pdb|2REN|A Chain A, Structure Of Recombinant Human Renin, A Target For
Cardiovascular- Active Drugs, At 2.5 Angstroms
Resolution
gi|193885216|pdb|2V13|A Chain A, Crystal Structure Of Renin With Inhibitor 7
gi|193885217|pdb|2V16|C Chain C, Crystal Structure Of Renin With Inhibitor 3
gi|193885218|pdb|2V16|O Chain O, Crystal Structure Of Renin With Inhibitor 3
gi|242556522|pdb|3G72|A Chain A, Design And Preparation Of Potent, Non-Peptidic,
Bioavailable Renin Inhibitors
gi|242556523|pdb|3G72|B Chain B, Design And Preparation Of Potent, Non-Peptidic,
Bioavailable Renin Inhibitors
gi|308388162|pdb|3OQF|A Chain A, Crystal Structure Analysis Of Renin-Indole-Piperazine
Inhibitor Complexes
gi|308388163|pdb|3OQF|B Chain B, Crystal Structure Analysis Of Renin-Indole-Piperazine
Inhibitor Complexes
gi|310689956|pdb|3OOT|A Chain A, Crystal Structure Analysis Of Renin-Indole-Piperazin
Inhibitor Complexes
gi|310689957|pdb|3OOT|B Chain B, Crystal Structure Analysis Of Renin-Indole-Piperazin
Inhibitor Complexes
gi|310689958|pdb|3OQK|A Chain A, Crystal Structure Analysis Of Renin-Indole-Piperazin
Inhibitor Complexes
gi|310689959|pdb|3OQK|B Chain B, Crystal Structure Analysis Of Renin-Indole-Piperazin
Inhibitor Complexes
gi|342350963|pdb|3Q3T|A Chain A, Alkyl Amine Renin Inhibitors: Filling S1 From S3
gi|342350964|pdb|3Q3T|B Chain B, Alkyl Amine Renin Inhibitors: Filling S1 From S3
gi|345110923|pdb|3SFC|A Chain A, Structure-Based Optimization Of Potent 4- And
6-Azaindole-3- Carboxamides As Renin Inhibitors
gi|345110924|pdb|3SFC|B Chain B, Structure-Based Optimization Of Potent 4- And
6-Azaindole-3- Carboxamides As Renin Inhibitors
gi|358439749|pdb|3Q4B|A Chain A, Clinically Useful Alkyl Amine Renin Inhibitors
gi|358439750|pdb|3Q4B|B Chain B, Clinically Useful Alkyl Amine Renin Inhibitors
gi|358439751|pdb|3Q5H|A Chain A, Clinically Useful Alkyl Amine Renin Inhibitors
gi|358439752|pdb|3Q5H|B Chain B, Clinically Useful Alkyl Amine Renin Inhibitors
gi|400261138|pdb|3VSW|A Chain A, Human Renin In Complex With Compound 8
gi|400261139|pdb|3VSW|B Chain B, Human Renin In Complex With Compound 8
gi|400261140|pdb|3VSX|A Chain A, Human Renin In Complex With Compound 18
gi|400261141|pdb|3VSX|B Chain B, Human Renin In Complex With Compound 18
gi|430800765|pdb|3VYD|A Chain A, Human Renin In Complex With Inhibitor 6
gi|430800766|pdb|3VYD|B Chain B, Human Renin In Complex With Inhibitor 6
gi|430800767|pdb|3VYE|A Chain A, Human Renin In Complex With Inhibitor 7
gi|430800768|pdb|3VYE|B Chain B, Human Renin In Complex With Inhibitor 7
gi|430800769|pdb|3VYF|A Chain A, Human Renin In Complex With Inhibitor 9
gi|430800770|pdb|3VYF|B Chain B, Human Renin In Complex With Inhibitor 9
gi|449802496|pdb|4GJ8|A Chain A, Crystal Structure Of Renin In Complex With Pkf909-724
(compound 3)
gi|449802497|pdb|4GJ8|B Chain B, Crystal Structure Of Renin In Complex With Pkf909-724
(compound 3)
gi|449802498|pdb|4GJ9|A Chain A, Crystal Structure Of Renin In Complex With Gp055321
(compound 4)
gi|449802499|pdb|4GJ9|B Chain B, Crystal Structure Of Renin In Complex With Gp055321
(compound 4)
gi|449802500|pdb|4GJA|A Chain A, Crystal Structure Of Renin In Complex With Nvp-ayl747
(compound 5)
gi|449802501|pdb|4GJA|B Chain B, Crystal Structure Of Renin In Complex With Nvp-ayl747
(compound 5)
gi|449802502|pdb|4GJB|A Chain A, Crystal Structure Of Renin In Complex With Nvp-bbv031
(compound 6)
gi|449802503|pdb|4GJB|B Chain B, Crystal Structure Of Renin In Complex With Nvp-bbv031
(compound 6)
gi|449802504|pdb|4GJC|A Chain A, Crystal Structure Of Renin In Complex With Nvp-bch965
(compound 9)
gi|449802505|pdb|4GJC|B Chain B, Crystal Structure Of Renin In Complex With Nvp-bch965
(compound 9)
gi|449802506|pdb|4GJD|A Chain A, Crystal Structure Of Renin In Complex With Nvp-bgq311
(compound 12)
gi|449802507|pdb|4GJD|B Chain B, Crystal Structure Of Renin In Complex With Nvp-bgq311
(compound 12)
Length = 340
Score = 234 bits (597), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 110/250 (44%), Positives = 169/250 (67%), Gaps = 6/250 (2%)
Query: 67 LGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFH 125
LG++ ++ L N+MD QY+GEIGIG+PPQ F V+FDTGSSN+WVPSSKC +C +H
Sbjct: 3 LGNTTSSVI-LTNYMDTQYYGEIGIGTPPQTFKVVFDTGSSNVWVPSSKCSRLYTACVYH 61
Query: 126 SRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFL 185
+ + S++Y G + Y +G++SGF SQD + VG + V Q+F E T +L F+
Sbjct: 62 KLFDASDSSSYKHNGTELTLRYSTGTVSGFLSQDIITVGGITVT-QMFGEVTEMPALPFM 120
Query: 186 LARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEE--GGEIVFGG 243
LA FDG++G+GF E A+G P++DN++ QG++ E+VFSF+ NRD + + GG+IV GG
Sbjct: 121 LAEFDGVVGMGFIEQAIGRVTPIFDNIISQGVLKEDVFSFYYNRDSENSQSLGGQIVLGG 180
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
DP+H++G Y+ + K G WQ ++ + +G+ ST +CE GC A+VD+G S ++G T +
Sbjct: 181 SDPQHYEGNFHYINLIKTGVWQIQMKGVSVGS-STLLCEDGCLALVDTGASYISGSTSSI 239
Query: 304 TEINHAIGGE 313
++ A+G +
Sbjct: 240 EKLMEALGAK 249
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 31/84 (36%), Positives = 48/84 (57%)
Query: 422 IIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWIL 481
++ C+ PT+P++SF +G K + L+ Y+ + ++C A D+PPP GP W L
Sbjct: 256 VVKCNEGPTLPDISFHLGGKEYTLTSADYVFQESYSSKKLCTLAIHAMDIPPPTGPTWAL 315
Query: 482 GDVFMGVYHTVFDSGKLRIGFAEA 505
G F+ ++T FD RIGFA A
Sbjct: 316 GATFIRKFYTEFDRRNNRIGFALA 339
>gi|358057753|dbj|GAA96408.1| hypothetical protein E5Q_03075 [Mixia osmundae IAM 14324]
Length = 453
Score = 234 bits (597), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 118/264 (44%), Positives = 161/264 (60%), Gaps = 9/264 (3%)
Query: 68 GDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSR 127
GD E +PL NF++AQYF +I +G+PPQ F V+ DTGSSNLWVPS++C SI+C+ H +
Sbjct: 121 GDKVEHGVPLSNFLNAQYFADITLGTPPQEFKVVLDTGSSNLWVPSTRCS-SIACFLHKK 179
Query: 128 YKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLA 187
Y + S+TY E G +I YGSGS+ G S D + +GD+ +K Q F E+T+E L F
Sbjct: 180 YDASASSTYKENGTEFKIQYGSGSLEGVISNDVMTIGDITIKKQDFAESTKEPGLAFAFG 239
Query: 188 RFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAE--EGGEIVFGGVD 245
+FDGI+GL + IAV P + NM+ GLV + FSFWL D E GGE V GG D
Sbjct: 240 KFDGILGLAYDRIAVQHVTPPFYNMIADGLVDKAEFSFWLGDTADGEGAPGGEFVMGGTD 299
Query: 246 PKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTE 305
P H+KGK + PV +KGYW+ EL I G + G A +D+GTSL+A P+ +
Sbjct: 300 PAHYKGKIQWAPVRRKGYWEVELSKIKFGKDELELESTGAA--IDTGTSLIALPSDLAEL 357
Query: 306 INHAIGGE----GVVSAECKLVVS 325
+N IG + G + +C + S
Sbjct: 358 LNKEIGAKKSWNGQYTVDCAAIPS 381
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 29/85 (34%), Positives = 47/85 (55%), Gaps = 4/85 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC IP++P+++ + + ++ YIL+ CIS F D P GP+
Sbjct: 370 GQYTVDCAAIPSLPDLTMYFAGEPYTITGADYILQA----QGTCISAFTGLDFPESIGPI 425
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFA 503
WI+GDVF+ + TV+ K +GFA
Sbjct: 426 WIVGDVFLRRFFTVYSLEKDAVGFA 450
>gi|190613737|pdb|3D91|A Chain A, Human Renin In Complex With Remikiren
gi|190613738|pdb|3D91|B Chain B, Human Renin In Complex With Remikiren
gi|242556515|pdb|3G6Z|A Chain A, Design And Preparation Of Potent, Non-Peptidic,
Bioavailable Renin Inhibitors
gi|242556516|pdb|3G6Z|B Chain B, Design And Preparation Of Potent, Non-Peptidic,
Bioavailable Renin Inhibitors
gi|242556519|pdb|3G70|A Chain A, Design And Preparation Of Potent, Non-Peptidic,
Bioavailable Renin Inhibitors
gi|242556520|pdb|3G70|B Chain B, Design And Preparation Of Potent, Non-Peptidic,
Bioavailable Renin Inhibitors
gi|290560276|pdb|3K1W|A Chain A, New Classes Of Potent And Bioavailable Human Renin
Inhibitors
gi|290560277|pdb|3K1W|B Chain B, New Classes Of Potent And Bioavailable Human Renin
Inhibitors
gi|315113750|pdb|3OWN|A Chain A, Potent Macrocyclic Renin Inhibitors
gi|315113751|pdb|3OWN|B Chain B, Potent Macrocyclic Renin Inhibitors
Length = 341
Score = 234 bits (597), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 110/250 (44%), Positives = 169/250 (67%), Gaps = 6/250 (2%)
Query: 67 LGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFH 125
LG++ ++ L N+MD QY+GEIGIG+PPQ F V+FDTGSSN+WVPSSKC +C +H
Sbjct: 3 LGNTTSSVI-LTNYMDTQYYGEIGIGTPPQTFKVVFDTGSSNVWVPSSKCSRLYTACVYH 61
Query: 126 SRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFL 185
+ + S++Y G + Y +G++SGF SQD + VG + V Q+F E T +L F+
Sbjct: 62 KLFDASDSSSYKHNGTELTLRYSTGTVSGFLSQDIITVGGITVT-QMFGEVTEMPALPFM 120
Query: 186 LARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEE--GGEIVFGG 243
LA FDG++G+GF E A+G P++DN++ QG++ E+VFSF+ NRD + + GG+IV GG
Sbjct: 121 LAEFDGVVGMGFIEQAIGRVTPIFDNIISQGVLKEDVFSFYYNRDSENSQSLGGQIVLGG 180
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
DP+H++G Y+ + K G WQ ++ + +G+ ST +CE GC A+VD+G S ++G T +
Sbjct: 181 SDPQHYEGNFHYINLIKTGVWQIQMKGVSVGS-STLLCEDGCLALVDTGASYISGSTSSI 239
Query: 304 TEINHAIGGE 313
++ A+G +
Sbjct: 240 EKLMEALGAK 249
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 57/103 (55%), Gaps = 2/103 (1%)
Query: 405 SYINELCDSL--PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVC 462
S I +L ++L + + ++ C+ PT+P++SF +G K + L+ Y+ + ++C
Sbjct: 237 SSIEKLMEALGAKKRLFDYVVKCNEGPTLPDISFHLGGKEYTLTSADYVFQESYSSKKLC 296
Query: 463 ISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
A D+PPP GP W LG F+ ++T FD RIGFA A
Sbjct: 297 TLAIHAMDIPPPTGPTWALGATFIRKFYTEFDRRNNRIGFALA 339
>gi|301618285|ref|XP_002938556.1| PREDICTED: cathepsin E-A-like [Xenopus (Silurana) tropicalis]
Length = 402
Score = 234 bits (597), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 112/254 (44%), Positives = 169/254 (66%), Gaps = 2/254 (0%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
L ++M+AQY+GEI +G+PPQNFSV+FDTGSSN WVPSS C S +C H R+KS +S +Y
Sbjct: 73 LVDYMNAQYYGEISVGTPPQNFSVVFDTGSSNFWVPSSYC-LSEACQVHERFKSFESTSY 131
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
G+ I+YG+G + G +D + + ++ ++ Q F E+ E TF+LA+FDG++GLG
Sbjct: 132 EHGGRPFSIHYGTGQLVGVTGRDTLRISNMSIEGQDFGESILEPGRTFVLAQFDGVLGLG 191
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ +AV AVPV+D +V Q LV +++FSF LNRD D+E GGE++FGG+D +KG+ ++
Sbjct: 192 YPSLAVAGAVPVFDRIVNQKLVEQQLFSFHLNRDYDSEYGGELIFGGIDHSLYKGQIHWI 251
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
P+T+KGYWQ L ++ + ++ C+ C IVDSGTSL+ GP + ++ +G +
Sbjct: 252 PLTEKGYWQIRLDNVKVDGEAM-FCQSSCQVIVDSGTSLITGPKAEIKKLQELLGATPTL 310
Query: 317 SAECKLVVSQYGDL 330
E L S+ L
Sbjct: 311 FGEYILDCSRVSSL 324
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 43/99 (43%), Positives = 65/99 (65%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
+ EL + P GE I+DC R+ ++P V+FTIG + + L+PEQY +K ++ C++GF
Sbjct: 300 LQELLGATPTLFGEYILDCSRVSSLPRVTFTIGQRDYTLTPEQYTIKERSQKSDFCLTGF 359
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
A D+ GPLWILGD+FM +++VFD RIG A++
Sbjct: 360 QAMDISTKDGPLWILGDIFMSKFYSVFDREHDRIGLAKS 398
>gi|109287596|emb|CAJ55260.1| renin-like aspartic protease [Echis ocellatus]
Length = 395
Score = 234 bits (597), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 132/338 (39%), Positives = 193/338 (57%), Gaps = 27/338 (7%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV-SGVRHRLGDSDE 72
+L SC L SS+ L+RI LKK + + R T +E M A V ++HR+ DE
Sbjct: 9 LLISCFLC-FSSDALQRISLKK-------MPSIRETLQEMGMKVADVLPSLKHRISYLDE 60
Query: 73 DI------LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFH 125
+ L NF D QY+GEI IG+P Q F V+FDTGSSNLWVPS +C +C H
Sbjct: 61 GLHNKTASTILTNFRDTQYYGEISIGTPAQIFKVVFDTGSSNLWVPSRQCSPLYSACVSH 120
Query: 126 SRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFL 185
+RY S +S+TY G + Y G I GFFSQD V V D+ + Q F EA S+ F+
Sbjct: 121 NRYDSSESSTYKPKGTKITLTYAQGYIKGFFSQDIVRVADIPII-QFFTEAIALPSIPFI 179
Query: 186 LARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVD 245
ARFDG++G+G+ + A+G +PV+DN++ + ++SE VFS + +R ++ GGEI+ GG D
Sbjct: 180 FARFDGVLGMGYPKQAIGGVIPVFDNIMSEKVLSENVFSVYYSRHSESNTGGEIILGGSD 239
Query: 246 PKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTE 305
P H+ G YV +++GYW +L + I N+ +C GC A +D+GTS ++GP ++
Sbjct: 240 PSHYTGDFHYVSTSREGYWHVDLKGVSIENKIV-LCHDGCTATIDTGTSFISGPASSISV 298
Query: 306 INHAIGG---EGVVSAECKL------VVSQYGDLIWDL 334
+ IG +G +CK + GD+ + L
Sbjct: 299 LMETIGATLSDGDYVIDCKKINLLPDITFHLGDMTYSL 336
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 37/87 (42%), Positives = 54/87 (62%), Gaps = 2/87 (2%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +IDC +I +P+++F +GD ++LS Y+LK + C FMA D+PPP GPL
Sbjct: 310 GDYVIDCKKINLLPDITFHLGDMTYSLSSSTYVLKFSDETE--CTVAFMAVDIPPPLGPL 367
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
W+LG F+ Y+ FD RIGFA +
Sbjct: 368 WLLGATFIKQYYIEFDRQNNRIGFATS 394
>gi|344277046|ref|XP_003410316.1| PREDICTED: cathepsin E [Loxodonta africana]
Length = 396
Score = 234 bits (596), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 119/248 (47%), Positives = 159/248 (64%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N+ D +YFG I IGSP QNF+VIFDTGSSNLWVPS C S +C H R+ +S+T
Sbjct: 69 PLINYFDTEYFGAISIGSPSQNFTVIFDTGSSNLWVPSVYCT-SQACQTHPRFYPSQSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y+ +G I+YG+GS+SG D V V + V DQ F E+ +E TF+ + FDGI+GL
Sbjct: 128 YSSLGSPFSISYGTGSLSGIIGTDQVSVEGLTVIDQQFGESVKEPGQTFVDSAFDGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV +FS +++ DP G E++FGG D HF G +
Sbjct: 188 GYPSLAVGGVTPVFDNMMAQNLVDLPMFSVYMSSDPAGGMGSELIFGGYDHSHFSGSLNW 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE-- 313
VPVTK+GYWQ L +I +G + C GC AIVD+GTSL+ GP+ + ++ AIG E
Sbjct: 248 VPVTKQGYWQIALDNIQVGG-TVMFCSEGCQAIVDTGTSLITGPSNNIKQLQRAIGAEPE 306
Query: 314 -GVVSAEC 320
G + EC
Sbjct: 307 NGEYAVEC 314
Score = 82.0 bits (201), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 38/87 (43%), Positives = 51/87 (58%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
GE ++C + MP+V+FTI + LSP Y L C SGF D+ PP GPL
Sbjct: 308 GEYAVECVNLNVMPDVTFTINGVSYTLSPTAYTLLDSADGMNFCSSGFQGLDIQPPAGPL 367
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WILGDVF+ +++VFD G ++G A A
Sbjct: 368 WILGDVFIRQFYSVFDRGNNQVGLAPA 394
>gi|1657354|emb|CAA66056.1| procathepsin E [Mus musculus]
gi|13529380|gb|AAH05432.1| Cathepsin E [Mus musculus]
gi|71059833|emb|CAJ18460.1| Ctse [Mus musculus]
Length = 397
Score = 234 bits (596), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 116/248 (46%), Positives = 156/248 (62%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IG+PPQNF+VIFDTGSSNLWVPS C S +C H + +S+T
Sbjct: 70 PLINYLDMEYFGTISIGTPPQNFTVIFDTGSSNLWVPSVYCT-SPACKAHPVFHPSQSDT 128
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
YTE+G I YG+GS++G D V V + V Q F E+ +E TF+ A FDGI+GL
Sbjct: 129 YTEVGNHFSIQYGTGSLTGIIGADQVSVEGLTVDGQQFGESVKEPGQTFVNAEFDGILGL 188
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +A G PV+DNM+ Q LV+ +FS +L+ DP G E+ FGG DP HF G +
Sbjct: 189 GYPSLAAGGVTPVFDNMMAQNLVALPMFSVYLSSDPQGGSGSELTFGGYDPSHFSGSLNW 248
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
+PVTK+ YWQ L I +G+ + C GC AIVD+GTSL+ GP + + AIG
Sbjct: 249 IPVTKQAYWQIALDGIQVGD-TVMFCSEGCQAIVDTGTSLITGPPDKIKHLQEAIGATPI 307
Query: 313 EGVVSAEC 320
+G + +C
Sbjct: 308 DGEYAVDC 315
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 41/87 (47%), Positives = 55/87 (63%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
GE +DC + TMPNV+F I + + L+P YIL + C SGF D+PPP GPL
Sbjct: 309 GEYAVDCATLDTMPNVTFLINEVSYTLNPTDYILPDLVDGMQFCGSGFQGLDIPPPAGPL 368
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WILGDVF+ +++VFD G ++G A A
Sbjct: 369 WILGDVFIRQFYSVFDRGNNQVGLAPA 395
>gi|432090679|gb|ELK24020.1| Renin [Myotis davidii]
Length = 404
Score = 233 bits (595), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 134/334 (40%), Positives = 197/334 (58%), Gaps = 16/334 (4%)
Query: 4 KLLRSVFCLWVLASCLL-LPASSNGLRRIGLKK-----RRLDLHSLNAARITRKERYMGG 57
++ R L + SC+ LP + RRI LKK L ++ AR+ R E G
Sbjct: 5 RMSRWALLLLLWGSCISSLPVDTGAFRRIFLKKMPSVRESLKERGVDVARLLRAE----G 60
Query: 58 AGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY 117
+ SG R +S ++ L N++D QY+GEIGIG+PPQ F VIFDTGS+NLWVPS+KC
Sbjct: 61 SQFSG-RPPFTNSTAPVV-LTNYLDTQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCS 118
Query: 118 -FSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEA 176
+C HS Y S +S+TY E G I YGSG ++GF SQD V VG + V Q F E
Sbjct: 119 PLYTACEIHSLYDSLESSTYMENGTEFTIQYGSGKVNGFLSQDAVTVGGITVT-QTFGEV 177
Query: 177 TREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEG 236
T + F+LA+FDG++G+GF AV PV+D+++ Q ++ E+VFS + +R+ G
Sbjct: 178 TELPLMPFMLAKFDGVLGMGFPAQAVAGVTPVFDHILSQRVLKEDVFSVYYSRNSHL-LG 236
Query: 237 GEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLL 296
GEIV GG DP++++G YV ++K G WQ ++ + + ST +CE GC A+VD+G S +
Sbjct: 237 GEIVLGGSDPQYYQGNFHYVSISKTGSWQIKMKGVSV-RSSTLLCEEGCMAVVDTGASYI 295
Query: 297 AGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 330
+GPT + + +G + + + E + +Q L
Sbjct: 296 SGPTSSLRLLMETLGAKELSTDEYVVSCNQVPSL 329
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 53/86 (61%)
Query: 420 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 479
E ++ C+++P++P++SF +G + + L+ Y+L+ ++C D+PPP GP+W
Sbjct: 318 EYVVSCNQVPSLPDISFHLGGRAYTLTSADYVLQDPYSNDDLCTLALHGLDIPPPTGPVW 377
Query: 480 ILGDVFMGVYHTVFDSGKLRIGFAEA 505
+LG F+ ++T FD RIGFA A
Sbjct: 378 VLGASFIRKFYTEFDRRNNRIGFALA 403
>gi|224085770|ref|XP_002189383.1| PREDICTED: cathepsin E [Taeniopygia guttata]
Length = 435
Score = 233 bits (595), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 122/296 (41%), Positives = 173/296 (58%), Gaps = 6/296 (2%)
Query: 29 RRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGE 88
RR+ L RR L ++ R + G E PL ++D +YFG+
Sbjct: 62 RRVPLSCRRY-LRTMMRERGQLSHLWRAPGGPEASSEDCAAFLESSEPLIIYLDMEYFGQ 120
Query: 89 IGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYG 148
I IG+PPQNF+V+FDTGSSNLWVPS C S +C H+R+ +S+TY IG I YG
Sbjct: 121 ISIGTPPQNFTVVFDTGSSNLWVPSVYC-VSKACTEHTRFHPTQSSTYQVIGTPFSIQYG 179
Query: 149 SGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPV 208
+GS++G D V V + V +Q F E+ E FL A FDGI+GL + +AV PV
Sbjct: 180 TGSLTGIIGSDQVAVEGLAVSNQQFAESISEPGKAFLDAEFDGILGLAYPSLAVDGVTPV 239
Query: 209 WDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFEL 268
+DNM+ Q LV +FS +++ +PD+ +GGE++FGG D F G +VPVT++GYWQ +L
Sbjct: 240 FDNMMAQNLVELPIFSVYMSSNPDSPQGGEVLFGGFDTSRFTGTLNWVPVTQQGYWQIQL 299
Query: 269 GDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---EGVVSAECK 321
+I +G T C GC AIVD+GTSL+ GPT + ++ + IG +G + +C
Sbjct: 300 DNIQLGGTVT-FCANGCQAIVDTGTSLITGPTKEIKKLQNLIGAVSVDGEYTVDCS 354
Score = 82.0 bits (201), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 38/88 (43%), Positives = 57/88 (64%), Gaps = 2/88 (2%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQY-ILKTGEGIAEVCISGFMAFDLPPPRGP 477
GE +DC + +MP+++ TI + LS + Y +++ +G+A C SGF D+PPP GP
Sbjct: 347 GEYTVDCSNLSSMPDLTITINGLPYTLSAQAYTLMEYADGMA-FCTSGFQGSDIPPPTGP 405
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAEA 505
LWILGDVF+ +++VFD G +G A A
Sbjct: 406 LWILGDVFIRQFYSVFDRGNNMVGLAPA 433
>gi|389640809|ref|XP_003718037.1| vacuolar protease A [Magnaporthe oryzae 70-15]
gi|58257401|gb|AAW69322.1| vacuolar protease A-like protein [Magnaporthe grisea]
gi|351640590|gb|EHA48453.1| vacuolar protease A [Magnaporthe oryzae 70-15]
gi|440487134|gb|ELQ66940.1| vacuolar protease A [Magnaporthe oryzae P131]
Length = 395
Score = 233 bits (594), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 129/320 (40%), Positives = 191/320 (59%), Gaps = 19/320 (5%)
Query: 14 VLASCLLLPASSNGLRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSGVRHRLGD 69
++ + +LL + G+ ++ +KK +L LNA ++Y+G S + +
Sbjct: 5 MMTAAVLLGTAEAGVHKLKMKKIPLEDQLKTFDLNAQMRGLGQKYLGIRPESHQQAVFSN 64
Query: 70 -----SDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYF 124
S +P+ NFM+AQYF EI IG+PPQNF VI DTGSSNLWVPSS C SI+CY
Sbjct: 65 DAVQASGNHPVPISNFMNAQYFSEITIGTPPQNFKVILDTGSSNLWVPSSSC-GSIACYL 123
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
H++Y+S S+TY + G +I YGSGS+ GF S D + +GD+ +K+ F EAT+E L F
Sbjct: 124 HNKYESSSSSTYKKNGTEFKIQYGSGSMEGFVSNDVMTIGDLKIKNLDFAEATKEPGLAF 183
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
RFDGI+G+GF ++V VP + MV+Q L+ E VF+F+L D + E+VFGGV
Sbjct: 184 AFGRFDGILGMGFDRLSVNKIVPPFYAMVDQKLIDEPVFAFYL---ADEKSESEVVFGGV 240
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
+ H GK T +P+ +K YW+ +L I +G++ + G I+D+GTSL+A P+ +
Sbjct: 241 NKDHIDGKITEIPLRRKAYWEVDLDAIALGDEVAELDNTGV--ILDTGTSLIALPSQLAE 298
Query: 305 EINHAIGGE----GVVSAEC 320
+N IG + G S +C
Sbjct: 299 LLNSQIGAKKGYNGQYSIDC 318
Score = 75.5 bits (184), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 36/87 (41%), Positives = 55/87 (63%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ IDCD+ +P+++F + F +S YIL+ ++ CIS FMA D+P P GPL
Sbjct: 312 GQYSIDCDKRKDLPDITFRLSGYDFPISAYDYILE----VSGSCISTFMAMDIPEPVGPL 367
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ Y++++D GK +G A+A
Sbjct: 368 AILGDAFLRRYYSIYDLGKGTVGLAKA 394
>gi|340966614|gb|EGS22121.1| aspartic-type endopeptidase-like protein [Chaetomium thermophilum
var. thermophilum DSM 1495]
Length = 396
Score = 233 bits (594), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 127/312 (40%), Positives = 184/312 (58%), Gaps = 20/312 (6%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSD-- 71
+L + +LL ++ + ++ L+K L L+A I + + +G + G R R SD
Sbjct: 5 LLTAAVLLGSAQGAVHKLKLQKVPLS-EQLDAVPIEIQVQQLGQKYM-GTRSRQSHSDAV 62
Query: 72 ----------EDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSIS 121
+P+ NFM+AQYF EI +G+PPQ F V+ DTGSSNLWVPS C SI+
Sbjct: 63 WKGMMPEAMGSHPVPISNFMNAQYFSEISLGTPPQTFKVVLDTGSSNLWVPSVDC-GSIA 121
Query: 122 CYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGS 181
CY H++Y S S+TY G EI YGSGS+SGF SQD + +GD+ VK Q F EAT E
Sbjct: 122 CYLHTKYDSSASSTYKPNGTKFEIRYGSGSLSGFVSQDVLRIGDITVKGQDFAEATSEPG 181
Query: 182 LTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVF 241
L F RFDGI+GLG+ I+V VP + NM+EQ ++ E VF+F+L+ D E+ F
Sbjct: 182 LAFAFGRFDGILGLGYDTISVNRIVPPFYNMIEQKVIDEPVFAFYLS---DTSGQSEVTF 238
Query: 242 GGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTP 301
GG+D +KGK T +P+ +K YW+ + I G+ + + G I+D+GTSL+A P+
Sbjct: 239 GGIDKTKYKGKITTIPLRRKAYWEVDFDAISYGDDTAELENTGV--ILDTGTSLIALPSQ 296
Query: 302 VVTEINHAIGGE 313
+ +N +G +
Sbjct: 297 LAEMLNAQLGAK 308
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 38/100 (38%), Positives = 57/100 (57%), Gaps = 4/100 (4%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
+N + N G+ IDC + + +++FT+ F L+P YIL+ ++ CIS F
Sbjct: 301 LNAQLGAKKNFAGQYTIDCAKRDALKDITFTLAGYNFTLTPYDYILE----VSGSCISTF 356
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
M D P P GPL ILGD F+ Y++++D G +G AEAA
Sbjct: 357 MGMDFPAPTGPLAILGDAFLRKYYSIYDLGANTVGLAEAA 396
>gi|2851407|sp|P16228.3|CATE_RAT RecName: Full=Cathepsin E; Flags: Precursor
gi|1113086|dbj|BAA08128.1| cathepsin E precursor [Rattus rattus]
gi|149058663|gb|EDM09820.1| cathepsin E, isoform CRA_a [Rattus norvegicus]
Length = 398
Score = 233 bits (594), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 116/248 (46%), Positives = 158/248 (63%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG + IGSP QNF+VIFDTGSSNLWVPS C S +C H + +S+T
Sbjct: 71 PLINYLDMEYFGTVSIGSPSQNFTVIFDTGSSNLWVPSVYCT-SPACKAHPVFHPSQSST 129
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y E+G I YG+GS++G D V V + V+ Q F E+ +E TF+ A FDGI+GL
Sbjct: 130 YMEVGNHFSIQYGTGSLTGIIGADQVSVEGLTVEGQQFGESVKEPGQTFVNAEFDGILGL 189
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV+ +FS +L+ DP G E+ FGG DP HF G +
Sbjct: 190 GYPSLAVGGVTPVFDNMMAQNLVALPMFSVYLSSDPQGGSGSELTFGGYDPSHFSGSLNW 249
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
+PVTK+GYWQ L I +G+ + C GC AIVD+GTSL+ GP + ++ AIG
Sbjct: 250 IPVTKQGYWQIALDGIQVGD-TVMFCSEGCQAIVDTGTSLITGPPKKIKQLQEAIGATPM 308
Query: 313 EGVVSAEC 320
+G + +C
Sbjct: 309 DGEYAVDC 316
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 52/142 (36%), Positives = 75/142 (52%), Gaps = 8/142 (5%)
Query: 367 VEKENVSAGDSAV-CSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSL-PNPM-GESII 423
+ + + GD+ + CS A+V L K+ I +L +++ PM GE +
Sbjct: 260 IALDGIQVGDTVMFCSEGCQAIVDTGTSLITGPPKK-----IKQLQEAIGATPMDGEYAV 314
Query: 424 DCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGD 483
DC + MPNV+F I + LSP YIL + C SGF D+ PP GPLWILGD
Sbjct: 315 DCATLNMMPNVTFLINGVSYTLSPTAYILPDLVDGMQFCGSGFQGLDIQPPAGPLWILGD 374
Query: 484 VFMGVYHTVFDSGKLRIGFAEA 505
VF+ +++VFD G ++G A A
Sbjct: 375 VFIRKFYSVFDRGNNQVGLAPA 396
>gi|440475206|gb|ELQ43907.1| vacuolar protease A [Magnaporthe oryzae Y34]
Length = 395
Score = 233 bits (594), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 129/320 (40%), Positives = 191/320 (59%), Gaps = 19/320 (5%)
Query: 14 VLASCLLLPASSNGLRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSGVRHRLGD 69
++ + +LL + G+ ++ +KK +L LNA ++Y+G S + +
Sbjct: 5 MMTAAVLLGTAEAGVHKLKMKKIPLEDQLKTFDLNAQMRGLGQKYLGIRPESHQQAVFSN 64
Query: 70 -----SDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYF 124
S +P+ NFM+AQYF EI IG+PPQNF VI DTGSSNLWVPSS C SI+CY
Sbjct: 65 DAVQASGNHPVPISNFMNAQYFSEITIGTPPQNFKVILDTGSSNLWVPSSSC-GSIACYL 123
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
H++Y+S S+TY + G +I YGSGS+ GF S D + +GD+ +K+ F EAT+E L F
Sbjct: 124 HNKYESSSSSTYKKNGTEFKIQYGSGSMEGFVSNDFMTIGDLKIKNLDFAEATKEPGLAF 183
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
RFDGI+G+GF ++V VP + MV+Q L+ E VF+F+L D + E+VFGGV
Sbjct: 184 AFGRFDGILGMGFDRLSVNKIVPPFYAMVDQKLIDEPVFAFYL---ADEKSESEVVFGGV 240
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
+ H GK T +P+ +K YW+ +L I +G++ + G I+D+GTSL+A P+ +
Sbjct: 241 NKDHIDGKITEIPLRRKAYWEVDLDAIALGDEVAELDNTGV--ILDTGTSLIALPSQLAE 298
Query: 305 EINHAIGGE----GVVSAEC 320
+N IG + G S +C
Sbjct: 299 LLNSQIGAKKGYNGQYSIDC 318
Score = 75.1 bits (183), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 36/87 (41%), Positives = 55/87 (63%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ IDCD+ +P+++F + F +S YIL+ ++ CIS FMA D+P P GPL
Sbjct: 312 GQYSIDCDKRKDLPDITFRLSGYDFPISAYDYILE----VSGSCISTFMAMDIPEPVGPL 367
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ Y++++D GK +G A+A
Sbjct: 368 AILGDAFLRRYYSIYDLGKGTVGLAKA 394
>gi|38303893|gb|AAH62002.1| Ctse protein [Rattus norvegicus]
Length = 398
Score = 233 bits (594), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 116/248 (46%), Positives = 158/248 (63%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG + IGSP QNF+VIFDTGSSNLWVPS C S +C H + +S+T
Sbjct: 71 PLINYLDMEYFGTVSIGSPSQNFTVIFDTGSSNLWVPSVYCT-SPACKAHPVFHPSQSST 129
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y E+G I YG+GS++G D V V + V+ Q F E+ +E TF+ A FDGI+GL
Sbjct: 130 YMEVGNHFSIQYGTGSLTGIIGADQVSVEGLTVEGQQFGESVKEPGQTFVNAEFDGILGL 189
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV+ +FS +L+ DP G E+ FGG DP HF G +
Sbjct: 190 GYPSLAVGGVTPVFDNMMAQNLVALPMFSVYLSSDPQGGSGSELTFGGYDPSHFSGSLNW 249
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
+PVTK+GYWQ L I +G+ + C GC AIVD+GTSL+ GP + ++ AIG
Sbjct: 250 IPVTKQGYWQIALDGIQVGD-TVMFCSEGCQAIVDTGTSLITGPPKKIKQLQEAIGATPM 308
Query: 313 EGVVSAEC 320
+G + +C
Sbjct: 309 DGEYAVDC 316
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 52/142 (36%), Positives = 75/142 (52%), Gaps = 8/142 (5%)
Query: 367 VEKENVSAGDSAV-CSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSL-PNPM-GESII 423
+ + + GD+ + CS A+V L K+ I +L +++ PM GE +
Sbjct: 260 IALDGIQVGDTVMFCSEGCQAIVDTGTSLITGPPKK-----IKQLQEAIGATPMDGEYAV 314
Query: 424 DCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGD 483
DC + MPNV+F I + LSP YIL + C SGF D+ PP GPLWILGD
Sbjct: 315 DCATLNMMPNVTFLINGVSYTLSPTAYILPDLVDGMQFCGSGFQGLDIQPPAGPLWILGD 374
Query: 484 VFMGVYHTVFDSGKLRIGFAEA 505
VF+ +++VFD G ++G A A
Sbjct: 375 VFIRKFYSVFDRGNNQVGLAPA 396
>gi|195150257|ref|XP_002016071.1| GL10692 [Drosophila persimilis]
gi|194109918|gb|EDW31961.1| GL10692 [Drosophila persimilis]
Length = 399
Score = 233 bits (594), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 124/308 (40%), Positives = 179/308 (58%), Gaps = 22/308 (7%)
Query: 21 LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNF 80
P++ N + G+ R+D L +R+ + R GG V PL N+
Sbjct: 30 FPSARNRFVQFGI---RMDRFRLKYSRVDGRSRPRGGWEVRSE------------PLSNY 74
Query: 81 MDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEI 139
+DAQYFG I IGSPPQ F VIFDTGSSNLWVPS+ C + ++C HSRY +R+S+++
Sbjct: 75 LDAQYFGPITIGSPPQTFKVIFDTGSSNLWVPSTSCAPTMVACMVHSRYNARQSSSHRRN 134
Query: 140 GKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFRE 199
G I+YGSGS++G+ S D V V + +++Q F E T FL A+FDGI GL ++
Sbjct: 135 GVRFAIHYGSGSLAGYLSSDTVRVAGLEIQNQTFAEVTTMPGPIFLAAKFDGIFGLAYQS 194
Query: 200 IAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVT 259
I++ P + ++EQ L+S VFS +LNR+ + EGG + FGG +P++++G TYVPV+
Sbjct: 195 ISMQGVKPPFYAIMEQKLLSNPVFSVYLNREQEHPEGGALFFGGSNPRYYRGNFTYVPVS 254
Query: 260 KKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GV 315
++ YWQ + I + +C+ GC I+D+GTS LA P IN +IGG G
Sbjct: 255 RRAYWQVRMEAATINDLR--LCQHGCEVIIDTGTSFLALPYDQAILINESIGGTPSEYGQ 312
Query: 316 VSAECKLV 323
S C V
Sbjct: 313 YSVPCDQV 320
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 45/99 (45%), Positives = 60/99 (60%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
INE P+ G+ + CD++P +P ++F +G + F L YI + E+C S
Sbjct: 299 INESIGGTPSEYGQYSVPCDQVPQLPRLTFQLGSQQFFLDGSNYIFRDVYQDREICFSAI 358
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+ DLP P GPLWILGDVF+G Y+T FD G RIGFAEA
Sbjct: 359 IGVDLPSPSGPLWILGDVFLGKYYTEFDMGNHRIGFAEA 397
>gi|302899226|ref|XP_003048007.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256728939|gb|EEU42294.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 396
Score = 233 bits (593), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 120/251 (47%), Positives = 164/251 (65%), Gaps = 12/251 (4%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+P+ NFM+AQYF EI IG+PPQ+F V+ DTGSSNLWVPS +C SI+CY HS+Y S S+
Sbjct: 76 VPISNFMNAQYFSEITIGNPPQSFKVVLDTGSSNLWVPSQEC-GSIACYLHSKYDSSASS 134
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + G EI+YGSGS+SGF S D+V +GD+ +K Q F EAT+E L F RFDGI+G
Sbjct: 135 TYKQNGSEFEIHYGSGSLSGFISNDDVSIGDLKIKGQDFAEATKEPGLAFAFGRFDGILG 194
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V VP + MV Q L+ + VF+F+L D E E+VFGGVD H++G
Sbjct: 195 LGYDTISVNHIVPPFYQMVNQKLLDDPVFAFYL---ADQEGESEVVFGGVDKSHYEGDIE 251
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEG-GCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
Y+P+ +K YW+ +L I +G++ V E AI+D+GTSL P+ + +N IG +
Sbjct: 252 YIPLRRKAYWEVDLDAIALGDE---VAEQENTGAILDTGTSLNVLPSALAELLNKEIGAK 308
Query: 314 ----GVVSAEC 320
G + EC
Sbjct: 309 KGYNGQYTVEC 319
Score = 73.6 bits (179), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 53/87 (60%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ ++CD+ T+P+++FT+ ++L YIL+ ++ CIS F D P P GPL
Sbjct: 313 GQYTVECDKRQTLPDITFTLAGSNYSLPATDYILE----VSGSCISTFQGMDFPEPVGPL 368
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ Y++V+D GK +G A +
Sbjct: 369 VILGDAFLRRYYSVYDLGKNAVGLARS 395
>gi|189211129|ref|XP_001941895.1| vacuolar protease A precursor [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187977988|gb|EDU44614.1| vacuolar protease A precursor [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 399
Score = 233 bits (593), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 117/250 (46%), Positives = 162/250 (64%), Gaps = 8/250 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+P+ NF++AQYF EI +G+PPQ F VI DTGSSNLWVPSS C SI+CY H++Y S S+
Sbjct: 77 VPVTNFLNAQYFSEISLGTPPQTFKVILDTGSSNLWVPSSSCN-SIACYLHTKYDSSSSS 135
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + G EI YGSGS+SGF S D ++GD+ VK+Q F EAT E L F RFDGI+G
Sbjct: 136 TYKKNGTEFEIRYGSGSLSGFVSNDVFQIGDLKVKNQDFAEATSEPGLAFAFGRFDGIMG 195
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V VP + NM+EQGL+ E VF+F+L D + ++ E FGG+D + GK
Sbjct: 196 LGYDTISVKGIVPPFYNMLEQGLLDEPVFAFYLG-DTNQQQESEATFGGIDESKYTGKMI 254
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
+P+ +K YW+ EL + G ++ + G I+D+GTSL+A P+ + +N IG +
Sbjct: 255 KLPLRRKAYWEVELDALTFGKETAEMDNTGI--ILDTGTSLIALPSTIAELLNKEIGAKK 312
Query: 314 ---GVVSAEC 320
G + EC
Sbjct: 313 SFNGQYTVEC 322
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 32/87 (36%), Positives = 52/87 (59%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ ++CD+ ++P+++FT+ F +S YIL+ + CIS M D P P GPL
Sbjct: 316 GQYTVECDKRDSLPDLTFTLTGHNFTISAYDYILE----VQGSCISALMGMDFPEPVGPL 371
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ +++V+D G +G A+A
Sbjct: 372 AILGDAFLRKWYSVYDLGNSAVGLAKA 398
>gi|431892878|gb|ELK03306.1| Cathepsin E [Pteropus alecto]
Length = 396
Score = 233 bits (593), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 118/248 (47%), Positives = 159/248 (64%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I +GSPPQNF+VIFDTGSSNLWVPS C S +C H+R+ +S+T
Sbjct: 69 PLINYLDMEYFGTISVGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHARFYPSQSDT 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y+ +G I+YG+GS+SG D V V + V Q F E+ E TF+ A FDGI+GL
Sbjct: 128 YSTVGSHFSIHYGTGSLSGIIGADQVSVEGLTVVSQQFGESVTEPGQTFVNAEFDGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV +FS +++ D + G E++FGG D HF G +
Sbjct: 188 GYPSLAVGGVTPVFDNMMAQNLVDVPMFSVYMSSDLEGGAGSELIFGGYDHSHFSGSLNW 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE-- 313
VPVTK+GYWQ L I +G + C GC AIVD+GTSL+ GP+ + ++ AIG E
Sbjct: 248 VPVTKQGYWQIALDTIQVGG-AVIFCSEGCQAIVDTGTSLITGPSEEIKQLQKAIGAEPT 306
Query: 314 -GVVSAEC 320
G + EC
Sbjct: 307 NGEYAVEC 314
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 40/87 (45%), Positives = 52/87 (59%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
GE ++CD + MP+V+FTI + L P Y L E C SGF D+ PP GPL
Sbjct: 308 GEYAVECDNLNVMPDVTFTINGVPYTLQPTAYTLPDSVDETEFCFSGFQGLDIQPPAGPL 367
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WILGDVF+ +++VFD G R+G A A
Sbjct: 368 WILGDVFIRQFYSVFDRGNNRVGLAPA 394
>gi|326933745|ref|XP_003212960.1| PREDICTED: cathepsin E-like [Meleagris gallopavo]
Length = 403
Score = 233 bits (593), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 129/318 (40%), Positives = 186/318 (58%), Gaps = 24/318 (7%)
Query: 23 ASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRL------------GDS 70
A +GL+R L + L H + R + ++R G HRL G++
Sbjct: 18 APCSGLKRPALCRVTLTRH--RSLRKSLRDR--GQLSQFWKAHRLDMVQYTQDCSLFGEA 73
Query: 71 DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKS 130
+E PL N++D +YFG+I IG+PPQNF+VIFDTGSSNLWVPS C S +C H+R++
Sbjct: 74 NE---PLINYLDMEYFGQISIGTPPQNFTVIFDTGSSNLWVPSIYCT-SKACTNHARFQP 129
Query: 131 RKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFD 190
+S+TY +G + YG+GS++G D V V + V +Q F E+ E F + FD
Sbjct: 130 SRSSTYQPLGLPISLQYGTGSLTGIIGSDQVTVEGMTVCNQPFAESVSEPGKAFQDSEFD 189
Query: 191 GIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFK 250
GI+GL + +AV PV+DNM+ Q LV +FS +++ +PD+ GGE++FGG DP F
Sbjct: 190 GILGLAYPSLAVDGVTPVFDNMMAQDLVELPIFSVYMSANPDSSLGGEVLFGGFDPSRFL 249
Query: 251 GKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI 310
G +VPVT +GYWQ +L ++ +G + C GC AIVD+GTSLL GPT + E+ I
Sbjct: 250 GTLHWVPVTVQGYWQIQLDNVQVGG-TVVFCANGCQAIVDTGTSLLTGPTKDIKEMQRYI 308
Query: 311 GG---EGVVSAECKLVVS 325
G +G +C L+ S
Sbjct: 309 GATPMDGEYVVDCSLLSS 326
Score = 98.2 bits (243), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 57/144 (39%), Positives = 81/144 (56%), Gaps = 10/144 (6%)
Query: 367 VEKENVSAGDSAV-CSACEMAVVWVQNQLKQKQTKE--KVLSYINELCDSLPNPM-GESI 422
++ +NV G + V C+ A+V L TK+ ++ YI PM GE +
Sbjct: 265 IQLDNVQVGGTVVFCANGCQAIVDTGTSLLTGPTKDIKEMQRYIGA------TPMDGEYV 318
Query: 423 IDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILG 482
+DC + +MP V+FTI + LS + Y L ++C+SGF D+PPP GPLWILG
Sbjct: 319 VDCSLLSSMPIVTFTINGMPYLLSAQAYTLMEQSDGMDICLSGFQGMDVPPPAGPLWILG 378
Query: 483 DVFMGVYHTVFDSGKLRIGFAEAA 506
DVF+ Y++VFD G R+GFA AA
Sbjct: 379 DVFIRQYYSVFDRGNNRVGFAPAA 402
>gi|149725197|ref|XP_001502028.1| PREDICTED: pepsin A-like [Equus caballus]
Length = 387
Score = 233 bits (593), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 120/251 (47%), Positives = 163/251 (64%), Gaps = 10/251 (3%)
Query: 73 DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRK 132
D PL+N++D +YFG I IG+PPQ F+VIFDTGSSNLWVPS+ C S++CY H R+ K
Sbjct: 63 DSEPLENYLDEEYFGTISIGTPPQEFTVIFDTGSSNLWVPSTYCS-SLACYDHKRFNPEK 121
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGI 192
S+TY +S I YG+GS++G D V VG + +Q+F + +E LA FDGI
Sbjct: 122 SSTYQATSESISITYGTGSMTGILGYDTVRVGGIEDTNQIFGLSEKEPGFFLFLAPFDGI 181
Query: 193 IGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGK 252
+GLG+ I+ A PV+DN+ +QGLVS+++FS +L+ D E G ++FGG+D ++ G
Sbjct: 182 LGLGYPSISASGATPVFDNIWDQGLVSQDLFSVYLSS--DDESGSVVMFGGIDSSYYTGS 239
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG- 311
+VPVT +GYWQ + I I +S C GGC AIVD+GTSLLAGPT + I IG
Sbjct: 240 LHWVPVTTEGYWQIAVDSITINGESIA-CSGGCQAIVDTGTSLLAGPTSGIDNIQSYIGA 298
Query: 312 -----GEGVVS 317
GEGV+S
Sbjct: 299 RKDLLGEGVIS 309
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 52/134 (38%), Positives = 66/134 (49%), Gaps = 10/134 (7%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTK--EKVLSYINELCDSLPNPMGESIIDCDRIPTMP 432
G+S CS A+V L T + + SYI D L GE +I C I ++P
Sbjct: 262 GESIACSGGCQAIVDTGTSLLAGPTSGIDNIQSYIGARKDLL----GEGVISCSAIDSLP 317
Query: 433 NVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTV 492
++ FT+ F L P YILK + CISGF DL G LWILGDVF+ Y TV
Sbjct: 318 DIVFTMNGVEFPLPPSAYILKEDDS----CISGFEGVDLDTSSGELWILGDVFIRQYFTV 373
Query: 493 FDSGKLRIGFAEAA 506
FD ++G A A
Sbjct: 374 FDRANNQVGLAPVA 387
>gi|355745980|gb|EHH50605.1| hypothetical protein EGM_01462 [Macaca fascicularis]
Length = 401
Score = 233 bits (593), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 119/253 (47%), Positives = 165/253 (65%), Gaps = 10/253 (3%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C H+R++ +S+T
Sbjct: 69 PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHTRFQPSQSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNV-----EVGDVVVKDQVFIEATREGSLTFLLARFD 190
Y++ G+S I YG+GS+SG D V +V + V Q F E+ E TF+ A FD
Sbjct: 128 YSQPGQSFSIQYGTGSLSGIIGADQVSAFSCQVEGLTVVGQQFGESVTEPGQTFVDAEFD 187
Query: 191 GIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFK 250
GI+GLG+ +AVG PV+DNM+ Q LV +FS +++ +P+ G E++FGG D HF
Sbjct: 188 GILGLGYPSLAVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFS 247
Query: 251 GKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI 310
G +VPVTK+GYWQ L +I +G + C GC AIVD+GTSL+ GP+ + ++ +AI
Sbjct: 248 GSLDWVPVTKQGYWQIALDNIQVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAI 306
Query: 311 GG---EGVVSAEC 320
G +G + EC
Sbjct: 307 GAAPVDGEYAVEC 319
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 39/87 (44%), Positives = 52/87 (59%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
GE ++C + MP+V+FTI + LSP Y L + C SGF D+ PP GPL
Sbjct: 313 GEYAVECANLNVMPDVTFTINGVPYTLSPTAYTLLDFVDGMQFCSSGFQGLDIHPPAGPL 372
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WILGDVF+ +++VFD G R+G A A
Sbjct: 373 WILGDVFIRQFYSVFDRGNNRVGLAPA 399
>gi|291409618|ref|XP_002721075.1| PREDICTED: pepsin II-4-like [Oryctolagus cuniculus]
Length = 387
Score = 233 bits (593), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 117/247 (47%), Positives = 163/247 (65%), Gaps = 10/247 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
L+N++DA+YFG I IG+PPQ+F+VIFDTGSSNLWVPS+ C S++C H R+ S+TY
Sbjct: 67 LENYLDAEYFGTISIGTPPQDFTVIFDTGSSNLWVPSTYCS-SLACALHKRFNPEDSSTY 125
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
++ I YG+GS++G D V+VG + +Q+F + E LTFL A FDGI+GLG
Sbjct: 126 QGTSETLSITYGTGSMTGILGYDTVKVGSIEDTNQIFGLSKTEPGLTFLFAPFDGILGLG 185
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ I+ DA PV+DNM +GLVS+++FS +L+ D E+G ++FGG+D ++ G +V
Sbjct: 186 YPSISASDATPVFDNMWNEGLVSQDLFSVYLSSDD--EKGSLVMFGGIDSSYYTGSLNWV 243
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG----- 311
PV+ +GYWQ + I I N T C C AIVD+GTSLLAGPT ++ I IG
Sbjct: 244 PVSYEGYWQITMDSISI-NGETIACADSCQAIVDTGTSLLAGPTSAISNIQSYIGASKNL 302
Query: 312 -GEGVVS 317
GE V+S
Sbjct: 303 LGENVIS 309
Score = 71.6 bits (174), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 64/131 (48%), Gaps = 6/131 (4%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 434
G++ C+ A+V L T +S I + N +GE++I C I ++P++
Sbjct: 262 GETIACADSCQAIVDTGTSLLAGPTS--AISNIQSYIGASKNLLGENVISCSAIDSLPDI 319
Query: 435 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 494
FTI + L YILK + CISG +L G LWILGDVF+ Y TVFD
Sbjct: 320 VFTINGIQYPLPASAYILKEDDD----CISGLEGMNLDTSTGELWILGDVFIRQYFTVFD 375
Query: 495 SGKLRIGFAEA 505
++G A A
Sbjct: 376 RANNQLGLAAA 386
>gi|110590169|pdb|2G24|A Chain A, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110590170|pdb|2G24|B Chain B, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110590171|pdb|2G26|A Chain A, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110590172|pdb|2G26|B Chain B, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110590173|pdb|2G27|A Chain A, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110590174|pdb|2G27|B Chain B, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110591465|pdb|2FS4|A Chain A, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
C Ring
gi|110591466|pdb|2FS4|B Chain B, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
C Ring
gi|110591524|pdb|2G1N|A Chain A, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110591525|pdb|2G1N|B Chain B, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110591526|pdb|2G1O|A Chain A, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110591527|pdb|2G1O|B Chain B, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110591528|pdb|2G1R|A Chain A, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
C Ring
gi|110591529|pdb|2G1R|B Chain B, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
C Ring
gi|110591530|pdb|2G1S|A Chain A, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
C Ring
gi|110591531|pdb|2G1S|B Chain B, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
C Ring
gi|110591532|pdb|2G1Y|A Chain A, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110591533|pdb|2G1Y|B Chain B, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110591534|pdb|2G20|A Chain A, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
C Ring
gi|110591535|pdb|2G20|B Chain B, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
C Ring
gi|110591536|pdb|2G21|A Chain A, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110591537|pdb|2G21|B Chain B, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110591538|pdb|2G22|A Chain A, Ketopiperazine-based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110591539|pdb|2G22|B Chain B, Ketopiperazine-based Renin Inhibitors: Optimization Of The
"c" Ring
Length = 333
Score = 232 bits (592), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 108/240 (45%), Positives = 163/240 (67%), Gaps = 5/240 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L N+MD QY+GEIGIG+PPQ F V+FDTGSSN+WVPSSKC +C +H + + S++
Sbjct: 5 LTNYMDTQYYGEIGIGTPPQTFKVVFDTGSSNVWVPSSKCSRLYTACVYHKLFDASDSSS 64
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G + Y +G++SGF SQD + VG + V Q+F E T +L F+LA FDG++G+
Sbjct: 65 YKHNGTELTLRYSTGTVSGFLSQDIITVGGITVT-QMFGEVTEMPALPFMLAEFDGVVGM 123
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEE--GGEIVFGGVDPKHFKGKH 253
GF E A+G P++DN++ QG++ E+VFSF+ NRD + + GG+IV GG DP+H++G
Sbjct: 124 GFIEQAIGRVTPIFDNIISQGVLKEDVFSFYYNRDSENSQSLGGQIVLGGSDPQHYEGNF 183
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
Y+ + K G WQ ++ + +G+ ST +CE GC A+VD+G S ++G T + ++ A+G +
Sbjct: 184 HYINLIKTGVWQIQMKGVSVGS-STLLCEDGCLALVDTGASYISGSTSSIEKLMEALGAK 242
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 31/84 (36%), Positives = 48/84 (57%)
Query: 422 IIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWIL 481
++ C+ PT+P++SF +G K + L+ Y+ + ++C A D+PPP GP W L
Sbjct: 249 VVKCNEGPTLPDISFHLGGKEYTLTSADYVFQESYSSKKLCTLAIHAMDIPPPTGPTWAL 308
Query: 482 GDVFMGVYHTVFDSGKLRIGFAEA 505
G F+ ++T FD RIGFA A
Sbjct: 309 GATFIRKFYTEFDRRNNRIGFALA 332
>gi|387915422|gb|AFK11320.1| cathepsin E-A-like protein [Callorhinchus milii]
Length = 401
Score = 232 bits (592), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 131/332 (39%), Positives = 195/332 (58%), Gaps = 19/332 (5%)
Query: 4 KLLRSVFCLWVLASCLL-LPASSNGLRRIGLKKRR-----LDLHSLNAARITRKERYMGG 57
K+ +V L CL+ +P + R L++R L H A E+Y
Sbjct: 2 KVFVTVLLFIHLTECLIRIPLTRFKPIRKVLRERDQLKEFLRHHQFEAF----AEKYQSC 57
Query: 58 AGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY 117
V+ G + E L N+MDAQY+GEIGIG+P Q F+V+FDTGSSNLWVPS+ C
Sbjct: 58 YPSKLVKTHEGTAFEH---LSNYMDAQYYGEIGIGTPLQKFTVVFDTGSSNLWVPSAYC- 113
Query: 118 FSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEAT 177
S +C H ++KS S TY G I YG+G ++G +D V +G++ ++ Q F E+
Sbjct: 114 ISEACKMHEQFKSFHSTTYAPRGNQFSIRYGTGQLAGVLGKDMVRIGNITIRAQEFGESV 173
Query: 178 REGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGG 237
E TF +A+FDGI+GLG+ IA G A+PV+D M+ Q LV E +FS +NR+ D++ GG
Sbjct: 174 FEPGSTFAVAQFDGILGLGYPSIAEGGALPVFDRMMHQNLVVEPIFSVLINREMDSDYGG 233
Query: 238 EIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLA 297
E++ GG++ + + G +VPVT++GYWQ + ++ I T +C GCAAIVD+GTSL+
Sbjct: 234 ELLLGGINHECYTGSINWVPVTERGYWQIRMDNVKIDGMLT-LCINGCAAIVDTGTSLIT 292
Query: 298 GPTPVVTEINHAIG----GEGVVSAECKLVVS 325
GP + +++ +G G+G +CK + S
Sbjct: 293 GPEKEIRKLHKQLGAMSVGDGEYVVDCKRISS 324
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 44/105 (41%), Positives = 68/105 (64%), Gaps = 1/105 (0%)
Query: 401 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 460
EK + +++ ++ GE ++DC RI +M +V+FTIG+ F+LSP Y+ K +G
Sbjct: 295 EKEIRKLHKQLGAMSVGDGEYVVDCKRISSMASVTFTIGEVEFSLSPNDYV-KKFQGDHS 353
Query: 461 VCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+C+SGF D+ GPLWILGDVF+ ++T+FD G R+GFA +
Sbjct: 354 LCLSGFQEMDMVTRAGPLWILGDVFLTKFYTIFDRGNDRVGFARS 398
>gi|118138205|pdb|2I4Q|A Chain A, Human ReninPF02342674 COMPLEX
gi|118138206|pdb|2I4Q|B Chain B, Human ReninPF02342674 COMPLEX
Length = 336
Score = 232 bits (592), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 108/240 (45%), Positives = 163/240 (67%), Gaps = 5/240 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L N+MD QY+GEIGIG+PPQ F V+FDTGSSN+WVPSSKC +C +H + + S++
Sbjct: 8 LTNYMDTQYYGEIGIGTPPQTFKVVFDTGSSNVWVPSSKCSRLYTACVYHKLFDASDSSS 67
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G + Y +G++SGF SQD + VG + V Q+F E T +L F+LA FDG++G+
Sbjct: 68 YKHNGTELTLRYSTGTVSGFLSQDIITVGGITVT-QMFGEVTEMPALPFMLAEFDGVVGM 126
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEE--GGEIVFGGVDPKHFKGKH 253
GF E A+G P++DN++ QG++ E+VFSF+ NRD + + GG+IV GG DP+H++G
Sbjct: 127 GFIEQAIGRVTPIFDNIISQGVLKEDVFSFYYNRDSENSQSLGGQIVLGGSDPQHYEGNF 186
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
Y+ + K G WQ ++ + +G+ ST +CE GC A+VD+G S ++G T + ++ A+G +
Sbjct: 187 HYINLIKTGVWQIQMKGVSVGS-STLLCEDGCLALVDTGASYISGSTSSIEKLMEALGAK 245
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 31/84 (36%), Positives = 48/84 (57%)
Query: 422 IIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWIL 481
++ C+ PT+P++SF +G K + L+ Y+ + ++C A D+PPP GP W L
Sbjct: 252 VVKCNEGPTLPDISFHLGGKEYTLTSADYVFQESYSSKKLCTLAIHAMDIPPPTGPTWAL 311
Query: 482 GDVFMGVYHTVFDSGKLRIGFAEA 505
G F+ ++T FD RIGFA A
Sbjct: 312 GATFIRKFYTEFDRRNNRIGFALA 335
>gi|1065326|pdb|1HRN|A Chain A, High Resolution Crystal Structures Of Recombinant Human
Renin In Complex With Polyhydroxymonoamide Inhibitors
gi|1065327|pdb|1HRN|B Chain B, High Resolution Crystal Structures Of Recombinant Human
Renin In Complex With Polyhydroxymonoamide Inhibitors
gi|1310896|pdb|1BIM|A Chain A, Crystallographic Studies On The Binding Modes Of P2-P3
Butanediamide Renin Inhibitors
gi|1310897|pdb|1BIM|B Chain B, Crystallographic Studies On The Binding Modes Of P2-P3
Butanediamide Renin Inhibitors
gi|1310898|pdb|1BIL|A Chain A, Crystallographic Studies On The Binding Modes Of P2-P3
Butanediamide Renin Inhibitors
gi|1310899|pdb|1BIL|B Chain B, Crystallographic Studies On The Binding Modes Of P2-P3
Butanediamide Renin Inhibitors
gi|241913388|pdb|3GW5|A Chain A, Crystal Structure Of Human Renin Complexed With A Novel
Inhibitor
gi|241913389|pdb|3GW5|B Chain B, Crystal Structure Of Human Renin Complexed With A Novel
Inhibitor
gi|283807203|pdb|3KM4|A Chain A, Optimization Of Orally Bioavailable Alkyl Amine Renin
Inhibitors
gi|283807204|pdb|3KM4|B Chain B, Optimization Of Orally Bioavailable Alkyl Amine Renin
Inhibitors
Length = 337
Score = 232 bits (592), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 108/240 (45%), Positives = 163/240 (67%), Gaps = 5/240 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L N+MD QY+GEIGIG+PPQ F V+FDTGSSN+WVPSSKC +C +H + + S++
Sbjct: 9 LTNYMDTQYYGEIGIGTPPQTFKVVFDTGSSNVWVPSSKCSRLYTACVYHKLFDASDSSS 68
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G + Y +G++SGF SQD + VG + V Q+F E T +L F+LA FDG++G+
Sbjct: 69 YKHNGTELTLRYSTGTVSGFLSQDIITVGGITVT-QMFGEVTEMPALPFMLAEFDGVVGM 127
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEE--GGEIVFGGVDPKHFKGKH 253
GF E A+G P++DN++ QG++ E+VFSF+ NRD + + GG+IV GG DP+H++G
Sbjct: 128 GFIEQAIGRVTPIFDNIISQGVLKEDVFSFYYNRDSENSQSLGGQIVLGGSDPQHYEGNF 187
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
Y+ + K G WQ ++ + +G+ ST +CE GC A+VD+G S ++G T + ++ A+G +
Sbjct: 188 HYINLIKTGVWQIQMKGVSVGS-STLLCEDGCLALVDTGASYISGSTSSIEKLMEALGAK 246
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 31/84 (36%), Positives = 48/84 (57%)
Query: 422 IIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWIL 481
++ C+ PT+P++SF +G K + L+ Y+ + ++C A D+PPP GP W L
Sbjct: 253 VVKCNEGPTLPDISFHLGGKEYTLTSADYVFQESYSSKKLCTLAIHAMDIPPPTGPTWAL 312
Query: 482 GDVFMGVYHTVFDSGKLRIGFAEA 505
G F+ ++T FD RIGFA A
Sbjct: 313 GATFIRKFYTEFDRRNNRIGFALA 336
>gi|6978719|ref|NP_037070.1| cathepsin E precursor [Rattus norvegicus]
gi|1113084|dbj|BAA07285.1| cathepsin E precursor [Rattus norvegicus]
Length = 365
Score = 232 bits (592), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 114/236 (48%), Positives = 153/236 (64%), Gaps = 2/236 (0%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG + IGSP QNF+VIFDTGSSNLWVPS C S +C H + +S+T
Sbjct: 71 PLINYLDMEYFGTVSIGSPSQNFTVIFDTGSSNLWVPSVYCT-SSACKAHPVFHPSQSST 129
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y E+G I YG+GS++G D V V + V+ Q F E+ +E TF+ A FDGI+GL
Sbjct: 130 YMEVGNHFSIQYGTGSLTGIIGADQVSVEGLTVEGQQFGESVKEPGQTFVNAEFDGILGL 189
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV+ +FS +L+ DP G E+ FGG DP HF G +
Sbjct: 190 GYPSLAVGGVTPVFDNMMAQNLVALPMFSVYLSSDPQGGSGSELTFGGYDPSHFSGSLNW 249
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
+PVTK+GYWQ L I +G+ + C GC AIVD+GTSL+ GP + ++ AIG
Sbjct: 250 IPVTKQGYWQIALDGIQVGD-TVMFCSEGCQAIVDTGTSLITGPPKKIKQLQEAIG 304
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 32/46 (69%)
Query: 460 EVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+ C SGF D+ PP GPLWILGDVF+ +++VFD G ++G A A
Sbjct: 318 QFCGSGFQGLDIQPPAGPLWILGDVFIRKFYSVFDRGNNQVGLAPA 363
>gi|149058665|gb|EDM09822.1| cathepsin E, isoform CRA_c [Rattus norvegicus]
Length = 365
Score = 232 bits (592), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 114/236 (48%), Positives = 153/236 (64%), Gaps = 2/236 (0%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG + IGSP QNF+VIFDTGSSNLWVPS C S +C H + +S+T
Sbjct: 71 PLINYLDMEYFGTVSIGSPSQNFTVIFDTGSSNLWVPSVYCT-SPACKAHPVFHPSQSST 129
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y E+G I YG+GS++G D V V + V+ Q F E+ +E TF+ A FDGI+GL
Sbjct: 130 YMEVGNHFSIQYGTGSLTGIIGADQVSVEGLTVEGQQFGESVKEPGQTFVNAEFDGILGL 189
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV+ +FS +L+ DP G E+ FGG DP HF G +
Sbjct: 190 GYPSLAVGGVTPVFDNMMAQNLVALPMFSVYLSSDPQGGSGSELTFGGYDPSHFSGSLNW 249
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
+PVTK+GYWQ L I +G+ + C GC AIVD+GTSL+ GP + ++ AIG
Sbjct: 250 IPVTKQGYWQIALDGIQVGD-TVMFCSEGCQAIVDTGTSLITGPPKKIKQLQEAIG 304
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 24/46 (52%), Positives = 32/46 (69%)
Query: 460 EVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+ C SGF D+ PP GPLWILGDVF+ +++VFD G ++G A A
Sbjct: 318 QFCGSGFQGLDIQPPAGPLWILGDVFIRKFYSVFDRGNNQVGLAPA 363
>gi|355558837|gb|EHH15617.1| hypothetical protein EGK_01732 [Macaca mulatta]
Length = 401
Score = 232 bits (592), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 119/253 (47%), Positives = 165/253 (65%), Gaps = 10/253 (3%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C H+R++ +S+T
Sbjct: 69 PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHTRFQPSQSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNV-----EVGDVVVKDQVFIEATREGSLTFLLARFD 190
Y++ G+S I YG+GS+SG D V +V + V Q F E+ E TF+ A FD
Sbjct: 128 YSQPGQSFSIQYGTGSLSGIIGADQVSAFSCQVEGLTVVGQQFGESVTEPGQTFVDAEFD 187
Query: 191 GIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFK 250
GI+GLG+ +AVG PV+DNM+ Q LV +FS +++ +P+ G E++FGG D HF
Sbjct: 188 GILGLGYPSLAVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGVGSELIFGGYDHSHFS 247
Query: 251 GKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI 310
G +VPVTK+GYWQ L +I +G + C GC AIVD+GTSL+ GP+ + ++ +AI
Sbjct: 248 GSLNWVPVTKQGYWQIALDNIQVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAI 306
Query: 311 GG---EGVVSAEC 320
G +G + EC
Sbjct: 307 GAAPVDGEYAVEC 319
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 39/87 (44%), Positives = 52/87 (59%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
GE ++C + MP+V+FTI + LSP Y L + C SGF D+ PP GPL
Sbjct: 313 GEYAVECANLNVMPDVTFTINGVPYTLSPTAYTLLDFVDGMQFCSSGFQGLDIHPPAGPL 372
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WILGDVF+ +++VFD G R+G A A
Sbjct: 373 WILGDVFIRQFYSVFDRGNNRVGLAPA 399
>gi|46397366|sp|P14091.2|CATE_HUMAN RecName: Full=Cathepsin E; Contains: RecName: Full=Cathepsin E form
I; Contains: RecName: Full=Cathepsin E form II; Flags:
Precursor
Length = 401
Score = 232 bits (591), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 119/253 (47%), Positives = 164/253 (64%), Gaps = 10/253 (3%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C HSR++ +S+T
Sbjct: 69 PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHSRFQPSQSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNV-----EVGDVVVKDQVFIEATREGSLTFLLARFD 190
Y++ G+S I YG+GS+SG D V +V + V Q F E+ E TF+ A FD
Sbjct: 128 YSQPGQSFSIQYGTGSLSGIIGADQVSAFATQVEGLTVVGQQFGESVTEPGQTFVDAEFD 187
Query: 191 GIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFK 250
GI+GLG+ +AVG PV+DNM+ Q LV +FS +++ +P+ G E++FGG D HF
Sbjct: 188 GILGLGYPSLAVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFS 247
Query: 251 GKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI 310
G +VPVTK+ YWQ L +I +G + C GC AIVD+GTSL+ GP+ + ++ +AI
Sbjct: 248 GSLNWVPVTKQAYWQIALDNIQVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAI 306
Query: 311 GG---EGVVSAEC 320
G +G + EC
Sbjct: 307 GAAPVDGEYAVEC 319
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 39/87 (44%), Positives = 52/87 (59%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
GE ++C + MP+V+FTI + LSP Y L + C SGF D+ PP GPL
Sbjct: 313 GEYAVECANLNVMPDVTFTINGVPYTLSPTAYTLLDFVDGMQFCSSGFQGLDIHPPAGPL 372
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WILGDVF+ +++VFD G R+G A A
Sbjct: 373 WILGDVFIRQFYSVFDRGNNRVGLAPA 399
>gi|388326405|pdb|3VCM|A Chain A, Crystal Structure Of Human Prorenin
gi|388326406|pdb|3VCM|B Chain B, Crystal Structure Of Human Prorenin
Length = 335
Score = 232 bits (591), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 110/248 (44%), Positives = 167/248 (67%), Gaps = 7/248 (2%)
Query: 67 LGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFH 125
LG++ ++ L N+MD QY+GEIGIG+PPQ F V+FDTGSSN+WVPSSKC +C +H
Sbjct: 3 LGNTTSSVI-LTNYMDTQYYGEIGIGTPPQTFKVVFDTGSSNVWVPSSKCSRLYTACVYH 61
Query: 126 SRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFL 185
+ + S++Y G + Y +G++SGF SQD + VG + V Q+F E T +L F+
Sbjct: 62 KLFDASDSSSYKHNGTELTLRYSTGTVSGFLSQDIITVGGITVT-QMFGEVTEMPALPFM 120
Query: 186 LARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVD 245
LA FDG++G+GF E A+G P++DN++ QG++ E+VFSF+ NRD GG+IV GG D
Sbjct: 121 LAEFDGVVGMGFIEQAIGRVTPIFDNIISQGVLKEDVFSFYYNRD---SLGGQIVLGGSD 177
Query: 246 PKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTE 305
P+H++G Y+ + K G WQ ++ + +G+ ST +CE GC A+VD+G S ++G T + +
Sbjct: 178 PQHYEGNFHYINLIKTGVWQIQMKGVSVGS-STLLCEDGCLALVDTGASYISGSTSSIEK 236
Query: 306 INHAIGGE 313
+ A+G +
Sbjct: 237 LMEALGAK 244
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 57/103 (55%), Gaps = 2/103 (1%)
Query: 405 SYINELCDSL--PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVC 462
S I +L ++L + + ++ C+ PT+P++SF +G K + L+ Y+ + ++C
Sbjct: 232 SSIEKLMEALGAKKRLFDYVVKCNEGPTLPDISFHLGGKEYTLTSADYVFQESYSSKKLC 291
Query: 463 ISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
A D+PPP GP W LG F+ ++T FD RIGFA A
Sbjct: 292 TLAIHAMDIPPPTGPTWALGATFIRKFYTEFDRRNNRIGFALA 334
>gi|392586802|gb|EIW76137.1| Asp-domain-containing protein [Coniophora puteana RWD-64-598 SS2]
Length = 409
Score = 232 bits (591), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 136/340 (40%), Positives = 190/340 (55%), Gaps = 31/340 (9%)
Query: 12 LWVLASCLLLPASSNGLRRIGLKK--RRLDLHSLNAARITRK-------ERYMGGAGVSG 62
L +A +LLP +S G+ ++ L+K + H+ ++ K + + GAG +G
Sbjct: 3 LSAIAPLILLPFASAGVHKLKLQKLPQITPGHTHETTYLSHKYGGQVAQQVPLMGAGGAG 62
Query: 63 VRHRLGDSDEDI-------------LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNL 109
R D+D+ +PL NFM+AQYF EI +GSP Q F VI DTGSSNL
Sbjct: 63 RNFRPSPHDDDLFWTQEVAVEGGHTVPLSNFMNAQYFTEIELGSPAQTFKVILDTGSSNL 122
Query: 110 WVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVK 169
WVPS++C SI+C+ H++Y S S +Y G I YG+GS+ GF SQD +++GDV +
Sbjct: 123 WVPSAQCT-SIACFLHAKYDSSSSASYKANGTEFSIQYGTGSMEGFVSQDTLKIGDVSIS 181
Query: 170 DQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNR 229
Q F EAT+E LTF +FDGI+GLG+ I+V P NM+ QGL+ E +FSF L
Sbjct: 182 HQDFAEATKEPGLTFAFGKFDGILGLGYDTISVNHITPPVYNMINQGLLDEPLFSFRLGS 241
Query: 230 DPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIV 289
+GGE VFGG+D + G YVPV +K YW+ EL + G + G A +
Sbjct: 242 --SESDGGEAVFGGIDHSAYTGDIEYVPVRRKAYWEVELEKVSFGGDELELESTGAA--I 297
Query: 290 DSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLVVS 325
D+GTSL+A PT V +N IG + G + +C V S
Sbjct: 298 DTGTSLIALPTDVAEMLNTQIGAKRSWNGQYTIDCSKVPS 337
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 37/87 (42%), Positives = 55/87 (63%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ IDC ++P++P+ +F G K + L YIL+ ++ CIS F D+ P G L
Sbjct: 326 GQYTIDCSKVPSLPDFTFYFGGKPYPLKGSDYILE----VSGTCISSFTGMDINLPGGAL 381
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WI+GDVF+ Y+TV+D GK +GFA+A
Sbjct: 382 WIVGDVFLRRYYTVYDLGKDAVGFAKA 408
>gi|281339451|gb|EFB15035.1| hypothetical protein PANDA_018433 [Ailuropoda melanoleuca]
Length = 388
Score = 232 bits (591), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 124/258 (48%), Positives = 160/258 (62%), Gaps = 15/258 (5%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C HSR+ +SNT
Sbjct: 51 PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SAACKTHSRFYPSQSNT 109
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVG----------DVVVKDQVFIEATREGSLTFL 185
Y+ +G I YG+GS+SG D V+V +VV Q F E+ E TF+
Sbjct: 110 YSVLGSHFSIQYGTGSLSGIIGADQVDVTFFWVFSRQVEGLVVVGQQFGESVTEPGQTFV 169
Query: 186 LARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVD 245
A FDGI+GLG+ +AVG PV+DNM+ Q LV +FS +++ DP+ G E++FGG D
Sbjct: 170 NAEFDGILGLGYPSLAVGGVTPVFDNMMAQNLVDIPMFSVYMSSDPEGGAGSELIFGGYD 229
Query: 246 PKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTE 305
HF G +VPVTK+GYWQ L I +G + C GC AIVD+GTSL+ GP+ V +
Sbjct: 230 HSHFSGNLHWVPVTKQGYWQIALDAIQVGG-AVMFCSEGCQAIVDTGTSLITGPSDKVKQ 288
Query: 306 INHAIGGE---GVVSAEC 320
+ AIG E G EC
Sbjct: 289 LQKAIGAEPMDGEYGVEC 306
Score = 82.0 bits (201), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 41/90 (45%), Positives = 53/90 (58%), Gaps = 1/90 (1%)
Query: 417 PM-GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
PM GE ++C + MP+V+FTI + L P Y L E C SGF D+ PP
Sbjct: 297 PMDGEYGVECANLNVMPDVTFTINGISYTLQPTAYTLLDFVDGMEFCSSGFQGLDIQPPA 356
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
GPLWILGDVF+ +++VFD G R+G A A
Sbjct: 357 GPLWILGDVFIRRFYSVFDRGNNRVGLAPA 386
>gi|291416270|ref|XP_002724368.1| PREDICTED: pepsin II-4-like [Oryctolagus cuniculus]
Length = 387
Score = 232 bits (591), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 116/247 (46%), Positives = 163/247 (65%), Gaps = 10/247 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
L+N++DA+YFG I IG+PPQ+F+VIFDTGSSNLWVPS+ C S++C H R+ S+TY
Sbjct: 67 LENYLDAEYFGTISIGTPPQDFTVIFDTGSSNLWVPSTYCS-SLACALHKRFNPEDSSTY 125
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
++ I YG+GS++G D V+VG + +Q+F + E LTFL A FDGI+GLG
Sbjct: 126 QGTSETLSITYGTGSMTGILGYDTVKVGSIEDTNQIFGLSKTEPGLTFLFAPFDGILGLG 185
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ I+ DA PV+DNM +GLVS+++FS +L+ D E+G ++FGG+D ++ G +V
Sbjct: 186 YPSISASDATPVFDNMWNEGLVSQDLFSVYLSSDD--EKGSLVMFGGIDSSYYTGSLNWV 243
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG----- 311
PV+ +GYWQ + + I N T C C AIVD+GTSLLAGPT ++ I IG
Sbjct: 244 PVSYEGYWQITMDSVSI-NGETIACADSCQAIVDTGTSLLAGPTSAISNIQSYIGASKNL 302
Query: 312 -GEGVVS 317
GE V+S
Sbjct: 303 LGENVIS 309
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 64/131 (48%), Gaps = 6/131 (4%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 434
G++ C+ A+V L T +S I + N +GE++I C I ++P++
Sbjct: 262 GETIACADSCQAIVDTGTSLLAGPTS--AISNIQSYIGASKNLLGENVISCSAISSLPDI 319
Query: 435 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 494
FTI + L YILK + CISG +L G LWILGDVF+ Y TVFD
Sbjct: 320 VFTINGIQYPLPASAYILKEDDD----CISGLEGMNLDTSTGELWILGDVFIRQYFTVFD 375
Query: 495 SGKLRIGFAEA 505
++G A A
Sbjct: 376 RANNQLGLAAA 386
>gi|326523981|dbj|BAJ97001.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 428
Score = 232 bits (591), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 111/246 (45%), Positives = 160/246 (65%), Gaps = 3/246 (1%)
Query: 68 GDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSR 127
GD +PL ++M+AQY+ EIGIG+PPQ F V+ DTGSSNLWVPS++C SI+C+ H R
Sbjct: 86 GDHPHHGVPLTDYMNAQYYAEIGIGTPPQPFGVVMDTGSSNLWVPSTRCS-SIACWLHRR 144
Query: 128 YKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLA 187
+ + KS+T+ E G I YGSGS+ G S D V +GD+ + + F E+T+E + F L
Sbjct: 145 FDATKSSTFKENGTDFAIRYGSGSLEGVISTDTVTIGDLELTETDFGESTKEPGIAFALG 204
Query: 188 RFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDP 246
+FDGI+GLG+ IAV VP + M+ Q L+ + +F+FWL + + DAE GGE+VFG +D
Sbjct: 205 KFDGIMGLGYDTIAVQQVVPPFYQMINQKLIDKPLFTFWLGDTNKDAENGGELVFGEIDK 264
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
H++G Y PV +KGYW+ + ++LI ++ G A +D+GTSL+A PT I
Sbjct: 265 DHYEGDIVYAPVVRKGYWEVKFNELLINDEPADFL-GNATAAIDTGTSLIACPTEAAETI 323
Query: 307 NHAIGG 312
N +G
Sbjct: 324 NTMLGA 329
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 44/105 (41%), Positives = 63/105 (60%), Gaps = 6/105 (5%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILK-TGEGIAEV---- 461
IN + + N +G+ +DC + ++P ++FT G F L+P Y+L+ +G I
Sbjct: 323 INTMLGATKNFLGQWTLDCATLDSLPTLTFTFGGHKFPLAPTDYVLQVSGSPIGGGGGEA 382
Query: 462 -CISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
CISGFM D+PP G LWI+GDVF+ Y TV+D G R+GFA A
Sbjct: 383 QCISGFMGIDMPPQLGQLWIVGDVFLRRYFTVYDKGNNRVGFATA 427
>gi|169770745|ref|XP_001819842.1| vacuolar protease A [Aspergillus oryzae RIB40]
gi|238486794|ref|XP_002374635.1| aspartic endopeptidase Pep2 [Aspergillus flavus NRRL3357]
gi|21392388|dbj|BAC00850.1| pepsinogen [Aspergillus oryzae]
gi|83767701|dbj|BAE57840.1| unnamed protein product [Aspergillus oryzae RIB40]
gi|220699514|gb|EED55853.1| aspartic endopeptidase Pep2 [Aspergillus flavus NRRL3357]
gi|391867458|gb|EIT76704.1| aspartyl protease [Aspergillus oryzae 3.042]
Length = 397
Score = 231 bits (590), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 129/322 (40%), Positives = 192/322 (59%), Gaps = 21/322 (6%)
Query: 14 VLASCLLLPASSNGLRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSGVRHRLGD 69
++ + +LL +S + ++ L K + +LH+++ ++YMG ++ L +
Sbjct: 5 LVTASVLLGCASAEVHKLKLNKVPVSEQFNLHNIDTHVQALGQKYMGIR--PNIKQDLLN 62
Query: 70 SD-------EDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISC 122
+ D+L + NF++AQYF EI IG+PPQ F V+ DTGSSNLWVPSS+C SI+C
Sbjct: 63 ENPINDMGRHDVL-VDNFLNAQYFSEIEIGTPPQKFKVVLDTGSSNLWVPSSECG-SIAC 120
Query: 123 YFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSL 182
Y H++Y S S+TY + G I YGSGS+SGF SQD +++GD+ VKDQ+F EAT E L
Sbjct: 121 YLHNKYDSSSSSTYQKNGSEFAIKYGSGSLSGFVSQDTLKIGDLKVKDQLFAEATSEPGL 180
Query: 183 TFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFG 242
F RFDGI+GLGF I+V P + +M++QGL+ E VF+F+L + FG
Sbjct: 181 AFAFGRFDGILGLGFDTISVNKIPPPFYSMLDQGLLDEPVFAFYLGDTNKEGDDSVATFG 240
Query: 243 GVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPV 302
GVD H+ G+ +P+ +K YW+ +L I +G+ + G I+D+GTSL+A PT +
Sbjct: 241 GVDKDHYTGELVKIPLRRKAYWEVDLDAIALGDSVAELDNTGV--ILDTGTSLIALPTTL 298
Query: 303 VTEINHAIGGE----GVVSAEC 320
IN IG + G S +C
Sbjct: 299 AELINKEIGAKKGFTGQYSVDC 320
Score = 71.6 bits (174), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 52/87 (59%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DCD+ ++P+++FT+ F + P Y L+ + CIS FM D P P GPL
Sbjct: 314 GQYSVDCDKRDSLPDLTFTLSGYNFTIGPYDYTLE----VQGSCISAFMGMDFPEPVGPL 369
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ +++V+D G +G A+A
Sbjct: 370 AILGDAFLRKWYSVYDLGNGAVGLAKA 396
>gi|171679543|ref|XP_001904718.1| hypothetical protein [Podospora anserina S mat+]
gi|170939397|emb|CAP64625.1| unnamed protein product [Podospora anserina S mat+]
Length = 397
Score = 231 bits (590), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 129/312 (41%), Positives = 184/312 (58%), Gaps = 20/312 (6%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDE- 72
+ A+ LL A + G ++ LKK L L A + + +++G + G+R + ++
Sbjct: 6 LTAAVLLGAAQAGGTHKLKLKKVPL-AEQLEAVPLETQMKHLGQKYM-GIRPQQSHANAV 63
Query: 73 -----------DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSIS 121
+P+ NFM+AQYF EI IG+PPQ+F V+ DTGSSNLWVPS C SI+
Sbjct: 64 FQGSLADPKGIHPVPISNFMNAQYFSEITIGTPPQSFKVVLDTGSSNLWVPSVDC-GSIA 122
Query: 122 CYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGS 181
CY HS+Y S S+T+ G S EI YGSGS+SG+ SQD + +GD+ +K+Q F EAT E
Sbjct: 123 CYLHSKYDSSASSTFKANGSSFEIRYGSGSLSGYVSQDTMTIGDIKIKEQDFAEATSEPG 182
Query: 182 LTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVF 241
L F RFDGI+GLGF I+V VP + M+EQ L+ E VF+F L D E E+ F
Sbjct: 183 LAFAFGRFDGIMGLGFDRISVNGIVPPFYKMIEQKLIDEPVFAFKL---ADTEGESEVTF 239
Query: 242 GGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTP 301
GGVD +KGK +P+ +K YW+ + I G+ + + G I+D+GTSL+A P+
Sbjct: 240 GGVDKDAYKGKLITIPLRRKAYWEVDFDAISYGDDTADLENTGI--ILDTGTSLIALPSQ 297
Query: 302 VVTEINHAIGGE 313
+ +N IG +
Sbjct: 298 LAEMLNAQIGAK 309
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 49/87 (56%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC + +M +V+F + F L P Y+L+ G CIS F D+P P GPL
Sbjct: 314 GQYTVDCAKRDSMKDVTFNLAGYNFTLGPYDYVLEAGSS----CISSFFPMDMPEPVGPL 369
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ Y++++D G + AEA
Sbjct: 370 AILGDSFLRRYYSIYDLGANTVSLAEA 396
>gi|119491657|ref|XP_001263323.1| aspartic endopeptidase Pep2 [Neosartorya fischeri NRRL 181]
gi|119411483|gb|EAW21426.1| aspartic endopeptidase Pep2 [Neosartorya fischeri NRRL 181]
Length = 398
Score = 231 bits (590), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 128/322 (39%), Positives = 186/322 (57%), Gaps = 21/322 (6%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLD----LHSLNAARITRKERYMGGAGVSGVRHR--- 66
+L + +LL ++S + ++ L K LD H+++A ++YMG + H+
Sbjct: 6 LLTASVLLGSASAAVHKLKLNKVPLDEQLYTHNIDAHVRALGQKYMG---IRPNVHQELL 62
Query: 67 ----LGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISC 122
L D + + NF++AQYF EI +G+PPQ F V+ DTGSSNLWVP S C SI+C
Sbjct: 63 EENSLNDMSRHDVLVDNFLNAQYFSEISLGTPPQKFKVVLDTGSSNLWVPGSDCS-SIAC 121
Query: 123 YFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSL 182
+ H++Y S S+TY G I YGSG +SGF SQD +++GD+ V Q F EAT E L
Sbjct: 122 FLHNKYDSSASSTYKANGTEFAIKYGSGELSGFVSQDTLQIGDLKVVKQDFAEATNEPGL 181
Query: 183 TFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFG 242
F RFDGI+GLG+ I+V VP + NM+EQGL+ E VF+F+L + E FG
Sbjct: 182 AFAFGRFDGILGLGYDTISVNKIVPPFYNMLEQGLLDEPVFAFYLGDTNKEGDNSEASFG 241
Query: 243 GVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPV 302
GVD H+ G+ T +P+ +K YW+ + I +G+ + G I+D+GTSL+A P+ +
Sbjct: 242 GVDKNHYTGELTKIPLRRKAYWEVDFDAIALGDNVAELENTGV--ILDTGTSLIALPSTL 299
Query: 303 VTEINHAIGGE----GVVSAEC 320
+N IG + G S EC
Sbjct: 300 ADLLNKEIGAKKGFTGQYSIEC 321
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 52/87 (59%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ I+CD+ ++P+++FT+ F + P Y L+ + CIS FM D P P GPL
Sbjct: 315 GQYSIECDKRDSLPDLTFTLAGHNFTIGPYDYTLE----VQGSCISSFMGMDFPEPVGPL 370
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ +++V+D G +G AEA
Sbjct: 371 AILGDAFLRKWYSVYDLGNNAVGLAEA 397
>gi|444731560|gb|ELW71913.1| Cathepsin D [Tupaia chinensis]
Length = 684
Score = 231 bits (589), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 136/347 (39%), Positives = 192/347 (55%), Gaps = 61/347 (17%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGD--SDEDILP----------LKNFMDAQ 84
R+ LH + R T E MGG + + H S E P LKN+MDAQ
Sbjct: 23 RIPLHKFPSIRRTLTE--MGGPVENLIAHEPISKYSQEAPTPAATKGPVPEILKNYMDAQ 80
Query: 85 YFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSC 143
Y+GEIGIG+PPQ F+VIFDTGS+NLWVPS C +C+FH +Y S+KS+TY + G S
Sbjct: 81 YYGEIGIGTPPQCFTVIFDTGSANLWVPSIHCGMLDFACWFHHKYNSKKSSTYAKNGSSF 140
Query: 144 EINYGSGS--------------------------------ISGFFSQDNVEVG------- 164
+I+Y SGS +S SQ + E
Sbjct: 141 DIHYRSGSQWLRQPLRVPEPGHRVGTDIDPVLRDQELWGNMSRGDSQPHTEPSCWKVPCH 200
Query: 165 --DVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEV 222
V V Q F EAT++ +TFL A+FDGI+G+ + I+V + VPV+DN+++Q LV + +
Sbjct: 201 TVSVRVDKQTFGEATKQPGITFLAAKFDGILGMAYPRISVDNVVPVFDNLMKQKLVEKNI 260
Query: 223 FSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCE 282
F+F+LNRDP + GGE++ GGVD K++ G Y VT+K YWQ + + +G+ T +C+
Sbjct: 261 FAFYLNRDPSGQPGGELMLGGVDTKYYTGSLDYYNVTRKAYWQIHMDKLEVGDGLT-LCQ 319
Query: 283 GGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLVVS 325
GC IVD+GTSL+ GP V E++ A+G ++ E C+ V S
Sbjct: 320 EGCEVIVDTGTSLIVGPVDEVRELHKAMGAVPLIQGEYMIPCEKVAS 366
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 38/98 (38%), Positives = 64/98 (65%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
+++ ++P GE +I C+++ ++P ++ +G+K ++L E+Y +K +G + +SGF
Sbjct: 343 LHKAMGAVPLIQGEYMIPCEKVASLPQITIRLGNKDYHLKGEEYTIKVSQGGKPLGLSGF 402
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAE 504
M +PPP GPLWILGDVF+G Y+ VFD R+G E
Sbjct: 403 MGMHIPPPAGPLWILGDVFIGCYYAVFDRDNNRVGPLE 440
>gi|109287598|emb|CAJ55261.1| renin-like aspartic protease [Echis ocellatus]
Length = 395
Score = 231 bits (589), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 125/307 (40%), Positives = 180/307 (58%), Gaps = 18/307 (5%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV-SGVRHRLGDSDE 72
+L SC L SS+ L+RI LKK + + R T +E M A V ++HR DE
Sbjct: 9 LLISCFLC-FSSDALQRISLKK-------MPSIRETLQEMGMKVADVLPSLKHRFSYLDE 60
Query: 73 DI------LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFH 125
+ L NF D QY+GEI IG+P Q F V+FDTGSSNLWVPS +C +C H
Sbjct: 61 GLHNKTASTILTNFRDTQYYGEISIGTPAQIFKVVFDTGSSNLWVPSHQCSPLYSACVSH 120
Query: 126 SRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFL 185
+RY S +S+TY G + YG G I GF SQD V V D+ + Q F EA S+ F+
Sbjct: 121 NRYDSSESSTYKPKGTKITLTYGQGYIEGFLSQDIVRVADIPIT-QFFTEAIALPSIPFM 179
Query: 186 LARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVD 245
A FDG++G+G+ + A+G +PV+DN++ + ++SE VFS + +R ++ GGEI+ GG D
Sbjct: 180 YAHFDGVLGMGYPKQAIGGVIPVFDNIMSEKVLSENVFSVYYSRHSESNTGGEIILGGSD 239
Query: 246 PKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTE 305
P H+ G YV +++GYW +L + I N+ +C GC A +D+GTS ++GP ++
Sbjct: 240 PSHYTGDFHYVSTSREGYWHVDLKGVSIENK-IALCHDGCTATIDTGTSFISGPASSISV 298
Query: 306 INHAIGG 312
+ IG
Sbjct: 299 LMETIGA 305
Score = 86.3 bits (212), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 38/87 (43%), Positives = 55/87 (63%), Gaps = 2/87 (2%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +IDC++I +P++SF +GD ++LS Y+LK + C F A D+PPPRGPL
Sbjct: 310 GDYVIDCNQINLLPDISFHLGDMTYSLSSSTYVLKYSDETE--CTVAFSAIDIPPPRGPL 367
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
W+LG F+ Y+ FD RIGFA +
Sbjct: 368 WLLGATFIKQYYIEFDRQNNRIGFATS 394
>gi|154284392|ref|XP_001542991.1| vacuolar protease A precursor [Ajellomyces capsulatus NAm1]
gi|150406632|gb|EDN02173.1| vacuolar protease A precursor [Ajellomyces capsulatus NAm1]
Length = 398
Score = 231 bits (589), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 126/301 (41%), Positives = 183/301 (60%), Gaps = 12/301 (3%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGD----SDEDILPLKNFMDA 83
L++I L ++ +++ ++A ++YMG + GD S LP+ NF++A
Sbjct: 25 LQKIPLSEQFANVN-IDAHVRALGQKYMGVKPNQNGQDVFGDPAKASGGHSLPVDNFLNA 83
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
QYF EIGIG+PPQ F V+ DTGSSNLWVPSS+C SI+CY H++Y S S+T+ + G
Sbjct: 84 QYFSEIGIGTPPQTFKVVLDTGSSNLWVPSSECG-SIACYLHNKYDSSASSTHKKNGSEF 142
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVG 203
I YGSGS++GF SQD + +GD+VV+ QVF EAT E L F RFDGI+GLG+ I+V
Sbjct: 143 SITYGSGSLTGFVSQDCLTIGDLVVESQVFAEATSEPGLAFAFGRFDGILGLGYDTISVN 202
Query: 204 DAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGY 263
VP + M+ L+ E +FSF+L + E+VFGG++ F GK T +P+ +K Y
Sbjct: 203 KIVPPFYEMLNNNLLDEPMFSFYLGDANVDSDDSEVVFGGMNEDRFTGKLTKIPLRRKAY 262
Query: 264 WQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAE 319
W+ +L I G Q+ + G I+D+GTSL+A P+ + +N IG + G + E
Sbjct: 263 WEVDLDSITFGKQTALMSNTGV--ILDTGTSLIALPSTIAELLNKEIGAKKSFNGQYTVE 320
Query: 320 C 320
C
Sbjct: 321 C 321
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 32/85 (37%), Positives = 48/85 (56%), Gaps = 4/85 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ ++C + ++PN++F + F + P Y L+ + CIS FM D P P GPL
Sbjct: 315 GQYTVECAKRDSLPNLTFGLSGHNFTIGPYDYTLE----VQGTCISSFMGMDFPAPVGPL 370
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFA 503
ILGD F+ Y+TV+D G +G A
Sbjct: 371 AILGDAFLRRYYTVYDLGNDAVGLA 395
>gi|393215979|gb|EJD01470.1| aspartic peptidase A1 [Fomitiporia mediterranea MF3/22]
Length = 412
Score = 231 bits (589), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 138/339 (40%), Positives = 193/339 (56%), Gaps = 34/339 (10%)
Query: 14 VLASCLLLP-ASSNGLRRIGLKKRRLDLHSLNAARITRKERYMG-------GAGVSGVRH 65
V A LLLP A++ G+ ++ L K + + + E+Y G GAG +G +
Sbjct: 5 VFAPLLLLPFATAAGVHKLKLHKIQRENANPYLETAYLSEKYGGDSQLPLMGAGGAGRQL 64
Query: 66 RLG-----DSDEDIL------------PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSN 108
RL + E++L PL NFM+AQYF I +G+PPQ F VI DTGSSN
Sbjct: 65 RLARPSVNEEGENLLWTQEMINGGHNVPLTNFMNAQYFTTITLGTPPQEFKVILDTGSSN 124
Query: 109 LWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVV 168
LWVPS+KC SI+C+ H++Y S S+T+ + G S +I YGSGS+ GF S D + +GD+ +
Sbjct: 125 LWVPSTKCT-SIACFLHAKYDSSASSTHKKNGTSFKIEYGSGSMEGFVSNDVLSIGDLKI 183
Query: 169 KDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLN 228
DQ F EAT+E L F +FDGI+GLG+ I+V P + +MV +GL+ VFSF L
Sbjct: 184 HDQDFAEATKEPGLAFAFGKFDGILGLGYDTISVNHITPPFYSMVNKGLLDAPVFSFRLG 243
Query: 229 RDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAI 288
E+GGE VFGG+D + GK Y PV +K YW+ EL + G+ + G A
Sbjct: 244 S--SEEDGGEAVFGGIDESAYSGKINYAPVRRKAYWEVELPKVAFGDDVLELENTGAA-- 299
Query: 289 VDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKLV 323
+D+GTSL+A P+ V +N IG G + +CK V
Sbjct: 300 IDTGTSLIALPSDVAEMLNAQIGATKSWNGQYTVDCKKV 338
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 49/87 (56%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC ++P +P+ + + + L YIL+ + CIS F D+ P G L
Sbjct: 329 GQYTVDCKKVPDLPDFTLWFNGQAYPLKGSDYILE----VQGTCISSFTGLDINVPGGSL 384
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WI+GDVF+ Y TV+D G+ +GFA +
Sbjct: 385 WIIGDVFLRRYFTVYDHGRDAVGFANS 411
>gi|336373584|gb|EGO01922.1| hypothetical protein SERLA73DRAFT_177556 [Serpula lacrymans var.
lacrymans S7.3]
gi|336386403|gb|EGO27549.1| hypothetical protein SERLADRAFT_461213 [Serpula lacrymans var.
lacrymans S7.9]
Length = 413
Score = 231 bits (589), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 131/299 (43%), Positives = 175/299 (58%), Gaps = 25/299 (8%)
Query: 46 ARITRKERYMGGAGVSGVRHRLGDSDEDI---------------LPLKNFMDAQYFGEIG 90
A T ++ + GAG +G RH D ED +PL NFM+AQY+ EI
Sbjct: 49 AETTYQQLPLMGAGGAG-RHIRPDRPEDSDLFWTQEELVKGGHGVPLTNFMNAQYYTEIT 107
Query: 91 IGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSG 150
+GSP Q F VI DTGSSNLWVPSSKC SI+C+ H++Y S S+TY G I YGSG
Sbjct: 108 LGSPAQTFKVILDTGSSNLWVPSSKCT-SIACFLHTKYDSSSSSTYKANGTEFSIQYGSG 166
Query: 151 SISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWD 210
S+ GF SQ+++++GD+ ++ Q F EAT+E L F +FDGI+GLG+ I+V P +
Sbjct: 167 SMEGFVSQESMKIGDLSIQHQDFAEATKEPGLAFAFGKFDGILGLGYDTISVNHITPPFY 226
Query: 211 NMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 270
NM++QGL+ E +FSF L D +GGE VFGG+D + G TYVPV +K YW+ EL
Sbjct: 227 NMIDQGLLDEPLFSFRLGSSED--DGGEAVFGGIDSSAYTGSITYVPVRRKAYWEVELEK 284
Query: 271 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKLVVS 325
+ G + G A +D+GTSL+A PT V +N IG G +C V S
Sbjct: 285 VSFGGDELDLENTGAA--IDTGTSLIALPTDVAEMLNTQIGATRSWNGQYQVDCAKVPS 341
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 37/88 (42%), Positives = 52/88 (59%), Gaps = 4/88 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC ++P++P +SF G K + L YIL + CIS F D+ P G L
Sbjct: 330 GQYQVDCAKVPSLPELSFYFGGKPYPLKGTDYILN----VQGTCISAFTGLDINLPGGAL 385
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEAA 506
WI+GDVF+ Y TV+D G+ +GFA AA
Sbjct: 386 WIIGDVFLRRYFTVYDLGRDAVGFATAA 413
>gi|255639243|gb|ACU19920.1| unknown [Glycine max]
Length = 177
Score = 231 bits (588), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 102/166 (61%), Positives = 135/166 (81%), Gaps = 2/166 (1%)
Query: 342 EKVCQQIGLCAFNGAEYVSTGIKTVVEKEN--VSAGDSAVCSACEMAVVWVQNQLKQKQT 399
+ +C Q+GLC+ E S GI+ V EKE ++A D+ +CS+C+M V+W+QNQLKQK T
Sbjct: 11 DDICSQVGLCSSKRHESKSAGIEMVTEKEQGELTARDNPLCSSCQMLVLWIQNQLKQKAT 70
Query: 400 KEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIA 459
K++V +Y+N+LC+SLP+P GES+I C+ + MPN++FTIG+K F L+PEQYILKTGEGI
Sbjct: 71 KDRVFNYVNQLCESLPSPSGESVISCNSLSKMPNITFTIGNKPFVLTPEQYILKTGEGIT 130
Query: 460 EVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
EVC+SGF+AFD+PPP+GPLWILGDVFM YHTVFD G L++GFAEA
Sbjct: 131 EVCLSGFIAFDVPPPKGPLWILGDVFMRAYHTVFDYGNLQVGFAEA 176
>gi|195134378|ref|XP_002011614.1| GI11124 [Drosophila mojavensis]
gi|193906737|gb|EDW05604.1| GI11124 [Drosophila mojavensis]
Length = 373
Score = 231 bits (588), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 123/293 (41%), Positives = 180/293 (61%), Gaps = 17/293 (5%)
Query: 5 LLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVR 64
+L+S+ L V+ L +S L R+ + K + +E +
Sbjct: 1 MLKSITVLAVV-----LAVASAELHRVPILKHE--------NFVKTRENVKAEKAYLRAK 47
Query: 65 HRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCY 123
+ L ++ + L N ++ Y+G I IG+PPQ+F V+FD+GSSNLWVPSS C +F ++C
Sbjct: 48 YNLPNARLNEEELSNSINMAYYGTISIGTPPQSFKVLFDSGSSNLWVPSSTCWFFDVACM 107
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
H++Y KS+TY G+S I YGSGS+SGF S D V+V +V+K Q F EAT E +
Sbjct: 108 NHNQYDHDKSSTYEANGESFSIQYGSGSLSGFLSTDTVDVNGLVIKKQTFAEATSEPGNS 167
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
F ++FDGI+G+ ++ +AV + VP + NMV QGLV E VFSF+L RD + EGGE++FGG
Sbjct: 168 FTNSKFDGILGMAYQSLAVDNVVPPFYNMVSQGLVDESVFSFYLARDGTSNEGGELIFGG 227
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLL 296
D + G+ TYVP++++GYWQF + I I Q+ +C+ C AI D+GTSLL
Sbjct: 228 SDSSLYTGELTYVPISQQGYWQFAVDSISIDGQT--LCD-NCQAIADTGTSLL 277
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 39/100 (39%), Positives = 55/100 (55%), Gaps = 13/100 (13%)
Query: 409 ELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF-- 466
++ ++L N + ++DC + +MP ++ IG F L P QYI+++ C SGF
Sbjct: 285 DILNNLLNVDEDGLVDCSAVDSMPVLNLNIGGTKFTLEPAQYIIQSDGD----CQSGFEF 340
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
M D WILGDVF+G Y+T FD G RIGFA A
Sbjct: 341 MGTDF-------WILGDVFIGKYYTEFDLGNNRIGFAPVA 373
>gi|225556537|gb|EEH04825.1| aspartic endopeptidase Pep2 [Ajellomyces capsulatus G186AR]
Length = 398
Score = 231 bits (588), Expect = 9e-58, Method: Compositional matrix adjust.
Identities = 127/302 (42%), Positives = 189/302 (62%), Gaps = 14/302 (4%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGD----SDEDILPLKNFMDA 83
L++I L ++ +++ ++A ++YMG + GD S LP+ NF++A
Sbjct: 25 LQKIPLSEQFANVN-IDAHVRALGQKYMGVKPNQNGQDVFGDPAKASGGHSLPVDNFLNA 83
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
QYF EIGIG+PPQ F V+ DTGSSNLWVPSS+C SI+CY H++Y S S+T+ + G
Sbjct: 84 QYFSEIGIGTPPQTFKVVLDTGSSNLWVPSSECG-SIACYLHNKYDSSASSTHKKNGSEF 142
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVG 203
I YGSGS++GF SQD + +GD+VV++QVF EAT E L F RFDGI+GLG+ I+V
Sbjct: 143 SITYGSGSLTGFVSQDCLTIGDLVVENQVFAEATSEPGLAFAFGRFDGILGLGYDTISVN 202
Query: 204 DAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
VP + M+ + L+ E +FSF+L + + D +E E+VFGG++ F G+ T +P+ +K
Sbjct: 203 KIVPPFYEMLNKNLLDEPMFSFYLGDANVDGDE-SEVVFGGMNKNRFMGELTKIPLRRKA 261
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSA 318
YW+ +L I G Q+ + G I+D+GTSL+A P+ + +N IG + G +
Sbjct: 262 YWEVDLDSITFGKQTAMMANTGV--ILDTGTSLIALPSTIAELLNKEIGAKKSFNGQYTI 319
Query: 319 EC 320
EC
Sbjct: 320 EC 321
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 33/85 (38%), Positives = 48/85 (56%), Gaps = 4/85 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ I+C + ++PN++F + F + P Y L+ + CIS FM D P P GPL
Sbjct: 315 GQYTIECAKRDSLPNLTFGLSGHNFTIGPYDYTLE----VQGTCISSFMGMDFPAPVGPL 370
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFA 503
ILGD F+ Y+TV+D G +G A
Sbjct: 371 AILGDAFLRRYYTVYDLGNDAVGLA 395
>gi|355558869|gb|EHH15649.1| Renin [Macaca mulatta]
gi|355746005|gb|EHH50630.1| Renin [Macaca fascicularis]
Length = 406
Score = 231 bits (588), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 126/321 (39%), Positives = 196/321 (61%), Gaps = 20/321 (6%)
Query: 3 QKLLRSVFCLWVLASCLL-LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS 61
+++ R L + SC LP + +RI LK+ + + R + KER + A +
Sbjct: 5 RRMPRWGLLLLLWGSCTFGLPTDTTTFKRIFLKR-------MPSIRESLKERGVDMARLG 57
Query: 62 G------VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSK 115
R LG++ ++ L N+MD QY+GEIGIG+PPQ F V+FDTGSSN+WVPSSK
Sbjct: 58 PEWSQPMKRLALGNTTSSVI-LTNYMDTQYYGEIGIGTPPQTFKVVFDTGSSNVWVPSSK 116
Query: 116 C-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFI 174
C +C +H + + S++Y G + Y +G++SGF SQD + VG + V Q+F
Sbjct: 117 CSRLYTACVYHKLFDASDSSSYKHNGTELTLRYSTGTVSGFLSQDIITVGGITVT-QMFG 175
Query: 175 EATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNR-DPDA 233
E T +L F+LA FDG++G+GF E A+G P++DN++ QG++ E+VFSF+ NR +A
Sbjct: 176 EVTEMPALPFMLAEFDGVVGMGFIEQAIGRVTPIFDNILSQGVLKEDVFSFYYNRWGLNA 235
Query: 234 EE-GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSG 292
+ GG+IV GG DP+H++G Y+ + K G WQ + + +G+ ST +CE GC A+VD+G
Sbjct: 236 QSLGGQIVLGGSDPQHYEGNFHYINLIKTGVWQIPMKGVSVGS-STLLCEDGCLALVDTG 294
Query: 293 TSLLAGPTPVVTEINHAIGGE 313
S ++G T + ++ A+G +
Sbjct: 295 ASYISGSTSSIEKLMEALGAK 315
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 57/103 (55%), Gaps = 2/103 (1%)
Query: 405 SYINELCDSL--PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVC 462
S I +L ++L + + ++ C+ PT+P++SF +G K + L+ Y+ + ++C
Sbjct: 303 SSIEKLMEALGAKKRLFDYVVKCNEGPTLPDISFHLGGKEYTLTSADYVFQESYSSKKLC 362
Query: 463 ISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
A D+PPP GP W LG F+ ++T FD RIGFA A
Sbjct: 363 TLAIHAMDIPPPTGPTWALGATFIRKFYTEFDRRNNRIGFALA 405
>gi|115396430|ref|XP_001213854.1| vacuolar protease A precursor [Aspergillus terreus NIH2624]
gi|114193423|gb|EAU35123.1| vacuolar protease A precursor [Aspergillus terreus NIH2624]
Length = 397
Score = 230 bits (587), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 134/325 (41%), Positives = 199/325 (61%), Gaps = 26/325 (8%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLD----LHSLNAARITRKERYMGGAGVSGVRHRLGD 69
+L + +L+ +S + ++ L K LD +++A ++YMG + LGD
Sbjct: 6 LLTASVLVGCASAEVHKLKLNKLPLDEQLFTQNIDAHIHALGQKYMGVR--PNQQEPLGD 63
Query: 70 S------DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCY 123
+ + ++L + NFM+AQYF EI +G+PPQ F V+ DTGSSNLWVPSS+C SI+CY
Sbjct: 64 NPVNDLGNHNVL-VDNFMNAQYFSEIELGTPPQKFKVVLDTGSSNLWVPSSECS-SIACY 121
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
H++Y S S+TY + G I YGSGS+SGF S+D +++GD+ +K+Q+F EAT E L
Sbjct: 122 LHNKYDSSASSTYKKNGTEFSIRYGSGSLSGFVSEDTLKIGDLTIKEQLFAEATNEPGLA 181
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDA-EEGGEIV-- 240
F RFDGI+GLGF I+V P + MV QGL+ E VF+F+L DA +EG E V
Sbjct: 182 FAFGRFDGILGLGFDTISVNRIEPPFYKMVNQGLLDEPVFAFYLG---DANKEGDESVAT 238
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGGVD H+ G+ +P+ +K YW+ +L I +G+++ + G I+D+GTSL+A P+
Sbjct: 239 FGGVDKSHYTGELIKIPLRRKAYWEVDLDAITLGDETADLENTGV--ILDTGTSLIALPS 296
Query: 301 PVVTEINHAIGGE----GVVSAECK 321
+ IN IG + G S +C+
Sbjct: 297 NLAEMINAQIGAKKGFTGQYSVDCE 321
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 51/87 (58%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC++ ++P+++F + F + P Y L+ + CIS FM D P P GPL
Sbjct: 314 GQYSVDCEKRSSLPDITFALSGHNFTIGPYDYTLE----VQGSCISAFMGMDFPEPVGPL 369
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ +++V+D G +G A+A
Sbjct: 370 AILGDAFLRKWYSVYDLGNGAVGLAKA 396
>gi|283806592|ref|NP_001164549.1| pepsin II-1 precursor [Oryctolagus cuniculus]
gi|129777|sp|P28712.1|PEPA1_RABIT RecName: Full=Pepsin II-1; AltName: Full=Pepsin A; Flags: Precursor
gi|22218074|dbj|BAC07514.1| pepsinogen II-1 [Oryctolagus cuniculus]
Length = 387
Score = 230 bits (587), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 112/247 (45%), Positives = 165/247 (66%), Gaps = 10/247 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
L+N++DA+YFG I IG+PPQ F+VIFDTGSSNLWVPS+ C S++C+ H R+ S+T+
Sbjct: 67 LENYLDAEYFGTISIGTPPQEFTVIFDTGSSNLWVPSTYCS-SLACFLHKRFNPDDSSTF 125
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
++ I YG+GS++G D V+VG++ +Q+F + E +TFL+A FDGI+GL
Sbjct: 126 QATSETLSITYGTGSMTGILGYDTVKVGNIEDTNQIFGLSKTEPGITFLVAPFDGILGLA 185
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ I+ DA PV+DNM +GLVSE++FS +L+ + E+G ++FGG+D ++ G +V
Sbjct: 186 YPSISASDATPVFDNMWNEGLVSEDLFSVYLSS--NGEKGSMVMFGGIDSSYYTGSLNWV 243
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG----- 311
PV+ +GYWQ + I I N T C C A+VD+GTSLLAGPT +++I IG
Sbjct: 244 PVSHEGYWQITMDSITI-NGETIACADSCQAVVDTGTSLLAGPTSAISKIQSYIGASKNL 302
Query: 312 -GEGVVS 317
GE ++S
Sbjct: 303 LGENIIS 309
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 50/134 (37%), Positives = 69/134 (51%), Gaps = 10/134 (7%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTK--EKVLSYINELCDSLPNPMGESIIDCDRIPTMP 432
G++ C+ AVV L T K+ SYI + N +GE+II C I ++P
Sbjct: 262 GETIACADSCQAVVDTGTSLLAGPTSAISKIQSYIG----ASKNLLGENIISCSAIDSLP 317
Query: 433 NVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTV 492
++ FTI + + L YILK + C+SGF +L G LWILGDVF+ Y TV
Sbjct: 318 DIVFTINNVQYPLPASAYILKEDDD----CLSGFDGMNLDTSYGELWILGDVFIRQYFTV 373
Query: 493 FDSGKLRIGFAEAA 506
FD ++G A AA
Sbjct: 374 FDRANNQVGLAAAA 387
>gi|342882947|gb|EGU83511.1| hypothetical protein FOXB_05921 [Fusarium oxysporum Fo5176]
Length = 396
Score = 230 bits (587), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 122/251 (48%), Positives = 160/251 (63%), Gaps = 12/251 (4%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+P+ NFM+AQYF EI IG+PPQ+F V+ DTGSSNLWVPS +C SI+CY HS+Y S S+
Sbjct: 76 VPVSNFMNAQYFSEITIGTPPQSFKVVLDTGSSNLWVPSQQC-GSIACYLHSKYDSSASS 134
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY E G EI+YGSGS+SGF S D V +GD+ +KDQ F EAT+E L F RFDGI+G
Sbjct: 135 TYKENGTEFEIHYGSGSLSGFVSNDVVSIGDLEIKDQDFAEATKEPGLAFAFGRFDGILG 194
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ IAV VP + MV Q L+ E VF+F+L+ D E E FGG+D F G
Sbjct: 195 LGYDRIAVNGMVPPFYQMVNQKLLDEPVFAFYLD---DQEGESEATFGGIDKSKFTGDIE 251
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEG-GCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
Y+P+ +K YW+ +L I G++ V E AI+D+GTSL P+ + +N IG +
Sbjct: 252 YIPLRRKAYWEVDLEAIAFGDE---VAEQENTGAILDTGTSLNVLPSALAELLNKEIGAK 308
Query: 314 ----GVVSAEC 320
G + EC
Sbjct: 309 KGYNGQYTIEC 319
Score = 71.6 bits (174), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 51/87 (58%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ I+CD+ ++P+++F + ++L YIL+ + CIS F D P P GPL
Sbjct: 313 GQYTIECDKRASLPDITFNLAGSNYSLPATDYILE----VQGSCISTFQGMDFPEPVGPL 368
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ Y++V+D GK +G A A
Sbjct: 369 VILGDAFLRRYYSVYDLGKNAVGLARA 395
>gi|170091822|ref|XP_001877133.1| aspartic peptidase A1 [Laccaria bicolor S238N-H82]
gi|164648626|gb|EDR12869.1| aspartic peptidase A1 [Laccaria bicolor S238N-H82]
Length = 408
Score = 230 bits (587), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 120/255 (47%), Positives = 160/255 (62%), Gaps = 9/255 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL NFM+AQYF EI IG+PPQ+F VI DTGSSNLWVPS KC SI+C+ H++Y S S+
Sbjct: 87 VPLSNFMNAQYFTEISIGNPPQSFKVILDTGSSNLWVPSVKCT-SIACFLHTKYDSASSS 145
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
T+ G I+YGSGS+ GF S D + +GD+ +K Q F EA +E L F +FDGI+G
Sbjct: 146 TFKANGSEFSIHYGSGSMEGFVSNDLLSIGDITIKGQDFAEAVKEPGLAFAFGKFDGILG 205
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V +P + +M+ QGL+ VFSF L E+GGE VFGG+D +KGK T
Sbjct: 206 LGYDTISVNHIIPPFYSMINQGLIDSPVFSFRLGS--SEEDGGEAVFGGIDESAYKGKIT 263
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
YVPV +K YW+ EL + GN + G A +D+GTSL+ PT + +N IG +
Sbjct: 264 YVPVRRKAYWEVELEKVSFGNDDLELESTGAA--IDTGTSLIVLPTDIAEMLNTQIGAKK 321
Query: 314 ---GVVSAECKLVVS 325
G +C V S
Sbjct: 322 SWNGQYQVDCAKVPS 336
Score = 78.6 bits (192), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 38/88 (43%), Positives = 53/88 (60%), Gaps = 4/88 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC ++P++P +SF G K + L YIL+ + CIS F DL P G L
Sbjct: 325 GQYQVDCAKVPSLPELSFYFGGKPYPLKGTDYILE----VQGTCISAFTGMDLNLPGGSL 380
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEAA 506
WI+GD F+ Y TV+D G+ +GFAEAA
Sbjct: 381 WIIGDAFLRRYFTVYDLGRNAVGFAEAA 408
>gi|50978946|ref|NP_001003194.1| renin precursor [Canis lupus familiaris]
gi|62287424|sp|Q6DYE7.1|RENI_CANFA RecName: Full=Renin; AltName: Full=Angiotensinogenase; Flags:
Precursor
gi|50058380|gb|AAT68959.1| preprorenin [Canis lupus familiaris]
Length = 403
Score = 230 bits (587), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 129/319 (40%), Positives = 191/319 (59%), Gaps = 21/319 (6%)
Query: 9 VFCLWVLASCLL-LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSG----- 62
+ LW SC LPA + RRI LKK + + R + KER + AG+
Sbjct: 12 LLVLW--GSCTFGLPADTGAFRRIFLKK-------MPSIRESLKERGVDVAGLGAEWNQF 62
Query: 63 -VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSI 120
R G+S ++ L N++D QY+GEIGIG+PPQ F V+FDTGS+NLWVPS++C
Sbjct: 63 TKRLSSGNSTSPVV-LTNYLDTQYYGEIGIGTPPQTFKVVFDTGSANLWVPSTRCSPLYT 121
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+C H Y S +S++Y E G + I YGSG + GF SQD V VG + V Q F E T
Sbjct: 122 ACEIHCLYDSSESSSYMENGTTFTIRYGSGKVKGFLSQDMVTVGGITVT-QTFGEVTELP 180
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
+ F+LA+FDG++G+GF AVG PV+D+++ QG++ EEVFS + +R+ GGE+V
Sbjct: 181 LIPFMLAKFDGVLGMGFPAQAVGGVTPVFDHILSQGVLKEEVFSVYYSRNSHL-LGGEVV 239
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
GG DP++++G YV ++K G WQ ++ + + +T VCE GC +VD+G S ++GPT
Sbjct: 240 LGGSDPQYYQGNFHYVSISKTGSWQIKMKGVSV-RSATLVCEEGCMVVVDTGASYISGPT 298
Query: 301 PVVTEINHAIGGEGVVSAE 319
+ + +G + + + E
Sbjct: 299 SSLRLLMDTLGAQELSTNE 317
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 32/86 (37%), Positives = 56/86 (65%)
Query: 420 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 479
E +++C+++PT+P++SF +G + + L+ + Y+L+ G ++C D+PPP GP+W
Sbjct: 317 EYVVNCNQVPTLPDISFHLGGRAYTLTSKDYVLQDPYGNEDLCTLALHGLDVPPPTGPVW 376
Query: 480 ILGDVFMGVYHTVFDSGKLRIGFAEA 505
+LG F+ ++T FD RIGFA A
Sbjct: 377 VLGASFIRKFYTEFDRHNNRIGFALA 402
>gi|145232965|ref|XP_001399855.1| vacuolar protease A [Aspergillus niger CBS 513.88]
gi|134056777|emb|CAK37685.1| aspartic protease pepE-Aspergillus niger
Length = 398
Score = 230 bits (587), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 130/313 (41%), Positives = 188/313 (60%), Gaps = 21/313 (6%)
Query: 14 VLASCLLLPASSNGLRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSGVRHR--- 66
+L + +LL +S + ++ L K +L H+++A ++YMG + H+
Sbjct: 6 LLTASVLLGCASAEVHKLKLNKVPLEEQLYTHNIDAHVRALGQKYMG---IRPSIHKELV 62
Query: 67 ----LGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISC 122
+ D + + NF++AQYF EI +G+PPQ F V+ DTGSSNLWVPSS+C SI+C
Sbjct: 63 EENPINDMSRHDVLVDNFLNAQYFSEIELGTPPQKFKVVLDTGSSNLWVPSSECS-SIAC 121
Query: 123 YFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSL 182
Y H++Y S S+TY + G I YGSGS+SGF SQD +++GD+ VK Q F EAT E L
Sbjct: 122 YLHNKYDSSASSTYHKNGSEFAIKYGSGSLSGFISQDTLKIGDLKVKGQDFAEATNEPGL 181
Query: 183 TFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV-- 240
F RFDGI+GLG+ I+V VP + NM++QGL+ E VF+F+L +EG E V
Sbjct: 182 AFAFGRFDGILGLGYDTISVNKIVPPFYNMLDQGLLDEPVFAFYLGD--TNKEGDESVAT 239
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGGVD H+ G+ +P+ +K YW+ EL I +G+ + G I+D+GTSL+A P
Sbjct: 240 FGGVDKDHYTGELIKIPLRRKAYWEVELDAIALGDDVAEMENTGV--ILDTGTSLIALPA 297
Query: 301 PVVTEINHAIGGE 313
+ IN IG +
Sbjct: 298 DLAEMINAQIGAK 310
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 52/87 (59%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DCD+ ++P+V+FT+ F +S Y L+ + C+S FM D P P GPL
Sbjct: 315 GQYTVDCDKRSSLPDVTFTLAGHNFTISSYDYTLE----VQGSCVSAFMGMDFPEPVGPL 370
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ +++V+D G +G A+A
Sbjct: 371 AILGDAFLRKWYSVYDLGNSAVGLAKA 397
>gi|530795|gb|AAA20876.1| pepsinogen [Aspergillus niger]
gi|350634685|gb|EHA23047.1| extracellular aspartic protease [Aspergillus niger ATCC 1015]
Length = 398
Score = 230 bits (587), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 130/313 (41%), Positives = 188/313 (60%), Gaps = 21/313 (6%)
Query: 14 VLASCLLLPASSNGLRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSGVRHR--- 66
+L + +LL +S + ++ L K +L H+++A ++YMG + H+
Sbjct: 6 LLTASVLLGCASAEVHKLKLNKVPLEEQLYTHNIDAHVRALGQKYMG---IRPSIHKELV 62
Query: 67 ----LGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISC 122
+ D + + NF++AQYF EI +G+PPQ F V+ DTGSSNLWVPSS+C SI+C
Sbjct: 63 EENPINDMSRHDVLVDNFLNAQYFSEIELGTPPQKFKVVLDTGSSNLWVPSSECS-SIAC 121
Query: 123 YFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSL 182
Y H++Y S S+TY + G I YGSGS+SGF SQD +++GD+ VK Q F EAT E L
Sbjct: 122 YLHNKYDSSASSTYHKNGSEFAIKYGSGSLSGFVSQDTLKIGDLKVKGQDFAEATNEPGL 181
Query: 183 TFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV-- 240
F RFDGI+GLG+ I+V VP + NM++QGL+ E VF+F+L +EG E V
Sbjct: 182 AFAFGRFDGILGLGYDTISVNKIVPPFYNMLDQGLLDEPVFAFYLGD--TNKEGDESVAT 239
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGGVD H+ G+ +P+ +K YW+ EL I +G+ + G I+D+GTSL+A P
Sbjct: 240 FGGVDKDHYTGELIKIPLRRKAYWEVELDAIALGDDVAEMENTGV--ILDTGTSLIALPA 297
Query: 301 PVVTEINHAIGGE 313
+ IN IG +
Sbjct: 298 DLAEMINAQIGAK 310
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 52/87 (59%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DCD+ ++P+V+FT+ F +S Y L+ + C+S FM D P P GPL
Sbjct: 315 GQYTVDCDKRSSLPDVTFTLAGHNFTISSYDYTLE----VQGSCVSAFMGMDFPEPVGPL 370
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ +++V+D G +G A+A
Sbjct: 371 AILGDAFLRKWYSVYDLGNSAVGLAKA 397
>gi|46138535|ref|XP_390958.1| hypothetical protein FG10782.1 [Gibberella zeae PH-1]
gi|408391598|gb|EKJ70970.1| hypothetical protein FPSE_08829 [Fusarium pseudograminearum CS3096]
Length = 396
Score = 230 bits (587), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 124/257 (48%), Positives = 163/257 (63%), Gaps = 14/257 (5%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+P+ NFM+AQYF EI IG+PPQ+F V+ DTGSSNLWVPS +C SI+CY HS+Y S S+
Sbjct: 76 VPVSNFMNAQYFSEITIGTPPQSFKVVLDTGSSNLWVPSQEC-GSIACYLHSKYDSSASS 134
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + G EI+YGSGS+SGF S D V +GD+ +KDQ F EAT+E L F RFDGI+G
Sbjct: 135 TYKKNGSEFEIHYGSGSLSGFVSNDVVSIGDLKIKDQDFAEATKEPGLAFAFGRFDGILG 194
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEG-GEIVFGGVDPKHFKGKH 253
LG+ IAV VP + MV Q L+ E VF+F+L D +EG E FGGVD + G
Sbjct: 195 LGYDRIAVNGMVPPFYQMVNQKLLDEPVFAFYL----DGQEGQSEATFGGVDKSKYTGDL 250
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEG-GCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
Y+P+ +K YW+ +L I G++ V E AI+D+GTSL P+ + +N IG
Sbjct: 251 EYIPLRRKAYWEVDLDAIAFGDE---VAEQENTGAILDTGTSLNVLPSALAELLNKEIGA 307
Query: 313 E----GVVSAECKLVVS 325
+ G + EC V S
Sbjct: 308 KKGYNGQYTIECDKVSS 324
Score = 75.1 bits (183), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 53/87 (60%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ I+CD++ ++P+++FT+ ++L YIL+ + CIS F D P P GPL
Sbjct: 313 GQYTIECDKVSSLPDITFTLAGSNYSLPSTDYILE----VQGSCISTFQGMDFPEPVGPL 368
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ Y++V+D GK +G A A
Sbjct: 369 VILGDAFLRRYYSVYDLGKNAVGLARA 395
>gi|195399277|ref|XP_002058247.1| GJ15982 [Drosophila virilis]
gi|194150671|gb|EDW66355.1| GJ15982 [Drosophila virilis]
Length = 374
Score = 230 bits (586), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 116/235 (49%), Positives = 160/235 (68%), Gaps = 9/235 (3%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L N ++ Y+G I IG+PPQ+F V+FD+GSSNLWVPSS C +F ++C H++Y KS+T
Sbjct: 61 LSNSINMAYYGAITIGTPPQSFKVLFDSGSSNLWVPSSTCWFFDVACMNHNQYDHDKSST 120
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
YT G+S I YGSGS+SGF S D V+V +V+K Q F EAT E +F A+FDGI+G+
Sbjct: 121 YTSNGESFSIQYGSGSLSGFLSTDTVDVNGLVIKSQTFAEATSEPGTSFNNAKFDGILGM 180
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
++ +AV + VP + NMV QGLV + VFSF+L RD + +GGE++FGG D + G TY
Sbjct: 181 AYQSLAVDNVVPPFYNMVSQGLVDQSVFSFYLARDGTSSQGGELIFGGSDSSLYSGDLTY 240
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI 310
VP++++GYWQF + I QS +C+ C AI D+GTSLL VV+E + I
Sbjct: 241 VPISEQGYWQFTMAGASIDGQS--LCD-NCQAIADTGTSLL-----VVSEAAYDI 287
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 54/101 (53%), Gaps = 15/101 (14%)
Query: 409 ELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKT-GEGIAEVCISGF- 466
++ +++ N ++DC + +P ++ IG F L P QYI+++ G+ C S F
Sbjct: 286 DILNNVLNVDENGLVDCSTVDKLPVLNLNIGGGKFTLEPAQYIIQSDGQ-----CQSSFE 340
Query: 467 -MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
M D WILGDVF+G Y+T FD G RIGFA A
Sbjct: 341 YMGTDF-------WILGDVFIGKYYTEFDLGNNRIGFAPVA 374
>gi|392568782|gb|EIW61956.1| aspartic peptidase A1 [Trametes versicolor FP-101664 SS1]
Length = 415
Score = 230 bits (586), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 148/382 (38%), Positives = 207/382 (54%), Gaps = 47/382 (12%)
Query: 12 LWVLASCLLLP-ASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV---SGV---- 63
L A LLP ++G+ R+ LKK + + E+Y GG+ V G+
Sbjct: 6 LASFAPLALLPFVVADGVHRMKLKKLPPAISNPQLESAYLAEKYGGGSQVPLGGGIGRNV 65
Query: 64 ---RHRLGDSDE-------------DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSS 107
R + D +E +PL NFM+AQYF EI +G+PPQ+F VI DTGSS
Sbjct: 66 RVSRPTVKDGEELFWTQDEFSTEGGHTVPLSNFMNAQYFAEITLGTPPQSFKVILDTGSS 125
Query: 108 NLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVV 167
NLWVPS+KC SI+C+ H++Y S S+TY G I YGSGS+ GF S+D + +GD+
Sbjct: 126 NLWVPSTKCT-SIACFLHAKYDSSASSTYKANGSEFSIQYGSGSMEGFVSRDVLTIGDLT 184
Query: 168 VKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL 227
VK+ F EAT+E L F +FDGI+GLG+ I+V VP + +V QGL+ VFSF L
Sbjct: 185 VKNLDFAEATKEPGLAFAFGKFDGILGLGYDTISVNHIVPPFYALVNQGLLDSPVFSFRL 244
Query: 228 NRDPDAEE-GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCA 286
D+EE GGE +FGG+D + GK YVPV +K YW+ EL I +G++ + G A
Sbjct: 245 G---DSEEDGGEAIFGGIDDSAYSGKIEYVPVRRKAYWEVELEKIRLGDEELELENTGAA 301
Query: 287 AIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQ 346
+D+GTSL+A P+ + +N IG + + + + ++ DL DL
Sbjct: 302 --IDTGTSLIALPSDLAEMLNAQIGAKKSWNGQYTVDCAKVPDLP-DLTF---------- 348
Query: 347 QIGLCAFNGAEYVSTGIKTVVE 368
FNG YV G V+E
Sbjct: 349 -----FFNGKPYVLKGTDYVLE 365
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 50/86 (58%), Gaps = 5/86 (5%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP-PPRGP 477
G+ +DC ++P +P+++F K + L Y+L+ + C+S F D+ P G
Sbjct: 331 GQYTVDCAKVPDLPDLTFFFNGKPYVLKGTDYVLE----VQGTCMSSFTGIDINLPGGGA 386
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFA 503
LWI+GDVF+ Y TV+D G+ +GFA
Sbjct: 387 LWIVGDVFLRKYFTVYDLGRDAVGFA 412
>gi|326475448|gb|EGD99457.1| aspartyl proteinase [Trichophyton tonsurans CBS 112818]
gi|326477485|gb|EGE01495.1| vacuolar protease A [Trichophyton equinum CBS 127.97]
Length = 400
Score = 230 bits (586), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 125/306 (40%), Positives = 180/306 (58%), Gaps = 18/306 (5%)
Query: 18 CLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRL-------GDS 70
C S L+++ LK++ L+ ++ + ++YMG +H +S
Sbjct: 15 CTSAKLHSLKLKKVSLKEQ-LERADIDVQVKSLGQKYMGIRPEQHEQHMFKEQTPIEAES 73
Query: 71 DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKS 130
++L + NF++AQYF EI IG+PPQ F V+ DTGSSNLWVP C SI+C+ HS Y S
Sbjct: 74 GHNVL-IDNFLNAQYFSEISIGTPPQTFKVVLDTGSSNLWVPGKDCS-SIACFLHSTYDS 131
Query: 131 RKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFD 190
S+TY++ G I YGSGS+ GF S+DNV++GD+ +K Q+F EAT E L F RFD
Sbjct: 132 SASSTYSKNGTKFAIRYGSGSLEGFVSRDNVKIGDMTIKKQLFAEATSEPGLAFAFGRFD 191
Query: 191 GIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL---NRDPDAEEGGEIVFGGVDPK 247
GI+G+GF I+V P + NM++QGL+ E VFSF+L N+D D + FGG D
Sbjct: 192 GIMGMGFSSISVNGITPPFYNMIDQGLIDEPVFSFYLGDTNKDGDQS---VVTFGGSDAS 248
Query: 248 HFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEIN 307
HF G T +P+ +K YW+ + I +G + + G I+D+GTSL+A PT + IN
Sbjct: 249 HFTGDMTTIPLRRKAYWEVDFDAISLGEDTAALENTGV--ILDTGTSLIALPTTLAEMIN 306
Query: 308 HAIGGE 313
IG +
Sbjct: 307 TQIGAK 312
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 35/87 (40%), Positives = 53/87 (60%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC + ++P+V+FT+ F + P Y L+ ++ CIS FM D P P GPL
Sbjct: 317 GQYTLDCSKRDSLPDVTFTLSGHNFTIGPHDYTLE----VSGTCISSFMGMDFPEPVGPL 372
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ Y++V+D GK +G A+A
Sbjct: 373 AILGDSFLRRYYSVYDLGKGTVGLAKA 399
>gi|452981069|gb|EME80829.1| hypothetical protein MYCFIDRAFT_89289 [Pseudocercospora fijiensis
CIRAD86]
Length = 396
Score = 230 bits (586), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 125/309 (40%), Positives = 185/309 (59%), Gaps = 13/309 (4%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRL--DLHSLNAARITRK--ERYMGGAGVSGVRHRLGD 69
L + L + G+ ++ L+K L L LN R ++YMG + + +
Sbjct: 4 ALLTSALAAGAQAGVHKMKLQKISLSEQLEGLNIEDHVRHLGQKYMGVRPQNPLSEMFKE 63
Query: 70 SD---EDILPL--KNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYF 124
+ ED P+ NF++AQYF +I IG+PPQ F V+ DTGSSNLWVPS C SI+CY
Sbjct: 64 TSVHAEDGHPVAVDNFLNAQYFSQIAIGTPPQEFKVVLDTGSSNLWVPSQDC-GSIACYL 122
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
HS+Y +S TY + G I YGSGS+ G+ SQD V++GD+ +K+Q+F EAT E L F
Sbjct: 123 HSKYDHGESTTYKQNGSDFAIRYGSGSLEGYVSQDTVQIGDLKIKNQLFAEATSEPGLAF 182
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
RFDGI+GLG+ I+V P + NM++QGL+ E+ F+F+L+ +E E +FGGV
Sbjct: 183 AFGRFDGIMGLGYDTISVNGIPPPFYNMIDQGLLDEKKFAFYLSSTDKGDE-SEAIFGGV 241
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
+ H+ GK +P+ +K YW+ +L I G+Q+ + G AI+D+GTSL+A P+ +
Sbjct: 242 NEDHYTGKMINIPLRRKAYWEVDLDAITFGDQTAEIDATG--AILDTGTSLIALPSTLAE 299
Query: 305 EINHAIGGE 313
+N IG +
Sbjct: 300 LLNKEIGAK 308
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 53/87 (60%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC + ++P+++FT+ F + YIL+ + CIS FM FD+P P GPL
Sbjct: 313 GQYTVDCSKRDSLPDLTFTLTGHNFTIDSYDYILE----VQGSCISAFMGFDIPEPAGPL 368
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ +++V+D G +G A+A
Sbjct: 369 AILGDAFLRKWYSVYDLGSNSVGLAKA 395
>gi|195121164|ref|XP_002005091.1| GI20282 [Drosophila mojavensis]
gi|193910159|gb|EDW09026.1| GI20282 [Drosophila mojavensis]
Length = 392
Score = 230 bits (586), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 112/256 (43%), Positives = 158/256 (61%), Gaps = 2/256 (0%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N++DAQYFG I IG+P Q F+VIFDTGS+NLWVPS C ++C HSR+ ++KS+
Sbjct: 63 VPLSNYLDAQYFGPISIGTPQQTFNVIFDTGSANLWVPSESCQKKLACQIHSRFNAKKSS 122
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y GK +I YGSGS++G+ S D V V + + +Q F EAT FL A+FDGI G
Sbjct: 123 SYRSNGKRFDIQYGSGSLAGYLSHDTVRVAGLEIPNQTFAEATDMPGPIFLAAKFDGIFG 182
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+R I++ + P + ++EQ L+ VFS +LNR+ + +GG + FGG ++++G T
Sbjct: 183 LGYRGISIQNIKPPFYAIMEQNLLKRPVFSVYLNRELGSNQGGYLFFGGSSSRYYRGNFT 242
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
YVPVT + YWQ +L IG +C GC I+D+GTS LA P IN +IGG
Sbjct: 243 YVPVTHRAYWQVKLETARIGKLQ--LCLNGCQVIIDTGTSFLAVPYEQAILINESIGGTP 300
Query: 315 VVSAECKLVVSQYGDL 330
+ + Q L
Sbjct: 301 AAYGQFSVPCDQVAHL 316
Score = 99.0 bits (245), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 46/99 (46%), Positives = 61/99 (61%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
INE P G+ + CD++ +P ++FT+G++ F L E Y+ VC S F
Sbjct: 292 INESIGGTPAAYGQFSVPCDQVAHLPTLTFTLGNRRFQLKGEDYVFHDIFPDRTVCASAF 351
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+A DLP P GPLWILGDVF+G Y+T FD G RIGFA+A
Sbjct: 352 IAVDLPSPSGPLWILGDVFLGKYYTEFDMGNHRIGFADA 390
>gi|388579370|gb|EIM19694.1| aspartyl proteinase [Wallemia sebi CBS 633.66]
Length = 411
Score = 230 bits (586), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 113/256 (44%), Positives = 165/256 (64%), Gaps = 4/256 (1%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
LP+ NF++AQY+ EIG+GSP Q F+V+ DTGSSNLWVPS+KC SI+C+ H ++ +S
Sbjct: 89 LPVSNFLNAQYYAEIGLGSPEQKFNVVLDTGSSNLWVPSNKC-MSIACFLHRKFNPEESK 147
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y G EI YGSGS+ G QD + + D+ VK+Q+F EAT E L F +FDGI+G
Sbjct: 148 SYKANGTDFEIRYGSGSLKGIVGQDTLAIDDLHVKNQLFAEATSEPGLAFAFGKFDGILG 207
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V D P + N+++QGL+ E VFSF+L + +E + VFGG+D H+KG+
Sbjct: 208 LGYDTISVNDIPPPFYNLIDQGLLDEPVFSFYLTDEQSGKE-SQAVFGGIDHDHYKGQLH 266
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
YVP+ +KGYW+ EL + G+ + G A +D+GTSL+A PT + +N IG +
Sbjct: 267 YVPLRRKGYWEVELEKLTFGDDEVELENTGAA--IDTGTSLIAIPTDMAEMLNKMIGAKK 324
Query: 315 VVSAECKLVVSQYGDL 330
S + + ++ DL
Sbjct: 325 SWSGQYTVDCNKVDDL 340
Score = 82.4 bits (202), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 46/139 (33%), Positives = 75/139 (53%), Gaps = 6/139 (4%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
VE E ++ GD V A + L T + +N++ + + G+ +DC+
Sbjct: 278 VELEKLTFGDDEVELENTGAAIDTGTSLIAIPTD--MAEMLNKMIGAKKSWSGQYTVDCN 335
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
++ +P +SFT G K + LS + YIL + C+S F D+P P GP++I+GDVF+
Sbjct: 336 KVDDLPELSFTFGGKKYPLSGKDYILN----LQGTCVSAFTGLDIPEPLGPIYIIGDVFL 391
Query: 487 GVYHTVFDSGKLRIGFAEA 505
Y TV+D G+ +GFAE+
Sbjct: 392 RRYFTVYDLGRDAVGFAES 410
>gi|451992127|gb|EMD84649.1| hypothetical protein COCHEDRAFT_1189444 [Cochliobolus
heterostrophus C5]
gi|452004574|gb|EMD97030.1| hypothetical protein COCHEDRAFT_1189956 [Cochliobolus
heterostrophus C5]
Length = 399
Score = 230 bits (586), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 114/250 (45%), Positives = 162/250 (64%), Gaps = 8/250 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+P+ N+++AQYF EI +G+PPQ+F VI DTGSSNLWVPS++C SI+C+ H +Y S S+
Sbjct: 77 VPVSNYLNAQYFSEISLGTPPQSFKVILDTGSSNLWVPSTQCT-SIACFLHDKYDSSSSS 135
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + G EI YGSGS+ GF S D +++GD+ VK+Q F EAT E L F +FDGI+G
Sbjct: 136 TYQKNGSDFEIRYGSGSMKGFVSNDVLQIGDLKVKNQDFAEATSEPGLAFAFGKFDGILG 195
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V VP + NM+ QGL+ E VF+F+L D ++G E FGG+D H+ GK
Sbjct: 196 LGYDTISVNHIVPPFYNMINQGLLDEPVFAFYLGDVAD-KQGSEATFGGIDESHYTGKLI 254
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
+P+ +K YW+ +L I G ++ G I+D+GTSL+A P+ + +N IG +
Sbjct: 255 KLPLRRKAYWEVDLDAITFGKETAETENVGV--ILDTGTSLIALPSAMAELLNKEIGAKK 312
Query: 314 ---GVVSAEC 320
G S EC
Sbjct: 313 GFNGQYSVEC 322
Score = 68.6 bits (166), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 53/87 (60%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ ++CD+ ++P+++FT+ F +S YIL+ I+ CIS M D+P P GPL
Sbjct: 316 GQYSVECDKRDSLPDLTFTLTGHNFTISAYDYILE----ISGSCISALMGMDIPEPAGPL 371
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ +++V+D G + A++
Sbjct: 372 AILGDAFLRKWYSVYDLGNSAVALAKS 398
>gi|121705756|ref|XP_001271141.1| aspartic endopeptidase Pep2 [Aspergillus clavatus NRRL 1]
gi|119399287|gb|EAW09715.1| aspartic endopeptidase Pep2 [Aspergillus clavatus NRRL 1]
Length = 398
Score = 230 bits (586), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 127/322 (39%), Positives = 187/322 (58%), Gaps = 21/322 (6%)
Query: 14 VLASCLLLPASSNGLRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSGVRHR--- 66
+L + LL +S + ++ L K +L H+++A ++YMG + H+
Sbjct: 6 LLTASALLGCASAEVHKLKLNKVPLEEQLYTHNIDAHVRALGQKYMG---IRPNIHKELL 62
Query: 67 ----LGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISC 122
D + + NF++AQYF EI +G+PPQ F V+ DTGSSNLWVPSS+C SI+C
Sbjct: 63 EENSFNDMSRHDVLVDNFLNAQYFSEIELGTPPQKFKVVLDTGSSNLWVPSSEC-GSIAC 121
Query: 123 YFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSL 182
Y H++Y S S+TY + G I YGSG +SGF SQDN+++GD+ ++ Q F EAT E L
Sbjct: 122 YLHTKYDSSASSTYKKNGTEFAIRYGSGELSGFVSQDNLKIGDLKIEKQDFAEATNEPGL 181
Query: 183 TFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFG 242
F RFDGI+GLG+ I+V VP + NM+ QGL+ E VF+F+L + FG
Sbjct: 182 AFAFGRFDGILGLGYDTISVNKIVPPFYNMLNQGLLDEPVFAFYLGDANKEGDSSVATFG 241
Query: 243 GVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPV 302
G+D HF G+ T +P+ +K YW+ +L I +G+ + G I+D+GTSL+A P+ +
Sbjct: 242 GIDKDHFTGELTKIPLRRKAYWEVDLDAIALGDNVAELDNTGV--ILDTGTSLIALPSTL 299
Query: 303 VTEINHAIGGE----GVVSAEC 320
+N IG + G S EC
Sbjct: 300 ADLLNKEIGAKKGFTGQYSVEC 321
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 52/87 (59%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ ++CD+ ++P+++FT+ F + P Y L+ + CIS FM D P P GPL
Sbjct: 315 GQYSVECDKRDSLPDLTFTLSGHNFTIGPYDYTLE----VQGSCISSFMGMDFPEPVGPL 370
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ Y++V+D G +G A+A
Sbjct: 371 AILGDAFLRKYYSVYDLGNHAVGLAKA 397
>gi|194218276|ref|XP_001501986.2| PREDICTED: pepsin A-like [Equus caballus]
Length = 387
Score = 230 bits (586), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 119/251 (47%), Positives = 162/251 (64%), Gaps = 10/251 (3%)
Query: 73 DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRK 132
D PL+N++D +YFG I IG+PPQ F+VIFDTGSSNLWVPS+ C S++CY H R+ K
Sbjct: 63 DSEPLENYLDEEYFGTISIGTPPQEFTVIFDTGSSNLWVPSTYCS-SLACYDHKRFNPEK 121
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGI 192
S+TY +S I YG+GS++G D V VG + +Q+F + +E LA FDGI
Sbjct: 122 SSTYQATSESISITYGTGSMTGILGYDTVRVGGIEDTNQIFGLSEKEPGFFLFLAPFDGI 181
Query: 193 IGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGK 252
+GLG+ I+ A PV+DN+ +QGLVS+++FS +L+ D E G ++FGG+D ++ G
Sbjct: 182 LGLGYPSISASGATPVFDNIWDQGLVSQDLFSVYLSS--DDESGSVVMFGGIDSSYYTGS 239
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG- 311
+VPVT +GYWQ + I I +S C GGC AIVD+GTSLLAGPT + I IG
Sbjct: 240 LHWVPVTTEGYWQIAVDSITINGESIA-CSGGCQAIVDTGTSLLAGPTSGIDNIQSYIGA 298
Query: 312 -----GEGVVS 317
GE V+S
Sbjct: 299 RKDLLGEEVIS 309
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 52/134 (38%), Positives = 66/134 (49%), Gaps = 10/134 (7%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTK--EKVLSYINELCDSLPNPMGESIIDCDRIPTMP 432
G+S CS A+V L T + + SYI D L GE +I C I ++P
Sbjct: 262 GESIACSGGCQAIVDTGTSLLAGPTSGIDNIQSYIGARKDLL----GEEVISCSAIDSLP 317
Query: 433 NVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTV 492
++ FT+ F L P YILK + CISGF DL G LWILGDVF+ Y TV
Sbjct: 318 DIVFTMNGVEFPLPPSAYILKEDDS----CISGFEGVDLDTSSGELWILGDVFIRQYFTV 373
Query: 493 FDSGKLRIGFAEAA 506
FD ++G A A
Sbjct: 374 FDRANNQVGLAPVA 387
>gi|452840489|gb|EME42427.1| hypothetical protein DOTSEDRAFT_73302 [Dothistroma septosporum
NZE10]
Length = 398
Score = 230 bits (586), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 111/237 (46%), Positives = 160/237 (67%), Gaps = 4/237 (1%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
+ NF++AQYF EI IG+PPQ F V+ DTGSSNLWVPS C SI+CY HS+Y +S+TY
Sbjct: 78 VDNFLNAQYFSEIAIGTPPQEFKVVLDTGSSNLWVPSQDC-GSIACYLHSKYDHSESSTY 136
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
+ G I YGSGS+ G+ S+D V++GD+ +KDQ+F EAT E L F RFDGI+GLG
Sbjct: 137 KKNGSDFAIRYGSGSLEGYVSKDTVQIGDLKIKDQLFAEATSEPGLAFAFGRFDGILGLG 196
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ I+V P + NM++Q L+ E+VF+F+L+ D + + E +FGGV+ H+ G+ T +
Sbjct: 197 YDTISVNGIPPPFYNMIDQDLLDEKVFAFYLS-DTNKGDESEAIFGGVNKDHYTGEMTKI 255
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
P+ +K YW+ +L I G+Q+ + G AI+D+GTSLLA P+ + +N IG +
Sbjct: 256 PLRRKAYWEVDLDAITFGDQTAEIDSTG--AILDTGTSLLALPSTLAELLNKEIGAK 310
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 52/87 (59%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC + ++P+++FT+ F + YIL+ + CIS FM FD+P P GPL
Sbjct: 315 GQYTVDCSKRDSLPDLTFTLTGHNFTIDAYDYILE----VQGSCISAFMGFDIPEPAGPL 370
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ Y++V+D +G A+A
Sbjct: 371 AILGDAFLRKYYSVYDLENNAVGLAKA 397
>gi|330930051|ref|XP_003302872.1| hypothetical protein PTT_14856 [Pyrenophora teres f. teres 0-1]
gi|311321500|gb|EFQ89048.1| hypothetical protein PTT_14856 [Pyrenophora teres f. teres 0-1]
Length = 399
Score = 230 bits (586), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 114/250 (45%), Positives = 162/250 (64%), Gaps = 8/250 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+P+ NF++AQYF EI +G+PPQ F V+ DTGSSNLWVPS+ C SI+CY H++Y S S+
Sbjct: 77 VPVSNFLNAQYFSEISLGTPPQTFKVVLDTGSSNLWVPSTSCN-SIACYLHTKYDSSSSS 135
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + G EI YGSGS+SGF S D ++GD+ VK+Q F EAT E L F RFDGI+G
Sbjct: 136 TYKKNGTEFEIRYGSGSLSGFVSNDVFQIGDLKVKNQDFAEATSEPGLAFAFGRFDGIMG 195
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V VP + NM++QGL+ E VF+F+L D + ++ E FGG+D + GK
Sbjct: 196 LGYDTISVKGIVPPFYNMLDQGLLDEPVFAFYLG-DTNQQQESEATFGGIDESKYTGKMI 254
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
+P+ +K YW+ EL + G ++ + G I+D+GTSL+A P+ + +N IG +
Sbjct: 255 KLPLRRKAYWEVELDALTFGKETAEMDNTGI--ILDTGTSLIALPSTIAELLNKEIGAKK 312
Query: 314 ---GVVSAEC 320
G + EC
Sbjct: 313 SFNGQYTVEC 322
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 52/87 (59%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ ++C++ ++P+++FT+ F +S YIL+ + CIS M D P P GPL
Sbjct: 316 GQYTVECNKRDSLPDLTFTLSGHNFTISAYDYILE----VQGSCISALMGMDFPEPVGPL 371
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ +++V+D G +G A+A
Sbjct: 372 AILGDAFLRKWYSVYDLGNSVVGLAKA 398
>gi|302657131|ref|XP_003020295.1| hypothetical protein TRV_05606 [Trichophyton verrucosum HKI 0517]
gi|306531031|sp|D4DEN7.1|CARP_TRIVH RecName: Full=Probable vacuolar protease A; AltName: Full=Aspartic
endopeptidase PEP2; AltName: Full=Aspartic protease
PEP2; Flags: Precursor
gi|291184114|gb|EFE39677.1| hypothetical protein TRV_05606 [Trichophyton verrucosum HKI 0517]
Length = 400
Score = 229 bits (585), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 123/305 (40%), Positives = 179/305 (58%), Gaps = 18/305 (5%)
Query: 18 CLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRL---------- 67
C S L+++ LK++ L+ ++ + ++YMG + +H
Sbjct: 15 CTSAKLHSLKLKKVSLKEQ-LEHADIDVQIKSLGQKYMG---IRPEQHEQQMFKEQTPIE 70
Query: 68 GDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSR 127
+S ++L + NF++AQYF EI IG+PPQ F V+ DTGSSNLWVP C SI+C+ HS
Sbjct: 71 AESGHNVL-IDNFLNAQYFSEISIGTPPQTFKVVLDTGSSNLWVPGKDCS-SIACFLHST 128
Query: 128 YKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLA 187
Y S S+TY++ G I YGSGS+ GF SQD+V++GD+ +K+Q+F EAT E L F
Sbjct: 129 YDSSASSTYSKNGTKFAIRYGSGSLEGFVSQDSVKIGDMTIKNQLFAEATSEPGLAFAFG 188
Query: 188 RFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPK 247
RFDGI+G+GF I+V P + NM++QGL+ E VFSF+L + + FGG D K
Sbjct: 189 RFDGIMGMGFSSISVNGITPPFYNMIDQGLIDEPVFSFYLGDTNKEGDQSVVTFGGSDTK 248
Query: 248 HFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEIN 307
HF G T +P+ +K YW+ + I +G + + G I+D+GTSL+A PT + IN
Sbjct: 249 HFTGDMTTIPLRRKAYWEVDFDAISLGEDTAALENTGI--ILDTGTSLIALPTTLAEMIN 306
Query: 308 HAIGG 312
IG
Sbjct: 307 TQIGA 311
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 35/87 (40%), Positives = 53/87 (60%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC + ++P+V+FT+ F + P Y L+ ++ CIS FM D P P GPL
Sbjct: 317 GQYTLDCAKRDSLPDVTFTVSGHNFTIGPHDYTLE----VSGTCISSFMGMDFPEPVGPL 372
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ Y++V+D GK +G A+A
Sbjct: 373 AILGDSFLRRYYSVYDLGKGTVGLAKA 399
>gi|70999520|ref|XP_754479.1| aspartic endopeptidase Pep2 [Aspergillus fumigatus Af293]
gi|74675969|sp|O42630.1|CARP_ASPFU RecName: Full=Vacuolar protease A; AltName: Full=Aspartic
endopeptidase pep2; AltName: Full=Aspartic protease
pep2; Flags: Precursor
gi|2664292|emb|CAA75754.1| cellular aspartic protease [Aspergillus fumigatus]
gi|4200293|emb|CAA10674.1| aspartic protease [Aspergillus fumigatus]
gi|66852116|gb|EAL92441.1| aspartic endopeptidase Pep2 [Aspergillus fumigatus Af293]
gi|159127496|gb|EDP52611.1| aspartic endopeptidase Pep2 [Aspergillus fumigatus A1163]
Length = 398
Score = 229 bits (585), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 127/322 (39%), Positives = 186/322 (57%), Gaps = 21/322 (6%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLD----LHSLNAARITRKERYMGGAGVSGVRHR--- 66
+L + +LL ++S + ++ L K LD H+++A ++YMG + H+
Sbjct: 6 LLTASVLLGSASAAVHKLKLNKVPLDEQLYTHNIDAHVRALGQKYMG---IRPNVHQELL 62
Query: 67 ----LGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISC 122
L D + + NF++AQYF EI +G+PPQ F V+ DTGSSNLWVP S C SI+C
Sbjct: 63 EENSLNDMSRHDVLVDNFLNAQYFSEISLGTPPQKFKVVLDTGSSNLWVPGSDCS-SIAC 121
Query: 123 YFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSL 182
+ H++Y S S+TY G I YGSG +SGF SQD +++GD+ V Q F EAT E L
Sbjct: 122 FLHNKYDSSASSTYKANGTEFAIKYGSGELSGFVSQDTLQIGDLKVVKQDFAEATNEPGL 181
Query: 183 TFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFG 242
F RFDGI+GLG+ I+V VP + NM++QGL+ E VF+F+L + E FG
Sbjct: 182 AFAFGRFDGILGLGYDTISVNKIVPPFYNMLDQGLLDEPVFAFYLGDTNKEGDNSEASFG 241
Query: 243 GVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPV 302
GVD H+ G+ T +P+ +K YW+ + I +G+ + G I+D+GTSL+A P+ +
Sbjct: 242 GVDKNHYTGELTKIPLRRKAYWEVDFDAIALGDNVAELENTGI--ILDTGTSLIALPSTL 299
Query: 303 VTEINHAIGGE----GVVSAEC 320
+N IG + G S EC
Sbjct: 300 ADLLNKEIGAKKGFTGQYSIEC 321
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 52/87 (59%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ I+CD+ ++P+++FT+ F + P Y L+ + CIS FM D P P GPL
Sbjct: 315 GQYSIECDKRDSLPDLTFTLAGHNFTIGPYDYTLE----VQGSCISSFMGMDFPEPVGPL 370
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ +++V+D G +G A+A
Sbjct: 371 AILGDAFLRKWYSVYDLGNNAVGLAKA 397
>gi|449280945|gb|EMC88160.1| Cathepsin E, partial [Columba livia]
Length = 374
Score = 229 bits (585), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 113/249 (45%), Positives = 159/249 (63%), Gaps = 5/249 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG+I IG+PPQNF+V+FDTGSSNLWVPS C S +C H++++ +S+T
Sbjct: 47 PLINYLDMEYFGQISIGTPPQNFTVVFDTGSSNLWVPSVYC-VSKACAEHAKFQPSQSST 105
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y IG I YG+GS++G D V V + V +Q F E+ E FL A FDG++GL
Sbjct: 106 YQAIGTPFSIQYGTGSLTGVIGSDQVVVEGLTVNNQQFAESISEPGKAFLDAPFDGVLGL 165
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ +AV PV+DNM+ Q LV +FS +L+ +P++ GGE++FGG DP F G +
Sbjct: 166 AYPSLAVDGVTPVFDNMMAQNLVELPIFSVYLSTNPESSLGGELLFGGFDPSRFMGTLNW 225
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
VPVT++GYWQ +L +I + + C GC AIVD+GTSL+ GPT V + IG
Sbjct: 226 VPVTQQGYWQIQLDNIQLAG-TVAFCTNGCQAIVDTGTSLITGPTKDVKVLQKYIGATPV 284
Query: 313 EGVVSAECK 321
+G + EC
Sbjct: 285 DGEYAVECN 293
Score = 85.1 bits (209), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 54/143 (37%), Positives = 76/143 (53%), Gaps = 10/143 (6%)
Query: 367 VEKENVS-AGDSAVCSACEMAVVWVQNQLKQKQTKE-KVLS-YINELCDSLPNPM-GESI 422
++ +N+ AG A C+ A+V L TK+ KVL YI P+ GE
Sbjct: 236 IQLDNIQLAGTVAFCTNGCQAIVDTGTSLITGPTKDVKVLQKYIGA------TPVDGEYA 289
Query: 423 IDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILG 482
++C+ + MP+V+FTI + LS + Y L C SGF D+ PP GPLWILG
Sbjct: 290 VECNNLNVMPDVTFTINGLPYLLSAQAYTLVENSDGMAFCTSGFQGLDIAPPYGPLWILG 349
Query: 483 DVFMGVYHTVFDSGKLRIGFAEA 505
DVF+ +++VFD G R+G A A
Sbjct: 350 DVFIRQFYSVFDRGNNRVGLAPA 372
>gi|367047895|ref|XP_003654327.1| hypothetical protein THITE_2117251 [Thielavia terrestris NRRL 8126]
gi|347001590|gb|AEO67991.1| hypothetical protein THITE_2117251 [Thielavia terrestris NRRL 8126]
Length = 396
Score = 229 bits (585), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 115/239 (48%), Positives = 157/239 (65%), Gaps = 6/239 (2%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+P+ N+M+AQYF EI +G+PPQ+F V+ DTGSSNLWVPS +C SI+CY HS+Y S S+
Sbjct: 76 VPISNYMNAQYFSEITLGTPPQSFKVVLDTGSSNLWVPSVEC-GSIACYLHSKYDSSASS 134
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + G S +I YGSGS+SGF SQD + +GD+ VK Q F EAT E L F RFDGI+G
Sbjct: 135 TYKKNGTSFDIRYGSGSLSGFVSQDTLSIGDITVKGQDFAEATSEPGLAFAFGRFDGILG 194
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V VP + MVEQ LV E VF+F+L D E+VFGGVD +KGK T
Sbjct: 195 LGYDTISVNGIVPPFYKMVEQKLVDEPVFAFYL---ADTNGESEVVFGGVDKDRYKGKIT 251
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
+P+ +K YW+ + + G+ + G AI+D+GTSL+ P+ + +N +G +
Sbjct: 252 TIPLRRKAYWEVDFESLSYGDDTADFENTG--AILDTGTSLITLPSQLAEMLNAQLGAK 308
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 34/99 (34%), Positives = 56/99 (56%), Gaps = 4/99 (4%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
+N + N G+ ++DC + ++ +++F + F L P+ YIL+ I+ C+S F
Sbjct: 301 LNAQLGAKKNFAGQYVLDCSKRDSLEDITFNLAGYNFTLGPQDYILE----ISGSCMSTF 356
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
D P P GPL ILGD F+ Y++++D G +G AEA
Sbjct: 357 TPMDFPAPTGPLAILGDAFLRRYYSIYDLGANTVGLAEA 395
>gi|410986287|ref|XP_003999442.1| PREDICTED: renin [Felis catus]
Length = 407
Score = 229 bits (585), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 138/339 (40%), Positives = 200/339 (58%), Gaps = 26/339 (7%)
Query: 1 MEQ--KLLRSVFCLWVLASCLL-LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGG 57
M+Q ++ R L + +SC LPA S RRI LKK + + R + KER +
Sbjct: 1 MDQGSRMPRWGLLLVLCSSCTFGLPADSGAFRRIFLKK-------MPSIRESLKERGVDV 53
Query: 58 AGVSG------VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWV 111
A + R G+S ++ L N++D QY+GEIGIG+PPQ F VIFDTGS+NLWV
Sbjct: 54 ARLGAEWSQFTKRFSFGNSTSPVV-LTNYLDTQYYGEIGIGTPPQTFKVIFDTGSANLWV 112
Query: 112 PSSKCY-FSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKD 170
PS+KC +C HS Y S +S++Y E G + I+YGSG + GF SQD V VG + V
Sbjct: 113 PSTKCSPLYTACEIHSLYDSSESSSYMENGTAFAIHYGSGKVKGFLSQDEVTVGGITVT- 171
Query: 171 QVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRD 230
Q F E T + F+LA+FDGI+G+GF AVG PV+D+++ QG++ E+VFS + +R+
Sbjct: 172 QTFGEVTELPLIPFMLAKFDGILGMGFPAQAVGGVTPVFDHILSQGVLKEDVFSVYYSRN 231
Query: 231 PDAEE--GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAI 288
GGE+V GG DP++++G YV V+K G WQ ++ + + +T VCE GC +
Sbjct: 232 SKNSHLLGGEVVLGGSDPQYYQGNFHYVSVSKTGSWQIKMKGVSV-RSATVVCEEGCMVV 290
Query: 289 VDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLV 323
VD+G S ++GPT + + +G + + E CK V
Sbjct: 291 VDTGASYISGPTSSLRLLMETLGAKELSRNEYVVNCKQV 329
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 33/86 (38%), Positives = 53/86 (61%)
Query: 420 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 479
E +++C ++PT+P++SF +G + + L+ Y+LK G +C D+PPP GP+W
Sbjct: 321 EYVVNCKQVPTLPDISFHLGGRAYTLTSADYVLKDPYGNDGLCTLALHGLDVPPPTGPVW 380
Query: 480 ILGDVFMGVYHTVFDSGKLRIGFAEA 505
+LG F+ ++T FD RIGFA A
Sbjct: 381 VLGASFIRKFYTEFDRHNNRIGFALA 406
>gi|315051426|ref|XP_003175087.1| hypothetical protein MGYG_02617 [Arthroderma gypseum CBS 118893]
gi|311340402|gb|EFQ99604.1| hypothetical protein MGYG_02617 [Arthroderma gypseum CBS 118893]
Length = 401
Score = 229 bits (584), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 112/235 (47%), Positives = 149/235 (63%), Gaps = 3/235 (1%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
+ NF++AQYF EI IG+PPQ F V+ DTGSSNLWVP C SI+C+ HS Y S S+TY
Sbjct: 80 IDNFLNAQYFSEISIGTPPQTFKVVLDTGSSNLWVPGKDCS-SIACFLHSTYDSSASSTY 138
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
+ G I YGSGS+ GF SQD+V++GD+ +KDQ+F EAT E L F RFDGI+G+G
Sbjct: 139 HKNGTKFAIRYGSGSLEGFVSQDDVKIGDMTIKDQLFAEATSEPGLAFAFGRFDGIMGMG 198
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
F I+V P + M++QGL+ E VFSF+L + + FGG D HF GK T +
Sbjct: 199 FSSISVNGITPPFYKMIDQGLIDEPVFSFYLGDTNKEGDQSVVTFGGSDESHFTGKMTTI 258
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
P+ +K YW+ E I +G + + G I+D+GTSL+A PT + IN IG
Sbjct: 259 PLRRKAYWEVEFNAISLGKDTAALENTGI--ILDTGTSLIALPTTLAEMINSQIG 311
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 53/87 (60%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC + ++P+V+FT+ F + P Y L+ ++ CIS FM D P P GPL
Sbjct: 318 GQYTLDCAKRDSLPDVTFTLSGHNFTIGPHDYTLE----VSGTCISSFMGMDFPEPVGPL 373
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ +++V+D GK +G A+A
Sbjct: 374 AILGDSFLRRWYSVYDLGKGTVGLAKA 400
>gi|358372259|dbj|GAA88863.1| aspartic protease (PepE) [Aspergillus kawachii IFO 4308]
Length = 398
Score = 229 bits (584), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 125/311 (40%), Positives = 184/311 (59%), Gaps = 17/311 (5%)
Query: 14 VLASCLLLPASSNGLRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSGVRHR--- 66
+L + +LL +S + ++ L K +L H+++A ++YMG + H+
Sbjct: 6 LLTASVLLGCASAEVHKLKLNKVPLEEQLYTHNIDAHVRALGQKYMG---IRPSIHKELV 62
Query: 67 ----LGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISC 122
+ D + + NF++AQYF EI +G+PPQ F V+ DTGSSNLWVPSS+C SI+C
Sbjct: 63 EENPINDMSRHDVLVDNFLNAQYFSEIELGTPPQKFKVVLDTGSSNLWVPSSECS-SIAC 121
Query: 123 YFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSL 182
Y H++Y S S+TY + G I YGSGS+SGF SQD +++GD+ VK Q F EAT E L
Sbjct: 122 YLHNKYDSSASSTYHKNGSEFAIKYGSGSLSGFISQDTLKIGDLKVKGQDFAEATNEPGL 181
Query: 183 TFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFG 242
F RFDGI+GLG+ I+V VP + NM++QGL+ E VF+F+L + FG
Sbjct: 182 AFAFGRFDGILGLGYDTISVNKIVPPFYNMLDQGLLDEPVFAFYLGDTNKEGDDSVATFG 241
Query: 243 GVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPV 302
GVD H+ G+ +P+ +K YW+ +L I +G+ + G I+D+GTSL+A P +
Sbjct: 242 GVDKDHYTGELIKIPLRRKAYWEVDLDAIALGDDVAELDNTGV--ILDTGTSLIALPADL 299
Query: 303 VTEINHAIGGE 313
IN IG +
Sbjct: 300 AEMINAQIGAK 310
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 32/87 (36%), Positives = 52/87 (59%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DCD+ ++P+V+FT+ F ++ Y L+ + C+S FM D P P GPL
Sbjct: 315 GQYTVDCDKRSSLPDVTFTLAGHNFTITSYDYTLE----VQGSCVSAFMGMDFPEPVGPL 370
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ +++V+D G +G A+A
Sbjct: 371 AILGDAFLRKWYSVYDLGNSAVGLAKA 397
>gi|389747274|gb|EIM88453.1| Asp-domain-containing protein [Stereum hirsutum FP-91666 SS1]
Length = 416
Score = 229 bits (584), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 132/331 (39%), Positives = 186/331 (56%), Gaps = 36/331 (10%)
Query: 24 SSNGLRRIGLKK--RRLDLHSLNAARITRK-------ERYMGGAGVSGVRHRL---GDSD 71
S++G+ ++ LKK + L +A + K + + G+ + R R G SD
Sbjct: 17 SASGIHKLKLKKLPQVASNQHLESAYLAEKYGAQAPAQMPLAGSADAAGRMRFSRPGQSD 76
Query: 72 EDI---------------LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC 116
+D+ +PL NFM+AQY+ EI IG+PPQ F VI DTGSSNLWVPSS+C
Sbjct: 77 DDLFWTQEESIIANGGHGVPLTNFMNAQYYTEIDIGTPPQTFKVILDTGSSNLWVPSSQC 136
Query: 117 YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEA 176
SI+C+ H++Y S S++Y G I YGSGS+ GF S D++ GD+ + F EA
Sbjct: 137 T-SIACFLHTKYDSSASSSYKANGTEFSIQYGSGSMEGFVSNDDIVFGDMSLSSVDFAEA 195
Query: 177 TREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEG 236
T+E L F +FDGI+GL + IAV PV+ +V QG++SE VFSF L D +G
Sbjct: 196 TKEPGLAFAFGKFDGILGLAYDTIAVNHITPVFYELVNQGIISEPVFSFRLGSSED--DG 253
Query: 237 GEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLL 296
GE +FGG+DP + GK Y PV +K YW+ EL + G+ + G A +D+GTSL+
Sbjct: 254 GEAIFGGIDPSAYSGKIDYAPVRRKAYWEVELEKVSFGDDDLELENTGAA--IDTGTSLI 311
Query: 297 AGPTPVVTEINHAIGGE----GVVSAECKLV 323
A PT V +N IG + G + +C V
Sbjct: 312 ALPTDVAEMLNTQIGAKKSWNGQYTVDCAKV 342
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/140 (32%), Positives = 70/140 (50%), Gaps = 6/140 (4%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
VE E VS GD + A + L T V +N + + G+ +DC
Sbjct: 283 VELEKVSFGDDDLELENTGAAIDTGTSLIALPTD--VAEMLNTQIGAKKSWNGQYTVDCA 340
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
++P +P+++F +K + L Y+L+ + CIS F D+ P G LWI+GDVF+
Sbjct: 341 KVPDLPDLTFYFNEKPYPLKGTDYVLE----VQGTCISAFTGLDINLPGGSLWIIGDVFL 396
Query: 487 GVYHTVFDSGKLRIGFAEAA 506
Y TV+D G+ +GFA +A
Sbjct: 397 RRYFTVYDLGRDAVGFATSA 416
>gi|449481456|ref|XP_002189698.2| PREDICTED: cathepsin E-A-like [Taeniopygia guttata]
Length = 405
Score = 229 bits (584), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 108/236 (45%), Positives = 161/236 (68%), Gaps = 2/236 (0%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
L ++M+AQY+G + +G+PPQ+F+V+FDTGSSN WVPS+ C S +C H ++KS KS++Y
Sbjct: 73 LYDYMNAQYYGVVSVGTPPQSFTVVFDTGSSNFWVPSAYC-ISEACRVHQKFKSFKSDSY 131
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
G++ + YGSG + G +D +++ ++ +K Q F E+ E TF+LA FDG++GLG
Sbjct: 132 EHGGEAFSLQYGSGQLLGIAGKDTLQISNISIKGQDFGESVFEPGATFVLAHFDGVLGLG 191
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ +AVG+A+PV+D+++ Q LV E VFSF+L R D E GGE++ GG+D +KG +V
Sbjct: 192 YPSLAVGNALPVFDSIMNQHLVEEPVFSFYLKRGEDTENGGELILGGIDHSLYKGSIHWV 251
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
PVT+K YWQ + +I I + T C GC AIVDSGTSL+ GP+ + + IG
Sbjct: 252 PVTEKSYWQIHMNNIKIQGRVT-FCSHGCEAIVDSGTSLITGPSSQIRRLQAYIGA 306
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 42/91 (46%), Positives = 62/91 (68%)
Query: 415 PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPP 474
P+ GE ++DC R+ ++P++SFTIG + + L+ EQYI+K C+SGF + D+P
Sbjct: 308 PSNTGEFLVDCRRLSSLPHISFTIGHREYKLAAEQYIIKESIDDQTFCMSGFQSLDIPTR 367
Query: 475 RGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
G LWILGDVFM ++ +FD G R+GFA+A
Sbjct: 368 TGSLWILGDVFMSAFYCIFDRGNDRVGFAKA 398
>gi|241687194|ref|XP_002412838.1| aspartyl protease, putative [Ixodes scapularis]
gi|215506640|gb|EEC16134.1| aspartyl protease, putative [Ixodes scapularis]
Length = 320
Score = 229 bits (583), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 118/230 (51%), Positives = 155/230 (67%), Gaps = 3/230 (1%)
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
+Y+G I IG+PPQ+F VIFDTGS+NLW+PSSKC + C H RY S +S+TY G++
Sbjct: 3 EYYGPITIGTPPQDFQVIFDTGSANLWLPSSKCT-TKYCLHHHRYDSSRSSTYEADGRNF 61
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVG 203
I YGSG++ GF S+D +G V Q EA G + L A FDGI+GL + IAV
Sbjct: 62 TIVYGSGNVEGFISKDVCRIGSAKVSGQPLGEALVVGGESLLEAPFDGILGLAYPSIAVD 121
Query: 204 DAVPVWDNMVEQGLVSEE-VFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
VPV+DNM++QGL+ E+ VFS +LNRDP ++EGGEI+FGG+D H+KG TYVPVT KG
Sbjct: 122 GVVPVFDNMMKQGLLGEQNVFSVYLNRDPSSKEGGEILFGGIDHDHYKGSITYVPVTAKG 181
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
YWQF + D + +C+ GC AI D+GTSL+ GP V +N +GG
Sbjct: 182 YWQFHV-DGASKSVPELLCKDGCEAIADTGTSLITGPPEEVDSLNQYLGG 230
Score = 95.5 bits (236), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 43/100 (43%), Positives = 65/100 (65%), Gaps = 3/100 (3%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
+N+ G+ ++DCD++ ++PNV+FTI K F+L + Y+LK + +C+SGF
Sbjct: 224 LNQYLGGTKTEGGQYLLDCDKLESLPNVTFTISGKEFSLRSKDYVLKINQQGQTLCVSGF 283
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
M +P PLWILGDVF+G Y+T+FD + R+GFAE A
Sbjct: 284 MGLGMPQ---PLWILGDVFLGPYYTIFDRDQDRVGFAEVA 320
>gi|407260952|ref|XP_003946102.1| PREDICTED: renin-1-like [Mus musculus]
Length = 400
Score = 229 bits (583), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 132/322 (40%), Positives = 189/322 (58%), Gaps = 30/322 (9%)
Query: 6 LRSVFCLWVLASCLL-LPASSNGLRRIGLKK----------RRLDLHSLNAARITRKERY 54
L ++ LW + C LP + RI LKK R +D+ L+A R
Sbjct: 3 LWALLLLW--SPCTFSLPTRTATFERIPLKKMPSVREILEERGVDMTRLSAER------- 53
Query: 55 MGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSS 114
GV R L + ++ L N+++ QY+GEIGIG+PPQ F VIFDTGS+NLWVPS+
Sbjct: 54 ----GVFTKRPSLINLTSPVV-LTNYLNTQYYGEIGIGTPPQTFKVIFDTGSANLWVPST 108
Query: 115 KC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF 173
KC ++C HS Y+S S++Y E G I+YGSG + GF SQD V VG + V Q F
Sbjct: 109 KCSRLYLACGIHSLYESSDSSSYMENGSDFTIHYGSGRVKGFLSQDVVTVGGITVT-QTF 167
Query: 174 IEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDA 233
E T + F+LA+FDG++G+GF AVG PV+D+++ QG++ EEVFS + NR
Sbjct: 168 GEVTELPLIPFMLAKFDGVLGMGFPAQAVGGVTPVFDHILSQGVLKEEVFSVYYNRKTKG 227
Query: 234 EE--GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDS 291
GGE+V GG DP+H++G YV ++K WQ + + +G+ ST +CE GCA +VD+
Sbjct: 228 SHLLGGEVVLGGSDPQHYQGNFHYVSISKTDSWQITMKGVSVGS-STLLCEEGCAVVVDT 286
Query: 292 GTSLLAGPTPVVTEINHAIGGE 313
G+S ++ PT + I A+G +
Sbjct: 287 GSSFISAPTSSLKLIMQALGAK 308
Score = 81.6 bits (200), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 33/86 (38%), Positives = 54/86 (62%)
Query: 420 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 479
E +++C ++PT+P++SF +G + + LS Y+L+ ++C A D+PPP GP+W
Sbjct: 314 EYVVNCSQVPTLPDISFDLGGRAYTLSSTDYVLQYPYRRDKLCTLALHAMDIPPPTGPVW 373
Query: 480 ILGDVFMGVYHTVFDSGKLRIGFAEA 505
+LG F+ ++T FD RIGFA A
Sbjct: 374 VLGATFIRKFYTEFDRHNNRIGFALA 399
>gi|283806612|ref|NP_001164557.1| pepsin II-2/3 precursor [Oryctolagus cuniculus]
gi|129781|sp|P27821.1|PEPA2_RABIT RecName: Full=Pepsin II-2/3; AltName: Full=Pepsin A; Flags:
Precursor
gi|165600|gb|AAA85369.1| pepsinogen [Oryctolagus cuniculus]
Length = 387
Score = 228 bits (582), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 114/247 (46%), Positives = 162/247 (65%), Gaps = 10/247 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
++N++DA+YFG I IG+PPQ+F+VIFDTGSSNLWVPS+ C S++C H R+ S+TY
Sbjct: 67 MENYLDAEYFGTISIGTPPQDFTVIFDTGSSNLWVPSTYCS-SLACALHKRFNPEDSSTY 125
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
++ I YG+GS++G D V+VG + +Q+F + E SLTFL A FDGI+GL
Sbjct: 126 QGTSETLSITYGTGSMTGILGYDTVKVGSIEDTNQIFGLSKTEPSLTFLFAPFDGILGLA 185
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ I+ DA PV+DNM +GLVS+++FS +L+ D E+G ++FGG+D ++ G +V
Sbjct: 186 YPSISSSDATPVFDNMWNEGLVSQDLFSVYLSS--DDEKGSLVMFGGIDSSYYTGSLNWV 243
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG----- 311
PV+ +GYWQ + + I N T C C AIVD+GTSLL GPT ++ I IG
Sbjct: 244 PVSYEGYWQITMDSVSI-NGETIACADSCQAIVDTGTSLLTGPTSAISNIQSYIGASKNL 302
Query: 312 -GEGVVS 317
GE V+S
Sbjct: 303 LGENVIS 309
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 43/131 (32%), Positives = 63/131 (48%), Gaps = 6/131 (4%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 434
G++ C+ A+V L T +S I + N +GE++I C I ++P++
Sbjct: 262 GETIACADSCQAIVDTGTSLLTGPTS--AISNIQSYIGASKNLLGENVISCSAIDSLPDI 319
Query: 435 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 494
FTI + L YILK + C SG ++ G LWILGDVF+ Y TVFD
Sbjct: 320 VFTINGIQYPLPASAYILKEDDD----CTSGLEGMNVDTYTGELWILGDVFIRQYFTVFD 375
Query: 495 SGKLRIGFAEA 505
++G A A
Sbjct: 376 RANNQLGLAAA 386
>gi|30575834|gb|AAP32823.1| aspartyl proteinase [Paracoccidioides brasiliensis]
Length = 400
Score = 228 bits (581), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 125/291 (42%), Positives = 176/291 (60%), Gaps = 10/291 (3%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDE----DILPLKNFMDA 83
L +I L ++ LD ++ ++YMG + D+ + + + NF++A
Sbjct: 26 LNKISLSQQ-LDHANIETQVKALGQKYMGVRPSQHLNEMFKDTSKASGGHSVLVDNFLNA 84
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
QYF EI IG+PPQ F V+ DTGSSNLWVPSS+C SI+CY HS+Y S S+T+ + G
Sbjct: 85 QYFSEISIGTPPQTFKVVLDTGSSNLWVPSSQCS-SIACYLHSKYDSSASSTHRKNGTEF 143
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVG 203
I YGSGS+SGF SQD + +GD+ V+ Q F EAT E L F RFDGI+GLG+ I+V
Sbjct: 144 AIRYGSGSLSGFVSQDVLRIGDMTVESQDFAEATSEPGLAFAFGRFDGILGLGYDTISVN 203
Query: 204 DAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
VP + MV QGL+ E VFSF+L N D D ++ E FGG+D H+ G T + + +K
Sbjct: 204 RIVPTFYLMVNQGLLDEPVFSFYLGNSDTDGDD-SEATFGGIDKDHYTGNLTMISLRRKA 262
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
YW+ +L I G+++ + G I+D+GTSLLA P+ V +N IG +
Sbjct: 263 YWEVDLDAITFGSETAELENTGV--ILDTGTSLLALPSTVAEILNQKIGAK 311
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 51/87 (58%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC + + P+++FT+ F + YIL+ + CIS FM D P P GPL
Sbjct: 316 GQYTVDCSKRSSFPDITFTLAGHNFTIGSYDYILE----VQGSCISSFMGMDFPEPVGPL 371
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ +++V+D G +IG A+A
Sbjct: 372 AILGDAFLRRWYSVYDLGNHQIGLAKA 398
>gi|390601248|gb|EIN10642.1| endopeptidase [Punctularia strigosozonata HHB-11173 SS5]
Length = 412
Score = 228 bits (581), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 118/253 (46%), Positives = 159/253 (62%), Gaps = 9/253 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL NFM+AQYF EI +G+PPQ+F VI DTGSSNLWVPS KC SI+C+ H +Y S +S+
Sbjct: 91 VPLSNFMNAQYFSEITLGTPPQSFKVILDTGSSNLWVPSVKCT-SIACFLHQKYDSSQSS 149
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y G I YGSGS+ GF S+D + +GD+ +K Q F EAT+E L F +FDGI+G
Sbjct: 150 SYKANGSEFSIQYGSGSMEGFVSRDTLTIGDLTIKGQDFAEATKEPGLAFAFGKFDGILG 209
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V P + +M+ L+ + VFSF L E+GGE VFGG+D ++GK T
Sbjct: 210 LGYDTISVNHITPPFYSMINAALLDDPVFSFRLGS--SEEDGGEAVFGGIDSSAYEGKIT 267
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG-- 312
YVPV +K YW+ EL I G+ + G A +D+GTSL+A PT + +N IG
Sbjct: 268 YVPVRRKAYWEVELEKIKFGDDELELENTGAA--IDTGTSLIALPTDLAEMLNAQIGATK 325
Query: 313 --EGVVSAECKLV 323
G + EC V
Sbjct: 326 SWNGQYTVECSKV 338
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 51/87 (58%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ ++C ++P +P +SF + + L YIL+ + C+S F D+ P G L
Sbjct: 329 GQYTVECSKVPDLPELSFYFDGQAYPLKGTDYILE----VQGTCMSAFTGLDINLPGGSL 384
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WI+GDVF+ Y TV+D G+ +GFA++
Sbjct: 385 WIVGDVFLRKYFTVYDLGRDAVGFAKS 411
>gi|291409616|ref|XP_002721074.1| PREDICTED: pepsin II-4-like [Oryctolagus cuniculus]
Length = 387
Score = 228 bits (580), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 126/298 (42%), Positives = 178/298 (59%), Gaps = 22/298 (7%)
Query: 32 GLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGI 91
GL + L HS N A +Y A + E ++N+MDA+YFG I I
Sbjct: 36 GLLQDYLKTHSPNPAT-----KYFPNAAYA---------KESTEKMENYMDAEYFGTISI 81
Query: 92 GSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGS 151
G+PPQ+F+VIFDTGSSNLWVPS C S++C FH ++ +KS+TY K+ I YG+GS
Sbjct: 82 GTPPQDFTVIFDTGSSNLWVPSIYCS-SLACAFHKQFNPKKSSTYQATDKTVSIAYGTGS 140
Query: 152 ISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDN 211
++G D V+VG + Q+F + E TF+ A FDGI+GLG+ I+ DA PV+DN
Sbjct: 141 MTGILGYDIVKVGSIDDTHQIFGLSETEPGDTFVFAPFDGILGLGYPSISSSDATPVFDN 200
Query: 212 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 271
M + LVSE++FS +L+ D ++G ++FGG+D ++KG +VPV+ +GYWQF + +
Sbjct: 201 MWDHRLVSEDLFSVYLSS--DDKKGSLVMFGGIDESYYKGSLHWVPVSYEGYWQFTMDSV 258
Query: 272 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI----GGEGVVSAECKLVVS 325
I N T C C AI+D+GTSLLAGPT +++I I EG +C V S
Sbjct: 259 TI-NGKTIACADSCQAIIDTGTSLLAGPTNAISKIQRHIRAYDNSEGEAIVKCSDVKS 315
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 46/131 (35%), Positives = 63/131 (48%), Gaps = 6/131 (4%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 434
G + C+ A++ L T +S I + N GE+I+ C + ++P+V
Sbjct: 262 GKTIACADSCQAIIDTGTSLLAGPTN--AISKIQRHIRAYDNSEGEAIVKCSDVKSLPDV 319
Query: 435 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 494
FTI + L YILK + VC SGF DL G LWILGDVF+ Y TVFD
Sbjct: 320 VFTIHGVKYPLPASAYILKEDD----VCTSGFEGMDLDTSSGELWILGDVFIRKYFTVFD 375
Query: 495 SGKLRIGFAEA 505
++G A A
Sbjct: 376 RANNKLGLAPA 386
>gi|425767355|gb|EKV05929.1| Vacuolar protease A [Penicillium digitatum PHI26]
gi|425779798|gb|EKV17829.1| Vacuolar protease A [Penicillium digitatum Pd1]
Length = 399
Score = 228 bits (580), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 130/311 (41%), Positives = 186/311 (59%), Gaps = 27/311 (8%)
Query: 28 LRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILP------- 76
+ R+ L K +L+ H+++A ++YMG + +H+ D + P
Sbjct: 21 VHRLKLNKVPLSEQLNTHNIDAHLHNLGQKYMG---IRPEKHQDLFHDTSLNPASGHDVL 77
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
+ NF++AQYF EI IG+PPQ F V+ DTGSSNLWVPSS+C SI+C+ HS+Y S S+TY
Sbjct: 78 VDNFLNAQYFSEITIGTPPQTFKVVLDTGSSNLWVPSSQCS-SIACFLHSKYDSSSSSTY 136
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
+ G EI YGSGS+SGF S+D +++GD+ V+ Q F EAT E L F RFDGI+GLG
Sbjct: 137 QKNGTDFEIRYGSGSLSGFVSRDTLQIGDLKVEGQDFAEATNEPGLAFAFGRFDGILGLG 196
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE---IVFGGVDPKHFKGKH 253
+ I+V VP + M++Q LV E VF+F+L DA + G+ FGG+D H+ G+
Sbjct: 197 YDTISVNKMVPPFYQMIKQKLVDEPVFAFYLG---DANKDGDNSVATFGGIDESHYTGEL 253
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG- 312
+PV +K YW+ EL I +GN + + G I+D+GTSL+A P+ + +N IG
Sbjct: 254 IKIPVRRKAYWEVELNSIALGNNVAELDDTGV--ILDTGTSLIALPSTMAELLNKEIGAT 311
Query: 313 ---EGVVSAEC 320
G S EC
Sbjct: 312 KGFTGQYSVEC 322
Score = 75.9 bits (185), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 54/87 (62%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ ++CD+ ++P+++FT+G F + P YIL+ + CIS FM D P P GPL
Sbjct: 316 GQYSVECDKRDSLPDLTFTLGGHNFTIGPHDYILE----VQGSCISSFMGMDFPEPVGPL 371
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ +++V+D G +G A+A
Sbjct: 372 AILGDAFLRRWYSVYDVGNNAVGLAKA 398
>gi|326911558|ref|XP_003202125.1| PREDICTED: cathepsin E-A-like [Meleagris gallopavo]
Length = 404
Score = 228 bits (580), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 110/243 (45%), Positives = 160/243 (65%), Gaps = 2/243 (0%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
L ++M+AQY+G I +G+PPQ+F+V+FDTGSSN WVPS C S +C H R+KS S++Y
Sbjct: 73 LYDYMNAQYYGVISVGTPPQSFTVVFDTGSSNFWVPSVYC-ISEACRVHQRFKSFLSDSY 131
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
G+ + YG+G + G ++D +++ ++ +K Q F E+ E +TF LA FDG++GLG
Sbjct: 132 EHGGEPFSLQYGTGQLLGIAAKDTLQISNISIKGQDFGESVFEPGMTFALAHFDGVLGLG 191
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ +AVG+A+PV+D+++ Q LV E VFSF+L R D E GGE++ GG+D +KG +V
Sbjct: 192 YPSLAVGNALPVFDSIMNQKLVEEPVFSFYLKRGDDTENGGELILGGIDHSLYKGSIHWV 251
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
PVT+K YWQ L +I I + C GC AIVDSGTSL+ GP+ + + IG
Sbjct: 252 PVTEKSYWQIHLNNIKIQGR-VAFCSHGCEAIVDSGTSLITGPSSQIRRLQEYIGASPSR 310
Query: 317 SAE 319
S E
Sbjct: 311 SGE 313
Score = 98.6 bits (244), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 42/100 (42%), Positives = 65/100 (65%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
+ E + P+ GE ++DC R+ ++P++SFTIG + L+ EQY++K C+SGF
Sbjct: 300 LQEYIGASPSRSGEFLVDCRRLSSLPHISFTIGHHEYKLTAEQYVVKESIDDQTFCMSGF 359
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
+ D+P G LWILGDVFM ++ +FD G R+GFA++A
Sbjct: 360 QSLDIPTRSGSLWILGDVFMSAFYCIFDRGNDRVGFAKSA 399
>gi|13676837|ref|NP_112469.1| renin-1 precursor [Mus musculus]
gi|132327|sp|P06281.1|RENI1_MOUSE RecName: Full=Renin-1; AltName: Full=Angiotensinogenase; AltName:
Full=Kidney renin; Flags: Precursor
gi|53931|emb|CAA34636.1| unnamed protein product [Mus musculus]
gi|26342875|dbj|BAC35094.1| unnamed protein product [Mus musculus]
gi|26351563|dbj|BAC39418.1| unnamed protein product [Mus musculus]
gi|38512029|gb|AAH61053.1| Renin 1 structural [Mus musculus]
gi|148707703|gb|EDL39650.1| mCG131545 [Mus musculus]
Length = 402
Score = 228 bits (580), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 129/310 (41%), Positives = 186/310 (60%), Gaps = 9/310 (2%)
Query: 6 LRSVFCLWVLASCLL-LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVR 64
L ++ LW + C LP + RI LKK + + R R GV R
Sbjct: 8 LWALLLLW--SPCTFSLPTRTATFERIPLKKMP-SVREILEERGVDMTRLSAEWGVFTKR 64
Query: 65 HRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCY 123
L + ++ L N+++ QY+GEIGIG+PPQ F VIFDTGS+NLWVPS+KC ++C
Sbjct: 65 PSLTNLTSPVV-LTNYLNTQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCSRLYLACG 123
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
HS Y+S S++Y E G I+YGSG + GF SQD+V VG + V Q F E T +
Sbjct: 124 IHSLYESSDSSSYMENGSDFTIHYGSGRVKGFLSQDSVTVGGITVT-QTFGEVTELPLIP 182
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
F+LA+FDG++G+GF AVG PV+D+++ QG++ EEVFS + NR GGE+V GG
Sbjct: 183 FMLAKFDGVLGMGFPAQAVGGVTPVFDHILSQGVLKEEVFSVYYNRGSHL-LGGEVVLGG 241
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
DP+H++G YV ++K WQ + + +G+ ST +CE GCA +VD+G+S ++ PT +
Sbjct: 242 SDPQHYQGNFHYVSISKTDSWQITMKGVSVGS-STLLCEEGCAVVVDTGSSFISAPTSSL 300
Query: 304 TEINHAIGGE 313
I A+G +
Sbjct: 301 KLIMQALGAK 310
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 33/86 (38%), Positives = 54/86 (62%)
Query: 420 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 479
E +++C ++PT+P++SF +G + + LS Y+L+ ++C A D+PPP GP+W
Sbjct: 316 EYVVNCSQVPTLPDISFDLGGRAYTLSSTDYVLQYPNRRDKLCTLALHAMDIPPPTGPVW 375
Query: 480 ILGDVFMGVYHTVFDSGKLRIGFAEA 505
+LG F+ ++T FD RIGFA A
Sbjct: 376 VLGATFIRKFYTEFDRHNNRIGFALA 401
>gi|395821502|ref|XP_003784077.1| PREDICTED: gastricsin-like [Otolemur garnettii]
Length = 390
Score = 227 bits (579), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 113/302 (37%), Positives = 183/302 (60%), Gaps = 4/302 (1%)
Query: 12 LWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSD 71
L ++ +CL L S GL R+ L+K + ++ + + G ++ G+
Sbjct: 4 LVLILACLYL---SEGLERVILRKGKSIRQAMEEQGVLEEYLKNHPKGDPVAKYHFGNYA 60
Query: 72 EDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSR 131
P+ N+M++ YFGEI IG+PPQNF V+FDTGSSNLWVPS+ C S +C H +
Sbjct: 61 VAYEPITNYMESFYFGEISIGTPPQNFLVLFDTGSSNLWVPSTYCQ-SQACSNHHVFNPS 119
Query: 132 KSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDG 191
+S+T++ G++ ++YGSGS++ D V + ++VV +Q F + E ++ F + FDG
Sbjct: 120 QSSTFSNNGQTYTLSYGSGSLTVVMGYDTVTIQNIVVNNQEFGLSENEPTVPFYYSAFDG 179
Query: 192 IIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKG 251
I+G+ + IAVG+A V +M++Q +++ +FSF+ +R P A+ GGE++ GGVD + + G
Sbjct: 180 ILGMAYPAIAVGNAPTVVQDMLQQNQLTQPIFSFYFSRQPTAQYGGELILGGVDSQLYSG 239
Query: 252 KHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
+ + PVT++ YWQ + + IGNQ+TG+C GC IVD+GTSLL P ++ A G
Sbjct: 240 EIVWTPVTQEMYWQIAIQEFSIGNQATGLCSQGCQGIVDTGTSLLTVPQQYISSFVEATG 299
Query: 312 GE 313
+
Sbjct: 300 AQ 301
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 33/89 (37%), Positives = 45/89 (50%), Gaps = 5/89 (5%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG-P 477
G+ ++ C + MP ++FTIG L P Y+L C G L G P
Sbjct: 306 GDFVVSCSNVQNMPTIAFTIGGAQLPLPPSTYVLNNNG----YCTLGIEPTYLSSQSGEP 361
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
LWILGDVF+ Y++VFD +GFA +A
Sbjct: 362 LWILGDVFLREYYSVFDMANNMVGFALSA 390
>gi|283806610|ref|NP_001164556.1| pepsin II-4 precursor [Oryctolagus cuniculus]
gi|129787|sp|P28713.1|PEPA4_RABIT RecName: Full=Pepsin II-4; AltName: Full=Pepsin A; Flags: Precursor
gi|22218076|dbj|BAC07515.1| pepsinogen II-4 [Oryctolagus cuniculus]
Length = 387
Score = 227 bits (579), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 114/247 (46%), Positives = 161/247 (65%), Gaps = 10/247 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
L+N++DA+YFG I IG+PPQ+F+VIFDTGSSNLWVPS+ C S++C H R+ S+TY
Sbjct: 67 LENYLDAEYFGTISIGTPPQDFTVIFDTGSSNLWVPSTYCS-SLACALHKRFNPEDSSTY 125
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
++ I YG+GS++G D V+VG + +Q+F + E LTFL A FDGI+GL
Sbjct: 126 QGTSETLSITYGTGSMTGILGYDTVKVGSIEDTNQIFGLSKTEPGLTFLFAPFDGILGLA 185
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ I+ DA PV+DNM +GLVS+++FS +L+ D E+G ++FGG+D ++ G +V
Sbjct: 186 YPSISSSDATPVFDNMWNEGLVSQDLFSVYLSSDD--EKGSLVMFGGIDSSYYTGSLNWV 243
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG----- 311
PV+ +GYWQ + + I N T C C AIVD+GTSLL GPT ++ I IG
Sbjct: 244 PVSYEGYWQITMDSVSI-NGETIACADSCQAIVDTGTSLLTGPTSAISNIQSYIGASKNL 302
Query: 312 -GEGVVS 317
GE V+S
Sbjct: 303 LGENVIS 309
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 43/131 (32%), Positives = 63/131 (48%), Gaps = 6/131 (4%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 434
G++ C+ A+V L T +S I + N +GE++I C I ++P++
Sbjct: 262 GETIACADSCQAIVDTGTSLLTGPTS--AISNIQSYIGASKNLLGENVISCSAIDSLPDI 319
Query: 435 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 494
FTI + L YILK + C SG ++ G LWILGDVF+ Y TVFD
Sbjct: 320 VFTINGIQYPLPASAYILKEDDD----CTSGLEGMNVDTYTGELWILGDVFIRQYFTVFD 375
Query: 495 SGKLRIGFAEA 505
++G A A
Sbjct: 376 RANNQLGLAAA 386
>gi|409050032|gb|EKM59509.1| hypothetical protein PHACADRAFT_250062 [Phanerochaete carnosa
HHB-10118-sp]
Length = 407
Score = 227 bits (579), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 135/334 (40%), Positives = 186/334 (55%), Gaps = 30/334 (8%)
Query: 15 LASCLLLP--ASSNGLRRIGLKK-----RRLDLHSLNAARITRKERYMGGAGVSGVRHRL 67
LA ++LP A++ G+ + L K + S + A + M GAG +G RL
Sbjct: 6 LAPLVILPFAAAAAGVHKFKLHKLPPVSQDFAFESAHLAEKYGGQVPMLGAGGAGRNVRL 65
Query: 68 GDSDED--------------ILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPS 113
D LPL+NFM+AQYF I IG+PPQ+F+VI DTGSSNLWVPS
Sbjct: 66 SRPTPDDGLFRTQEEFTSGHTLPLQNFMNAQYFTTIEIGTPPQSFNVILDTGSSNLWVPS 125
Query: 114 SKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF 173
++C SI+C+ H +Y S S+TY G I YGSGS+ GF S+D + +GD+ + Q F
Sbjct: 126 TQCT-SIACFLHKKYDSGSSSTYKPNGSEFSIQYGSGSMEGFVSRDVLTMGDITIGQQDF 184
Query: 174 IEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDA 233
EAT+E L F +FDGI+GL + IAV P NM E+GL+ + VF+F L
Sbjct: 185 AEATKEPGLAFAFGKFDGILGLAYDTIAVNHITPPHYNMFEKGLIEKPVFAFRLGS--TE 242
Query: 234 EEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGT 293
E+ GE FGG+D F+GK VPV +K YW+ EL + +G+ + + G A +D+GT
Sbjct: 243 EDAGEATFGGIDESAFEGKLHRVPVRRKAYWEVELEKVRLGDDELELEDTGAA--IDTGT 300
Query: 294 SLLAGPTPVVTEINHAIGGE----GVVSAECKLV 323
SL+A PT + IN IG + G + EC V
Sbjct: 301 SLIALPTDMAEMINAQIGAKRGWNGQYTVECSTV 334
Score = 58.2 bits (139), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 48/87 (55%), Gaps = 5/87 (5%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ ++C +P +P ++ K + L YIL+ + C+S F D+P L
Sbjct: 325 GQYTVECSTVPDLPALTLYFDSKPYVLQGTDYILE----VQGTCMSSFTPLDMPNGMN-L 379
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WI+GDVF+ ++TV+D G +GFA+A
Sbjct: 380 WIIGDVFLRKFYTVYDFGDDTVGFAKA 406
>gi|119567604|gb|ABL84270.1| aspartic protease [Musca domestica]
Length = 379
Score = 227 bits (579), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 120/262 (45%), Positives = 167/262 (63%), Gaps = 12/262 (4%)
Query: 70 SDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFHSRY 128
SDE PL+N ++ +Y+G+I IG+PPQ F V+FDTGSSNLWVPSS C+ + I+C H++Y
Sbjct: 57 SDE---PLENSLNMKYYGDITIGTPPQKFVVLFDTGSSNLWVPSSHCWIWDIACKKHNQY 113
Query: 129 KSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLAR 188
S+TY + G+ I+YGSGS+SGF SQD+V V + +K+QVF EA E +F A
Sbjct: 114 NHDDSSTYVKNGELISISYGSGSMSGFLSQDDVTVEGLTIKNQVFAEAMNEPGNSFTDAN 173
Query: 189 FDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKH 248
FDGI G+ ++ +A + VP + NM QGLV +FSF LNRD + +GG+++ GGVD
Sbjct: 174 FDGIFGMAYQSLAEDNVVPPFYNMFAQGLVDANMFSFLLNRDGTSTDGGQMILGGVDSSL 233
Query: 249 FKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH 308
+ G TYVPV+ +GYWQFE+ I QS +C+ C AI D+GTSL+ P+ +N
Sbjct: 234 YTGDITYVPVSSQGYWQFEVTSGAIKGQS--ICD-NCQAIADTGTSLIVAPSDAYNTLNA 290
Query: 309 AIGG-----EGVVSAECKLVVS 325
IG +G +C V S
Sbjct: 291 EIGATYNEDDGNYYVDCSAVDS 312
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/89 (40%), Positives = 48/89 (53%), Gaps = 13/89 (14%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF--MAFDLPPPRG 476
G +DC + ++P+V+F IG F L YI+ + C+S F M D
Sbjct: 301 GNYYVDCSAVDSLPDVTFVIGGTTFTLPASAYIVT----VDGNCMSSFTYMGTDF----- 351
Query: 477 PLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
WILGDVF+G Y+TVFD R+GFAEA
Sbjct: 352 --WILGDVFIGKYYTVFDFANNRVGFAEA 378
>gi|449549767|gb|EMD40732.1| hypothetical protein CERSUDRAFT_44393 [Ceriporiopsis subvermispora
B]
Length = 413
Score = 227 bits (578), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 132/329 (40%), Positives = 187/329 (56%), Gaps = 34/329 (10%)
Query: 24 SSNGLRRIGLKKRRLDLHSLNAARITRKERYMG-------GAGVSGVRHRLGD------- 69
+++G+ R+ L K + E+Y G GAG G RLG
Sbjct: 15 AADGVHRLKLHKVPPTTSNPALESAYLAEKYGGQAQSPLMGAGGYGRNVRLGRPTHQDGE 74
Query: 70 ----SDEDIL-------PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF 118
+ ED++ PL NFM+AQYF EI +G+PPQ+F V+ DTGSSNLWVPS+KC
Sbjct: 75 ELFWTQEDLVTEGGHTVPLSNFMNAQYFAEITLGTPPQSFKVVLDTGSSNLWVPSTKCT- 133
Query: 119 SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
SI+C+ H++Y S S++Y G EI+YGSGS+ GF SQD + +GD+ + + F EAT+
Sbjct: 134 SIACFLHAKYDSSASSSYKANGTEFEIHYGSGSMEGFISQDVLSIGDISINNLDFAEATK 193
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
E L F +FDGI+GL + I+V VP + +MV + L+ VFSF L E+GGE
Sbjct: 194 EPGLAFAFGKFDGILGLAYDTISVNHVVPPFYHMVNKNLIDSPVFSFRLGS--SEEDGGE 251
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
+FGGVD + GK YVPV +K YW+ EL I +G+ + G A +D+GTSL+A
Sbjct: 252 AIFGGVDESAYTGKIDYVPVRRKAYWEVELQKISLGDDELELENTGAA--IDTGTSLIAL 309
Query: 299 PTPVVTEINHAIGGE----GVVSAECKLV 323
P+ + +N IG + G + EC+ V
Sbjct: 310 PSDMAEMLNTQIGAKRSWNGQYTVECEKV 338
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 55/88 (62%), Gaps = 5/88 (5%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGP- 477
G+ ++C+++P +P+++FT K + L YIL+ + C+S F D+ P G
Sbjct: 329 GQYTVECEKVPDLPDLTFTFDGKDYPLKGTDYILE----VQGTCMSAFTGLDINMPDGSQ 384
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+WI+GDVF+ Y+TV+D G+ +GFA+A
Sbjct: 385 IWIVGDVFLRRYYTVYDLGRDAVGFAKA 412
>gi|321250483|ref|XP_003191823.1| endopeptidase [Cryptococcus gattii WM276]
gi|317458290|gb|ADV20036.1| Endopeptidase, putative [Cryptococcus gattii WM276]
Length = 432
Score = 227 bits (578), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 118/255 (46%), Positives = 159/255 (62%), Gaps = 9/255 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+M+AQYF +I +G+P Q F VI DTGSSNLWVPS C SI+C+ HS+Y S +S+
Sbjct: 111 VPLSNYMNAQYFAQIELGTPAQTFKVILDTGSSNLWVPSVGCT-SIACFLHSKYDSSQSS 169
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY G EI+YGSGS+ GF SQD + +GD+ +K Q F EAT+E L F +FDGI+G
Sbjct: 170 TYKANGSDFEIHYGSGSLEGFISQDTLAIGDLAIKGQDFAEATKEPGLAFAFGKFDGILG 229
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+V VP + NM+ Q L+ + VFSF L + +GGE +FGG+D + G
Sbjct: 230 LAYDTISVNHIVPPFYNMLNQDLLDDPVFSFRLGSSEN--DGGEAIFGGIDKSAYSGSLH 287
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
YVPV +KGYW+ EL I G+ + G A +D+GTSL+ PT V +N IG E
Sbjct: 288 YVPVRRKGYWEVELESISFGDDELELENTGAA--IDTGTSLIVMPTDVAEMLNKEIGAEK 345
Query: 314 ---GVVSAECKLVVS 325
G + +C V S
Sbjct: 346 SWNGQYTVDCNTVPS 360
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 50/139 (35%), Positives = 77/139 (55%), Gaps = 6/139 (4%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
VE E++S GD + A + L T V +N+ + + G+ +DC+
Sbjct: 299 VELESISFGDDELELENTGAAIDTGTSLIVMPTD--VAEMLNKEIGAEKSWNGQYTVDCN 356
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
+P++P ++FT K + LS E YIL G CIS F D+PPP GPL+I+GDVF+
Sbjct: 357 TVPSLPELAFTFDGKAYKLSGEDYILNAGG----TCISSFTGMDIPPPMGPLYIVGDVFL 412
Query: 487 GVYHTVFDSGKLRIGFAEA 505
Y+TV+D G+ +GFA++
Sbjct: 413 RKYYTVYDLGRNAVGFAKS 431
>gi|146386352|gb|ABQ23964.1| cathepsin D [Oryctolagus cuniculus]
Length = 292
Score = 227 bits (578), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 108/222 (48%), Positives = 157/222 (70%), Gaps = 7/222 (3%)
Query: 104 TGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVE 162
TGSSNLWVPS C I+C+ H +Y S+KS+TY + G + +I+YGSGS+SG+ SQD V
Sbjct: 1 TGSSNLWVPSVHCKLLDIACWIHHKYNSKKSSTYVKNGTTFDIHYGSGSLSGYLSQDTVS 60
Query: 163 V-----GDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGL 217
V + V+ Q+F EAT++ +TF+ A+FDGI+G+ + I+V + +PV+DN+++Q L
Sbjct: 61 VPCTASSSIQVQKQIFGEATKQPGITFIAAKFDGILGMAYPRISVNNVLPVFDNLMQQKL 120
Query: 218 VSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQS 277
V + VFSF+LNRDP A+ GGE++ GGVDPK+++G +Y+ VT+K YWQ + + +G+
Sbjct: 121 VEKNVFSFYLNRDPAAQPGGELMLGGVDPKYYQGSLSYLNVTRKAYWQVHMDQLNVGSGL 180
Query: 278 TGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
T +CEGGC AIVD+GTSLL GP V E+ AIG ++ E
Sbjct: 181 T-LCEGGCEAIVDTGTSLLVGPVDEVRELQRAIGAVPLIQGE 221
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 38/79 (48%), Positives = 55/79 (69%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE II C+++ ++P V+ +G + + LS E Y LK +G +C+SGFM D+P
Sbjct: 214 AVPLIQGEYIIPCEKVSSLPPVTLKLGGRDYTLSSEDYTLKVSQGGKTICLSGFMGMDIP 273
Query: 473 PPRGPLWILGDVFMGVYHT 491
PP GPLWILGDVF+G Y+T
Sbjct: 274 PPAGPLWILGDVFIGRYYT 292
>gi|24653643|ref|NP_610961.1| CG10104 [Drosophila melanogaster]
gi|7303185|gb|AAF58249.1| CG10104 [Drosophila melanogaster]
Length = 404
Score = 227 bits (578), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 128/318 (40%), Positives = 185/318 (58%), Gaps = 25/318 (7%)
Query: 12 LWVLASCL----LLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRL 67
+W+L S L +LP L+ R+ L +AR R E+ G+ R RL
Sbjct: 1 MWLLVSLLPVLFILPVQFQHPVSCKLQLYRVPLRRFPSAR-HRFEK----LGIRMDRLRL 55
Query: 68 GDSDE------------DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSK 115
++E PL N++DAQYFG I IG+PPQ F VIFDTGSSNLWVPS+
Sbjct: 56 KYAEEVSHFRGEWNSAVKSTPLSNYLDAQYFGPITIGTPPQTFKVIFDTGSSNLWVPSAT 115
Query: 116 CYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFI 174
C + ++C H+RY +++S ++ G I+YGSGS+SGF S D V V + ++DQ F
Sbjct: 116 CASTMVACRVHNRYFAKRSTSHQVRGDHFAIHYGSGSLSGFLSTDTVRVAGLEIRDQTFA 175
Query: 175 EATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAE 234
EAT FL A+FDGI GL +R I++ P + M+EQGL+++ +FS +L+R+ + +
Sbjct: 176 EATEMPGPIFLAAKFDGIFGLAYRSISMQRIKPPFYAMMEQGLLTKPIFSVYLSRNGE-K 234
Query: 235 EGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTS 294
+GG I FGG +P ++ G TYV V+ + YWQ ++ +I N +C+ GC I+D+GTS
Sbjct: 235 DGGAIFFGGSNPHYYTGNFTYVQVSHRAYWQVKMDSAVIRNLE--LCQQGCEVIIDTGTS 292
Query: 295 LLAGPTPVVTEINHAIGG 312
LA P IN +IGG
Sbjct: 293 FLALPYDQAILINESIGG 310
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 44/99 (44%), Positives = 63/99 (63%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
INE P+ G+ ++ CD +P +P ++FT+G + F L +Y+ + +C S F
Sbjct: 304 INESIGGTPSSFGQFLVPCDSVPDLPKITFTLGGRRFFLESHEYVFRDIYQDRRICSSAF 363
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+A DLP P GPLWILGDVF+G Y+T FD + RIGFA+A
Sbjct: 364 IAVDLPSPSGPLWILGDVFLGKYYTEFDMERHRIGFADA 402
>gi|200688|gb|AAA40043.1| renin (Ren-1-d) [Mus musculus]
gi|148669208|gb|EDL01155.1| mCG129412 [Mus musculus]
Length = 402
Score = 227 bits (578), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 132/320 (41%), Positives = 189/320 (59%), Gaps = 29/320 (9%)
Query: 6 LRSVFCLWVLASCLL-LPASSNGLRRIGLKK----------RRLDLHSLNAARITRKERY 54
L ++ LW + C LP + RI LKK R +D+ L+A R
Sbjct: 8 LWALLLLW--SPCTFSLPTRTATFERIPLKKMPSVREILEERGVDMTRLSAER------- 58
Query: 55 MGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSS 114
GV R L + ++ L N+++ QY+GEIGIG+PPQ F VIFDTGS+NLWVPS+
Sbjct: 59 ----GVFTKRPSLINLTSPVV-LTNYLNTQYYGEIGIGTPPQTFKVIFDTGSANLWVPST 113
Query: 115 KC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF 173
KC ++C HS Y+S S++Y E G I+YGSG + GF SQD V VG + V Q F
Sbjct: 114 KCSRLYLACGIHSLYESSDSSSYMENGSDFTIHYGSGRVKGFLSQDVVTVGGITVT-QTF 172
Query: 174 IEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDA 233
E T + F+LA+FDG++G+GF AVG PV+D+++ QG++ EEVFS + NR
Sbjct: 173 GEVTELPLIPFMLAKFDGVLGMGFPAQAVGGVTPVFDHILSQGVLKEEVFSVYYNRGSHL 232
Query: 234 EEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGT 293
GGE+V GG DP+H++G YV ++K WQ + + +G+ ST +CE GCA +VD+G+
Sbjct: 233 -LGGEVVLGGSDPQHYQGNFHYVSISKTDSWQITMKGVSVGS-STLLCEEGCAVVVDTGS 290
Query: 294 SLLAGPTPVVTEINHAIGGE 313
S ++ PT + I A+G +
Sbjct: 291 SFISAPTSSLKLIMQALGAK 310
Score = 81.6 bits (200), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 33/86 (38%), Positives = 54/86 (62%)
Query: 420 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 479
E +++C ++PT+P++SF +G + + LS Y+L+ ++C A D+PPP GP+W
Sbjct: 316 EYVVNCSQVPTLPDISFDLGGRAYTLSSTDYVLQYPYRRDKLCTLALHAMDIPPPTGPVW 375
Query: 480 ILGDVFMGVYHTVFDSGKLRIGFAEA 505
+LG F+ ++T FD RIGFA A
Sbjct: 376 VLGATFIRKFYTEFDRHNNRIGFALA 401
>gi|118082412|ref|XP_416090.2| PREDICTED: cathepsin E-A-like [Gallus gallus]
Length = 404
Score = 227 bits (578), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 110/243 (45%), Positives = 160/243 (65%), Gaps = 2/243 (0%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
L ++M+AQY+G I +G+PPQ+F+V+FDTGSSN WVPS C S +C H R+KS S++Y
Sbjct: 73 LYDYMNAQYYGVISVGTPPQSFTVVFDTGSSNFWVPSVYC-ISEACRVHQRFKSFLSDSY 131
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
G+ + YG+G + G ++D +++ ++ +K Q F E+ E +TF LA FDG++GLG
Sbjct: 132 EHGGEPFSLQYGTGQLLGIAAKDTLQISNISIKGQDFGESVFEPGMTFALAHFDGVLGLG 191
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ +AVG+A+PV+D+++ Q LV E VFSF+L R D E GGE++ GG+D +KG +V
Sbjct: 192 YPSLAVGNALPVFDSIMNQKLVEEPVFSFYLKRGDDTENGGELILGGIDHSLYKGSIHWV 251
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
PVT+K YWQ L +I I + C GC AIVDSGTSL+ GP+ + + IG
Sbjct: 252 PVTEKSYWQIHLNNIKIQGRVV-FCSHGCEAIVDSGTSLITGPSSQIRRLQEYIGASPSR 310
Query: 317 SAE 319
S E
Sbjct: 311 SGE 313
Score = 102 bits (254), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 43/100 (43%), Positives = 66/100 (66%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
+ E + P+ GE ++DC R+ ++P++SFTIG + L+ EQY++K C+SGF
Sbjct: 300 LQEYIGASPSRSGEFLVDCRRLSSLPHISFTIGHHDYKLTAEQYVVKESIDDQTFCMSGF 359
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
+ D+P GPLWILGDVFM ++ +FD G R+GFA++A
Sbjct: 360 QSLDIPTHNGPLWILGDVFMSAFYCIFDRGNDRVGFAKSA 399
>gi|195485971|ref|XP_002091310.1| GE13586 [Drosophila yakuba]
gi|194177411|gb|EDW91022.1| GE13586 [Drosophila yakuba]
Length = 404
Score = 226 bits (577), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 119/293 (40%), Positives = 173/293 (59%), Gaps = 19/293 (6%)
Query: 21 LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNF 80
P++ + ++G++ RL L R E G+ + PL N+
Sbjct: 36 FPSARHRFEKLGIRMDRLRLKYAEEVSQFRGE---------------GNLEVKSTPLSNY 80
Query: 81 MDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEI 139
+DAQYFG I IG+PPQ+F VIFDTGSSNLWVPS+ C ++C H+RY +++S ++
Sbjct: 81 LDAQYFGPITIGTPPQSFKVIFDTGSSNLWVPSATCASRMVACRVHNRYFAKRSTSHQVR 140
Query: 140 GKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFRE 199
G I+YGSGS+ GF S D V V + ++DQ F EAT FL A+FDGI GLG+R
Sbjct: 141 GDRFAIHYGSGSLFGFLSTDTVRVAGLEIRDQTFAEATEMPGPIFLAAKFDGIFGLGYRS 200
Query: 200 IAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVT 259
I++ P + M+EQGL+++ +FS +L+R + +EGG I FGG +P ++ G TYV V+
Sbjct: 201 ISMQRIKPPFYAMMEQGLLTKPIFSVYLSRHGE-KEGGAIFFGGSNPHYYTGNFTYVQVS 259
Query: 260 KKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
+ YWQ ++ +I N +C+ GC I+D+GTS LA P IN +IGG
Sbjct: 260 HRAYWQVKMDSAVIRNLE--LCQQGCEVIIDTGTSFLALPYDQAILINESIGG 310
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 43/99 (43%), Positives = 62/99 (62%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
INE P+ G+ ++ C+ I +P ++FT+G + F L +Y+ + +C S F
Sbjct: 304 INESIGGTPSSFGQFLVPCENISALPKITFTLGGRTFFLESHEYVFRDIYQDRRICSSAF 363
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+A DLP P GPLWILGDVF+G Y+T FD + RIGFA+A
Sbjct: 364 IAVDLPSPSGPLWILGDVFLGKYYTEFDMERHRIGFADA 402
>gi|395328846|gb|EJF61236.1| endopeptidase [Dichomitus squalens LYAD-421 SS1]
Length = 412
Score = 226 bits (577), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 120/255 (47%), Positives = 156/255 (61%), Gaps = 9/255 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL NFM+AQYF EI +G+PPQ F VI DTGSSNLWVPS KC SI+C+ H++Y S S+
Sbjct: 91 VPLSNFMNAQYFAEISLGTPPQTFKVILDTGSSNLWVPSVKCT-SIACFLHTKYDSSSSS 149
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY G I YGSGS+ GF SQD +GD+ V F EAT+E L F +FDGI+G
Sbjct: 150 TYKANGTEFSIQYGSGSMEGFVSQDTFRIGDLTVDGLDFAEATKEPGLAFAFGKFDGILG 209
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + IAV P + +++ +GLV E VFSF L D +GGE +FGGVD + GK
Sbjct: 210 LAYDTIAVNHITPPFYHLINKGLVDEPVFSFRLGSSED--DGGEAIFGGVDDSAYTGKIQ 267
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG-- 312
YVPV +K YW+ EL + +G+ + G A +D+GTSL+A PT + IN IG
Sbjct: 268 YVPVRRKAYWEVELEKVSLGDDVLELESTGAA--IDTGTSLIALPTDIAEMINTQIGATK 325
Query: 313 --EGVVSAECKLVVS 325
G + +C V S
Sbjct: 326 SWNGQYTVDCAKVPS 340
Score = 81.6 bits (200), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 48/140 (34%), Positives = 72/140 (51%), Gaps = 6/140 (4%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
VE E VS GD + A + L T + IN + + G+ +DC
Sbjct: 279 VELEKVSLGDDVLELESTGAAIDTGTSLIALPTD--IAEMINTQIGATKSWNGQYTVDCA 336
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
++P++P+++FT G + L YIL+ + CIS F D+ P G LWI+GDVF+
Sbjct: 337 KVPSLPDLTFTFGGNPYVLKGTDYILE----VQGTCISSFTGLDINVPGGSLWIVGDVFL 392
Query: 487 GVYHTVFDSGKLRIGFAEAA 506
Y+TV+D G+ +GFA AA
Sbjct: 393 RKYYTVYDHGRDAVGFALAA 412
>gi|46395759|sp|Q800A0.1|CATE_RANCA RecName: Full=Cathepsin E; Flags: Precursor
gi|29647357|dbj|BAC75398.1| cathepsin E [Rana catesbeiana]
Length = 397
Score = 226 bits (577), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 107/236 (45%), Positives = 155/236 (65%), Gaps = 2/236 (0%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG+I IG+PPQ F+VIFDTGSSNLWVPS C S +C H+RY+ +S T
Sbjct: 65 PLMNYLDVEYFGQISIGTPPQQFTVIFDTGSSNLWVPSIYCT-SQACTKHNRYRPSESTT 123
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G++ I YG+G+++G D V V + V+ Q F E+ E TF + FDGI+GL
Sbjct: 124 YVSNGEAFFIQYGTGNLTGILGIDQVTVQGITVQSQTFAESVSEPGSTFQDSNFDGILGL 183
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ +AV + +PV+DNM+ Q LV +F ++NRDP++ +GGE+V GG D F G+ +
Sbjct: 184 AYPNLAVDNCIPVFDNMIAQNLVELPLFGVYMNRDPNSADGGELVLGGFDTSRFSGQLNW 243
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
VP+T +GYWQ ++ I + Q C GC AIVD+GTSL+ GP+ + ++ + IG
Sbjct: 244 VPITVQGYWQIQVDSIQVAGQVI-FCSDGCQAIVDTGTSLITGPSGDIEQLQNYIG 298
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 48/103 (46%), Positives = 66/103 (64%), Gaps = 8/103 (7%)
Query: 401 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 460
E++ +YI + N GE + C + MP+V+FTI ++L+PEQY+L+ G G
Sbjct: 291 EQLQNYI-----GVTNTNGEYGVSCSTLSLMPSVTFTINGLDYSLTPEQYMLEDGGG--- 342
Query: 461 VCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFA 503
C SGF D+ PP GPLWILGDVF+G Y++VFD G R+GFA
Sbjct: 343 YCSSGFQGLDISPPSGPLWILGDVFIGQYYSVFDRGNNRVGFA 385
>gi|149245862|ref|XP_001472682.1| PREDICTED: renin-1-like isoform 1 [Mus musculus]
Length = 425
Score = 226 bits (577), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 132/320 (41%), Positives = 189/320 (59%), Gaps = 29/320 (9%)
Query: 6 LRSVFCLWVLASCLL-LPASSNGLRRIGLKK----------RRLDLHSLNAARITRKERY 54
L ++ LW + C LP + RI LKK R +D+ L+A R
Sbjct: 31 LWALLLLW--SPCTFSLPTRTATFERIPLKKMPSVREILEERGVDMTRLSAER------- 81
Query: 55 MGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSS 114
GV R L + ++ L N+++ QY+GEIGIG+PPQ F VIFDTGS+NLWVPS+
Sbjct: 82 ----GVFTKRPSLINLTSPVV-LTNYLNTQYYGEIGIGTPPQTFKVIFDTGSANLWVPST 136
Query: 115 KC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF 173
KC ++C HS Y+S S++Y E G I+YGSG + GF SQD V VG + V Q F
Sbjct: 137 KCSRLYLACGIHSLYESSDSSSYMENGSDFTIHYGSGRVKGFLSQDVVTVGGITVT-QTF 195
Query: 174 IEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDA 233
E T + F+LA+FDG++G+GF AVG PV+D+++ QG++ EEVFS + NR
Sbjct: 196 GEVTELPLIPFMLAKFDGVLGMGFPAQAVGGVTPVFDHILSQGVLKEEVFSVYYNRGSHL 255
Query: 234 EEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGT 293
GGE+V GG DP+H++G YV ++K WQ + + +G+ ST +CE GCA +VD+G+
Sbjct: 256 -LGGEVVLGGSDPQHYQGNFHYVSISKTDSWQITMKGVSVGS-STLLCEEGCAVVVDTGS 313
Query: 294 SLLAGPTPVVTEINHAIGGE 313
S ++ PT + I A+G +
Sbjct: 314 SFISAPTSSLKLIMQALGAK 333
Score = 81.6 bits (200), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 33/86 (38%), Positives = 54/86 (62%)
Query: 420 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 479
E +++C ++PT+P++SF +G + + LS Y+L+ ++C A D+PPP GP+W
Sbjct: 339 EYVVNCSQVPTLPDISFDLGGRAYTLSSTDYVLQYPYRRDKLCTLALHAMDIPPPTGPVW 398
Query: 480 ILGDVFMGVYHTVFDSGKLRIGFAEA 505
+LG F+ ++T FD RIGFA A
Sbjct: 399 VLGATFIRKFYTEFDRHNNRIGFALA 424
>gi|307175238|gb|EFN65290.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 357
Score = 226 bits (576), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 118/256 (46%), Positives = 151/256 (58%), Gaps = 6/256 (2%)
Query: 67 LGDSDEDI--LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYF 124
L DSD+D + L N+ + Y+G I IG+PPQ F VIFDTGS+NLW+PS KC + +C
Sbjct: 21 LNDSDDDFPSVILSNYQNINYYGVITIGTPPQEFKVIFDTGSANLWIPSKKCNLT-ACLI 79
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR-EGSLT 183
H++Y S SNTY +I Y + I G S D V V V++Q F E T
Sbjct: 80 HNQYNSTASNTYIAKNALIQIKYFNSIIDGLISTDIVNVAGFNVQNQTFAELTNMSNEEL 139
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
FL A FDGI+GL + I+ + +PV+DNMV Q LVS +FSF+LNRDP AE GE + GG
Sbjct: 140 FLPAPFDGILGLAYSYISDNNIIPVFDNMVNQNLVSSHIFSFYLNRDPSAELDGEFILGG 199
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
DP H+ G TYVPVT KG+WQF + I + N S +C+ C AI D+G GPT V
Sbjct: 200 SDPAHYDGNFTYVPVTHKGFWQFTMDKIEVNNIS--LCQSSCQAIADTGMGETYGPTSDV 257
Query: 304 TEINHAIGGEGVVSAE 319
IN IG + E
Sbjct: 258 KTINELIGTTNIDGME 273
Score = 48.9 bits (115), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 29/85 (34%), Positives = 44/85 (51%), Gaps = 4/85 (4%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
INEL + N G ++C RIP +P + F +G K FNL+ + YI++ + C S F
Sbjct: 260 INELIGT-TNIDGMERVNCSRIPELPTIRFILGGKAFNLTGKDYIIQFPDEGNTSCRSSF 318
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHT 491
+ +D W LG F+G+ T
Sbjct: 319 LGYDFKEFN---WELGVAFIGIVFT 340
>gi|261194088|ref|XP_002623449.1| aspartyl proteinase [Ajellomyces dermatitidis SLH14081]
gi|239588463|gb|EEQ71106.1| aspartyl proteinase [Ajellomyces dermatitidis SLH14081]
gi|239606974|gb|EEQ83961.1| aspartyl proteinase [Ajellomyces dermatitidis ER-3]
gi|327354563|gb|EGE83420.1| aspartyl proteinase [Ajellomyces dermatitidis ATCC 18188]
Length = 398
Score = 226 bits (576), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 117/262 (44%), Positives = 161/262 (61%), Gaps = 12/262 (4%)
Query: 52 ERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWV 111
E + G A SG L D NF++AQY+ EI IG+PPQ F V+ DTGSSNLWV
Sbjct: 61 EMFKGAAQASGGHSVLVD---------NFLNAQYYSEITIGTPPQTFKVVLDTGSSNLWV 111
Query: 112 PSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQ 171
PSS+C SI+CY H++Y S S+TY + G I YGSGS+SGF SQD V +GD+ +K Q
Sbjct: 112 PSSEC-GSIACYLHNKYDSSTSSTYQKNGSEFAIRYGSGSLSGFVSQDTVRIGDLTIKSQ 170
Query: 172 VFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDP 231
+F EAT E L F RFDGI+GLG+ I+V P + MV QGL+ E VFSF+L
Sbjct: 171 LFAEATNEPGLAFAFGRFDGILGLGYDTISVNKIPPPFYEMVNQGLLDEPVFSFYLGDAN 230
Query: 232 DAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDS 291
++ E VFGG++ H+ G+ +P+ +K YW+ +L I G ++ + G I+D+
Sbjct: 231 IEDDDSEAVFGGINKDHYTGELVMIPLRRKAYWEVDLDAITFGKETAQLENTGV--ILDT 288
Query: 292 GTSLLAGPTPVVTEINHAIGGE 313
GTSL+A P+ + +N IG +
Sbjct: 289 GTSLIALPSTLAELLNKEIGAK 310
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 34/86 (39%), Positives = 50/86 (58%), Gaps = 4/86 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ IDC + +P+++FT+ F + P YIL+ + CIS FM D P P GPL
Sbjct: 315 GQYTIDCTKRDGLPDLTFTLTGHNFTIGPYDYILE----VQGSCISSFMGMDFPEPVGPL 370
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAE 504
ILGD F+ Y++V+D G +G A+
Sbjct: 371 AILGDAFLRRYYSVYDMGNHSVGLAK 396
>gi|358385852|gb|EHK23448.1| hypothetical protein TRIVIDRAFT_215801 [Trichoderma virens Gv29-8]
Length = 395
Score = 226 bits (576), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 127/310 (40%), Positives = 183/310 (59%), Gaps = 17/310 (5%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRK-----ERYMGGAGVSGVRHRLG 68
++A+ L+ ++ G+ ++ L+K L+ L + I + ++YMG S
Sbjct: 5 LIAAAALVGSAQAGVHKMKLQKVSLE-QQLEGSTIESQVQHLGQKYMGVRPTSRADVMFN 63
Query: 69 DSDEDI-----LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCY 123
D I +P+ NFM+AQYF EI IG+PPQ F V+ DTGSSNLWVPS C SI+C+
Sbjct: 64 DKLPKIQGGHPVPVTNFMNAQYFSEITIGTPPQTFKVVLDTGSSNLWVPSQSCN-SIACF 122
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
H+ Y S S+TY + G EI+YGSGS++GF S D V +GD+ ++ Q F EAT E L
Sbjct: 123 LHATYDSSSSSTYKQNGSDFEIHYGSGSLTGFISNDVVTIGDLKIQKQDFAEATSEPGLA 182
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
F RFDGI+GLG+ I+V +P + MV Q L+ E VF+F+L +EG E VFGG
Sbjct: 183 FAFGRFDGILGLGYDTISVNGIIPPFYQMVNQKLLDEPVFAFYLGS---GDEGSEAVFGG 239
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
VD H+ GK Y+P+ +K YW+ +L I G++ + G AI+D+GTSL P+ +
Sbjct: 240 VDESHYSGKIEYIPLRRKAYWEVDLDSIAFGDEVAELENTG--AILDTGTSLNVLPSGLA 297
Query: 304 TEINHAIGGE 313
+N IG +
Sbjct: 298 ELLNAEIGAK 307
Score = 69.7 bits (169), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 53/87 (60%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC + ++P+++F++ ++L YI++ ++ CIS F D P P GPL
Sbjct: 312 GQYTVDCSKRDSLPDITFSLAGSKYSLPATDYIIE----MSGNCISSFQGMDFPEPVGPL 367
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ Y++V+D GK +G A+A
Sbjct: 368 VILGDAFLRRYYSVYDLGKNAVGLAKA 394
>gi|290543422|ref|NP_001166408.1| cathepsin E precursor [Cavia porcellus]
gi|115721|sp|P25796.1|CATE_CAVPO RecName: Full=Cathepsin E; Flags: Precursor
gi|191295|gb|AAA37052.1| procathepsin E [Cavia porcellus]
gi|1246041|gb|AAB35844.1| procathepsin E [Cavia]
Length = 391
Score = 226 bits (576), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 114/236 (48%), Positives = 151/236 (63%), Gaps = 3/236 (1%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C H + S+T
Sbjct: 65 PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACQTHPVFHPSLSST 123
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y E+G S I YG+GS++G D V V + V Q F E+ +E TF+ A FDGI+GL
Sbjct: 124 YREVGNSFSIQYGTGSLTGIIGADQVSVEGLTVVGQQFGESVQEPGKTFVHAEFDGILGL 183
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +A G PV+DNM+ Q LV+ +FS +++ +P G E+ FGG DP HF G +
Sbjct: 184 GYPSLAAGGVTPVFDNMMAQNLVALPMFSVYMSSNPGG-SGSELTFGGYDPSHFSGSLNW 242
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
VPVTK+ YWQ L I +G+ S C GC AIVD+GTSL+ GP + ++ A+G
Sbjct: 243 VPVTKQAYWQIALDGIQVGD-SVMFCSEGCQAIVDTGTSLITGPPGKIKQLQEALG 297
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 49/147 (33%), Positives = 73/147 (49%), Gaps = 18/147 (12%)
Query: 367 VEKENVSAGDSAV-CSACEMAVVWVQNQL------KQKQTKEKV-LSYINELCDSLPNPM 418
+ + + GDS + CS A+V L K KQ +E + +Y++E
Sbjct: 253 IALDGIQVGDSVMFCSEGCQAIVDTGTSLITGPPGKIKQLQEALGATYVDE--------- 303
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G S+ C + M +V+F I + L+P Y L +VC +GF ++ PP GPL
Sbjct: 304 GYSV-QCANLNMMLDVTFIINGVPYTLNPTAYTLLDFVDGMQVCSTGFEGLEIQPPAGPL 362
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WILGDVF+ ++ VFD G R+G A A
Sbjct: 363 WILGDVFIRQFYAVFDRGNNRVGLAPA 389
>gi|67524891|ref|XP_660507.1| hypothetical protein AN2903.2 [Aspergillus nidulans FGSC A4]
gi|40744298|gb|EAA63474.1| hypothetical protein AN2903.2 [Aspergillus nidulans FGSC A4]
gi|259486160|tpe|CBF83780.1| TPA: vacuolar aspartyl protease (proteinase A) (Eurofung)
[Aspergillus nidulans FGSC A4]
Length = 394
Score = 226 bits (576), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 131/316 (41%), Positives = 185/316 (58%), Gaps = 29/316 (9%)
Query: 15 LASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRK---------ERYMGGAGVSGVRH 65
+ + LL + G + K +L+ L ITR ++YMG +H
Sbjct: 1 MKASLLTASVLLGYASAEVHKLKLNKVPLTEQFITRNIADHANALGQKYMG----QFQQH 56
Query: 66 RLGDSD------EDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS 119
L D D+L + NFM+AQYF EI +G+PPQ F V+ DTGSSNLWVPSS+C S
Sbjct: 57 VLEDEPVNAMRGHDVL-VDNFMNAQYFSEIQLGTPPQTFKVVLDTGSSNLWVPSSECG-S 114
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
I+CY H ++ S S+TY + G I YGSGS+SGF S+DN+++GD+ VK Q F EAT E
Sbjct: 115 IACYLHQKFDSSASSTYKKNGSEFAIKYGSGSLSGFVSRDNLQIGDLKVKGQDFAEATSE 174
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL---NRDPDAEEG 236
L F RFDGI+GLGF I+V VP + NM+ QGL+ E VF+F+L N+D D+
Sbjct: 175 PGLAFAFGRFDGILGLGFDTISVNRIVPPFYNMIHQGLLDEPVFAFYLGDANKDGDSSVA 234
Query: 237 GEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLL 296
FGG+D H++G+ +P+ +K YW+ +L I +G++ + G I+D+GTSL+
Sbjct: 235 ---TFGGIDKDHYEGELIKIPLRRKAYWEVDLDAIALGDEVAELENTGV--ILDTGTSLI 289
Query: 297 AGPTPVVTEINHAIGG 312
A P+ + IN IG
Sbjct: 290 ALPSNLAEMINTEIGA 305
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 51/87 (58%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ IDC + ++P+++FT+ F + P Y L+ + CIS FM D P P GPL
Sbjct: 311 GQYTIDCAKRDSLPDLTFTLTGHNFTIGPYDYTLE----VQGSCISAFMGMDFPEPVGPL 366
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ +++V+D G +G A+A
Sbjct: 367 AILGDAFLRKWYSVYDLGNGAVGLAKA 393
>gi|110277433|gb|ABG57251.1| vacuolar protease A [Trichoderma atroviride]
gi|358394485|gb|EHK43878.1| hypothetical protein TRIATDRAFT_137844 [Trichoderma atroviride IMI
206040]
Length = 395
Score = 226 bits (576), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 128/309 (41%), Positives = 184/309 (59%), Gaps = 15/309 (4%)
Query: 14 VLASCLLLPASSNGLRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSGVRHRLGD 69
++A+ L+ ++ G+ ++ L+K ++L+ S+ A ++YMG S V D
Sbjct: 5 LIAAAALVGSAQAGVHKMKLQKVSLEQQLEGSSIEAQVQQLGQKYMGVRPTSRVDVMFND 64
Query: 70 SDEDI-----LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYF 124
+ + +P+ NFM+AQYF EI IGSPPQ F V+ DTGSSNLWVPS C SI+C+
Sbjct: 65 NVPKVKGGHPVPVTNFMNAQYFSEITIGSPPQTFKVVLDTGSSNLWVPSQSCN-SIACFL 123
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
HS Y S S++Y + G EI+YGSGS++GF S D V +GD+ +K Q F EAT E L F
Sbjct: 124 HSTYDSSSSSSYKKNGSDFEIHYGSGSLTGFISNDVVTIGDLQIKGQDFAEATSEPGLAF 183
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
RFDGI+GLG+ I+V VP + MV Q L+ E VF+F+L +EG FGGV
Sbjct: 184 AFGRFDGILGLGYDTISVNGIVPPFYQMVNQKLLDEPVFAFYLGS---GDEGSVATFGGV 240
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
D H+ GK Y+P+ +K YW+ +L I G++ + G AI+D+GTSL P+ +
Sbjct: 241 DESHYSGKIEYIPLRRKAYWEVDLDSIAFGDEVAELENTG--AILDTGTSLNVLPSGIAE 298
Query: 305 EINHAIGGE 313
+N IG +
Sbjct: 299 LLNAEIGAK 307
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 53/87 (60%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ IDC + ++P+++F++ ++L YIL+ ++ CIS F D P P GPL
Sbjct: 312 GQYTIDCAKRDSLPDITFSLAGSKYSLPASDYILE----VSGSCISTFQGMDFPEPVGPL 367
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ Y++V+D GK +G A+A
Sbjct: 368 VILGDAFLRRYYSVYDLGKGAVGLAKA 394
>gi|50557048|ref|XP_505932.1| YALI0F27071p [Yarrowia lipolytica]
gi|49651802|emb|CAG78744.1| YALI0F27071p [Yarrowia lipolytica CLIB122]
Length = 396
Score = 226 bits (576), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 126/306 (41%), Positives = 181/306 (59%), Gaps = 20/306 (6%)
Query: 37 RLDLHSLNAARITRKER------YMGGAGVSGVRHRLGDSDE-----DIL--PLKNFMDA 83
++ ++ ++ A + KE M G G +LG+ +E D+ PL N+++A
Sbjct: 22 KVSINKMSTAELLGKENGFEDHLRMMGQKYMGKFQKLGEFNELASIQDVSNSPLTNYLNA 81
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
QY+ EI IG+PPQ F+VI DTGSSNLWVPS +C SI+CY H +Y S S++Y G +
Sbjct: 82 QYYTEIEIGTPPQKFNVILDTGSSNLWVPSVQCN-SIACYLHQKYDSAASSSYKANGTAF 140
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVG 203
EI YGSGS+ GF SQD +++G +V+ +Q F EAT E L F +FDGI+GL + I+V
Sbjct: 141 EIQYGSGSMEGFVSQDTLKLGSLVLPEQDFAEATSEPGLAFAFGKFDGILGLAYDTISVN 200
Query: 204 DAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGY 263
VP N V +GL+ + FSF+L +GG FGGVD +F+GK T++PV +K Y
Sbjct: 201 KIVPPVYNAVNRGLLDKNQFSFFLGDTNKGTDGGVATFGGVDEDYFEGKITWLPVRRKAY 260
Query: 264 WQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAE 319
W+ E I +G+Q+ + G A +D+GTSLLA P+ + +N IG G + E
Sbjct: 261 WEVEFNSITLGDQTAELVNTGAA--IDTGTSLLALPSGLAEVLNSEIGATKGWSGQYTVE 318
Query: 320 CKLVVS 325
C V S
Sbjct: 319 CDKVDS 324
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 52/87 (59%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ ++CD++ ++P+++F F + P Y L+ ++ C+S F FD+P P GP+
Sbjct: 313 GQYTVECDKVDSLPDLTFNFAGYNFTIGPRDYTLE----LSGSCVSAFTGFDIPAPVGPI 368
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
I+GD F+ Y++V+D +G A+A
Sbjct: 369 AIIGDAFLRRYYSVYDLDHDAVGLAKA 395
>gi|393246119|gb|EJD53628.1| aspartic peptidase A1 [Auricularia delicata TFB-10046 SS5]
Length = 415
Score = 226 bits (575), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 117/243 (48%), Positives = 154/243 (63%), Gaps = 5/243 (2%)
Query: 71 DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKS 130
D +PL NF +AQYF EI +GSP QNF V+ DTGSSNLWVPSS C SI+C+ H++Y S
Sbjct: 90 DGHKVPLSNFANAQYFAEISLGSPAQNFKVVLDTGSSNLWVPSSGCT-SIACFLHAKYDS 148
Query: 131 RKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFD 190
S+TY + G S EI+YGSGS+ GF SQD +++GD+ + Q F EA +E L F +FD
Sbjct: 149 SASSTYKKNGSSFEIHYGSGSMEGFISQDTLKIGDISIPGQDFAEAMKEPGLAFAFGKFD 208
Query: 191 GIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFK 250
GI+GL + IAV P + NMV + L+ + VFSF L +GG VFGGVD H+K
Sbjct: 209 GILGLAYDTIAVNHITPPFYNMVNKKLLDQPVFSFRLG--ASESDGGSAVFGGVDSSHYK 266
Query: 251 GKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI 310
G+ TYVPV +K YW+ EL I +G+ G A +D+GTSL+ P + IN I
Sbjct: 267 GQITYVPVRRKAYWEVELEGIKLGDDEVDFENTGAA--IDTGTSLIVLPVDIGEMINAQI 324
Query: 311 GGE 313
G +
Sbjct: 325 GAK 327
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 53/87 (60%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ + C++ ++PN +F G K + L+ E Y+L+ ++ C+S F D P G L
Sbjct: 332 GQYTVPCEKRSSLPNFTFNFGGKPYVLTGEDYVLE----LSGTCVSAFTPMDFNVPGGDL 387
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WI+GDVF+ Y TV+D G+ +GFAE+
Sbjct: 388 WIVGDVFLRKYFTVYDLGRNAVGFAES 414
>gi|327296035|ref|XP_003232712.1| hypothetical protein TERG_06704 [Trichophyton rubrum CBS 118892]
gi|326465023|gb|EGD90476.1| hypothetical protein TERG_06704 [Trichophyton rubrum CBS 118892]
Length = 400
Score = 226 bits (575), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 123/298 (41%), Positives = 178/298 (59%), Gaps = 24/298 (8%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRL----------GDSDEDILPL 77
L+++ LK++ L+ ++ + ++YMG + +H +S ++L +
Sbjct: 25 LKKVSLKEQ-LEHADIDVQIKSLGQKYMG---IRPEQHEQQMFKEQTPIEAESGHNVL-I 79
Query: 78 KNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYT 137
NF++AQYF EI IG+PPQ F V+ DTGSSNLWVP C SI+C+ HS Y S S+TY+
Sbjct: 80 DNFLNAQYFSEISIGTPPQTFKVVLDTGSSNLWVPGKDCS-SIACFLHSTYDSSASSTYS 138
Query: 138 EIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGF 197
G I YGSGS+ GF S+DNV++GD+ +K+Q+F EAT E L F RFDGI+G+GF
Sbjct: 139 RNGTKFAIRYGSGSLEGFVSRDNVKIGDLTIKNQLFAEATSEPGLAFAFGRFDGIMGMGF 198
Query: 198 REIAVGDAVPVWDNMVEQGLVSEEVFSFWL---NRDPDAEEGGEIVFGGVDPKHFKGKHT 254
I+V P + NM++QGL+ E VFSF+L N+D D + FGG D HF G T
Sbjct: 199 SSISVNGIPPPFYNMIDQGLLDEPVFSFYLGDTNKDGDQS---VVTFGGSDTNHFTGDMT 255
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
+P+ +K YW+ + I +G + + G I+D+GTSL+A PT + IN IG
Sbjct: 256 TIPLRRKAYWEVDFDAISLGKDTAALENTGI--ILDTGTSLIALPTTLAEMINTQIGA 311
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 35/87 (40%), Positives = 53/87 (60%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC + ++P+V+FT+ F + P Y L+ ++ CIS FM D P P GPL
Sbjct: 317 GQYTLDCAKRDSLPDVTFTLSGHNFTIGPHDYTLE----VSGTCISSFMGMDFPEPVGPL 372
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ Y++V+D GK +G A+A
Sbjct: 373 AILGDSFLRRYYSVYDLGKGTVGLAKA 399
>gi|338712318|ref|XP_001501960.2| PREDICTED: pepsin II-1-like [Equus caballus]
Length = 397
Score = 226 bits (575), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 113/251 (45%), Positives = 160/251 (63%), Gaps = 10/251 (3%)
Query: 73 DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRK 132
D PL+N++D +YFG I IG+PPQ F+VIFDTGSSNLWVPS+ C S++CY H R+ K
Sbjct: 73 DTEPLENYLDEEYFGTISIGTPPQEFTVIFDTGSSNLWVPSTYCS-SLACYDHKRFNPEK 131
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGI 192
S+TY +S I YG+GS++G D V VG + +Q+F + +E LA FDGI
Sbjct: 132 SSTYRATSESISITYGTGSMTGILGYDTVRVGGIEDTNQIFGLSEKEPGFFLFLAPFDGI 191
Query: 193 IGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGK 252
+GL + I+ A PV+DN+ +QGLVS+++FS +L+ + E G ++FGG+D ++ G
Sbjct: 192 LGLAYPSISASGATPVFDNIWDQGLVSQDLFSVYLSS--NDESGSVVMFGGIDSSYYTGS 249
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG- 311
+VPV+ +GYWQ + I + +S C GGC A+VD+GTSLL GPT + I IG
Sbjct: 250 LHWVPVSHEGYWQITVDSITVNGESIA-CSGGCQAVVDTGTSLLTGPTSAIDNIQSYIGA 308
Query: 312 -----GEGVVS 317
GE V+S
Sbjct: 309 RKDLLGEAVIS 319
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 52/134 (38%), Positives = 69/134 (51%), Gaps = 10/134 (7%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTK--EKVLSYINELCDSLPNPMGESIIDCDRIPTMP 432
G+S CS AVV L T + + SYI D L GE++I C I ++P
Sbjct: 272 GESIACSGGCQAVVDTGTSLLTGPTSAIDNIQSYIGARKDLL----GEAVISCSSIDSLP 327
Query: 433 NVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTV 492
++ FTI F L+P YIL+ + +CISGF +L G LWILGDVF+ Y TV
Sbjct: 328 DIVFTINGVEFPLTPSAYILEEDD----ICISGFKGMNLDTSSGELWILGDVFIRQYFTV 383
Query: 493 FDSGKLRIGFAEAA 506
FD ++G A A
Sbjct: 384 FDRANNQVGLASVA 397
>gi|340374170|ref|XP_003385611.1| PREDICTED: cathepsin D-like [Amphimedon queenslandica]
Length = 389
Score = 225 bits (574), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 108/245 (44%), Positives = 159/245 (64%), Gaps = 3/245 (1%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSN 134
P+K+++ AQY+G I +G+P Q+F+ +FDTGSSNLWVPS KC I+C H++Y S KS+
Sbjct: 63 PMKDYLMAQYYGPISLGTPDQDFNCMFDTGSSNLWVPSKKCGLLDIACRLHNKYDSTKSS 122
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY G + YGSG+ SGFFS DN+++G+ + Q EAT E + F+ A+FDGI G
Sbjct: 123 TYIANGTKFSLQYGSGATSGFFSTDNMKIGNSTITKQSIGEATHEPGVAFVAAKFDGICG 182
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+ + I+ P +DNM+ Q LV+ +F +L+ D A GG++ GG + K++ G
Sbjct: 183 MAYPAISAERQTPFFDNMISQNLVNAGMFGVFLSADTSASLGGDLNLGGPNEKYYTGDFN 242
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
YVP+T K Y+ ++ + GN S +C+GGC IVD+GTSL+AGPT VT+I AIG +
Sbjct: 243 YVPLTSKTYYMIKVDGMNAGNLS--LCDGGCNGIVDTGTSLIAGPTAEVTKIATAIGAKS 300
Query: 315 VVSAE 319
++ E
Sbjct: 301 TLAGE 305
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 47/151 (31%), Positives = 78/151 (51%), Gaps = 8/151 (5%)
Query: 357 EYVSTGIKT--VVEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSL 414
YV KT +++ + ++AG+ ++C +V L T E ++ I +
Sbjct: 242 NYVPLTSKTYYMIKVDGMNAGNLSLCDGGCNGIVDTGTSLIAGPTAE--VTKIATAIGAK 299
Query: 415 PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPP 474
GE IDC ++P++P+V+ TI + + L+ + Y+L + C+ GFM +LP
Sbjct: 300 STLAGEYTIDCTKVPSLPDVTITIAGQKYTLTGKDYVLN----VEGQCLLGFMGINLPDQ 355
Query: 475 RGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
WILGDV + VY+TVFD R+GFA +
Sbjct: 356 LKNSWILGDVLIRVYYTVFDYSGGRVGFAPS 386
>gi|21063965|gb|AAM29212.1| AT05209p [Drosophila melanogaster]
Length = 404
Score = 225 bits (574), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 128/318 (40%), Positives = 184/318 (57%), Gaps = 25/318 (7%)
Query: 12 LWVLASCL----LLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRL 67
+W L S L +LP L+ R+ L +AR R E+ G+ R RL
Sbjct: 1 MWPLVSLLPVLFILPVQFQHPVSCKLQLYRVPLRRFPSAR-HRFEK----LGIRMDRLRL 55
Query: 68 GDSDE------------DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSK 115
++E PL N++DAQYFG I IG+PPQ F VIFDTGSSNLWVPS+
Sbjct: 56 KYAEEVSHFRGEWNSAVKSTPLSNYLDAQYFGPITIGTPPQTFKVIFDTGSSNLWVPSAT 115
Query: 116 CYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFI 174
C + ++C H+RY +++S ++ G I+YGSGS+SGF S D V V + ++DQ F
Sbjct: 116 CASTMVACRVHNRYFAKRSTSHQVRGDHFAIHYGSGSLSGFLSTDTVRVAGLEIRDQTFA 175
Query: 175 EATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAE 234
EAT FL A+FDGI GL +R I++ P + M+EQGL+++ +FS +L+R+ + +
Sbjct: 176 EATEMPGPIFLAAKFDGIFGLAYRSISMQRIKPPFYAMMEQGLLTKPIFSVYLSRNGE-K 234
Query: 235 EGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTS 294
+GG I FGG +P ++ G TYV V+ + YWQ ++ +I N +C+ GC I+D+GTS
Sbjct: 235 DGGAIFFGGSNPHYYTGNFTYVQVSHRAYWQVKMDSAVIRNLE--LCQQGCEVIIDTGTS 292
Query: 295 LLAGPTPVVTEINHAIGG 312
LA P IN +IGG
Sbjct: 293 FLALPYDQAILINESIGG 310
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 44/99 (44%), Positives = 63/99 (63%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
INE P+ G+ ++ CD +P +P ++FT+G + F L +Y+ + +C S F
Sbjct: 304 INESIGGTPSSFGQFLVPCDSVPDLPKITFTLGGRRFFLESHEYVFRDIYQDRRICSSAF 363
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+A DLP P GPLWILGDVF+G Y+T FD + RIGFA+A
Sbjct: 364 IAVDLPSPSGPLWILGDVFLGKYYTEFDMERHRIGFADA 402
>gi|345568347|gb|EGX51242.1| hypothetical protein AOL_s00054g478 [Arthrobotrys oligospora ATCC
24927]
Length = 392
Score = 225 bits (574), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 124/311 (39%), Positives = 182/311 (58%), Gaps = 19/311 (6%)
Query: 24 SSNGLRRIGLKKRRLDLHSLNAARITR----KERYMGGAGVSGVRHRLGDSDED---ILP 76
+S G+ ++ LKK ++ L T+ ++Y+ AG + D + D +P
Sbjct: 16 ASAGVHKMSLKKIPVEDTMLGQNFQTQVQALAQKYINRAG--NQQAFTNDVNADGGHSVP 73
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
+ NF++AQY+ EI +G+PPQ F V+ DTGSSNLWVPS C SI+C+ H++Y S +S+TY
Sbjct: 74 VNNFLNAQYYSEITLGTPPQTFKVVLDTGSSNLWVPSKSCS-SIACFLHTKYDSSESSTY 132
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
G I YGSGS+ GF SQD + +GD+ +K+Q+F EAT+E L F +FDGI+GLG
Sbjct: 133 KANGTEFSIQYGSGSMEGFISQDTLTIGDLTIKNQLFAEATKEPGLAFAFGKFDGILGLG 192
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ I+V P + M+ Q LV E VF+F+L R+ D E VFGG+D H+ G T+V
Sbjct: 193 YDTISVNKIPPPFYQMISQKLVDEPVFAFYLGREEDESEA---VFGGIDKSHYTGDITWV 249
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---- 312
V +K YW+ I G+Q+ + G A++D+GTSL+ P+ +N AIG
Sbjct: 250 DVRRKAYWEVPFDSISFGDQTAELDSWG--AVLDTGTSLITLPSDYAEMLNSAIGATKGW 307
Query: 313 EGVVSAECKLV 323
G S C+ V
Sbjct: 308 NGQYSVPCEKV 318
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 48/87 (55%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ + C+++P +P+++F +G F + Y L + CIS D+P GP+
Sbjct: 309 GQYSVPCEKVPDLPSLTFNLGGTNFTIEGSDYTLN----LQGSCISAITPLDMPARLGPM 364
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ Y++++D G R G A+A
Sbjct: 365 AILGDAFLRKYYSIYDLGNNRAGLAKA 391
>gi|296479430|tpg|DAA21545.1| TPA: renin [Bos taurus]
Length = 401
Score = 225 bits (574), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 126/307 (41%), Positives = 184/307 (59%), Gaps = 20/307 (6%)
Query: 17 SCLL-LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRL------GD 69
SC LPA + RRI LKK + + R + KER + A + +L G+
Sbjct: 13 SCTFSLPADTAAFRRIFLKK-------MPSVRESLKERGVDMARLGAEWSQLTKTLSFGN 65
Query: 70 SDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFHSRY 128
++ L N++D QY+GEIGIG+PPQ F V+FDTGS+NLWVPS+KC +C HS Y
Sbjct: 66 RTSPVV-LTNYLDTQYYGEIGIGTPPQTFKVVFDTGSANLWVPSTKCSPLYTACEIHSLY 124
Query: 129 KSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLAR 188
S +S++Y E G I+YGSG + GF SQD V VG + V Q F E T L F+LA+
Sbjct: 125 DSLESSSYVENGTEFTIHYGSGKVKGFLSQDLVTVGGITVT-QTFGEVTELPLLPFMLAK 183
Query: 189 FDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEE--GGEIVFGGVDP 246
FDG++G+GF AVG PV+D+++ Q +++++VFS + +RD GGEIV GG DP
Sbjct: 184 FDGVLGMGFPAQAVGGVTPVFDHILAQRVLTDDVFSVYYSRDSKNSHLLGGEIVLGGSDP 243
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
++++ YV ++K G WQ + + + +T +CE GC IVD+G S ++GPT + +
Sbjct: 244 QYYQENFHYVSISKPGSWQIRMKGVSV-RSTTLLCEEGCMVIVDTGASYISGPTSSLRLL 302
Query: 307 NHAIGGE 313
A+G +
Sbjct: 303 MEALGAK 309
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 31/84 (36%), Positives = 53/84 (63%)
Query: 422 IIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWIL 481
+++C+++PT+P++SF +G K + L+ Y+L+ ++C D+PPP GP+W+L
Sbjct: 317 VVNCNQMPTLPDISFHLGGKAYTLTSADYVLQDPYNNDDLCTLALHGMDIPPPTGPVWVL 376
Query: 482 GDVFMGVYHTVFDSGKLRIGFAEA 505
G F+ ++T FD RIGFA A
Sbjct: 377 GATFIRKFYTEFDRRNNRIGFALA 400
>gi|68051036|emb|CAI46901.1| nothepsin [Podarcis siculus]
Length = 414
Score = 225 bits (573), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 123/320 (38%), Positives = 186/320 (58%), Gaps = 13/320 (4%)
Query: 6 LRSVFCLWVLASCLL----LPASSNGLRRIGLKKRRLDLHSLNAARITR--KERYMGGAG 59
+R + WV CL +P + R L+KR +LH L R +RY
Sbjct: 1 MRVLLAFWVYIPCLTAVVRIPLTRFESIRGKLRKRG-ELHKLLEDRQPDIFGQRY-PHCL 58
Query: 60 VSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS 119
S + G + E L ++M+AQY+GE+ +G+PPQ F+V+FDTGSS+ WVPS++CY S
Sbjct: 59 PSDINLSQGLATER---LYDYMNAQYYGEVSVGTPPQRFTVVFDTGSSDFWVPSARCY-S 114
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
+C H R++S S +Y ++G+ + YG+GS+ G ++D V+ ++ ++ Q F E E
Sbjct: 115 KACSMHKRFESFMSYSYAQVGEPFYLQYGTGSLIGVTAKDTVQFSNLSIEAQDFGEVRYE 174
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEI 239
LTF A FDG++GLG+ ++V +PV+D M+ Q L+ E VFSF LNR + E GGE+
Sbjct: 175 PDLTFTFAHFDGVLGLGYPSLSVLHGLPVFDGMLRQQLIEEPVFSFILNRGGNTENGGEL 234
Query: 240 VFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGP 299
+FGG+D +KG +VPVT++ YW+ + ++ I C+ GCAAIVDSGTSL+ GP
Sbjct: 235 IFGGIDHSLYKGSIHWVPVTEQKYWKIHMDNVKIQGH-IAACKDGCAAIVDSGTSLITGP 293
Query: 300 TPVVTEINHAIGGEGVVSAE 319
+ + IG E
Sbjct: 294 PSQIIRLQQKIGAHPAPHGE 313
Score = 102 bits (254), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 44/90 (48%), Positives = 61/90 (67%)
Query: 415 PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPP 474
P P GE I+DC R+ ++P ++FTIG + + ++ +QYI+K G C+SGF A DL P
Sbjct: 308 PAPHGEFIVDCRRLSSLPPITFTIGQREYTITSKQYIIKQTSGGEAFCLSGFQALDLGPR 367
Query: 475 RGPLWILGDVFMGVYHTVFDSGKLRIGFAE 504
P+WILGDVF+G Y+TVFD R+GFA
Sbjct: 368 SKPMWILGDVFIGQYYTVFDRANDRVGFAR 397
>gi|255936729|ref|XP_002559391.1| Pc13g09680 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211584011|emb|CAP92037.1| Pc13g09680 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 398
Score = 224 bits (572), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 129/311 (41%), Positives = 184/311 (59%), Gaps = 27/311 (8%)
Query: 28 LRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILP------- 76
+ R+ L K +L+ H+++A ++YMG + +H+ D P
Sbjct: 20 VHRLKLNKVPLAEQLNTHNIDAHVHNLGQKYMG---IRPEKHQDLFHDTSFNPAAGHDVL 76
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
+ NF++AQYF EI IG+PPQ F V+ DTGSSNLWVPSS+C SI+C+ HS+Y S S+TY
Sbjct: 77 VDNFLNAQYFSEISIGTPPQTFKVVLDTGSSNLWVPSSQCS-SIACFLHSKYDSSSSSTY 135
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
+ G EI YGSGS+SGF S+D +++GD+ VK Q F EAT E L F RFDGI+GLG
Sbjct: 136 EKNGTEFEIRYGSGSLSGFVSRDTLQIGDLKVKGQDFAEATNEPGLAFAFGRFDGILGLG 195
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE---IVFGGVDPKHFKGKH 253
+ I+V VP + +M+ Q LV E VF+F+L DA + G+ FGG+D H+ G+
Sbjct: 196 YDTISVNKMVPPFYHMINQKLVDEPVFAFYLG---DANKDGDNSVATFGGIDESHYTGEL 252
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG- 312
+P+ +K YW+ EL I +G+ + G I+D+GTSL+A P+ + +N IG
Sbjct: 253 IKIPLRRKAYWEVELNSIALGDNVAELENTGV--ILDTGTSLIALPSTMAELLNKEIGAT 310
Query: 313 ---EGVVSAEC 320
G S EC
Sbjct: 311 KGFTGQYSVEC 321
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 54/87 (62%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ ++CD+ ++P+++FT+G F + P Y+L+ + CIS FM D P P GPL
Sbjct: 315 GQYSVECDKRDSLPDLTFTLGGHKFTIGPYDYVLE----VQGSCISSFMGMDFPEPVGPL 370
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ +++V+D G +G A+A
Sbjct: 371 AILGDAFLRRWYSVYDVGNNAVGLAKA 397
>gi|74136511|ref|NP_001028152.1| gastricsin precursor [Monodelphis domestica]
gi|73621388|sp|Q689Z7.1|PEPC_MONDO RecName: Full=Gastricsin; AltName: Full=Pepsinogen C; Flags:
Precursor
gi|51534970|dbj|BAD36918.1| pepsinogen C [Monodelphis domestica]
Length = 391
Score = 224 bits (572), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 107/238 (44%), Positives = 160/238 (67%), Gaps = 1/238 (0%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
P+ N++D+ YFGEI IG+PPQNF V+FDTGSSNLWVPS+ C S +C H+R+ +S+T
Sbjct: 66 PITNYLDSFYFGEISIGTPPQNFLVLFDTGSSNLWVPSTYCQ-SQACSNHNRFSPSQSST 124
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
+T G++ ++YGSGS++ D V V ++VV +Q F + E + F + FDGI+G+
Sbjct: 125 FTNGGQTYTLSYGSGSLTVVLGYDTVTVQNIVVSNQEFGLSESEPTSPFYYSDFDGILGM 184
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ +AVG++ V M++QG +SE +FSF+ +R P + GGE++ GGVDP+ + G+ T+
Sbjct: 185 AYPAMAVGNSPTVMQGMLQQGQLSEPIFSFYFSRQPTHQYGGELILGGVDPQLYSGQITW 244
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
PVT++ YWQ + + IGNQ+TG C GC AIVD+GT LLA P ++ A G +
Sbjct: 245 TPVTQEVYWQIGIEEFAIGNQATGWCSQGCQAIVDTGTFLLAVPQQYMSAFLQATGAQ 302
Score = 69.3 bits (168), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 35/89 (39%), Positives = 49/89 (55%), Gaps = 5/89 (5%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG-P 477
G+ +++C+ I MP ++F I F L P Y+ C G A LP P G P
Sbjct: 307 GDFMVNCNYIQDMPTITFVINGSQFPLPPSAYVFNNNG----YCRLGIEATYLPSPNGQP 362
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
LWILGDVF+ Y++V+D R+GFA +A
Sbjct: 363 LWILGDVFLKEYYSVYDMANNRVGFAYSA 391
>gi|402072590|gb|EJT68339.1| vacuolar protease A [Gaeumannomyces graminis var. tritici
R3-111a-1]
Length = 396
Score = 224 bits (572), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 122/310 (39%), Positives = 183/310 (59%), Gaps = 17/310 (5%)
Query: 14 VLASCLLLPASSNGLRRIGLKK----RRLDLHSLNAARITRKERYMG------GAGVSGV 63
+L + +LL + G+ ++ +KK +L+ L A ++Y+G V
Sbjct: 5 LLTAAVLLGSVDAGVHKLKMKKVPLSEQLETVPLTAQLRGLGQKYLGLRPDSHAQAVFES 64
Query: 64 RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCY 123
R + + P+ NFM+AQY+ EI +G+PPQ+F V+ DTGSSNLWVPS C SI+CY
Sbjct: 65 RPIRAQGNHPV-PVSNFMNAQYYSEITVGTPPQSFKVVLDTGSSNLWVPSQSC-GSIACY 122
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
HS+Y S S+TY + G EI YGSGS+SGF S D +++GD+ +K+Q F EAT+E L
Sbjct: 123 LHSKYDSSASSTYKKNGTEFEITYGSGSLSGFVSNDVMQIGDIKIKNQDFAEATKEPGLA 182
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
F RFDGI+GLGF ++V VP + M++Q L+ E VF+F+L D ++ E +FGG
Sbjct: 183 FAFGRFDGILGLGFDRLSVNKMVPPFYQMIDQKLIDEPVFAFYL---ADQDDESEAIFGG 239
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
++ H GK +P+ +K YW+ + I +G++ G E I+D+GTSL PT +
Sbjct: 240 INKDHIDGKIIEIPLRRKAYWEVDFDAIALGDE-VGELE-NTGVILDTGTSLNVLPTQLA 297
Query: 304 TEINHAIGGE 313
+N IG +
Sbjct: 298 EMLNAQIGAK 307
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 35/87 (40%), Positives = 56/87 (64%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ IDCD+ ++P+V+FT+ F+++ YIL+ + CIS FM D+ PP GPL
Sbjct: 312 GQYTIDCDKRKSLPDVTFTLTGHNFSITAYDYILEA----SGTCISTFMGMDIAPPAGPL 367
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ Y++++D GK +G A++
Sbjct: 368 AILGDAFLRRYYSIYDLGKGTVGLAKS 394
>gi|302497761|ref|XP_003010880.1| hypothetical protein ARB_02919 [Arthroderma benhamiae CBS 112371]
gi|306531030|sp|D4B385.1|CARP_ARTBC RecName: Full=Probable vacuolar protease A; AltName: Full=Aspartic
endopeptidase PEP2; AltName: Full=Aspartic protease
PEP2; Flags: Precursor
gi|291174425|gb|EFE30240.1| hypothetical protein ARB_02919 [Arthroderma benhamiae CBS 112371]
Length = 400
Score = 224 bits (572), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 112/239 (46%), Positives = 152/239 (63%), Gaps = 9/239 (3%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
+ NF++AQYF EI IG+PPQ F V+ DTGSSNLWVP C SI+C+ HS Y S S+TY
Sbjct: 79 IDNFLNAQYFSEISIGTPPQTFKVVLDTGSSNLWVPGKDCS-SIACFLHSTYDSSASSTY 137
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
++ G I YGSGS+ GF S+D+V++GD+ +K Q+F EAT E L F RFDGI+G+G
Sbjct: 138 SKNGTKFAIRYGSGSLEGFVSRDSVKIGDMTIKKQLFAEATSEPGLAFAFGRFDGIMGMG 197
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWL---NRDPDAEEGGEIVFGGVDPKHFKGKH 253
F I+V P + NM++QGL+ E VFSF+L N+D D + FGG D HF G
Sbjct: 198 FSSISVNGITPPFYNMIDQGLIDEPVFSFYLGDTNKDGDQS---VVTFGGSDTNHFTGDM 254
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
T +P+ +K YW+ + I +G + + G I+D+GTSL+A PT + IN IG
Sbjct: 255 TTIPLRRKAYWEVDFDAISLGKDTAALENTGI--ILDTGTSLIALPTTLAEMINTQIGA 311
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 35/87 (40%), Positives = 53/87 (60%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC + ++P+V+FT+ F + P Y L+ ++ CIS FM D P P GPL
Sbjct: 317 GQYTLDCAKRDSLPDVTFTLSGHNFTIGPHDYTLE----VSGTCISSFMGMDFPEPVGPL 372
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ Y++V+D GK +G A+A
Sbjct: 373 AILGDSFLRRYYSVYDLGKGTVGLAKA 399
>gi|125984612|ref|XP_001356070.1| GA14340 [Drosophila pseudoobscura pseudoobscura]
gi|54644388|gb|EAL33129.1| GA14340 [Drosophila pseudoobscura pseudoobscura]
Length = 387
Score = 224 bits (572), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 118/261 (45%), Positives = 163/261 (62%), Gaps = 8/261 (3%)
Query: 70 SDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRY 128
S E L+N ++ +Y+G IGIG+P Q F V+FDTGS+NLWVPS+KC +++C H++Y
Sbjct: 56 SSESTETLQNTLNMEYYGLIGIGTPEQIFRVLFDTGSANLWVPSAKCPSTNVACQKHNQY 115
Query: 129 KSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLAR 188
S +S+TY G+S I YG+GS++GF S+D V V + ++ Q F EA E TF+ A
Sbjct: 116 HSEQSSTYVANGESFSIQYGTGSLTGFLSEDTVWVAGIEIQQQTFAEALNEPGSTFVSAP 175
Query: 189 FDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKH 248
F GI+GL F+ IAV P +DNM+ QGL+ E V SF+L R A +GGE++ GGVDP
Sbjct: 176 FAGIMGLAFKSIAVDGVTPPFDNMIAQGLLDEPVISFYLQRQGTAVQGGELILGGVDPSL 235
Query: 249 FKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH 308
+ G TYVPV+ GYWQF++ + G +C GC AI D+GTSL+ P +IN
Sbjct: 236 YTGNLTYVPVSVAGYWQFKVNSVKSGGFL--LCS-GCQAIADTGTSLIVVPEAAYAKINS 292
Query: 309 AIG----GEGVVSAECKLVVS 325
+G GEG +C V S
Sbjct: 293 LLGATDNGEGEAFVKCADVSS 313
Score = 85.5 bits (210), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 49/137 (35%), Positives = 70/137 (51%), Gaps = 7/137 (5%)
Query: 370 ENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIP 429
+V +G +CS C+ A+ L E + IN L + N GE+ + C +
Sbjct: 256 NSVKSGGFLLCSGCQ-AIADTGTSLIV--VPEAAYAKINSLLGATDNGEGEAFVKCADVS 312
Query: 430 TMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVY 489
++P V+ IG IF L+P+ Y++K E C+S F LWILGDVF+G +
Sbjct: 313 SLPKVNLNIGGTIFTLAPKDYVVKLTEAGQTRCMSSFTTMS----GNTLWILGDVFIGKF 368
Query: 490 HTVFDSGKLRIGFAEAA 506
+TVFD G RIGFA A
Sbjct: 369 YTVFDKGNNRIGFARVA 385
>gi|21629629|gb|AAM61957.1| synthetic renin 2/1d [Mus musculus]
Length = 401
Score = 224 bits (572), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 130/313 (41%), Positives = 186/313 (59%), Gaps = 12/313 (3%)
Query: 7 RSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSG---V 63
R LW L LLL + G R+ L + + R +ER + +S V
Sbjct: 3 RRRMPLWAL---LLLWSPCTFSLPTGTTFERIPLKKMPSVREILEERGVDMTRLSAEWDV 59
Query: 64 RHRLGDSDEDILP--LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSI 120
R + I P L N++++QY+GEIGIG+PPQ F VIFDTGS+NLWVPS+KC +
Sbjct: 60 RTKRSSLTNLISPVVLTNYLNSQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCSRLYL 119
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+C HS Y+S S++Y E G I+YGSG + GF SQD V VG + V Q F E T
Sbjct: 120 ACGIHSLYESSDSSSYMENGDDFTIHYGSGRVKGFLSQDVVTVGGITVT-QTFGEVTELP 178
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
+ F+LA+FDG++G+GF AVG PV+D+++ QG++ EEVFS + NR P GGE+V
Sbjct: 179 LIPFMLAKFDGVLGMGFPAQAVGGVTPVFDHILSQGVLKEEVFSVYYNRGPHL-LGGEVV 237
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
GG DP+H++G YV ++K WQ + + +G+ ST +CE GC +VD+G+S ++ PT
Sbjct: 238 LGGSDPEHYQGDFHYVSLSKTDSWQITMKGVSVGS-STLLCEEGCEVVVDTGSSFISAPT 296
Query: 301 PVVTEINHAIGGE 313
+ I A+G +
Sbjct: 297 SSLKLIMQALGAK 309
Score = 82.4 bits (202), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 54/88 (61%)
Query: 418 MGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGP 477
+ E ++ C ++PT+P++SF +G + + LS Y+L+ ++C A D+PPP GP
Sbjct: 313 LHEYVVSCSQVPTLPDISFNLGGRAYTLSSTDYVLQYPNRRDKLCTVALHAMDIPPPTGP 372
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+W+LG F+ ++T FD RIGFA A
Sbjct: 373 VWVLGATFIRKFYTEFDRHNNRIGFALA 400
>gi|327271207|ref|XP_003220379.1| PREDICTED: gastricsin-like [Anolis carolinensis]
Length = 388
Score = 224 bits (571), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 119/305 (39%), Positives = 187/305 (61%), Gaps = 12/305 (3%)
Query: 12 LWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRK--ERYMGGAGVSGVR-HRLG 68
L ++ +C L S GL + LKK + S+ I + E Y+ + R +
Sbjct: 4 LMLMLACFQL---SEGLVTVPLKKGK----SIRETMIEKGVLEDYLKHHNLDPARKYHFN 56
Query: 69 DSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRY 128
+ + P+ +MDA Y+G+IGIG+P QNF V+FDTGSSNLWVPS C + +C H+R+
Sbjct: 57 EYNVAYEPMA-YMDASYYGQIGIGTPAQNFLVLFDTGSSNLWVPSIYCN-TEACTRHARF 114
Query: 129 KSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLAR 188
+S+TY+ G++ + YGSG+++GFF D + + ++VV +Q F + E F+ A
Sbjct: 115 NPSQSSTYSTNGQTFFLQYGSGNLAGFFGYDTLTLQNIVVTNQEFGLSKNEPGANFIYAE 174
Query: 189 FDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKH 248
FDGI+G+ + +AVG A + M+++ L+S+ VFSF+L+R P+++ GGE+VFGGVD +
Sbjct: 175 FDGILGMAYPSLAVGGATTALERMLQENLLSQSVFSFYLSRQPNSQYGGEVVFGGVDTRL 234
Query: 249 FKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH 308
+ G+ + PVT++ YWQ + + IG Q+TG C GC AIVD+GTSLL P ++
Sbjct: 235 YSGEIYWAPVTQELYWQIGIQEFSIGGQATGWCSQGCQAIVDTGTSLLTVPQQYMSNFLS 294
Query: 309 AIGGE 313
A+G +
Sbjct: 295 AVGAQ 299
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 36/92 (39%), Positives = 50/92 (54%), Gaps = 5/92 (5%)
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
N G+ ++C+ + +P +SFTI F L P YIL C G LP
Sbjct: 301 NQYGQYAVNCNNVQNLPTISFTINGVSFPLPPSAYILNNNG----YCTVGIEPTYLPSQN 356
Query: 476 G-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
G PLWILGD+F+ Y++V+D G R+GFA +A
Sbjct: 357 GQPLWILGDIFLREYYSVYDMGNNRVGFATSA 388
>gi|149725292|ref|XP_001501875.1| PREDICTED: pepsin A-like [Equus caballus]
Length = 387
Score = 224 bits (571), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 116/249 (46%), Positives = 160/249 (64%), Gaps = 12/249 (4%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL+N+MD +YFG I IG+PPQ F+VIFDTGSSNLWVPS+ C S++C H+R+ S+T
Sbjct: 66 PLENYMDEEYFGTISIGTPPQEFTVIFDTGSSNLWVPSTYCS-SLACSNHNRFNPEDSST 124
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIG 194
Y +S I YG+GS++G D V VG + +Q+F + T GS + A FDGI+G
Sbjct: 125 YEATSESVSITYGTGSMTGVLGYDTVRVGGIEDTNQIFGLSETEPGSFLYY-APFDGILG 183
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+ A PV+DNM +QGLVS+++FS +L+ D E G ++FGG+D ++ G
Sbjct: 184 LAYPSISASGATPVFDNMWDQGLVSQDLFSVYLSS--DDESGSVVMFGGIDSSYYSGSLN 241
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG--- 311
+VPV+ +GYWQ + I + +S C GGC AIVD+GTSLLAGPT + I IG
Sbjct: 242 WVPVSNEGYWQITMDSITMNGESIA-CSGGCQAIVDTGTSLLAGPTSAIDNIQSYIGASE 300
Query: 312 ---GEGVVS 317
GE V+S
Sbjct: 301 DSSGESVIS 309
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 52/134 (38%), Positives = 68/134 (50%), Gaps = 10/134 (7%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTK--EKVLSYINELCDSLPNPMGESIIDCDRIPTMP 432
G+S CS A+V L T + + SYI DS GES+I C I ++P
Sbjct: 262 GESIACSGGCQAIVDTGTSLLAGPTSAIDNIQSYIGASEDS----SGESVISCSSIDSLP 317
Query: 433 NVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTV 492
++ FT+ F LSP YIL+ + CISGF D+ G LWILGDVF+ Y TV
Sbjct: 318 DIVFTLNGVEFPLSPSAYILQEDDS----CISGFEGMDVDTSSGELWILGDVFIRQYFTV 373
Query: 493 FDSGKLRIGFAEAA 506
FD ++G A A
Sbjct: 374 FDRANNQVGLAPVA 387
>gi|118344572|ref|NP_001072053.1| cathepsin D2 precursor [Takifugu rubripes]
gi|55771084|dbj|BAD69802.1| cathepsin D2 [Takifugu rubripes]
Length = 386
Score = 224 bits (571), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 116/294 (39%), Positives = 178/294 (60%), Gaps = 14/294 (4%)
Query: 20 LLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKN 79
LL S + I L + R L R++ +R + S D + + L N
Sbjct: 13 LLITESAAITSISLHRARSLL-----TRMSNNQRSLLRVAASST-----DPESPAVRLIN 62
Query: 80 FMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFHSRYKSRKSNTYTE 138
D QYFG+I IG+PPQ F+V+FDTGSS+LWVPS C ++C H Y+S +S+TY +
Sbjct: 63 IYDLQYFGKISIGTPPQEFTVLFDTGSSDLWVPSVYCSPLYLACGLHRHYRSYRSSTYVQ 122
Query: 139 IGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFR 198
+ I Y SG +SGF S+D + +G + V Q+F EA R+ TF+ +FDGI+G+ +
Sbjct: 123 CDRGFFIEYQSGRLSGFVSKDTLSIGGLQVPGQLFGEAVRQPGETFIYTQFDGILGMAYP 182
Query: 199 EIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPV 258
I+ PV+D ++ L+ + VFSF+LNRDP+A GG+++ GG++P+H+ G+ YV V
Sbjct: 183 SIST--IAPVFDRIMAAKLLPQNVFSFYLNRDPEAAIGGQLILGGLNPEHYAGELHYVNV 240
Query: 259 TKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
T+K YWQ E+ I +G+Q + +C+ C IVD+GTSL+ GP+ + +++AI G
Sbjct: 241 TRKAYWQIEVNRINVGDQLS-LCKPSCQTIVDTGTSLITGPSEEIRALHNAIPG 293
Score = 102 bits (253), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 65/178 (36%), Positives = 93/178 (52%), Gaps = 19/178 (10%)
Query: 334 LLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTV--VEKENVSAGDS-AVCSACEMAVVWV 390
L++ GL PE ++ YV+ K +E ++ GD ++C +V
Sbjct: 221 LILGGLNPEHYAGEL--------HYVNVTRKAYWQIEVNRINVGDQLSLCKPSCQTIVDT 272
Query: 391 QNQLKQKQTKEKVLSYINELCDSLP---NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSP 447
L ++E I L +++P E+IIDC++IP+MP +SF IG K+F L+P
Sbjct: 273 GTSLITGPSEE-----IRALHNAIPGMSRQKDENIIDCEQIPSMPVISFNIGGKLFPLNP 327
Query: 448 EQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
E YI K + C S FMA D+ PP PLW LGDVF+ Y+TVFD R+GFA A
Sbjct: 328 EDYIWKEMDRGTAFCQSRFMALDMGPPAAPLWNLGDVFIMKYYTVFDRDADRVGFALA 385
>gi|301786581|ref|XP_002928699.1| PREDICTED: pepsin A-like isoform 1 [Ailuropoda melanoleuca]
gi|281347483|gb|EFB23067.1| hypothetical protein PANDA_018738 [Ailuropoda melanoleuca]
Length = 385
Score = 224 bits (571), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 130/330 (39%), Positives = 185/330 (56%), Gaps = 28/330 (8%)
Query: 9 VFCLWVLASCLLLPAS-------SNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS 61
+ L L+ CL++ L+ GL K L HS N A + + A V
Sbjct: 6 LISLVALSECLIIKVPLVKKKSLRKNLKEHGLLKDFLKNHSPNPA----SKYFPQEAAVM 61
Query: 62 GVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSIS 121
+ PL+N+MD +YFG IGIG+PPQ F+VIFDTGSSNLWVPS C S +
Sbjct: 62 ATQ-----------PLENYMDMEYFGTIGIGTPPQEFTVIFDTGSSNLWVPSVYCS-SPA 109
Query: 122 CYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREG 180
C H+R+ ++S+TY ++ I YG+GS++G D V+VG + +Q+F + T G
Sbjct: 110 CSNHNRFNPQQSSTYEGTSQTVSIAYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPG 169
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
S + A FDGI+GL + +I+ A PV+DNM QGLVS+++FS +L+ D + G ++
Sbjct: 170 SFLYY-APFDGILGLAYPQISSSGATPVFDNMWNQGLVSQDLFSVYLSS--DDQSGSVVM 226
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGG+D +F G +VPV+ +GYWQ + + I Q+ C GC AIVD+GTSLLAGPT
Sbjct: 227 FGGIDSSYFTGNLNWVPVSVEGYWQITMDSVTINGQAIA-CSQGCQAIVDTGTSLLAGPT 285
Query: 301 PVVTEINHAIGGEGVVSAECKLVVSQYGDL 330
+ I IG + E + S DL
Sbjct: 286 NSIANIQSYIGASEDSNGEMTISCSAINDL 315
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 62/134 (46%), Gaps = 11/134 (8%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTKE--KVLSYINELCDSLPNPMGESIIDCDRIPTMP 432
G + CS A+V L T + SYI DS GE I C I +P
Sbjct: 261 GQAIACSQGCQAIVDTGTSLLAGPTNSIANIQSYIGASEDS----NGEMTISCSAINDLP 316
Query: 433 NVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTV 492
++ FTI + L P YIL+ + C+SGF +LP G LWILGD+F+ Y V
Sbjct: 317 DIVFTINGIQYPLPPSAYILQNQD-----CVSGFQGMNLPTASGELWILGDIFIRQYFAV 371
Query: 493 FDSGKLRIGFAEAA 506
FD ++G A A
Sbjct: 372 FDRANNQVGLAPVA 385
>gi|301786583|ref|XP_002928700.1| PREDICTED: pepsin A-like isoform 2 [Ailuropoda melanoleuca]
Length = 393
Score = 224 bits (571), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 126/304 (41%), Positives = 177/304 (58%), Gaps = 21/304 (6%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L+ GL K L HS N A + + A V + PL+N+MD +YFG
Sbjct: 32 LKEHGLLKDFLKNHSPNPA----SKYFPQEAAVMATQ-----------PLENYMDMEYFG 76
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+PPQ F+VIFDTGSSNLWVPS C S +C H+R+ ++S+TY ++ I Y
Sbjct: 77 TIGIGTPPQEFTVIFDTGSSNLWVPSVYCS-SPACSNHNRFNPQQSSTYEGTSQTVSIAY 135
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
G+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + +I+ A
Sbjct: 136 GTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPQISSSGAT 194
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
PV+DNM QGLVS+++FS +L+ D + G ++FGG+D +F G +VPV+ +GYWQ
Sbjct: 195 PVFDNMWNQGLVSQDLFSVYLSS--DDQSGSVVMFGGIDSSYFTGNLNWVPVSVEGYWQI 252
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQ 326
+ + I Q+ C GC AIVD+GTSLLAGPT + I IG + E + S
Sbjct: 253 TMDSVTINGQAIA-CSQGCQAIVDTGTSLLAGPTNSIANIQSYIGASEDSNGEMTISCSA 311
Query: 327 YGDL 330
DL
Sbjct: 312 INDL 315
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 48/137 (35%), Positives = 63/137 (45%), Gaps = 9/137 (6%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTKE--KVLSYINELCDSLPNPMGESIIDCDRIPTMP 432
G + CS A+V L T + SYI DS GE I C I +P
Sbjct: 261 GQAIACSQGCQAIVDTGTSLLAGPTNSIANIQSYIGASEDS----NGEMTISCSAINDLP 316
Query: 433 NVSFTIGDKIFNLSPEQYILKTGEGIA---EVCISGFMAFDLPPPRGPLWILGDVFMGVY 489
++ FTI + L P YIL+ A + C+SGF +LP G LWILGD+F+ Y
Sbjct: 317 DIVFTINGIQYPLPPSAYILQVSGLWASRLQDCVSGFQGMNLPTASGELWILGDIFIRQY 376
Query: 490 HTVFDSGKLRIGFAEAA 506
VFD ++G A A
Sbjct: 377 FAVFDRANNQVGLAPVA 393
>gi|195161645|ref|XP_002021673.1| GL26637 [Drosophila persimilis]
gi|194103473|gb|EDW25516.1| GL26637 [Drosophila persimilis]
Length = 387
Score = 224 bits (571), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 119/263 (45%), Positives = 165/263 (62%), Gaps = 12/263 (4%)
Query: 70 SDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRY 128
S E L+N ++ +Y+G IGIG+P Q F V+FDTGS+NLWVPS+KC +++C H++Y
Sbjct: 56 SSESTETLQNTLNMEYYGLIGIGTPEQIFRVLFDTGSANLWVPSAKCPSTNVACQKHNQY 115
Query: 129 KSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLAR 188
S +S+TY G+S I YG+GS++GF S+D V V + ++ Q F EA E TF+ A
Sbjct: 116 HSGQSSTYVANGESFSIQYGTGSLTGFLSEDTVWVAGIEIQQQTFAEALNEPGSTFVSAP 175
Query: 189 FDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKH 248
F GI+GL F+ IAV P +DNM+ QGL+ E V SF+L R A +GGE++ GGVDP
Sbjct: 176 FAGIMGLAFKSIAVDGVTPPFDNMIAQGLLDEPVISFYLQRQGTAVQGGELILGGVDPSL 235
Query: 249 FKGKHTYVPVTKKGYWQFELGDILIGNQSTGV--CEGGCAAIVDSGTSLLAGPTPVVTEI 306
+ G TYVPV+ GYWQF++ + +S G+ C GC AI D+GTSL+ P +I
Sbjct: 236 YTGNLTYVPVSVAGYWQFKVNSV----KSGGILLCS-GCQAIADTGTSLIVVPEAAYAKI 290
Query: 307 NHAIG----GEGVVSAECKLVVS 325
N +G GEG +C V S
Sbjct: 291 NSLLGATDNGEGEAFVKCADVSS 313
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 48/137 (35%), Positives = 70/137 (51%), Gaps = 7/137 (5%)
Query: 370 ENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIP 429
+V +G +CS C+ A+ L E + IN L + N GE+ + C +
Sbjct: 256 NSVKSGGILLCSGCQ-AIADTGTSLIV--VPEAAYAKINSLLGATDNGEGEAFVKCADVS 312
Query: 430 TMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVY 489
++P V+ IG IF L+P+ Y++K E C+S F + LWILGDVF+G +
Sbjct: 313 SLPKVNLNIGGTIFTLAPKDYVVKLTEAGQTRCMSSFTSMS----GNTLWILGDVFIGKF 368
Query: 490 HTVFDSGKLRIGFAEAA 506
+TVFD G IGFA A
Sbjct: 369 YTVFDKGNNTIGFARVA 385
>gi|449282010|gb|EMC88940.1| Cathepsin E-B, partial [Columba livia]
Length = 387
Score = 223 bits (569), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 106/243 (43%), Positives = 160/243 (65%), Gaps = 2/243 (0%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
L ++M+AQY+G + +G+PPQ F+V+FDTGSSN WVPS+ C S +C H ++KS S++Y
Sbjct: 55 LYDYMNAQYYGVVSVGTPPQRFTVVFDTGSSNFWVPSAYC-ISEACRVHQKFKSFLSDSY 113
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
G++ + YG+G + G +D +++ ++ +K Q F E+ E TF+ A FDG++GLG
Sbjct: 114 EHGGEAFSLQYGTGQLLGVAGKDTLQISNISIKGQDFGESVFEPGSTFVFAHFDGVLGLG 173
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ +AVG+A+PV+D+++ Q LV E +FSF+L R+ D E GGE++ GG+D +KG +V
Sbjct: 174 YPSLAVGNALPVFDSIMNQQLVEEPIFSFYLKREDDTENGGELILGGIDHSLYKGSIHWV 233
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
PVT+K YWQ L +I I + C GC AIVDSGTSL+ GP+ + + IG
Sbjct: 234 PVTEKSYWQIHLNNIKIQGR-VAFCSHGCEAIVDSGTSLITGPSSQIRRLQEYIGASPSH 292
Query: 317 SAE 319
S E
Sbjct: 293 SGE 295
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 41/99 (41%), Positives = 64/99 (64%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
+ E + P+ GE ++DC R+ ++P++SFTIG + L+ EQY++K C+SGF
Sbjct: 282 LQEYIGASPSHSGEFLVDCRRLSSLPHISFTIGHHEYKLTAEQYVVKESIEDQTFCMSGF 341
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+ D+ GPLWILGDVFM ++ +FD G R+GFA++
Sbjct: 342 QSLDITTRAGPLWILGDVFMSAFYCIFDRGNDRVGFAKS 380
>gi|194883084|ref|XP_001975634.1| GG20455 [Drosophila erecta]
gi|190658821|gb|EDV56034.1| GG20455 [Drosophila erecta]
Length = 404
Score = 223 bits (569), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 128/318 (40%), Positives = 182/318 (57%), Gaps = 25/318 (7%)
Query: 12 LWVLASCL----LLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRL 67
+W+L S L +LP L+ R+ L +AR R E+ G+ R RL
Sbjct: 1 MWLLVSLLPVLFILPVQFQPPVSCTLQLYRVPLRRFPSAR-HRFEK----LGIRMDRLRL 55
Query: 68 GDSDE------------DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSK 115
++E PL N++DAQYFG I IG+PPQ+F VIFDTGSSNLWVPS+
Sbjct: 56 KYAEEVSHFRGEWNSEVKATPLSNYLDAQYFGPITIGTPPQSFKVIFDTGSSNLWVPSAT 115
Query: 116 CYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFI 174
C ++C H+RY +++S ++ G I+YGSGS+ GF S D V V + + DQ F
Sbjct: 116 CASRMVACRVHNRYFAKRSTSHQVRGDRFAIHYGSGSLFGFLSTDTVRVAGLEIHDQTFA 175
Query: 175 EATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAE 234
EAT FL A+FDGI GL +R I++ P + M+EQGL+++ +FS +L+R + +
Sbjct: 176 EATEMPGPIFLAAKFDGIFGLAYRSISMQRIKPPFYAMMEQGLLTKPIFSVYLSRHGE-K 234
Query: 235 EGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTS 294
EGG I FGG +P ++ G TYV V+ + YWQ ++ +I N +C+ GC I+D+GTS
Sbjct: 235 EGGAIFFGGSNPHYYTGNFTYVQVSHRAYWQVKMDSAVIRNLE--LCQQGCEVIIDTGTS 292
Query: 295 LLAGPTPVVTEINHAIGG 312
LA P IN +IGG
Sbjct: 293 FLALPYDQAILINESIGG 310
Score = 96.7 bits (239), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 44/99 (44%), Positives = 62/99 (62%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
INE P+ G+ ++ C+ I +P ++FT+G + F L +Y+ + +C S F
Sbjct: 304 INESIGGTPSSFGQFLVPCESIAGLPKITFTLGGRRFFLESHEYVFRDIYQDRRICSSAF 363
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+A DLP P GPLWILGDVF+G Y+T FD K RIGFA+A
Sbjct: 364 IAVDLPSPSGPLWILGDVFLGKYYTEFDMEKHRIGFADA 402
>gi|291409611|ref|XP_002721072.1| PREDICTED: pepsin-3-like isoform 2 [Oryctolagus cuniculus]
Length = 387
Score = 223 bits (569), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 124/296 (41%), Positives = 173/296 (58%), Gaps = 24/296 (8%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL K L H+LN A +Y+ A V L+N++D +YFG
Sbjct: 32 LIEKGLLKDYLKTHTLNLAT-----KYLPKAAFDSVPTE---------SLENYLDTEYFG 77
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
I IG+PPQ+F+VIFDTGSSNLWVPS C S +C H+++ S+T+ +S I Y
Sbjct: 78 TISIGTPPQDFTVIFDTGSSNLWVPSVYCS-SAACSVHNQFNPEDSSTFQATSESLSITY 136
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVP 207
G+GS++GF D V VG++ +Q+F + E A FDGI+GL + I+ DA P
Sbjct: 137 GTGSMTGFLGYDTVNVGNIEDTNQIFGLSESEPGSFLYYAPFDGILGLAYPSISASDATP 196
Query: 208 VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFE 267
V+DNM +GLVSE++FS +L+ D D+ G ++FGGVD ++ G +VPV+ +GYWQ
Sbjct: 197 VFDNMWNEGLVSEDLFSVYLSSDDDS--GSVVMFGGVDSSYYTGSLNWVPVSYEGYWQIT 254
Query: 268 LGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 317
+ I + + T C GC AIVD+GTSLLAGPT ++ I IG GE +VS
Sbjct: 255 VDSITMDGE-TIACADGCQAIVDTGTSLLAGPTSAISNIQSYIGASENSDGEMIVS 309
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 64/132 (48%), Gaps = 6/132 (4%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 434
G++ C+ A+V L T +S I + N GE I+ C + ++PN+
Sbjct: 262 GETIACADGCQAIVDTGTSLLAGPTS--AISNIQSYIGASENSDGEMIVSCSSMYSLPNI 319
Query: 435 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 494
FTI + + YIL+ + CISGF +L G LWILGDVF+ Y TVFD
Sbjct: 320 VFTINGVQYPVPASAYILEEDDA----CISGFEGMNLDTYTGELWILGDVFIRQYFTVFD 375
Query: 495 SGKLRIGFAEAA 506
++G A AA
Sbjct: 376 RANNQLGLAAAA 387
>gi|402226359|gb|EJU06419.1| endopeptidase [Dacryopinax sp. DJM-731 SS1]
Length = 413
Score = 223 bits (569), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 128/301 (42%), Positives = 171/301 (56%), Gaps = 31/301 (10%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL +FM+AQYF EI +G+PPQ F V+ DTGSSNLWVPS KC SI+C+ H +Y S S+
Sbjct: 92 VPLTDFMNAQYFAEITLGTPPQTFKVVLDTGSSNLWVPSIKCT-SIACFLHQKYDSAASS 150
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY G + EI+YGSGS+ GF S D + +GD+ V+ F EAT+E L F L RFDGI+G
Sbjct: 151 TYKSNGTAFEIHYGSGSMEGFVSNDLLTIGDLQVQKLDFAEATKEPGLAFALGRFDGILG 210
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKH 253
L + I+V PV+ M+ Q L+ VF+F L N D D GGE FGG+D + GK
Sbjct: 211 LAYDTISVLHMTPVFYQMINQKLLENPVFAFRLGNSDAD---GGEATFGGIDESAYTGKI 267
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG- 312
YVPV +KGYW+ EL I +G + + G A +D+GTSL+A P+ + +N IG
Sbjct: 268 DYVPVRRKGYWEIELDKISLGGEDLELESTGAA--IDTGTSLIALPSDIAEMLNKEIGAT 325
Query: 313 ---EGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEK 369
+ EC V S LPE FNG Y +G ++E
Sbjct: 326 KSWNNQYTVECSTVDS--------------LPELTFY------FNGKPYPLSGRDYILEA 365
Query: 370 E 370
+
Sbjct: 366 Q 366
Score = 75.5 bits (184), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 34/84 (40%), Positives = 53/84 (63%), Gaps = 4/84 (4%)
Query: 423 IDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILG 482
++C + ++P ++F K + LS YIL+ +G CIS F D+PPP GP+WI+G
Sbjct: 334 VECSTVDSLPELTFYFNGKPYPLSGRDYILE-AQG---TCISSFTGLDIPPPLGPIWIVG 389
Query: 483 DVFMGVYHTVFDSGKLRIGFAEAA 506
DVF+ Y++V+D G+ +G A AA
Sbjct: 390 DVFLRKYYSVYDLGRNAVGLASAA 413
>gi|194762106|ref|XP_001963199.1| GF19728 [Drosophila ananassae]
gi|190616896|gb|EDV32420.1| GF19728 [Drosophila ananassae]
Length = 390
Score = 223 bits (569), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 116/246 (47%), Positives = 155/246 (63%), Gaps = 6/246 (2%)
Query: 70 SDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRY 128
+ E L + D +Y+G + IG+P QNF+++FDTGS+NLWVPS+KC S +C H++Y
Sbjct: 60 ASEGTETLHDSADREYYGLLSIGTPKQNFNILFDTGSANLWVPSAKCSASNKACQKHNKY 119
Query: 129 KSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLAR 188
S +S+TY G+S I YG+GS+SGF S D VEV + +K Q F EAT E TF A+
Sbjct: 120 HSGESSTYVANGESFSIEYGTGSLSGFLSTDTVEVAGIQIKSQTFAEATNEPGSTFTDAK 179
Query: 189 FDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKH 248
F GI+GL F+ IAV P WDNM+EQ L+ E V SF+L A +GGE++ GG+D
Sbjct: 180 FAGILGLAFKSIAVDGVTPPWDNMIEQKLLDEPVISFYLKLKGTAVQGGEMILGGIDSSL 239
Query: 249 FKGKHTYVPVTKKGYWQFELGDILIGNQSTGV-CEGGCAAIVDSGTSLLAGPTPVVTEIN 307
+KG T+VPVTK YWQF+L I ++ GV AI D+GTSL+ P T IN
Sbjct: 240 YKGSLTWVPVTKAAYWQFKLTAI----KTKGVFISRNTQAIADTGTSLIVLPKAAYTRIN 295
Query: 308 HAIGGE 313
+ IG E
Sbjct: 296 NLIGAE 301
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 41/100 (41%), Positives = 56/100 (56%), Gaps = 4/100 (4%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
IN L + N GE+ + C R+ +PNV+ IGD+ F L+P YI++ E C+S F
Sbjct: 294 INNLIGAEDNGEGEAFVRCGRVSALPNVNLHIGDRFFTLTPSDYIIRITESGETYCMSVF 353
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
+ L ILGD F+G ++TVFD G RIGFA A
Sbjct: 354 TYME----GNTLTILGDAFIGKFYTVFDKGNNRIGFAPVA 389
>gi|432116085|gb|ELK37212.1| Cathepsin E [Myotis davidii]
Length = 396
Score = 223 bits (569), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 120/248 (48%), Positives = 157/248 (63%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C H R+ +S+T
Sbjct: 69 PLVNYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHPRFSPSQSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y+ G I YG+GS+SG +D V V + V Q F E+ E TF+ A FDGI+GL
Sbjct: 128 YSSPGSHFFIQYGTGSLSGVIGEDQVSVEGLTVVGQQFGESVTEPGQTFVDAEFDGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV +FS +++ DP+ G E++FGG D HF G +
Sbjct: 188 GYPSLAVGGVTPVFDNMMAQNLVDVPMFSVYMSSDPEGGAGSELIFGGYDHSHFSGSLNW 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE-- 313
VPVTK+GYWQ L I +G + C GC AIVD+GTSL+ GP + ++ AIG E
Sbjct: 248 VPVTKQGYWQIALDTIQVGG-AVMFCSEGCQAIVDTGTSLITGPPAEIKQLQKAIGAEPV 306
Query: 314 -GVVSAEC 320
G + EC
Sbjct: 307 DGEYAVEC 314
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 40/87 (45%), Positives = 52/87 (59%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
GE ++CD + MP+V+FTI + L P Y L E C SGF D+ PP GPL
Sbjct: 308 GEYAVECDNLNVMPDVTFTINGVPYTLQPTAYTLLDFVDGMEFCSSGFQGLDIQPPAGPL 367
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WILGDVF+ +++VFD G R+G A A
Sbjct: 368 WILGDVFIRQFYSVFDRGDNRVGLAPA 394
>gi|291409609|ref|XP_002721071.1| PREDICTED: pepsin-3-like isoform 1 [Oryctolagus cuniculus]
Length = 387
Score = 223 bits (569), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 124/296 (41%), Positives = 173/296 (58%), Gaps = 24/296 (8%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL K L H+LN A +Y+ A V L+N++D +YFG
Sbjct: 32 LIEKGLLKDYLKTHTLNLAT-----KYLPKAAFDSVPTE---------SLENYLDTEYFG 77
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
I IG+PPQ+F+VIFDTGSSNLWVPS C S +C H+++ S+T+ +S I Y
Sbjct: 78 TISIGTPPQDFTVIFDTGSSNLWVPSVYCS-SAACSVHNQFNPEDSSTFQATSESLSITY 136
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVP 207
G+GS++GF D V VG++ +Q+F + E A FDGI+GL + I+ DA P
Sbjct: 137 GTGSMTGFLGYDTVNVGNIEDTNQIFGLSESEPGSFLYYAPFDGILGLAYPSISASDATP 196
Query: 208 VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFE 267
V+DNM +GLVSE++FS +L+ D D+ G ++FGGVD ++ G +VPV+ +GYWQ
Sbjct: 197 VFDNMWNEGLVSEDLFSVYLSSDDDS--GSVVMFGGVDSSYYTGSLNWVPVSYEGYWQIT 254
Query: 268 LGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 317
+ I + + T C GC AIVD+GTSLLAGPT ++ I IG GE +VS
Sbjct: 255 VDSITMDGE-TIACADGCQAIVDTGTSLLAGPTSAISNIQSYIGASENSDGEMIVS 309
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 44/132 (33%), Positives = 64/132 (48%), Gaps = 6/132 (4%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 434
G++ C+ A+V L T +S I + N GE I+ C + ++PN+
Sbjct: 262 GETIACADGCQAIVDTGTSLLAGPTS--AISNIQSYIGASENSDGEMIVSCSSMYSLPNI 319
Query: 435 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 494
FTI + + YIL+ + C+SGF +L G LWILGDVF+ Y TVFD
Sbjct: 320 VFTINGVQYPVPASAYILEEDDD----CLSGFDGMNLDTSYGELWILGDVFIRQYFTVFD 375
Query: 495 SGKLRIGFAEAA 506
++G A AA
Sbjct: 376 RANNQVGLAAAA 387
>gi|57164325|ref|NP_001009299.1| renin precursor [Ovis aries]
gi|1710090|sp|P52115.1|RENI_SHEEP RecName: Full=Renin; AltName: Full=Angiotensinogenase; Flags:
Precursor
gi|896318|gb|AAA69809.1| renin [Ovis aries]
Length = 400
Score = 223 bits (569), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 125/306 (40%), Positives = 181/306 (59%), Gaps = 19/306 (6%)
Query: 17 SCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRL------GDS 70
S LPA + RRI LKK + + R + KER + A + +L G+
Sbjct: 14 STFSLPADTAAFRRIFLKK-------MPSVRESLKERGVDMAQLGAEWSQLTKTLSFGNR 66
Query: 71 DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFHSRYK 129
++ L N++D QY+GEIGIG+PPQ F VIFDTGS+NLWVPS+KC +C HS Y
Sbjct: 67 TSPVV-LTNYLDTQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCSPLYTACEIHSLYD 125
Query: 130 SRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARF 189
S +S++Y E G I YGSG + GF SQD V VG + V Q F E T F+LA+F
Sbjct: 126 SLESSSYVENGTEFTIYYGSGKVKGFLSQDLVTVGGITVT-QTFGEVTELPLRPFMLAKF 184
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEE--GGEIVFGGVDPK 247
DG++G+GF AVG PV+D+++ Q +++E+VFS + +RD GGEIV GG DP+
Sbjct: 185 DGVLGMGFPAQAVGGVTPVFDHILAQRVLTEDVFSVYYSRDSKNSHLLGGEIVLGGSDPQ 244
Query: 248 HFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEIN 307
+++ YV ++K G WQ + + + +T +CE GC +VD+G S ++GPT + +
Sbjct: 245 YYQENFHYVSISKPGSWQIRMKGVSV-RSTTLLCEEGCMVVVDTGASYISGPTSSLRLLM 303
Query: 308 HAIGGE 313
A+G +
Sbjct: 304 EALGAK 309
Score = 75.1 bits (183), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 32/86 (37%), Positives = 52/86 (60%), Gaps = 1/86 (1%)
Query: 420 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 479
E +++C+++PT+P++SF +G K + L+ Y+L+ C D+PPP GP+W
Sbjct: 315 EYVVNCNQMPTLPDISFHLGGKAYTLTSADYVLQDPYNNIS-CTLALHGMDIPPPTGPVW 373
Query: 480 ILGDVFMGVYHTVFDSGKLRIGFAEA 505
+LG F+ ++T FD RIGFA A
Sbjct: 374 VLGATFIRKFYTEFDRRNNRIGFALA 399
>gi|242781757|ref|XP_002479865.1| aspartic endopeptidase Pep2 [Talaromyces stipitatus ATCC 10500]
gi|218720012|gb|EED19431.1| aspartic endopeptidase Pep2 [Talaromyces stipitatus ATCC 10500]
Length = 395
Score = 223 bits (569), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 117/279 (41%), Positives = 165/279 (59%), Gaps = 6/279 (2%)
Query: 36 RRLDLHSLNAARITRKERYMG--GAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGS 93
+ D S+N + ++YMG GV + D+L + NF++AQYF EI IG+
Sbjct: 32 EQFDKRSMNDHMRSLGQKYMGVVPEGVYEDTSIRPEGGHDVL-VDNFLNAQYFSEITIGT 90
Query: 94 PPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSIS 153
PPQNF V+ DTGSSNLWVPS+ C SI+CY H++Y S S+TY + G I YGSGS+
Sbjct: 91 PPQNFKVVLDTGSSNLWVPSASCN-SIACYLHNKYDSSSSSTYKKNGSEFAIQYGSGSLE 149
Query: 154 GFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMV 213
GF S+D V +GD+ +KDQ F EAT E L F RFDGI+GLGF I+V VP + NM+
Sbjct: 150 GFVSRDVVTIGDITIKDQDFAEATNEPGLAFAFGRFDGILGLGFDTISVNKIVPPFYNML 209
Query: 214 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 273
Q + E VF+F+L + E FGG+D H+ G+ +P+ +K YW+ + +
Sbjct: 210 NQKTLDEPVFAFYLGDSNKEGDNSEATFGGIDKSHYTGELVKIPLRRKAYWEVDFDAVAF 269
Query: 274 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
G+ + G I+D+GTSL+A P+ + +N IG
Sbjct: 270 GDNVAELENTGV--ILDTGTSLIALPSTLAELLNKEIGA 306
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 52/87 (59%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC + ++P+++ T+ F+++ Y+L+ + CIS FM D P P GPL
Sbjct: 312 GQYTVDCTKRDSLPDLTVTLSGHNFSITAHDYVLE----VQGSCISAFMGMDFPEPVGPL 367
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ +++V+D G +G A+A
Sbjct: 368 AILGDAFLRKWYSVYDLGNGAVGLAKA 394
>gi|354487263|ref|XP_003505793.1| PREDICTED: renin-like [Cricetulus griseus]
Length = 403
Score = 223 bits (568), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 130/317 (41%), Positives = 186/317 (58%), Gaps = 29/317 (9%)
Query: 9 VFCLWVLASCLL-LPASSNGLRRIGLKK----------RRLDLHSLNAARITRKERYMGG 57
+ LW +SC LP + RI LKK R +D+ L+A +R+ G
Sbjct: 12 LLILW--SSCAFSLPTDTAAFGRILLKKMPSVREILKERGVDMTKLSAEWGKFTKRFSFG 69
Query: 58 AGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY 117
G S V L N++D QY+GEIGIG+PPQ F VIFDTGS+NLWVPS+KC
Sbjct: 70 NGTSPVI------------LTNYLDTQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCS 117
Query: 118 -FSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEA 176
+C HS Y S +S++Y E G I+YGSG + GF SQD V VG ++V Q F E
Sbjct: 118 PLYSACEIHSLYDSSESSSYMENGTEFTIHYGSGKVKGFLSQDIVTVGGIIVT-QTFGEV 176
Query: 177 TREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEG 236
T + F+LA+FDG++G+GF AVG PV+D+++ Q ++ EEVFS + +RD G
Sbjct: 177 TELPLIPFMLAKFDGVLGMGFPAQAVGGVTPVFDHILSQRVLKEEVFSVYYSRDSHL-LG 235
Query: 237 GEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLL 296
GE+V GG DP+H++G YV V++ G W+ + + +G+ +T +CE GC +VD+G S +
Sbjct: 236 GEVVLGGSDPQHYQGNFHYVSVSRTGSWEIAMKGVSVGS-ATLLCEEGCVVVVDTGASYI 294
Query: 297 AGPTPVVTEINHAIGGE 313
+GPT + I +G +
Sbjct: 295 SGPTSSLKLIMQTLGAK 311
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 52/86 (60%)
Query: 420 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 479
+ ++DC ++P++P++SF +G + + L+ Y+L+ + C D+PPP GP+W
Sbjct: 317 DYVVDCSQVPSLPDISFHLGGRAYTLTSADYVLQNPYRNDDQCTLALHGLDIPPPTGPVW 376
Query: 480 ILGDVFMGVYHTVFDSGKLRIGFAEA 505
+LG F+ ++T FD RIGFA A
Sbjct: 377 VLGASFIRKFYTEFDRHNNRIGFALA 402
>gi|291416142|ref|XP_002724306.1| PREDICTED: cathepsin D [Oryctolagus cuniculus]
Length = 377
Score = 223 bits (568), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 112/244 (45%), Positives = 154/244 (63%), Gaps = 35/244 (14%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L+N+MDAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S+KS+T
Sbjct: 80 LRNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSVHCKLLDIACWIHHKYNSKKSST 139
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + G + +I+YGSGS+SG+ SQD V V
Sbjct: 140 YVKNGTTFDIHYGSGSLSGYLSQDTVSXXXXXXXXNV----------------------- 176
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+PV+DN+++Q LV + VFSF+LNRDP A+ GGE++ GGVDPK+++G +Y
Sbjct: 177 ----------LPVFDNLMQQKLVEKNVFSFYLNRDPAAQPGGELMLGGVDPKYYQGSLSY 226
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
+ VT+K YWQ + + +G+ T +CEGGC AIVD+GTSLL GP V E+ AIG +
Sbjct: 227 LNVTRKAYWQVHMDQLNVGSGLT-LCEGGCEAIVDTGTSLLVGPVDEVRELQRAIGAVPL 285
Query: 316 VSAE 319
+ E
Sbjct: 286 IQGE 289
Score = 108 bits (270), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 57/140 (40%), Positives = 83/140 (59%), Gaps = 3/140 (2%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
+++ NV +G + CE A+V L E + + ++P GE II C+
Sbjct: 239 MDQLNVGSGLTLCEGGCE-AIVDTGTSLLVGPVDE--VRELQRAIGAVPLIQGEYIIPCE 295
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFM 486
++ ++P V+ +G + + LS E Y LK +G +C+SGFM D+PPP GPLWILGDVF+
Sbjct: 296 KVSSLPPVTLKLGGRDYTLSSEDYTLKVSQGGKTICLSGFMGMDIPPPAGPLWILGDVFI 355
Query: 487 GVYHTVFDSGKLRIGFAEAA 506
G Y+TVFD R+GFAEAA
Sbjct: 356 GRYYTVFDRDGNRVGFAEAA 375
>gi|320588396|gb|EFX00865.1| aspartic endopeptidase pep2 [Grosmannia clavigera kw1407]
Length = 401
Score = 223 bits (568), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 131/312 (41%), Positives = 189/312 (60%), Gaps = 18/312 (5%)
Query: 23 ASSNGLRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDS-----DED 73
A ++G++++ LKK ++L+ ++A ++YMG S + D
Sbjct: 16 AQASGIQKLKLKKVPLAKQLESIPIDAQIRGLGQKYMGARLGSHADEMFKTAVVETDDNH 75
Query: 74 ILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKS 133
LP+ NF++AQYF EI IG+PPQ+F V+ DTGSSNLWVPSS+C SI+CY H++Y S S
Sbjct: 76 PLPVSNFLNAQYFAEISIGTPPQSFKVVLDTGSSNLWVPSSQCG-SIACYLHTKYDSESS 134
Query: 134 NTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGII 193
++Y G + YGSGS+SGF SQD V +GD+ + Q F EAT E L F ARFDGI+
Sbjct: 135 SSYKSNGSAFAAQYGSGSLSGFVSQDTVSIGDLKIVKQDFAEATEEPGLAFAFARFDGIL 194
Query: 194 GLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGK 252
GLGF I+V VP + N++ Q L+ VF+F+L N D D ++ E VFGGVD H+ GK
Sbjct: 195 GLGFDTISVNHIVPPFYNLINQKLIDSGVFAFYLGNADSDGDD-SEAVFGGVDKAHYTGK 253
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
T +P+ +K YW+ +L I +G + + G I+D+GTSL+A P+ + +N IG
Sbjct: 254 ITTIPLRRKAYWEVDLDSISLGEDTAELENTGV--ILDTGTSLIALPSSLAEMLNAQIGA 311
Query: 313 E----GVVSAEC 320
+ G S +C
Sbjct: 312 KKGYNGQYSVDC 323
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 50/87 (57%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC R ++P+V+FT+ F+L YIL+ ++ CIS F D P P GPL
Sbjct: 317 GQYSVDCSRKSSLPDVTFTLSGYNFSLPASDYILE----VSGSCISTFTGVDFPEPVGPL 372
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ Y++++D +G A A
Sbjct: 373 AILGDAFLRRYYSIYDLDNNTVGLALA 399
>gi|410968030|ref|XP_003990516.1| PREDICTED: pepsin B-like [Felis catus]
Length = 390
Score = 223 bits (568), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 118/308 (38%), Positives = 179/308 (58%), Gaps = 13/308 (4%)
Query: 25 SNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDIL-PLKNFMDA 83
S G+ RI LKK + + + R + V L ++D P N++++
Sbjct: 14 SEGVERIILKKGK-SIRQVMEERGVLQTFLKNHPKVDPAAKYLFNNDAVAYEPFTNYLNS 72
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
YFGEI IG+PPQNF V+FDTGSSNLWVPS+ C S +C H+ + S+TY G++
Sbjct: 73 YYFGEISIGTPPQNFLVLFDTGSSNLWVPSTYCK-SQACSNHNTFNPSMSSTYQNNGQTY 131
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVG 203
+ YGSGS++ D V V ++V+ +Q F + E S F A FDGI+G+ + +AVG
Sbjct: 132 TLYYGSGSLTVLLGYDTVTVQNIVIHNQEFGLSEIEPSNPFYYANFDGILGMAYPNLAVG 191
Query: 204 DAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGY 263
++ V ++M++QG ++ +FSF+ +R P E GGE++ GG++ + + G+ + PVT++ Y
Sbjct: 192 NSPTVMESMMQQGQLTSPIFSFYFSRQPTYEYGGELILGGMNSQFYSGEIVWTPVTRELY 251
Query: 264 WQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLV 323
WQ + + L+GNQ TG+C GC AIVD+GT +LA P + A G E
Sbjct: 252 WQVAIDEFLVGNQPTGLCSQGCQAIVDTGTYVLAVPQQYMNSFLQATGAE---------- 301
Query: 324 VSQYGDLI 331
VSQYGD +
Sbjct: 302 VSQYGDFV 309
Score = 69.7 bits (169), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 34/86 (39%), Positives = 48/86 (55%), Gaps = 5/86 (5%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG-P 477
G+ +++C+ I +MP ++F I L P Y+L C G A LP P G P
Sbjct: 306 GDFVVNCNSIQSMPTITFVISGSPLPLPPSAYVLNNNG----YCTLGIEATYLPSPSGQP 361
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFA 503
LW LGDVF+ Y+T++D G R+GFA
Sbjct: 362 LWTLGDVFLKEYYTIYDMGNNRMGFA 387
>gi|164657049|ref|XP_001729651.1| hypothetical protein MGL_3195 [Malassezia globosa CBS 7966]
gi|159103544|gb|EDP42437.1| hypothetical protein MGL_3195 [Malassezia globosa CBS 7966]
Length = 419
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 116/255 (45%), Positives = 155/255 (60%), Gaps = 9/255 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL +F++AQYF +I +GSPPQ+F VI DTGS+NLWVPS C SI+C H +Y + S
Sbjct: 96 VPLTDFLNAQYFADIELGSPPQSFKVILDTGSANLWVPSESCT-SIACLLHKKYDNSLSK 154
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY G +I+YGSGS+ GF S+D + +GD+ VKDQ F EA +E L F +FDGI+G
Sbjct: 155 TYQANGSEFQIHYGSGSMEGFVSRDTLRIGDLDVKDQDFAEAIKEPGLAFAFGKFDGILG 214
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+V VP + M EQ L+ + F F+L EGGE FGGVDP F+G
Sbjct: 215 LAYDTISVNKIVPPFYRMKEQNLLDQNQFGFYLGS--SESEGGEATFGGVDPSRFEGPIV 272
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
Y PV ++GYW+ L I GN+ + G A +D+GTSL+A PT V +N IG +
Sbjct: 273 YAPVRRRGYWEVALNKIGFGNEELVLTRTGAA--IDTGTSLIAMPTDVAEILNKEIGAKR 330
Query: 314 ---GVVSAECKLVVS 325
G S +C V S
Sbjct: 331 SWTGQYSVDCSKVPS 345
Score = 82.0 bits (201), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 36/87 (41%), Positives = 55/87 (63%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC ++P++P ++F + +K + L YI + CIS FM DLP P GPL
Sbjct: 334 GQYSVDCSKVPSLPALTFYLDNKPYTLEGRDYIFN----VQGTCISPFMGMDLPEPVGPL 389
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WI+GDVF+ ++TV+D K +GFA+A
Sbjct: 390 WIVGDVFLRKFYTVYDLDKDAVGFAKA 416
>gi|212526768|ref|XP_002143541.1| aspartic endopeptidase Pep2 [Talaromyces marneffei ATCC 18224]
gi|210072939|gb|EEA27026.1| aspartic endopeptidase Pep2 [Talaromyces marneffei ATCC 18224]
Length = 395
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 122/291 (41%), Positives = 168/291 (57%), Gaps = 10/291 (3%)
Query: 28 LRRIGLKK----RRLDLHSLNAARITRKERYMG--GAGVSGVRHRLGDSDEDILPLKNFM 81
+ R+ L K + D S+N + ++YMG G + D+L + NF+
Sbjct: 20 VHRLKLDKLSLSEQFDKRSMNDHMRSLSQKYMGVVPEGTYQDTSIRPEGGHDVL-VDNFL 78
Query: 82 DAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGK 141
+AQYF EI IG+PPQNF V+ DTGSSNLWVPSS C SI+CY HS+Y S S+TY + G
Sbjct: 79 NAQYFSEITIGTPPQNFKVVLDTGSSNLWVPSSSCN-SIACYLHSKYDSSSSSTYKKNGS 137
Query: 142 SCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIA 201
I YGSGS+ GF S+D V +GD+ +KDQ F EAT E L F RFDGI+GLGF I+
Sbjct: 138 DFAIQYGSGSLEGFVSRDTVTIGDITIKDQDFAEATNEPGLAFAFGRFDGILGLGFDTIS 197
Query: 202 VGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKK 261
V VP + NM+ Q + E VF+F+L + E FGG+D H+ G+ +P+ +K
Sbjct: 198 VNKIVPPFYNMLNQKSLDEPVFAFYLGDSNKEGDASEATFGGIDKSHYTGELVKIPLRRK 257
Query: 262 GYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
YW+ + I G + G I+D+GTSL+A P+ + +N IG
Sbjct: 258 AYWEVDFDAIAFGENVAELENTGV--ILDTGTSLIALPSTLAELLNKEIGA 306
Score = 65.5 bits (158), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 52/87 (59%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC + ++P+++ T+ F+++ Y+L+ + CIS FM D P P GPL
Sbjct: 312 GQYTVDCAKRDSLPDLTVTLSGHNFSITAFDYVLE----VQGSCISAFMGMDFPEPVGPL 367
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ +++++D G +G A+A
Sbjct: 368 AILGDAFLRKWYSIYDLGNGAVGLAKA 394
>gi|296810640|ref|XP_002845658.1| vacuolar protease A [Arthroderma otae CBS 113480]
gi|263406266|sp|C5FS55.1|CARP_NANOT RecName: Full=Vacuolar protease A; AltName: Full=Aspartic
endopeptidase PEP2; AltName: Full=Aspartic protease
PEP2; Flags: Precursor
gi|238843046|gb|EEQ32708.1| vacuolar protease A [Arthroderma otae CBS 113480]
Length = 395
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 124/296 (41%), Positives = 175/296 (59%), Gaps = 24/296 (8%)
Query: 18 CLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRL---------- 67
C S L+++ LK++ L+ ++ + ++YMG + +H
Sbjct: 15 CTSAKLHSLKLKKVSLKEQ-LEHADIDVQIKSLGQKYMG---IRPGQHEQQMFKEQTPIE 70
Query: 68 GDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSR 127
+S ++L + NF++AQYF EI IG+PPQ F V+ DTGSSNLWVP C SI+C+ HS
Sbjct: 71 AESGHNVL-IDNFLNAQYFSEISIGTPPQTFKVVLDTGSSNLWVPGKDCS-SIACFLHST 128
Query: 128 YKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLA 187
Y S S+T+T G S I YGSGS+ GF SQDNV++GD+ +K+Q+F EAT E L F
Sbjct: 129 YDSSASSTFTRNGTSFAIRYGSGSLEGFVSQDNVQIGDMKIKNQLFAEATSEPGLAFAFG 188
Query: 188 RFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL---NRDPDAEEGGEIVFGGV 244
RFDGI+G+G+ I+V P + MVEQGLV E VFSF+L N+D D + FGG
Sbjct: 189 RFDGILGMGYDTISVNKITPPFYKMVEQGLVDEPVFSFYLGDTNKDGDQS---VVTFGGA 245
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
D H+ G T +P+ +K YW+ E I +G + + G I+D+GTSL+A PT
Sbjct: 246 DKSHYTGDITTIPLRRKAYWEVEFNAITLGKDTATLDNTGI--ILDTGTSLIALPT 299
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 53/87 (60%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ IDC + ++P+++FT+ F + P Y L+ ++ CIS FM D P P GPL
Sbjct: 312 GQYTIDCAKRDSLPDLTFTLSGHNFTIGPYDYTLE----VSGTCISSFMGMDFPEPVGPL 367
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ +++V+D GK +G A+A
Sbjct: 368 AILGDSFLRRWYSVYDLGKGTVGLAKA 394
>gi|169861123|ref|XP_001837196.1| endopeptidase [Coprinopsis cinerea okayama7#130]
gi|116501918|gb|EAU84813.1| endopeptidase [Coprinopsis cinerea okayama7#130]
Length = 411
Score = 223 bits (567), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 116/255 (45%), Positives = 155/255 (60%), Gaps = 9/255 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL NFM+AQY+ EI +G+PPQ F VI DTGSSNLWVPS KC SI+C+ H++Y S +S
Sbjct: 89 VPLTNFMNAQYYTEITLGTPPQTFKVILDTGSSNLWVPSIKCT-SIACFLHTKYDSSQST 147
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY G I YGSGS+ GF SQD + +GD+ +K Q F EA +E L F +FDGI+G
Sbjct: 148 TYKANGTEFSIQYGSGSMEGFVSQDTLGIGDLTIKGQDFAEALKEPGLAFAFGKFDGILG 207
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+V VP + NM+ Q L+ VF+F + E+GGE FGG+D + + GK
Sbjct: 208 LAYDTISVNRIVPPFYNMINQKLIDSPVFAFRIGS--SEEDGGEATFGGIDHEAYTGKLH 265
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
YVPV +K YW+ EL I G+ + G A +D+GTSL+A PT + +N IG
Sbjct: 266 YVPVRRKAYWEVELEKISFGDDELELEHTGAA--IDTGTSLIALPTDMAEMLNTQIGARK 323
Query: 314 ---GVVSAECKLVVS 325
G +C V S
Sbjct: 324 SWNGQYQVDCNKVPS 338
Score = 71.6 bits (174), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 35/89 (39%), Positives = 53/89 (59%), Gaps = 5/89 (5%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG-P 477
G+ +DC+++P++P+++F G K + L YIL + CIS F D+ P G
Sbjct: 327 GQYQVDCNKVPSLPDLTFQFGGKPYPLKGSDYILN----VQGTCISAFTGMDINMPGGDS 382
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
LWI+GDVF+ Y+TV+D G +GFA A
Sbjct: 383 LWIVGDVFLRKYYTVYDLGNDAVGFAPVA 411
>gi|440903924|gb|ELR54511.1| Renin, partial [Bos grunniens mutus]
Length = 404
Score = 223 bits (567), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 125/305 (40%), Positives = 184/305 (60%), Gaps = 19/305 (6%)
Query: 17 SCLL-LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRL------GD 69
SC LPA + RRI LKK + + R + KER + A + +L G+
Sbjct: 19 SCTFSLPADTAAFRRIFLKK-------MPSVRESLKERGVDMARLGAEWSQLTKTLSFGN 71
Query: 70 SDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFHSRY 128
++ L N++D QY+GEIGIG+PPQ F V+FDTGS+NLWVPS+KC +C HS Y
Sbjct: 72 RTSPVV-LTNYLDTQYYGEIGIGTPPQTFKVVFDTGSANLWVPSTKCSPLYTACEIHSLY 130
Query: 129 KSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLAR 188
S +S++Y E G I+YGSG + GF SQD V VG + V Q F E T L F+LA+
Sbjct: 131 DSLESSSYVENGTEFTIHYGSGKVKGFLSQDLVTVGGITVT-QTFGEVTELPLLPFMLAK 189
Query: 189 FDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKH 248
FDG++G+GF AVG PV+D+++ Q +++++VFS + +R+ GGEIV GG DP++
Sbjct: 190 FDGVLGMGFPAQAVGGVTPVFDHILAQRVLTDDVFSVYYSRNSHL-LGGEIVLGGSDPQY 248
Query: 249 FKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH 308
++ YV ++K G WQ + + + +T +CE GC IVD+G S ++GPT + +
Sbjct: 249 YQENFHYVSISKPGSWQIRMKGVSV-RSTTLLCEEGCMVIVDTGASYISGPTSSLRLLME 307
Query: 309 AIGGE 313
A+G +
Sbjct: 308 ALGAK 312
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 31/84 (36%), Positives = 53/84 (63%)
Query: 422 IIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWIL 481
+++C+++PT+P++SF +G K + L+ Y+L+ ++C D+PPP GP+W+L
Sbjct: 320 VVNCNQMPTLPDISFHLGGKAYTLTSADYVLQDPYNNDDLCTLALHGMDIPPPTGPVWVL 379
Query: 482 GDVFMGVYHTVFDSGKLRIGFAEA 505
G F+ ++T FD RIGFA A
Sbjct: 380 GATFIRKFYTEFDRRNNRIGFALA 403
>gi|148669271|gb|EDL01218.1| mCG6933 [Mus musculus]
Length = 401
Score = 223 bits (567), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 126/309 (40%), Positives = 184/309 (59%), Gaps = 8/309 (2%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRH 65
L ++ LW + C + RI LKK + + R R V R
Sbjct: 8 LWALLLLW--SPCTFSLPTGTTFERIPLKKMP-SVREILEERGVDMTRLSAEWDVFTKRS 64
Query: 66 RLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYF 124
L D ++ L N++++QY+GEIGIG+PPQ F VIFDTGS+NLWVPS+KC ++C
Sbjct: 65 SLTDLISPVV-LTNYLNSQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCSRLYLACGI 123
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
HS Y+S S++Y E G I+YGSG + GF SQD+V VG + V Q F E T + F
Sbjct: 124 HSLYESSDSSSYMENGDDFTIHYGSGRVKGFLSQDSVTVGGITVT-QTFGEVTELPLIPF 182
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
+LA+FDG++G+GF AVG PV+D+++ QG++ E+VFS + NR P GGE+V GG
Sbjct: 183 MLAQFDGVLGMGFPAQAVGGVTPVFDHILSQGVLKEKVFSVYYNRGPHL-LGGEVVLGGS 241
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
DP+H++G YV ++K WQ + + +G+ ST +CE GC +VD+G+S ++ PT +
Sbjct: 242 DPEHYQGDFHYVSLSKTDSWQITMKGVSVGS-STLLCEEGCEVVVDTGSSFISAPTSSLK 300
Query: 305 EINHAIGGE 313
I A+G +
Sbjct: 301 LIMQALGAK 309
Score = 82.0 bits (201), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 54/88 (61%)
Query: 418 MGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGP 477
+ E ++ C ++PT+P++SF +G + + LS Y+L+ ++C A D+PPP GP
Sbjct: 313 LHEYVVSCSQVPTLPDISFNLGGRAYTLSSTDYVLQYPNRRDKLCTLALHAMDIPPPTGP 372
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+W+LG F+ ++T FD RIGFA A
Sbjct: 373 VWVLGATFIRKFYTEFDRHNNRIGFALA 400
>gi|15079273|gb|AAH11473.1| Ren2 protein [Mus musculus]
Length = 401
Score = 223 bits (567), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 126/309 (40%), Positives = 184/309 (59%), Gaps = 8/309 (2%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRH 65
L ++ LW + C + RI LKK + + R R V R
Sbjct: 8 LWALLLLW--SPCTFSLPTGTTFERIPLKKMP-SVREILEERGVDMTRLSAEWDVFTKRS 64
Query: 66 RLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYF 124
L D ++ L N++++QY+GEIGIG+PPQ F VIFDTGS+NLWVPS+KC ++C
Sbjct: 65 SLTDLISPVV-LTNYLNSQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCSRLYLACGI 123
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
HS Y+S S++Y E G I+YGSG + GF SQD+V VG + V Q F E T + F
Sbjct: 124 HSLYESSDSSSYMENGDDFTIHYGSGRVKGFLSQDSVTVGGITVT-QTFGEVTELPLIPF 182
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
+LA+FDG++G+GF AVG PV+D+++ QG++ E+VFS + NR P GGE+V GG
Sbjct: 183 MLAQFDGVLGMGFPAQAVGGVTPVFDHILSQGVLKEKVFSVYYNRGPHL-LGGEVVLGGS 241
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
DP+H++G YV ++K WQ + + +G+ ST +CE GC +VD+G+S ++ PT +
Sbjct: 242 DPEHYQGDFHYVSLSKTDSWQITMKGVSVGS-STLLCEEGCEVVVDTGSSFISAPTSSLK 300
Query: 305 EINHAIGGE 313
I A+G +
Sbjct: 301 LIMQALGAK 309
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 54/88 (61%)
Query: 418 MGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGP 477
+ E ++ C ++PT+P++SF +G + + LS Y+L+ ++C A D+PPP GP
Sbjct: 313 LHEYVVSCSQVPTLPDISFNLGGRAYTLSSTDYVLQYPNRRDKLCTVALHAMDIPPPTGP 372
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+W+LG F+ ++T F+ RIGFA A
Sbjct: 373 VWVLGATFIRKFYTEFERHNNRIGFALA 400
>gi|132329|sp|P00796.1|RENI2_MOUSE RecName: Full=Renin-2; AltName: Full=Angiotensinogenase; AltName:
Full=Submandibular gland renin; Contains: RecName:
Full=Renin-2 heavy chain; Contains: RecName:
Full=Renin-2 light chain; Flags: Precursor
gi|15029868|gb|AAH11157.1| Ren2 protein [Mus musculus]
Length = 401
Score = 222 bits (566), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 126/309 (40%), Positives = 184/309 (59%), Gaps = 8/309 (2%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRH 65
L ++ LW + C + RI LKK + + R R V R
Sbjct: 8 LWALLLLW--SPCTFSLPTGTTFERIPLKKMP-SVREILEERGVDMTRLSAEWDVFTKRS 64
Query: 66 RLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYF 124
L D ++ L N++++QY+GEIGIG+PPQ F VIFDTGS+NLWVPS+KC ++C
Sbjct: 65 SLTDLISPVV-LTNYLNSQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCSRLYLACGI 123
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
HS Y+S S++Y E G I+YGSG + GF SQD+V VG + V Q F E T + F
Sbjct: 124 HSLYESSDSSSYMENGDDFTIHYGSGRVKGFLSQDSVTVGGITVT-QTFGEVTELPLIPF 182
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
+LA+FDG++G+GF AVG PV+D+++ QG++ E+VFS + NR P GGE+V GG
Sbjct: 183 MLAQFDGVLGMGFPAQAVGGVTPVFDHILSQGVLKEKVFSVYYNRGPHL-LGGEVVLGGS 241
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
DP+H++G YV ++K WQ + + +G+ ST +CE GC +VD+G+S ++ PT +
Sbjct: 242 DPEHYQGDFHYVSLSKTDSWQITMKGVSVGS-STLLCEEGCEVVVDTGSSFISAPTSSLK 300
Query: 305 EINHAIGGE 313
I A+G +
Sbjct: 301 LIMQALGAK 309
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 54/88 (61%)
Query: 418 MGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGP 477
+ E ++ C ++PT+P++SF +G + + LS Y+L+ ++C A D+PPP GP
Sbjct: 313 LHEYVVSCSQVPTLPDISFNLGGRAYTLSSTDYVLQYPNRRDKLCTVALHAMDIPPPTGP 372
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+W+LG F+ ++T FD RIGFA A
Sbjct: 373 VWVLGATFIRKFYTEFDRHNNRIGFALA 400
>gi|330688453|ref|NP_001193438.1| renin precursor [Bos taurus]
Length = 398
Score = 222 bits (566), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 125/305 (40%), Positives = 184/305 (60%), Gaps = 19/305 (6%)
Query: 17 SCLL-LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRL------GD 69
SC LPA + RRI LKK + + R + KER + A + +L G+
Sbjct: 13 SCTFSLPADTAAFRRIFLKK-------MPSVRESLKERGVDMARLGAEWSQLTKTLSFGN 65
Query: 70 SDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFHSRY 128
++ L N++D QY+GEIGIG+PPQ F V+FDTGS+NLWVPS+KC +C HS Y
Sbjct: 66 RTSPVV-LTNYLDTQYYGEIGIGTPPQTFKVVFDTGSANLWVPSTKCSPLYTACEIHSLY 124
Query: 129 KSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLAR 188
S +S++Y E G I+YGSG + GF SQD V VG + V Q F E T L F+LA+
Sbjct: 125 DSLESSSYVENGTEFTIHYGSGKVKGFLSQDLVTVGGITVT-QTFGEVTELPLLPFMLAK 183
Query: 189 FDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKH 248
FDG++G+GF AVG PV+D+++ Q +++++VFS + +R+ GGEIV GG DP++
Sbjct: 184 FDGVLGMGFPAQAVGGVTPVFDHILAQRVLTDDVFSVYYSRNSHL-LGGEIVLGGSDPQY 242
Query: 249 FKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH 308
++ YV ++K G WQ + + + +T +CE GC IVD+G S ++GPT + +
Sbjct: 243 YQENFHYVSISKPGSWQIRMKGVSV-RSTTLLCEEGCMVIVDTGASYISGPTSSLRLLME 301
Query: 309 AIGGE 313
A+G +
Sbjct: 302 ALGAK 306
Score = 79.3 bits (194), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 31/84 (36%), Positives = 53/84 (63%)
Query: 422 IIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWIL 481
+++C+++PT+P++SF +G K + L+ Y+L+ ++C D+PPP GP+W+L
Sbjct: 314 VVNCNQMPTLPDISFHLGGKAYTLTSADYVLQDPYNNDDLCTLALHGMDIPPPTGPVWVL 373
Query: 482 GDVFMGVYHTVFDSGKLRIGFAEA 505
G F+ ++T FD RIGFA A
Sbjct: 374 GATFIRKFYTEFDRRNNRIGFALA 397
>gi|118150650|ref|NP_112470.2| renin-2 [Mus musculus]
Length = 424
Score = 222 bits (566), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 126/309 (40%), Positives = 184/309 (59%), Gaps = 8/309 (2%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRH 65
L ++ LW + C + RI LKK + + R R V R
Sbjct: 31 LWALLLLW--SPCTFSLPTGTTFERIPLKKMP-SVREILEERGVDMTRLSAEWDVFTKRS 87
Query: 66 RLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYF 124
L D ++ L N++++QY+GEIGIG+PPQ F VIFDTGS+NLWVPS+KC ++C
Sbjct: 88 SLTDLISPVV-LTNYLNSQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCSRLYLACGI 146
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
HS Y+S S++Y E G I+YGSG + GF SQD+V VG + V Q F E T + F
Sbjct: 147 HSLYESSDSSSYMENGDDFTIHYGSGRVKGFLSQDSVTVGGITVT-QTFGEVTELPLIPF 205
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
+LA+FDG++G+GF AVG PV+D+++ QG++ E+VFS + NR P GGE+V GG
Sbjct: 206 MLAQFDGVLGMGFPAQAVGGVTPVFDHILSQGVLKEKVFSVYYNRGPHL-LGGEVVLGGS 264
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
DP+H++G YV ++K WQ + + +G+ ST +CE GC +VD+G+S ++ PT +
Sbjct: 265 DPEHYQGDFHYVSLSKTDSWQITMKGVSVGS-STLLCEEGCEVVVDTGSSFISAPTSSLK 323
Query: 305 EINHAIGGE 313
I A+G +
Sbjct: 324 LIMQALGAK 332
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 54/88 (61%)
Query: 418 MGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGP 477
+ E ++ C ++PT+P++SF +G + + LS Y+L+ ++C A D+PPP GP
Sbjct: 336 LHEYVVSCSQVPTLPDISFNLGGRAYTLSSTDYVLQYPNRRDKLCTVALHAMDIPPPTGP 395
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+W+LG F+ ++T FD RIGFA A
Sbjct: 396 VWVLGATFIRKFYTEFDRHNNRIGFALA 423
>gi|328860092|gb|EGG09199.1| hypothetical protein MELLADRAFT_42703 [Melampsora larici-populina
98AG31]
Length = 429
Score = 222 bits (566), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 108/242 (44%), Positives = 157/242 (64%), Gaps = 6/242 (2%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+++AQYF EI IG+PPQ+F VI DTGSSNLWVPS++C SI+C+ HS+Y S+
Sbjct: 103 VPLSNYLNAQYFSEITIGTPPQSFKVILDTGSSNLWVPSTRCT-SIACFLHSKYDCEASS 161
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y G +I YGSGS+ G S D V +GD+ ++D F E+T+E L F +FDGI+G
Sbjct: 162 SYKANGTEFQIRYGSGSLEGVISNDVVRIGDLEIRDTDFAESTKEPGLAFAFGKFDGILG 221
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDA---EEGGEIVFGGVDPKHFKG 251
LG+ I+V VP + M+EQGL+ E VF+F+L ++ +GGE +FGG+D H++G
Sbjct: 222 LGYDTISVLHTVPPFYEMIEQGLLDEPVFAFYLGTSHESGVDNQGGEAIFGGIDEAHYEG 281
Query: 252 KHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
Y PV ++GYW+ L + G + + G A +D+GTSL+A PT IN ++G
Sbjct: 282 DIHYAPVRRRGYWEVALEGVRFGKEEMKLVNVGAA--IDTGTSLIALPTDTAEIINASLG 339
Query: 312 GE 313
+
Sbjct: 340 AK 341
Score = 89.0 bits (219), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 38/87 (43%), Positives = 58/87 (66%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DCD+IPT+P+++FT K F ++ E YIL+ + CIS F D+PP G L
Sbjct: 346 GQYTVDCDKIPTLPDLTFTFAGKDFTITAEDYILQ----VQGTCISSFSGLDMPPNVGEL 401
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WI+GD F+ ++TV+D G+ +GFA+A
Sbjct: 402 WIIGDTFLRKWYTVYDLGRNAVGFAKA 428
>gi|291223845|ref|XP_002731921.1| PREDICTED: expressed hypothetical protein-like [Saccoglossus
kowalevskii]
Length = 959
Score = 222 bits (565), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 110/235 (46%), Positives = 158/235 (67%), Gaps = 4/235 (1%)
Query: 78 KNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTY 136
++DA Y+GEIGIG+PP F V+FDTGSS LWVPS+ C S ++C FH+ Y + KS+TY
Sbjct: 634 NTYIDASYYGEIGIGTPPATFLVLFDTGSSYLWVPSAMCPESNMACAFHNSYDNLKSSTY 693
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
T +S I YGSGS+SG S+D + +GDV +++Q+F E T + +LARFDGI+GLG
Sbjct: 694 TATRESFNITYGSGSVSGVISRDTIVIGDVRIENQLFGETTAWPDTSIVLARFDGILGLG 753
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ + +PV+DNM+ Q L+SE VFS ++ D + GE++ GG D H+ G+ TY+
Sbjct: 754 YPNLQTRSILPVFDNMLAQHLISEPVFSVYVRGDGNK---GELILGGSDQHHYSGEFTYL 810
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
PVT KGYWQF + I + ++ + C GC A+VD+GTS++AGP + +N IG
Sbjct: 811 PVTIKGYWQFTMDSIHVYDKPSQYCLDGCQAVVDTGTSVIAGPMEDIETLNTEIG 865
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 33/85 (38%), Positives = 51/85 (60%), Gaps = 2/85 (2%)
Query: 420 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 479
+ +I+C + ++P++SF +G K+F L P YI + G +E+C+S + GP+W
Sbjct: 873 QFVINCHLVDSLPDISFVLGGKLFALEPRDYIEQDNTGDSEICLSNLVGHG--NGIGPIW 930
Query: 480 ILGDVFMGVYHTVFDSGKLRIGFAE 504
ILG VF Y+ FD GK R+GFA
Sbjct: 931 ILGAVFTRKYYVEFDRGKDRVGFAN 955
>gi|366991455|ref|XP_003675493.1| hypothetical protein NCAS_0C01360 [Naumovozyma castellii CBS 4309]
gi|342301358|emb|CCC69126.1| hypothetical protein NCAS_0C01360 [Naumovozyma castellii CBS 4309]
Length = 406
Score = 222 bits (565), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 114/273 (41%), Positives = 163/273 (59%), Gaps = 4/273 (1%)
Query: 42 SLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVI 101
SL + ER S + D +PL N+++AQYF +I +G+PPQNF VI
Sbjct: 49 SLGHKYMNHFERANPEVSFSRDHPFFAEGDGHNVPLTNYLNAQYFADISVGTPPQNFKVI 108
Query: 102 FDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNV 161
DTGSSNLWVPSS+C S++C+ HS+Y S++Y G I YGSGS+ G+ SQD +
Sbjct: 109 LDTGSSNLWVPSSECN-SLACFLHSKYDHDASSSYKANGTKFAIQYGSGSLEGYISQDTL 167
Query: 162 EVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEE 221
+GD+ + Q F EAT E LTF +FDGI+GL + I+V VP + N +EQGL+ E+
Sbjct: 168 NIGDLTIPKQDFAEATSEPGLTFAFGKFDGILGLAYDTISVDKVVPPFYNAIEQGLLDEK 227
Query: 222 VFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGV 280
F+F+L + D + GGEI GG+D FKG ++PV +K YW+ + I +G+Q +
Sbjct: 228 KFAFYLGDTKKDEKNGGEITIGGIDESKFKGDIEWLPVRRKAYWEVKFEGIALGDQYAAL 287
Query: 281 CEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
G A +D+GTSL+ P+ + IN IG +
Sbjct: 288 ENHGAA--IDTGTSLITLPSGLAEIINTEIGAK 318
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 49/87 (56%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DCD +P+++F K F +SP Y L+ ++ CIS M D P P GP+
Sbjct: 323 GQYTLDCDTRDGLPDLTFNFNGKNFTISPFDYTLE----VSGSCISAIMPMDFPEPMGPM 378
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
I+GD F+ Y++++D +G AEA
Sbjct: 379 AIVGDAFLRKYYSIYDLDNHAVGLAEA 405
>gi|206611|gb|AAA42031.1| renin [Rattus norvegicus]
Length = 352
Score = 222 bits (565), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 125/312 (40%), Positives = 185/312 (59%), Gaps = 17/312 (5%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSG--- 62
L ++ LW S LP + RI LKK + + R +ER + +S
Sbjct: 8 LWALLLLWTSCS-FSLPTDTASFGRILLKK-------MPSVREILEERGVDMTRISAEWG 59
Query: 63 --VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFS 119
++ + + L N++D QY+GEIGIG+P Q F VIFDTGS+NLWVPS+KC
Sbjct: 60 EFIKKSSFTNVTSPVVLTNYLDTQYYGEIGIGTPSQTFKVIFDTGSANLWVPSTKCGPLY 119
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
+C H+ Y S +S++Y E G I+YGSG + GF SQD V VG ++V Q F E T
Sbjct: 120 TACEIHNLYDSSESSSYMENGTEFTIHYGSGKVKGFLSQDVVTVGGIIVT-QTFGEVTEL 178
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEI 239
+ F+LA+FDG++G+GF AV +PV+D+++ Q ++ EEVFS + +R+ GGE+
Sbjct: 179 PLIPFMLAKFDGVLGMGFPAQAVDGVIPVFDHILSQRVLKEEVFSVYYSRESHL-LGGEV 237
Query: 240 VFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGP 299
V GG DP+H++G YV ++K G WQ + + +G +T +CE GC A+VD+GTS ++GP
Sbjct: 238 VLGGSDPQHYQGNFHYVSISKAGSWQITMKGVSVG-PATLLCEEGCMAVVDTGTSYISGP 296
Query: 300 TPVVTEINHAIG 311
T + I A+G
Sbjct: 297 TSSLQLIMQALG 308
>gi|332024604|gb|EGI64802.1| Lysosomal aspartic protease [Acromyrmex echinatior]
Length = 361
Score = 222 bits (565), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 124/292 (42%), Positives = 170/292 (58%), Gaps = 17/292 (5%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQ 96
R+ LH ++AR + Y G S VR PL NF +AQY+G I IG+P Q
Sbjct: 3 RILLHKTSSARKSIGIDYRQGNLTSIVRE----------PLLNFRNAQYYGVISIGTPRQ 52
Query: 97 NFSVIFDTGSSNLWVPSSKCYF-SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
F V+FDTGS+NLWVPS C I+C H +Y +R S TY G +I Y G++SG+
Sbjct: 53 RFKVLFDTGSANLWVPSVHCNLEDITCLSHRKYNNRTSRTYIPNGTLFDIQYEYGTLSGY 112
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
S D V V + + +Q F EA E + FL A+FDGI+G+G+ I++ PV+ NMV+Q
Sbjct: 113 LSTDVVNVAGLNIINQTFGEAINEPGIAFLYAKFDGILGMGYPNISILGVTPVFTNMVQQ 172
Query: 216 GLVSEEVFSFWLNRD-PDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
GLVS +FSF+LNR+ D+ G ++ GG DP + G+ TYV VT KGYWQF + I +
Sbjct: 173 GLVSSPIFSFYLNRNLLDSSAGSVLILGGSDPALYDGELTYVNVTHKGYWQFTMDKIQME 232
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE---GVVSAECKLV 323
N++ +C GC AI D+G S LAGP + I I + GVV +C +
Sbjct: 233 NET--LCVNGCQAIADTGFSRLAGPPTDIAIITSRIAIDDFNGVVYVDCDQI 282
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 37/89 (41%), Positives = 50/89 (56%), Gaps = 3/89 (3%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYIL--KTGEGIAEVCISGFMAFDLPPPRG 476
G +DCD+I +PNV+F + K F L+ E YI+ K + VC S F G
Sbjct: 273 GVVYVDCDQISNLPNVTFFLSGKPFVLTAEDYIIVRKIDKKGTPVCYSAF-EIAAQSEFG 331
Query: 477 PLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+W+LGD F+G Y+T FD G R+GFA A
Sbjct: 332 IMWVLGDSFLGRYYTEFDMGNDRVGFAPA 360
>gi|311260416|ref|XP_003128442.1| PREDICTED: gastricsin-like [Sus scrofa]
Length = 394
Score = 222 bits (565), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 118/331 (35%), Positives = 193/331 (58%), Gaps = 28/331 (8%)
Query: 5 LLRSVFCLWVLASCLL------LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGA 58
++ ++ CL +L + ++ L + ++ GL + L H + A+
Sbjct: 9 MVVALVCLQLLEASVIKVPLKKLKSIRQAMKEKGLLEEFLKTHKYDPAQ----------- 57
Query: 59 GVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF 118
R+R GD + P+ +++A YFGEI IG+PPQNF V+FDTGSSNLWVPS C
Sbjct: 58 -----RYRFGDFSVALEPMA-YLEAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCK- 110
Query: 119 SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
S++C H+R+ KS+TY+ ++ + YGSGS++GFF D +++ + V DQ F +
Sbjct: 111 SLACTTHARFNPSKSSTYSTDRQTFSLQYGSGSLTGFFGYDTLKIQSIQVPDQEFGLSET 170
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
E +FL A+FDGI+GL + +++ G A ++++ ++ VFSF+L+ +++GGE
Sbjct: 171 EPGTSFLYAQFDGIMGLAYPDLSAGGATTAMQGLLQEDALTSPVFSFYLSNQQSSQDGGE 230
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
+V GGVD + G+ + PVT++ YWQ + + LIG++++G C GC AIVD+GTSLL
Sbjct: 231 LVLGGVDSSLYTGQIYWAPVTQELYWQIGIEEFLIGDEASGWCSEGCQAIVDTGTSLLTV 290
Query: 299 PTPVVTEINHAIGGE----GVVSAECKLVVS 325
P ++++ A G E G +CK + S
Sbjct: 291 PQDYLSDLVQATGAEENEYGEFLVDCKDIQS 321
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 39/92 (42%), Positives = 52/92 (56%), Gaps = 5/92 (5%)
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
N GE ++DC I ++P +F I F L P YIL+ +G C+ G +
Sbjct: 307 NEYGEFLVDCKDIQSLPTFTFIINGVEFPLPPSAYILEE-DGF---CMVGVEPTYVSSQN 362
Query: 476 G-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
G PLWILGDVF+ Y++VFD G R+GFA AA
Sbjct: 363 GQPLWILGDVFLRSYYSVFDLGNNRVGFATAA 394
>gi|148747255|ref|NP_036774.4| renin precursor [Rattus norvegicus]
gi|1350571|sp|P08424.2|RENI_RAT RecName: Full=Renin; AltName: Full=Angiotensinogenase; Flags:
Precursor
gi|30027675|gb|AAP13916.1| renin [Rattus sp.]
gi|51261221|gb|AAH78878.1| Renin [Rattus norvegicus]
gi|149058615|gb|EDM09772.1| renin 1, isoform CRA_b [Rattus norvegicus]
Length = 402
Score = 221 bits (564), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 125/312 (40%), Positives = 185/312 (59%), Gaps = 17/312 (5%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSG--- 62
L ++ LW S LP + RI LKK + + R +ER + +S
Sbjct: 8 LWALLLLWTSCS-FSLPTDTASFGRILLKK-------MPSVREILEERGVDMTRISAEWG 59
Query: 63 --VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFS 119
++ + + L N++D QY+GEIGIG+P Q F VIFDTGS+NLWVPS+KC
Sbjct: 60 EFIKKSSFTNVTSPVVLTNYLDTQYYGEIGIGTPSQTFKVIFDTGSANLWVPSTKCGPLY 119
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
+C H+ Y S +S++Y E G I+YGSG + GF SQD V VG ++V Q F E T
Sbjct: 120 TACEIHNLYDSSESSSYMENGTEFTIHYGSGKVKGFLSQDVVTVGGIIVT-QTFGEVTEL 178
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEI 239
+ F+LA+FDG++G+GF AV +PV+D+++ Q ++ EEVFS + +R+ GGE+
Sbjct: 179 PLIPFMLAKFDGVLGMGFPAQAVDGVIPVFDHILSQRVLKEEVFSVYYSRESHL-LGGEV 237
Query: 240 VFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGP 299
V GG DP+H++G YV ++K G WQ + + +G +T +CE GC A+VD+GTS ++GP
Sbjct: 238 VLGGSDPQHYQGNFHYVSISKAGSWQITMKGVSVG-PATLLCEEGCMAVVDTGTSYISGP 296
Query: 300 TPVVTEINHAIG 311
T + I A+G
Sbjct: 297 TSSLQLIMQALG 308
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 52/84 (61%)
Query: 422 IIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWIL 481
+++C ++PT+P++SF +G + + LS Y+ K ++CI D+PPP GP+W+L
Sbjct: 318 VVNCSQVPTLPDISFYLGGRTYTLSNMDYVQKNPFRNDDLCILALQGLDIPPPTGPVWVL 377
Query: 482 GDVFMGVYHTVFDSGKLRIGFAEA 505
G F+ ++T FD RIGFA A
Sbjct: 378 GATFIRKFYTEFDRHNNRIGFALA 401
>gi|296219067|ref|XP_002755720.1| PREDICTED: cathepsin D [Callithrix jacchus]
Length = 392
Score = 221 bits (564), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 121/296 (40%), Positives = 176/296 (59%), Gaps = 27/296 (9%)
Query: 37 RLDLHSLNAARITRKERYMGG--------AGVSGVRHRLGDSDEDILP--LKNFMDAQYF 86
R+ LH + R T E MGG +S + +P LKN+MDAQY+
Sbjct: 23 RIPLHKFTSIRRTMSE--MGGPVEDLIAKGPISKYSQEMPAMPGGPIPEILKNYMDAQYY 80
Query: 87 GEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKS--C 143
GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C + + S + G C
Sbjct: 81 GEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACSALGQGGRKWSQLCLDPGPPVPC 140
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVG 203
+ + ++ G V V+ QVF EAT++ +TF+ A+FDGI+G+ + I+V
Sbjct: 141 RSSLSASALGG-----------VKVERQVFGEATKQPGITFIAAKFDGILGMAYPRISVN 189
Query: 204 DAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGY 263
+ +PV+DN+++Q LV + +FSF+LNRDPDA+ GGE++ GG D K++KG Y+ VT+K Y
Sbjct: 190 NVLPVFDNLMQQKLVDQNIFSFYLNRDPDAQPGGELMLGGTDSKYYKGSLFYLNVTRKAY 249
Query: 264 WQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
WQ + + + + T +C+GGC AIVD+GTSL+ GP V E+ AIG ++ E
Sbjct: 250 WQVHMDQVEVASGLT-LCKGGCEAIVDTGTSLMVGPVDEVRELQKAIGAMPLIQGE 304
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 45/94 (47%), Positives = 64/94 (68%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
++P GE +I C+++ T+P + +G K + LSP+ Y LK + +C+SGFM D+P
Sbjct: 297 AMPLIQGEYMIPCEKVSTLPVIMLKLGGKDYELSPQDYTLKVSQAGKTICLSGFMGMDIP 356
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PP GPLWILGDVF+G Y+TVFD R+GFA+A
Sbjct: 357 PPSGPLWILGDVFIGRYYTVFDRDNNRVGFAQAT 390
>gi|126310959|ref|XP_001372683.1| PREDICTED: chymosin-like [Monodelphis domestica]
Length = 383
Score = 221 bits (563), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 121/306 (39%), Positives = 178/306 (58%), Gaps = 17/306 (5%)
Query: 12 LWVLASCLLLPASSNGLRRIGLKK-----RRLDLHSLNAARITRKERYMGGAGVSGVRHR 66
L++LA ++ S RRI L K + L H L + + + +Y +
Sbjct: 5 LFLLA---VIAISECAFRRIPLTKGKTLRKVLKEHGLLESFL-KSHKYSPSSKYQLYGEA 60
Query: 67 LGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHS 126
+DE PL N++D+QYFG+I IG+PPQ F+V+FDTGSSNLWVPS C S +C H
Sbjct: 61 AKVTDE---PLTNYLDSQYFGKIYIGTPPQEFTVVFDTGSSNLWVPSVYCN-SDACQNHH 116
Query: 127 RYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLL 186
R+ S T+ + I YG+GS+ G D V V +VV DQ+F +T+E F
Sbjct: 117 RFNPASSTTFRSTQEPLSIQYGTGSMEGVLGYDTVTVSQIVVPDQIFGLSTQEPGEIFTY 176
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
+ FDGI+GLG+ +A A PV+DNM+ + LV++++FS +++RD +G ++ G +DP
Sbjct: 177 SEFDGILGLGYPSLAEDQATPVFDNMMNKNLVAQDLFSVYMSRD---SQGSMLILGAIDP 233
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
++ G +VPVT++GYWQF + I + Q CEGGC AI+D+GTSLL GP+ + I
Sbjct: 234 SYYTGSLHWVPVTEQGYWQFSVDSITVNGQVVA-CEGGCQAILDTGTSLLVGPSYDIANI 292
Query: 307 NHAIGG 312
IG
Sbjct: 293 QSIIGA 298
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 45/87 (51%), Gaps = 8/87 (9%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
GE I+C + +MP V I + + L P Y +G+ C SGF + L
Sbjct: 304 GEYDINCSNLSSMPTVVVHINGRQYPLPPSAYT-NQDQGL---CSSGFQS----EGSDQL 355
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WILGDVF+ Y++VFD G R+G A A
Sbjct: 356 WILGDVFIREYYSVFDRGNNRVGLATA 382
>gi|73620985|sp|P81498.2|PEPC_SUNMU RecName: Full=Gastricsin; AltName: Full=Pepsinogen C-1; Flags:
Precursor
gi|9798662|dbj|BAB11753.1| pepsinogen C [Suncus murinus]
Length = 389
Score = 221 bits (563), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 109/266 (40%), Positives = 165/266 (62%), Gaps = 6/266 (2%)
Query: 64 RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCY 123
++ GD P+ +MDA YFGEI IG+PPQNF V+FDTGSSNLWVPS C S +C
Sbjct: 53 KYHFGDFSVAYEPMA-YMDASYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACT 110
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
H+R+ +S+TY+ G++ + YGSGS++GFF D + V ++ V Q F + E
Sbjct: 111 GHARFNPNQSSTYSTNGQTFSLQYGSGSLTGFFGYDTMTVQNIKVPHQEFGLSQNEPGTN 170
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
F+ A+FDGI+G+ + +A+G A M+++G ++ VFSF+L+ ++ GG ++FGG
Sbjct: 171 FIYAQFDGIMGMAYPSLAMGGATTALQGMLQEGALTSPVFSFYLSNQQGSQNGGAVIFGG 230
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
VD + G+ + PVT++ YWQ + + LIG Q+TG C+ GC AIVD+GTSLL P +
Sbjct: 231 VDNSLYTGQIFWAPVTQELYWQIGVEEFLIGGQATGWCQQGCQAIVDTGTSLLTVPQQFM 290
Query: 304 TEINHAIGGE----GVVSAECKLVVS 325
+ + A G + G ++ C + S
Sbjct: 291 SALQQATGAQQDQYGQLAVNCNSIQS 316
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 37/107 (34%), Positives = 58/107 (54%), Gaps = 5/107 (4%)
Query: 401 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 460
++ +S + + + + G+ ++C+ I ++P ++F I F L P Y+L T
Sbjct: 287 QQFMSALQQATGAQQDQYGQLAVNCNSIQSLPTLTFIINGVQFPLPPSAYVLNTNG---- 342
Query: 461 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
C G LP G PLWILGDVF+ Y++V+D G R+GFA AA
Sbjct: 343 YCFLGVEPTYLPSQNGQPLWILGDVFLRSYYSVYDMGNNRVGFATAA 389
>gi|494607|pdb|1SMR|A Chain A, The 3-D Structure Of Mouse Submaxillary Renin Complexed
With A Decapeptide Inhibitor Ch-66 Based On The 4-16
Fragment Of Rat Angiotensinogen
gi|157880102|pdb|1SMR|C Chain C, The 3-D Structure Of Mouse Submaxillary Renin Complexed
With A Decapeptide Inhibitor Ch-66 Based On The 4-16
Fragment Of Rat Angiotensinogen
gi|157880104|pdb|1SMR|E Chain E, The 3-D Structure Of Mouse Submaxillary Renin Complexed
With A Decapeptide Inhibitor Ch-66 Based On The 4-16
Fragment Of Rat Angiotensinogen
gi|157880106|pdb|1SMR|G Chain G, The 3-D Structure Of Mouse Submaxillary Renin Complexed
With A Decapeptide Inhibitor Ch-66 Based On The 4-16
Fragment Of Rat Angiotensinogen
Length = 335
Score = 221 bits (563), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 111/238 (46%), Positives = 162/238 (68%), Gaps = 4/238 (1%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L N++++QY+GEIGIG+PPQ F VIFDTGS+NLWVPS+KC ++C HS Y+S S++
Sbjct: 9 LTNYLNSQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCSRLYLACGIHSLYESSDSSS 68
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y E G I+YGSG + GF SQD+V VG + V Q F E T+ + F+LA+FDG++G+
Sbjct: 69 YMENGDDFTIHYGSGRVKGFLSQDSVTVGGITVT-QTFGEVTQLPLIPFMLAQFDGVLGM 127
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
GF AVG PV+D+++ QG++ E+VFS + NR P GGE+V GG DP+H++G Y
Sbjct: 128 GFPAQAVGGVTPVFDHILSQGVLKEKVFSVYYNRGPHL-LGGEVVLGGSDPQHYQGDFHY 186
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
V ++K WQ + + +G+ ST +CE GC +VD+G+S ++ PT + I A+G +
Sbjct: 187 VSLSKTDSWQITMKGVSVGS-STLLCEEGCEVVVDTGSSFISAPTSSLKLIMQALGAK 243
Score = 82.4 bits (202), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 54/88 (61%)
Query: 418 MGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGP 477
+ E ++ C ++PT+P++SF +G + + LS Y+L+ ++C A D+PPP GP
Sbjct: 247 LHEYVVSCSQVPTLPDISFNLGGRAYTLSSTDYVLQYPNRRDKLCTVALHAMDIPPPTGP 306
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+W+LG F+ ++T FD RIGFA A
Sbjct: 307 VWVLGATFIRKFYTEFDRHNNRIGFALA 334
>gi|223468|prf||0807285A renin precursor
Length = 401
Score = 221 bits (563), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 125/309 (40%), Positives = 183/309 (59%), Gaps = 8/309 (2%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRH 65
L ++ LW + C + RI LKK + + R R V R
Sbjct: 8 LWALLLLW--SPCTFSLPTGTTFERIPLKKMP-SVREILEERGVDMTRLSAEWDVFTKRS 64
Query: 66 RLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYF 124
L D ++ L N++++QY+GEIGIG+PPQ F VIFDTGS+NLWVPS+KC ++C
Sbjct: 65 SLTDLISPVV-LTNYLNSQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCSRLYLACGI 123
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
HS Y+S S++Y E G I+YGSG + GF SQD+V VG + V Q F E T + F
Sbjct: 124 HSLYESSDSSSYMENGDDFTIHYGSGRVKGFLSQDSVTVGGITVT-QTFGEVTELPLIPF 182
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
+LA+FDG++G+G AVG PV+D+++ QG++ E+VFS + NR P GGE+V GG
Sbjct: 183 MLAQFDGVLGMGLSRSAVGGVTPVFDHILSQGVLKEKVFSVYYNRGPHL-LGGEVVLGGS 241
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
DP+H++G YV ++K WQ + + +G+ ST +CE GC +VD+G+S ++ PT +
Sbjct: 242 DPEHYQGDFHYVSLSKTDSWQITMKGVSVGS-STLLCEEGCEVVVDTGSSFISAPTSSLK 300
Query: 305 EINHAIGGE 313
I A+G +
Sbjct: 301 LIMQALGAK 309
Score = 82.0 bits (201), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 54/88 (61%)
Query: 418 MGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGP 477
+ E ++ C ++PT+P++SF +G + + LS Y+L+ ++C A D+PPP GP
Sbjct: 313 LHEYVVSCSQVPTLPDISFNLGGRAYTLSSTDYVLQYPNRRDKLCTVALHAMDIPPPTGP 372
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+W+LG F+ ++T FD R+GFA A
Sbjct: 373 VWVLGATFIRKFYTEFDRHNNRVGFALA 400
>gi|45643446|gb|AAS72876.1| aspartyl protease [Triatoma infestans]
Length = 387
Score = 221 bits (563), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 108/232 (46%), Positives = 155/232 (66%), Gaps = 3/232 (1%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
L+N ++ QY+G + +G+PPQ +V+FDTGS+NLWVP + C S +C H+ Y ++S+TY
Sbjct: 63 LRNSLNTQYYGNVTLGTPPQELTVVFDTGSANLWVPLANCP-SFACIIHNTYDHKQSSTY 121
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
GK+ INYG+GSI+G S D +++GD+ VK+Q+F EA + + F ++ DGI+GL
Sbjct: 122 QPNGKALRINYGTGSITGEMSSDVLQIGDLQVKNQLFGEAPQVSNSPFGRSKADGILGLA 181
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF-KGKHTY 255
F IA G A+P + NM++QGL+ + VFS +LNR+PD E GGEI+FGGVD K F K T
Sbjct: 182 FPPIAKGQAIPPFFNMIDQGLLDKPVFSVYLNRNPDEEVGGEIIFGGVDEKRFNKESLTT 241
Query: 256 VPVTKKGYWQFELGDILI-GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
VP+T YW F++ ++ G C+ GC A D+GTS + GPT V EI
Sbjct: 242 VPLTNPTYWMFKMDEVSTSGTNGKSWCQNGCRATADTGTSFIVGPTKEVAEI 293
Score = 63.5 bits (153), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 48/85 (56%), Gaps = 5/85 (5%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G + CD + +P+++F + K + L E Y+L+ E + CI GF + LP P
Sbjct: 304 GVGYVPCDELHKLPDITFHLNGKGYTLKAEDYVLEMTEAGEKACIVGFAS--LP---QPF 358
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFA 503
WILGDVF+G Y+T+F+ + FA
Sbjct: 359 WILGDVFLGKYYTIFNVEDRTVSFA 383
>gi|384485237|gb|EIE77417.1| hypothetical protein RO3G_02121 [Rhizopus delemar RA 99-880]
Length = 399
Score = 221 bits (563), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 114/261 (43%), Positives = 162/261 (62%), Gaps = 10/261 (3%)
Query: 72 EDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSR 131
E +PL N+++AQY+GEI +G+PPQ FSV+FDTGSSN WVPS++C FS++C H RY +
Sbjct: 67 EHGVPLANYLNAQYYGEISLGTPPQIFSVVFDTGSSNTWVPSTRC-FSLACLTHRRYSAS 125
Query: 132 KSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDG 191
+S+TY G I YG+G++ G SQD + VG + + +Q F E+T E LTF+ A+FDG
Sbjct: 126 RSSTYVRNGTQFSITYGTGALQGVISQDTLRVGGIQIDNQQFAESTIEPGLTFIYAQFDG 185
Query: 192 IIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNR---DPDAEEGGEIVFGGVDPKH 248
I GLG+ I+V VP + NMV + L+SE VFSFW+N + + GGEI FG +D
Sbjct: 186 IFGLGYDTISVQRVVPPFYNMVNRNLISESVFSFWINDINVQAENDIGGEIAFGEIDQTR 245
Query: 249 FKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH 308
+ G + PV +KGYW+ + + +G + V A +D+GTSL+ PT V EI+
Sbjct: 246 YTGDLIWSPVQRKGYWEIAIDNFRVG--ADPVNPSSLTAAIDTGTSLILVPTSVSIEIHA 303
Query: 309 AIG----GEGVVSAECKLVVS 325
+G G G+ C V S
Sbjct: 304 RLGAQLSGNGLYIFSCATVSS 324
Score = 43.1 bits (100), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 33/68 (48%), Gaps = 7/68 (10%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G I C + ++P + T F L Y+++ I C SGF D+PPP GPL
Sbjct: 313 GLYIFSCATVSSLPEICVTFSGVDFCLQGPDYVIE----IDGQCYSGFGPLDIPPPAGPL 368
Query: 479 WILGDVFM 486
W+ VFM
Sbjct: 369 WV---VFM 373
>gi|327278828|ref|XP_003224162.1| PREDICTED: pepsin A-like isoform 2 [Anolis carolinensis]
Length = 386
Score = 221 bits (563), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 116/287 (40%), Positives = 171/287 (59%), Gaps = 11/287 (3%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L++ ++ L H L + + +G G+ + + PL+N+MD +Y G
Sbjct: 22 LKKTKSLRQNLKEHGLLEKYLQKHHHNLGSKYFPGLAN-----ENAAEPLENYMDIEYIG 76
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
I IG+P Q F V+FDTGSSNLWVPS C S +C H+R+ + S+TY +S + Y
Sbjct: 77 TISIGTPAQQFVVLFDTGSSNLWVPSVYCS-SSACSNHNRFNPQDSSTYQATSQSVSVTY 135
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
G+GS++GF + D V+VG +VV +Q+F + T GS + + FDGI+GL F IA A
Sbjct: 136 GTGSMTGFLAYDTVQVGSIVVTNQIFGLSETEPGSFLYY-SPFDGILGLAFPSIASSGAT 194
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
PV+DNM+ +GLVS+++FS +L+ D + G ++FGGVD ++ G +VP++ + YWQ
Sbjct: 195 PVFDNMMSEGLVSQDLFSVYLSS--DDQSGSFVMFGGVDTSYYSGSLNWVPLSSESYWQI 252
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
L I + QS C GGC AIVD+GTSLLAGP + I + IG
Sbjct: 253 TLDSITLNGQSI-ACSGGCQAIVDTGTSLLAGPPNGIANIQYYIGAS 298
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 34/88 (38%), Positives = 46/88 (52%), Gaps = 4/88 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G +I C+ + ++P++ FTI F L YIL G C GF D+P G L
Sbjct: 303 GGYMISCNAMNSLPDIIFTINGIEFPLPASAYIL----GQNGYCTPGFEGIDIPTQSGEL 358
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEAA 506
WILGDVF+ Y+ VFD ++G A A
Sbjct: 359 WILGDVFIRQYYCVFDRANNQVGLAPVA 386
>gi|194218271|ref|XP_001501895.2| PREDICTED: pepsin A-like [Equus caballus]
Length = 387
Score = 221 bits (563), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 115/252 (45%), Positives = 160/252 (63%), Gaps = 12/252 (4%)
Query: 73 DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRK 132
D PL+N+MD YFG I IG+P Q F+VIFDTGSSNLWVPS C S++C H+R+
Sbjct: 63 DTQPLENYMDEAYFGTISIGTPAQEFTVIFDTGSSNLWVPSIYCS-SLACSDHNRFNPED 121
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDG 191
S+TY +S I YG+GS++G D V VG + +Q+F + T GS + A FDG
Sbjct: 122 SSTYRATSESVSITYGTGSMTGVLGYDTVRVGGIEDTNQIFGLSETEPGSFLYY-APFDG 180
Query: 192 IIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKG 251
I+GL + I+ A PV+DN+ +QGLVS+++FS +L+ D E G ++FGG+DP ++ G
Sbjct: 181 ILGLAYPSISASGATPVFDNIWDQGLVSQDLFSVYLSS--DDESGSVVMFGGIDPSYYTG 238
Query: 252 KHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
+VPV+ +GYWQ + + + +S C GGC AIVD+GTSLLAGPT + I +G
Sbjct: 239 SLHWVPVSNEGYWQITMDSVTVNGESIA-CSGGCQAIVDTGTSLLAGPTSAIDNIQSYLG 297
Query: 312 ------GEGVVS 317
GEGV+S
Sbjct: 298 FSEDSSGEGVIS 309
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 51/134 (38%), Positives = 66/134 (49%), Gaps = 10/134 (7%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTK--EKVLSYINELCDSLPNPMGESIIDCDRIPTMP 432
G+S CS A+V L T + + SY+ DS GE +I C I ++P
Sbjct: 262 GESIACSGGCQAIVDTGTSLLAGPTSAIDNIQSYLGFSEDS----SGEGVISCSSIYSLP 317
Query: 433 NVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTV 492
++ FT+ F L P YIL+ + CISGF DL G LWILGDVF+ Y TV
Sbjct: 318 DIVFTLNGVEFPLRPSAYILEEDDS----CISGFEGMDLDTSSGELWILGDVFIRQYFTV 373
Query: 493 FDSGKLRIGFAEAA 506
FD +IG A A
Sbjct: 374 FDRANNQIGLASVA 387
>gi|73915318|gb|AAZ92540.1| aspartyl protease 1 [Coccidioides posadasii]
gi|73915320|gb|AAZ92541.1| aspartyl protease 1 [Coccidioides posadasii]
Length = 399
Score = 221 bits (563), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 117/278 (42%), Positives = 166/278 (59%), Gaps = 11/278 (3%)
Query: 52 ERYMGGAGVSGVRHRLGD----SDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSS 107
++Y G S + L D +D + + NF++AQYF EI IG+PPQNF V+ DTGSS
Sbjct: 48 QKYFGSLPSSQQQTVLSDEYSTTDGHNVLVDNFLNAQYFSEISIGNPPQNFKVVLDTGSS 107
Query: 108 NLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVV 167
NLWVPSS+C SI+CY H++Y S S+TY + G I YGSGS+SGF SQD + +GD+
Sbjct: 108 NLWVPSSEC-GSIACYLHNKYDSSASSTYKKNGTEFAIRYGSGSLSGFVSQDTLRIGDLT 166
Query: 168 VKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL 227
++ Q F EAT E L F RFDGI+GLG+ I+V VP + NM+ +GL+ E VF F+L
Sbjct: 167 IEGQDFAEATNEPGLAFAFGRFDGILGLGYDTISVNKIVPPFYNMINEGLIDEPVFGFYL 226
Query: 228 NRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAA 287
+ FGGVD F G+ +P+ +K YW+ + I GN+ + + G
Sbjct: 227 GDTNKEGDDSYATFGGVDSSLFSGEMIKIPLRRKAYWEVDFDAIAFGNERAELEDTGI-- 284
Query: 288 IVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECK 321
I+D+GTSL+A P+ + +N IG + G + +C
Sbjct: 285 ILDTGTSLIALPSTLAELLNREIGAKKSWNGQYTVDCN 322
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 54/87 (62%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC++ P++P+++FT+ F + P YIL+ + CIS FM D P P GPL
Sbjct: 315 GQYTVDCNKRPSLPDLTFTLSGHNFTIGPYDYILE----VQGSCISSFMGMDFPEPVGPL 370
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ ++T++D G +G A+A
Sbjct: 371 AILGDAFLRRFYTMYDLGNNLVGLAKA 397
>gi|195046656|ref|XP_001992194.1| GH24344 [Drosophila grimshawi]
gi|193893035|gb|EDV91901.1| GH24344 [Drosophila grimshawi]
Length = 373
Score = 221 bits (562), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 115/259 (44%), Positives = 165/259 (63%), Gaps = 8/259 (3%)
Query: 69 DSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF-SISCYFHSR 127
D+DE+ L N ++ Y+G I IG+PPQ+F V+FD+GSSNLWVPSS+C+F I+C H++
Sbjct: 54 DADEE---LSNSINMAYYGAITIGTPPQSFKVLFDSGSSNLWVPSSRCFFLDIACQNHNK 110
Query: 128 YKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLA 187
Y KS+TY G+S I YGSGS+SGF S D+V+V + +K Q F EAT E +F A
Sbjct: 111 YDHDKSSTYVANGESFSIQYGSGSLSGFLSTDDVDVSGLTIKSQTFAEATNEPGTSFNNA 170
Query: 188 RFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRD-PDAEEGGEIVFGGVDP 246
+FDGI+G+ ++ I+ + VP + NMV QGLV + VFSF+L RD +GGE++FGG DP
Sbjct: 171 KFDGILGMAYQSISSDNVVPPFYNMVSQGLVDDSVFSFYLARDGTSTTDGGELIFGGSDP 230
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
+ G +YVP++++GYWQF + I Q+ G AI D+GTSLL + +
Sbjct: 231 AKYTGDLSYVPISEQGYWQFAVDSATIDGQTLGES---FQAIADTGTSLLVVSSDAYDIL 287
Query: 307 NHAIGGEGVVSAECKLVVS 325
N+ + + +C V S
Sbjct: 288 NNLLNVDEDGLVDCSTVDS 306
Score = 72.4 bits (176), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 43/101 (42%), Positives = 60/101 (59%), Gaps = 15/101 (14%)
Query: 409 ELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKT-GEGIAEVCISGF- 466
++ ++L N + ++DC + +MP ++FTIG K + L P QYI+++ GE C SGF
Sbjct: 285 DILNNLLNVDEDGLVDCSTVDSMPVLTFTIGGKQYPLEPAQYIIQSDGE-----CQSGFE 339
Query: 467 -MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
M D WILGDVF+G Y+T FD G RIGFA A
Sbjct: 340 YMGTDF-------WILGDVFIGQYYTEFDLGNNRIGFAPVA 373
>gi|327278826|ref|XP_003224161.1| PREDICTED: pepsin A-like isoform 1 [Anolis carolinensis]
Length = 387
Score = 221 bits (562), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 116/287 (40%), Positives = 171/287 (59%), Gaps = 11/287 (3%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L++ ++ L H L + + +G G+ + + PL+N+MD +Y G
Sbjct: 22 LKKTKSLRQNLKEHGLLEKYLQKHHHNLGSKYFPGLAN-----ENAAEPLENYMDIEYIG 76
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
I IG+P Q F V+FDTGSSNLWVPS C S +C H+R+ + S+TY +S + Y
Sbjct: 77 TISIGTPAQQFVVLFDTGSSNLWVPSVYCS-SSACSNHNRFNPQDSSTYQATSQSVSVTY 135
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
G+GS++GF + D V+VG +VV +Q+F + T GS + + FDGI+GL F IA A
Sbjct: 136 GTGSMTGFLAYDTVQVGSIVVTNQIFGLSETEPGSFLYY-SPFDGILGLAFPSIASSGAT 194
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
PV+DNM+ +GLVS+++FS +L+ D + G ++FGGVD ++ G +VP++ + YWQ
Sbjct: 195 PVFDNMMSEGLVSQDLFSVYLSS--DDQSGSFVMFGGVDTSYYSGSLNWVPLSSESYWQI 252
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
L I + QS C GGC AIVD+GTSLLAGP + I + IG
Sbjct: 253 TLDSITLNGQSI-ACSGGCQAIVDTGTSLLAGPPNGIANIQYYIGAS 298
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 46/88 (52%), Gaps = 3/88 (3%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G +I C+ + ++P++ FTI F L YI + G C GF D+P G L
Sbjct: 303 GGYMISCNAMNSLPDIIFTINGIEFPLPASAYIRQGQNG---YCTPGFEGIDIPTQSGEL 359
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEAA 506
WILGDVF+ Y+ VFD ++G A A
Sbjct: 360 WILGDVFIRQYYCVFDRANNQVGLAPVA 387
>gi|392575952|gb|EIW69084.1| hypothetical protein TREMEDRAFT_39371 [Tremella mesenterica DSM
1558]
Length = 446
Score = 221 bits (562), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 120/268 (44%), Positives = 162/268 (60%), Gaps = 15/268 (5%)
Query: 68 GDSDEDIL------PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSIS 121
GDS++ +L PL ++M+AQY+ I IG+PPQ F V+ DTGSSNLWVPSS C SI+
Sbjct: 112 GDSEKRVLKGGHGVPLSDYMNAQYYAPITIGTPPQEFKVVLDTGSSNLWVPSSSCT-SIA 170
Query: 122 CYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGS 181
C+ HS+Y S S+TY G I YGSGS+ GF S D V + D+ +K Q F EAT+E
Sbjct: 171 CFLHSKYDSSASSTYKANGSDFAIRYGSGSLEGFVSSDTVTIADLSLKHQDFAEATKEPG 230
Query: 182 LTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVF 241
L F +FDGI+GL + I+V VP + M+ +GL+ E VFSF L D + +GGE +F
Sbjct: 231 LAFAFGKFDGIMGLAYDTISVNHIVPPFYTMLNRGLLDEPVFSFRLGSDEN--DGGECIF 288
Query: 242 GGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTP 301
GGVD + GK YVP+ +KGYW+ EL I G + + G A +D+GTSL+ P+
Sbjct: 289 GGVDDSAYTGKIQYVPIRRKGYWEVELEKIGFGEEELELENTGAA--IDTGTSLIVMPSD 346
Query: 302 VVTEINHAIGG----EGVVSAECKLVVS 325
V +N IG G + +C V S
Sbjct: 347 VAEMLNKEIGATKSWNGQYTVDCNTVPS 374
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 37/87 (42%), Positives = 53/87 (60%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC+ +P++P +S T+G + L E Y+L G CIS F D+P P GPL
Sbjct: 363 GQYTVDCNTVPSLPELSLTMGGIDWVLKGEDYVLNAGG----TCISSFTGMDIPAPIGPL 418
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WI+GDVF+ TV+D G+ +GFA A
Sbjct: 419 WIVGDVFLRKVVTVYDLGRNAVGFAAA 445
>gi|327279867|ref|XP_003224677.1| PREDICTED: cathepsin E-A-like [Anolis carolinensis]
Length = 406
Score = 221 bits (562), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 104/253 (41%), Positives = 162/253 (64%), Gaps = 6/253 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
L ++M+ +Y+GE+ IG+P Q F+VIFDTGS++ WVPS+ C S +C H ++K+ S +Y
Sbjct: 73 LCDYMNTEYYGEVSIGTPAQKFTVIFDTGSADFWVPSAYC-ISDACELHQKFKAFSSESY 131
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
G+ + YG+G + G ++D V++G++ ++DQ F E+ E +TF A FDG++GLG
Sbjct: 132 AHGGQKFTLQYGTGRLMGIVAKDKVQIGNITIEDQAFGESVFEPGMTFAFAHFDGVLGLG 191
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ ++V +++PV+DN+++Q LV E +FSF LNR+ + + GG ++ GG+D F G +
Sbjct: 192 YPTLSVTNSMPVFDNIIKQHLVEEPLFSFSLNREHNVDNGGVLILGGIDHSLFTGPIHWF 251
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
PVTKKGYWQ + + I Q T C GC AIVDSGTSL+ GP + + +IG
Sbjct: 252 PVTKKGYWQIHMNSVKIQGQVTS-CISGCEAIVDSGTSLITGPLSQIVRLQQSIGAFPTA 310
Query: 317 SAE----CKLVVS 325
+ E C+ V S
Sbjct: 311 TGEFLVDCRRVSS 323
Score = 99.0 bits (245), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 43/94 (45%), Positives = 63/94 (67%)
Query: 413 SLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLP 472
+ P GE ++DC R+ ++P V+F+IG++ F L+ E YI+K +G +C+SGF A D+
Sbjct: 306 AFPTATGEFLVDCRRVSSLPPVTFSIGEREFTLTAENYIIKEFDGKENLCLSGFQAQDIS 365
Query: 473 PPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PLWILGDVFM ++ VFD G R+GFA+ A
Sbjct: 366 SHNMPLWILGDVFMSAFYCVFDRGNDRVGFAKPA 399
>gi|430811193|emb|CCJ31368.1| unnamed protein product, partial [Pneumocystis jirovecii]
Length = 411
Score = 221 bits (562), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 135/332 (40%), Positives = 183/332 (55%), Gaps = 35/332 (10%)
Query: 5 LLRSVFCLWVLASCLLLPASSNGLRRIGLKK-----RRLDLHS-LNAARITRKERYMGGA 58
++ + L+VL C S GL R+ L+K R +H+ + A + RK
Sbjct: 1 MVSIAYWLYVLFVCQT--GVSRGLHRLELRKIPGDHRVNKVHNDIEAYSLARKYTLFYSY 58
Query: 59 GVSGVRHR--------LGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVI-FDTGSSNL 109
G +++ LG + ++ L NF +AQ +I IG+PPQ F V+ DTGSSNL
Sbjct: 59 GRDERKNKEPIIHGKPLGTNAHEV-SLTNFFNAQCRIDITIGTPPQTFKVVVLDTGSSNL 117
Query: 110 WVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVK 169
WVPSSKC S++C HS+Y S S+TY G EI YGSGSISGF S D V D+V+
Sbjct: 118 WVPSSKCT-SLACIIHSKYDSSLSSTYIANGSKFEIRYGSGSISGFISTDKFSVSDIVLP 176
Query: 170 DQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNR 229
Q F EA E TF RFDGI+GLG+ IAV +P + NMVEQ ++E VF+FW+
Sbjct: 177 AQEFAEAMSEPGFTFTFGRFDGILGLGYSSIAVNGIIPPFYNMVEQNAINEPVFAFWMGN 236
Query: 230 DPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQ---------FELGDILIGNQSTGV 280
EGGE FGG+DP H++G TY+PV +K YW+ F G IG ++ G
Sbjct: 237 IEKDIEGGECTFGGIDPMHYEGDLTYIPVRRKAYWEAFCLVDLSFFAYGKDFIGMENVG- 295
Query: 281 CEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
AI+D+GTSL+ P + +N+AIG
Sbjct: 296 ------AILDTGTSLIVMPKNIADLLNNAIGA 321
Score = 94.7 bits (234), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 40/88 (45%), Positives = 61/88 (69%), Gaps = 4/88 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ I+DC++IPT+P+++F G F+L P +YI+K I C++ F D+PPP GPL
Sbjct: 327 GDYILDCNKIPTLPDITFGFGHHNFSLGPNEYIIK----IQSKCMTTFTGMDIPPPAGPL 382
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEAA 506
WI+GDVF+ Y++V+D GK +G A+A
Sbjct: 383 WIIGDVFLRKYYSVYDLGKNMVGLAKAT 410
>gi|322708430|gb|EFZ00008.1| vacuolar protease A [Metarhizium anisopliae ARSEF 23]
Length = 395
Score = 221 bits (562), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 121/264 (45%), Positives = 168/264 (63%), Gaps = 11/264 (4%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+P+ NFM+AQYF EI +G+PPQ F V+ DTGSSNLWVPS C SI+CY HS Y S S+
Sbjct: 75 VPVSNFMNAQYFSEITVGTPPQTFKVVLDTGSSNLWVPSQSCS-SIACYLHSTYDSSSSS 133
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + G S EI YGSGS+SGF SQD V +GD+ +KDQ F EAT E L F +FDGI+G
Sbjct: 134 TYKKNGSSFEIRYGSGSLSGFVSQDVVTIGDLKIKDQDFAEATSEPGLAFAFGKFDGILG 193
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ ++V VP + M+ Q L+ E VF+F+L +EEG E VFGG+D H+ GK
Sbjct: 194 LGYDTLSVNKIVPPFYQMINQKLLDEPVFAFYLG---SSEEGSEAVFGGIDKDHYTGKIE 250
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
Y+P+ +K YW+ ++ I G+ + G AI+D+GTSL P+ + +N IG +
Sbjct: 251 YIPLRRKAYWEVDIHSIAFGDDVAELDRTG--AILDTGTSLNVLPSTLAELLNKEIGAKK 308
Query: 314 ---GVVSAECKLVVSQYGDLIWDL 334
G + +C + S D++++L
Sbjct: 309 SWNGQYTVDCAQIKS-LPDIVFNL 331
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 51/87 (58%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC +I ++P++ F + ++L YIL+ + CIS F D+P P GPL
Sbjct: 312 GQYTVDCAQIKSLPDIVFNLAGSNYSLPASDYILE----LQGTCISTFQGMDIPEPAGPL 367
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ Y++++D G+ +G A +
Sbjct: 368 IILGDAFLRRYYSIYDLGRNAVGLARS 394
>gi|307167891|gb|EFN61280.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 431
Score = 220 bits (561), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 127/363 (34%), Positives = 186/363 (51%), Gaps = 61/363 (16%)
Query: 9 VFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLG 68
+F L+++A+ L + I + +R+ LH ++ R + + G+ +
Sbjct: 1 MFRLFLMATALFV--------LIDAQLQRIQLHKMDPIR-----KRLRKIGIDLQQINFT 47
Query: 69 DSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSR 127
S+ L N++D++Y+G I IG+PPQ F V+FDTGSSNLW+PS C + ++C H++
Sbjct: 48 KSNPSSQSLYNYLDSEYYGNITIGTPPQQFKVLFDTGSSNLWIPSILCSTANVACALHNK 107
Query: 128 YKSRKSNTYTEIGKSCEINY-------GSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
Y S KS TY C + Y SGS+SGF S D V V + V+ Q F EA E
Sbjct: 108 YDSTKSRTYKVNNTICSLQYDITSIPFNSGSVSGFLSTDVVNVAGLNVQGQTFAEAIDEL 167
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLN------------ 228
L ++A FDGI+G+G+ IAV PV+ N+++Q LV + VFSF+LN
Sbjct: 168 VLALVVAEFDGILGMGYSTIAVDGVTPVFYNLIKQKLVPQPVFSFYLNRHVFSYSIFKSI 227
Query: 229 ------------------------RDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYW 264
RDP A+ GGE++ GG DP ++ G YV VTKKGYW
Sbjct: 228 SNKYIYNKKKYIYIAILKRIYNVYRDPSAKVGGELILGGSDPAYYTGHFKYVDVTKKGYW 287
Query: 265 QFELGDILIG----NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAEC 320
QF + + I N+ +C GGC AI D+G SL+ GPT + IN IG +
Sbjct: 288 QFLMDRVRITRTKFNKGRTLCMGGCQAIADTGMSLIVGPTSEIDIINKYIGANKTTDSSG 347
Query: 321 KLV 323
++
Sbjct: 348 NII 350
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 41/98 (41%), Positives = 58/98 (59%), Gaps = 7/98 (7%)
Query: 408 NELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFM 467
N+ DS N + ++++C+ I +P + F +G K F L+ YILK E C SGF+
Sbjct: 340 NKTTDSSGNII--NVVNCNTIHKLPIIRFILGGKRFPLNSNNYILKNTEYGITTCTSGFV 397
Query: 468 AFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
PLWILGDVF+G Y+T FD GK R+GFA++
Sbjct: 398 G-----SNSPLWILGDVFIGRYYTEFDLGKNRVGFAQS 430
>gi|14193251|gb|AAK55849.1|AF266465_1 aspartic protease [Manihot esculenta]
Length = 159
Score = 220 bits (561), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 102/159 (64%), Positives = 124/159 (77%), Gaps = 3/159 (1%)
Query: 351 CAFNGAEYVSTGIKTVVEKENVSAGDS---AVCSACEMAVVWVQNQLKQKQTKEKVLSYI 407
C F+G+ VS I++VV + + S A+CS CEMAV+W+QNQLKQ T E++L+Y
Sbjct: 1 CTFDGSRGVSMTIESVVNENSQEVAGSLHDAMCSTCEMAVIWMQNQLKQNATLERILNYA 60
Query: 408 NELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFM 467
NELC+ LP+PMGES +DC + TMPNVSFTIG K+F+LSPEQY+LK GEG A CISGF
Sbjct: 61 NELCERLPSPMGESAVDCGSLSTMPNVSFTIGGKVFDLSPEQYVLKVGEGEAAQCISGFT 120
Query: 468 AFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
A D+PPPRGPLWILGDVFMG +HTVFD G LR+GFAEAA
Sbjct: 121 ALDVPPPRGPLWILGDVFMGRFHTVFDYGNLRVGFAEAA 159
>gi|226288833|gb|EEH44345.1| vacuolar protease A [Paracoccidioides brasiliensis Pb18]
Length = 400
Score = 220 bits (561), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 119/291 (40%), Positives = 174/291 (59%), Gaps = 10/291 (3%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDE----DILPLKNFMDA 83
L +I L ++ LD ++ ++YMG D+ + + + NF++A
Sbjct: 26 LNKISLSQQ-LDHANIETQVKALGQKYMGVRPSQHFNEMFKDTSKASGGHSVLVDNFLNA 84
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
QYF EI IG+PPQ F V+ DTGSSNLWVPS++C SI+C+ H++Y S S+T+ + G
Sbjct: 85 QYFSEISIGTPPQTFKVVLDTGSSNLWVPSAQC-MSIACFLHNKYDSSVSSTHRKNGTEF 143
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVG 203
I YGSGS+SGF SQD V +GD+ V +Q F EAT E L F RFDGI+GLG+ I+V
Sbjct: 144 AIRYGSGSLSGFVSQDVVRIGDMTVNNQDFAEATSEPGLAFAFGRFDGILGLGYDTISVN 203
Query: 204 DAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
VP++ M+ Q L+ VF F+L N D D ++ E FGG+D HF G+ T + + ++
Sbjct: 204 HIVPLFYQMINQKLLDMPVFGFYLGNSDVDGDD-SEATFGGIDESHFTGELTTISLRRRA 262
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
YW+ +L I+ GN+ + G I+D+GTSLLA P+ + +N IG +
Sbjct: 263 YWEVDLDAIIFGNEMAELENTGV--ILDTGTSLLALPSTIAELLNKQIGAK 311
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 51/87 (58%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC + T P+++FT+ F + YIL+ + CIS FM D P P GPL
Sbjct: 316 GQYTVDCTKRSTFPDITFTLAGHNFTIGSYDYILE----VQGSCISSFMGMDFPEPVGPL 371
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ +++V+D G +IG A+A
Sbjct: 372 AILGDAFLRRWYSVYDLGNHQIGLAKA 398
>gi|119187279|ref|XP_001244246.1| hypothetical protein CIMG_03687 [Coccidioides immitis RS]
gi|303317132|ref|XP_003068568.1| aspartyl proteinase [Coccidioides posadasii C735 delta SOWgp]
gi|6760077|gb|AAF28186.1|AF162132_1 aspartyl proteinase [Coccidioides posadasii]
gi|240108249|gb|EER26423.1| aspartyl proteinase [Coccidioides posadasii C735 delta SOWgp]
gi|392870962|gb|EAS32810.2| vacuolar protease A [Coccidioides immitis RS]
Length = 399
Score = 220 bits (561), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 111/249 (44%), Positives = 155/249 (62%), Gaps = 7/249 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
+ NF++AQYF EI IG+PPQNF V+ DTGSSNLWVPSS+C SI+CY H++Y S S+TY
Sbjct: 77 VDNFLNAQYFSEISIGNPPQNFKVVLDTGSSNLWVPSSEC-GSIACYLHNKYDSSASSTY 135
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
+ G I YGSGS+SGF SQD + +GD+ ++ Q F EAT E L F RFDGI+GLG
Sbjct: 136 KKNGTEFAIRYGSGSLSGFVSQDTLRIGDLTIEGQDFAEATNEPGLAFAFGRFDGILGLG 195
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ I+V VP + NM+ +GL+ E VF F+L + FGGVD F G+ +
Sbjct: 196 YDTISVNKIVPPFYNMINEGLIDEPVFGFYLGDTNKEGDDSYATFGGVDSSLFSGEMIKI 255
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE--- 313
P+ +K YW+ + I GN+ + + G I+D+GTSL+A P+ + +N IG +
Sbjct: 256 PLRRKAYWEVDFDAIAFGNERAELEDTGI--ILDTGTSLIALPSTLAELLNREIGAKKSW 313
Query: 314 -GVVSAECK 321
G + +C
Sbjct: 314 NGQYTVDCN 322
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 54/87 (62%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC++ P++P+++FT+ F + P YIL+ + CIS FM D P P GPL
Sbjct: 315 GQYTVDCNKRPSLPDLTFTLSGHNFTIGPYDYILE----VQGSCISSFMGMDFPEPVGPL 370
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ ++T++D G +G A+A
Sbjct: 371 AILGDAFLRRFYTMYDLGNNLVGLAKA 397
>gi|200702|gb|AAA40050.1| renin [Mus musculus]
Length = 401
Score = 220 bits (560), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 124/309 (40%), Positives = 183/309 (59%), Gaps = 8/309 (2%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRH 65
L ++ LW + C + RI LKK + + R R V R
Sbjct: 8 LWALLLLW--SPCTFSLPTGTTFERIPLKKMP-SVREILEERGVDMTRLSAEWDVFTKRS 64
Query: 66 RLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYF 124
L D ++ L N++++QY+GEIGIG+PPQ F V+FDTGS+NLWVPS+KC ++C
Sbjct: 65 SLTDLISPVV-LTNYLNSQYYGEIGIGTPPQTFKVMFDTGSANLWVPSTKCSRLYLACGI 123
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
HS Y+S S++Y E G I+YGSG + GF SQD+V VG + V Q F E T + F
Sbjct: 124 HSLYESSDSSSYMENGDDFTIHYGSGRVKGFLSQDSVTVGGITVT-QTFGEVTELPLIPF 182
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
+LA+FDG++G+G AVG PV+D+++ QG++ E+VFS + NR P GGE+V GG
Sbjct: 183 MLAQFDGVLGMGLSRSAVGGVTPVFDHILSQGVLKEKVFSVYYNRGPHL-LGGEVVLGGS 241
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
DP+H++G YV ++K WQ + + +G+ ST +CE GC +VD+G+S ++ PT +
Sbjct: 242 DPEHYQGDFHYVSLSKTDSWQITMKGVSVGS-STLLCEEGCEVVVDTGSSFISAPTSSLK 300
Query: 305 EINHAIGGE 313
I A+G +
Sbjct: 301 LIMQALGAK 309
Score = 82.0 bits (201), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 54/88 (61%)
Query: 418 MGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGP 477
+ E ++ C ++PT+P++SF +G + + LS Y+L+ ++C A D+PPP GP
Sbjct: 313 LHEYVVSCSQVPTLPDISFNLGGRAYTLSSTDYVLQYPNRRDKLCTVALHAMDIPPPTGP 372
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+W+LG F+ ++T FD R+GFA A
Sbjct: 373 VWVLGATFIRKFYTEFDRHNNRVGFALA 400
>gi|149058614|gb|EDM09771.1| renin 1, isoform CRA_a [Rattus norvegicus]
Length = 366
Score = 220 bits (560), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 110/236 (46%), Positives = 158/236 (66%), Gaps = 4/236 (1%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L N++D QY+GEIGIG+P Q F VIFDTGS+NLWVPS+KC +C H+ Y S +S++
Sbjct: 40 LTNYLDTQYYGEIGIGTPSQTFKVIFDTGSANLWVPSTKCGPLYTACEIHNLYDSSESSS 99
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y E G I+YGSG + GF SQD V VG ++V Q F E T + F+LA+FDG++G+
Sbjct: 100 YMENGTEFTIHYGSGKVKGFLSQDVVTVGGIIVT-QTFGEVTELPLIPFMLAKFDGVLGM 158
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
GF AV +PV+D+++ Q ++ EEVFS + +R+ GGE+V GG DP+H++G Y
Sbjct: 159 GFPAQAVDGVIPVFDHILSQRVLKEEVFSVYYSRESHL-LGGEVVLGGSDPQHYQGNFHY 217
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
V ++K G WQ + + +G +T +CE GC A+VD+GTS ++GPT + I A+G
Sbjct: 218 VSISKAGSWQITMKGVSVG-PATLLCEEGCMAVVDTGTSYISGPTSSLQLIMQALG 272
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 52/84 (61%)
Query: 422 IIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWIL 481
+++C ++PT+P++SF +G + + LS Y+ K ++CI D+PPP GP+W+L
Sbjct: 282 VVNCSQVPTLPDISFYLGGRTYTLSNMDYVQKNPFRNDDLCILALQGLDIPPPTGPVWVL 341
Query: 482 GDVFMGVYHTVFDSGKLRIGFAEA 505
G F+ ++T FD RIGFA A
Sbjct: 342 GATFIRKFYTEFDRHNNRIGFALA 365
>gi|258563860|ref|XP_002582675.1| vacuolar protease A [Uncinocarpus reesii 1704]
gi|237908182|gb|EEP82583.1| vacuolar protease A [Uncinocarpus reesii 1704]
Length = 400
Score = 220 bits (560), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 113/248 (45%), Positives = 152/248 (61%), Gaps = 7/248 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
+ NF++AQYF EI IG+PPQNF V+ DTGSSNLWVPSS+C SI+C+ HS+Y S S+TY
Sbjct: 76 VDNFLNAQYFSEISIGNPPQNFKVVLDTGSSNLWVPSSQCG-SIACFLHSKYDSSASSTY 134
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
+ G I YGSGS+SGF SQD + +GD+VVK+Q F EAT E L F RFDGI+GLG
Sbjct: 135 KKNGTEFSIRYGSGSLSGFVSQDTLRIGDLVVKEQDFAEATNEPGLAFAFGRFDGILGLG 194
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ I+V VP + NM+ Q L+ E VF F+L + FGGVD F +
Sbjct: 195 YDTISVNKIVPPFYNMLNQKLIDEPVFGFYLGDTNKEGDDSYATFGGVDDSLFSDDMIKI 254
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE--- 313
P+ +K YW+ + + GN + G I+D+GTSL+A P+ + +N IG +
Sbjct: 255 PLRRKAYWEVDFDAVTFGNDRAELENTGI--ILDTGTSLIALPSTLAELLNKEIGAKKSW 312
Query: 314 -GVVSAEC 320
G + EC
Sbjct: 313 NGQYTVEC 320
Score = 79.3 bits (194), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 55/87 (63%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ ++CD+ P++P+++FT+ F + P YIL+ + CIS FM D P P GPL
Sbjct: 314 GQYTVECDKRPSLPDLTFTLSGHNFTIGPNDYILE----VQGSCISSFMGMDFPEPVGPL 369
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ ++T++D G ++G A+A
Sbjct: 370 AILGDAFLRRFYTMYDLGNNQVGLAKA 396
>gi|291409620|ref|XP_002721076.1| PREDICTED: pepsinogen III-like [Oryctolagus cuniculus]
Length = 387
Score = 220 bits (560), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 123/296 (41%), Positives = 172/296 (58%), Gaps = 24/296 (8%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL K L H+ N A +Y+ A V L+N++D +YFG
Sbjct: 32 LIEKGLLKDYLKTHTPNLAT-----KYLPKAAFDSVPTET---------LENYLDTEYFG 77
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+P Q+F+VIFDTGSSNLWVPS C S +C H+++ S+T+ +S I Y
Sbjct: 78 TIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SAACSVHNKFNPEDSSTFQATSESLSITY 136
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVP 207
G+GS++GF D V+VG++ +Q+F + E A FDGI+GL + I+ DA P
Sbjct: 137 GTGSMTGFLGYDTVKVGNIEDTNQIFGLSESEPGSFLYYAPFDGILGLAYPSISSSDATP 196
Query: 208 VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFE 267
V+DNM +GLVSE++FS +L+ D E G ++FGG+D ++ G +VPV+ +GYWQ
Sbjct: 197 VFDNMWNEGLVSEDLFSVYLSSDD--ESGSVVMFGGIDSSYYTGSLNWVPVSYEGYWQIT 254
Query: 268 LGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 317
L I + + T C GC AIVD+GTSLLAGPT ++ I IG GE +VS
Sbjct: 255 LDSITMDGE-TIACADGCQAIVDTGTSLLAGPTSAISNIQSYIGASENSDGEMIVS 309
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 64/132 (48%), Gaps = 6/132 (4%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 434
G++ C+ A+V L T +S I + N GE I+ C + ++PN+
Sbjct: 262 GETIACADGCQAIVDTGTSLLAGPTS--AISNIQSYIGASENSDGEMIVSCSSMYSLPNI 319
Query: 435 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 494
FTI + + YIL+ + CISGF +L G LWILGDVF+ Y TVFD
Sbjct: 320 VFTINGVQYPVPASAYILEEDDA----CISGFEGMNLDTYTGELWILGDVFIRQYFTVFD 375
Query: 495 SGKLRIGFAEAA 506
++G A AA
Sbjct: 376 RANNQLGLAAAA 387
>gi|871442|emb|CAA25391.1| renin [Mus musculus]
Length = 387
Score = 220 bits (560), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 111/240 (46%), Positives = 159/240 (66%), Gaps = 5/240 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L N+++ QY+GEIGIG+PPQ F VIFDTGS+NLWVPS+KC ++C HS Y+S S++
Sbjct: 58 LTNYLNTQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCSRLYLACGIHSLYESSDSSS 117
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDV--VVKDQVFIEATREGSLTFLLARFDGII 193
Y E G I+YGSG + GF SQD+V V V + Q F E T + F+LA+FDG++
Sbjct: 118 YMENGSDFTIHYGSGRVKGFLSQDSVTVSRVGGITVTQTFGEVTELPLIPFMLAKFDGVL 177
Query: 194 GLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKH 253
G+GF AVG PV+D+++ QG++ EEVFS + NR GGE+V GG DP+H++G
Sbjct: 178 GMGFPAQAVGGVTPVFDHILSQGVLKEEVFSVYYNRGSHL-LGGEVVLGGSDPQHYQGNF 236
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
YV ++K WQ + + +G+ ST +CE GCA +VD+G+S ++ PT + I A+G +
Sbjct: 237 HYVSISKTDSWQITMKGVSVGS-STLLCEEGCAVVVDTGSSFISAPTSSLKLIMQALGAK 295
Score = 82.4 bits (202), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 33/86 (38%), Positives = 54/86 (62%)
Query: 420 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 479
E +++C ++PT+P++SF +G + + LS Y+L+ ++C A D+PPP GP+W
Sbjct: 301 EYVVNCSQVPTLPDISFDLGGRAYTLSSTDYVLQYPNRRDKLCTLALHAMDIPPPTGPVW 360
Query: 480 ILGDVFMGVYHTVFDSGKLRIGFAEA 505
+LG F+ ++T FD RIGFA A
Sbjct: 361 VLGATFIRKFYTEFDRHNNRIGFALA 386
>gi|291409605|ref|XP_002721070.1| PREDICTED: pepsin II-1-like [Oryctolagus cuniculus]
Length = 387
Score = 220 bits (560), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 108/247 (43%), Positives = 159/247 (64%), Gaps = 10/247 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
L+N++D +YFG I IG+PPQ F+VIFDTGSSNLWVPS+ C S++C H R+ S+T+
Sbjct: 67 LENYLDTEYFGTISIGTPPQEFTVIFDTGSSNLWVPSTYCS-SLACILHKRFNPDDSSTF 125
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
++ I YG+GS++G D V+VG + +Q+F + E L L+A FDGI+GL
Sbjct: 126 QATSETLSITYGTGSMTGILGYDTVKVGSIEDTNQIFGLSKTEPGLFLLVAPFDGILGLA 185
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ I+ DA PV+DNM QGLVS+++FS +L+ D ++G ++FGG+D ++ G +V
Sbjct: 186 YPSISASDATPVFDNMWNQGLVSQDLFSVYLSS--DEQKGSLVMFGGIDSSYYTGSLNWV 243
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG----- 311
PV+ +GYWQ + I + + T C C A+VD+GTSLLAGPT ++ I IG
Sbjct: 244 PVSHEGYWQITVDSITMDGE-TIACADSCQAVVDTGTSLLAGPTSAISNIQSYIGASKNL 302
Query: 312 -GEGVVS 317
GE ++S
Sbjct: 303 LGENIIS 309
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 48/131 (36%), Positives = 66/131 (50%), Gaps = 6/131 (4%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 434
G++ C+ AVV L T +S I + N +GE+II C I ++P++
Sbjct: 262 GETIACADSCQAVVDTGTSLLAGPTS--AISNIQSYIGASKNLLGENIISCSAIDSLPDI 319
Query: 435 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 494
FTI + + L YILK + CISGF +L G LWILGDVF+ Y TVFD
Sbjct: 320 VFTINNVQYPLPASAYILKEDDD----CISGFEGMNLDTSYGELWILGDVFIRQYFTVFD 375
Query: 495 SGKLRIGFAEA 505
++G A A
Sbjct: 376 RANNQVGLAAA 386
>gi|223891|prf||1004236A renin
Length = 336
Score = 220 bits (560), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 111/238 (46%), Positives = 161/238 (67%), Gaps = 4/238 (1%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L N++++QY+GEIGIG+PPQ F VIFDTGS+NLWVPS+KC ++C HS Y+S S++
Sbjct: 12 LTNYLNSQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCSRLYLACGIHSLYESSDSSS 71
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y E G I+YGSG + GF SQD+V VG + V Q F E T + F+LA+FDG++G+
Sbjct: 72 YMENGDDFTIHYGSGRVKGFLSQDSVTVGGITVT-QTFGEVTELPLIPFMLAQFDGVLGM 130
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
GF AVG PV+D+++ QG++ E+VFS + NR P GGE+V GG DP+H++G Y
Sbjct: 131 GFPAQAVGGVTPVFDHILSQGVLKEKVFSVYYNRGPHL-LGGEVVLGGSDPEHYQGDFGY 189
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
V ++K WQ + + +G+ ST +CE GC +VD+G+S ++ PT + I A+G +
Sbjct: 190 VSLSKTDSWQITMKGVSVGS-STLLCEEGCEVVVDTGSSFISAPTSSLKLIMQALGAK 246
Score = 79.3 bits (194), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 54/88 (61%), Gaps = 2/88 (2%)
Query: 418 MGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGP 477
+ E ++ C ++PT+P++SF +G + + LS Y+L+ ++C A D+PPP GP
Sbjct: 250 LHEYVVSCSQVPTLPDISFNLGGRAYTLSSTDYVLQYPND--KLCTVALHAMDIPPPTGP 307
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+W+LG F+ ++T FD RIGFA A
Sbjct: 308 VWVLGATFIRKFYTEFDRHNNRIGFALA 335
>gi|348502999|ref|XP_003439054.1| PREDICTED: renin-like [Oreochromis niloticus]
Length = 396
Score = 219 bits (559), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 120/310 (38%), Positives = 180/310 (58%), Gaps = 21/310 (6%)
Query: 13 WV-LASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGV-----RHR 66
W+ L + L +S+ LRRI LKK + + R T +E G V V +
Sbjct: 8 WIYLVALSLTVTTSHALRRIALKK-------MPSIRETLQEL---GVSVEQVMTELAQKS 57
Query: 67 LGDSDEDILP--LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCY 123
+ D++ +P L N++D QYFGEI IGSP Q F+V+FDTGS+NLWVPS C FS +C+
Sbjct: 58 IADTNNGTVPTPLTNYLDTQYFGEISIGSPAQMFNVVFDTGSANLWVPSQSCSPFSTACF 117
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
H+RY + KS TY E G I Y SG++ GF S+D V V + QVF EAT ++
Sbjct: 118 THNRYDASKSRTYIENGTGFSIKYASGNVRGFLSEDVVVV-GGIPVVQVFAEATALSAMP 176
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
F+ A+FDG++G+G+ +A+ PV+D ++ Q ++ EEVFS + +RDP GGE+V GG
Sbjct: 177 FIFAKFDGVLGMGYPNVAIDGITPVFDRIMSQHVLKEEVFSIYYSRDPKRSPGGELVLGG 236
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
DP ++ G Y+ + G W+ + + +G + C GC A++D+G+S + GP V
Sbjct: 237 TDPNYYTGSFNYINTRQTGKWELTMKGVSVGREMM-FCAEGCTAVIDTGSSYITGPASSV 295
Query: 304 TEINHAIGGE 313
+ + IG +
Sbjct: 296 SVLMKTIGAQ 305
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 37/83 (44%), Positives = 53/83 (63%)
Query: 423 IDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILG 482
++CD + T+P+V+F +G + ++L+ E YIL + EVC F D+PPP GP+WILG
Sbjct: 313 VNCDTVKTLPSVTFHLGGQEYSLTQEDYILWQSQIEGEVCTVTFRGLDVPPPTGPIWILG 372
Query: 483 DVFMGVYHTVFDSGKLRIGFAEA 505
F+ Y+T FD RIGFA A
Sbjct: 373 ANFIARYYTEFDRRNNRIGFATA 395
>gi|331215715|ref|XP_003320537.1| saccharopepsin [Puccinia graminis f. sp. tritici CRL 75-36-700-3]
gi|309299527|gb|EFP76118.1| saccharopepsin [Puccinia graminis f. sp. tritici CRL 75-36-700-3]
Length = 430
Score = 219 bits (559), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 130/364 (35%), Positives = 195/364 (53%), Gaps = 43/364 (11%)
Query: 4 KLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLD--------LHSLNAARITRKERYM 55
K +V + LA+ +P+++ G R+ L K + L++L + ++Y
Sbjct: 2 KSTSAVVAITALAAVASIPSATAGKHRMKLHKMPITSSANSQTILNNLQSQTAWVSQKYF 61
Query: 56 GGAGVSGVR-----HRLGDSDE--DI----------------LPLKNFMDAQYFGEIGIG 92
G + + H L E D+ +PL N+++AQYF EI +G
Sbjct: 62 GVDDTASEKKFRYGHALKQPKEGDDVSIQMIEEAELASAGHEVPLSNYLNAQYFSEISLG 121
Query: 93 SPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSI 152
+PPQ+F V+ DTGSSNLWVPS++C SI+C+ HS+Y S TY G +I YGSGS+
Sbjct: 122 TPPQSFKVVLDTGSSNLWVPSTRCT-SIACFLHSKYDCEASETYQANGTEFKIRYGSGSL 180
Query: 153 SGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNM 212
G S D + +GD+ V D F E+T+E L F +FDGI GLG+ I+V VP + M
Sbjct: 181 EGVISNDVLTIGDLTVPDVDFAESTKEPGLAFAFGKFDGIFGLGYDTISVLHTVPPFYKM 240
Query: 213 VEQGLVSEEVFSFWL------NRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
+E G++ + VF+F+L DP+ GGE+VFGGVD H++G+ Y PV ++GYW+
Sbjct: 241 MENGMLDDPVFAFYLGSAQGNKADPN---GGEVVFGGVDEAHYEGEIFYAPVRRRGYWEV 297
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQ 326
EL + G + + G A +D+GTSL+A PT IN IG S + + S+
Sbjct: 298 ELKSVKFGKEEMKLHNVGAA--IDTGTSLIALPTDTAEIINAEIGATKSWSGQYTVDCSR 355
Query: 327 YGDL 330
+L
Sbjct: 356 IPEL 359
Score = 86.7 bits (213), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 37/87 (42%), Positives = 58/87 (66%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC RIP +P+++F G K F ++ E YIL+ ++ C+S F D+PP G L
Sbjct: 347 GQYTVDCSRIPELPDLTFNFGGKEFTITGEDYILQ----VSGTCVSAFTGLDMPPNIGEL 402
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WI+GDVF+ ++TV+D G+ +GFA+A
Sbjct: 403 WIVGDVFLRKWYTVYDWGRDAVGFAKA 429
>gi|122938522|gb|ABM69085.1| aspartic proteinase AspMD02 [Musca domestica]
Length = 379
Score = 219 bits (559), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 127/300 (42%), Positives = 182/300 (60%), Gaps = 14/300 (4%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDED 73
VL S LLL ++ L ++ + K + N R K +Y GG + +R D
Sbjct: 7 VLWSALLLAEAT--LVQVPITKVKETKSKANEIR-KLKAKY-GGTPKAEIR------DLV 56
Query: 74 ILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF-SISCYFHSRYKSRK 132
+ L N++D Y+G+I IG+P Q F V+FDTGSSNLWVP + C + +C H+ Y
Sbjct: 57 VEKLFNYVDDSYYGKITIGTPGQEFLVLFDTGSSNLWVPVAPCSADNAACENHNTYDPSA 116
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGI 192
S+T+ + G+S I YGSGS+SG+ +D V+V + +K QVF AT E TF+ A FDGI
Sbjct: 117 SSTHVKKGESFSIQYGSGSLSGYLVEDTVDVEGLKIKKQVFAAATNEPGETFVYAPFDGI 176
Query: 193 IGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGK 252
+G+GF+ IAV D P W NM+ Q L+SE+VFSF+L R ++EGG +V GG D ++++G
Sbjct: 177 MGMGFKSIAVDDVTPPWYNMISQHLISEKVFSFYLARRGTSDEGGVMVVGGNDDRYYEGD 236
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
YVPV+++GYWQFE+ + + +C+ C AI D+GTSL+A PT EI IG
Sbjct: 237 FHYVPVSEQGYWQFEMAEAHV--NGVRICD-RCQAIADTGTSLIAVPTDKYEEIQKEIGA 293
Score = 62.4 bits (150), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 33/86 (38%), Positives = 47/86 (54%), Gaps = 8/86 (9%)
Query: 420 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 479
E ++DC +I +P V+F +GD F L Y++K+ + C S F W
Sbjct: 301 EYMLDCSKIDDLPVVTFRLGDGTFTLEGRDYVIKSDDN---QCSSAF-----EDGGTDFW 352
Query: 480 ILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGDVF+G Y+T FD+ R+GFA A
Sbjct: 353 ILGDVFIGKYYTTFDAEHNRVGFALA 378
>gi|195399279|ref|XP_002058248.1| GJ15983 [Drosophila virilis]
gi|194150672|gb|EDW66356.1| GJ15983 [Drosophila virilis]
Length = 372
Score = 219 bits (559), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 112/249 (44%), Positives = 155/249 (62%), Gaps = 4/249 (1%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
L N M+ Y+G I IG+PPQ+F V+FD+GSSNLWVPS C S +C H++Y S S+TY
Sbjct: 61 LSNSMNMAYYGAITIGTPPQSFKVLFDSGSSNLWVPSKTCS-SYACEVHNQYDSSASSTY 119
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
G+S I YG+GS+SG + D V V + V+ Q F EAT E F A FDGI+G+G
Sbjct: 120 QANGESFSIQYGTGSLSGILATDIVNVNGLSVESQTFAEATNEPGTNFNDANFDGILGMG 179
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
++ IA + VP + NMV QGLV + VFSF+L RD + +GGE++FGG D + G TYV
Sbjct: 180 YQSIAQDNVVPPFYNMVSQGLVDQSVFSFYLARDGTSSQGGELIFGGSDSSLYSGDLTYV 239
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
P++++GYWQF + I QS +C+ C AI D+GTSL+ P ++N + +
Sbjct: 240 PISEQGYWQFTMAGASIDGQS--LCD-NCQAIADTGTSLIVAPANAYMQLNDILNVDDQG 296
Query: 317 SAECKLVVS 325
+C V S
Sbjct: 297 LVDCSSVSS 305
Score = 68.9 bits (167), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 39/90 (43%), Positives = 52/90 (57%), Gaps = 15/90 (16%)
Query: 420 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKT-GEGIAEVCISGF--MAFDLPPPRG 476
+ ++DC + +MP ++F IG F+L P QYI+++ GE C S F M D
Sbjct: 295 QGLVDCSSVSSMPVITFNIGGTNFDLEPAQYIIQSDGE-----CQSSFEYMGTDF----- 344
Query: 477 PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
WILGDVF+G Y+T FD G RIGFA A
Sbjct: 345 --WILGDVFIGQYYTEFDLGNNRIGFAPVA 372
>gi|206609|gb|AAA42030.1| preprorenin (EC 3.4.99.19) [Rattus norvegicus]
Length = 402
Score = 219 bits (559), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 124/312 (39%), Positives = 184/312 (58%), Gaps = 17/312 (5%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSG--- 62
L ++ LW S LP + RI LKK + + R +ER + +S
Sbjct: 8 LWALLLLWTSCS-FSLPTDTASFGRILLKK-------MPSVREILEERGVDMTRISAEWG 59
Query: 63 --VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFS 119
++ + + L N++D QY+GEIGIG+P Q F VIFDTGS+NLWVPS+KC
Sbjct: 60 EFIKKSSFTNVTSPVVLTNYLDTQYYGEIGIGTPSQTFKVIFDTGSANLWVPSTKCGPLY 119
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
+C H+ Y S +S++Y E G I+YGSG + GF SQD V VG ++V Q F E T
Sbjct: 120 TACEIHNLYDSSESSSYMENGTEFTIHYGSGKVKGFLSQDVVTVGGIIVT-QTFGEVTEL 178
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEI 239
+ F+LA+FDG++G+GF AV +PV+D+++ ++ EEVFS + +R+ GGE+
Sbjct: 179 PLIPFMLAKFDGVLGMGFPAQAVDGVIPVFDHILSHEVLKEEVFSVYYSRESHL-LGGEV 237
Query: 240 VFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGP 299
V GG DP+H++G YV ++K G WQ + + +G +T +CE GC A+VD+GTS ++GP
Sbjct: 238 VLGGSDPQHYQGNFHYVSISKAGSWQITMKGVSVG-PATLLCEEGCMAVVDTGTSYISGP 296
Query: 300 TPVVTEINHAIG 311
T + I A+G
Sbjct: 297 TSSLQLIMQALG 308
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 52/84 (61%)
Query: 422 IIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWIL 481
+++C ++PT+P++SF +G + + LS Y+ K ++CI D+PPP GP+W+L
Sbjct: 318 VVNCSQVPTLPDISFYLGGRTYTLSNMDYVQKNPFRNDDLCILALQGLDIPPPTGPVWVL 377
Query: 482 GDVFMGVYHTVFDSGKLRIGFAEA 505
G F+ ++T FD RIGFA A
Sbjct: 378 GATFIRKFYTEFDRHNNRIGFALA 401
>gi|328771090|gb|EGF81130.1| hypothetical protein BATDEDRAFT_16209 [Batrachochytrium
dendrobatidis JAM81]
Length = 400
Score = 219 bits (559), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 142/370 (38%), Positives = 201/370 (54%), Gaps = 29/370 (7%)
Query: 10 FCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNA----ARITRKERYMGGAGVSGVRH 65
+W++A+ ++ A I LKKR +LNA + R + S ++
Sbjct: 4 LLVWLVAAASVVSAHKG--NTIKLKKRPHTQDTLNALFSNVQSVYSNRLAFQSETSEDQY 61
Query: 66 RLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFH 125
LG E +PL +F +AQYFGEI IG+PPQ F+VIFDTGSSNLWVPS++C SI+C+ H
Sbjct: 62 ILGGGAEHSVPLTDFANAQYFGEIQIGTPPQPFTVIFDTGSSNLWVPSTRCS-SIACWMH 120
Query: 126 SRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFL 185
RY + +S+TY G I YG+G++ G SQD V +G + +++Q F E+ +E +TF
Sbjct: 121 RRYDASESSTYVNNGTEFAIQYGTGALEGVISQDTVTIGGLTIENQGFGESVKEPGITFA 180
Query: 186 LARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVD 245
+ RFDGI+GLGF I+V VP N++ + +F WL EEGGEIVFG V+
Sbjct: 181 VGRFDGILGLGFDTISVQKVVPPMYNLINNHQLDTPLFGVWLGSS-SGEEGGEIVFGAVN 239
Query: 246 PKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTE 305
HFKG T+VPV +K YW+ EL + IG + + A +D+G+SL A P
Sbjct: 240 HDHFKGAVTWVPVVRKAYWEVELEGVTIGGKKLAIKS--SRAAIDTGSSLFALPVAEADA 297
Query: 306 INHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKT 365
IN +GG+ + G I D LPE Q F G ++V TG
Sbjct: 298 INGILGGKK----------NWNGQFIVDCATIDSLPELTLQ------FGGQKFVITGSDY 341
Query: 366 VVEKENVSAG 375
+++ VSAG
Sbjct: 342 ILQ---VSAG 348
Score = 92.4 bits (228), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 51/143 (35%), Positives = 75/143 (52%), Gaps = 5/143 (3%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
VE E V+ G + A + + L E IN + N G+ I+DC
Sbjct: 260 VELEGVTIGGKKLAIKSSRAAIDTGSSLFALPVAEA--DAINGILGGKKNWNGQFIVDCA 317
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKTGEGI---AEVCISGFMAFDLPPPRGPLWILGD 483
I ++P ++ G + F ++ YIL+ G + CISGFM D+P P GPLWI+GD
Sbjct: 318 TIDSLPELTLQFGGQKFVITGSDYILQVSAGPVGGGDQCISGFMGLDIPAPAGPLWIVGD 377
Query: 484 VFMGVYHTVFDSGKLRIGFAEAA 506
VF+ ++T++D G R+GFAEAA
Sbjct: 378 VFLRKFYTIYDVGNARVGFAEAA 400
>gi|198451348|ref|XP_001358330.2| GA19187 [Drosophila pseudoobscura pseudoobscura]
gi|198131448|gb|EAL27468.2| GA19187 [Drosophila pseudoobscura pseudoobscura]
Length = 393
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 118/295 (40%), Positives = 174/295 (58%), Gaps = 10/295 (3%)
Query: 43 LNAARITRKERYMGGAGVSGVRHRLGDSDEDIL------PLKNFMDAQYFGEIGIGSPPQ 96
L A+ + + ++ G + ++ L +S+ PL N ++ +Y G I IG+P Q
Sbjct: 31 LQASFMATRRQHRAGKQLLYAKYNLANSEASQSSGGASEPLDNRLNLEYAGPISIGTPRQ 90
Query: 97 NFSVIFDTGSSNLWVPSSKCYF-SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
F+++FDTGS+NLWVPS++C +++C H RY + S+++ G+ I YG+GS+SG
Sbjct: 91 PFNMLFDTGSANLWVPSAECSARNVACQHHHRYNASASSSHVPDGRRFAIAYGTGSLSGR 150
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
+QD V VG +VV++Q F A E TF+ F GI+GL FR IA A P++ NM +Q
Sbjct: 151 LAQDTVSVGRLVVQNQTFGMAIHEPGSTFVDTNFAGIVGLAFRSIAEQQATPLFQNMCDQ 210
Query: 216 GLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGN 275
GLV + VFSF+L R+ A++GGE++FGG+D F TYVP+T GYWQF++ + +
Sbjct: 211 GLVDQCVFSFYLKRNGSAQQGGELLFGGIDASRFTAPLTYVPLTHAGYWQFQMQSVEVVG 270
Query: 276 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 330
++ G AIVD+GTSLLA P IN +GG S E L S G L
Sbjct: 271 KTI---SQGRQAIVDTGTSLLAAPPREYLIINSLLGGLPTASGEYLLRCSDIGRL 322
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 42/101 (41%), Positives = 55/101 (54%), Gaps = 6/101 (5%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTG-EGIAEVCISG 465
IN L LP GE ++ C I +P V F IG + F L P Y+++ + + VC+S
Sbjct: 298 INSLLGGLPTASGEYLLRCSDIGRLPEVFFVIGGQRFGLQPRDYVMQVANDDGSSVCLSA 357
Query: 466 FMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
F D WILGDVF+G Y+T FD + RIGFA AA
Sbjct: 358 FTLMD-----ADFWILGDVFIGRYYTAFDVAQRRIGFAPAA 393
>gi|57046|emb|CAA30082.1| unnamed protein product [Rattus norvegicus]
Length = 402
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 124/312 (39%), Positives = 184/312 (58%), Gaps = 17/312 (5%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSG--- 62
L ++ LW S LP + RI LKK + + R +ER + +S
Sbjct: 8 LWALLLLWTSCS-FSLPTDTASFGRILLKK-------MPSVREILEERGVDMTRISAEWG 59
Query: 63 --VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFS 119
++ + + L N++D QY+GEIGIG+P Q F VIFDTGS+NLWVPS+KC
Sbjct: 60 EFIKKSSFTNVTSPVVLTNYLDTQYYGEIGIGTPSQTFKVIFDTGSANLWVPSTKCGPLY 119
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
+C H+ Y S +S++Y E G I+YGSG + GF SQD V VG ++V Q F E T
Sbjct: 120 TACEIHNLYDSSESSSYMENGTEFTIHYGSGKVKGFLSQDVVTVGGIIVT-QTFGEVTEL 178
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEI 239
+ F+LA+FDG++G+GF V +PV+D+++ Q ++ EEVFS + +R+ GGE+
Sbjct: 179 PLIPFMLAKFDGVLGMGFPAQVVDGVIPVFDHILSQRVLKEEVFSVYYSRESHL-LGGEV 237
Query: 240 VFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGP 299
V GG DP+H++G YV ++K G WQ + + +G +T +CE GC A+VD+GTS ++GP
Sbjct: 238 VLGGSDPQHYQGNFHYVSISKAGSWQITMKGVSLG-PATLLCEEGCMAVVDTGTSYISGP 296
Query: 300 TPVVTEINHAIG 311
T + I A+G
Sbjct: 297 TSSLQLIMQALG 308
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 52/84 (61%)
Query: 422 IIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWIL 481
+++C ++PT+P++SF +G + + LS Y+ K ++CI D+PPP GP+W+L
Sbjct: 318 VVNCSQVPTLPDISFYLGGRTYTLSNMDYVQKNPFRNDDLCILALQGLDIPPPTGPVWVL 377
Query: 482 GDVFMGVYHTVFDSGKLRIGFAEA 505
G F+ ++T FD RIGFA A
Sbjct: 378 GATFIRKFYTEFDRHNNRIGFALA 401
>gi|432103960|gb|ELK30793.1| Gastricsin [Myotis davidii]
Length = 390
Score = 219 bits (558), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 114/289 (39%), Positives = 170/289 (58%), Gaps = 1/289 (0%)
Query: 25 SNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQ 84
S G+ RI LKK + ++ + K ++ + P+ N++DA
Sbjct: 14 SEGVERIILKKGKSIRQTMEEKGVLEKFLKNHRKEDPAAKYHFNNDAVAYEPITNYLDAF 73
Query: 85 YFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCE 144
YFGEI IG+PPQNF V+FDTGSSNLWVPS+ C S +C H+R+ S+T+ G++
Sbjct: 74 YFGEISIGTPPQNFLVLFDTGSSNLWVPSTYCQ-SQACSNHNRFNPSLSSTFRNNGQTYT 132
Query: 145 INYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGD 204
++YGSGS+S D V V ++VV +Q F + E + F + FDGI+G+ + +AVGD
Sbjct: 133 LSYGSGSLSVVLGYDTVTVQNIVVNNQEFGLSENEPNDPFYYSDFDGILGMAYPNMAVGD 192
Query: 205 AVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYW 264
A V M++QG ++ +FSF+ +R P + GGE++ GGVD + + G+ + PVT++ YW
Sbjct: 193 APTVMQGMLQQGQLTLPIFSFYFSRQPTRQYGGELILGGVDQQLYSGQIVWAPVTQELYW 252
Query: 265 QFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
Q + + IG+Q+TG C GC AIVD+GT LLA P + A G E
Sbjct: 253 QIAIQEFAIGDQATGWCSQGCQAIVDTGTFLLAVPQQYMGSFLQATGAE 301
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 36/89 (40%), Positives = 51/89 (57%), Gaps = 5/89 (5%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG-P 477
G+ ++ C+ + ++P ++FTI F L P Y+L C G A LP P G P
Sbjct: 306 GDFVVACNSVESLPTITFTISGSQFPLPPSAYVLNNNG----YCRLGIEATYLPSPNGQP 361
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
LWILGDVF+ Y++V+D R+GFA AA
Sbjct: 362 LWILGDVFLKEYYSVYDMAHNRVGFAFAA 390
>gi|351707611|gb|EHB10530.1| Renin [Heterocephalus glaber]
Length = 397
Score = 219 bits (558), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 131/315 (41%), Positives = 186/315 (59%), Gaps = 16/315 (5%)
Query: 5 LLRSVFCLWVLASCLL-LPASSNGLRRIGLKKRRLDLHSLNAARITRKER--YMG--GAG 59
+LR F L + S LP + RRI LKK + + R + KER MG GA
Sbjct: 1 MLRWGFLLLLWGSYTFGLPTDTAAFRRIFLKK-------MPSVRDSLKERGVDMGRLGAK 53
Query: 60 VSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-F 118
RL D+ + L N+++ QY+GEIGIG+PPQ F VIFDTGS+NLWVPS+KC
Sbjct: 54 WGEFAKRLSDNSTSPVVLTNYLNTQYYGEIGIGTPPQAFKVIFDTGSANLWVPSTKCSPL 113
Query: 119 SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
+C HS Y S +S++Y E G I YGSG + GF SQD V VG + V Q F E T
Sbjct: 114 YTACEIHSLYDSAESSSYIENGTEFSIRYGSGKVKGFLSQDVVTVGGITVT-QTFGEVTE 172
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
+ F+LA+FDG++G+GF AVG PV+D+++ Q ++ E+VFS + +RD G
Sbjct: 173 LPLIPFMLAKFDGVLGMGFPAQAVGGITPVFDHILSQRVLKEDVFSVYYSRDSHLLGGEL 232
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
++ G DP+H++G YV ++K G WQ + + +G +T +CE GC A+VD+G S ++G
Sbjct: 233 LLGGS-DPQHYQGNFHYVSISKSGSWQITMKGVSVGF-ATLLCEEGCMAVVDTGASYISG 290
Query: 299 PTPVVTEINHAIGGE 313
PT + I A+G +
Sbjct: 291 PTSSLRLIMEALGAK 305
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 37/102 (36%), Positives = 59/102 (57%)
Query: 404 LSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCI 463
L I E + + E +++C+R+PT+P++SF +GD+ + L+ Y+L+ ++C
Sbjct: 295 LRLIMEALGAKEHSTDEYVVNCNRVPTLPDISFHLGDRAYTLTSADYVLQDPYRNDDLCT 354
Query: 464 SGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
D+PPP GPLW LG F+ ++T FD RIGFA A
Sbjct: 355 LALHGLDIPPPTGPLWALGASFIRKFYTEFDRHNNRIGFALA 396
>gi|198475392|ref|XP_001357030.2| GA17303 [Drosophila pseudoobscura pseudoobscura]
gi|198138802|gb|EAL34096.2| GA17303 [Drosophila pseudoobscura pseudoobscura]
Length = 401
Score = 219 bits (558), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 122/314 (38%), Positives = 181/314 (57%), Gaps = 18/314 (5%)
Query: 13 WVLASCLLLPASSNGLRRIGLKK---RRLDLHSLNAARITRK------ERYM----GGAG 59
W + C+L AS+ L+RI + K +R H R R E Y+ G
Sbjct: 4 WFVLLCVLALASAE-LQRIKIHKSEHKRSRHHVRQEVRSLRHKYQQLIENYVVYDYGQPD 62
Query: 60 VSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS 119
+ D L N M+ Y+G+I IG+PPQ F+V+FDTGSSNLW+PS++C +
Sbjct: 63 YGNDYPSNSEPDYTTEELGNSMNMYYYGQISIGTPPQYFNVVFDTGSSNLWIPSAQCLST 122
Query: 120 -ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
++C H++Y + S+TY ++ I YG+GS++G+ + D V + + + +Q F EA
Sbjct: 123 DVACQQHNQYNASASSTYVANSQNFSIQYGTGSVTGYLAMDTVTINGLAIANQTFGEAVS 182
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
+ +F FDGI+G+G++ IAV VP + N+ EQGL+ E F F+L R+ +EEGG+
Sbjct: 183 QPGSSFTDVAFDGILGMGYQTIAVDSVVPPFYNLYEQGLIDEPTFGFYLARNGSSEEGGQ 242
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
++ GGVD G TYVPV+++GYWQF + + I T +C+ GC AI D+GTSLLA
Sbjct: 243 LLLGGVDETLMAGDLTYVPVSQEGYWQFSVNN--ISWNGTVLCD-GCQAIADTGTSLLAC 299
Query: 299 PTPVVTEINHAIGG 312
P V T+IN IG
Sbjct: 300 PQAVYTQINQLIGA 313
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 41/134 (30%), Positives = 63/134 (47%), Gaps = 9/134 (6%)
Query: 370 ENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIP 429
N+S + +C C+ + L Q V + IN+L ++ G + I C +
Sbjct: 273 NNISWNGTVLCDGCQAIADTGTSLLACPQ---AVYTQINQLIGAVL-IEGSNYIPCATLD 328
Query: 430 TMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVY 489
++P +SF IG F+L YI + C+S F W+LGDVF+G Y
Sbjct: 329 SLPVLSFNIGGTTFDLPASAYISVFHDEGYTSCMSTFTDIGTD-----FWVLGDVFLGQY 383
Query: 490 HTVFDSGKLRIGFA 503
+T FD G+ R+GFA
Sbjct: 384 YTQFDFGQNRVGFA 397
>gi|253762217|gb|ACT35560.1| pepsinogen A2 precursor [Siniperca chuatsi]
Length = 376
Score = 219 bits (558), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 114/251 (45%), Positives = 164/251 (65%), Gaps = 13/251 (5%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
P+ N D Y+G I IGSPPQ+FSVIFDTGSSNLW+PS C S +C H R+ ++S T
Sbjct: 60 PMTNDADLSYYGVISIGSPPQSFSVIFDTGSSNLWIPSVYCS-SQACENHRRFNPQQSTT 118
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIG 194
+ + I YG+GS++G+ + D VEVG + V +QVF I T + ++ A DGI+G
Sbjct: 119 FKWGNQPLSIQYGTGSMTGYLAIDTVEVGGISVANQVFGISRTEAPFMAYMQA--DGILG 176
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L F+ IA + VPV+DNMV+QGLVS+ +FS +L+ ++E+G E+VFGG+D H+ G+ T
Sbjct: 177 LAFQTIASDNVVPVFDNMVKQGLVSQPLFSVYLSS--NSEQGSEVVFGGIDSSHYTGQIT 234
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG--- 311
++P++ YWQ ++ + I Q T C GGC AI+D+GTSL+ GPT + +N +G
Sbjct: 235 WIPLSSATYWQIKMDSVTINGQ-TVACSGGCQAIIDTGTSLIVGPTSDINNMNAWVGAST 293
Query: 312 ---GEGVVSAE 319
GE VVS +
Sbjct: 294 NQYGEAVVSCQ 304
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 38/132 (28%), Positives = 62/132 (46%), Gaps = 10/132 (7%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 434
G + CS A++ L T + ++ +N + N GE+++ C I +MP V
Sbjct: 255 GQTVACSGGCQAIIDTGTSLIVGPTSD--INNMNAWVGASTNQYGEAVVSCQNIQSMPAV 312
Query: 435 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 494
+FT+ + F + Y+ + G C +GF LWILGDVF+ Y+ VFD
Sbjct: 313 TFTLNGQAFTIPASAYVSQNSYG----CNTGFGQGG----SDQLWILGDVFIREYYVVFD 364
Query: 495 SGKLRIGFAEAA 506
+ +G A +A
Sbjct: 365 AQAQYVGLASSA 376
>gi|193735605|gb|ACF20292.1| vacuolar protease A [Trichoderma aureoviride]
gi|226374420|gb|ACO52389.1| vacuolar protease A [Trichoderma aureoviride]
Length = 395
Score = 219 bits (558), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 130/309 (42%), Positives = 185/309 (59%), Gaps = 15/309 (4%)
Query: 14 VLASCLLLPASSNGLRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSGVRHRLGD 69
++A+ L+ ++ G+ ++ L+K ++L+ S+ A ++YMG S D
Sbjct: 5 LIAAAALVGSAQAGVHKMKLQKVSLEQQLEGSSIEAQVQQLGQKYMGVRPTSRADVMFND 64
Query: 70 SDEDI-----LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYF 124
+ I +P+ NFM+AQYF EI IGSPPQ F V+ DTGSSNLWVPS C SI+C+
Sbjct: 65 NLPKIKGGHPVPVTNFMNAQYFSEITIGSPPQTFKVVLDTGSSNLWVPSQSCN-SIACFL 123
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
HS Y S S+TY + G EI+YGSGS++GF S D V +GD+ +K Q F EAT E L F
Sbjct: 124 HSTYDSSSSSTYKKNGSDFEIHYGSGSLTGFISNDVVTIGDLKIKGQDFAEATSEPGLAF 183
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
RFDGI+GLG+ I+V VP + MV Q L+ E VF+F+L +++EG FGGV
Sbjct: 184 AFGRFDGILGLGYDTISVNGIVPPFYQMVNQKLLDEPVFAFYLG---NSDEGSVATFGGV 240
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
D HF GK Y+P+ +K YW+ +L I G++ + G AI+D+GTSL P+ +
Sbjct: 241 DESHFSGKIEYIPLRRKAYWEVDLDSIAFGDEVAELENTG--AILDTGTSLNVLPSGIAE 298
Query: 305 EINHAIGGE 313
+N IG +
Sbjct: 299 LLNAEIGAK 307
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 53/87 (60%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ IDC + ++P+++F++ ++L YIL+ ++ CIS F D P P GPL
Sbjct: 312 GQYTIDCAKRDSLPDITFSLAGSKYSLPASDYILE----VSGSCISTFQGMDFPEPVGPL 367
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ Y++V+D GK +G A+A
Sbjct: 368 VILGDAFLRRYYSVYDLGKGAVGLAKA 394
>gi|126309849|ref|XP_001370462.1| PREDICTED: gastricsin-like [Monodelphis domestica]
Length = 390
Score = 219 bits (557), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 104/238 (43%), Positives = 152/238 (63%), Gaps = 1/238 (0%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N+MD Y+GEI IG+PPQNF V+FDTGSSNLWVPS C S +C H ++ KS+T
Sbjct: 64 PLANYMDMSYYGEISIGTPPQNFLVLFDTGSSNLWVPSIYCQ-SQACTNHPQFNPSKSST 122
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y+ G++ + YG+GS++G F D V + + + +Q F + E F+ A+FDGI+GL
Sbjct: 123 YSSNGQTFSLQYGTGSLTGVFGYDTVTIQGISITNQEFGLSETEPGTNFVYAQFDGILGL 182
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ I+ G A V +++ L++ VF+F+L+ + ++ GGE+VFGGVD + G +
Sbjct: 183 AYPAISSGGATTVMQGFLQENLLNSPVFAFYLSGNENSNNGGEVVFGGVDTSMYTGDIYW 242
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
PVT++ YWQ + IG Q+TG C GGC AIVD+GTSLL P + +E+ IG +
Sbjct: 243 APVTEEAYWQIAINGFSIGGQATGWCSGGCQAIVDTGTSLLTAPQQIFSELMQYIGAQ 300
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 37/107 (34%), Positives = 56/107 (52%), Gaps = 4/107 (3%)
Query: 401 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 460
+++ S + + + + G ++ C +MP ++F I F L P Y+L + E
Sbjct: 287 QQIFSELMQYIGAQQDENGSYLVSCSNTQSMPTITFNINGVDFPLPPSAYVLPSNSNYCE 346
Query: 461 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
V G M LP G PLWILGDVF+ Y++V+D G R+GFA A
Sbjct: 347 V---GIMPTYLPSQNGQPLWILGDVFLRNYYSVYDLGNNRVGFANLA 390
>gi|444706374|gb|ELW47716.1| Renin [Tupaia chinensis]
Length = 401
Score = 219 bits (557), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 126/299 (42%), Positives = 183/299 (61%), Gaps = 20/299 (6%)
Query: 9 VFCLWVLASCLL-LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRH-- 65
+ LW SC LPA +NG +RI LKK + + R + KER A + +
Sbjct: 7 LLVLW--GSCTFGLPADANGFQRIFLKK-------MPSVRESLKERGADAARLVAKWNLS 57
Query: 66 ---RLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSIS 121
LG+S ++ L N++D QY+GEIGIG+P Q F V+FDTGS+NLWVPS+KC +
Sbjct: 58 KTLSLGNSTSPVV-LTNYLDTQYYGEIGIGTPAQTFKVVFDTGSANLWVPSTKCSPLYTA 116
Query: 122 CYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGS 181
C HS Y S +S++Y E G I+YGSG + GF SQD V VG + V Q F E T
Sbjct: 117 CEIHSLYDSSESSSYMENGTEFAIHYGSGKVRGFLSQDVVTVGGITVT-QTFGEVTELPV 175
Query: 182 LTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVF 241
+ F+LA+FDG++G+G AVG PV+D+++ Q ++ E+VFS + +++ GGEIV
Sbjct: 176 IPFMLAKFDGVLGMGLPAQAVGGVTPVFDHILSQRVLKEDVFSVYYSKNSHV-LGGEIVL 234
Query: 242 GGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
GG DP++++G YV V+ G WQ ++ + + +T +CE GC A+VD+GTS ++GPT
Sbjct: 235 GGSDPQYYQGHFHYVSVSSTGSWQVKMKGVSV-RSATLLCENGCMAVVDTGTSYISGPT 292
Score = 78.6 bits (192), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 52/86 (60%)
Query: 420 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 479
+ +++C+ +PT+P++SF +G + L+ Y+L+ E+C D+PPP GP+W
Sbjct: 315 QYVVNCNEVPTLPDISFHLGGHAYTLTSADYVLQDPYSNDELCTLALHGLDVPPPTGPIW 374
Query: 480 ILGDVFMGVYHTVFDSGKLRIGFAEA 505
+LG F+ ++T FD RIGFA A
Sbjct: 375 VLGASFIRKFYTEFDRRNNRIGFALA 400
>gi|50294061|ref|XP_449442.1| hypothetical protein [Candida glabrata CBS 138]
gi|49528756|emb|CAG62418.1| unnamed protein product [Candida glabrata]
Length = 415
Score = 219 bits (557), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 114/257 (44%), Positives = 160/257 (62%), Gaps = 13/257 (5%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+MDAQYF +I +G+PPQ F VI DTGSSNLWVPS C S++C+ H++Y +S+
Sbjct: 81 VPLSNYMDAQYFADISLGTPPQKFKVILDTGSSNLWVPSVDC-GSLACFLHNKYDHSQSS 139
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + G+ I+YGSGSI G+ S+DN+++GD+ +++Q F E T E L F +FDGI+G
Sbjct: 140 TYIKDGRPLSISYGSGSIEGYISEDNLQIGDLTIQNQKFGETTSEPGLAFAFGKFDGILG 199
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLN--RDPDAE-----EGGEIVFGGVDPK 247
L + IA D P + + ++Q L+ E FSF+L DP AE +GG GGVD
Sbjct: 200 LAYDTIAQDDITPPFYSAIQQHLLDESKFSFYLKSVNDPAAEGGSASDGGVFTLGGVDSS 259
Query: 248 HFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEIN 307
FKG + V ++ YW+ L I +G+QSTG E AAI D+GTSL+ P+ + IN
Sbjct: 260 KFKGDLIPLHVRRQAYWEVPLNAIKLGDQSTGKLENTGAAI-DTGTSLITLPSDMAEIIN 318
Query: 308 HAIGGE----GVVSAEC 320
IG + G + EC
Sbjct: 319 AQIGAKKGWTGQYTLEC 335
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 47/87 (54%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ ++C +P+++FT+ F LSP +Y L+ ++ CIS D P P G +
Sbjct: 329 GQYTLECSTRAKLPDLTFTLDGHDFVLSPFEYTLE----VSGSCISVITPMDFPEPIGRM 384
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ Y++VFD + AEA
Sbjct: 385 AILGDAFLRRYYSVFDLDANVVSLAEA 411
>gi|118344578|ref|NP_001072054.1| renin precursor [Takifugu rubripes]
gi|39540664|tpg|DAA01803.1| TPA: pro-renin [Takifugu rubripes]
gi|55771086|dbj|BAD69803.1| renin [Takifugu rubripes]
Length = 396
Score = 219 bits (557), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 121/310 (39%), Positives = 177/310 (57%), Gaps = 21/310 (6%)
Query: 13 WV-LASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGD-S 70
W+ LA+ L SS LRRI LH + + R T E G V V + + S
Sbjct: 8 WMSLAALSLALTSSQALRRI-------TLHKMPSIRETLGEM---GVSVEQVLSEMAEKS 57
Query: 71 DEDIL------PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCY 123
D+ PL N++D QYFGEI IGSP Q F+V+FDTGS+NLWVPS C FS +C+
Sbjct: 58 AGDVFNKTVPTPLTNYLDTQYFGEISIGSPAQMFNVVFDTGSANLWVPSQSCSPFSTACF 117
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
H+RY + KS T+ E G I Y SG++ GF S+D V V + QVF EAT ++
Sbjct: 118 THNRYDASKSQTHVENGTGFSIQYASGNVRGFLSEDVVVV-GGIPVIQVFAEATSLSAMP 176
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
F+ A+FDG++G+G+ +A+ PV+D ++ Q ++ EEVFS + +RDP GGE+V GG
Sbjct: 177 FVFAKFDGVLGMGYPNMAIDGITPVFDRIMSQHVLKEEVFSIYYSRDPKHSPGGELVLGG 236
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
DP ++ G Y+ + G W+ + + +G + C GC A++D+G+S + GP V
Sbjct: 237 TDPNYYTGSFNYMGTRETGKWEITMKGVSVGMEMM-FCTEGCTAVIDTGSSYITGPASSV 295
Query: 304 TEINHAIGGE 313
+ + IG +
Sbjct: 296 SLLMKTIGAQ 305
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 37/83 (44%), Positives = 53/83 (63%)
Query: 423 IDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILG 482
++CD + T+P+V+F +G + + L+ E YIL + +VCI F D+PPP GP+WILG
Sbjct: 313 VNCDAVKTLPSVTFHLGGQEYPLTQEDYILWQSQIEGDVCIVTFRGLDIPPPVGPIWILG 372
Query: 483 DVFMGVYHTVFDSGKLRIGFAEA 505
F+ Y+T FD RIGFA A
Sbjct: 373 ANFIARYYTEFDRHNNRIGFATA 395
>gi|225681688|gb|EEH19972.1| cathepsin D [Paracoccidioides brasiliensis Pb03]
Length = 349
Score = 219 bits (557), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 110/238 (46%), Positives = 155/238 (65%), Gaps = 5/238 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
+ NF++AQYF EI IG+PPQ F V+ DTGSSNLWVPS++C SI+C+ H++Y S S+T+
Sbjct: 27 VDNFLNAQYFSEISIGTPPQTFKVVLDTGSSNLWVPSAQC-MSIACFLHNKYDSSVSSTH 85
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
+ G I YGSGS+SGF SQD V +GD+ V +Q F EAT E L F RFDGI+GLG
Sbjct: 86 RKNGTEFTIRYGSGSLSGFVSQDVVRIGDMTVNNQDFAEATSEPGLAFAFGRFDGILGLG 145
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ I+V VP++ M+ Q L+ VF F+L N D D ++ E FGG+D HF G+ T
Sbjct: 146 YDSISVNHIVPLFYQMINQKLLDTPVFGFYLGNSDVDGDD-SEATFGGIDESHFTGELTT 204
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
+ + ++ YW+ +L I+ GN+ + G I+D+GTSLLA P+ + +N IG +
Sbjct: 205 ISLRRRAYWEVDLDAIIFGNEMAELENTGV--ILDTGTSLLALPSTIAELLNKQIGAK 260
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 51/87 (58%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC + T P+++FT+ F + YIL+ + CIS FM D P P GPL
Sbjct: 265 GQYTVDCTKRSTFPDITFTLAGHNFTIGSYDYILE----VQGSCISSFMGMDFPEPVGPL 320
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ +++V+D G +IG A+A
Sbjct: 321 AILGDAFLRRWYSVYDLGNHQIGLAKA 347
>gi|195144214|ref|XP_002013091.1| GL23572 [Drosophila persimilis]
gi|194102034|gb|EDW24077.1| GL23572 [Drosophila persimilis]
Length = 393
Score = 219 bits (557), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 118/295 (40%), Positives = 174/295 (58%), Gaps = 10/295 (3%)
Query: 43 LNAARITRKERYMGGAGVSGVRHRLGDSDEDIL------PLKNFMDAQYFGEIGIGSPPQ 96
L A+ + + ++ G + ++ L +S+ PL N ++ +Y G I IG+P Q
Sbjct: 31 LQASFMATRRQHRAGKQLLYAKYNLANSEASQSSGGASEPLDNRLNLEYAGPISIGTPRQ 90
Query: 97 NFSVIFDTGSSNLWVPSSKCYF-SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
F+++FDTGS+NLWVPS++C +++C H RY + S+++ G+ I YG+GS+SG
Sbjct: 91 PFNMLFDTGSANLWVPSAECSARNVACQHHHRYNASASSSHVPDGRRFAIAYGTGSLSGR 150
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
+QD V VG +VV++Q F A E TF+ F GI+GL FR IA A P++ NM +Q
Sbjct: 151 LAQDTVSVGRLVVQNQTFGMAIHEPGSTFVDTNFAGIVGLAFRSIAEQHATPLFQNMCDQ 210
Query: 216 GLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGN 275
GLV + VFSF+L R+ A++GGE++FGG+D F TYVP+T GYWQF++ + +
Sbjct: 211 GLVDQCVFSFYLKRNGSAQQGGELLFGGIDASRFTAPLTYVPLTHAGYWQFQMQSVEVVG 270
Query: 276 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 330
++ G AIVD+GTSLLA P IN +GG S E L S G L
Sbjct: 271 KTI---SQGRQAIVDTGTSLLAAPPREYLIINSLLGGLPTASGEYLLRCSDIGRL 322
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 42/101 (41%), Positives = 56/101 (55%), Gaps = 6/101 (5%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTG-EGIAEVCISG 465
IN L LP GE ++ C I +P V F IG + F L P Y+++ + + VC+S
Sbjct: 298 INSLLGGLPTASGEYLLRCSDIGRLPEVFFVIGGQRFGLQPRDYVMQVANDDGSSVCLSA 357
Query: 466 FMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
F D WILGDVF+G Y+T FD+ + RIGFA AA
Sbjct: 358 FTLMD-----ADFWILGDVFIGRYYTAFDAAQRRIGFAPAA 393
>gi|255713834|ref|XP_002553199.1| KLTH0D11264p [Lachancea thermotolerans]
gi|238934579|emb|CAR22761.1| KLTH0D11264p [Lachancea thermotolerans CBS 6340]
Length = 417
Score = 219 bits (557), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 118/309 (38%), Positives = 180/309 (58%), Gaps = 16/309 (5%)
Query: 26 NGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDIL--------PL 77
N +G+ +LH ++ ++ + A +G LG ++D+L PL
Sbjct: 36 NDGSELGVMMSVANLHQKYLSQFSKAYPEVDFASHAGSGIGLGAVEQDVLSAMGGHDVPL 95
Query: 78 KNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYT 137
N+++AQYF EI +G+PPQ+F VI DTGSSNLWVPS +C S++C+ HS+Y S++Y
Sbjct: 96 SNYLNAQYFTEITLGTPPQSFKVILDTGSSNLWVPSDEC-GSLACFLHSKYSHDASSSYK 154
Query: 138 EIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGF 197
G + I YGSGS+ G+ SQD + +GD+ + Q F EAT E L F +FDGI+GLG+
Sbjct: 155 ANGTNFAIQYGSGSLEGYISQDTLSIGDLTIPKQDFAEATSEPGLAFAFGKFDGILGLGY 214
Query: 198 REIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEG-GEIVFGGVDPKHFKGKHTYV 256
IAV VP + GL+ E F+F+LN D+EE GE+ FGG+D +KG T++
Sbjct: 215 DTIAVDKVVPPVYKAINDGLLDEPRFAFYLNNADDSEESTGEVTFGGIDSSKYKGNITWL 274
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE--- 313
PV +K YW+ + I +G++ + G A +D+GTSL+A P+ + +N IG +
Sbjct: 275 PVRRKAYWEVKFDGIGLGDEYAEL--EGTGAAIDTGTSLIALPSGLAEVLNAEIGAKKGW 332
Query: 314 -GVVSAECK 321
G + +C+
Sbjct: 333 SGQYTVDCE 341
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 50/87 (57%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC+ +P+++FT K F +S Y L+ ++ CIS F D P P GPL
Sbjct: 334 GQYTVDCESRDQLPDLTFTFNGKNFTISAYDYTLE----VSGSCISAFTPMDFPEPVGPL 389
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
I+GD F+ +++V+D G +G A+A
Sbjct: 390 AIIGDAFLRKFYSVYDLGNNAVGLAQA 416
>gi|156846613|ref|XP_001646193.1| hypothetical protein Kpol_1013p6 [Vanderwaltozyma polyspora DSM
70294]
gi|156116867|gb|EDO18335.1| hypothetical protein Kpol_1013p6 [Vanderwaltozyma polyspora DSM
70294]
Length = 402
Score = 219 bits (557), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 105/239 (43%), Positives = 154/239 (64%), Gaps = 3/239 (1%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+++AQY+ +I +G+P Q+F VI DTGSSNLWVPS C S++CY H++Y S+
Sbjct: 79 IPLSNYLNAQYYTDITLGTPAQSFKVILDTGSSNLWVPSVDCN-SLACYLHAKYDHSDSS 137
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + G + I YGSGS+ G+ SQD +++GD+V+ Q F EAT E L F +FDGI+G
Sbjct: 138 TYKKNGTTFSIQYGSGSMEGYISQDVLQIGDLVIPGQDFAEATSEPGLAFAFGKFDGILG 197
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + IAV VP + N + + LV E +FSF+L D +E+GG++ FGG D F G T
Sbjct: 198 LAYDTIAVNRVVPPFYNAINKKLVDEPIFSFYLGDDTKSEDGGQVTFGGYDSSLFTGDIT 257
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
++PV +K YW+ + I +GN+ + G A +D+GTSL+ P+ + IN IG +
Sbjct: 258 WLPVRRKAYWEVKFDAIALGNEVADLVNHGAA--IDTGTSLITLPSGLAEVINSQIGAK 314
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 49/87 (56%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ I+DC T+P+++FT F ++P Y L+ ++ CIS D P P GPL
Sbjct: 319 GQWIVDCKTRDTLPDMTFTFDGYNFTITPYDYTLE----VSGSCISAITPMDFPAPVGPL 374
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
I+GD F+ Y++++D G +G A A
Sbjct: 375 AIVGDAFLRRYYSIYDVGNNAVGLAAA 401
>gi|313220508|emb|CBY31359.1| unnamed protein product [Oikopleura dioica]
gi|313229843|emb|CBY07548.1| unnamed protein product [Oikopleura dioica]
Length = 397
Score = 219 bits (557), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 129/341 (37%), Positives = 193/341 (56%), Gaps = 52/341 (15%)
Query: 5 LLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVR 64
+L S LA +L+P L++ + + +LHS A + E
Sbjct: 2 MLTSALLGMALADPILIP-----LKKTKMTRGIGNLHSKYRADVPTNE------------ 44
Query: 65 HRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI---- 120
L N+ DAQYFG + IG+P QNF+VIFDTGSSNLWVPSSKC I
Sbjct: 45 ------------LTNYFDAQYFGPLTIGTPAQNFTVIFDTGSSNLWVPSSKCDPHIGTGF 92
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEV----GDVVVKDQVFIEA 176
+C H++Y S S+T+TE G EI YG+GS+ GF S D++++ G ++ K F EA
Sbjct: 93 ACLNHNKYDSDLSSTWTEDGTKFEIQYGTGSMVGFQSTDDIDIAPGSGGLIAKQATFAEA 152
Query: 177 TREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDP----D 232
E +TFL A FDGI+GL + I+V A P+++ ++E+G V+ VF+F+++R+ +
Sbjct: 153 VEEPGITFLAAAFDGIMGLAYPSISVNGATPIYNQLMEEGQVN-GVFAFFVHRNSSKPGE 211
Query: 233 AEEGGEIVFGGVDPKHFKG----KHTYVPVTKKGYWQFEL------GDILIGNQSTGVCE 282
++ GGEI +GGV+P+ F+G + V+++ YWQ + GD + +Q +CE
Sbjct: 212 SDIGGEIAWGGVNPERFEGTFPDSFIWHEVSRQAYWQVNMGTVTVNGDGFVSDQPIVMCE 271
Query: 283 GGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLV 323
GGC IVDSGTSL+ GPT + +IN AIG ++ E ++
Sbjct: 272 GGCQGIVDSGTSLITGPTEITDQINKAIGAIEFIAGEWLVI 312
Score = 85.9 bits (211), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 40/102 (39%), Positives = 58/102 (56%)
Query: 402 KVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEV 461
++ IN+ ++ GE ++ C P MP + I D + ++P+ Y+L +
Sbjct: 290 EITDQINKAIGAIEFIAGEWLVICRNKPRMPTIDIYIDDVRYRMTPDDYVLTIEDQGQTQ 349
Query: 462 CISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFA 503
CIS FM D+P P GPLWILGD FMG+ +TVFD R+GFA
Sbjct: 350 CISAFMGLDIPEPAGPLWILGDAFMGMKYTVFDFDTNRVGFA 391
>gi|345802472|ref|XP_854465.2| PREDICTED: pepsin B-like [Canis lupus familiaris]
Length = 390
Score = 219 bits (557), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 112/276 (40%), Positives = 168/276 (60%), Gaps = 3/276 (1%)
Query: 25 SNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDIL-PLKNFMDA 83
S G+ RI LKK + + + R + V L ++D P N++++
Sbjct: 14 SEGVERIILKKGK-SIRQVMEERGVLETFLRNHPKVDPAAKYLFNNDAVAYEPFTNYLNS 72
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
YFGEI IG+PPQNF V+FDTGSSNLWVPS+ C S +C H+ + S+TY G++
Sbjct: 73 YYFGEISIGTPPQNFLVLFDTGSSNLWVPSTYCQ-SQACSNHNTFNPSSSSTYRNNGQTY 131
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVG 203
+ YGSGS++ D V V ++V+ +Q F + E S F A FDGI+G+ + +AVG
Sbjct: 132 TLYYGSGSLTVLLGYDTVTVQNIVINNQEFGLSEIEPSNPFYYANFDGILGMAYPNLAVG 191
Query: 204 DAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGY 263
D+ V +MV+QG +++ +FSF+ +R P E GGE++ GGVD + + G+ + PVT++ Y
Sbjct: 192 DSPTVMQSMVQQGQLTQPIFSFYFSRQPTYEYGGELILGGVDTQFYSGEIVWAPVTREMY 251
Query: 264 WQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGP 299
WQ + + L+ NQ+TG+C GC AIVD+GT +LA P
Sbjct: 252 WQVAIDEFLVNNQATGLCSQGCQAIVDTGTYVLAVP 287
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 32/89 (35%), Positives = 48/89 (53%), Gaps = 5/89 (5%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG-P 477
G+ +++C+ I +MP ++F I L P Y+ C G A LP P G P
Sbjct: 306 GDFVVNCNSIQSMPTITFVISGSPLPLPPSAYVFNNNG----YCTLGIEATYLPSPTGQP 361
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
LW LGDVF+ Y+T++D ++GFA +A
Sbjct: 362 LWTLGDVFLKEYYTIYDLANNKMGFAPSA 390
>gi|195134380|ref|XP_002011615.1| GI11125 [Drosophila mojavensis]
gi|193906738|gb|EDW05605.1| GI11125 [Drosophila mojavensis]
Length = 371
Score = 219 bits (557), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 113/251 (45%), Positives = 157/251 (62%), Gaps = 8/251 (3%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
L N ++ Y+G I IG+PPQNF V+FD+GSSNLWVPS C S +C H++Y S S+TY
Sbjct: 60 LSNSLNMAYYGAITIGTPPQNFKVLFDSGSSNLWVPSKNCP-SYACEVHNQYDSSASSTY 118
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
G+S I YG+GS+SGF S D V+V + +K Q F EAT E F A FDGI+G+G
Sbjct: 119 EANGESFSIQYGTGSLSGFLSTDTVDVNGLSIKKQTFAEATNEPGTNFNNANFDGILGMG 178
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
++ I+ + VP + NMV Q L+ + VFSF+L RD + +GGE++FGG D + G TYV
Sbjct: 179 YQSISQDNVVPPFYNMVSQDLIDQSVFSFYLARDGTSSQGGELIFGGSDSSLYSGDFTYV 238
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH--AIGGEG 314
P++++GYWQF + + S +C+ C AI D+GTSLL P +N + EG
Sbjct: 239 PISQEGYWQFTMAGASVEGYS--LCD-NCQAIADTGTSLLVAPANAYELLNEILNVNDEG 295
Query: 315 VVSAECKLVVS 325
+V +C V S
Sbjct: 296 LV--DCSTVSS 304
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 41/99 (41%), Positives = 58/99 (58%), Gaps = 11/99 (11%)
Query: 409 ELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKT-GEGIAEVCISGFM 467
EL + + N E ++DC + ++P ++F IG F+LSP YI++T GE ++ V +M
Sbjct: 283 ELLNEILNVNDEGLVDCSTVSSLPVITFNIGGTNFDLSPSAYIIQTDGECMSSV---QYM 339
Query: 468 AFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
D WILGDVF+G Y+T FD G RIGFA A
Sbjct: 340 GTDF-------WILGDVFIGQYYTEFDLGNNRIGFAPVA 371
>gi|195159706|ref|XP_002020719.1| GL15694 [Drosophila persimilis]
gi|194117669|gb|EDW39712.1| GL15694 [Drosophila persimilis]
Length = 401
Score = 218 bits (556), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 125/318 (39%), Positives = 185/318 (58%), Gaps = 19/318 (5%)
Query: 9 VFCLWVLASCLLLPASSNGLRRIGLKK---RRLDLHSLNAARITRK------ERYM---- 55
+F L+VL C+L AS+ L+RI + K +R H R R E Y+
Sbjct: 1 MFKLFVLL-CVLALASAE-LQRIKIHKSEHKRSRHHVRQEVRSLRHKYQQLIENYVVYDY 58
Query: 56 GGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSK 115
G + D L N M+ Y+G+I IG+PPQ F+V+FDTGSSNLW+PS++
Sbjct: 59 GQPDYGNDYPSNSEPDYTTEELGNSMNMYYYGQISIGTPPQYFNVVFDTGSSNLWIPSAQ 118
Query: 116 CYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFI 174
C + ++C H++Y + S+TY ++ I YG+GS++G+ + D V + + + +Q F
Sbjct: 119 CLSTDVACQQHNQYNASASSTYVANSQNFSIQYGTGSVTGYLATDTVTINGLAIANQTFG 178
Query: 175 EATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAE 234
EA + +F FDGI+G+G++ IAV VP + N+ EQGL+ E F F+L R+ +E
Sbjct: 179 EAVSQPGSSFTDVAFDGILGMGYQTIAVDSVVPPFYNLYEQGLIDEPTFGFYLARNGSSE 238
Query: 235 EGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTS 294
EGG+++ GGVD G TYVPV+++GYWQF + + I T +C+ GC AI D+GTS
Sbjct: 239 EGGQLLLGGVDETLMAGDLTYVPVSQEGYWQFSVNN--ISWNGTVLCD-GCQAIADTGTS 295
Query: 295 LLAGPTPVVTEINHAIGG 312
LLA P V T+IN IG
Sbjct: 296 LLACPQAVYTQINQLIGA 313
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 41/134 (30%), Positives = 63/134 (47%), Gaps = 9/134 (6%)
Query: 370 ENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIP 429
N+S + +C C+ + L Q V + IN+L ++ G + I C +
Sbjct: 273 NNISWNGTVLCDGCQAIADTGTSLLACPQ---AVYTQINQLIGAVL-IEGSNYIPCATLD 328
Query: 430 TMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVY 489
++P +SF IG F+L YI + C+S F W+LGDVF+G Y
Sbjct: 329 SLPVLSFNIGGTTFDLPASAYISVFHDEGYTSCMSTFTDIGTD-----FWVLGDVFLGQY 383
Query: 490 HTVFDSGKLRIGFA 503
+T FD G+ R+GFA
Sbjct: 384 YTQFDFGQNRVGFA 397
>gi|322700747|gb|EFY92500.1| vacuolar protease A [Metarhizium acridum CQMa 102]
Length = 395
Score = 218 bits (556), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 120/264 (45%), Positives = 170/264 (64%), Gaps = 11/264 (4%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+P+ NFM+AQYF EI IGSPPQ+F V+ DTGSSNLWVPS C SI+CY HS Y S S+
Sbjct: 75 VPVSNFMNAQYFSEITIGSPPQSFKVVLDTGSSNLWVPSQSCN-SIACYLHSTYDSSSSS 133
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + G S EI YGSGS+SGF SQD V +GD+ ++ Q F EAT E L F +FDGI+G
Sbjct: 134 TYKKNGSSFEIRYGSGSLSGFVSQDVVSIGDLKIEHQDFAEATSEPGLAFAFGKFDGILG 193
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ ++V VP + M++Q L+ E VF+F+L EEG E VFGG+D H+ G+
Sbjct: 194 LGYDTLSVNKIVPPFYQMIDQKLLDEPVFAFYLG---SKEEGSEAVFGGIDKNHYTGELE 250
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
Y+P+ +K YW+ ++ I +G++ + G AI+D+GTSL P+ + +N IG +
Sbjct: 251 YLPLRRKAYWEVDINSIALGDEIAELDHTG--AILDTGTSLNVLPSTLAELLNKEIGAKK 308
Query: 314 ---GVVSAECKLVVSQYGDLIWDL 334
G + +C + S D++++L
Sbjct: 309 SWNGQYTVDCDKIKS-LPDIVFNL 331
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 52/87 (59%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DCD+I ++P++ F + + ++L YIL+ + C+S F D+P P GPL
Sbjct: 312 GQYTVDCDKIKSLPDIVFNLSNSNYSLPASDYILE----LQGTCLSTFQGMDIPEPAGPL 367
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ Y++V+D + +G A A
Sbjct: 368 VILGDAFLRRYYSVYDLERNAVGLARA 394
>gi|224458280|ref|NP_001138943.1| gastricsin precursor [Pongo abelii]
gi|222425206|dbj|BAH20552.1| pepsinogen C [Pongo abelii]
Length = 388
Score = 218 bits (555), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 116/283 (40%), Positives = 169/283 (59%), Gaps = 9/283 (3%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRH------RLGDSDEDILPLKNFMDAQYFGEIG 90
++ L + R T KE+ + G + +H R GD P+ +MDA YFGEI
Sbjct: 20 KVPLKKFKSIRETMKEKGLLGEFLRTHKHDPAWKYRFGDLSVSYEPMA-YMDAAYFGEIS 78
Query: 91 IGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSG 150
IG+PPQNF V+FDTGSSNLWVPS C S +C HSR+ +S+TY+ G++ + YGSG
Sbjct: 79 IGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHSRFNPSESSTYSTNGQTFSLQYGSG 137
Query: 151 SISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWD 210
S++GFF D + V + V +Q F + E F+ A+FDGI+GL + ++V +A
Sbjct: 138 SLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGIMGLAYPALSVDEATTAMQ 197
Query: 211 NMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 270
MV++G ++ VFSF+L+ GG +VFGGVD + G+ + PVT++ YWQ + +
Sbjct: 198 GMVQEGALTSPVFSFYLSNQ-QGSSGGAVVFGGVDSSLYTGQIYWAPVTQELYWQIGIEE 256
Query: 271 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
LIG Q++G C GC AIVD+GTSLL P ++ + A G +
Sbjct: 257 FLIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMSALLQATGAQ 299
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 37/107 (34%), Positives = 57/107 (53%), Gaps = 5/107 (4%)
Query: 401 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 460
++ +S + + + + G+ +++C+ I +P ++F I F L P YIL
Sbjct: 286 QQYMSALLQATGAQEDEYGQFLVNCNSIQNLPTLTFIINGVEFPLPPSSYILSNNG---- 341
Query: 461 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
C G LP G PLWILGDVF+ Y++V+D G R+GFA AA
Sbjct: 342 YCTVGVELTYLPSQNGQPLWILGDVFLRSYYSVYDLGNNRVGFATAA 388
>gi|441648777|ref|XP_003266334.2| PREDICTED: LOW QUALITY PROTEIN: gastricsin [Nomascus leucogenys]
Length = 388
Score = 218 bits (554), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 115/283 (40%), Positives = 169/283 (59%), Gaps = 9/283 (3%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVS------GVRHRLGDSDEDILPLKNFMDAQYFGEIG 90
+ L+ + R T KE+ + G + ++ GD P+ +MDA YFGE+
Sbjct: 20 KXPLNEFKSIRETMKEKGLLGEFLRTHKYDPAWKYHFGDLSVSYEPMA-YMDAAYFGEVS 78
Query: 91 IGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSG 150
IG+PPQNF V+FDTGSSNLWVPS C S +C HSR+ KS+TY+ G++ + YGSG
Sbjct: 79 IGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHSRFNPSKSSTYSTNGQTFSLQYGSG 137
Query: 151 SISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWD 210
S++GFF D + V + V +Q F + E F+ ARFDGI+GL + ++V +A
Sbjct: 138 SLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFIYARFDGIMGLAYPALSVDEATTAMQ 197
Query: 211 NMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 270
MV++G ++ VFSF+L+ + GG +VFGGVD + G+ + PVT++ YWQ + +
Sbjct: 198 GMVQEGALTSPVFSFYLSNQ-EGSSGGAVVFGGVDSSLYTGQIYWAPVTQELYWQIGIEE 256
Query: 271 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
LIG Q++G C GC AIVD+GTSLL P ++ + A G +
Sbjct: 257 FLIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMSALLQATGAQ 299
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 38/107 (35%), Positives = 58/107 (54%), Gaps = 5/107 (4%)
Query: 401 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 460
++ +S + + + + G+ +++C+ I +P ++F I F L P YIL
Sbjct: 286 QQYMSALLQATGAQEDEYGQFLVNCNSIQNLPTLTFIINGVEFPLPPSSYILSNNG---- 341
Query: 461 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
C G A LP G PLWILGDVF+ Y++V+D G R+GFA AA
Sbjct: 342 YCTVGVEATYLPSQSGQPLWILGDVFLRSYYSVYDLGNNRVGFATAA 388
>gi|126309851|ref|XP_001370482.1| PREDICTED: gastricsin-like [Monodelphis domestica]
Length = 390
Score = 218 bits (554), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 103/238 (43%), Positives = 152/238 (63%), Gaps = 1/238 (0%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N+MD Y+GEI IG+PPQNF V+FDTGSSNLWVPS C S +C H ++ +S+T
Sbjct: 64 PLANYMDMSYYGEISIGTPPQNFLVLFDTGSSNLWVPSIYCQ-SQACTNHPQFNPSQSST 122
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y+ G++ + YG+GS++G F D V + + + +Q F + E F+ A+FDGI+GL
Sbjct: 123 YSSNGQTFSLQYGTGSLTGVFGYDTVTIQGISITNQEFGLSETEPGTNFVYAQFDGILGL 182
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ I+ G A V +++ L++ VF+F+L+ + ++ GGE+VFGGVD + G +
Sbjct: 183 AYPAISSGGATTVMQGFLQENLLNSPVFAFYLSGNENSNNGGEVVFGGVDTSMYTGDIYW 242
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
PVT++ YWQ + IG Q+TG C GGC AIVD+GTSLL P + +E+ IG +
Sbjct: 243 APVTEEAYWQIAINGFSIGGQATGWCSGGCQAIVDTGTSLLTAPQQIFSELMQYIGAQ 300
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 37/107 (34%), Positives = 56/107 (52%), Gaps = 4/107 (3%)
Query: 401 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 460
+++ S + + + + G ++ C +MP ++F I F L P Y+L + E
Sbjct: 287 QQIFSELMQYIGAQQDENGSYLVSCSNTQSMPTITFNINGVDFPLPPSAYVLPSNSNYCE 346
Query: 461 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
V G M LP G PLWILGDVF+ Y++V+D G R+GFA A
Sbjct: 347 V---GIMPTYLPSQNGQPLWILGDVFLRNYYSVYDLGNNRVGFANLA 390
>gi|22218078|dbj|BAC07516.1| pepsinogen III [Oryctolagus cuniculus]
Length = 387
Score = 218 bits (554), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 122/296 (41%), Positives = 171/296 (57%), Gaps = 24/296 (8%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL K L H+ N A +Y+ A V L+N++D +YFG
Sbjct: 32 LIEKGLLKDYLKTHTPNLAT-----KYLPKAAFDSVPTET---------LENYLDTEYFG 77
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+P Q+F+VIFDTGSSNLWVPS C S +C H+++ S+T+ +S I Y
Sbjct: 78 TIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SAACSVHNKFNPEDSSTFQATSESLSITY 136
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVP 207
G+GS++GF D V+VG++ +Q+F + E A FDGI+GL + I+ DA P
Sbjct: 137 GTGSMTGFLGYDTVKVGNIEDTNQIFGLSESEPGSFLYYAPFDGILGLAYPSISSSDATP 196
Query: 208 VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFE 267
V+DNM +GLVSE++FS +L+ D E G ++FGG+D ++ G +VPV+ +GYWQ
Sbjct: 197 VFDNMWNEGLVSEDLFSVYLSSDD--ESGSVVMFGGIDSSYYTGSLNWVPVSYEGYWQIT 254
Query: 268 LGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 317
L I + + T C C AIVD+GTSLLAGPT ++ I IG GE +VS
Sbjct: 255 LDSITMDGE-TIACADSCQAIVDTGTSLLAGPTSAISNIQSYIGASENSDGEMIVS 309
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 64/132 (48%), Gaps = 6/132 (4%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 434
G++ C+ A+V L T +S I + N GE I+ C + ++PN+
Sbjct: 262 GETIACADSCQAIVDTGTSLLAGPTS--AISNIQSYIGASENSDGEMIVSCSSMYSLPNI 319
Query: 435 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 494
FTI + + YIL+ + CISGF +L G LWILGDVF+ Y TVFD
Sbjct: 320 VFTINGVQYPVPASAYILEEDDA----CISGFEGMNLDTYTGELWILGDVFIRQYFTVFD 375
Query: 495 SGKLRIGFAEAA 506
++G A AA
Sbjct: 376 RANNQLGLAAAA 387
>gi|253762215|gb|ACT35559.1| pepsinogen A2 precursor [Siniperca scherzeri]
Length = 376
Score = 217 bits (553), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 114/251 (45%), Positives = 163/251 (64%), Gaps = 13/251 (5%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
P+ N D Y+G I IGSPPQ+FSVIFDTGSSNLW+PS C S +C H R+ ++S T
Sbjct: 60 PMTNDADLSYYGVISIGSPPQSFSVIFDTGSSNLWIPSVYCS-SQACENHRRFNPQQSTT 118
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIG 194
+ + I YG+GS++G+ + D VEVG + V +QVF I T + + A DGI+G
Sbjct: 119 FKWGNQPLSIQYGTGSMTGYLAIDTVEVGGISVANQVFGISRTEAPFMAHMQA--DGILG 176
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L F+ IA + VPV+DNMV+QGLVS+ +FS +L+ ++E+G E+VFGG+D H+ G+ T
Sbjct: 177 LAFQTIASDNVVPVFDNMVKQGLVSQPLFSVYLSS--NSEQGSEVVFGGIDSSHYTGQIT 234
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG--- 311
++P++ YWQ ++ + I Q T C GGC AI+D+GTSL+ GPT + +N +G
Sbjct: 235 WIPLSSATYWQIKMDSVTINGQ-TVACSGGCQAIIDTGTSLIVGPTSDINNMNAWVGAST 293
Query: 312 ---GEGVVSAE 319
GE VVS +
Sbjct: 294 NQYGEAVVSCQ 304
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/132 (28%), Positives = 63/132 (47%), Gaps = 10/132 (7%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 434
G + CS A++ L T + ++ +N + N GE+++ C I +MP+V
Sbjct: 255 GQTVACSGGCQAIIDTGTSLIVGPTSD--INNMNAWVGASTNQYGEAVVSCQNIQSMPDV 312
Query: 435 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 494
+FT+ + F + Y+ + G C +GF LWILGDVF+ Y+ VFD
Sbjct: 313 TFTLNGQAFTIPASAYVSQNSYG----CNTGFGQGG----SDQLWILGDVFIREYYVVFD 364
Query: 495 SGKLRIGFAEAA 506
+ +G A +A
Sbjct: 365 AHAQYVGLASSA 376
>gi|291409613|ref|XP_002721073.1| PREDICTED: pepsinogen III-like [Oryctolagus cuniculus]
Length = 387
Score = 217 bits (553), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 125/304 (41%), Positives = 176/304 (57%), Gaps = 26/304 (8%)
Query: 28 LRRIGLKKRRLDLHSLN-AARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYF 86
L GL K L H+ N A + KE + A VS L+N++D +YF
Sbjct: 32 LIEKGLLKDYLKTHTPNLATKYFPKETF---ASVSTES------------LENYLDTEYF 76
Query: 87 GEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEIN 146
G I IG+PPQ+F+VIFDTGSSNLWVPS+ C S +C H+R+ S+T+ ++ I
Sbjct: 77 GTISIGTPPQDFTVIFDTGSSNLWVPSTYCS-SAACTVHNRFNPDDSSTFQATSETLSIT 135
Query: 147 YGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDA 205
YG+GS++G D V VG + +Q+F + T GS + A FDGI+GL + I+ DA
Sbjct: 136 YGTGSMTGILGYDTVNVGSIEDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISASDA 194
Query: 206 VPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQ 265
PV+DNM +GLVS+++FS +L+ D E G ++FGG+D ++ G +VPV+ +GYWQ
Sbjct: 195 TPVFDNMWNEGLVSQDLFSVYLSSDD--ESGSLVMFGGIDSSYYTGSLNWVPVSYEGYWQ 252
Query: 266 FELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECK 321
L I + + T C GC AIVD+GTSLLAGPT ++ I IG EG + C
Sbjct: 253 ITLDSITMDGE-TIACADGCQAIVDTGTSLLAGPTSAISNIQSYIGASENYEGEMIVSCS 311
Query: 322 LVVS 325
+ S
Sbjct: 312 SMYS 315
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 63/132 (47%), Gaps = 6/132 (4%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 434
G++ C+ A+V L T +S I + N GE I+ C + ++PN+
Sbjct: 262 GETIACADGCQAIVDTGTSLLAGPTS--AISNIQSYIGASENYEGEMIVSCSSMYSLPNI 319
Query: 435 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 494
FTI + + YIL+ VC SGF D+ G LWILGDVF+ Y TVFD
Sbjct: 320 VFTINGVQYPVPASAYILEEDS----VCTSGFEGMDVDTSTGELWILGDVFIRQYFTVFD 375
Query: 495 SGKLRIGFAEAA 506
++G A AA
Sbjct: 376 RANNQLGLAAAA 387
>gi|256274192|gb|EEU09100.1| Pep4p [Saccharomyces cerevisiae JAY291]
Length = 405
Score = 217 bits (553), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 111/265 (41%), Positives = 166/265 (62%), Gaps = 9/265 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+++AQY+ +I +G+PPQNF VI DTGSSNLWVPS++C S++C+ HS+Y S+
Sbjct: 81 VPLTNYLNAQYYTDITLGTPPQNFKVILDTGSSNLWVPSNEC-GSLACFLHSKYDHEASS 139
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y G I YG+GS+ G+ SQD + +GD+ + Q F EAT E LTF +FDGI+G
Sbjct: 140 SYKANGTEFAIQYGTGSLEGYISQDTLSIGDLTIPKQDFAEATSEPGLTFAFGKFDGILG 199
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKH 253
LG+ I+V VP + N ++Q L+ E+ F+F+L + D E GGE FGG+D FKG
Sbjct: 200 LGYDTISVDKVVPPFYNAIQQDLLDEKRFAFYLGDTSKDTENGGEATFGGIDESKFKGDI 259
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
T++PV +K YW+ + I +G++ + G A +D+GTSL+ P+ + IN IG +
Sbjct: 260 TWLPVRRKAYWEVKFEGIGLGDEYAELESHGAA--IDTGTSLITLPSGLAEMINAEIGAK 317
Query: 314 ----GVVSAECKLVVSQYGDLIWDL 334
G + +C DLI++L
Sbjct: 318 KGWTGQYTLDCN-TRDNLPDLIFNL 341
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 47/87 (54%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC+ +P++ F + F + P Y L+ ++ CIS D P P GPL
Sbjct: 322 GQYTLDCNTRDNLPDLIFNLNGYNFTIGPYDYTLE----VSGSCISAITPMDFPEPVGPL 377
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
I+GD F+ Y++++D G +G A+A
Sbjct: 378 AIVGDAFLRKYYSIYDLGNNAVGLAKA 404
>gi|193499293|gb|ACF18589.1| pepsinogen A2 precursor [Siniperca scherzeri]
Length = 376
Score = 217 bits (553), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 114/251 (45%), Positives = 163/251 (64%), Gaps = 13/251 (5%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
P+ N D Y+G I IGSPPQ+FSVIFDTGSSNLW+PS C S +C H R+ ++S T
Sbjct: 60 PMTNDADLSYYGVISIGSPPQSFSVIFDTGSSNLWIPSVYCS-SQACENHRRFNPQQSTT 118
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIG 194
+ + I YG+GS++G+ + D VEVG + V +QVF I T + + A DGI+G
Sbjct: 119 FKWGNQPLSIQYGTGSMTGYLAIDTVEVGGISVANQVFGISRTEAPFMAHMQA--DGILG 176
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L F+ IA + VPV+DNMV+QGLVS+ +FS +L+ ++E+G E+VFGG+D H+ G+ T
Sbjct: 177 LAFQTIASDNVVPVFDNMVKQGLVSQPLFSVYLSS--NSEQGSEVVFGGIDSSHYTGQIT 234
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG--- 311
++P++ YWQ ++ + I Q T C GGC AI+D+GTSL+ GPT + +N +G
Sbjct: 235 WIPLSSATYWQIKMDSVTINGQ-TVACSGGCQAIIDTGTSLIVGPTSDINNMNAWVGAST 293
Query: 312 ---GEGVVSAE 319
GE VVS +
Sbjct: 294 NQYGEAVVSCQ 304
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/132 (28%), Positives = 62/132 (46%), Gaps = 10/132 (7%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 434
G + CS A++ L T + ++ +N + N GE+++ C I +MP+V
Sbjct: 255 GQTVACSGGCQAIIDTGTSLIVGPTSD--INNMNAWVGASTNQYGEAVVSCQNIQSMPDV 312
Query: 435 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 494
+FT+ + F + Y+ + G C +GF LWILGDVF+ Y+ VFD
Sbjct: 313 TFTLNGQAFTIPASAYVFQNSYG----CNTGFGQGG----SDQLWILGDVFIREYYVVFD 364
Query: 495 SGKLRIGFAEAA 506
+ +G A A
Sbjct: 365 AHAQYVGLASFA 376
>gi|283806594|ref|NP_001164550.1| pepsin-3 precursor [Oryctolagus cuniculus]
gi|129783|sp|P27822.1|PEPA3_RABIT RecName: Full=Pepsin-3; AltName: Full=Pepsin A; AltName:
Full=Pepsin III; Flags: Precursor
gi|165598|gb|AAA85370.1| pepsinogen [Oryctolagus cuniculus]
Length = 387
Score = 217 bits (553), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 122/296 (41%), Positives = 171/296 (57%), Gaps = 24/296 (8%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL K L H+ N A +Y+ A V L+N++D +YFG
Sbjct: 32 LIEKGLLKDYLKTHTPNLAT-----KYLPKAAFDSVPTET---------LENYLDTEYFG 77
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+P Q+F+VIFDTGSSNLWVPS C S +C H+++ S+T+ +S I Y
Sbjct: 78 TIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SAACSVHNQFNPEDSSTFQATSESLSITY 136
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVP 207
G+GS++GF D V+VG++ +Q+F + E A FDGI+GL + I+ DA P
Sbjct: 137 GTGSMTGFLGYDTVKVGNIEDTNQIFGLSESEPGSFLYYAPFDGILGLAYPSISSSDATP 196
Query: 208 VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFE 267
V+DNM +GLVSE++FS +L+ D E G ++FGG+D ++ G +VPV+ +GYWQ
Sbjct: 197 VFDNMWNEGLVSEDLFSVYLSSDD--ESGSVVMFGGIDSSYYTGSLNWVPVSYEGYWQIT 254
Query: 268 LGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 317
L I + + T C C AIVD+GTSLLAGPT ++ I IG GE +VS
Sbjct: 255 LDSITMDGE-TIACADSCQAIVDTGTSLLAGPTSAISNIQSYIGASENSDGEMIVS 309
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 64/132 (48%), Gaps = 6/132 (4%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 434
G++ C+ A+V L T +S I + N GE I+ C + ++PN+
Sbjct: 262 GETIACADSCQAIVDTGTSLLAGPTS--AISNIQSYIGASENSDGEMIVSCSSMYSLPNI 319
Query: 435 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 494
FTI + + YIL+ + CISGF +L G LWILGDVF+ Y TVFD
Sbjct: 320 VFTINGVQYPVPASAYILEEDDA----CISGFEGMNLDTYTGELWILGDVFIRQYFTVFD 375
Query: 495 SGKLRIGFAEAA 506
++G A AA
Sbjct: 376 RANNQLGLAAAA 387
>gi|12248414|dbj|BAB20092.1| pepsinogen A [Rana catesbeiana]
Length = 385
Score = 217 bits (553), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 123/312 (39%), Positives = 174/312 (55%), Gaps = 27/312 (8%)
Query: 9 VFCLWVLASCLLLPASSNG-------LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS 61
+F L VLA C ++ S L R+GL L H N A + + A S
Sbjct: 6 LFGLVVLAECGVVKVSLRKGESLRARLNRLGLLGDYLKKHHYNPA----TKYFPSLAQAS 61
Query: 62 GVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSIS 121
G PL+N+MD +YFG I IG+PPQ+F+VIFDTGSSNLWVPS C S +
Sbjct: 62 GE------------PLQNYMDIEYFGTISIGTPPQSFTVIFDTGSSNLWVPSVYCS-SPA 108
Query: 122 CYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGS 181
C H + ++S+T+ I YG+GS+SGF D V+VG++ + +Q+F + E
Sbjct: 109 CTNHHMFNPQQSSTFQATNTPVSIQYGTGSMSGFLGYDTVQVGNIQITNQIFGLSQSEPG 168
Query: 182 LTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVF 241
+ FDGI+GL F +A A PV+DNM QGL+ +++FS +L+ + G ++F
Sbjct: 169 SFLYYSPFDGILGLAFPSLASSQATPVFDNMWNQGLIPQDLFSVYLSS--QGQSGSFVLF 226
Query: 242 GGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTP 301
GGVD ++ G +VP+T + YWQ + I IG Q C G C+AIVD+GTSLLAGP+
Sbjct: 227 GGVDTSYYTGNLNWVPLTAETYWQITVDSISIGGQVIA-CSGSCSAIVDTGTSLLAGPST 285
Query: 302 VVTEINHAIGGE 313
+ I + IG
Sbjct: 286 PIANIQYYIGAN 297
Score = 68.2 bits (165), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 34/88 (38%), Positives = 47/88 (53%), Gaps = 4/88 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +I+C+ I MP V FTI + L Y+ ++ + C SGF A +LP G L
Sbjct: 302 GQYVINCNNISNMPTVVFTINGVQYPLPASAYVRQSQQS----CTSGFQAMNLPTSSGDL 357
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEAA 506
WILGDVF+ Y+ VFD + A A
Sbjct: 358 WILGDVFIREYYVVFDRANNYVAMAPVA 385
>gi|444316168|ref|XP_004178741.1| hypothetical protein TBLA_0B03830 [Tetrapisispora blattae CBS 6284]
gi|387511781|emb|CCH59222.1| hypothetical protein TBLA_0B03830 [Tetrapisispora blattae CBS 6284]
Length = 413
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 111/251 (44%), Positives = 160/251 (63%), Gaps = 8/251 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+M+AQYF +I IG+PPQ+F V+ DTGSSNLWVPS +C S++CY HS+Y +S+
Sbjct: 89 VPLSNYMNAQYFADIKIGTPPQSFKVVLDTGSSNLWVPSKEC-GSLACYLHSKYNHDESS 147
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY G + I YGSGS+ G+ SQD +E+GD+ + Q F EAT E ++F +FDGI+G
Sbjct: 148 TYKANGSAFAIQYGSGSLEGYISQDVMEIGDLKITKQDFAEATSEPGISFAFGKFDGILG 207
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKH 253
L + IAV VP N + QGL+ E F+F+L + + GGE VFGG+D F+G
Sbjct: 208 LAYDTIAVNRVVPPVYNAINQGLLDEPKFAFYLGDASKSKDNGGEAVFGGIDETKFEGDI 267
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
T++PV +K YW+ +L + +G + T + G A +D+GTSL+ P+ + IN IG +
Sbjct: 268 TWLPVRRKAYWEVKLEGLGLGEEYTELENHGAA--IDTGTSLITLPSGLAEIINSEIGAK 325
Query: 314 ----GVVSAEC 320
G + EC
Sbjct: 326 KGWTGQYTIEC 336
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 50/87 (57%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ I+CD+ ++P+++FT F +SP Y L+ ++ CIS D P P GP+
Sbjct: 330 GQYTIECDKRASLPDMTFTFDGYNFTISPYDYTLE----VSGSCISAITPMDFPEPVGPM 385
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
I+GD F+ Y++V+D G +G A A
Sbjct: 386 AIIGDAFLRKYYSVYDLGNDAVGLAPA 412
>gi|254583898|ref|XP_002497517.1| ZYRO0F07392p [Zygosaccharomyces rouxii]
gi|238940410|emb|CAR28584.1| ZYRO0F07392p [Zygosaccharomyces rouxii]
Length = 418
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 107/251 (42%), Positives = 162/251 (64%), Gaps = 7/251 (2%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+++AQY+ E+ +G+PPQNF VI DTGSSNLWVPS++C S++C+ HS+Y S+
Sbjct: 95 VPLTNYLNAQYYTEVSLGTPPQNFKVILDTGSSNLWVPSTECS-SLACFLHSKYDHDSSS 153
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y G I YGSGS+ G+ SQD + +GD+ + Q F EAT E L F +FDGI+G
Sbjct: 154 SYKPNGTEFAIRYGSGSLEGYISQDTLNLGDLSITKQDFAEATSEPGLQFAFGKFDGILG 213
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V VP + N +QGL+ E F+F+L RD ++++GG FGGVD ++G+ T
Sbjct: 214 LGYDTISVDGVVPPFYNAWKQGLLDEPKFAFYLGRDGESQDGGVATFGGVDDSKYEGEIT 273
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
++P+ +K YW+ + I +G + + G A +D+GTSL+A P+ + IN IG +
Sbjct: 274 WLPIRRKAYWEVKFDGIGLGEEYAELENHGAA--IDTGTSLIALPSGLAEIINAEIGAKK 331
Query: 314 ---GVVSAECK 321
G + EC+
Sbjct: 332 SWTGQYTVECE 342
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 52/87 (59%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ ++C+ ++PN++FT+G F L+ YIL+ ++ CIS D P P GPL
Sbjct: 335 GQYTVECEARSSLPNMTFTLGGHNFELTAYDYILE----VSGQCISAIFPMDFPEPVGPL 390
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
I+GD F+ Y++++D G +G A+A
Sbjct: 391 AIIGDSFLRKYYSIYDLGNNAVGLADA 417
>gi|344257339|gb|EGW13443.1| Napsin-A [Cricetulus griseus]
Length = 532
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 109/253 (43%), Positives = 155/253 (61%), Gaps = 4/253 (1%)
Query: 81 MDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPS---SKCYFSISCYFHSRYKSRKSNTYT 137
M+ QYFG+IG+G+PPQNF+V+FDTGSSNL S S S FH R+ + S+++
Sbjct: 1 MNTQYFGDIGLGTPPQNFTVVFDTGSSNLCSVSHRLSDPILSPELGFHRRFNPKASSSFR 60
Query: 138 EIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGF 197
G I YGSG ++G SQDN+ +G++ F EA E S+ F LA FDGI+GLGF
Sbjct: 61 PNGTKLAIQYGSGQLTGILSQDNLTIGEIRGVSVTFGEALWESSMVFTLAHFDGILGLGF 120
Query: 198 REIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVP 257
+AV P D MVEQGL+ + +FSF+LNRD + +GGE+V GG DP H+ T++P
Sbjct: 121 PSLAVDGVQPPLDAMVEQGLLQKPIFSFYLNRDAEGSDGGELVLGGSDPAHYIPPLTFIP 180
Query: 258 VTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVS 317
VT YWQ + + +G +C GC I+D+GTSL+ GP+ + +N AIGG ++
Sbjct: 181 VTIPAYWQVHMESVNVGT-GLSLCAQGCGVILDTGTSLITGPSEEIHALNKAIGGLPFLA 239
Query: 318 AECKLVVSQYGDL 330
+ + S+ +L
Sbjct: 240 GQYFIQCSKTPEL 252
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 49/136 (36%), Positives = 71/136 (52%), Gaps = 4/136 (2%)
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
+E NV G S C + ++ L ++E + +N+ LP G+ I C
Sbjct: 191 MESVNVGTGLSLCAQGCGV-ILDTGTSLITGPSEE--IHALNKAIGGLPFLAGQYFIQCS 247
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKT-GEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
+ P +P VSF +G FNL+ + Y++K +C+ GF A D+P P GPLWILGDVF
Sbjct: 248 KTPELPTVSFRLGGVWFNLTGQDYVIKILNSDDVGLCLLGFQALDIPKPAGPLWILGDVF 307
Query: 486 MGVYHTVFDSGKLRIG 501
+G Y VFD G +G
Sbjct: 308 LGPYVAVFDRGVKTVG 323
>gi|354493821|ref|XP_003509038.1| PREDICTED: gastricsin-like [Cricetulus griseus]
gi|344238302|gb|EGV94405.1| Gastricsin [Cricetulus griseus]
Length = 391
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 119/304 (39%), Positives = 178/304 (58%), Gaps = 7/304 (2%)
Query: 13 WVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGD-SD 71
W++ + L LP L R+ LKK + ++ + K+ ++R G+ D
Sbjct: 3 WLVVALLCLPLLEAALVRVPLKKMKTIRQNMKEKGV-LKDFLKTHKYDPAQKYRFGNFGD 61
Query: 72 EDIL--PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYK 129
+L P+ +MDA YFGEI IG+PPQNF V+FDTGSSNLWVPS C S +C H RY
Sbjct: 62 FSVLYEPIA-YMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSIYCQ-SEACTTHPRYN 119
Query: 130 SRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARF 189
KS+TY G++ + YG+GS++GFF D + V + V +Q F + E F+ A F
Sbjct: 120 PNKSSTYYTEGQTFSLQYGTGSLTGFFGYDTLTVQGIQVPNQEFGLSENEPGTNFVYADF 179
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF 249
DGI+GL + ++ G A ++++G +S+ +F +L GG+IVFGGVD +
Sbjct: 180 DGIMGLAYPGLSAGGATTAMQGLLQEGALSQPLFGVYLGSQ-QGSNGGQIVFGGVDENLY 238
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHA 309
G+ T++PVT++ YWQ + D LIG+Q +G C GCA IVD+GTSLL P+ ++++
Sbjct: 239 TGEITWIPVTQELYWQITIDDFLIGDQVSGWCSQGCAGIVDTGTSLLTMPSQYLSDLLQT 298
Query: 310 IGGE 313
IG +
Sbjct: 299 IGAQ 302
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 34/88 (38%), Positives = 48/88 (54%), Gaps = 5/88 (5%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG-P 477
GE + CD + ++P +F + F LSP YIL+ VC+ G + L G
Sbjct: 307 GEYFVSCDSVSSLPTFNFVLNGVEFPLSPSFYILQE----DGVCMVGLESSPLTSESGQS 362
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+WILGDVF+ Y+ +FD G R+GFA A
Sbjct: 363 MWILGDVFLRSYYAIFDMGNNRVGFATA 390
>gi|194764262|ref|XP_001964249.1| GF20814 [Drosophila ananassae]
gi|190619174|gb|EDV34698.1| GF20814 [Drosophila ananassae]
Length = 405
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 113/231 (48%), Positives = 145/231 (62%), Gaps = 5/231 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
L N+ + QY+G I IG+P QNF V FDTGSSNLW+PSS+C S SC H+RY S +S+TY
Sbjct: 68 LSNYDNFQYYGSINIGTPGQNFQVQFDTGSSNLWIPSSQCT-SSSCMVHTRYSSYQSSTY 126
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
G I YG+GS+SGF SQD V V +V+++Q F E T E FL A FDGI+GL
Sbjct: 127 KSNGSIFNITYGTGSVSGFMSQDVVSVAGLVIRNQTFGEVTSESGSNFLNASFDGILGLA 186
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
F +AV P + N++ Q +V + VFSF+L N GGE++ GG DPK ++GK TY
Sbjct: 187 FPMLAVNLVTPFFQNLISQKVVQQPVFSFYLRNNGTTVTYGGELILGGSDPKLYRGKLTY 246
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
VPV+ YWQF I +GN + G AAI D+GTSLL P T+I
Sbjct: 247 VPVSYPAYWQFYTDSIQMGNT---LISTGDAAIADTGTSLLVAPQAEYTQI 294
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 37/132 (28%), Positives = 56/132 (42%), Gaps = 23/132 (17%)
Query: 372 VSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTM 431
+S GD+A+ +V Q + Q + N + + C +I
Sbjct: 269 ISTGDAAIADTGTSLLVAPQAEYTQ--------------IAKIFNADSDGVFACGKISKW 314
Query: 432 PNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHT 491
P + I F ++PE YI++ G + A + P WILGDVF+G Y+T
Sbjct: 315 PTMYIKINGVSFQITPEYYIIQEGY---------YCALAIQPASQDFWILGDVFLGRYYT 365
Query: 492 VFDSGKLRIGFA 503
FD G R+GFA
Sbjct: 366 EFDVGNQRLGFA 377
>gi|290974880|ref|XP_002670172.1| predicted protein [Naegleria gruberi]
gi|284083728|gb|EFC37428.1| predicted protein [Naegleria gruberi]
Length = 388
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 106/240 (44%), Positives = 152/240 (63%), Gaps = 6/240 (2%)
Query: 74 ILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKS 133
I+PLK++ D +Y+GEI IG+P Q F V+FDTGSSNLWVPS C +SC H+RY KS
Sbjct: 66 IVPLKDYDDVEYYGEITIGTPAQTFKVVFDTGSSNLWVPSVACK-DLSCVRHARYNHTKS 124
Query: 134 NTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGII 193
+TY G+S I YG+G++ G S D V VG + +K QVF E T E + TFL A+ DGI
Sbjct: 125 STYVPNGQSFNITYGTGAVKGILSSDTVVVGGLAIKGQVFGETTNEYTDTFLNAKIDGIC 184
Query: 194 GLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKH 253
G F IAV PV++N+++Q LV + +FSF++++ ++ GG++ K++ G
Sbjct: 185 GFAFPNIAVDGVTPVFNNLMKQRLVDKNIFSFYMSKKA-GSGASAMILGGINSKYYTGSF 243
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGP----TPVVTEINHA 309
+YVP+ + YW L DI + Q +C GC AIVD+GTSL+AG P++ ++N A
Sbjct: 244 SYVPLIQHNYWSIALDDIAMNGQGQSLCGFGCMAIVDTGTSLIAGTPDVMQPIINQLNVA 303
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 36/83 (43%), Positives = 49/83 (59%), Gaps = 4/83 (4%)
Query: 424 DCDRIPTMPNVSFTIGDKIFNLSPEQYILK-TGEGIAEVCISGFMAFDLPPPRGPLWILG 482
DC I + PNVSF IG K + L+P Y++K T +G + C GF D+ ILG
Sbjct: 305 DCSNIDSNPNVSFVIGGKQYLLTPRDYVIKITSQGQTQ-CFPGFQTMDM--GTNGFVILG 361
Query: 483 DVFMGVYHTVFDSGKLRIGFAEA 505
DVF+ Y+TVFD R+GFA++
Sbjct: 362 DVFISTYYTVFDYEGSRVGFAKS 384
>gi|410045159|ref|XP_001145764.3| PREDICTED: pepsin A-5 isoform 1 [Pan troglodytes]
Length = 434
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 121/291 (41%), Positives = 169/291 (58%), Gaps = 23/291 (7%)
Query: 28 LRRI----GLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDA 83
LRR GL K L H+ N A +Y + H PL+N++D
Sbjct: 28 LRRTLSERGLLKDFLKKHNFNPA-----SKYFPQWEAPTLLHEQ--------PLENYLDV 74
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
+YFG IGIG+P Q+F+V+FDTGSSNLWVPS CY S++C H+ + + S+TY K+
Sbjct: 75 EYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCY-SLACMDHNLFNPQDSSTYKSTSKTV 133
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAV 202
I YG+GS++G D V+VG + +Q+F + T GS F A FDGI+GL + I+
Sbjct: 134 SITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLFF-APFDGILGLAYPSISS 192
Query: 203 GDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
A PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +G
Sbjct: 193 SGATPVFDNIWNQGLVSQDLFSVYLSA--DDKSGSVVIFGGIDSSYYTGSLNWVPVTVEG 250
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
YWQ + I + N T C GC AIVD+GTSLL GPT + I IG
Sbjct: 251 YWQITVDSITM-NGKTIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGAS 300
Score = 68.6 bits (166), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 35/87 (40%), Positives = 51/87 (58%), Gaps = 4/87 (4%)
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
N G+ ++ C I ++P++ FTI + + P YIL++ EG CISGF ++P
Sbjct: 302 NSDGDMVVSCSAISSLPDIVFTINGVQYPVPPSAYILQS-EG---SCISGFQGMNVPTES 357
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGF 502
G LWILGDVF+ Y TVFD ++G
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNKVGL 384
>gi|401881725|gb|EJT46014.1| endopeptidase [Trichosporon asahii var. asahii CBS 2479]
Length = 528
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 115/255 (45%), Positives = 161/255 (63%), Gaps = 12/255 (4%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+M+AQY+ I IG+PPQ F V+ DTGSSNLWVPS +C SI+C+ +Y + +S+
Sbjct: 192 VPLSNYMNAQYYAPITIGTPPQEFGVVLDTGSSNLWVPSVQCS-SIACF---KYDNSQSS 247
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY G I YGSGS+ GF S+D +E+ + VKDQ+F EAT+E + F+ +FDGI+G
Sbjct: 248 TYKANGSEFAIRYGSGSLEGFVSEDTLEIAGLKVKDQLFAEATKEPGMAFVFGKFDGILG 307
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V P + NM++Q L+ E+VFSF L D +GGE +FGG D K K
Sbjct: 308 LGYNTISVNQIPPPFYNMIDQNLLDEKVFSFRLGSSED--DGGECIFGGYDKKWSDEKPI 365
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
Y+PV +KGYW+ EL I G++ + G A +D+GTSL+A PT + +N IG E
Sbjct: 366 YIPVRRKGYWEVELEGIKFGDEELPLENTGAA--IDTGTSLIALPTDIAEILNKEIGAEK 423
Query: 314 ---GVVSAECKLVVS 325
G + +C V S
Sbjct: 424 SWNGQYTVDCSKVPS 438
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 29/67 (43%), Positives = 43/67 (64%), Gaps = 4/67 (5%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC ++P++P+++F G K F + E Y+L G CIS FM D+PPP GP+
Sbjct: 427 GQYTVDCSKVPSLPDLTFNFGGKKFPIKGEDYVLNAGG----TCISAFMGMDIPPPMGPI 482
Query: 479 WILGDVF 485
WI+GD F
Sbjct: 483 WIIGDAF 489
>gi|340518711|gb|EGR48951.1| predicted protein [Trichoderma reesei QM6a]
Length = 395
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 127/300 (42%), Positives = 180/300 (60%), Gaps = 15/300 (5%)
Query: 23 ASSNGLRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDI---- 74
++ G+ ++ L+K ++L+ S+ A ++YMG S D +
Sbjct: 14 SAQAGIHKMKLQKVSLEQQLEGSSIEAHVQQLGQKYMGVRPTSRAEVMFNDKPPKVQGGH 73
Query: 75 -LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKS 133
+P+ NFM+AQYF EI IG+PPQ+F V+ DTGSSNLWVPS C SI+C+ HS Y S S
Sbjct: 74 PVPVTNFMNAQYFSEITIGTPPQSFKVVLDTGSSNLWVPSQSCN-SIACFLHSTYDSSSS 132
Query: 134 NTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGII 193
+TY G EI+YGSGS++GF S D V +GD+ +K Q F EAT E L F RFDGI+
Sbjct: 133 STYKPNGSDFEIHYGSGSLTGFISNDVVTIGDLKIKGQDFAEATSEPGLAFAFGRFDGIL 192
Query: 194 GLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKH 253
GLG+ I+V VP + MV Q L+ E VF+F+L ++EG E VFGGVD H++GK
Sbjct: 193 GLGYDTISVNGIVPPFYQMVNQKLIDEPVFAFYLGS---SDEGSEAVFGGVDDAHYEGKI 249
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
Y+P+ +K YW+ +L I G++ + G AI+D+GTSL P+ + +N IG +
Sbjct: 250 EYIPLRRKAYWEVDLDSIAFGDEVAELENTG--AILDTGTSLNVLPSGLAELLNAEIGAK 307
Score = 68.2 bits (165), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 53/87 (60%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC + ++P+++F++ ++L YI++ ++ CIS F D P P GPL
Sbjct: 312 GQYTVDCSKRDSLPDITFSLAGSKYSLPASDYIIE----MSGNCISSFQGMDFPEPVGPL 367
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ Y++V+D G+ +G A+A
Sbjct: 368 VILGDAFLRRYYSVYDLGRDAVGLAKA 394
>gi|89111566|dbj|BAE80442.1| pepsinogen B isozyme [Canis lupus familiaris]
Length = 374
Score = 216 bits (551), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 103/238 (43%), Positives = 152/238 (63%), Gaps = 1/238 (0%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
P N++++ YFGEI IG+PPQNF V+FDTGSSNLWVPS+ C S +C H+ + S+T
Sbjct: 49 PFTNYLNSYYFGEISIGTPPQNFLVVFDTGSSNLWVPSTYCQ-SQACSNHNTFNPSSSST 107
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G++ + YGSGS++ D V V ++V+ +Q F + E S F A FDGI+G+
Sbjct: 108 YRNNGQTYTLYYGSGSLTVLLGYDTVTVQNIVINNQEFGLSEIEPSNPFYYANFDGILGM 167
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ +AVGD+ V +MV+QG +++ +FSF+ +R P E GGE++ GGVD + + G+ +
Sbjct: 168 AYPNLAVGDSPTVMQSMVQQGQLTQPIFSFYFSRQPTYEYGGELILGGVDTQFYSGEIVW 227
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
PVT++ YWQ + + LIGNQ+TG+C GC IVD+GT L P + A G +
Sbjct: 228 APVTREMYWQVAIDEFLIGNQATGLCSQGCQGIVDTGTFPLTVPQQYLDSFVKATGAQ 285
Score = 68.9 bits (167), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 46/86 (53%), Gaps = 5/86 (5%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG-P 477
G +++C+ I +MP ++F I L P Y+L C G LP P G P
Sbjct: 290 GNFVVNCNSIQSMPTITFVISGSPLPLPPSTYVLNNNG----YCTLGIEVTYLPSPNGQP 345
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFA 503
LWILGDVF+ Y+TVFD R+GFA
Sbjct: 346 LWILGDVFLREYYTVFDMAANRVGFA 371
>gi|156843876|ref|XP_001645003.1| hypothetical protein Kpol_1072p15 [Vanderwaltozyma polyspora DSM
70294]
gi|156115658|gb|EDO17145.1| hypothetical protein Kpol_1072p15 [Vanderwaltozyma polyspora DSM
70294]
Length = 399
Score = 216 bits (551), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 107/251 (42%), Positives = 160/251 (63%), Gaps = 7/251 (2%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+++AQY+ ++ IG+PPQ F VI DTGSSNLWVPS C S++CY HS+Y S+
Sbjct: 76 VPLDNYLNAQYYTDVSIGTPPQKFKVILDTGSSNLWVPSVGCS-SLACYLHSKYDHSLSS 134
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY G I YGSGS+ G+ SQD + +GD+++ Q F EAT E L F +FDGI+G
Sbjct: 135 TYRSNGSDFVIQYGSGSLKGYISQDTLTIGDLIIPQQDFAEATAEPGLAFAFGKFDGILG 194
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+V AVP N + +GL+ + +F+F+L + ++ GGE FGG DP F+G+
Sbjct: 195 LAYDSISVNKAVPPLYNAIHRGLLDKPMFAFYLGDEKSSKNGGEATFGGYDPSRFEGEIK 254
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
++PV +K YW+ + I +G++ + EG AAI D+GTSL+ P+ + +N+ IG +
Sbjct: 255 WLPVRRKAYWEVQFDGIKLGDKFMKL-EGHGAAI-DTGTSLITLPSQIADFLNNEIGAKK 312
Query: 314 ---GVVSAECK 321
G + +CK
Sbjct: 313 SWNGQYTIDCK 323
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 26/87 (29%), Positives = 46/87 (52%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ IDC + ++P ++ + F + P Y L+ I+ CIS D P P GPL
Sbjct: 316 GQYTIDCKKRESLPKLTLNFYNHNFTIDPFDYTLE----ISGSCISAITPMDFPQPVGPL 371
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
I+GD F+ +++++D +G A++
Sbjct: 372 SIIGDAFLRRFYSIYDLENNAVGLAKS 398
>gi|1246038|gb|AAB35842.1| pepsinogen A [turtles, Peptide, 361 aa]
Length = 361
Score = 216 bits (551), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 107/238 (44%), Positives = 153/238 (64%), Gaps = 4/238 (1%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N+MDA+YFG I IG+P Q+F+V+FDTGSSNLWVPS C S +C H+R+ S+T
Sbjct: 50 PLTNYMDAEYFGTISIGTPAQDFTVVFDTGSSNLWVPSVTCS-SAACTQHNRFNPSDSST 108
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y ++ I YG+GS++G DNV+VG +V +Q+F + E TF A DGI+GL
Sbjct: 109 YRATSQNLSIQYGTGSMTGILGYDNVQVGGLVDTNQIFGLSETEPGSTFYYAPMDGILGL 168
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ IA A PV+DNM+ +GLVS+++FS +L+ D + G ++FGG D ++ G +
Sbjct: 169 AYPSIASSGATPVFDNMMSEGLVSQDLFSVYLSS--DEQSGSFVMFGGNDTSYYSGSLNW 226
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
+P++ + YW+ + I + Q T C GGC AI+D+GTSLLAGP V+ IN IG
Sbjct: 227 IPLSAETYWEITMDSITMNGQ-TIACSGGCQAIIDTGTSLLAGPPSDVSNINSYIGAS 283
Score = 38.5 bits (88), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 23/84 (27%), Positives = 36/84 (42%), Gaps = 9/84 (10%)
Query: 423 IDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILG 482
+ C + ++P + F I F + YI+ + S LWILG
Sbjct: 287 VSCSSMSSLPEIVFNINGIAFPVPASAYIINDSSSCSSSFESMDQG---------LWILG 337
Query: 483 DVFMGVYHTVFDSGKLRIGFAEAA 506
DVF+ +Y+ VFD ++G A A
Sbjct: 338 DVFIRLYYVVFDRANNQVGLASLA 361
>gi|406701140|gb|EKD04292.1| endopeptidase [Trichosporon asahii var. asahii CBS 8904]
Length = 824
Score = 216 bits (551), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 116/255 (45%), Positives = 161/255 (63%), Gaps = 12/255 (4%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+M+AQY+ I IG+PPQ F V+ DTGSSNLWVPS +C SI+C+ +Y + +S+
Sbjct: 226 VPLSNYMNAQYYAPITIGTPPQEFGVVLDTGSSNLWVPSVQCS-SIACF---KYDNSQSS 281
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY G I YGSGS+ GF S+D +E+ + VKDQ+F EAT+E + F+ +FDGI+G
Sbjct: 282 TYKANGSEFAIRYGSGSLEGFVSEDTLEIAGLKVKDQLFAEATKEPGMAFVFGKFDGILG 341
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V P + NM++Q L+ E+VFSF L D +GGE +FGG D K K
Sbjct: 342 LGYNTISVNQIPPPFYNMIDQNLLDEKVFSFRLGSSED--DGGECIFGGYDKKWSDEKPI 399
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
YVPV +KGYW+ EL I G++ + G A +D+GTSL+A PT + +N IG E
Sbjct: 400 YVPVRRKGYWEVELEGIKFGDEELPLENTGAA--IDTGTSLIALPTDIAEILNKEIGAEK 457
Query: 314 ---GVVSAECKLVVS 325
G + +C V S
Sbjct: 458 SWNGQYTVDCSKVPS 472
Score = 207 bits (527), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 113/255 (44%), Positives = 157/255 (61%), Gaps = 12/255 (4%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+M+AQY+ I IG+PPQ F V+ DTGSSNLWVPS +C SI+C+ +Y + +S+
Sbjct: 527 VPLSNYMNAQYYAPITIGTPPQEFGVVLDTGSSNLWVPSVQCS-SIACF---KYDNSQSS 582
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY G I YGSGS+ GF S+D +E+ + VKDQ+F EAT+E + F+ +F G
Sbjct: 583 TYKANGSEFAIRYGSGSLEGFVSEDTLEIAGLKVKDQLFAEATKEPGMAFVFGKFTVSFG 642
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V P + NM++Q L+ E+VFSF L D +GGE +FGG D K K
Sbjct: 643 LGYNTISVNQIPPPFYNMIDQNLLDEKVFSFRLGSSED--DGGECIFGGYDKKWSDEKPI 700
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
YVPV +KGYW+ EL I G++ + G A +D+GTSL+A PT + +N IG E
Sbjct: 701 YVPVRRKGYWEVELEGIKFGDEELPLENTGAA--IDTGTSLIALPTDIAEILNKEIGAEK 758
Query: 314 ---GVVSAECKLVVS 325
G + +C V S
Sbjct: 759 SWNGQYTVDCSKVPS 773
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 28/67 (41%), Positives = 42/67 (62%), Gaps = 4/67 (5%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC ++P++P+++F G K F + E Y+L G CIS FM D+PPP GP+
Sbjct: 762 GQYTVDCSKVPSLPDLTFNFGGKKFPIKGEDYVLNAGG----TCISAFMGMDIPPPMGPI 817
Query: 479 WILGDVF 485
WI+GD
Sbjct: 818 WIIGDAL 824
>gi|50978822|ref|NP_001003117.1| pepsin A preproprotein [Canis lupus familiaris]
gi|73621384|sp|Q9GMY6.1|PEPA_CANFA RecName: Full=Pepsin A; Flags: Precursor
gi|9798660|dbj|BAB11752.1| pepsinogen A [Canis lupus familiaris]
Length = 386
Score = 216 bits (551), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 108/238 (45%), Positives = 154/238 (64%), Gaps = 6/238 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
LKN+MD +YFG IGIG+PPQ F+VIFDTGSSNLWVPS C S +C H+R+ ++S+TY
Sbjct: 66 LKNYMDMEYFGTIGIGTPPQEFTVIFDTGSSNLWVPSVYCS-SPACSNHNRFNPQESSTY 124
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGL 195
+ I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+GL
Sbjct: 125 QGTNRPVSIAYGTGSMTGILGYDTVQVGGIADTNQIFGLSETEPGSFLYY-APFDGILGL 183
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ +I+ A PV+DNM +GLVS+++FS +L+ D + G ++FGG+D ++ G +
Sbjct: 184 AYPQISASGATPVFDNMWNEGLVSQDLFSVYLSS--DDQSGSVVMFGGIDSSYYSGNLNW 241
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
VPV+ +GYWQ + + + Q+ C GC AIVD+GTSLLAGPT + I IG
Sbjct: 242 VPVSVEGYWQITVDSVTMNGQAIA-CSDGCQAIVDTGTSLLAGPTNAIANIQSYIGAS 298
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 65/132 (49%), Gaps = 6/132 (4%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 434
G + CS A+V L T ++ I + N G+ +I C I ++P++
Sbjct: 261 GQAIACSDGCQAIVDTGTSLLAGPTN--AIANIQSYIGASQNSYGQMVISCSAINSLPDI 318
Query: 435 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 494
FTI + L P YIL++ +G C+SGF +LP G LWILGDVF+ Y VFD
Sbjct: 319 VFTINGIQYPLPPSAYILQSQQG----CVSGFQGMNLPTASGELWILGDVFIRQYFAVFD 374
Query: 495 SGKLRIGFAEAA 506
++G A A
Sbjct: 375 RANNQVGLAPVA 386
>gi|194900440|ref|XP_001979765.1| GG22202 [Drosophila erecta]
gi|190651468|gb|EDV48723.1| GG22202 [Drosophila erecta]
Length = 395
Score = 216 bits (551), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 133/325 (40%), Positives = 181/325 (55%), Gaps = 17/325 (5%)
Query: 12 LWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITR-----KERYMGGAGVSGVRHR 66
LWVL CL L RI ++ + + S R R K +GG V+ R
Sbjct: 11 LWVL--CLFWAKCQGQLIRIPMQFQASFMASRRQHRAGRSSLLAKYNVVGGQEVTS---R 65
Query: 67 LGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF-SISCYFH 125
G + E L N ++ +Y G I IGSP Q F+++FDTGS+NLWVPS++C S++C+ H
Sbjct: 66 NGGATET---LDNRLNLEYAGPISIGSPGQPFNMLFDTGSANLWVPSAECSLKSVACHHH 122
Query: 126 SRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFL 185
RY + S+T+ G+ I YG+GS+SG +QD V +G +VV++Q F AT E TF+
Sbjct: 123 HRYNASASSTFVPDGRRFSIAYGTGSLSGILAQDTVAIGQLVVRNQTFAMATHEPGPTFV 182
Query: 186 LARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVD 245
F GI+GLGFR IA P++++M +Q LV E VFSF+L R+ GGE++FGGVD
Sbjct: 183 DTNFAGIVGLGFRPIAEQRIKPLFESMCDQQLVDECVFSFYLKRNGSERMGGELLFGGVD 242
Query: 246 PKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTE 305
F G TYVP+T GYWQF L I +G + AI D+GTSLLA P
Sbjct: 243 KTKFSGSLTYVPLTHAGYWQFPLDGIELGGTTISRHR---QAIADTGTSLLAAPPREYLI 299
Query: 306 INHAIGGEGVVSAECKLVVSQYGDL 330
IN +GG + E L S+ L
Sbjct: 300 INSLLGGLPTSNNEYLLNCSEIDSL 324
Score = 82.4 bits (202), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 40/101 (39%), Positives = 58/101 (57%), Gaps = 6/101 (5%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILK-TGEGIAEVCISG 465
IN L LP E +++C I ++P + F IG + F L P Y++ T + + +C+S
Sbjct: 300 INSLLGGLPTSNNEYLLNCSEIDSLPEIVFIIGGRRFGLQPRDYVMSVTNDDGSRICLSA 359
Query: 466 FMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
F D WILGDVF+G Y+T FD+G+ +IGFA AA
Sbjct: 360 FTLMD-----AEFWILGDVFIGRYYTAFDAGQRQIGFAPAA 395
>gi|395535589|ref|XP_003769805.1| PREDICTED: chymosin-like [Sarcophilus harrisii]
Length = 382
Score = 216 bits (551), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 120/312 (38%), Positives = 172/312 (55%), Gaps = 19/312 (6%)
Query: 4 KLLRSVFCLWVLASCLL-LPASSNGLRRIGLKKRRL--DLHSLNAARITRKERYMGGAGV 60
+ L + L+ C++ LP R LKK L D N ++ K R G A
Sbjct: 2 RCLLVFLAIIALSDCMIRLPLMKGNTLRHKLKKHGLLADFLEENKYSLSSKYRRYGEAAK 61
Query: 61 SGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI 120
PL NF+D+QYFG+I IG+PPQ F+V+FDTGSSNLWVPS C S
Sbjct: 62 VASE-----------PLTNFLDSQYFGKIYIGTPPQEFTVVFDTGSSNLWVPSVYCN-ST 109
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+C H R+ +S+T+ + I YG+GS+ G D V V +V DQ+F +T+E
Sbjct: 110 ACENHHRFSPSESSTFNSTEEPLSIQYGTGSMEGVLGYDTVIVSSIVDPDQIFGLSTQEP 169
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
F + FDGI+GLG+ +AV A PV+DNM+ + LV++ +FS ++NR G +
Sbjct: 170 GNIFTYSEFDGILGLGYPSLAVDQATPVFDNMMNKHLVAQNLFSVYMNRH---GPGSMLT 226
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
G +D ++ G +VP+T +GYWQF + I + Q C+GGC AI+D+GTSLL GP+
Sbjct: 227 LGAIDSSYYTGSLHWVPITVQGYWQFSVDRITVNGQVVA-CDGGCQAILDTGTSLLVGPS 285
Query: 301 PVVTEINHAIGG 312
++ I IG
Sbjct: 286 YDISNIQSVIGA 297
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 35/102 (34%), Positives = 50/102 (49%), Gaps = 8/102 (7%)
Query: 404 LSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCI 463
+S I + + GE IDC + +MP V I + + L P Y ++ + VC
Sbjct: 288 ISNIQSVIGATQGQYGEFDIDCSSLSSMPTVVIHINGRQYPLPPSAYTIQ----MESVCT 343
Query: 464 SGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
SGF LWILGDVF+ Y++VFD R+G A+A
Sbjct: 344 SGFQG----DGSSQLWILGDVFIREYYSVFDRANNRVGLAKA 381
>gi|401623301|gb|EJS41405.1| pep4p [Saccharomyces arboricola H-6]
Length = 405
Score = 216 bits (551), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 105/240 (43%), Positives = 157/240 (65%), Gaps = 4/240 (1%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+++AQY+ +I +G+PPQNF VI DTGSSNLWVPS++C S++C+ HS+Y S+
Sbjct: 81 VPLTNYLNAQYYTDITLGTPPQNFKVILDTGSSNLWVPSNEC-GSLACFLHSKYDHEASS 139
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y G I YG+GS+ G+ SQD + +GD+ + Q F EAT E LTF +FDGI+G
Sbjct: 140 SYKANGTEFAIQYGTGSLEGYISQDTLSIGDLTIPKQDFAEATSEPGLTFAFGKFDGILG 199
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKH 253
LG+ I+V VP + N ++Q L+ E+ F+F+L + D+E GGE FGG+D FKG
Sbjct: 200 LGYDSISVDKVVPPFYNAIQQDLLDEKKFAFYLGDTSKDSENGGEATFGGIDESKFKGDI 259
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
T++PV +K YW+ + I +G++ + G A +D+GTSL+ P+ + IN IG +
Sbjct: 260 TWLPVRRKAYWEVKFEGIGLGDEFAELENHGAA--IDTGTSLITLPSGLAEMINAEIGAK 317
Score = 61.6 bits (148), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 48/87 (55%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC+ +P+++F + F + P Y L+ ++ CIS D P P GPL
Sbjct: 322 GQYTLDCNTRDGLPDLTFNLNGYNFTIGPYDYTLE----VSGSCISAITPMDFPEPVGPL 377
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
I+GD F+ Y++++D G +G A+A
Sbjct: 378 AIVGDAFLRKYYSIYDLGNDAVGLAKA 404
>gi|444725492|gb|ELW66056.1| Gastricsin [Tupaia chinensis]
Length = 389
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 110/286 (38%), Positives = 168/286 (58%), Gaps = 18/286 (6%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
++ GL K L H + A+ ++ D P+ +MDA YFG
Sbjct: 33 MKEKGLLKEFLRTHKYDPAQ----------------KYHFNDFSVAYEPMA-YMDAAYFG 75
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
EI IG+PPQNF V+FDTGSSNLWVPS C S +C H R+ +S+TY+ G++ + Y
Sbjct: 76 EISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTNHPRFNPSQSSTYSTNGQTFSLQY 134
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVP 207
GSGS++GFF D + V + V +Q F + E F+ A+FDGI+G+ + +++G A
Sbjct: 135 GSGSLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGIMGMAYPALSMGGATT 194
Query: 208 VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFE 267
M+++G+++ VFSF+L+ +E+GG ++FGGVD + G+ + PVT++ YWQ
Sbjct: 195 ALQGMLQEGVLTSPVFSFYLSNQQGSEDGGAVIFGGVDNSLYSGQIYWAPVTQELYWQIG 254
Query: 268 LGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
+ + LIG Q++G C GC AIVD+GTSLL P ++ + A G +
Sbjct: 255 IEEFLIGGQASGWCSQGCQAIVDTGTSLLTVPQQYMSTLLQATGAQ 300
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 38/107 (35%), Positives = 58/107 (54%), Gaps = 5/107 (4%)
Query: 401 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 460
++ +S + + + + G+ +++CD I ++P +F I F L P YIL
Sbjct: 287 QQYMSTLLQATGAQEDEYGQFLVNCDNIQSLPTFTFIINGVQFPLPPSAYILSNNGA--- 343
Query: 461 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
C+ G A LP G PLWILGDVF+ Y++V+D R+GFA AA
Sbjct: 344 -CMVGVEATYLPSQNGQPLWILGDVFLRSYYSVYDMSNNRVGFATAA 389
>gi|190576563|gb|ACE79054.1| gastricsin precursor (predicted) [Sorex araneus]
Length = 389
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 109/268 (40%), Positives = 162/268 (60%), Gaps = 12/268 (4%)
Query: 64 RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCY 123
++ GD P+ ++DA YFGEI IG+PPQNF V+FDTGSSNLWVPS C S +C
Sbjct: 53 KYHFGDFSVAYEPMA-YLDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACT 110
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
H+R+ KS+TY+ G++ + YGSGS++GFF D + + ++ V Q F + E
Sbjct: 111 GHARFNPSKSSTYSTNGQTFSLQYGSGSLTGFFGYDTMTLQNIKVPHQEFGLSQNEPGDN 170
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
F+ A+FDGI+G+ + +A+G A M++ G + VFSF+L+ +++GG +VFGG
Sbjct: 171 FVYAQFDGIMGMAYPTLAMGGATTALQGMLQAGALDSPVFSFYLSNQQSSQDGGAVVFGG 230
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
VD + G+ + PVT++ YWQ + LIG Q+TG C GC AIVD+GTSLL P +
Sbjct: 231 VDNSLYTGQIFWTPVTQELYWQIGVEQFLIGGQATGWCSQGCQAIVDTGTSLLTVPQQYM 290
Query: 304 TEINHAIGGEGVVSAECKLVVSQYGDLI 331
+ + A G + + QYG ++
Sbjct: 291 SALQQATGAQ----------LDQYGQMV 308
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 37/107 (34%), Positives = 58/107 (54%), Gaps = 5/107 (4%)
Query: 401 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 460
++ +S + + + + G+ +++C+ I +P ++F I F L P Y+L
Sbjct: 287 QQYMSALQQATGAQLDQYGQMVVNCNNIQNLPTLTFVINGVQFPLLPSAYVLNNNG---- 342
Query: 461 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
C G LP P G PLWILGDVF+ Y++V+D G R+GFA AA
Sbjct: 343 YCTLGVEPTYLPSPTGQPLWILGDVFLRSYYSVYDMGNNRVGFATAA 389
>gi|56269596|gb|AAH86835.1| Nots protein [Danio rerio]
Length = 443
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 111/246 (45%), Positives = 163/246 (66%), Gaps = 6/246 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
L NFMDAQ+FG+I +G P QNF+V+FDTGSS+LWVPSS C S +C H+++K+ +S+TY
Sbjct: 105 LYNFMDAQFFGQISLGRPEQNFTVVFDTGSSDLWVPSSYC-VSQACALHNKFKAFESSTY 163
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
T G+ I+YGSG + G ++D ++VG V V++QVF EA E +F+LA+FDG++GLG
Sbjct: 164 THDGRVFGIHYGSGHLLGVMARDELKVGSVCVQNQVFGEAVYEPGFSFVLAQFDGVLGLG 223
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
F ++A PV+D+M+EQ ++ + VFSF+L + + GGE+VFGG+D F ++
Sbjct: 224 FPQLAEEKGSPVFDSMMEQNMLDQPVFSFYLTNN-GSGFGGELVFGGMDESRFLPPINWI 282
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCE---GGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
PVT+KGYWQ +L + + + C GC AIVD+GTSL+ GP + + IG
Sbjct: 283 PVTQKGYWQIKLDAVKV-QGALSFCYRSVQGCQAIVDTGTSLIGGPARDILILQQFIGAT 341
Query: 314 GVVSAE 319
+ E
Sbjct: 342 PTANGE 347
Score = 88.6 bits (218), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 37/98 (37%), Positives = 61/98 (62%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
+ + + P GE ++DC R+ ++P VSF I ++LS EQYI + ++C SGF
Sbjct: 334 LQQFIGATPTANGEFVVDCVRVSSLPVVSFLINSVEYSLSGEQYIRRETLNNKQICFSGF 393
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAE 504
+ ++P P GP+WILGDVF+ ++++D G+ R+G A
Sbjct: 394 QSIEVPSPAGPMWILGDVFLSQVYSIYDRGENRVGLAR 431
>gi|327270926|ref|XP_003220239.1| PREDICTED: embryonic pepsinogen-like [Anolis carolinensis]
Length = 382
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 118/305 (38%), Positives = 173/305 (56%), Gaps = 20/305 (6%)
Query: 15 LASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDI 74
LA + A S + RI L++ + R T KE + + + R+ +G +
Sbjct: 4 LAILFAIVALSESIIRIPLQRGK-------KGRNTLKENGLLDSFLKEHRYDIGSKYRPM 56
Query: 75 L--------PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHS 126
L PL N++D +Y+G I IG+PPQ F+V+FDTGSSNLWVPS+ C C H
Sbjct: 57 LEAAEVAGEPLMNYLDTEYYGTINIGTPPQAFTVVFDTGSSNLWVPSTYCS-DAPCQNHP 115
Query: 127 RYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLL 186
R+ +S+T+ ++ I YG+GS+ G D + V + V Q F ++ E + F
Sbjct: 116 RFDPSQSSTFENTQQTMSIQYGTGSMQGILGYDTLTVTGITVPKQEFALSSSEPGVFFTY 175
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
FDGI+GLG+ IAV D PV+DNM+ +GLV E +FS +L R G I FGG+D
Sbjct: 176 VPFDGILGLGYPSIAVSDVTPVFDNMMNEGLVQENLFSVYLGR---GGTGSIITFGGIDE 232
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
++ G ++PVT++GYWQ EL IL+ ++ C GC AIVD+GTSL+AGP ++ +
Sbjct: 233 SYYTGSINWIPVTEQGYWQIELDSILVNGEAI-ACSDGCQAIVDTGTSLVAGPPSDISNL 291
Query: 307 NHAIG 311
+AIG
Sbjct: 292 QNAIG 296
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 41/131 (31%), Positives = 62/131 (47%), Gaps = 10/131 (7%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 434
G++ CS A+V L + +S + + P G+ I+C + MP+V
Sbjct: 261 GEAIACSDGCQAIVDTGTSLVAGPPSD--ISNLQNAIGATPGQYGQYDINCGNLGNMPDV 318
Query: 435 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 494
F I F L+P Y L+ + E C SGF G LWILGDVF+ Y+++FD
Sbjct: 319 VFVINGIQFPLTPTAYTLEESQ---EECHSGFQNM-----SGYLWILGDVFIREYYSIFD 370
Query: 495 SGKLRIGFAEA 505
++G A+A
Sbjct: 371 RANNQVGLAKA 381
>gi|6325103|ref|NP_015171.1| Pep4p [Saccharomyces cerevisiae S288c]
gi|115643|sp|P07267.1|CARP_YEAST RecName: Full=Saccharopepsin; AltName: Full=Aspartate protease;
Short=PrA; Short=Proteinase A; AltName:
Full=Carboxypeptidase Y-deficient protein 4; AltName:
Full=Proteinase YSCA; Flags: Precursor
gi|172122|gb|AAB63975.1| vacuolar proteinase A precursor [Saccharomyces cerevisiae]
gi|1370328|emb|CAA97859.1| PEP4 [Saccharomyces cerevisiae]
gi|1403555|emb|CAA65567.1| P2585 protein [Saccharomyces cerevisiae]
gi|151942645|gb|EDN60991.1| vacuolar proteinase A [Saccharomyces cerevisiae YJM789]
gi|190407806|gb|EDV11071.1| vacuolar proteinase A [Saccharomyces cerevisiae RM11-1a]
gi|259150002|emb|CAY86805.1| Pep4p [Saccharomyces cerevisiae EC1118]
gi|285815388|tpg|DAA11280.1| TPA: Pep4p [Saccharomyces cerevisiae S288c]
gi|323302701|gb|EGA56507.1| Pep4p [Saccharomyces cerevisiae FostersB]
gi|323331178|gb|EGA72596.1| Pep4p [Saccharomyces cerevisiae AWRI796]
gi|323346153|gb|EGA80443.1| Pep4p [Saccharomyces cerevisiae Lalvin QA23]
gi|323351977|gb|EGA84516.1| Pep4p [Saccharomyces cerevisiae VL3]
gi|365762755|gb|EHN04288.1| Pep4p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
gi|392295854|gb|EIW06957.1| Pep4p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 405
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 105/240 (43%), Positives = 156/240 (65%), Gaps = 4/240 (1%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+++AQY+ +I +G+PPQNF VI DTGSSNLWVPS++C S++C+ HS+Y S+
Sbjct: 81 VPLTNYLNAQYYTDITLGTPPQNFKVILDTGSSNLWVPSNEC-GSLACFLHSKYDHEASS 139
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y G I YG+GS+ G+ SQD + +GD+ + Q F EAT E LTF +FDGI+G
Sbjct: 140 SYKANGTEFAIQYGTGSLEGYISQDTLSIGDLTIPKQDFAEATSEPGLTFAFGKFDGILG 199
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKH 253
LG+ I+V VP + N ++Q L+ E+ F+F+L + D E GGE FGG+D FKG
Sbjct: 200 LGYDTISVDKVVPPFYNAIQQDLLDEKRFAFYLGDTSKDTENGGEATFGGIDESKFKGDI 259
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
T++PV +K YW+ + I +G++ + G A +D+GTSL+ P+ + IN IG +
Sbjct: 260 TWLPVRRKAYWEVKFEGIGLGDEYAELESHGAA--IDTGTSLITLPSGLAEMINAEIGAK 317
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 46/87 (52%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC+ +P++ F F + P Y L+ ++ CIS D P P GPL
Sbjct: 322 GQYTLDCNTRDNLPDLIFNFNGYNFTIGPYDYTLE----VSGSCISAITPMDFPEPVGPL 377
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
I+GD F+ Y++++D G +G A+A
Sbjct: 378 AIVGDAFLRKYYSIYDLGNNAVGLAKA 404
>gi|344246136|gb|EGW02240.1| Renin [Cricetulus griseus]
Length = 720
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 122/285 (42%), Positives = 175/285 (61%), Gaps = 19/285 (6%)
Query: 33 LKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIG 92
LK+R +D+ L+A +R+ G G S V L N++D QY+GEIGIG
Sbjct: 8 LKERGVDMTKLSAEWGKFTKRFSFGNGTSPVI------------LTNYLDTQYYGEIGIG 55
Query: 93 SPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGS 151
+PPQ F VIFDTGS+NLWVPS+KC +C HS Y S +S++Y E G I+YGSG
Sbjct: 56 TPPQTFKVIFDTGSANLWVPSTKCSPLYSACEIHSLYDSSESSSYMENGTEFTIHYGSGK 115
Query: 152 ISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDN 211
+ GF SQD V VG ++V Q F E T + F+LA+FDG++G+GF AVG PV+D+
Sbjct: 116 VKGFLSQDIVTVGGIIVT-QTFGEVTELPLIPFMLAKFDGVLGMGFPAQAVGGVTPVFDH 174
Query: 212 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFE---L 268
++ Q ++ EEVFS + +RD GGE+V GG DP+H++G YV V++ G W+ L
Sbjct: 175 ILSQRVLKEEVFSVYYSRDSHL-LGGEVVLGGSDPQHYQGNFHYVSVSRTGSWEIAMKGL 233
Query: 269 GDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
+ +G+ +T +CE GC +VD+G S ++GPT + I +G +
Sbjct: 234 RRVSVGS-ATLLCEEGCVVVVDTGASYISGPTSSLKLIMQTLGAK 277
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 52/87 (59%)
Query: 420 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 479
+ ++DC ++P++P++SF +G + + L+ Y+L+ + C D+PPP GP+W
Sbjct: 283 DYVVDCSQVPSLPDISFHLGGRAYTLTSADYVLQNPYRNDDQCTLALHGLDIPPPTGPVW 342
Query: 480 ILGDVFMGVYHTVFDSGKLRIGFAEAA 506
+LG F+ ++T FD RIG +AA
Sbjct: 343 VLGASFIRKFYTEFDRHNNRIGEEKAA 369
>gi|349581664|dbj|GAA26821.1| K7_Pep4p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 405
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 105/240 (43%), Positives = 156/240 (65%), Gaps = 4/240 (1%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+++AQY+ +I +G+PPQNF VI DTGSSNLWVPS++C S++C+ HS+Y S+
Sbjct: 81 VPLTNYLNAQYYTDITLGTPPQNFKVILDTGSSNLWVPSNEC-GSLACFLHSKYDHEASS 139
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y G I YG+GS+ G+ SQD + +GD+ + Q F EAT E LTF +FDGI+G
Sbjct: 140 SYKANGTEFAIQYGTGSLEGYISQDTLSIGDLTIPKQDFAEATSEPGLTFAFGKFDGILG 199
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKH 253
LG+ I+V VP + N ++Q L+ E+ F+F+L + D E GGE FGG+D FKG
Sbjct: 200 LGYDTISVDKVVPPFYNAIQQDLLDEKKFAFYLGDTSKDTENGGEATFGGIDESKFKGDI 259
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
T++PV +K YW+ + I +G++ + G A +D+GTSL+ P+ + IN IG +
Sbjct: 260 TWLPVRRKAYWEVKFEGIGLGDEYAELESHGAA--IDTGTSLITLPSGLAEMINAEIGAK 317
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 46/87 (52%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC+ +P++ F F + P Y L+ ++ CIS D P P GPL
Sbjct: 322 GQYTLDCNTRDNLPDLIFNFNGYNFTIGPYDYTLE----VSGSCISAITPMDFPEPVGPL 377
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
I+GD F+ Y++++D G +G A+A
Sbjct: 378 AIVGDAFLRKYYSIYDLGNNAVGLAKA 404
>gi|301030231|gb|ADK47877.1| cathepsin D [Triatoma infestans]
Length = 390
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 109/239 (45%), Positives = 154/239 (64%), Gaps = 3/239 (1%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
L+N + QY+G I +G+PPQ F+VIFDTGSSNLW+PS+ C S++C H+ Y +S+TY
Sbjct: 63 LRNSFNTQYYGNITLGTPPQEFTVIFDTGSSNLWIPSAVCS-SVACRVHNTYDHDRSSTY 121
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
G+ + YG+GSI+G S D +++GD+ VK+Q+F EA + F A+ DGI+GL
Sbjct: 122 QPDGRILRLTYGTGSIAGIMSSDVLQIGDLQVKNQLFGEALQVSDSPFARAKPDGILGLA 181
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF-KGKHTY 255
F IA AVP + NM++Q L+ + VFS +LNR+PD E GGEI+FGGVD + + K T
Sbjct: 182 FPSIAQDHAVPPFFNMIKQELLDKPVFSVYLNRNPDEEVGGEIIFGGVDEELYNKESMTT 241
Query: 256 VPVTKKGYWQFELGDILIGNQS-TGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
VP+T YW F++ I + T C+ GC I D+GTS + GP+ V EI +G E
Sbjct: 242 VPLTSTSYWMFQMDGISTSAEDGTSWCQNGCPGIADTGTSFIVGPSSDVDEIMELVGAE 300
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 32/85 (37%), Positives = 45/85 (52%), Gaps = 2/85 (2%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G + CD + +P+++F I K + + E YILK + CI GF LP P
Sbjct: 304 GIGFVSCDDLDKLPDITFHINGKGYTIKAEDYILKVTQAGETACIVGFTT--LPSAPQPF 361
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFA 503
WILGDVF+G +TVF+ + FA
Sbjct: 362 WILGDVFLGKVYTVFNVEDRTVSFA 386
>gi|365986877|ref|XP_003670270.1| hypothetical protein NDAI_0E02105 [Naumovozyma dairenensis CBS 421]
gi|343769040|emb|CCD25027.1| hypothetical protein NDAI_0E02105 [Naumovozyma dairenensis CBS 421]
Length = 408
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 110/251 (43%), Positives = 161/251 (64%), Gaps = 8/251 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+++AQYF +I +G+PPQ+F VI DTGSSNLWVPS +C S++CY HS+Y KS+
Sbjct: 84 IPLSNYLNAQYFADITLGTPPQSFKVILDTGSSNLWVPSVECG-SLACYLHSKYDHDKSS 142
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y G I YG+GS+ G+ SQD + +GD+ + Q F EAT E LTF +FDGI+G
Sbjct: 143 SYKPNGTDFAIRYGTGSLEGYISQDTLNIGDLNIPKQDFAEATSEPGLTFAFGKFDGILG 202
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKH 253
L + I+V VP + N +EQ L+ E+ F+F+L + + +E+GGEI GG+D FKG
Sbjct: 203 LAYDSISVNKVVPPFYNAIEQELLDEKKFAFYLGDANKKSEDGGEITIGGIDKTKFKGDI 262
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
++PV +K YW+ + I +G+Q + G A +D+GTSL+A P+ + IN IG +
Sbjct: 263 DWLPVRRKAYWEVKFEGIGLGDQFAELENHGAA--IDTGTSLIALPSGLAEIINTEIGAK 320
Query: 314 ----GVVSAEC 320
G + EC
Sbjct: 321 KGWTGQYTVEC 331
Score = 68.9 bits (167), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 49/87 (56%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ ++CD P +P+++F K F + P Y L+ ++ CIS M D P P GP+
Sbjct: 325 GQYTVECDARPNLPDLTFNFNGKNFTIGPYDYTLE----VSGSCISAIMPMDFPEPVGPM 380
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
I+GD F+ Y++++D +G AEA
Sbjct: 381 AIIGDAFLRKYYSIYDLENNAVGLAEA 407
>gi|197247086|gb|AAI65335.1| Nots protein [Danio rerio]
Length = 416
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 111/246 (45%), Positives = 163/246 (66%), Gaps = 6/246 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
L NFMDAQ+FG+I +G P QNF+V+FDTGSS+LWVPSS C S +C H+++K+ +S+TY
Sbjct: 78 LYNFMDAQFFGQISLGRPEQNFTVVFDTGSSDLWVPSSYC-VSQACALHNKFKAFESSTY 136
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
T G+ I+YGSG + G ++D ++VG V V++QVF EA E +F+LA+FDG++GLG
Sbjct: 137 THDGRVFGIHYGSGHLLGVMARDELKVGSVCVQNQVFGEAVYEPGFSFVLAQFDGVLGLG 196
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
F ++A PV+D+M+EQ ++ + VFSF+L + + GGE+VFGG+D F ++
Sbjct: 197 FPQLAEEKGSPVFDSMMEQNMLDQPVFSFYLTNN-GSGFGGELVFGGMDESRFLPPINWI 255
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCE---GGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
PVT+KGYWQ +L + + + C GC AIVD+GTSL+ GP + + IG
Sbjct: 256 PVTQKGYWQIKLDAVKV-QGALSFCYRSVQGCQAIVDTGTSLIGGPARDILILQQFIGAT 314
Query: 314 GVVSAE 319
+ E
Sbjct: 315 PTANGE 320
Score = 88.6 bits (218), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 37/98 (37%), Positives = 61/98 (62%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
+ + + P GE ++DC R+ ++P VSF I ++LS EQYI + ++C SGF
Sbjct: 307 LQQFIGATPTANGEFVVDCVRVSSLPVVSFLINSVEYSLSGEQYIRRETLNNKQICFSGF 366
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAE 504
+ ++P P GP+WILGDVF+ ++++D G+ R+G A
Sbjct: 367 QSIEVPSPAGPMWILGDVFLSQVYSIYDRGENRVGLAR 404
>gi|301784222|ref|XP_002927531.1| PREDICTED: pepsin B-like [Ailuropoda melanoleuca]
Length = 390
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 112/297 (37%), Positives = 173/297 (58%), Gaps = 6/297 (2%)
Query: 18 CLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS-GVRHRLGDSDEDILP 76
CL L S G+ RI LKK + + + R + V G ++ + P
Sbjct: 10 CLHL---SEGVERIVLKKGK-SIRQVMEERGVLETFLKNHPKVDPGAKYLYSNDAVAYEP 65
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
N++++ YFGEI IG+PPQNF V+FDTGSSNLWVPS+ C S +C H+ + S+TY
Sbjct: 66 FTNYLNSYYFGEISIGTPPQNFLVLFDTGSSNLWVPSTYCQ-SQACTNHNMFNPSSSSTY 124
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
G++ + YGSGS++ D V V ++++ +Q F + E + F A FDGI+G+
Sbjct: 125 RNNGQTYTLYYGSGSLTVLLGYDTVNVQNIIINNQEFGLSEIEPNNPFYYANFDGILGMA 184
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ +AVG+A V +MV+Q +++ +FSF+ +R P E GGE++ GGVD + + G+ +
Sbjct: 185 YPNLAVGNAPTVTQSMVQQDQLTQPIFSFYFSRQPTYEYGGELILGGVDSQFYSGEIVWT 244
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
PVT++ YWQ + + L+ NQ+TG+C GC AIVD+GT +LA P + G +
Sbjct: 245 PVTREMYWQIAIDEFLVSNQATGLCSQGCQAIVDTGTYMLAVPQQFIGSFLQTTGAQ 301
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 37/86 (43%), Positives = 49/86 (56%), Gaps = 5/86 (5%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG-P 477
G+ ++DCD I +MP ++F I L P Y+L C G A LP P G P
Sbjct: 306 GDFVVDCDSIQSMPTITFVISWTALPLPPSAYVLNNNG----YCTLGIEATYLPSPTGQP 361
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFA 503
LWILGDVF+ Y+T++D G R+GFA
Sbjct: 362 LWILGDVFLKEYYTIYDIGNNRMGFA 387
>gi|207340638|gb|EDZ68928.1| YPL154Cp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 385
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 105/240 (43%), Positives = 156/240 (65%), Gaps = 4/240 (1%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+++AQY+ +I +G+PPQNF VI DTGSSNLWVPS++C S++C+ HS+Y S+
Sbjct: 81 VPLTNYLNAQYYTDITLGTPPQNFKVILDTGSSNLWVPSNEC-GSLACFLHSKYDHEASS 139
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y G I YG+GS+ G+ SQD + +GD+ + Q F EAT E LTF +FDGI+G
Sbjct: 140 SYKANGTEFAIQYGTGSLEGYISQDTLSIGDLTIPKQDFAEATSEPGLTFAFGKFDGILG 199
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKH 253
LG+ I+V VP + N ++Q L+ E+ F+F+L + D E GGE FGG+D FKG
Sbjct: 200 LGYDTISVDKVVPPFYNAIQQDLLDEKRFAFYLGDTSKDTENGGEATFGGIDESKFKGDI 259
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
T++PV +K YW+ + I +G++ + G A +D+GTSL+ P+ + IN IG +
Sbjct: 260 TWLPVRRKAYWEVKFEGIGLGDEYAELESHGAA--IDTGTSLITLPSGLAEMINAEIGAK 317
Score = 47.8 bits (112), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 22/68 (32%), Positives = 34/68 (50%), Gaps = 4/68 (5%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC+ +P++ F F + P Y L+ ++ CIS D P P GPL
Sbjct: 322 GQYTLDCNTRDNLPDLIFNFNGYNFTIGPYDYTLE----VSGSCISAITPMDFPEPVGPL 377
Query: 479 WILGDVFM 486
I+GD F+
Sbjct: 378 AIVGDAFL 385
>gi|73621391|sp|Q9GMY4.1|PEPC_SORUN RecName: Full=Gastricsin; AltName: Full=Pepsinogen C; Flags:
Precursor
gi|9798664|dbj|BAB11754.1| pepsinogen C [Sorex unguiculatus]
Length = 389
Score = 216 bits (550), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 113/286 (39%), Positives = 165/286 (57%), Gaps = 18/286 (6%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
LR GL L H + A+ ++ GD P+ ++DA YFG
Sbjct: 33 LREQGLLGEFLRTHPYDPAQ----------------KYHFGDFSVAYEPMA-YLDAAYFG 75
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
EI IG+PPQNF V+FDTGSSNLWVPS C S +C H+R+ KS+TY+ G++ + Y
Sbjct: 76 EISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTGHARFNPSKSSTYSTNGQTFSLQY 134
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVP 207
GSGS++GFF D + + ++ V Q F + E F+ A+FDGI+G+ + +A+G A
Sbjct: 135 GSGSLTGFFGYDTMTLQNIKVPHQEFGLSQNEPGENFVYAQFDGIMGMAYPTLAMGGATT 194
Query: 208 VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFE 267
M++ G + VFSF+L+ +++GG +VFGGVD + G+ + PVT++ YWQ
Sbjct: 195 ALQGMLQAGALDSPVFSFYLSNQQSSKDGGAVVFGGVDNSLYTGQIFWTPVTQELYWQIG 254
Query: 268 LGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
+ LIG Q+TG C GC AIVD+GTSLL P ++ + A G +
Sbjct: 255 VEQFLIGGQATGWCSQGCQAIVDTGTSLLTVPQQYLSALQQATGAQ 300
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 38/107 (35%), Positives = 58/107 (54%), Gaps = 5/107 (4%)
Query: 401 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 460
++ LS + + + + G+ +++C+ I +P ++F I F L P Y+L
Sbjct: 287 QQYLSALQQATGAQLDQDGQMVVNCNNIQNLPTLTFVINGVQFPLLPSAYVLNNNG---- 342
Query: 461 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
C G LP P G PLWILGDVF+ Y++V+D G R+GFA AA
Sbjct: 343 YCTLGVEPTYLPSPTGQPLWILGDVFLRSYYSVYDMGNNRVGFATAA 389
>gi|73621385|sp|Q9GMY7.1|PEPA_RHIFE RecName: Full=Pepsin A; Flags: Precursor
gi|9798658|dbj|BAB11751.1| pepsinogen A [Rhinolophus ferrumequinum]
Length = 386
Score = 216 bits (549), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 123/300 (41%), Positives = 176/300 (58%), Gaps = 25/300 (8%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL + L HS+N A KE A + + PL+N+MD +YFG
Sbjct: 32 LMEQGLLQDYLKTHSINPASKYLKE----AASMMATQ-----------PLENYMDMEYFG 76
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+PPQ F+VIFDTGSSNLWVPS C S +C H+R+ ++S+TY + + Y
Sbjct: 77 TIGIGTPPQEFTVIFDTGSSNLWVPSVYCS-SPACSNHNRFNPQQSSTYQGTNQKLSVAY 135
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
G+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + IA A
Sbjct: 136 GTGSMTGILGYDTVQVGGITDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSIASSGAT 194
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV-FGGVDPKHFKGKHTYVPVTKKGYWQ 265
PV+DN+ QGLVS+++FS +L+ + ++GG +V FGG+D +F G +VP++ + YWQ
Sbjct: 195 PVFDNIWNQGLVSQDLFSVYLSSN---DQGGSVVMFGGIDSSYFTGNLNWVPLSSETYWQ 251
Query: 266 FELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVS 325
+ I + Q C G C AIVD+GTSLL+GPT + I IG +A ++VVS
Sbjct: 252 ITVDSITMNGQVI-ACSGSCQAIVDTGTSLLSGPTNAIASIQGYIGASQ--NANGEMVVS 308
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 40/91 (43%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
N GE ++ C I T+PN+ FTI + L P Y+L++ +G C SGF D+P
Sbjct: 300 NANGEMVVSCSAINTLPNIVFTINGVQYPLPPSAYVLQSQQG----CTSGFQGMDIPTSS 355
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
G LWILGDVF+ Y TVFD G ++G A A
Sbjct: 356 GELWILGDVFIRQYFTVFDRGNNQVGLAPVA 386
>gi|343425806|emb|CBQ69339.1| probable PEP4-aspartyl protease [Sporisorium reilianum SRZ2]
Length = 419
Score = 216 bits (549), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 113/253 (44%), Positives = 158/253 (62%), Gaps = 9/253 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL +F++AQYF +I +G+P Q F VI DTGSSNLWVPS+KC SI+C+ H +Y S S+
Sbjct: 98 VPLTDFLNAQYFCDISLGTPAQEFKVILDTGSSNLWVPSTKCS-SIACFLHKKYDSSASS 156
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y + G +I YGSGS+ G SQD +++GD+ +K Q F EAT E L F +FDGI+G
Sbjct: 157 SYKKNGTEFKIQYGSGSMEGIVSQDTLKIGDLTIKGQDFAEATSEPGLAFAFGKFDGILG 216
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+V VP + M++QGL+ SF+L E+GGE VFGG+D H+ GK
Sbjct: 217 LAYDTISVNGIVPPFYQMIDQGLLDSPQVSFYLGS--SEEDGGEAVFGGIDESHYSGKIH 274
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG-- 312
+ PV +KGYW+ L + +G++ + E G AAI D+GTSL+A T +N IG
Sbjct: 275 WAPVKRKGYWEVALDKLALGDEELEL-ENGSAAI-DTGTSLIAMATDTAEILNAEIGATK 332
Query: 313 --EGVVSAECKLV 323
G S +C V
Sbjct: 333 SWNGQYSVDCDKV 345
Score = 65.5 bits (158), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 51/87 (58%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DCD++ +P ++F I + F L + Y+L+ + CIS F +LP P +
Sbjct: 336 GQYSVDCDKVKDLPPLTFYIDGQPFKLEGKDYVLE----VQGSCISSFSGINLPGPLADM 391
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
I+GDVF+ Y++V+D GK +G A A
Sbjct: 392 LIVGDVFLRKYYSVYDLGKNAVGLATA 418
>gi|222425180|dbj|BAH20539.1| pepsinogen A-43 [Pongo abelii]
Length = 388
Score = 216 bits (549), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 120/291 (41%), Positives = 170/291 (58%), Gaps = 23/291 (7%)
Query: 28 LRRI----GLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDA 83
LRR GL K L H+LN A +Y + H PL+N++D
Sbjct: 28 LRRTLSERGLLKDFLKKHNLNPA-----SKYFPQGKAPTLLHEQ--------PLENYLDV 74
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
+YFG IGIG+P Q+F+V+FDTGSSNLWVPS CY S++C H+ + + S+TY ++
Sbjct: 75 EYFGSIGIGTPAQDFTVVFDTGSSNLWVPSVYCY-SLACMDHNLFNPQDSSTYKSTSETV 133
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAV 202
I YG+GS++G D V+VG + +Q+F + + GS F A FDGI+GL + I+
Sbjct: 134 SITYGTGSMTGILGYDTVKVGGISDTNQIFGLSESEPGSFLFF-APFDGILGLAYPSISS 192
Query: 203 GDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
A PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +G
Sbjct: 193 SGATPVFDNIWNQGLVSQDLFSVYLSA--DDKSGSVVIFGGIDSSYYTGSLNWVPVTVEG 250
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
YWQ + I + N T C GC AIVD+GTSLL GPT + I IG
Sbjct: 251 YWQITVDSITM-NGKTIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGAS 300
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 39/91 (42%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
N G+ ++ C I ++P++ FTI + L P YILK+ EG CISGF ++P
Sbjct: 302 NSDGDMVVSCSAISSLPDIVFTINGVQYPLPPSAYILKS-EG---SCISGFQGMNVPTES 357
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 388
>gi|355706340|gb|AES02605.1| napsin A aspartic peptidase [Mustela putorius furo]
Length = 258
Score = 216 bits (549), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 96/196 (48%), Positives = 136/196 (69%), Gaps = 1/196 (0%)
Query: 74 ILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF-SISCYFHSRYKSRK 132
+PL N+++AQY+GEIG+G+PPQNFSV+FDTGSSNLWVPS +C+F S+ C+FH R+ S+
Sbjct: 63 FVPLSNYLNAQYYGEIGLGTPPQNFSVVFDTGSSNLWVPSIRCHFLSLPCWFHHRFNSKA 122
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGI 192
S+++ G I YG+G + G S+D + +G + +F EA E SL F A FDG+
Sbjct: 123 SSSFQPNGTKFAIQYGTGKLDGILSEDKLTIGGIKGASVIFGEALWEPSLVFTFAHFDGV 182
Query: 193 IGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGK 252
+GLGF +AVG P D +V++GL+ + +FSF+LNRDP A +GGE+V GG DP H+
Sbjct: 183 LGLGFPILAVGGVRPPLDTLVDEGLLDKPIFSFYLNRDPKAADGGELVLGGSDPAHYIPP 242
Query: 253 HTYVPVTKKGYWQFEL 268
T++PVT YWQ +
Sbjct: 243 LTFLPVTIPAYWQIHM 258
>gi|432943847|ref|XP_004083297.1| PREDICTED: cathepsin E-A-like [Oryzias latipes]
Length = 412
Score = 216 bits (549), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 125/333 (37%), Positives = 184/333 (55%), Gaps = 17/333 (5%)
Query: 12 LWVLASCLLLPASSNGLRRIGLKKR-RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDS 70
+W ++ L +P N R ++ LD + T RY RLG S
Sbjct: 12 IWTASALLRVPLRRNPTIRTQMRAEGLLDQFLKDNQPDTFNRRYAQCFPPGTQSLRLGRS 71
Query: 71 DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKS 130
E I NFMDAQY+GEI +G+P QNFSVIFDTGSS+LWVPSS C S +C FH +K+
Sbjct: 72 SEKIY---NFMDAQYYGEIRLGTPEQNFSVIFDTGSSDLWVPSSYC-VSQACAFHRHFKA 127
Query: 131 RKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFD 190
KS+++ G++ I+YGSG + G +D + +G++ V +Q F E+ E TF+ A+FD
Sbjct: 128 FKSSSFHHDGRTFGIHYGSGHLLGVMGKDTLRIGNLTVLNQEFGESVYEPGSTFVTAKFD 187
Query: 191 GIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAE-EGGEIVFGGVDPKHF 249
G++GL + +A PV+DNM+ Q ++ E +FSF+L+R G+++ GG D +
Sbjct: 188 GVLGLAYPSLAEIIGKPVFDNMLAQKILDEPIFSFYLSRSKSKSVPEGQLLLGGTDESLY 247
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHA 309
G +VPVT KGYWQ + + + S+ +C GC AIVD+GTSL+AGP + ++
Sbjct: 248 SGPINWVPVTIKGYWQIRMDSVSVQGVSS-LCRRGCEAIVDTGTSLIAGPPREILRLHQL 306
Query: 310 IGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPE 342
IG + +GD + D LP
Sbjct: 307 IGA----------TPTHFGDFVVDCARLSSLPH 329
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 41/99 (41%), Positives = 66/99 (66%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
+++L + P G+ ++DC R+ ++P+V+F +G+ + L+ E YI K E+C +GF
Sbjct: 303 LHQLIGATPTHFGDFVVDCARLSSLPHVTFVLGEVEYTLTSEHYIRKETFSSRELCFTGF 362
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
MA ++ GPLWILGDVF+ Y+T+FD G+ R+GFA A
Sbjct: 363 MAAEMFSADGPLWILGDVFLTQYYTIFDKGQDRVGFARA 401
>gi|401838744|gb|EJT42213.1| PEP4-like protein [Saccharomyces kudriavzevii IFO 1802]
Length = 405
Score = 216 bits (549), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 107/240 (44%), Positives = 159/240 (66%), Gaps = 4/240 (1%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+++AQY+ +I +G+PPQNF VI DTGSSNLWVPS++C S++C+ HS+Y S+
Sbjct: 81 VPLTNYLNAQYYTDITLGTPPQNFKVILDTGSSNLWVPSNEC-GSLACFLHSKYDHEASS 139
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y G I YG+GS+ G+ SQD + +GD+ + Q F EAT E LTF +FDGI+G
Sbjct: 140 SYKANGTEFAIQYGTGSLEGYISQDTLSIGDLTIPKQDFAEATSEPGLTFAFGKFDGILG 199
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKH 253
LG+ I+V VP + N ++Q L+ E+ F+F+L + D+E GGE FGG+D FKG
Sbjct: 200 LGYDTISVDKVVPPFYNAIQQDLLDEKKFAFYLGDTSKDSENGGEATFGGIDESKFKGDI 259
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
T++PV +K YW+ + I +G++ + EG AAI D+GTSL+ P+ + IN +G +
Sbjct: 260 TWLPVRRKAYWEVKFEGIGLGDEYAEL-EGHGAAI-DTGTSLITLPSGLAEMINAELGAK 317
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 47/87 (54%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC+ ++P++ F F + P Y L+ ++ CIS D P P GPL
Sbjct: 322 GQYTLDCNTRDSLPDLIFNFNGYNFTIGPYDYTLE----VSGSCISAITPMDFPEPVGPL 377
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
I+GD F+ Y++++D G +G A+A
Sbjct: 378 AIVGDAFLRKYYSIYDLGNDAVGLAKA 404
>gi|126309845|ref|XP_001370435.1| PREDICTED: gastricsin-like [Monodelphis domestica]
Length = 390
Score = 216 bits (549), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 103/238 (43%), Positives = 151/238 (63%), Gaps = 1/238 (0%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N+MD Y+GEI IG+PPQNF V+FDTGSSNLWV S C S +C H ++ KS+T
Sbjct: 64 PLANYMDMSYYGEISIGTPPQNFLVLFDTGSSNLWVASIYCQ-SQACTNHPQFNPSKSST 122
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y+ G++ + YG+GS++G F D V + + + +Q F + E F+ A+FDGI+GL
Sbjct: 123 YSSNGQTFSLQYGTGSLTGVFGYDTVTIQGISITNQEFGLSETEPGTNFVYAQFDGILGL 182
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ I+ G A V +++ L++ VF+F+L+ + ++ GGE+VFGGVD + G +
Sbjct: 183 AYPAISSGGATTVMQGFLQENLLNSPVFAFYLSGNENSNNGGEVVFGGVDTSMYTGDIYW 242
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
PVT++ YWQ + IG Q+TG C GGC AIVD+GTSLL P + +E+ IG +
Sbjct: 243 APVTEEAYWQIAINGFSIGGQATGWCSGGCQAIVDTGTSLLTAPQQIFSELMQYIGAQ 300
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 56/107 (52%), Gaps = 4/107 (3%)
Query: 401 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 460
+++ S + + + + G ++ C +MP ++F I F L P Y+L + E
Sbjct: 287 QQIFSELMQYIGAQQDENGSYLVSCSNTQSMPTITFNINGVDFPLPPSAYVLPSNSNYCE 346
Query: 461 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
V G M LP G PLWILGDVF+ Y++++D G R+GFA A
Sbjct: 347 V---GIMPTYLPSQNGQPLWILGDVFLRNYYSIYDLGNNRVGFANLA 390
>gi|222425184|dbj|BAH20541.1| pepsinogen A-14 [Pongo abelii]
Length = 388
Score = 216 bits (549), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 120/291 (41%), Positives = 170/291 (58%), Gaps = 23/291 (7%)
Query: 28 LRRI----GLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDA 83
LRR GL K L H+LN A +Y + H PL+N++D
Sbjct: 28 LRRTLSERGLLKDFLKKHNLNPA-----SKYFPQGKAPTLLHEQ--------PLENYLDV 74
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
+YFG IGIG+P Q+F+V+FDTGSSNLWVPS CY S++C H+ + + S+TY ++
Sbjct: 75 EYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCY-SLACMDHNLFNPQDSSTYKSTSETV 133
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAV 202
I YG+GS++G D V+VG + +Q+F + + GS F A FDGI+GL + I+
Sbjct: 134 SITYGTGSMTGILGYDTVKVGGISDTNQIFGLSESEPGSFLFF-APFDGILGLAYPSISS 192
Query: 203 GDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
A PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +G
Sbjct: 193 SGATPVFDNIWNQGLVSQDLFSVYLSA--DDKSGSVVIFGGIDSSYYTGSLNWVPVTVEG 250
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
YWQ + I + N T C GC AIVD+GTSLL GPT + I IG
Sbjct: 251 YWQITVDSITM-NGKTIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGAS 300
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 39/91 (42%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
N G+ ++ C I ++P++ FTI + L P YILK+ EG CISGF ++P
Sbjct: 302 NSNGDMVVSCSAISSLPDIVFTINGVQYPLPPSAYILKS-EG---SCISGFQGMNVPTES 357
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 388
>gi|2687645|gb|AAB88862.1| cathepsin D [Sparus aurata]
Length = 399
Score = 216 bits (549), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 116/271 (42%), Positives = 161/271 (59%), Gaps = 14/271 (5%)
Query: 77 LKNFMDAQYFGEIGIGSP-PQNFSVIFDTGSSNLWVPSSKCYF-SISCYFHSRYKSRKSN 134
L NFMDAQY+G I IG+P ++F+V+FDTGSSNLWVPS C F I+C Y S+KS
Sbjct: 69 LTNFMDAQYYGVISIGTPVHRDFTVLFDTGSSNLWVPSIHCSFLDIACCASPSYNSKKST 128
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + G I YG GS+SGF S +V V + V Q F EA ++ +TF +ARFDG +G
Sbjct: 129 TYVQNGTEFSIRYGRGSLSGFISGSDVSVAGLPVPRQQFGEAVKQPGITFAVARFDGSLG 188
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFK-GKH 253
+ + + + VPV+D + L+ + +FSF+L RDP A GGE+ GG DP G
Sbjct: 189 MAYPFHIIANVVPVFDTAMAAKLLPQNIFSFYLTRDPKAAVGGELTLGGTDPHVLTLGDL 248
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
YV VT+K YW + + +GNQ + +C+ GC AIVD+GTSL+ GP V ++ AIG
Sbjct: 249 HYVNVTRKAYWHIGMDGLQVGNQLS-LCKAGCEAIVDTGTSLIVGPVEEVRALHKAIGAL 307
Query: 314 GVVSAE----------CKLVVSQYGDLIWDL 334
++ E C L +S G +++L
Sbjct: 308 PLIDGEYGLDCSGSHRCLLSLSTLGGRMFNL 338
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 45/99 (45%), Positives = 63/99 (63%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
+++ +LP GE +DC T+G ++FNL+ E Y++K + +C+SGF
Sbjct: 300 LHKAIGALPLIDGEYGLDCSGSHRCLLSLSTLGGRMFNLTGEDYVMKESQMGMSICVSGF 359
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
MA D+PPP GPLWILGDVF+G Y+TVFD R+GFA A
Sbjct: 360 MAMDIPPPAGPLWILGDVFIGKYYTVFDRNADRVGFAPA 398
>gi|224458278|ref|NP_001138942.1| pepsinogen A precursor [Pongo abelii]
gi|222425178|dbj|BAH20538.1| pepsinogen A-75 [Pongo abelii]
Length = 388
Score = 216 bits (549), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 123/307 (40%), Positives = 176/307 (57%), Gaps = 27/307 (8%)
Query: 28 LRRI----GLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDA 83
LRR GL K L H+LN A +Y + H PL+N++D
Sbjct: 28 LRRTLSERGLLKDFLKKHNLNPA-----SKYFPQGKAPTLLHEQ--------PLENYLDV 74
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
+YFG IGIG+P Q+F+V+FDTGSSNLWVPS CY S++C H+ + + S+TY ++
Sbjct: 75 EYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCY-SLACMDHNLFNPQDSSTYKSTSETV 133
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAV 202
I YG+GS++G D V+VG + +Q+F + + GS F A FDGI+GL + I+
Sbjct: 134 SITYGTGSMTGILGYDTVKVGGISDTNQIFGLSESEPGSFLFF-APFDGILGLAYPSISS 192
Query: 203 GDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
A PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +G
Sbjct: 193 SGATPVFDNIWNQGLVSQDLFSVYLSA--DDKSGSVVIFGGIDSSYYTGSLNWVPVTVEG 250
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSA 318
YWQ + I + N T C GC AIVD+GTSLL GPT + I IG +G +
Sbjct: 251 YWQITVDSITM-NGKTIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVV 309
Query: 319 ECKLVVS 325
C + S
Sbjct: 310 SCSAISS 316
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 39/91 (42%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
N G+ ++ C I ++P++ FTI + L P YILK+ EG CISGF ++P
Sbjct: 302 NSDGDMVVSCSAISSLPDIVFTINGVQYPLPPSAYILKS-EG---SCISGFQGMNVPTES 357
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 388
>gi|253762219|gb|ACT35561.1| pepsinogen A2 precursor [Siniperca chuatsi]
Length = 376
Score = 216 bits (549), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 113/251 (45%), Positives = 162/251 (64%), Gaps = 13/251 (5%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
P+ N D Y+G I IGSPPQ+FSVIFDTGSSNLW+PS C S +C H R+ ++ T
Sbjct: 60 PMTNDADLSYYGVISIGSPPQSFSVIFDTGSSNLWIPSVYCS-SQACENHRRFNPQQPTT 118
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIG 194
+ + I YG+GS++G+ + D VEVG + V +QVF I T + + A DGI+G
Sbjct: 119 FKWGNQPLSIQYGTGSMTGYLAIDTVEVGGISVANQVFGISRTEAPFMAHMQA--DGILG 176
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L F+ IA + VPV+DNMV+QGLVS+ +FS +L+ ++E+G E+VFGG+D H+ G+ T
Sbjct: 177 LAFQTIASDNVVPVFDNMVKQGLVSQPLFSVYLSS--NSEQGSEVVFGGIDSSHYTGQIT 234
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG--- 311
++P++ YWQ ++ + I Q T C GGC AI+D+GTSL+ GPT + +N +G
Sbjct: 235 WIPLSSATYWQIKMDSVTINGQ-TVACSGGCQAIIDTGTSLIVGPTSDINNMNAWVGAST 293
Query: 312 ---GEGVVSAE 319
GE VVS +
Sbjct: 294 NQYGEAVVSCQ 304
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/132 (28%), Positives = 63/132 (47%), Gaps = 10/132 (7%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 434
G + CS A++ L T + ++ +N + N GE+++ C I +MP+V
Sbjct: 255 GQTVACSGGCQAIIDTGTSLIVGPTSD--INNMNAWVGASTNQYGEAVVSCQNIQSMPDV 312
Query: 435 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 494
+FT+ + F + Y+ + G C +GF LWILGDVF+ Y+ VFD
Sbjct: 313 TFTLNGQAFTIPASAYVSQNSYG----CNTGFGQGG----SDQLWILGDVFIREYYVVFD 364
Query: 495 SGKLRIGFAEAA 506
+ +G A +A
Sbjct: 365 AHAQYVGLASSA 376
>gi|365758066|gb|EHM99929.1| Pep4p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 405
Score = 216 bits (549), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 107/240 (44%), Positives = 159/240 (66%), Gaps = 4/240 (1%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+++AQY+ +I +G+PPQNF VI DTGSSNLWVPS++C S++C+ HS+Y S+
Sbjct: 81 VPLTNYLNAQYYTDITLGTPPQNFKVILDTGSSNLWVPSNEC-GSLACFLHSKYDHEASS 139
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y G I YG+GS+ G+ SQD + +GD+ + Q F EAT E LTF +FDGI+G
Sbjct: 140 SYKANGTEFAIQYGTGSLEGYISQDTLTIGDLTIPKQDFAEATSEPGLTFAFGKFDGILG 199
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKH 253
LG+ I+V VP + N ++Q L+ E+ F+F+L + D+E GGE FGG+D FKG
Sbjct: 200 LGYDTISVDKVVPPFYNAIQQDLLDEKKFAFYLGDTSKDSENGGEATFGGIDESKFKGDI 259
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
T++PV +K YW+ + I +G++ + EG AAI D+GTSL+ P+ + IN +G +
Sbjct: 260 TWLPVRRKAYWEVKFEGIGLGDEYAEL-EGHGAAI-DTGTSLITLPSGLAEMINAELGAK 317
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 47/87 (54%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC+ ++P++ F F + P Y L+ ++ CIS D P P GPL
Sbjct: 322 GQYTLDCNTRDSLPDLIFNFNGYNFTIGPYDYTLE----VSGSCISAITPMDFPEPVGPL 377
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
I+GD F+ Y++++D G +G A+A
Sbjct: 378 AIVGDAFLRKYYSIYDLGNDAVGLAKA 404
>gi|323335315|gb|EGA76604.1| Pep4p [Saccharomyces cerevisiae Vin13]
Length = 368
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 105/240 (43%), Positives = 156/240 (65%), Gaps = 4/240 (1%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+++AQY+ +I +G+PPQNF VI DTGSSNLWVPS++C S++C+ HS+Y S+
Sbjct: 44 VPLTNYLNAQYYTDITLGTPPQNFKVILDTGSSNLWVPSNEC-GSLACFLHSKYDHEASS 102
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y G I YG+GS+ G+ SQD + +GD+ + Q F EAT E LTF +FDGI+G
Sbjct: 103 SYKANGTEFAIQYGTGSLEGYISQDTLSIGDLTIPKQDFAEATSEPGLTFAFGKFDGILG 162
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKH 253
LG+ I+V VP + N ++Q L+ E+ F+F+L + D E GGE FGG+D FKG
Sbjct: 163 LGYDTISVDKVVPPFYNAIQQDLLDEKRFAFYLGDTSKDTENGGEATFGGIDESKFKGDI 222
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
T++PV +K YW+ + I +G++ + G A +D+GTSL+ P+ + IN IG +
Sbjct: 223 TWLPVRRKAYWEVKFEGIGLGDEYAELESHGAA--IDTGTSLITLPSGLAEMINAEIGAK 280
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 45/87 (51%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC+ +P++ F F + P Y L+ + CIS D P P GPL
Sbjct: 285 GQYTLDCNTRDNLPDLIFNFNGYNFTIGPYDYTLEX----SGSCISAITPMDFPEPVGPL 340
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
I+GD F+ Y++++D G +G A+A
Sbjct: 341 AIVGDAFLRKYYSIYDLGNNAVGLAKA 367
>gi|14278413|pdb|1G0V|A Chain A, The Structure Of Proteinase A Complexed With A Ia3 Mutant,
Mvv
Length = 329
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 105/240 (43%), Positives = 156/240 (65%), Gaps = 4/240 (1%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+++AQY+ +I +G+PPQNF VI DTGSSNLWVPS++C S++C+ HS+Y S+
Sbjct: 5 VPLTNYLNAQYYTDITLGTPPQNFKVILDTGSSNLWVPSNEC-GSLACFLHSKYDHEASS 63
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y G I YG+GS+ G+ SQD + +GD+ + Q F EAT E LTF +FDGI+G
Sbjct: 64 SYKANGTEFAIQYGTGSLEGYISQDTLSIGDLTIPKQDFAEATSEPGLTFAFGKFDGILG 123
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKH 253
LG+ I+V VP + N ++Q L+ E+ F+F+L + D E GGE FGG+D FKG
Sbjct: 124 LGYDTISVDKVVPPFYNAIQQDLLDEKRFAFYLGDTSKDTENGGEATFGGIDESKFKGDI 183
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
T++PV +K YW+ + I +G++ + G A +D+GTSL+ P+ + IN IG +
Sbjct: 184 TWLPVRRKAYWEVKFEGIGLGDEYAELESHGAA--IDTGTSLITLPSGLAEMINAEIGAK 241
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 46/87 (52%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC+ +P++ F F + P Y L+ ++ CIS D P P GPL
Sbjct: 246 GQYTLDCNTRDNLPDLIFNFNGYNFTIGPYDYTLE----VSGSCISAITPMDFPEPVGPL 301
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
I+GD F+ Y++++D G +G A+A
Sbjct: 302 AIVGDAFLRKYYSIYDLGNNAVGLAKA 328
>gi|363743175|ref|XP_003642787.1| PREDICTED: renin-like [Gallus gallus]
Length = 451
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 107/252 (42%), Positives = 155/252 (61%), Gaps = 7/252 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFHSRYKSRKSNT 135
L N++D QY+GEI IG+PPQ F V+FDTGS+NLWVPS KC +C HSRY S KS T
Sbjct: 124 LTNYLDTQYYGEISIGTPPQTFKVVFDTGSANLWVPSCKCSPLYSACISHSRYDSSKSRT 183
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G I YG+GS+ GF SQD V V D+ + QVF EAT + F+ ARFDG++G+
Sbjct: 184 YIANGTGFAIRYGTGSVKGFLSQDVVMVSDIPII-QVFAEATVLPAFPFIFARFDGVLGM 242
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ A+ PV+D ++ Q ++ E+VFS + +R+ + GGEI+ GG DP ++ G Y
Sbjct: 243 GYPSQAIDGITPVFDRILSQQILKEDVFSVYYSRNSPLKPGGEIILGGTDPAYYTGDFHY 302
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
+ +++ GYWQ + + +G + C+ GC+ +D+G S + GP V+ + AIG
Sbjct: 303 LSISRSGYWQISMKGVSVGAEML-FCKEGCSVAIDTGASYITGPAGPVSVLMKAIGAAEM 361
Query: 313 -EGVVSAECKLV 323
EG +C+ V
Sbjct: 362 TEGEYVVDCEKV 373
Score = 94.7 bits (234), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 39/87 (44%), Positives = 57/87 (65%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
GE ++DC+++P +PN+SF +G K + LS Y+L+ + ++C+ D+PPP GPL
Sbjct: 364 GEYVVDCEKVPQLPNISFHLGGKAYTLSGSAYVLRQTQYGEDICVVALSGLDIPPPAGPL 423
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WILG F+G Y+T FD RIGFA A
Sbjct: 424 WILGASFIGHYYTKFDRRNNRIGFATA 450
>gi|114607413|ref|XP_518465.2| PREDICTED: gastricsin isoform 2 [Pan troglodytes]
Length = 388
Score = 215 bits (548), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 119/307 (38%), Positives = 178/307 (57%), Gaps = 16/307 (5%)
Query: 13 WVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS------GVRHR 66
W++ + L S + ++ LKK + + R T KE+ + G + ++R
Sbjct: 3 WMVVVLVCLQLSEAAVVKVPLKKFK-------SIRETMKEKGLLGEFLRTHKYDPAWKYR 55
Query: 67 LGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHS 126
GD P+ +MDA YFGEI IG+PPQNF V+FDTGSSNLWVPS C S +C HS
Sbjct: 56 FGDLSVTYEPMA-YMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHS 113
Query: 127 RYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLL 186
R+ +S+TY+ G++ + YGSGS++GFF D + V + V +Q F + E F+
Sbjct: 114 RFNPSESSTYSTNGQTFSLQYGSGSLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVY 173
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+GL + ++V +A MV++G ++ VFS +L+ GG +VFGGVD
Sbjct: 174 AQFDGIMGLAYPALSVDEATTAMQGMVQEGALTSPVFSVYLSNQ-QGSSGGAVVFGGVDS 232
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
+ G+ + PVT++ YWQ + + LIG Q++G C GC AIVD+GTSLL P ++ +
Sbjct: 233 SLYTGQIYWAPVTQELYWQIGIEEFLIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMSAL 292
Query: 307 NHAIGGE 313
A G +
Sbjct: 293 LEATGAQ 299
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 38/107 (35%), Positives = 59/107 (55%), Gaps = 5/107 (4%)
Query: 401 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 460
++ +S + E + + G+ +++C+ I +P ++F I F L P YIL + +G
Sbjct: 286 QQYMSALLEATGAQEDEYGQFLVNCNSIQNLPTLTFIINGVEFPLPPSSYIL-SNDGY-- 342
Query: 461 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
C G L G PLWILGDVF+ Y++V+D G R+GFA AA
Sbjct: 343 -CTVGVEPTYLSSQNGQPLWILGDVFLRSYYSVYDLGNNRVGFATAA 388
>gi|73621390|sp|Q9GMY3.1|PEPC_RHIFE RecName: Full=Gastricsin; AltName: Full=Pepsinogen C; Flags:
Precursor
gi|9798666|dbj|BAB11755.1| pepsinogen C [Rhinolophus ferrumequinum]
Length = 389
Score = 215 bits (548), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 111/286 (38%), Positives = 172/286 (60%), Gaps = 8/286 (2%)
Query: 34 KKRRLDLHSLNAARITRKERYMGGAGVS------GVRHRLGDSDEDILPLKNFMDAQYFG 87
K ++ L L + R T KE+ + + ++R D P+ +MDA YFG
Sbjct: 17 KVVKVPLKKLKSLRETMKEKGLLEEFLKNHKYDPAQKYRYTDFSVAYEPMA-YMDAAYFG 75
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
EI IG+PPQNF V+FDTGSSNLWVPS C + +C H+R+ +S+TY+ G++ + Y
Sbjct: 76 EISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-TQACTGHTRFNPSQSSTYSTNGQTFSLQY 134
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVP 207
GSGS++GFF D + V + V +Q F + E F+ A+FDGI+G+ + +A+G A
Sbjct: 135 GSGSLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGIMGMAYPSLAMGGATT 194
Query: 208 VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFE 267
M+++G ++ VFSF+L+ ++ GG ++FGGVD ++G+ + PVT++ YWQ
Sbjct: 195 ALQGMLQEGALTSPVFSFYLSNQQGSQNGGAVIFGGVDNSLYQGQIYWAPVTQELYWQIG 254
Query: 268 LGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
+ + LIG Q++G C GC AIVD+GTSLL P ++ + A G +
Sbjct: 255 IEEFLIGGQASGWCSQGCQAIVDTGTSLLTVPQQYMSALLQATGAQ 300
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 37/107 (34%), Positives = 55/107 (51%), Gaps = 5/107 (4%)
Query: 401 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 460
++ +S + + + + G+ ++C+ I +P +F I F L P YIL
Sbjct: 287 QQYMSALLQATGAQEDQYGQFFVNCNYIQNLPTFTFIINGVQFPLPPSSYILNNNG---- 342
Query: 461 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
C G LP G PLWILGDVF+ Y++V+D G R+GFA AA
Sbjct: 343 YCTVGVEPTYLPSQNGQPLWILGDVFLRSYYSVYDMGNNRVGFATAA 389
>gi|194210206|ref|XP_001488754.2| PREDICTED: renin-like [Equus caballus]
Length = 391
Score = 215 bits (548), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 120/291 (41%), Positives = 176/291 (60%), Gaps = 10/291 (3%)
Query: 37 RLDLHSLNAARITRKERYMG----GAGVSGVRHRLG-DSDEDILPLKNFMDAQYFGEIGI 91
R+ L + + R + +ER + GA S RL D+ + L N++D QY+GEIGI
Sbjct: 17 RIFLRKMPSVRESLRERGVDVSRIGAEWSQFTKRLSRDNSTSPVVLTNYLDTQYYGEIGI 76
Query: 92 GSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFHSRYKSRKSNTYTEIGKSCEINYGSG 150
G+PPQ F VIFDTGS+NLWVPS+KC +C HS Y S +S++Y E G I YGSG
Sbjct: 77 GTPPQTFKVIFDTGSANLWVPSTKCSPLYAACEIHSLYDSSESSSYMENGTEFTIRYGSG 136
Query: 151 SISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWD 210
+ GF SQD V VG + V Q F E T + F+LA+FDG++G+GF AVG PV+D
Sbjct: 137 KVKGFLSQDMVTVGGITVT-QTFAEVTELPLIPFMLAKFDGVLGMGFPAQAVGGVTPVFD 195
Query: 211 NMVEQGLVSEEVFSFWLNRDPDAEE--GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFEL 268
+++ Q ++ E+VFS + +R+ GGEIV GG DP++++G YV V+K WQ ++
Sbjct: 196 HILSQRVLKEDVFSVYYSRNSKNSHLLGGEIVLGGSDPQYYQGNFHYVSVSKTDSWQIKM 255
Query: 269 GDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
+ + +T +CE GC +VD+G S ++GPT + + +G + + S E
Sbjct: 256 KGVSV-RSATLLCEEGCMVVVDTGASYISGPTSSLRLLMETLGAKELSSDE 305
Score = 79.7 bits (195), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 54/86 (62%)
Query: 420 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 479
E +++C+++PT+P++SF +G + + L+ Y+L+ ++C D+PPP GP+W
Sbjct: 305 EYVVNCNQVPTLPDISFHLGGRAYTLTSADYVLQDPYSNDDLCTLALHGLDVPPPTGPVW 364
Query: 480 ILGDVFMGVYHTVFDSGKLRIGFAEA 505
+LG F+ ++T FD RIGFA A
Sbjct: 365 VLGASFIRKFYTEFDRHNNRIGFALA 390
>gi|7766834|pdb|1DP5|A Chain A, The Structure Of Proteinase A Complexed With A Ia3 Mutant
Inhibitor
gi|7766836|pdb|1DPJ|A Chain A, The Structure Of Proteinase A Complexed With Ia3 Peptide
Inhibitor
gi|22218637|pdb|1FMU|A Chain A, Structure Of Native Proteinase A In P3221 Space Group.
gi|22218638|pdb|1FMX|A Chain A, Structure Of Native Proteinase A In The Space Group P21
gi|22218639|pdb|1FMX|B Chain B, Structure Of Native Proteinase A In The Space Group P21
gi|225346|prf||1301217A proteinase A,Asp
Length = 329
Score = 215 bits (548), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 105/240 (43%), Positives = 156/240 (65%), Gaps = 4/240 (1%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+++AQY+ +I +G+PPQNF VI DTGSSNLWVPS++C S++C+ HS+Y S+
Sbjct: 5 VPLTNYLNAQYYTDITLGTPPQNFKVILDTGSSNLWVPSNEC-GSLACFLHSKYDHEASS 63
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y G I YG+GS+ G+ SQD + +GD+ + Q F EAT E LTF +FDGI+G
Sbjct: 64 SYKANGTEFAIQYGTGSLEGYISQDTLSIGDLTIPKQDFAEATSEPGLTFAFGKFDGILG 123
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKH 253
LG+ I+V VP + N ++Q L+ E+ F+F+L + D E GGE FGG+D FKG
Sbjct: 124 LGYDTISVDKVVPPFYNAIQQDLLDEKRFAFYLGDTSKDTENGGEATFGGIDESKFKGDI 183
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
T++PV +K YW+ + I +G++ + G A +D+GTSL+ P+ + IN IG +
Sbjct: 184 TWLPVRRKAYWEVKFEGIGLGDEYAELESHGAA--IDTGTSLITLPSGLAEMINAEIGAK 241
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 46/87 (52%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC+ +P++ F F + P Y L+ ++ CIS D P P GPL
Sbjct: 246 GQYTLDCNTRDNLPDLIFNFNGYNFTIGPYDYTLE----VSGSCISAITPMDFPEPVGPL 301
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
I+GD F+ Y++++D G +G A+A
Sbjct: 302 AIVGDAFLRKYYSIYDLGNNAVGLAKA 328
>gi|195046637|ref|XP_001992191.1| GH24623 [Drosophila grimshawi]
gi|193893032|gb|EDV91898.1| GH24623 [Drosophila grimshawi]
Length = 374
Score = 215 bits (548), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 114/252 (45%), Positives = 156/252 (61%), Gaps = 9/252 (3%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
L N M+ Y+G I IG+PPQ+F V+FD+GSSNLWVPS+ C S +C H++Y S S+TY
Sbjct: 62 LSNSMNMAYYGAITIGTPPQSFEVLFDSGSSNLWVPSNTCT-STACEVHNQYDSSASSTY 120
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
G+S I YG+GS+SGF S D V++ + V Q F EAT E F A FDGI+G+G
Sbjct: 121 QSNGESFSIQYGTGSLSGFLSTDTVDINGLSVTSQTFAEATDEPGTNFNNANFDGILGMG 180
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRD-PDAEEGGEIVFGGVDPKHFKGKHTY 255
++ I+ D VPV+ NMV QGLV + VFSF+L R +GGE++FGG D + G TY
Sbjct: 181 YQTISQDDVVPVFYNMVSQGLVDQSVFSFYLARAGTSTTDGGELIFGGSDSSLYSGDLTY 240
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH--AIGGE 313
VPV+++GYWQF + S +C+ C AI D+GTSL+ P +N + E
Sbjct: 241 VPVSQEGYWQFTMDSATADGNS--LCD-DCQAIADTGTSLIVAPANAYELLNEILNVDDE 297
Query: 314 GVVSAECKLVVS 325
G+V +C + S
Sbjct: 298 GLV--DCSTISS 307
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 42/101 (41%), Positives = 57/101 (56%), Gaps = 15/101 (14%)
Query: 409 ELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKT-GEGIAEVCISGF- 466
EL + + N E ++DC I ++P ++F IG F+LSP YI+++ GE C S F
Sbjct: 286 ELLNEILNVDDEGLVDCSTISSLPVITFNIGGTNFDLSPSAYIIQSDGE-----CQSSFQ 340
Query: 467 -MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
M D WILGDVF+G Y+T FD G R+GFA A
Sbjct: 341 YMGTDF-------WILGDVFIGQYYTEFDLGNNRVGFAPVA 374
>gi|129786|sp|P27678.1|PEPA4_MACFU RecName: Full=Pepsin A-4; AltName: Full=Pepsin I/II; Flags:
Precursor
gi|38071|emb|CAA42425.1| prepropepsin A [Macaca fuscata]
Length = 388
Score = 215 bits (547), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 124/313 (39%), Positives = 177/313 (56%), Gaps = 26/313 (8%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL K L H+LN A +Y A + D PL+N++D +YFG
Sbjct: 32 LSEHGLLKDFLKKHNLNPA-----SKYFPQAEAPTLI--------DEQPLENYLDVEYFG 78
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+P QNF+V+FDTGSSNLWVPS CY S++C H+ + + S+TY K+ I Y
Sbjct: 79 TIGIGTPAQNFTVVFDTGSSNLWVPSVYCY-SLACMDHNLFNPQDSSTYRATSKTVSITY 137
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVP 207
G+GS++G D V+VG + +Q+F + E A FDGI+GL + I+ A P
Sbjct: 138 GTGSMTGILGYDTVKVGGISDTNQIFGLSETEPGFFLYFAPFDGILGLAYPSISSSGATP 197
Query: 208 VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFE 267
V+DN+ Q LVS+++FS +L+ D + G ++FGG+D ++ G +VPV+ +GYWQ
Sbjct: 198 VFDNIWNQRLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTGSLNWVPVSVEGYWQIS 255
Query: 268 LGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVSAECK 321
+ I + N T C GC AIVD+GTSLL GPT + I IG GE VVS
Sbjct: 256 VDSITM-NGKTIACAKGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGEMVVSCSA- 313
Query: 322 LVVSQYGDLIWDL 334
+S D+++ +
Sbjct: 314 --ISSLPDIVFTI 324
Score = 73.6 bits (179), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 50/91 (54%), Gaps = 4/91 (4%)
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
N GE ++ C I ++P++ FTI + L P YIL++ C SGF D+P
Sbjct: 302 NSDGEMVVSCSAISSLPDIVFTINGVQYPLPPSAYILQSQGS----CTSGFQGMDVPTES 357
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 388
>gi|2624629|pdb|2JXR|A Chain A, Structure Of Yeast Proteinase A
gi|10835733|pdb|1FQ4|A Chain A, Crystal Structure Of A Complex Between Hydroxyethylene
Inhibitor Cp- 108,420 And Yeast Aspartic Proteinase A
gi|10835734|pdb|1FQ5|A Chain A, X-Ray Struture Of A Cyclic Statine Inhibitor Pd-129,541
Bound To Yeast Proteinase A
gi|10835735|pdb|1FQ6|A Chain A, X-Ray Structure Of Glycol Inhibitor Pd-133,450 Bound To
Saccharopepsin
gi|10835736|pdb|1FQ7|A Chain A, X-Ray Structure Of Inhibitor Cp-72,647 Bound To
Saccharopepsin
gi|10835737|pdb|1FQ8|A Chain A, X-Ray Structure Of Difluorostatine Inhibitor Cp81,198
Bound To Saccharopepsin
Length = 329
Score = 215 bits (547), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 105/240 (43%), Positives = 156/240 (65%), Gaps = 4/240 (1%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+++AQY+ +I +G+PPQNF VI DTGSSNLWVPS++C S++C+ HS+Y S+
Sbjct: 5 VPLTNYLNAQYYTDITLGTPPQNFKVILDTGSSNLWVPSNEC-GSLACFLHSKYDHEASS 63
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y G I YG+GS+ G+ SQD + +GD+ + Q F EAT E LTF +FDGI+G
Sbjct: 64 SYKANGTEFAIQYGTGSLEGYISQDTLSIGDLTIPKQDFAEATSEPGLTFAFGKFDGILG 123
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKH 253
LG+ I+V VP + N ++Q L+ E+ F+F+L + D E GGE FGG+D FKG
Sbjct: 124 LGYDTISVDKVVPPFYNAIQQDLLDEKRFAFYLGDTSKDTENGGEATFGGIDESKFKGDI 183
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
T++PV +K YW+ + I +G++ + G A +D+GTSL+ P+ + IN IG +
Sbjct: 184 TWLPVRRKAYWEVKFEGIGLGDEYAELESHGAA--IDTGTSLITLPSGLAEMINAEIGAK 241
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 46/87 (52%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC+ +P++ F F + P Y L+ ++ CIS D P P GPL
Sbjct: 246 GQYTLDCNTRDNLPDLIFNFNGYNFTIGPYDYTLE----VSGSCISAITPMDFPEPVGPL 301
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
I+GD F+ Y++++D G +G A+A
Sbjct: 302 AIVGDAFLRKYYSIYDIGNNAVGLAKA 328
>gi|158257160|dbj|BAF84553.1| unnamed protein product [Homo sapiens]
Length = 388
Score = 215 bits (547), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 123/307 (40%), Positives = 177/307 (57%), Gaps = 27/307 (8%)
Query: 28 LRRI----GLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDA 83
LRR GL K L H+LN AR +Y + D PL+N++D
Sbjct: 28 LRRTLSERGLLKDFLKKHNLNPAR-----KYF--------PQWEAPTLVDEQPLENYLDM 74
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
+YFG IGIG+P Q+F+V+FDTGSSNLWVPS C S++C H+R+ S+TY ++
Sbjct: 75 EYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTYQSTSETV 133
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAV 202
I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + I+
Sbjct: 134 SITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISS 192
Query: 203 GDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
A PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +G
Sbjct: 193 SGATPVFDNIWNQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTGSLNWVPVTVEG 250
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSA 318
YWQ + I + ++ C GC AIVD+GTSLL GPT +T I IG +G +
Sbjct: 251 YWQITVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPITNIQSDIGASENSDGDMVV 309
Query: 319 ECKLVVS 325
C + S
Sbjct: 310 SCSAISS 316
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 38/91 (41%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
N G+ ++ C I ++P++ FTI + + P YIL++ EG CISGF +LP
Sbjct: 302 NSDGDMVVSCSAISSLPDIVFTINGVQYPVPPSAYILQS-EG---SCISGFQGMNLPTES 357
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 388
>gi|130484814|ref|NP_001076103.1| gastricsin precursor [Oryctolagus cuniculus]
gi|73621389|sp|Q9GMY2.1|PEPC_RABIT RecName: Full=Gastricsin; AltName: Full=Pepsinogen C; Flags:
Precursor
gi|9798668|dbj|BAB11756.1| pepsinogen C [Oryctolagus cuniculus]
Length = 388
Score = 215 bits (547), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 118/315 (37%), Positives = 178/315 (56%), Gaps = 25/315 (7%)
Query: 5 LLRSVFCLWVLASCLL------LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGA 58
LL ++ CL +L + ++ + L+ GL K L+ H + A
Sbjct: 4 LLVALVCLHLLEAAVIKVPLRKFKSIRETLKEKGLLKEFLNTHKYDPA------------ 51
Query: 59 GVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF 118
+++R GD P+ +++DA YFGEI IG+P QNF V+FDTGSSNLWVPS C
Sbjct: 52 ----LKYRFGDFSVTYEPM-DYLDAAYFGEISIGTPSQNFLVLFDTGSSNLWVPSVYCQ- 105
Query: 119 SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
S +C H+R+ KS+T+ ++ + YGSGS++GFF D + ++ V +Q F +
Sbjct: 106 SEACTTHNRFNPSKSSTFYTYDQTFSLEYGSGSLTGFFGYDTFTIQNIEVPNQEFGLSET 165
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
E FL A FDGI+GL + ++VGDA P MV+ G +S VFSF+L+ +GG
Sbjct: 166 EPGTNFLYAEFDGIMGLAYPSLSVGDATPALQGMVQDGTISSSVFSFYLSSQ-QGTDGGA 224
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
+V GGVD + G + PVT++ YWQ + + LI ++++G C GC AIVD+GTSLL
Sbjct: 225 LVLGGVDSSLYTGDIYWAPVTRELYWQIGIDEFLISSEASGWCSQGCQAIVDTGTSLLTV 284
Query: 299 PTPVVTEINHAIGGE 313
P ++++ A G +
Sbjct: 285 PQEYMSDLLEATGAQ 299
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 50/137 (36%), Positives = 69/137 (50%), Gaps = 9/137 (6%)
Query: 372 VSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTM 431
+S+ S CS A+V L ++ +S + E + N GE ++DCD ++
Sbjct: 259 ISSEASGWCSQGCQAIVDTGTSLLT--VPQEYMSDLLEATGAQENEYGEFLVDCDSTESL 316
Query: 432 PNVSFTIGDKIFNLSPEQYILKT-GEGIAEVCISGFMAFDLPPPRG-PLWILGDVFMGVY 489
P +F I F LSP YIL T G+ C+ G A L G PLWILGDVF+ Y
Sbjct: 317 PTFTFVINGVEFPLSPSAYILNTDGQ-----CMVGVEATYLSSQDGEPLWILGDVFLRAY 371
Query: 490 HTVFDSGKLRIGFAEAA 506
++VFD R+GFA A
Sbjct: 372 YSVFDMANNRVGFAALA 388
>gi|254596794|gb|ACT75642.1| pepsinogen A [Channa argus]
Length = 361
Score = 215 bits (547), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 110/254 (43%), Positives = 160/254 (62%), Gaps = 14/254 (5%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
P+ N D Y+G I IG+PPQ+FSVIFD+GSSNLWVPS C S +C H+++ ++S++
Sbjct: 44 PMTNDADMSYYGVISIGTPPQSFSVIFDSGSSNLWVPSVYCSSSQACQNHNKFNPQQSSS 103
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
+ G+S I YG+GS++G+ D V VG V V +QVF + E + + DGI+GL
Sbjct: 104 FQWNGESLSIQYGTGSMTGYLGADTVGVGGVSVANQVFGLSQSEAPFMAHM-QADGILGL 162
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
F+ IA + VPV++NMV QGLVS+ +FS +L+ ++ +G E+VFGGVD H+ G+ +
Sbjct: 163 AFQSIASDNVVPVFNNMVSQGLVSQPMFSVYLSS--NSAQGSEVVFGGVDSNHYTGQIAW 220
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
+P+T YWQ ++ + I Q T C GGC AI+D+GTSL+ GPT ++ IN +G
Sbjct: 221 IPLTSATYWQIKMDSVSINGQ-TVACSGGCQAIIDTGTSLIVGPTSDISNINSWVGAS-- 277
Query: 316 VSAECKLVVSQYGD 329
QYGD
Sbjct: 278 --------TDQYGD 283
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 39/132 (29%), Positives = 63/132 (47%), Gaps = 10/132 (7%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 434
G + CS A++ L T + +S IN + + G++ ++C I +MP V
Sbjct: 240 GQTVACSGGCQAIIDTGTSLIVGPTSD--ISNINSWVGASTDQYGDATVNCQNIQSMPEV 297
Query: 435 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 494
+FT+ F + Y+ ++ G C +GF LWILGDVF+ Y+ VFD
Sbjct: 298 TFTLNGNAFTIPATAYVSQSYYG----CTTGFGQGG----SDQLWILGDVFIRQYYAVFD 349
Query: 495 SGKLRIGFAEAA 506
+ IG A++A
Sbjct: 350 TQGPYIGLAKSA 361
>gi|344295434|ref|XP_003419417.1| PREDICTED: pepsin A-2/A-3-like [Loxodonta africana]
Length = 384
Score = 214 bits (546), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 119/286 (41%), Positives = 171/286 (59%), Gaps = 18/286 (6%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
LRR LK+ L L R+ +Y S + D L+N++D +YFG
Sbjct: 27 LRR-NLKEHGLLDDFLKTHRLNPASKYFPKEASSLL---------DTQTLENYLDVEYFG 76
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+P Q F+VIFDTGSSNLWVPS+ C S++C H+R+ S+TY ++ I Y
Sbjct: 77 TIGIGTPAQEFTVIFDTGSSNLWVPSTYCS-SLACTNHNRFNPDDSSTYRSTSETVSITY 135
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
G+GS++G D V+VG + +Q+F + T GS + + FDGI+GL + I+ DA
Sbjct: 136 GTGSMTGILGYDTVKVGGISDTNQIFGLSETEPGSFLYY-SPFDGILGLAYPSISSSDAT 194
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV-FGGVDPKHFKGKHTYVPVTKKGYWQ 265
PV+DN+ +QGLVS+++FS +L+ D EEGG +V FGG+D ++ G +VPV+ +GYWQ
Sbjct: 195 PVFDNIWDQGLVSQDLFSVYLSSD---EEGGSVVIFGGIDSSYYTGSLNWVPVSYEGYWQ 251
Query: 266 FELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
L + I +S C C AI+D+GTSLLAGPT + I +G
Sbjct: 252 ITLDSVSIDGESVA-CSDTCQAIIDTGTSLLAGPTTAIANIQEYLG 296
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 43/134 (32%), Positives = 62/134 (46%), Gaps = 12/134 (8%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTKE--KVLSYINELCDSLPNPMGESIIDCDRIPTMP 432
G+S CS A++ L T + Y+ L DS E + C ++P
Sbjct: 261 GESVACSDTCQAIIDTGTSLLAGPTTAIANIQEYLG-LGDS-----SEEEVSCSTADSLP 314
Query: 433 NVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTV 492
N+ FTI + +SP YI++ + C+ G DL G LWILGDVF+ Y+TV
Sbjct: 315 NIVFTINGVQYPVSPSSYIVEEDQS----CVVGLEGMDLDTYSGELWILGDVFIRQYYTV 370
Query: 493 FDSGKLRIGFAEAA 506
FD ++G A A
Sbjct: 371 FDRANNQVGLASVA 384
>gi|332267172|ref|XP_003282561.1| PREDICTED: pepsin A-5 [Nomascus leucogenys]
Length = 372
Score = 214 bits (546), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 125/307 (40%), Positives = 176/307 (57%), Gaps = 27/307 (8%)
Query: 28 LRRI----GLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDA 83
LRR GL K L H+LN AR +Y + D PL+N++D
Sbjct: 12 LRRTLSEHGLLKDFLKKHNLNPAR-----KYF--------PQLEAPTLVDEQPLENYLDM 58
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
+YFG IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H+R+ S+TY ++
Sbjct: 59 EYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTYQSTSETV 117
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAV 202
I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + I+
Sbjct: 118 SIAYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISS 176
Query: 203 GDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
A PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +G
Sbjct: 177 SGATPVFDNIWNQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYSGSLNWVPVTVEG 234
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSA 318
YWQ + I + N T C GC AIVD+GTSLL GPT + I IG +G +
Sbjct: 235 YWQITVDSITM-NGETIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVV 293
Query: 319 ECKLVVS 325
C + S
Sbjct: 294 SCSAISS 300
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
N G+ ++ C I ++P++ FTI + + P YIL++ EG CISGF ++P
Sbjct: 286 NSDGDMVVSCSAISSLPDIVFTINGVQYPVPPSAYILQS-EG---SCISGFQGMNVPTES 341
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 342 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 372
>gi|426368715|ref|XP_004051348.1| PREDICTED: pepsin A-5-like isoform 1 [Gorilla gorilla gorilla]
Length = 388
Score = 214 bits (546), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 122/290 (42%), Positives = 170/290 (58%), Gaps = 23/290 (7%)
Query: 28 LRRI----GLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDA 83
LRR GL K L H+LN AR +Y + D PL+N++D
Sbjct: 28 LRRTLSERGLLKDFLKKHNLNPAR-----KYF--------PQWEAPTLVDEQPLENYLDM 74
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
+YFG IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H+R+ S+TY ++
Sbjct: 75 EYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTYQSTSETV 133
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAV 202
I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + I+
Sbjct: 134 SITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISS 192
Query: 203 GDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
A PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +G
Sbjct: 193 SGATPVFDNIWNQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTGSLNWVPVTVEG 250
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
YWQ + I + N T C GC AIVD+GTSLL GPT + I IG
Sbjct: 251 YWQITVDSITM-NGETIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGA 299
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
N G+ ++ C I ++P++ FTI + + P YIL++ EG CISGF ++P
Sbjct: 302 NSDGDMVVSCSAISSLPDIVFTINGVQYPVPPSAYILQS-EG---SCISGFQGMNVPTES 357
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNQVGLAAVA 388
>gi|73620984|sp|P81497.2|PEPA_SUNMU RecName: Full=Pepsin A; Flags: Precursor
gi|9798654|dbj|BAB11749.1| pepsinogen A [Suncus murinus]
Length = 387
Score = 214 bits (546), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 119/295 (40%), Positives = 174/295 (58%), Gaps = 22/295 (7%)
Query: 32 GLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGI 91
GL K L H++N A +Y + L D PL N+MD +YFG IGI
Sbjct: 36 GLLKDFLAKHNVNPA-----SKYFPTEAAT----ELADQ-----PLVNYMDMEYFGTIGI 81
Query: 92 GSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGS 151
G+PPQ F+VIFDTGSSNLWVPS C S +C H+R+ +KS+T+ ++ I YG+GS
Sbjct: 82 GTPPQEFTVIFDTGSSNLWVPSVYCS-SPACSNHNRFNPQKSSTFQSTSQTLSIAYGTGS 140
Query: 152 ISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWD 210
++G D V+V + +Q+F + T GS + + FDGI+GL + IA A PV+D
Sbjct: 141 MTGVLGYDTVQVAGIADTNQIFGLSQTEPGSFLYY-SPFDGILGLAYPNIASSGATPVFD 199
Query: 211 NMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 270
NM QGLVS+++FS +L+ + + G ++FGG+D ++ G +VP++ +GYWQ +
Sbjct: 200 NMWNQGLVSQDLFSVYLSS--NDQSGSVVIFGGIDSSYYTGNLNWVPLSSEGYWQITVDS 257
Query: 271 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVS 325
I + Q+ C G C AIVD+GTSLL+GP + I +IG +A ++VVS
Sbjct: 258 ITMNGQAIA-CSGSCQAIVDTGTSLLSGPNNAIANIQKSIGASQ--NANGQMVVS 309
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 44/132 (33%), Positives = 63/132 (47%), Gaps = 6/132 (4%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 434
G + CS A+V L ++ I + + N G+ ++ C I ++P++
Sbjct: 262 GQAIACSGSCQAIVDTGTSLLSG--PNNAIANIQKSIGASQNANGQMVVSCSSIQSLPDI 319
Query: 435 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 494
FTI + L YIL+ + C SGF D+P P G LWILGDVF+ Y VFD
Sbjct: 320 VFTINGIQYPLPASAYILQNQQD----CTSGFQGMDIPTPSGELWILGDVFIRQYFAVFD 375
Query: 495 SGKLRIGFAEAA 506
G R+G A A
Sbjct: 376 RGNNRVGLAPVA 387
>gi|195433875|ref|XP_002064932.1| GK15196 [Drosophila willistoni]
gi|194161017|gb|EDW75918.1| GK15196 [Drosophila willistoni]
Length = 415
Score = 214 bits (546), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 123/351 (35%), Positives = 189/351 (53%), Gaps = 32/351 (9%)
Query: 12 LWVLASCLLLPASSNGLRRIGLKKRRL---------------DLHSLNAARITRKERYMG 56
LW L + +L+ SS R+ + R+ DL SL+ + + +
Sbjct: 8 LWFLLALVLIAFSSAEARKRKHNRVRVGRHNNPSSSHYNVKHDLKSLSIKHKLKLSKAIV 67
Query: 57 GAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC 116
VS + + L N + +Y+ + IG+PPQ F ++ DTGS+NLWVPSSKC
Sbjct: 68 DNAVSASTKTGTTAATNAASLGNAYNTEYYITVHIGTPPQEFRLLIDTGSANLWVPSSKC 127
Query: 117 YFSI-SCYFHSRYKSRKSNTYTEIGKSCEINYGSGS-----ISGFFSQDNVEVGDVVVKD 170
++ +C H RY S S+TY + +I Y S + + GF SQD V +GD+ +K+
Sbjct: 128 PSTVKACAAHQRYNSSASSTYKANNTAFQIEYASNTAGGVALDGFLSQDTVAIGDLAIKN 187
Query: 171 QVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRD 230
QVF E T E TFL + FDG+IGL + I++ +P N++ QGL+ E +FS +LNR+
Sbjct: 188 QVFAEMTNEPDGTFLTSPFDGMIGLAYASISINGVIPPLYNLISQGLIPEPIFSIYLNRN 247
Query: 231 -PDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIV 289
+A GGE++ GG+DP + G TYVPV+++GYWQFE+ + +Q C+ C AI+
Sbjct: 248 GTNATNGGELILGGIDPALYSGCLTYVPVSQQGYWQFEMTSATLNDQE--FCD-NCQAIL 304
Query: 290 DSGTSLLAGPTPVVTEINHAIG------GEGVVSAECKLVVSQYGDLIWDL 334
D GTSL+ P + EIN +G G +C +S+ D+I+ +
Sbjct: 305 DVGTSLIVVPNSEIKEINQILGVTNPNATSGAFLVDCA-TISKLPDIIFTI 354
Score = 78.6 bits (192), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 63/200 (31%), Positives = 90/200 (45%), Gaps = 37/200 (18%)
Query: 331 IWDLLVSGLLPEKV----CQQIGLCAFNGAEYVSTGIKTVV------------------E 368
+++L+ GL+PE + + G A NG E + GI + E
Sbjct: 226 LYNLISQGLIPEPIFSIYLNRNGTNATNGGELILGGIDPALYSGCLTYVPVSQQGYWQFE 285
Query: 369 KENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGES---IIDC 425
+ + D C C+ A++ V L E + IN++ + NP S ++DC
Sbjct: 286 MTSATLNDQEFCDNCQ-AILDVGTSLIVVPNSE--IKEINQIL-GVTNPNATSGAFLVDC 341
Query: 426 DRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVF 485
I +P++ FTI K F L YILK G C+SGF D WILG+VF
Sbjct: 342 ATISKLPDIIFTIARKEFALKSTDYILKYGN----TCVSGFSTLD----GIDFWILGEVF 393
Query: 486 MGVYHTVFDSGKLRIGFAEA 505
MG Y+TVFD G +IG A A
Sbjct: 394 MGAYYTVFDIGYNQIGIATA 413
>gi|51534964|dbj|BAD36915.1| pepsinogen C [Myocastor coypus]
Length = 393
Score = 214 bits (546), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 117/305 (38%), Positives = 176/305 (57%), Gaps = 7/305 (2%)
Query: 13 WVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITR---KERYMGGAGVSGVRHRLGD 69
W + + L LP + RI L+K + ++ + + K+ A +H GD
Sbjct: 3 WAIVALLCLPLLEAAVLRIPLRKSKSIREAMKENGLLKQYLKDHKQDPAQKFFGKH-FGD 61
Query: 70 SDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYK 129
+ P+ +MDA YFGEI +G+PPQ+F V+FDTGSSNLWV S C S++C HSR+
Sbjct: 62 YSVLLEPM-TYMDASYFGEISLGTPPQSFQVLFDTGSSNLWVASIYCK-SLACTTHSRFN 119
Query: 130 SRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARF 189
KS+TYT G++ + YGSGS++G F D + + D V Q F + +E +FL A F
Sbjct: 120 PNKSSTYTSAGQTFSLQYGSGSLTGLFGYDTLTIQDTQVPKQEFGLSEQEPGGSFLYAAF 179
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDA-EEGGEIVFGGVDPKH 248
DGI+GL + ++ GDA ++ +G +S+ +FS +L DA EGG ++ GGVD
Sbjct: 180 DGIMGLAYPGLSAGDATTAMQGLLREGALSQSLFSVYLGSQQDATNEGGALILGGVDESL 239
Query: 249 FKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH 308
+ G ++ PVT++ YWQ + D L+ +++G C GC AIVD+GTSLL P ++ +
Sbjct: 240 YSGAISWTPVTQELYWQIGIEDFLLDGEASGWCSEGCQAIVDTGTSLLTVPQQYLSTLIE 299
Query: 309 AIGGE 313
AIG E
Sbjct: 300 AIGAE 304
>gi|426368717|ref|XP_004051349.1| PREDICTED: pepsin A-5-like isoform 2 [Gorilla gorilla gorilla]
Length = 388
Score = 214 bits (546), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 122/290 (42%), Positives = 170/290 (58%), Gaps = 23/290 (7%)
Query: 28 LRRI----GLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDA 83
LRR GL K L H+LN AR +Y + D PL+N++D
Sbjct: 28 LRRTLSERGLLKDFLKKHNLNPAR-----KYF--------PQWEAPTLVDEQPLENYLDM 74
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
+YFG IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H+R+ S+TY ++
Sbjct: 75 EYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTYQSTSETV 133
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAV 202
I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + I+
Sbjct: 134 SITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISS 192
Query: 203 GDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
A PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +G
Sbjct: 193 SGATPVFDNIWNQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTGSLNWVPVTVEG 250
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
YWQ + I + N T C GC AIVD+GTSLL GPT + I IG
Sbjct: 251 YWQITVDSITM-NGETIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGA 299
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
N G+ ++ C I ++P++ FTI + + P YIL++ EG CISGF ++P
Sbjct: 302 NSDGDMVVSCSAISSLPDIVFTINGVQYPVPPSAYILQS-EG---SCISGFEGMNVPTES 357
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNQVGLAAVA 388
>gi|407726061|dbj|BAM46128.1| pepsinogen C [Cynops pyrrhogaster]
Length = 383
Score = 214 bits (546), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 116/321 (36%), Positives = 173/321 (53%), Gaps = 24/321 (7%)
Query: 3 QKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSG 62
+ L+ ++ CL + +P L + ++ + H + A R+ +Y
Sbjct: 2 KNLILALVCLQFAEGLVRIP-----LHKFKPMRQVMAEHGVKAPRVDPATKY-------- 48
Query: 63 VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISC 122
R + PL N+MD Y+GEI IG+PPQNF V+FDTGSSNLWV S+ C S +C
Sbjct: 49 ---RFNNFAVGYEPLSNYMDMSYYGEISIGTPPQNFLVLFDTGSSNLWVASTYCS-SSAC 104
Query: 123 YFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSL 182
H+ + +S+TYT + I YG+GS++G D V + + + Q F + E
Sbjct: 105 TNHATFNPSQSSTYTSNNQKFSIQYGTGSLTGILGYDTVSIQGITITQQEFALSVNEPGT 164
Query: 183 TFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFG 242
F+ A+FDGI+GL + IA A V + M+ QGL+S+ +F F+L + ++ GGE+VFG
Sbjct: 165 NFVYAQFDGILGLAYPSIAADGATTVMEGMMNQGLLSQNIFGFYLGQQ-GSQSGGELVFG 223
Query: 243 GVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPV 302
GVD ++ G+ T+ PVT++ YWQ + + Q TG C GC IVD+GTSLL P
Sbjct: 224 GVDSNYYTGQITWTPVTQQMYWQIGISGFGVNGQPTGWCGQGCQGIVDTGTSLLTAPGQY 283
Query: 303 VTEINHAIG------GEGVVS 317
+ + IG GE VVS
Sbjct: 284 IAALMQEIGATQDSNGEYVVS 304
Score = 71.6 bits (174), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 39/90 (43%), Positives = 52/90 (57%), Gaps = 7/90 (7%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKT-GEGIAEVCISGFMAFDLPPPRG- 476
GE ++ C I ++P +SFTIG L P YIL+ GE C G M LP G
Sbjct: 299 GEYVVSCSNIDSLPTLSFTIGGTSLPLPPSAYILQNNGE-----CSVGIMPTYLPSQNGQ 353
Query: 477 PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
PLWILGDVF+ Y++++D ++GFA AA
Sbjct: 354 PLWILGDVFLRQYYSIYDVTNNQVGFATAA 383
>gi|222425186|dbj|BAH20542.1| pepsinogen A-35 [Pongo abelii]
Length = 388
Score = 214 bits (545), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 120/291 (41%), Positives = 169/291 (58%), Gaps = 23/291 (7%)
Query: 28 LRRI----GLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDA 83
LRR GL K L H+LN A +Y + H PL+N++D
Sbjct: 28 LRRTLSERGLLKDFLKKHNLNPA-----SKYFPQGKAPTLLHEQ--------PLENYLDV 74
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
+YFG IGIG+P Q+F+V+FDTGSSNLWVPS CY S+ C H+ + + S+TY ++
Sbjct: 75 EYFGSIGIGTPAQDFTVVFDTGSSNLWVPSVYCY-SLVCMDHNLFNPQDSSTYKSTSETV 133
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAV 202
I YG+GS++G D V+VG + +Q+F + + GS F A FDGI+GL + I+
Sbjct: 134 SITYGTGSMTGILGYDTVKVGGISDTNQIFGLSESEPGSFLFF-APFDGILGLAYPSISS 192
Query: 203 GDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
A PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +G
Sbjct: 193 SGATPVFDNIWNQGLVSQDLFSVYLSA--DDKSGSVVIFGGIDSSYYTGSLNWVPVTVEG 250
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
YWQ + I + N T C GC AIVD+GTSLL GPT + I IG
Sbjct: 251 YWQITVDSITM-NGKTIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGAS 300
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 39/91 (42%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
N G+ ++ C I ++P++ FTI + L P YILK+ EG CISGF ++P
Sbjct: 302 NSDGDMVVSCSAISSLPDIVFTINGVQYPLPPSAYILKS-EG---SCISGFQGMNVPTES 357
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 388
>gi|222425182|dbj|BAH20540.1| pepsinogen A-15 [Pongo abelii]
Length = 388
Score = 214 bits (545), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 118/291 (40%), Positives = 170/291 (58%), Gaps = 23/291 (7%)
Query: 28 LRRI----GLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDA 83
LRR GL K L H+LN A +Y + H PL+N++D
Sbjct: 28 LRRTLSERGLLKDFLKKHNLNPA-----SKYFPQGKAPTLLHEQ--------PLENYLDV 74
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
+YFG IGIG+P Q+F+V+FDTGSSNLWVPS CY S++C H+ + + S+TY ++
Sbjct: 75 EYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCY-SLACMDHNLFNPQDSSTYKSTSETV 133
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAV 202
I YG+GS++G D V+VG + +Q+F + + GS F A FDGI+GL + I+
Sbjct: 134 SITYGTGSMTGILGYDTVKVGGISDTNQIFGLSESEPGSFLFF-APFDGILGLAYPSISS 192
Query: 203 GDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
A PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +G
Sbjct: 193 SGATPVFDNIWNQGLVSQDLFSVYLSA--DDKSGSVVIFGGIDSSYYTGSLNWVPVTVEG 250
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
YWQ + I + ++ C GC AIVD+GTSLL GPT + I IG
Sbjct: 251 YWQITVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGAS 300
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 39/91 (42%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
N G+ ++ C I ++P++ FTI + L P YILK+ EG CISGF ++P
Sbjct: 302 NSNGDMVVSCSAISSLPDIVFTINGVQYPLPPSAYILKS-EG---SCISGFQGMNVPTES 357
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 388
>gi|410974069|ref|XP_003993470.1| PREDICTED: pepsin A-like [Felis catus]
Length = 387
Score = 214 bits (545), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 108/239 (45%), Positives = 153/239 (64%), Gaps = 6/239 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL+N+MD +YFG IGIG+PPQ F+VIFDTGSSNLWVPS C S +C H R+ ++S+T
Sbjct: 66 PLENYMDMEYFGTIGIGTPPQQFTVIFDTGSSNLWVPSVYCK-SPACTNHKRFNPQESST 124
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIG 194
Y I YG+GS++G D V+VG V +Q+F + T GS + A FDGI+G
Sbjct: 125 YQATNNPVSIAYGTGSMTGILGYDTVQVGGVSDTNQIFGLSETEPGSFLYY-APFDGILG 183
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + +I+ A PV+DNM +GLVS+++FS +L+ + + G ++FGG+D ++ G
Sbjct: 184 LAYPQISASGATPVFDNMWNEGLVSQDLFSVYLSG--NDQSGSVVMFGGIDSSYYTGNLN 241
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
++PV+ +GYWQ + I + QS C GGC AIVD+GTSLL GP+ + I IG
Sbjct: 242 WIPVSVEGYWQISVDSITMNGQSI-ACNGGCQAIVDTGTSLLTGPSNAIANIQSDIGAS 299
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 49/91 (53%), Gaps = 4/91 (4%)
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
N G+ I C I +P++ FTI + L P YIL++ +G CISG +LP
Sbjct: 301 NSYGQMGISCSAINNLPDIVFTINGNEYPLPPSAYILQSQQG----CISGLQGMNLPTAS 356
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
G LWILGDVF+ Y VFD ++G A A
Sbjct: 357 GELWILGDVFIRQYFAVFDRANNQVGLAPVA 387
>gi|357167304|ref|XP_003581098.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like
[Brachypodium distachyon]
Length = 225
Score = 214 bits (545), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 101/200 (50%), Positives = 141/200 (70%), Gaps = 9/200 (4%)
Query: 211 NMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 270
+M EQ L+++++F+FWLNR+ DA GGE+VF D H+KG HTYVPV ++G WQF +GD
Sbjct: 22 SMQEQKLLADDIFTFWLNREADASSGGELVF--XDSNHYKGNHTYVPVRRRGXWQFNMGD 79
Query: 271 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 330
+LI +QSTG C GCA IV SGTSLLAGP + ++NHAIG E +++ ECK VSQYG++
Sbjct: 80 LLIDDQSTGFCAKGCADIVYSGTSLLAGPICIFAQVNHAIGAERIINTECKEEVSQYGEM 139
Query: 331 IWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWV 390
LL+ P+KVC F+G GI++VV K+NV G +C+ACEMA+VW+
Sbjct: 140 TLHLLLVQTKPQKVCS-----XFDGTLSDYNGIESVVGKKNV--GSVVICTACEMAIVWI 192
Query: 391 QNQLKQKQTKEKVLSYINEL 410
+NQL+ +TKE +L Y+N++
Sbjct: 193 ENQLRXNKTKELILQYVNQV 212
>gi|221048011|gb|ACL98113.1| pepsinogen [Epinephelus coioides]
Length = 311
Score = 214 bits (545), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 116/267 (43%), Positives = 166/267 (62%), Gaps = 17/267 (6%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
+ N D Y+G I IG+PPQ+FSVIFDTGSSNLWVPS C S +C H ++ ++S+T+
Sbjct: 61 MTNDADLSYYGVISIGTPPQSFSVIFDTGSSNLWVPSVYCS-SQACQNHRKFNPQQSSTF 119
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGL 195
+ I YG+GS++G + DNVEVG + V++QVF I T + + A DGI+GL
Sbjct: 120 KWGDQPLSIQYGTGSMTGHLAIDNVEVGGITVQNQVFGISRTEAPFMAHMTA--DGILGL 177
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
F+ IA + VPV+DNMV+QGLVS+ +FS +L+ E+G E+VFGG+D H+ G+ T+
Sbjct: 178 AFQTIAADNVVPVFDNMVKQGLVSQPLFSVYLSS--HGEQGSEVVFGGIDSSHYTGQVTW 235
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
VP+T YWQ ++ + I Q T C GGC AI+D+GTSL+ GPT + +N +G
Sbjct: 236 VPLTSATYWQIKMDGVKINGQ-TVACAGGCQAIIDTGTSLIVGPTNDINNMNSWVGAS-- 292
Query: 316 VSAECKLVVSQYGDLIWDLLVSGLLPE 342
+QYG+ + G +PE
Sbjct: 293 --------TNQYGESTVNCQNVGSMPE 311
>gi|403217759|emb|CCK72252.1| hypothetical protein KNAG_0J01710 [Kazachstania naganishii CBS
8797]
Length = 415
Score = 214 bits (544), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 110/265 (41%), Positives = 164/265 (61%), Gaps = 9/265 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+++AQY+ +I +G+PPQ F VI DTGSSNLWVPSS+C S++C+ H +Y S+
Sbjct: 91 VPLSNYLNAQYYTDITLGTPPQQFKVILDTGSSNLWVPSSEC-GSLACFLHEKYDHSASS 149
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y G I YGSGS+ G+ SQD + +GD+ + Q F EAT E L F +FDGI+G
Sbjct: 150 SYKANGTDFSIQYGSGSLEGYISQDTLSIGDLTIPKQDFAEATSEPGLAFAFGKFDGILG 209
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKH 253
L + I+V VP + N +EQ L+ E F+F+L + + DAE+GGE +FGGVD + G
Sbjct: 210 LAYDTISVDKVVPPFYNALEQDLLDEAKFAFYLGDTNKDAEDGGEAIFGGVDKSKYTGDV 269
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
T++PV +K YW+ +L + +G++ + G A +D+GTSL+ P+ + IN IG +
Sbjct: 270 TWLPVRRKAYWEVKLEGLGLGDEYAELESHGAA--IDTGTSLITLPSGLAEIINSEIGAK 327
Query: 314 ----GVVSAECKLVVSQYGDLIWDL 334
G + EC Q DL ++
Sbjct: 328 KGWTGQYTLECN-TRDQLPDLTFNF 351
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 26/87 (29%), Positives = 46/87 (52%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ ++C+ +P+++F F + P Y L+ ++ CIS D P P GPL
Sbjct: 332 GQYTLECNTRDQLPDLTFNFNGYNFTIGPYDYTLE----VSGSCISAITPMDFPEPVGPL 387
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
I+GD F+ Y++++D +G A+A
Sbjct: 388 AIVGDAFLRKYYSIYDLEHNAVGLAKA 414
>gi|126309841|ref|XP_001370380.1| PREDICTED: gastricsin-like isoform 1 [Monodelphis domestica]
Length = 388
Score = 214 bits (544), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 102/234 (43%), Positives = 152/234 (64%), Gaps = 1/234 (0%)
Query: 80 FMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEI 139
+MD+ Y+GEI IG+PPQNF V+FDTGSSNLWVPS C S +C H+R+ +S+TY+
Sbjct: 67 YMDSSYYGEISIGTPPQNFLVLFDTGSSNLWVPSIYCQ-SQACSGHARFNPSQSSTYSTN 125
Query: 140 GKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFRE 199
G++ + YGSGS++GFF D + V + V +Q F + E F+ A+FDGI+G+ +
Sbjct: 126 GQTFSLQYGSGSLTGFFGYDTMTVQGIQVPNQEFGLSENEPGTNFIYAQFDGIMGMAYPA 185
Query: 200 IAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVT 259
+AVG A M++Q +++ +FSF+L+ ++ GGE++FGGVD + G+ + PVT
Sbjct: 186 LAVGGATTALQGMLQQNVLTNPIFSFYLSNQQSSQSGGEVIFGGVDNNLYSGQIYWAPVT 245
Query: 260 KKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
++ YWQ + + IG Q+TG C GC AIVD+GTSLL P ++ A GG+
Sbjct: 246 QELYWQIGIQEFSIGGQATGWCSQGCQAIVDTGTSLLTVPQQYMSAFLQATGGQ 299
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 38/89 (42%), Positives = 49/89 (55%), Gaps = 5/89 (5%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG-P 477
G+ ++DC+ I +P +SF I F LSP YIL C G L G P
Sbjct: 304 GQYVVDCNSIQNLPTISFLINGVQFPLSPSAYILNNNG----YCTVGIEPTYLASQNGQP 359
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
LWILGDVF+ Y++V+D G R+GFA AA
Sbjct: 360 LWILGDVFLRSYYSVYDMGNNRVGFATAA 388
>gi|229576947|ref|NP_001153272.1| pepsinogen A precursor [Pongo abelii]
gi|222425188|dbj|BAH20543.1| pepsinogen A-19 [Pongo abelii]
gi|222425190|dbj|BAH20544.1| pepsinogen A-13 [Pongo abelii]
gi|222425204|dbj|BAH20551.1| pepsinogen A-41 [Pongo abelii]
Length = 388
Score = 214 bits (544), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 123/307 (40%), Positives = 176/307 (57%), Gaps = 27/307 (8%)
Query: 28 LRRI----GLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDA 83
LRR GL K L H+LN AR +Y + D PL+N++D
Sbjct: 28 LRRTLSEHGLLKDFLKTHNLNPAR-----KYF--------PQWEAPTLVDEQPLENYLDM 74
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
+YFG IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H+R+ S+TY ++
Sbjct: 75 EYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTYQSTSETV 133
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAV 202
I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + I+
Sbjct: 134 SIAYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISS 192
Query: 203 GDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
A PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +G
Sbjct: 193 SGATPVFDNIWNQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTGSLNWVPVTVEG 250
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSA 318
YWQ + I + ++ C GC AIVD+GTSLL GPT + I IG +G +
Sbjct: 251 YWQITVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVV 309
Query: 319 ECKLVVS 325
C + S
Sbjct: 310 SCSAISS 316
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 38/91 (41%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
N G+ ++ C I ++P++ FTI + L P YIL++ EG CISGF ++P
Sbjct: 302 NSDGDMVVSCSAISSLPDIVFTINGVQYPLPPSAYILQS-EG---SCISGFQGMNVPTES 357
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 388
>gi|126309843|ref|XP_001370404.1| PREDICTED: gastricsin-like isoform 2 [Monodelphis domestica]
Length = 389
Score = 214 bits (544), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 102/234 (43%), Positives = 152/234 (64%), Gaps = 1/234 (0%)
Query: 80 FMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEI 139
+MD+ Y+GEI IG+PPQNF V+FDTGSSNLWVPS C S +C H+R+ +S+TY+
Sbjct: 67 YMDSSYYGEISIGTPPQNFLVLFDTGSSNLWVPSIYCQ-SQACSGHARFNPSQSSTYSTN 125
Query: 140 GKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFRE 199
G++ + YGSGS++GFF D + V + V +Q F + E F+ A+FDGI+G+ +
Sbjct: 126 GQTFSLQYGSGSLTGFFGYDTMTVQGIQVPNQEFGLSENEPGTNFIYAQFDGIMGMAYPA 185
Query: 200 IAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVT 259
+AVG A M++Q +++ +FSF+L+ ++ GGE++FGGVD + G+ + PVT
Sbjct: 186 LAVGGATTALQGMLQQNVLTNPIFSFYLSNQQSSQSGGEVIFGGVDNNLYSGQIYWAPVT 245
Query: 260 KKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
++ YWQ + + IG Q+TG C GC AIVD+GTSLL P ++ A GG+
Sbjct: 246 QELYWQIGIQEFSIGGQATGWCSQGCQAIVDTGTSLLTVPQQYMSAFLQATGGQ 299
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 39/89 (43%), Positives = 50/89 (56%), Gaps = 4/89 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG-P 477
G+ ++DC+ I +P +SF I F LSP YIL G C G L G P
Sbjct: 304 GQYVVDCNSIQNLPTISFLINGVQFPLSPSAYILNQNNG---YCTVGIEPTYLASQNGQP 360
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
LWILGDVF+ Y++V+D G R+GFA AA
Sbjct: 361 LWILGDVFLRSYYSVYDMGNNRVGFATAA 389
>gi|45382395|ref|NP_990208.1| gastricsin precursor [Gallus gallus]
gi|4589840|dbj|BAA76893.1| pepsinogen C [Gallus gallus]
Length = 389
Score = 214 bits (544), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 119/312 (38%), Positives = 173/312 (55%), Gaps = 13/312 (4%)
Query: 3 QKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSG 62
++L+ +V CL + L +P R +K+ + LH A Y + +
Sbjct: 2 KRLILTVLCLHLCEGILRVPLKKGKSIREAMKESGV-LHDYLANHRHYDPAYKFFSNFAT 60
Query: 63 VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISC 122
PL N MD Y+GEI IG+PPQNF V+FDTGSSNLWVPS+ C S +C
Sbjct: 61 AYE----------PLANNMDMSYYGEISIGTPPQNFLVLFDTGSSNLWVPSTLCQ-SQAC 109
Query: 123 YFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSL 182
H+ + +S+T++ + + YGSGS++G F D V + + + +Q F + E
Sbjct: 110 ANHNEFDPNESSTFSTQDEFFSLQYGSGSLTGIFGFDTVTIQGISITNQEFGLSETEPGT 169
Query: 183 TFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFG 242
+FL + FDGI+GL F I+ G A V M+++ L+ VFSF+L+ + +GGE+VFG
Sbjct: 170 SFLYSPFDGILGLAFPSISAGGATTVMQKMLQENLLDFPVFSFYLSGQ-EGSQGGELVFG 228
Query: 243 GVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPV 302
GVDP + G+ T+ PVT+ YWQ + D +G QS+G C GC IVD+GTSLL P V
Sbjct: 229 GVDPNLYTGQITWTPVTQTTYWQIGIEDFAVGGQSSGWCSQGCQGIVDTGTSLLTVPNQV 288
Query: 303 VTEINHAIGGEG 314
TE+ IG +
Sbjct: 289 FTELMQYIGAQA 300
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 53/176 (30%), Positives = 77/176 (43%), Gaps = 14/176 (7%)
Query: 333 DLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWVQN 392
+L+ G+ P QI Y GI E V S CS +V
Sbjct: 224 ELVFGGVDPNLYTGQITWTPVTQTTYWQIGI----EDFAVGGQSSGWCSQGCQGIVDTGT 279
Query: 393 QLKQ--KQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQY 450
L Q +++ YI D G+ + C I MP ++F I F L P Y
Sbjct: 280 SLLTVPNQVFTELMQYIGAQADD----SGQYVASCSNIEYMPTITFVISGTSFPLPPSAY 335
Query: 451 ILKTGEGIAEVCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+L++ ++ C G + LP G PLWILGDVF+ VY++++D G ++GFA A
Sbjct: 336 MLQSN---SDYCTVGIESTYLPSQTGQPLWILGDVFLRVYYSIYDMGNNQVGFATA 388
>gi|255644659|gb|ACU22832.1| unknown [Glycine max]
Length = 144
Score = 214 bits (544), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 90/138 (65%), Positives = 120/138 (86%)
Query: 368 EKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDR 427
E+E ++A D+ +CS+C+M V+W+QNQLKQK TK++V +Y+N+LC+SLP+P GES+I C+
Sbjct: 6 EQEELAARDTPLCSSCQMLVLWIQNQLKQKATKDRVFNYVNQLCESLPSPSGESVISCNS 65
Query: 428 IPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMG 487
+ MPN++FTIG+K F L+PEQYIL+TGEGI EVC+SGF+AFD+PPP+GPLWILGDVFM
Sbjct: 66 LSKMPNITFTIGNKPFVLTPEQYILRTGEGITEVCLSGFIAFDVPPPKGPLWILGDVFMR 125
Query: 488 VYHTVFDSGKLRIGFAEA 505
YHTVFD G L++GFAEA
Sbjct: 126 AYHTVFDYGNLQVGFAEA 143
>gi|23943854|ref|NP_055039.1| pepsin A-5 preproprotein [Homo sapiens]
gi|378522017|sp|P0DJD9.1|PEPA5_HUMAN RecName: Full=Pepsin A-5; AltName: Full=Pepsinogen-5; Flags:
Precursor
gi|20810074|gb|AAH29055.1| Pepsinogen 5, group I (pepsinogen A) [Homo sapiens]
gi|119594334|gb|EAW73928.1| pepsinogen 5, group I (pepsinogen A) [Homo sapiens]
gi|219520836|gb|AAI71889.1| Pepsinogen 5, group I (pepsinogen A) [Homo sapiens]
gi|223461673|gb|AAI47000.1| Pepsinogen 5, group I (pepsinogen A) [Homo sapiens]
Length = 388
Score = 213 bits (543), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 124/307 (40%), Positives = 176/307 (57%), Gaps = 27/307 (8%)
Query: 28 LRRI----GLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDA 83
LRR GL K L H+LN AR +Y + D PL+N++D
Sbjct: 28 LRRTLSERGLLKDFLKKHNLNPAR-----KYF--------PQWEAPTLVDEQPLENYLDM 74
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
+YFG IGIG+P Q+F+V+FDTGSSNLWVPS C S++C H+R+ S+TY ++
Sbjct: 75 EYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTYQSTSETV 133
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAV 202
I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + I+
Sbjct: 134 SITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISS 192
Query: 203 GDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
A PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +G
Sbjct: 193 SGATPVFDNIWNQGLVSQDLFSVYLSA--DDKSGSVVIFGGIDSSYYTGSLNWVPVTVEG 250
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSA 318
YWQ + I + N T C GC AIVD+GTSLL GPT + I IG +G +
Sbjct: 251 YWQITVDSITM-NGETIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVV 309
Query: 319 ECKLVVS 325
C + S
Sbjct: 310 SCSAISS 316
Score = 72.0 bits (175), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
N G+ ++ C I ++P++ FTI + + P YIL++ EG CISGF ++P
Sbjct: 302 NSDGDMVVSCSAISSLPDIVFTINGVQYPVPPSAYILQS-EG---SCISGFQGMNVPTES 357
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 388
>gi|149725191|ref|XP_001501954.1| PREDICTED: pepsin A-like [Equus caballus]
Length = 387
Score = 213 bits (543), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 121/296 (40%), Positives = 167/296 (56%), Gaps = 24/296 (8%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
LR GL L H N A + A G L+N+MD +YFG
Sbjct: 32 LRENGLLADFLKQHPRNPASKYFPKEAATLAATEG--------------LENYMDEEYFG 77
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
I IG+P Q F+VIFDTGSSNLWVPS+ C S++C H+R+ S+TY +S I Y
Sbjct: 78 TISIGTPAQEFTVIFDTGSSNLWVPSTYCS-SLACSDHNRFNPEDSSTYEATSESVSITY 136
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVP 207
G+GS++G D V VG + +Q+F + E S A FDGI+GL + I+ A P
Sbjct: 137 GTGSMTGVLGYDTVRVGGIEDTNQIFGLSESEPSSFLYYAPFDGILGLAYPSISASGATP 196
Query: 208 VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFE 267
V+DN+ +QGLVS+++FS +L+ D E G ++FGG+D ++ G +VPV+++ YWQ
Sbjct: 197 VFDNIWDQGLVSQDLFSVYLSSDD--ESGSVVMFGGIDSSYYSGSLNWVPVSEEAYWQIT 254
Query: 268 LGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 317
+ I + +S C GGC AIVD+GTSLLAGPT + I IG GE V+S
Sbjct: 255 VDSITMNGESIA-CSGGCQAIVDTGTSLLAGPTSGIDNIQSYIGASEDSSGEAVIS 309
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 54/134 (40%), Positives = 68/134 (50%), Gaps = 10/134 (7%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTK--EKVLSYINELCDSLPNPMGESIIDCDRIPTMP 432
G+S CS A+V L T + + SYI DS GE++I C I ++P
Sbjct: 262 GESIACSGGCQAIVDTGTSLLAGPTSGIDNIQSYIGASEDS----SGEAVISCSSIYSLP 317
Query: 433 NVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTV 492
++ FTI F LSP YIL+ + CISGF DL G LWILGDVF+ Y TV
Sbjct: 318 DIVFTINGVEFPLSPSAYILEEDDS----CISGFEGMDLDTSSGELWILGDVFIRQYFTV 373
Query: 493 FDSGKLRIGFAEAA 506
FD +IG A A
Sbjct: 374 FDRANNQIGLAPVA 387
>gi|4505757|ref|NP_002621.1| gastricsin isoform 1 preproprotein [Homo sapiens]
gi|129796|sp|P20142.1|PEPC_HUMAN RecName: Full=Gastricsin; AltName: Full=Pepsinogen C; Flags:
Precursor
gi|387015|gb|AAA60063.1| pepsinogen C [Homo sapiens]
gi|551176|gb|AAA60074.1| pepsinogen [Homo sapiens]
gi|1658286|gb|AAB18273.1| gastricsin [Homo sapiens]
gi|49522219|gb|AAH73740.1| Progastricsin (pepsinogen C) [Homo sapiens]
gi|119624464|gb|EAX04059.1| progastricsin (pepsinogen C) [Homo sapiens]
Length = 388
Score = 213 bits (543), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 114/283 (40%), Positives = 168/283 (59%), Gaps = 9/283 (3%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVS------GVRHRLGDSDEDILPLKNFMDAQYFGEIG 90
++ L + R T KE+ + G + ++R GD P+ +MDA YFGEI
Sbjct: 20 KVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYRFGDLSVTYEPMA-YMDAAYFGEIS 78
Query: 91 IGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSG 150
IG+PPQNF V+FDTGSSNLWVPS C S +C HSR+ +S+TY+ G++ + YGSG
Sbjct: 79 IGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHSRFNPSESSTYSTNGQTFSLQYGSG 137
Query: 151 SISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWD 210
S++GFF D + V + V +Q F + E F+ A+FDGI+GL + ++V +A
Sbjct: 138 SLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGIMGLAYPALSVDEATTAMQ 197
Query: 211 NMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 270
MV++G ++ VFS +L+ GG +VFGGVD + G+ + PVT++ YWQ + +
Sbjct: 198 GMVQEGALTSPVFSVYLSNQ-QGSSGGAVVFGGVDSSLYTGQIYWAPVTQELYWQIGIEE 256
Query: 271 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
LIG Q++G C GC AIVD+GTSLL P ++ + A G +
Sbjct: 257 FLIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMSALLQATGAQ 299
Score = 65.1 bits (157), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 57/107 (53%), Gaps = 5/107 (4%)
Query: 401 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 460
++ +S + + + + G+ +++C+ I +P+++F I F L P YIL
Sbjct: 286 QQYMSALLQATGAQEDEYGQFLVNCNSIQNLPSLTFIINGVEFPLPPSSYILSNNG---- 341
Query: 461 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
C G L G PLWILGDVF+ Y++V+D G R+GFA AA
Sbjct: 342 YCTVGVEPTYLSSQNGQPLWILGDVFLRSYYSVYDLGNNRVGFATAA 388
>gi|148691635|gb|EDL23582.1| progastricsin (pepsinogen C) [Mus musculus]
Length = 392
Score = 213 bits (543), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 117/304 (38%), Positives = 174/304 (57%), Gaps = 6/304 (1%)
Query: 13 WVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRK--ERYMGGAGVSGVRHRLGDS 70
W++ + L LP L R+ LKK + ++ + + + + G + GD
Sbjct: 3 WMVVALLCLPLLEAALIRVPLKKMKSIRETMKEQGVLKDFLKNHKYDPGQKYHFGKFGDY 62
Query: 71 DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKS 130
P+ +MDA Y+GEI IG+PPQNF V+FDTGSSNLWV S C S +C H+RY
Sbjct: 63 SVLYEPMA-YMDASYYGEISIGTPPQNFLVLFDTGSSNLWVSSVYCQ-SEACTTHTRYNP 120
Query: 131 RKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFD 190
KS+TY G++ + YG+GS++GFF D + V + V +Q F + E F+ A+FD
Sbjct: 121 SKSSTYYTQGQTFSLQYGTGSLTGFFGYDTLRVQSIQVPNQEFGLSENEPGTNFVYAQFD 180
Query: 191 GIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFK 250
GI+GL + ++ G A M+ +G +S+ +F +L +GG+IVFGGVD +
Sbjct: 181 GIMGLAYPGLSSGGATTALQGMLGEGALSQPLFGVYLGSQ-QGSDGGQIVFGGVDENLYT 239
Query: 251 GKHTYVPVTKKGYWQFELGDILIGNQSTGVC-EGGCAAIVDSGTSLLAGPTPVVTEINHA 309
G+ T++PVT++ YWQ + D LIGNQ++G C GC IVD+GTSLL P + E+
Sbjct: 240 GELTWIPVTQELYWQITIDDFLIGNQASGWCSSSGCQGIVDTGTSLLVMPAQYLNELLQT 299
Query: 310 IGGE 313
IG +
Sbjct: 300 IGAQ 303
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 38/104 (36%), Positives = 57/104 (54%), Gaps = 8/104 (7%)
Query: 406 YINELCDSL---PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVC 462
Y+NEL ++ G+ + CD + ++P ++F + F LSP YI++ EG C
Sbjct: 292 YLNELLQTIGAQEGEYGQYFVSCDSVSSLPTLTFVLNGVQFPLSPSSYIIQE-EG---SC 347
Query: 463 ISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+ G + L G PLWILGDVF+ Y+ VFD G R+G A +
Sbjct: 348 MVGLESLSLNAESGQPLWILGDVFLRSYYAVFDMGNNRVGLAPS 391
>gi|219521036|gb|AAI71897.1| Pepsinogen 5, group I (pepsinogen A) [Homo sapiens]
Length = 388
Score = 213 bits (543), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 124/307 (40%), Positives = 176/307 (57%), Gaps = 27/307 (8%)
Query: 28 LRRI----GLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDA 83
LRR GL K L H+LN AR +Y + D PL+N++D
Sbjct: 28 LRRTLSERGLLKDFLKKHNLNPAR-----KYF--------PQWEAPTLVDEQPLENYLDM 74
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
+YFG IGIG+P Q+F+V+FDTGSSNLWVPS C S++C H+R+ S+TY ++
Sbjct: 75 EYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTYQSTSETV 133
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAV 202
I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + I+
Sbjct: 134 SITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISS 192
Query: 203 GDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
A PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +G
Sbjct: 193 SGATPVFDNIWNQGLVSQDLFSVYLSA--DDKSGSVVIFGGIDSSYYTGSLNWVPVTVEG 250
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSA 318
YWQ + I + N T C GC AIVD+GTSLL GPT + I IG +G +
Sbjct: 251 YWQITVDSITM-NGETIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVV 309
Query: 319 ECKLVVS 325
C + S
Sbjct: 310 SCSAISS 316
Score = 71.6 bits (174), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
N G+ ++ C I ++P++ FTI + + P YIL++ EG CISGF ++P
Sbjct: 302 NSDGDMVVSCSAISSLPDIVFTINGVQYPVPPSAYILQS-EG---SCISGFQGMNVPTES 357
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRKYFTVFDRANNQVGLAPVA 388
>gi|387014|gb|AAA60062.1| pepsinogen [Homo sapiens]
Length = 385
Score = 213 bits (543), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 114/283 (40%), Positives = 168/283 (59%), Gaps = 9/283 (3%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVS------GVRHRLGDSDEDILPLKNFMDAQYFGEIG 90
++ L + R T KE+ + G + ++R GD P+ +MDA YFGEI
Sbjct: 17 KVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYRFGDLSVTYEPMA-YMDAAYFGEIS 75
Query: 91 IGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSG 150
IG+PPQNF V+FDTGSSNLWVPS C S +C HSR+ +S+TY+ G++ + YGSG
Sbjct: 76 IGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHSRFNPSESSTYSTNGQTFSLQYGSG 134
Query: 151 SISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWD 210
S++GFF D + V + V +Q F + E F+ A+FDGI+GL + ++V +A
Sbjct: 135 SLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGIMGLAYPALSVDEATTAMQ 194
Query: 211 NMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 270
MV++G ++ VFS +L+ GG +VFGGVD + G+ + PVT++ YWQ + +
Sbjct: 195 GMVQEGALTSPVFSVYLSNQ-QGSSGGAVVFGGVDSSLYTGQIYWAPVTQELYWQIGIEE 253
Query: 271 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
LIG Q++G C GC AIVD+GTSLL P ++ + A G +
Sbjct: 254 FLIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMSALLQATGAQ 296
Score = 65.1 bits (157), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 57/107 (53%), Gaps = 5/107 (4%)
Query: 401 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 460
++ +S + + + + G+ +++C+ I +P+++F I F L P YIL
Sbjct: 283 QQYMSALLQATGAQEDEYGQFLVNCNSIQNLPSLTFIINGVEFPLPPSSYILSNNG---- 338
Query: 461 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
C G L G PLWILGDVF+ Y++V+D G R+GFA AA
Sbjct: 339 YCTVGVEPTYLSSQNGQPLWILGDVFLRSYYSVYDLGNNRVGFATAA 385
>gi|540097|gb|AAB08492.1| preprochymosin, partial [Sus scrofa]
Length = 380
Score = 213 bits (543), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 111/258 (43%), Positives = 157/258 (60%), Gaps = 15/258 (5%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D QYFG+I IG+PPQ F+V+FDTGSS LWVPS C S +C H R+ KS+T
Sbjct: 64 PLTNYLDTQYFGKIYIGTPPQEFTVVFDTGSSELWVPSVYCK-SDACQNHHRFNPSKSST 122
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
+ + K I YG+GSI GF D V V +V Q +T+E S F + FDGI+GL
Sbjct: 123 FQNLDKPLSIQYGTGSIQGFLGYDTVMVAGIVDAHQTVGLSTQEPSDIFTYSEFDGILGL 182
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ E+A VPV+DNM+ + LV++++F+ +++R+ +EG + G +DP ++ G +
Sbjct: 183 GYPELASEYTVPVFDNMMHRHLVAQDLFAVYMSRN---DEGSMLTLGAIDPSYYTGSLHW 239
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
VPVT + YWQF + + I N C GGC AI+D+GTS+LAGP+ + I AIG
Sbjct: 240 VPVTMQLYWQFTVDSVTI-NGVVVACNGGCQAILDTGTSMLAGPSSDILNIQMAIGA--- 295
Query: 316 VSAECKLVVSQYGDLIWD 333
SQYG+ D
Sbjct: 296 -------TESQYGEFDID 306
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 32/87 (36%), Positives = 44/87 (50%), Gaps = 8/87 (9%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
GE IDC + +MP V F I +++ L P Y +G C SGF +
Sbjct: 301 GEFDIDCGSLSSMPTVVFEISGRMYPLPPSAYT-NQDQGF---CTSGFQG----DSKSQH 352
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WILG VF+ Y++VFD R+G A+A
Sbjct: 353 WILGVVFIQEYYSVFDRANNRVGLAKA 379
>gi|195471992|ref|XP_002088286.1| GE18491 [Drosophila yakuba]
gi|194174387|gb|EDW87998.1| GE18491 [Drosophila yakuba]
Length = 392
Score = 213 bits (543), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 113/253 (44%), Positives = 151/253 (59%), Gaps = 10/253 (3%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNT 135
L N M+ +Y+G I IG+P Q F+++FDTGS+NLWVPSS C S I+C H++Y S S+T
Sbjct: 68 LHNSMNNEYYGVIAIGTPKQRFNILFDTGSANLWVPSSSCPASNIACKKHNKYNSAASST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G+ I YG+GS+SG S D V + + ++DQ F EA E TF+ A F GI+GL
Sbjct: 128 YVANGEEFAIEYGTGSLSGILSTDTVTIAGISIQDQTFGEALNEPGTTFVDAPFAGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
F IAV P +DNMV QGL+ E V SF+L R A GGE++ GG+D +KG TY
Sbjct: 188 AFSAIAVDGVTPPFDNMVSQGLLDEPVISFYLKRQGTAVRGGELILGGIDSSLYKGSLTY 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCE-GGCAAIVDSGTSLLAGPTPVVTEINHAIG--- 311
VPV+ YWQF + I ++ G+ GC AI D+GTSL+ P +IN +G
Sbjct: 248 VPVSVPAYWQFAVNTI----KTNGIVLCNGCQAIADTGTSLIVAPLAAYRKINRQLGATD 303
Query: 312 -GEGVVSAECKLV 323
G+G C V
Sbjct: 304 NGDGEAFVSCSRV 316
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 40/100 (40%), Positives = 55/100 (55%), Gaps = 4/100 (4%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
IN + N GE+ + C R+ T+P V+ IG IF L+P YI++ + C+S F
Sbjct: 295 INRQLGATDNGDGEAFVSCSRVSTLPKVNLNIGGTIFTLAPRDYIVRLTQNGRTYCMSAF 354
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
+ WILGDVF+G ++TVFD G RIGFA A
Sbjct: 355 TYME----GLSFWILGDVFIGKFYTVFDKGNERIGFARVA 390
>gi|426250269|ref|XP_004018860.1| PREDICTED: gastricsin [Ovis aries]
Length = 431
Score = 213 bits (543), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 110/283 (38%), Positives = 168/283 (59%), Gaps = 8/283 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRH------RLGDSDEDILPLKNFMDAQYFGEIG 90
++ L + R T KE+ + + +H GD P+ ++MDA YFGEI
Sbjct: 21 KIPLKKFKSIRETMKEKGLLEDFLRTYKHDPAQKYHFGDFSVATEPM-DYMDAAYFGEIS 79
Query: 91 IGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSG 150
IG+PPQNF V+FDTGSSNLWVPS C S +C H R+ S+TY+ ++ + YGSG
Sbjct: 80 IGTPPQNFLVLFDTGSSNLWVPSLYCQ-SQACTSHPRFNPSLSSTYSSNEQTFSLQYGSG 138
Query: 151 SISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWD 210
S++G D + V + V +Q F + E FL A+FDGI+G+ + ++V A V
Sbjct: 139 SLTGLLGYDTLTVQGIQVPNQEFGLSKTEPGTNFLYAKFDGIMGMAYPSLSVDGATTVLQ 198
Query: 211 NMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 270
MV++G ++ +FSF+L+ +++GG ++FGGVD + + G+ + PVT++ YWQ + +
Sbjct: 199 GMVQEGALTSPIFSFYLSSQQGSQDGGAVIFGGVDSRLYTGQIYWAPVTQELYWQIGIEE 258
Query: 271 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
LIG+Q+TG C GC AIVD+GTSLL P ++ + A G +
Sbjct: 259 FLIGDQATGWCSAGCQAIVDTGTSLLTVPQQFLSALLQATGAQ 301
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 70/140 (50%), Gaps = 8/140 (5%)
Query: 370 ENVSAGDSAV--CSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDR 427
E GD A CSA A+V L ++ LS + + + + G+ +DC+
Sbjct: 257 EEFLIGDQATGWCSAGCQAIVDTGTSLLT--VPQQFLSALLQATGAQKDQYGQFPVDCNN 314
Query: 428 IPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG-PLWILGDVFM 486
I +P ++F I F L P YIL G+ C+ G +P G PLWILGDVF+
Sbjct: 315 IQNLPTLTFVINGMQFPLPPASYILSNGD---SYCVLGVEVTYIPSQNGQPLWILGDVFL 371
Query: 487 GVYHTVFDSGKLRIGFAEAA 506
Y++V+D G R+GFA AA
Sbjct: 372 RSYYSVYDLGNNRVGFATAA 391
>gi|119372298|ref|NP_001073275.1| pepsin A preproprotein [Homo sapiens]
gi|378521956|sp|P0DJD8.1|PEPA3_HUMAN RecName: Full=Pepsin A-3; AltName: Full=Pepsinogen-3; Flags:
Precursor
gi|182887917|gb|AAI60184.1| Pepsinogen 3, group I (pepsinogen A) [synthetic construct]
Length = 388
Score = 213 bits (543), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 122/307 (39%), Positives = 176/307 (57%), Gaps = 27/307 (8%)
Query: 28 LRRI----GLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDA 83
LRR GL K L H+LN AR +Y + D PL+N++D
Sbjct: 28 LRRTLSERGLLKDFLKKHNLNPAR-----KYF--------PQWKAPTLVDEQPLENYLDM 74
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
+YFG IGIG+P Q+F+V+FDTGSSNLWVPS C S++C H+R+ S+TY ++
Sbjct: 75 EYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTYQSTSETV 133
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAV 202
I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + I+
Sbjct: 134 SITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISS 192
Query: 203 GDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
A PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +G
Sbjct: 193 SGATPVFDNIWNQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTGSLNWVPVTVEG 250
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSA 318
YWQ + I + ++ C GC AIVD+GTSLL GPT + I IG +G +
Sbjct: 251 YWQITVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVV 309
Query: 319 ECKLVVS 325
C + S
Sbjct: 310 SCSAISS 316
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 38/91 (41%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
N G+ ++ C I ++P++ FTI + + P YIL++ EG CISGF +LP
Sbjct: 302 NSDGDMVVSCSAISSLPDIVFTINGVQYPVPPSAYILQS-EG---SCISGFQGMNLPTES 357
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 388
>gi|449542760|gb|EMD33738.1| hypothetical protein CERSUDRAFT_56642 [Ceriporiopsis subvermispora
B]
Length = 395
Score = 213 bits (543), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 123/271 (45%), Positives = 165/271 (60%), Gaps = 11/271 (4%)
Query: 56 GGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSK 115
GG G + H G ++L L N+ +AQYF E+ +G+PPQNF VI DTGSSNLWVPS
Sbjct: 57 GGLGRNTEVHHSGPG-HNVL-LSNYANAQYFTEVSLGTPPQNFKVILDTGSSNLWVPSVH 114
Query: 116 CYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIE 175
C SI+C+ HS+Y S KS++Y G S EI YGSGS+ G SQD + +GD+ + +Q F E
Sbjct: 115 C-MSIACFMHSKYDSSKSSSYNANGSSFEIQYGSGSMQGIVSQDTLSIGDLNITNQDFAE 173
Query: 176 ATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEE 235
AT+E L+F +FDGI+GL + I+V P + NMVEQGL+ +FSF L DA
Sbjct: 174 ATKEPGLSFTFGKFDGILGLAYNSISVNYITPPFYNMVEQGLLDNPIFSFKLG---DAPL 230
Query: 236 GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSL 295
GGE +FGG D + G+ Y PV ++ YW+ EL + +G+Q + G A +D+GTSL
Sbjct: 231 GGEAIFGGTDESAYTGEIIYAPVRRQAYWEVELDKVTLGDQVFEFQDTGAA--IDTGTSL 288
Query: 296 LAGPTPVVTEINHAIGG---EGVVSAECKLV 323
+A PT T IN IG G EC +
Sbjct: 289 IAVPTAQATAINKLIGATSKSGTYVVECSTI 319
Score = 68.2 bits (165), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 50/87 (57%), Gaps = 5/87 (5%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G +++C IP +P +FTI + + L+ Y+L I C+S F D+P PL
Sbjct: 310 GTYVVECSTIPNLPVFTFTINGQDYPLNATDYVLS----IDGTCMSAFTPMDMPD-SAPL 364
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WI+GDVF+ Y+TVFD + +GFA A
Sbjct: 365 WIVGDVFLRRYYTVFDLEQDAVGFATA 391
>gi|222425194|dbj|BAH20546.1| pepsinogen A-28 [Pongo abelii]
gi|222425196|dbj|BAH20547.1| pepsinogen A-17 [Pongo abelii]
gi|222425202|dbj|BAH20550.1| pepsinogen A-71 [Pongo abelii]
Length = 388
Score = 213 bits (543), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 123/307 (40%), Positives = 176/307 (57%), Gaps = 27/307 (8%)
Query: 28 LRRI----GLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDA 83
LRR GL K L H+LN AR +Y + D PL+N++D
Sbjct: 28 LRRTLSEHGLLKDFLKKHNLNPAR-----KYF--------PQWEAPTLVDEQPLENYLDM 74
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
+YFG IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H+R+ S+TY ++
Sbjct: 75 EYFGSIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTYQSTSETV 133
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAV 202
I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + I+
Sbjct: 134 SIAYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISS 192
Query: 203 GDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
A PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +G
Sbjct: 193 SGATPVFDNIWNQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTGSLNWVPVTVEG 250
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSA 318
YWQ + I + ++ C GC AIVD+GTSLL GPT + I IG +G +
Sbjct: 251 YWQITVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVV 309
Query: 319 ECKLVVS 325
C + S
Sbjct: 310 SCSAISS 316
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 38/91 (41%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
N G+ ++ C I ++P++ FTI + L P YIL++ EG CISGF ++P
Sbjct: 302 NSDGDMVVSCSAISSLPDIVFTINGVQYPLPPSAYILQS-EG---SCISGFQGMNVPTES 357
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 388
>gi|426251840|ref|XP_004019629.1| PREDICTED: pepsin A-like [Ovis aries]
Length = 386
Score = 213 bits (543), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 110/245 (44%), Positives = 158/245 (64%), Gaps = 6/245 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL+N++D +YFG IGIG+P Q+F+VIFDTGSSNLWVPS C S +C H+R+ + S+T
Sbjct: 65 PLQNYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSIYCS-SEACTNHNRFNPQDSST 123
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIG 194
Y ++ I YG+GS++G D VEVG + +Q+F + T GS + A FDGI+G
Sbjct: 124 YEATSETLSITYGTGSMTGILGYDTVEVGGISDTNQIFGLSETEPGSFLYY-APFDGILG 182
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+ A PV+DN+ +QGLVS+++FS +L+ + E G ++FGG+D ++ G
Sbjct: 183 LAYPSISSSGATPVFDNIWDQGLVSQDLFSVYLSSN--EESGSVVMFGGIDSSYYSGSLN 240
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
+VPV+ +GYWQ + I + +S C GC AIVD+GTSLLAGPT ++ I IG
Sbjct: 241 WVPVSVEGYWQITVDSITMNGESIA-CSDGCQAIVDTGTSLLAGPTTAISNIQSYIGASE 299
Query: 315 VVSAE 319
S E
Sbjct: 300 DSSGE 304
Score = 79.3 bits (194), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 51/134 (38%), Positives = 66/134 (49%), Gaps = 10/134 (7%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTK--EKVLSYINELCDSLPNPMGESIIDCDRIPTMP 432
G+S CS A+V L T + SYI DS GE +I C I ++P
Sbjct: 261 GESIACSDGCQAIVDTGTSLLAGPTTAISNIQSYIGASEDS----SGEEVISCSSIDSLP 316
Query: 433 NVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTV 492
++ FTI + + P YIL+ + VC SGF D+P G LWILGDVF+ Y TV
Sbjct: 317 DIVFTINGVQYPVPPSAYILQNDD----VCSSGFEGMDIPTSSGDLWILGDVFIRQYFTV 372
Query: 493 FDSGKLRIGFAEAA 506
FD +IG A A
Sbjct: 373 FDRANNQIGLAPVA 386
>gi|194762104|ref|XP_001963198.1| GF19727 [Drosophila ananassae]
gi|190616895|gb|EDV32419.1| GF19727 [Drosophila ananassae]
Length = 449
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 127/298 (42%), Positives = 173/298 (58%), Gaps = 19/298 (6%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L+RI ++ R L H+L + + K +Y+ A S ++ E ++ NF Y+G
Sbjct: 20 LKRIEIRPRNL-THNLQSEILLLKAKYLSSADESV------EAKEILVNAANFA---YYG 69
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEIN 146
EI IG+PPQNFSV+FDTGSSN WVPSS C S ++C H++YKS S+TY +G + I
Sbjct: 70 EISIGTPPQNFSVLFDTGSSNTWVPSSLCPASDVACQSHNQYKSSASSTYVPVGTNISIV 129
Query: 147 YGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
YG+GS+ GF S D V + + V +Q F EAT E F FDGI+GLGF ++ G
Sbjct: 130 YGTGSMEGFLSNDTVRIAGLNVTNQTFAEATAEPDGFFDSQPFDGILGLGFNTLSNGINT 189
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
PV DNM+ QGL+ + FS +L R+ + GGEI++GG DP + G TYVPV+ YWQF
Sbjct: 190 PV-DNMIAQGLLDKPEFSVYLRRNGSSLIGGEIIWGGTDPSIYHGSITYVPVSVPQYWQF 248
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI----GGEGVVSAEC 320
+ I Q +C GC AI D+GTSL+ P T IN + G+G S C
Sbjct: 249 TVDTGTINGQI--LCR-GCQAIADTGTSLIIVPKRAFTAINKQLNATDNGDGTASIPC 303
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 42/105 (40%), Positives = 61/105 (58%), Gaps = 5/105 (4%)
Query: 401 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILK-TGEGIA 459
++ + IN+ ++ N G + I C I +P + IG F+L+P+ YI+K GE +
Sbjct: 279 KRAFTAINKQLNATDNGDGTASIPCWEICKLPTLYLNIGGTRFSLAPKDYIIKIVGENGS 338
Query: 460 EVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAE 504
C+SGF + LWILGDVF+G Y+TVFD G RIGFA+
Sbjct: 339 SQCLSGFEYLE----GNLLWILGDVFIGKYYTVFDLGNERIGFAK 379
>gi|346322842|gb|EGX92440.1| vacuolar protease A precursor [Cordyceps militaris CM01]
Length = 395
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 123/309 (39%), Positives = 183/309 (59%), Gaps = 15/309 (4%)
Query: 14 VLASCLLLPASSNGLRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSGV----RH 65
++A+ +L ++ G+ ++ L+K +L S A ++Y+G S
Sbjct: 5 LIAAAVLAGSAHAGIHKMKLQKIPLAEQLVGASFEAQAQQLGQKYLGARPASRADIIFNA 64
Query: 66 RLGDSDE-DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYF 124
++ +S ++P+ NF +AQYF EI IG+PPQ F V+ DTGSSNLWVPS C SI+C+
Sbjct: 65 KVAESKNGHLVPVSNFANAQYFSEITIGTPPQTFKVVLDTGSSNLWVPSQSCS-SIACFL 123
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
HS Y S S+TY + G EI+YGSGS++G+ S D V +GD+ +K+ F EAT E L F
Sbjct: 124 HSTYDSSSSSTYKKNGSDFEIHYGSGSLTGYVSNDVVRIGDLTIKNTDFAEATNEPGLAF 183
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
RFDGI+GLG+ I+V VP + M++Q L+ E VF+F+L + EEG E VFGGV
Sbjct: 184 AFGRFDGILGLGYDTISVNHMVPPFYQMIKQKLLDEPVFAFYLGSE---EEGSEAVFGGV 240
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
D H++GK Y+P+ +K YW+ + I G + + G I+D+GTSL P+ +
Sbjct: 241 DKNHYEGKIEYLPLRRKAYWEVDFDAIAFGKEVAELENTGV--ILDTGTSLNTLPSDLAE 298
Query: 305 EINHAIGGE 313
+N IG +
Sbjct: 299 LLNKEIGAK 307
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 49/87 (56%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ IDC +P+++FT+ + L YIL+ G C+S F D+PPP GPL
Sbjct: 312 GQYTIDCAARDKLPDITFTLAGSNYTLPATDYILELGGS----CVSTFTPLDMPPPAGPL 367
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ Y++V+D K +G A A
Sbjct: 368 AILGDAFLRRYYSVYDLNKNAVGLARA 394
>gi|119372302|ref|NP_001073276.1| pepsin A preproprotein [Homo sapiens]
gi|378521995|sp|P0DJD7.1|PEPA4_HUMAN RecName: Full=Pepsin A-4; AltName: Full=Pepsinogen-4; Flags:
Precursor
gi|387012|gb|AAA98529.1| pepsinogen [Homo sapiens]
gi|157170280|gb|AAI52845.1| Pepsinogen 4, group I (pepsinogen A) [synthetic construct]
gi|219520853|gb|AAI71920.1| Pepsinogen 4, group I (pepsinogen A) [Homo sapiens]
gi|219521176|gb|AAI71910.1| Pepsinogen 4, group I (pepsinogen A) [Homo sapiens]
gi|223462201|gb|AAI50660.1| Pepsinogen 4, group I (pepsinogen A) [Homo sapiens]
gi|261860840|dbj|BAI46942.1| pepsinogen 4, group I [synthetic construct]
Length = 388
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 122/307 (39%), Positives = 176/307 (57%), Gaps = 27/307 (8%)
Query: 28 LRRI----GLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDA 83
LRR GL K L H+LN AR +Y + D PL+N++D
Sbjct: 28 LRRTLSERGLLKDFLKKHNLNPAR-----KYF--------PQWEAPTLVDEQPLENYLDM 74
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
+YFG IGIG+P Q+F+V+FDTGSSNLWVPS C S++C H+R+ S+TY ++
Sbjct: 75 EYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTYQSTSETV 133
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAV 202
I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + I+
Sbjct: 134 SITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISS 192
Query: 203 GDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
A PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +G
Sbjct: 193 SGATPVFDNIWNQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTGSLNWVPVTVEG 250
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSA 318
YWQ + I + ++ C GC AIVD+GTSLL GPT + I IG +G +
Sbjct: 251 YWQITVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVV 309
Query: 319 ECKLVVS 325
C + S
Sbjct: 310 SCSAISS 316
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 38/91 (41%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
N G+ ++ C I ++P++ FTI + + P YIL++ EG CISGF +LP
Sbjct: 302 NSDGDMVVSCSAISSLPDIVFTINGVQYPVPPSAYILQS-EG---SCISGFQGMNLPTES 357
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 388
>gi|222425192|dbj|BAH20545.1| pepsinogen A-59 [Pongo abelii]
Length = 388
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 123/307 (40%), Positives = 176/307 (57%), Gaps = 27/307 (8%)
Query: 28 LRRI----GLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDA 83
LRR GL K L H+LN AR +Y + D PL+N++D
Sbjct: 28 LRRTLSEHGLLKDFLKKHNLNPAR-----KYF--------PQWEAPTLVDEQPLENYLDM 74
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
+YFG IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H+R+ S+TY ++
Sbjct: 75 EYFGSIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTYQSTSETV 133
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAV 202
I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + I+
Sbjct: 134 SIAYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISS 192
Query: 203 GDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
A PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +G
Sbjct: 193 SGATPVFDNIWNQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTGSLNWVPVTVEG 250
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSA 318
YWQ + I + ++ C GC AIVD+GTSLL GPT + I IG +G +
Sbjct: 251 YWQITVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVV 309
Query: 319 ECKLVVS 325
C + S
Sbjct: 310 SCSAISS 316
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
N G+ ++ C I ++P++ FTI + + P YIL++ EG CISGF ++P
Sbjct: 302 NSDGDMVVSCSAISSLPDIVFTINGVQYPVPPSAYILQS-EG---SCISGFQGMNVPTES 357
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 388
>gi|73621386|sp|Q9GMY8.1|PEPA_SORUN RecName: Full=Pepsin A; Flags: Precursor
gi|9798656|dbj|BAB11750.1| pepsinogen A [Sorex unguiculatus]
Length = 387
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 116/300 (38%), Positives = 171/300 (57%), Gaps = 22/300 (7%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL + L HSLN A +Y + ++ PL N+MD +YFG
Sbjct: 32 LWENGLLEDFLKTHSLNPA-----SKYFPTEATTLSANQ---------PLVNYMDMEYFG 77
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
I IG+PPQ F+VIFDTGSSNLWVPS C S +C H+R+ +KS+T+ ++ I Y
Sbjct: 78 TISIGTPPQEFTVIFDTGSSNLWVPSIYCS-SPACSNHNRFDPQKSSTFKPTSQTVSIAY 136
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVP 207
G+GS++G D V+V + +Q+F + E + FDGI+GL + I+ A P
Sbjct: 137 GTGSMTGVLGYDTVQVAGIADTNQIFGLSQSEPGSFLYYSPFDGILGLAYPSISSSGATP 196
Query: 208 VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFE 267
V+DNM QGLVS+++FS +L+ + + G ++FGG+D ++ G +VP++ +GYWQ
Sbjct: 197 VFDNMWNQGLVSQDLFSVYLSS--NDQSGSVVMFGGIDSSYYTGSLNWVPLSSEGYWQIT 254
Query: 268 LGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKLV 323
+ I + QS C GGC AIVD+GTSLL+GPT + I IG +G ++ C +
Sbjct: 255 VDSITMNGQSI-ACNGGCQAIVDTGTSLLSGPTNAIANIQSKIGASQNSQGQMAVSCSSI 313
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 62/132 (46%), Gaps = 6/132 (4%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 434
G S C+ A+V L T ++ I + N G+ + C I +P++
Sbjct: 262 GQSIACNGGCQAIVDTGTSLLSGPTN--AIANIQSKIGASQNSQGQMAVSCSSIKNLPDI 319
Query: 435 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 494
FTI + L YIL++ EG C SGF D+P G LWILGDVF+ Y TVFD
Sbjct: 320 VFTINGIQYPLPASAYILQSQEG----CSSGFQGMDIPTSSGELWILGDVFIRQYFTVFD 375
Query: 495 SGKLRIGFAEAA 506
++G A A
Sbjct: 376 RANNQVGLAPVA 387
>gi|189066533|dbj|BAG35783.1| unnamed protein product [Homo sapiens]
gi|193785072|dbj|BAG54225.1| unnamed protein product [Homo sapiens]
gi|219521010|gb|AAI71815.1| Pepsinogen 3, group I (pepsinogen A) [Homo sapiens]
Length = 388
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 122/307 (39%), Positives = 176/307 (57%), Gaps = 27/307 (8%)
Query: 28 LRRI----GLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDA 83
LRR GL K L H+LN AR +Y + D PL+N++D
Sbjct: 28 LRRTLSERGLLKDFLKKHNLNPAR-----KYF--------PQWKAPTLVDEQPLENYLDM 74
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
+YFG IGIG+P Q+F+V+FDTGSSNLWVPS C S++C H+R+ S+TY ++
Sbjct: 75 EYFGTIGIGTPAQDFTVLFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTYQSTSETV 133
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAV 202
I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + I+
Sbjct: 134 SITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISS 192
Query: 203 GDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
A PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +G
Sbjct: 193 SGATPVFDNIWNQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTGSLNWVPVTVEG 250
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSA 318
YWQ + I + ++ C GC AIVD+GTSLL GPT + I IG +G +
Sbjct: 251 YWQITVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVV 309
Query: 319 ECKLVVS 325
C + S
Sbjct: 310 SCSAISS 316
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 38/91 (41%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
N G+ ++ C I ++P++ FTI + + P YIL++ EG CISGF +LP
Sbjct: 302 NSDGDMVVSCSAISSLPDIVFTINGVQYPVPPSAYILQS-EG---SCISGFQGMNLPTES 357
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 388
>gi|395534115|ref|XP_003769093.1| PREDICTED: gastricsin-like [Sarcophilus harrisii]
Length = 392
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 100/238 (42%), Positives = 151/238 (63%), Gaps = 1/238 (0%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N+MD Y+GEI IG+PPQNF V+FDTGSSNLWV S C S +C H ++ +S+T
Sbjct: 66 PLANYMDMSYYGEISIGTPPQNFLVLFDTGSSNLWVSSIYCQ-SQACTNHPQFNPNQSST 124
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y+ G++ + YG+GS++G F D V + + + +Q F + E +F+ A+FDGI+GL
Sbjct: 125 YSSNGQTFSLQYGTGSLTGVFGYDTVTIQGISITNQEFGLSETEPGTSFVYAQFDGILGL 184
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ I+ G A V ++++ L++ VF+F+L+ + ++ GGE+ FGGVD F G +
Sbjct: 185 AYPSISSGGATTVMQGLLQENLINAPVFAFYLSGNENSNNGGEVTFGGVDTSMFTGDIYW 244
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
PVT++ YWQ + IG Q+TG C GC A+VD+GTSLL P + +E+ IG +
Sbjct: 245 APVTQEAYWQIAINGFSIGGQATGWCSEGCQAVVDTGTSLLTAPQQIFSELMQYIGAQ 302
Score = 69.7 bits (169), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 37/107 (34%), Positives = 55/107 (51%), Gaps = 4/107 (3%)
Query: 401 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 460
+++ S + + + N G ++ C + M ++F I F L P Y+L + E
Sbjct: 289 QQIFSELMQYIGAQQNENGAYLVSCSNVQNMSTITFNINGVNFPLPPSAYVLPSNSNYCE 348
Query: 461 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
V G M LP G PLWILGDVF+ Y++V+D G R+GFA A
Sbjct: 349 V---GIMPTYLPSQNGQPLWILGDVFLRNYYSVYDLGNNRVGFANLA 392
>gi|397526910|ref|XP_003833357.1| PREDICTED: gastricsin [Pan paniscus]
Length = 388
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 114/283 (40%), Positives = 168/283 (59%), Gaps = 9/283 (3%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVS------GVRHRLGDSDEDILPLKNFMDAQYFGEIG 90
++ L + R T KE+ + G + ++R GD P+ +MDA YFGEI
Sbjct: 20 KVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYRFGDLSVTYEPMA-YMDAAYFGEIS 78
Query: 91 IGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSG 150
IG+PPQNF V+FDTGSSNLWVPS C S +C HSR+ +S+TY+ G++ + YGSG
Sbjct: 79 IGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHSRFNPSESSTYSTNGQTFSLQYGSG 137
Query: 151 SISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWD 210
S++GFF D + V + V +Q F + E F+ A+FDGI+GL + ++V +A
Sbjct: 138 SLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGIMGLAYPALSVDEATTAMQ 197
Query: 211 NMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 270
MV++G ++ VFS +L+ GG +VFGGVD + G+ + PVT++ YWQ + +
Sbjct: 198 GMVQEGALTSPVFSVYLSNQ-QGSSGGAVVFGGVDSSLYTGQIYWAPVTQELYWQIGIEE 256
Query: 271 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
LIG Q++G C GC AIVD+GTSLL P ++ + A G +
Sbjct: 257 FLIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMSALLEATGAQ 299
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 38/107 (35%), Positives = 59/107 (55%), Gaps = 5/107 (4%)
Query: 401 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 460
++ +S + E + + G+ +++C+ I +P ++F I F L P YIL + +G
Sbjct: 286 QQYMSALLEATGAQEDEYGQFLVNCNSIQNLPTLTFIINGVEFPLPPSSYIL-SNDGY-- 342
Query: 461 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
C G L G PLWILGDVF+ Y++V+D G R+GFA AA
Sbjct: 343 -CTVGVEPTYLSSQNGQPLWILGDVFLRSYYSVYDLGNNRVGFATAA 388
>gi|219521691|gb|AAI71808.1| Pepsinogen 4, group I (pepsinogen A) [Homo sapiens]
Length = 388
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 122/307 (39%), Positives = 176/307 (57%), Gaps = 27/307 (8%)
Query: 28 LRRI----GLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDA 83
LRR GL K L H+LN AR +Y + D PL+N++D
Sbjct: 28 LRRTLSERGLLKDFLKKHNLNPAR-----KYF--------PQWEAPTLVDEQPLENYLDM 74
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
+YFG IGIG+P Q+F+V+FDTGSSNLWVPS C S++C H+R+ S+TY ++
Sbjct: 75 EYFGTIGIGTPAQDFTVLFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTYQSTSETV 133
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAV 202
I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + I+
Sbjct: 134 SITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISS 192
Query: 203 GDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
A PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +G
Sbjct: 193 SGATPVFDNIWNQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTGSLNWVPVTVEG 250
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSA 318
YWQ + I + ++ C GC AIVD+GTSLL GPT + I IG +G +
Sbjct: 251 YWQITVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVV 309
Query: 319 ECKLVVS 325
C + S
Sbjct: 310 SCSAISS 316
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 38/91 (41%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
N G+ ++ C I ++P++ FTI + + P YIL++ EG CISGF +LP
Sbjct: 302 NSDGDMVVSCSAISSLPDIVFTINGVQYPVPPSAYILQS-EG---SCISGFQGMNLPTES 357
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRTNNQVGLAPVA 388
>gi|380865655|gb|AFF19538.1| pepsin F, partial [Camelus dromedarius]
Length = 354
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 111/287 (38%), Positives = 171/287 (59%), Gaps = 9/287 (3%)
Query: 38 LDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDED-----ILPLKNFMDAQYFGEIGIG 92
+ L + R +E+ M + +RL D+ PL+N++D Y +I IG
Sbjct: 16 IPLTKVKPMRENLREKNMLKDFLEQYTYRLSDNTAPAKRVYTQPLRNYLDLVYIADISIG 75
Query: 93 SPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSI 152
+PPQNF V+FDTGS+NLWVPS C S +C HS + +S T++ G+S EI YG+G I
Sbjct: 76 TPPQNFKVVFDTGSANLWVPSIYCD-SKACANHSVFNPPRSTTFSLEGRSFEITYGTGKI 134
Query: 153 SGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNM 212
+GF D V +G++V+ Q F + +E + A FDGI+GLG+ +++ PV+DN+
Sbjct: 135 AGFLGYDTVRIGNLVIGSQAFGMSQKEPGIFLEHAVFDGILGLGYPALSIVGTTPVFDNL 194
Query: 213 VEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDIL 272
+Q L+ E +F+F+L+ E G ++FGG+D ++KG+ +VPV+++ YWQ + I
Sbjct: 195 KKQRLLKEPIFAFYLST--KKENGSVVMFGGLDHSYYKGELKWVPVSQRLYWQISMDSIT 252
Query: 273 IGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
+ + G C+GGC AIVD+GT++L GPT VVT I AI + E
Sbjct: 253 MNGKILG-CKGGCQAIVDTGTAVLVGPTNVVTNIQKAINARPLTGYE 298
>gi|29244579|ref|NP_080249.2| gastricsin precursor [Mus musculus]
gi|73921722|sp|Q9D7R7.1|PEPC_MOUSE RecName: Full=Gastricsin; AltName: Full=Pepsinogen C; Flags:
Precursor
gi|12843461|dbj|BAB25990.1| unnamed protein product [Mus musculus]
gi|68534888|gb|AAH99409.1| Progastricsin (pepsinogen C) [Mus musculus]
Length = 392
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 117/304 (38%), Positives = 173/304 (56%), Gaps = 6/304 (1%)
Query: 13 WVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRK--ERYMGGAGVSGVRHRLGDS 70
W++ + L LP L R+ LKK + ++ + + + + G + GD
Sbjct: 3 WMVVALLCLPLLEAALIRVPLKKMKSIRETMKEQGVLKDFLKNHKYDPGQKYHFGKFGDY 62
Query: 71 DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKS 130
P+ +MDA Y+GEI IG+PPQNF V+FDTGSSNLWV S C S +C H+RY
Sbjct: 63 SVLYEPMA-YMDASYYGEISIGTPPQNFLVLFDTGSSNLWVSSVYCQ-SEACTTHTRYNP 120
Query: 131 RKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFD 190
KS+TY G++ + YG+GS++GFF D + V + V +Q F + E F+ A+FD
Sbjct: 121 SKSSTYYTQGQTFSLQYGTGSLTGFFGYDTLRVQSIQVPNQEFGLSENEPGTNFVYAQFD 180
Query: 191 GIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFK 250
GI+GL + ++ G A M+ +G +S+ +F +L GG+IVFGGVD +
Sbjct: 181 GIMGLAYPGLSSGGATTALQGMLGEGALSQPLFGVYLGSQ-QGSNGGQIVFGGVDENLYT 239
Query: 251 GKHTYVPVTKKGYWQFELGDILIGNQSTGVC-EGGCAAIVDSGTSLLAGPTPVVTEINHA 309
G+ T++PVT++ YWQ + D LIGNQ++G C GC IVD+GTSLL P + E+
Sbjct: 240 GELTWIPVTQELYWQITIDDFLIGNQASGWCSSSGCQGIVDTGTSLLVMPAQYLNELLQT 299
Query: 310 IGGE 313
IG +
Sbjct: 300 IGAQ 303
Score = 68.9 bits (167), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 38/104 (36%), Positives = 57/104 (54%), Gaps = 8/104 (7%)
Query: 406 YINELCDSL---PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVC 462
Y+NEL ++ G+ + CD + ++P ++F + F LSP YI++ EG C
Sbjct: 292 YLNELLQTIGAQEGEYGQYFVSCDSVSSLPTLTFVLNGVQFPLSPSSYIIQE-EG---SC 347
Query: 463 ISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+ G + L G PLWILGDVF+ Y+ VFD G R+G A +
Sbjct: 348 MVGLESLSLNAESGQPLWILGDVFLRSYYAVFDMGNNRVGLAPS 391
>gi|217038345|gb|ACJ76637.1| pepsinogen C (predicted) [Oryctolagus cuniculus]
Length = 391
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 113/286 (39%), Positives = 166/286 (58%), Gaps = 19/286 (6%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L+ GL K L+ H + A +++R GD P+ +++DA YFG
Sbjct: 36 LKEKGLLKEFLNTHKYDPA----------------LKYRFGDFSVTYEPM-DYLDAAYFG 78
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
EI IG+P QNF V+FDTGSSNLWVPS C S +C H+R+ KS+T+ ++ + Y
Sbjct: 79 EISIGTPSQNFLVLFDTGSSNLWVPSVYCQ-SEACTTHNRFNPSKSSTFYTYDQTFSLEY 137
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVP 207
GSGS++GFF D + ++ V +Q F + E FL A FDGI+GL + ++VGDA P
Sbjct: 138 GSGSLTGFFGYDTFTIQNIEVPNQEFGLSETEPGTNFLYAEFDGIMGLAYPSLSVGDATP 197
Query: 208 VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFE 267
MV+ G +S VFSF+L+ +GG +V GGVD + G + PVT++ YWQ
Sbjct: 198 ALQGMVQDGTISSSVFSFYLSSQ-QGTDGGALVLGGVDSSLYTGDIYWAPVTRELYWQIG 256
Query: 268 LGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
+ + LI ++++G C GC AIVD+GTSLL P ++++ A G +
Sbjct: 257 IDEFLISSEASGWCSQGCQAIVDTGTSLLTVPQEYMSDLLEATGAQ 302
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 77/262 (29%), Positives = 113/262 (43%), Gaps = 34/262 (12%)
Query: 262 GYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLL----------AGPTPVVTEINHAIG 311
GY F + +I + NQ G+ E + GT+ L A P+ V + A+
Sbjct: 147 GYDTFTIQNIEVPNQEFGLSE------TEPGTNFLYAEFDGIMGLAYPSLSVGDATPALQ 200
Query: 312 G---EGVVSAE--CKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTV 366
G +G +S+ + SQ G L++ G+ I Y GI
Sbjct: 201 GMVQDGTISSSVFSFYLSSQQGTDGGALVLGGVDSSLYTGDIYWAPVTRELYWQIGIDEF 260
Query: 367 VEKENVSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCD 426
+ +S+ S CS A+V L ++ +S + E + N GE ++DCD
Sbjct: 261 L----ISSEASGWCSQGCQAIVDTGTSLLT--VPQEYMSDLLEATGAQENEYGEFLVDCD 314
Query: 427 RIPTMPNVSFTIGDKIFNLSPEQYILKT-GEGIAEVCISGFMAFDLPPPRG-PLWILGDV 484
++P +F I F LSP YIL T G+ C+ G A L G PLWILGDV
Sbjct: 315 STESLPTFTFVINGVEFPLSPSAYILNTDGQ-----CMVGVEATYLSSQDGEPLWILGDV 369
Query: 485 FMGVYHTVFDSGKLRIGFAEAA 506
F+ Y++VFD R+GFA A
Sbjct: 370 FLRAYYSVFDMANNRVGFAALA 391
>gi|195034430|ref|XP_001988894.1| GH11416 [Drosophila grimshawi]
gi|193904894|gb|EDW03761.1| GH11416 [Drosophila grimshawi]
Length = 400
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 123/314 (39%), Positives = 178/314 (56%), Gaps = 19/314 (6%)
Query: 28 LRRIGLKK---RRLDLHSLNAARITRKERYMG-------GAGVSGVRHRLGDSDEDILP- 76
L RI + K ++ H +AAR R++ + GA + + + DS+ D
Sbjct: 19 LHRIPIHKHQQKKTRQHMKSAARHLRQKYHKQSELYVDYGAPNNDLSGSVEDSNADYTTE 78
Query: 77 -LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSN 134
L N + Y+GEI IG+PPQ F V+FDTGSSNLWVPS C + ++C H++Y S S+
Sbjct: 79 ELSNNQNMDYYGEIAIGTPPQYFKVVFDTGSSNLWVPSVNCLPTDLACQTHNQYNSSASS 138
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY G+S I YG+GS++G+ S D V + + + +Q F EAT + + +F FDGI+G
Sbjct: 139 TYVANGESFSIQYGTGSLTGYLSSDTVSISGLSIVNQSFAEATSQPNSSFTGVPFDGILG 198
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+ + IA VP + N+ QGL+ + F F+L + AE GGE++ GGVD F+G T
Sbjct: 199 MAYSSIAEDSVVPPFYNLWNQGLIDKPTFGFYLTHNGSAELGGELILGGVDNTLFEGNLT 258
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG-- 312
VPV++ GYWQF + + + N V C AI D+GTSLLA P +T IN+ IG
Sbjct: 259 SVPVSQMGYWQFAMAVVAMDNN---VICSDCQAIADTGTSLLAVPANQLTYINNIIGAYQ 315
Query: 313 -EGVVSAECKLVVS 325
+G +C LV S
Sbjct: 316 MDGDYFVDCSLVNS 329
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 40/132 (30%), Positives = 68/132 (51%), Gaps = 9/132 (6%)
Query: 372 VSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTM 431
V+ ++ +CS C+ A+ L + L+YIN + + G+ +DC + ++
Sbjct: 275 VAMDNNVICSDCQ-AIADTGTSLLAVPANQ--LTYINNIIGAYQMD-GDYFVDCSLVNSL 330
Query: 432 PNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHT 491
P ++F IG+ +F+L+ +YI E + C+S F + D WILGD F+G Y+T
Sbjct: 331 PTLNFLIGESVFSLTSAEYITVIQESDTKYCMSSFTSIDT-----NFWILGDTFIGHYYT 385
Query: 492 VFDSGKLRIGFA 503
FD G + FA
Sbjct: 386 QFDFGHNSVSFA 397
>gi|385301236|gb|EIF45441.1| proteinase a [Dekkera bruxellensis AWRI1499]
Length = 429
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 105/238 (44%), Positives = 148/238 (62%), Gaps = 3/238 (1%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N+M+AQYF EI +G+P Q F VI DTGSSNLWVPSS C S++CY H++Y +S+T
Sbjct: 107 PLTNYMNAQYFSEIELGTPGQKFKVILDTGSSNLWVPSSDCA-SLACYLHTKYDHEQSST 165
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + G I YGSGS+ G+ SQD +++ D+ + +Q F EAT E L F +FDGI+GL
Sbjct: 166 YKKNGSEFSIQYGSGSMKGYISQDTLKISDLEITNQDFAEATEEPGLAFAFGKFDGILGL 225
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ I+V VP N + GL+ FSF+L E+GG FGG+D F GK T+
Sbjct: 226 GYDTISVNHIVPPVYNAINSGLLDNPQFSFYLGDTSKTEDGGVCTFGGIDDSKFTGKITW 285
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
+PV +K YW+ + I +G++ + G A +D+GTSL+ P+ + +N IG E
Sbjct: 286 LPVRRKAYWEVKFEGIGLGDEYAELQSHGAA--IDTGTSLIVLPSQLAEILNSEIGAE 341
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 54/87 (62%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC++ ++P+++ T G F LSP Y L+ ++ C+S F D+P P GPL
Sbjct: 346 GQYTVDCNKRDSLPDLTLTFGGYNFTLSPYDYTLE----VSGSCMSAFTGMDMPEPIGPL 401
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
I+GD F+ Y++V+D GK +G A+A
Sbjct: 402 AIIGDAFLRRYYSVYDLGKDAVGLAKA 428
>gi|50978660|ref|NP_001003028.1| pepsin B precursor [Canis lupus familiaris]
gi|73621387|sp|Q8SQ41.1|PEPB_CANFA RecName: Full=Pepsin B; Flags: Precursor
gi|19911571|dbj|BAB86888.1| pepsinogen B [Canis lupus familiaris]
Length = 390
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 111/297 (37%), Positives = 171/297 (57%), Gaps = 6/297 (2%)
Query: 18 CLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDIL-P 76
CL L S G+ RI LKK + + + R + V L ++D P
Sbjct: 10 CLHL---SEGVERIILKKGK-SIRQVMEERGVLETFLRNHPKVDPAAKYLFNNDAVAYEP 65
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
N++D+ YFGEI IG+PPQNF ++FDTGSSNLWVPS+ C S +C H+R+ +S+TY
Sbjct: 66 FTNYLDSYYFGEISIGTPPQNFLILFDTGSSNLWVPSTYCQ-SQACSNHNRFNPSRSSTY 124
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
++ + YG GS++ D V V ++V+ +Q+F + E + F + FDGI+G+
Sbjct: 125 QSSEQTYTLAYGFGSLTVLLGYDTVTVQNIVIHNQLFGMSENEPNYPFYYSYFDGILGMA 184
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ +AV + V NM++QG +++ +FSF+ + P E GGE++ GGVD + + G+ +
Sbjct: 185 YSNLAVDNGPTVLQNMMQQGQLTQPIFSFYFSPQPTYEYGGELILGGVDTQFYSGEIVWA 244
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
PVT++ YWQ + + LIGNQ+TG+C GC IVD+GT L P + A G +
Sbjct: 245 PVTREMYWQVAIDEFLIGNQATGLCSQGCQGIVDTGTFPLTVPQQYLDSFVKATGAQ 301
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 46/86 (53%), Gaps = 5/86 (5%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG-P 477
G +++C+ I +MP ++F I L P Y+L C G LP P G P
Sbjct: 306 GNFVVNCNSIQSMPTITFVISGSPLPLPPSTYVLNNNG----YCTLGIEVTYLPSPNGQP 361
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFA 503
LWILGDVF+ Y+TVFD R+GFA
Sbjct: 362 LWILGDVFLREYYTVFDMAANRVGFA 387
>gi|335287195|ref|XP_003355296.1| PREDICTED: gastricsin-like [Sus scrofa]
Length = 391
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 117/319 (36%), Positives = 182/319 (57%), Gaps = 9/319 (2%)
Query: 12 LWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSD 71
L ++ CL L S G+ RI L+K + ++ + K ++ +
Sbjct: 4 LVLVLMCLYL---SEGMERIILRKGKSIREAMEEQGVLEKFLKNRPKIDPAAKYHFNNDA 60
Query: 72 EDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSR 131
P N++D+ YFGEI IG+PPQNF V+FDTGSSNLWVPS+ C + +C H R+
Sbjct: 61 VAYEPFTNYLDSFYFGEISIGTPPQNFLVLFDTGSSNLWVPSTYCQ-TQACSDHRRFNPD 119
Query: 132 KSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDG 191
+S+T+ G++ ++YGSGS+S D V V ++V+ +Q F + E S F + FDG
Sbjct: 120 QSSTFRINGQTYTLSYGSGSLSVVLGYDTVTVQNIVIDNQEFGLSESEPSDPFYYSYFDG 179
Query: 192 IIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKG 251
I+G+ + +AVG++ V +M++Q +++ +FSF+ +R P E GGE++ GGVD + + G
Sbjct: 180 ILGMAYPNMAVGNSPTVMQSMLQQDQLTQPIFSFYFSRQPTYEYGGELILGGVDTQLYSG 239
Query: 252 KHTYVPVTKKGYWQFELGDILIGNQSTGVC-EGGCAAIVDSGTSLLAGPTPVVTEINHAI 310
+ + PVT++ YWQ + + IG+Q+TG C GC AIVD+GT LLA P + A
Sbjct: 240 QIVWTPVTRELYWQIAIQEFAIGDQATGWCFSQGCQAIVDTGTFLLAVPQQYLASFLQAT 299
Query: 311 GGE----GVVSAECKLVVS 325
G + G +C LV S
Sbjct: 300 GAQEAQNGDFVVDCDLVQS 318
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 36/89 (40%), Positives = 51/89 (57%), Gaps = 5/89 (5%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG-P 477
G+ ++DCD + +MP ++F IG F L P Y+ + C G A LP G P
Sbjct: 307 GDFVVDCDLVQSMPTITFIIGGSQFPLPPSAYVFSNNDS----CRLGIEASYLPSSSGEP 362
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
LWILGDVF+ Y++V+D R+GFA +A
Sbjct: 363 LWILGDVFLKEYYSVYDMANNRVGFALSA 391
>gi|219520803|gb|AAI71814.1| Pepsinogen 4, group I (pepsinogen A) [Homo sapiens]
Length = 388
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 122/307 (39%), Positives = 176/307 (57%), Gaps = 27/307 (8%)
Query: 28 LRRI----GLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDA 83
LRR GL K L H+LN AR +Y + D PL+N++D
Sbjct: 28 LRRTLSERGLLKDFLKKHNLNPAR-----KYF--------PQWEAPTLVDEQPLENYLDM 74
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
+YFG IGIG+P Q+F+V+FDTGSSNLWVPS C S++C H+R+ S+TY ++
Sbjct: 75 EYFGTIGIGTPAQDFTVLFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTYQSTSETV 133
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAV 202
I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + I+
Sbjct: 134 SITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISS 192
Query: 203 GDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
A PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +G
Sbjct: 193 SGATPVFDNIWNQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTGSLNWVPVTVEG 250
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSA 318
YWQ + I + ++ C GC AIVD+GTSLL GPT + I IG +G +
Sbjct: 251 YWQITVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVV 309
Query: 319 ECKLVVS 325
C + S
Sbjct: 310 SCSAISS 316
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 38/91 (41%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
N G+ ++ C I ++P++ FTI + + P YIL++ EG CISGF +LP
Sbjct: 302 NSDGDMVVSCSAISSLPDIVFTINGVQYPVPPSAYILQS-EG---SCISGFQGMNLPTES 357
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 388
>gi|387013|gb|AAA60061.1| pepsinogen A [Homo sapiens]
Length = 388
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 120/303 (39%), Positives = 174/303 (57%), Gaps = 23/303 (7%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL K L H+LN AR +Y + D PL+N++D +YFG
Sbjct: 32 LSERGLLKDFLKKHNLNPAR-----KYF--------PQWKAPTLVDEQPLENYLDMEYFG 78
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+P Q+F+V+FDTGSSNLWVPS C S++C H+R+ S+TY ++ I Y
Sbjct: 79 TIGIGTPAQDFTVLFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTYQSTSETVSITY 137
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
G+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + I+ A
Sbjct: 138 GTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISSSGAT 196
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ
Sbjct: 197 PVFDNIWNQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQI 254
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKL 322
+ I + ++ C GC AIVD+GTSLL GPT + I IG +G + C
Sbjct: 255 TVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSA 313
Query: 323 VVS 325
+ S
Sbjct: 314 ISS 316
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
N G+ ++ C I ++P++ FTI + + P YIL++ EG CISGF +LP
Sbjct: 302 NSDGDMVVSCSAISSLPDIVFTINGVQYPVPPSAYILQS-EG---SCISGFQGMNLPTES 357
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
G LWILGDVF+ Y TVF+ ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFERANNQVGLAPVA 388
>gi|166361871|gb|ABY87034.1| pepsinogen A1 [Epinephelus coioides]
gi|166361875|gb|ABY87036.1| pepsinogen A1 [Epinephelus coioides]
Length = 376
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 116/267 (43%), Positives = 166/267 (62%), Gaps = 17/267 (6%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
+ N D Y+G I IG+PPQ+FSVIFDTGSSNLWVPS C S +C H ++ ++S+T+
Sbjct: 61 MTNDADLSYYGVISIGTPPQSFSVIFDTGSSNLWVPSVYCS-SQACQNHRKFNPQQSSTF 119
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGL 195
+ I YG+GS++G + DNVEVG + V++QVF I T + + A DGI+GL
Sbjct: 120 KWGDQPLSIQYGTGSMTGHLAIDNVEVGGITVQNQVFGISRTEAPFMAHMTA--DGILGL 177
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
F+ IA + VPV+DNMV+QGLVS+ +FS +L+ E+G E+VFGG+D H+ G+ T+
Sbjct: 178 AFQTIAADNVVPVFDNMVKQGLVSQPLFSVYLSS--HGEQGSEVVFGGIDSSHYTGQVTW 235
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
VP+T YWQ ++ + I Q T C GGC AI+D+GTSL+ GPT + +N +G
Sbjct: 236 VPLTSATYWQIKMDGVKINGQ-TVACAGGCQAIIDTGTSLIVGPTNDINNMNSWVGAS-- 292
Query: 316 VSAECKLVVSQYGDLIWDLLVSGLLPE 342
+QYG+ + G +PE
Sbjct: 293 --------TNQYGESTVNCQNVGSMPE 311
Score = 55.5 bits (132), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 31/102 (30%), Positives = 51/102 (50%), Gaps = 8/102 (7%)
Query: 404 LSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCI 463
++ +N + N GES ++C + +MP V+FT+ F + Y+ + G C
Sbjct: 282 INNMNSWVGASTNQYGESTVNCQNVGSMPEVTFTLNGHDFTIPASAYVSQNYYG----CN 337
Query: 464 SGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+GF LWILGDVF+ Y+ +FD+ IG A++
Sbjct: 338 TGFGQ----GGSDQLWILGDVFIREYYVIFDAQARYIGLAQS 375
>gi|4589842|dbj|BAA76892.1| pepsinogen C [Gallus gallus]
Length = 389
Score = 213 bits (541), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 118/312 (37%), Positives = 173/312 (55%), Gaps = 13/312 (4%)
Query: 3 QKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSG 62
++L+ ++ CL + L +P R +K+ + LH A Y + +
Sbjct: 2 KRLILTMLCLHLCEGILRVPLKKGKSIREAMKESGV-LHDYLANHRHYDPAYKFFSNFAT 60
Query: 63 VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISC 122
PL N MD Y+GEI IG+PPQNF V+FDTGSSNLWVPS+ C S +C
Sbjct: 61 AYE----------PLANNMDMSYYGEISIGTPPQNFLVLFDTGSSNLWVPSTLCQ-SQAC 109
Query: 123 YFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSL 182
H+ + +S+T++ + + YGSGS++G F D V + + + +Q F + E
Sbjct: 110 ANHNEFDPNESSTFSTQDEFFSLQYGSGSLTGIFGFDTVTIQGISITNQEFGLSETEPGT 169
Query: 183 TFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFG 242
+FL + FDGI+GL F I+ G A V M+++ L+ VFSF+L+ + +GGE+VFG
Sbjct: 170 SFLYSPFDGILGLAFPSISAGGATTVMQKMLQENLLDFPVFSFYLSGQ-EGSQGGELVFG 228
Query: 243 GVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPV 302
GVDP + G+ T+ PVT+ YWQ + D +G QS+G C GC IVD+GTSLL P V
Sbjct: 229 GVDPNLYTGQITWTPVTQTTYWQIGIEDFAVGGQSSGWCSQGCQGIVDTGTSLLTVPNQV 288
Query: 303 VTEINHAIGGEG 314
TE+ IG +
Sbjct: 289 FTELMQYIGAQA 300
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 53/176 (30%), Positives = 77/176 (43%), Gaps = 14/176 (7%)
Query: 333 DLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWVQN 392
+L+ G+ P QI Y GI E V S CS +V
Sbjct: 224 ELVFGGVDPNLYTGQITWTPVTQTTYWQIGI----EDFAVGGQSSGWCSQGCQGIVDTGT 279
Query: 393 QLKQ--KQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQY 450
L Q +++ YI D G+ + C I MP ++F I F L P Y
Sbjct: 280 SLLTVPNQVFTELMQYIGAQADD----SGQYVASCSNIEYMPTITFVISGTSFPLPPSAY 335
Query: 451 ILKTGEGIAEVCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+L++ ++ C G + LP G PLWILGDVF+ VY++++D G ++GFA A
Sbjct: 336 MLQSN---SDYCTVGIESTYLPSQTGQPLWILGDVFLRVYYSIYDMGNNQVGFATA 388
>gi|444724642|gb|ELW65241.1| Chymosin [Tupaia chinensis]
Length = 381
Score = 212 bits (540), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 103/237 (43%), Positives = 152/237 (64%), Gaps = 5/237 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D QYFG+I IG+PPQ F+V+FDTGSS+LWVPS C S +C H R+ KS+T
Sbjct: 65 PLTNYLDTQYFGKITIGTPPQEFTVVFDTGSSDLWVPSVYCD-SAACQNHQRFDPSKSST 123
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
+ + K I YG+GS+ GF D V V D+V Q +T+E F A FDGI+GL
Sbjct: 124 FQNLDKPLSIQYGTGSMQGFLGYDTVTVSDIVDTHQTVGLSTQEPGNVFTYAEFDGILGL 183
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ +A +VPV+DNM+++ LV++++FS +++R+ ++G + G +D ++ G +
Sbjct: 184 AYPSLAAEYSVPVFDNMMQKHLVAKDLFSVYMSRN---DQGSMLTLGAIDSSYYTGSLHW 240
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
VPVT + YWQF + + I N C+GGC AI+D+GTSL+AGP+ + I AIG
Sbjct: 241 VPVTMQDYWQFTMDSVTI-NGVVVACDGGCQAILDTGTSLVAGPSSDILNIQQAIGA 296
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 34/88 (38%), Positives = 45/88 (51%), Gaps = 8/88 (9%)
Query: 418 MGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGP 477
GE IDCD + +MP V F I + + L P Y + + C SGF D
Sbjct: 301 FGEFDIDCDSLSSMPTVVFEINGRKYPLPPSAYTNQN----QDFCTSGFQGDD----DSQ 352
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAEA 505
WILGDVF+ Y++VFD R+G A+A
Sbjct: 353 QWILGDVFIREYYSVFDRANNRLGLAKA 380
>gi|402893203|ref|XP_003909790.1| PREDICTED: pepsin A-2/A-3-like [Papio anubis]
Length = 388
Score = 212 bits (540), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 124/316 (39%), Positives = 180/316 (56%), Gaps = 28/316 (8%)
Query: 26 NGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQY 85
+ L GL K L H+ N AR +Y A + D PL+N++D +Y
Sbjct: 30 HNLSEHGLLKDFLKKHNFNPAR-----KYFPQAEAPTLI--------DEQPLENYLDMEY 76
Query: 86 FGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEI 145
FG IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H R+ + S+TY + I
Sbjct: 77 FGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACTNHKRFNPQDSSTYQSTSGTLSI 135
Query: 146 NYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGD 204
YG+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + I+
Sbjct: 136 TYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISSSG 194
Query: 205 AVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYW 264
A PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPV+ +GYW
Sbjct: 195 ATPVFDNIWNQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTGSLNWVPVSVEGYW 252
Query: 265 QFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVSA 318
Q + I + ++ C GC AIVD+GTSLL GPT + I IG GE VVS
Sbjct: 253 QISVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGEMVVSC 311
Query: 319 ECKLVVSQYGDLIWDL 334
+S D+++ +
Sbjct: 312 SA---ISSLPDIVFTI 324
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 51/91 (56%), Gaps = 4/91 (4%)
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
N GE ++ C I ++P++ FTI + + P YIL++ CISGF D+P
Sbjct: 302 NSDGEMVVSCSAISSLPDIVFTINGIQYPVPPSAYILQS----QGSCISGFQGMDVPTES 357
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 388
>gi|112950081|gb|ABI26643.1| aspartic proteinase [Cucumis sativus]
Length = 399
Score = 212 bits (540), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 121/319 (37%), Positives = 183/319 (57%), Gaps = 19/319 (5%)
Query: 14 VLASCLLLPASSNGLRRIGLKKR---RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDS 70
V+A ++ +++ + RI L+++ +L +++ AA++ + +Y + + G R G +
Sbjct: 4 VIAFLAIVALAASEMHRIPLQRQENFKLTKNNIQAAKVHLRNKYNVKSNLLG---RSGTT 60
Query: 71 DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYK 129
++ + + ++Y+G IGIG+P Q F+V+FD+GSSNLWVPS+KC S +C H
Sbjct: 61 EQ---LTQGQLTSEYYGTIGIGTPAQEFTVVFDSGSSNLWVPSAKCSSSDQACKNH---N 114
Query: 130 SRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARF 189
S S+TY G+ I YG+GS++GF S D V V + ++ Q F EAT E TF+ + F
Sbjct: 115 SAASSTYVPNGEQFSIQYGTGSLTGFLSTDTVTVNGLTIQSQTFAEATNEPGSTFVDSTF 174
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF 249
DGI+GL + I+ + VP + NMV Q LVS VFS + R A GE++FGG D +
Sbjct: 175 DGILGLAYETISQDNVVPPFYNMVSQSLVSNPVFSVYFGRSKAANNNGEVIFGGSDSTVY 234
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHA 309
+G YVPVT++GYWQF + + + Q AI D+GTSLLA PT +N A
Sbjct: 235 QGPINYVPVTQQGYWQFTMDGVYVNGQQ---VISSAQAIADTGTSLLAAPTSAFYTLNEA 291
Query: 310 IGG---EGVVSAECKLVVS 325
IG EG +C V S
Sbjct: 292 IGATYQEGDYFVDCSSVSS 310
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 34/85 (40%), Positives = 49/85 (57%), Gaps = 9/85 (10%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC + ++PN+ F+IG ++L P YI++ I C+S A D
Sbjct: 299 GDYFVDCSSVSSLPNIQFSIGGINYSLPPSAYIVE----IEGECMSATTAMDQEQ----- 349
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFA 503
WILGDVF+G Y+T FD G R+GFA
Sbjct: 350 WILGDVFLGSYYTEFDLGNNRVGFA 374
>gi|448115983|ref|XP_004202951.1| Piso0_001822 [Millerozyma farinosa CBS 7064]
gi|359383819|emb|CCE79735.1| Piso0_001822 [Millerozyma farinosa CBS 7064]
Length = 414
Score = 212 bits (540), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 107/250 (42%), Positives = 157/250 (62%), Gaps = 8/250 (3%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL ++++AQY+ IG+GSP Q F VI DTGSSNLWVPS+ C S++C+ HS+Y +S++
Sbjct: 91 PLVDYLNAQYYTTIGLGSPAQEFKVILDTGSSNLWVPSTDCS-SLACFLHSKYYHDESSS 149
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + G I YG+GS+ G+ SQD + + + ++ Q F EAT E LTF A+FDGI+GL
Sbjct: 150 YKQNGSDFSIQYGTGSLEGYVSQDTLNLAGLTIEKQDFAEATSEPGLTFAFAKFDGILGL 209
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+ I+V + VP N ++QGL+ E F+F+L ++D D EGG FGGVD KH+KG
Sbjct: 210 AYDSISVDNIVPPIYNAIDQGLLDEPKFAFYLGDKDKDENEGGVATFGGVDTKHYKGDII 269
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
+PV +K YW+ I +G++ + G A +D+GTSL+ P+ + IN IG +
Sbjct: 270 ELPVRRKAYWEVSFDGIGLGDEYAELTSTGAA--IDTGTSLITLPSSLAEIINAKIGAKK 327
Query: 314 ---GVVSAEC 320
G S +C
Sbjct: 328 SWSGQYSVDC 337
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 49/87 (56%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DCD ++P ++ T F LSP +Y L+ G CIS F D P P G L
Sbjct: 331 GQYSVDCDSRDSLPELTMTFHGHNFTLSPYEYTLEVGGS----CISAFTPMDFPKPIGDL 386
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
I+GD F+ Y++V+D GK +G AE+
Sbjct: 387 AIVGDSFLRKYYSVYDIGKNVVGLAES 413
>gi|348578169|ref|XP_003474856.1| PREDICTED: renin-like [Cavia porcellus]
Length = 404
Score = 212 bits (540), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 128/312 (41%), Positives = 184/312 (58%), Gaps = 19/312 (6%)
Query: 9 VFCLWVLASCLL-LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRL 67
+ LW SC LP + RRI LKK + + R + KER + A +S +
Sbjct: 13 LLVLW--GSCTFSLPMDTAAFRRIILKK-------MPSIRDSLKERGVDMARLSAKWGQF 63
Query: 68 GDS---DEDILP--LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSIS 121
S D P L N++D QY+GEIGIG+PPQ F VIFDTGS+NLWVPS+KC +
Sbjct: 64 SKSLSLDNSTFPVVLTNYLDTQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCSPLYTA 123
Query: 122 CYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGS 181
C HS Y S +S++Y E G I YGSG + GF SQD V VG + V Q F E T
Sbjct: 124 CEIHSLYDSSESSSYMENGTEFTIRYGSGKVKGFLSQDVVTVGGITVT-QTFGEVTELPL 182
Query: 182 LTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVF 241
+ F+LA+FDG++G+GF AVG PV+D+++ Q ++ E+VFS + +R+ G ++
Sbjct: 183 IPFMLAKFDGVLGMGFPAQAVGGVTPVFDHILSQRVLKEDVFSVYYSRNSHLLGGELLLG 242
Query: 242 GGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTP 301
G DP+H++G YV ++K G WQ + + +G+ +T +CE GC A+VD+G S ++GPT
Sbjct: 243 GN-DPQHYQGNFHYVRISKTGSWQIMMKGVSVGS-ATLLCEEGCMAVVDTGASYISGPTS 300
Query: 302 VVTEINHAIGGE 313
+ I A+G +
Sbjct: 301 SLRLIMEALGAK 312
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 34/86 (39%), Positives = 54/86 (62%)
Query: 420 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 479
E +++C+++PT+P++SF +GD+ + L+ Y+L+ +VC D+PPP GPLW
Sbjct: 318 EYVVNCNQVPTLPDISFHLGDRAYTLTSADYVLQDPYSDDDVCTLALQGLDIPPPTGPLW 377
Query: 480 ILGDVFMGVYHTVFDSGKLRIGFAEA 505
LG F+ ++T FD RIGFA A
Sbjct: 378 ALGASFIRKFYTEFDRRNNRIGFALA 403
>gi|50306705|ref|XP_453326.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49642460|emb|CAH00422.1| KLLA0D05929p [Kluyveromyces lactis]
Length = 409
Score = 212 bits (539), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 109/251 (43%), Positives = 161/251 (64%), Gaps = 7/251 (2%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+++AQYF EI +GSPPQ+F VI DTGSSNLWVPS++C S++C+ H++Y S+
Sbjct: 86 VPLTNYLNAQYFTEITLGSPPQSFKVILDTGSSNLWVPSAEC-GSLACFLHTKYDHEASS 144
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY G I YGSGS+ G+ S+D + +GD+V+ DQ F EAT E L F +FDGI+G
Sbjct: 145 TYKANGSEFAIQYGSGSLEGYVSRDLLTIGDLVIPDQDFAEATSEPGLAFAFGKFDGILG 204
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+V VP N ++ L+ + VF+F+L +E+GGE FGG+D + + G+ T
Sbjct: 205 LAYDSISVNRIVPPVYNAIKNKLLDDPVFAFYLGDSDKSEDGGEASFGGIDEEKYTGEIT 264
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
++PV +K YW+ + I +G + EG AAI D+GTSL+A P+ + +N IG +
Sbjct: 265 WLPVRRKAYWEVKFEGIGLGEE-YATLEGHGAAI-DTGTSLIALPSGLAEILNAEIGAKK 322
Query: 314 ---GVVSAECK 321
G S +C+
Sbjct: 323 GWSGQYSVDCE 333
Score = 62.0 bits (149), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 49/88 (55%), Gaps = 4/88 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC+ ++P+++ F ++ Y L+ ++ CIS F D P P GPL
Sbjct: 326 GQYSVDCESRDSLPDLTLNFNGYNFTITAYDYTLE----VSGSCISAFTPMDFPEPVGPL 381
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEAA 506
I+GD F+ Y++++D G +G A+AA
Sbjct: 382 AIIGDAFLRKYYSIYDIGHDAVGLAKAA 409
>gi|195339961|ref|XP_002036585.1| GM18746 [Drosophila sechellia]
gi|194130465|gb|EDW52508.1| GM18746 [Drosophila sechellia]
Length = 392
Score = 212 bits (539), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 109/238 (45%), Positives = 149/238 (62%), Gaps = 8/238 (3%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNT 135
L+N M+ +Y+G I IG+P Q F+++FDTGS+NLWVPS+ C S +C H++Y S S+T
Sbjct: 68 LQNSMNNEYYGVIAIGTPKQRFNILFDTGSANLWVPSASCPASNTACQRHNKYNSAASST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G+ I YG+GS+SGF S D V + + ++DQ F EA E TF+ A F GI+GL
Sbjct: 128 YVANGEEFAIEYGTGSLSGFLSTDTVTIAGISIQDQTFGEALSEPGTTFVDAPFAGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
F IAV P +DNMV QGL+ E V SF+L R A GGE++ GG+D ++G TY
Sbjct: 188 AFSAIAVDGVTPPFDNMVSQGLLDEPVISFYLKRQGTAVRGGELILGGIDSSLYRGSLTY 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGV--CEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
VPV+ YWQF + I ++ G+ C GC AI D+GTSL+A P +IN +G
Sbjct: 248 VPVSVPAYWQFTVNTI----KTNGILLCN-GCQAIADTGTSLIAVPLAAYRKINRQLG 300
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 39/100 (39%), Positives = 55/100 (55%), Gaps = 4/100 (4%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
IN + N GE+ + C R+ ++P V+ IG +F L+P YI+K + C+S F
Sbjct: 295 INRQLGATDNGGGEAFVRCGRVSSLPKVNLNIGGTVFTLAPRDYIVKVTQYGQTYCMSAF 354
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
+ WILGDVF+G ++TVFD G RIGFA A
Sbjct: 355 TYMEGL----SFWILGDVFIGKFYTVFDKGNERIGFARVA 390
>gi|348514690|ref|XP_003444873.1| PREDICTED: pepsin A-like [Oreochromis niloticus]
Length = 377
Score = 212 bits (539), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 106/238 (44%), Positives = 154/238 (64%), Gaps = 7/238 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
+ N D Y+G I IG+PPQ+FSVIFDTGSSNLWVPS C S +C H+++ +S+T+
Sbjct: 62 MTNDADLSYYGTISIGTPPQSFSVIFDTGSSNLWVPSVYCN-STACENHNQFNPSQSSTF 120
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGL 195
+S I YG+GS++GF D VEVG + V +QVF + T +T++ A DGI+GL
Sbjct: 121 QWGNQSLSIQYGTGSMTGFLGSDTVEVGGISVANQVFGLSQTEASFMTYMQA--DGILGL 178
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
F+ IA + VPV++ M+ +GLVSE +FS +L+ ++E+G E+VFGG D H+ G T+
Sbjct: 179 AFQSIASDNVVPVFNTMITEGLVSEPIFSVYLSG--NSEQGSEVVFGGTDSTHYTGTITW 236
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
+P++ YWQ + + I Q T C GGC AI+D+GTSL+ GPT + +N +G
Sbjct: 237 IPLSSATYWQINMDSVTINGQ-TVACSGGCQAIIDTGTSLIVGPTTDINNLNSWVGAS 293
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 39/135 (28%), Positives = 69/135 (51%), Gaps = 16/135 (11%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 434
G + CS A++ L T + ++ +N + + G++I++C IP+MP+V
Sbjct: 256 GQTVACSGGCQAIIDTGTSLIVGPTTD--INNLNSWVGASTDQSGDAIVNCQNIPSMPDV 313
Query: 435 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG---PLWILGDVFMGVYHT 491
+FT+ F + Y+ ++ G C++GF +G LWILGDVF+ Y+
Sbjct: 314 TFTLNGNAFTVPASAYVSQSSSG----CMTGF-------GQGGTMQLWILGDVFIREYYA 362
Query: 492 VFDSGKLRIGFAEAA 506
VF++ IG A++A
Sbjct: 363 VFNAQTQNIGLAKSA 377
>gi|296198131|ref|XP_002746573.1| PREDICTED: gastricsin [Callithrix jacchus]
gi|18203304|sp|Q9N2D3.1|PEPC_CALJA RecName: Full=Gastricsin; AltName: Full=Pepsinogen C; Flags:
Precursor
gi|7008023|dbj|BAA90872.1| pepsinogen C [Callithrix jacchus]
Length = 388
Score = 212 bits (539), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 112/286 (39%), Positives = 165/286 (57%), Gaps = 19/286 (6%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
++ GL L H + AR ++R+ D P+ ++MDA YFG
Sbjct: 33 MKEKGLLWEFLKTHKHDPAR----------------KYRVSDLSVSYEPM-DYMDAAYFG 75
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
EI IG+PPQNF V+FDTGSSNLWVPS C S +C HSR+ S+TY+ G++ + Y
Sbjct: 76 EISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHSRFNPSASSTYSSNGQTFSLQY 134
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVP 207
GSGS++GFF D + V + V +Q F + E F+ A+FDGI+GL + +++G A
Sbjct: 135 GSGSLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGIMGLAYPALSMGGATT 194
Query: 208 VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFE 267
M+++G ++ VFSF+L+ GG ++FGGVD + G+ + PVT++ YWQ
Sbjct: 195 AMQGMLQEGALTSPVFSFYLSNQ-QGSSGGAVIFGGVDSSLYTGQIYWAPVTQELYWQIG 253
Query: 268 LGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
+ + LIG Q++G C GC AIVD+GTSLL P ++ A G +
Sbjct: 254 IEEFLIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMSAFLEATGAQ 299
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 38/107 (35%), Positives = 54/107 (50%), Gaps = 5/107 (4%)
Query: 401 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 460
++ +S E + + G+ +++CD I +P ++F I F L P YIL
Sbjct: 286 QQYMSAFLEATGAQEDEYGQFLVNCDSIQNLPTLTFIINGVEFPLPPSSYILSNNG---- 341
Query: 461 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
C G L PLWILGDVF+ Y++VFD G R+GFA AA
Sbjct: 342 YCTVGVEPTYLSSQNSQPLWILGDVFLRSYYSVFDLGNNRVGFATAA 388
>gi|195114666|ref|XP_002001888.1| GI14567 [Drosophila mojavensis]
gi|193912463|gb|EDW11330.1| GI14567 [Drosophila mojavensis]
Length = 402
Score = 212 bits (539), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 115/262 (43%), Positives = 161/262 (61%), Gaps = 8/262 (3%)
Query: 69 DSDEDIL-PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHS 126
DS+E ++ L N + Y+G IGIG+PPQ F+V+FDTGSSNLWVPS +C + ++C H+
Sbjct: 73 DSNEYVIETLSNNQNMDYYGVIGIGTPPQYFNVVFDTGSSNLWVPSVQCLSTDVACQNHN 132
Query: 127 RYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLL 186
+Y S S+TY G+S I YG+GS++GF S D V + + + Q F EA + + +F
Sbjct: 133 QYNSSASSTYVPNGESFSIQYGTGSLTGFLSTDTVTINGLSIASQTFGEAISQPNGSFTG 192
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
FDGI+G+G+ IAV + VP + N+ EQ L+ E F F+L RD A+ GG++V GG+D
Sbjct: 193 VPFDGILGMGYMSIAVDNVVPPFYNLYEQRLIDEPTFGFYLARDGSAQAGGQLVLGGIDS 252
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
+ F G TYV V ++GYWQF + +G VC C AI D+GTSLLA P T +
Sbjct: 253 QLFSGNLTYVSVVQQGYWQFVVNSAEMGGYV--VCY-NCQAIADTGTSLLACPGSAYTML 309
Query: 307 NHAIGG---EGVVSAECKLVVS 325
N IGG +G +C V S
Sbjct: 310 NQLIGGYLMDGDYYVDCSTVSS 331
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 34/88 (38%), Positives = 48/88 (54%), Gaps = 5/88 (5%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC + ++P + F IG IF+L P YI E C+S F +
Sbjct: 320 GDYYVDCSTVSSLPALKFNIGGTIFSLPPSAYISSFTEYNTTYCMSSFTYINTD-----F 374
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEAA 506
WILGDVF+G ++T FD G+ R+GFA A
Sbjct: 375 WILGDVFIGQFYTQFDFGENRVGFAPVA 402
>gi|444513055|gb|ELV10247.1| Pepsin A [Tupaia chinensis]
Length = 396
Score = 212 bits (539), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 120/313 (38%), Positives = 174/313 (55%), Gaps = 31/313 (9%)
Query: 32 GLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGI 91
GL + L H+LN A +Y + V + PL+N++D +YFG IGI
Sbjct: 30 GLLEEYLKKHTLNPAS-----KYFPKEAATMVSTQ---------PLENYLDMEYFGTIGI 75
Query: 92 GSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGS 151
G+P Q F+VIFDTGSSNLWVPS C S +C H+R+ ++S+TY ++ I YG+GS
Sbjct: 76 GTPAQEFTVIFDTGSSNLWVPSVYCS-SPACSNHNRFNPQQSSTYQATSQTVSIAYGTGS 134
Query: 152 ISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWD 210
++G D V+VG + +Q+F + T GS + + FDGI+GL + IA A PV+D
Sbjct: 135 MTGILGYDTVQVGGIADTNQIFGLSETEPGSFLYY-SPFDGILGLAYPNIASSGATPVFD 193
Query: 211 NMVEQGLVSEEVFSFWLNR--DPDA-----------EEGGEIVFGGVDPKHFKGKHTYVP 257
NM QGLVS+++FS +L+ PD E G ++FGG+D ++ G +VP
Sbjct: 194 NMWNQGLVSQDLFSVYLSSMGTPDILTSCITFHSNDESGSVVIFGGIDSSYYTGSLNWVP 253
Query: 258 VTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVS 317
++ +GYWQ + I + Q C G C AIVD+GTSLL+GPT + I IG +
Sbjct: 254 LSAEGYWQITVDSITMNGQPIA-CSGSCQAIVDTGTSLLSGPTNAIANIQSYIGASQNSN 312
Query: 318 AECKLVVSQYGDL 330
E + S +L
Sbjct: 313 GEMVISCSAINNL 325
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 46/132 (34%), Positives = 63/132 (47%), Gaps = 6/132 (4%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 434
G CS A+V L T ++ I + N GE +I C I +P++
Sbjct: 271 GQPIACSGSCQAIVDTGTSLLSGPTN--AIANIQSYIGASQNSNGEMVISCSAINNLPDI 328
Query: 435 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 494
FTI + L P YIL++ EG C SGF ++P G LWILGDVF+ Y+ VFD
Sbjct: 329 VFTINGVQYPLPPSAYILQSQEG----CTSGFQGMNIPTASGELWILGDVFIRQYYAVFD 384
Query: 495 SGKLRIGFAEAA 506
++G A A
Sbjct: 385 RANNQVGLAPVA 396
>gi|301622166|ref|XP_002940408.1| PREDICTED: renin-like [Xenopus (Silurana) tropicalis]
Length = 371
Score = 211 bits (538), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 112/266 (42%), Positives = 159/266 (59%), Gaps = 11/266 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFHSRYKSRKSNT 135
L N+MD QYFGEI IGSPPQ F V+FDTGS+NLWVPS +C +C H+RY S KS T
Sbjct: 41 LTNYMDTQYFGEISIGSPPQTFKVVFDTGSANLWVPSQRCSPLYSACVSHNRYDSTKSQT 100
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y E G I YGSG + GF SQD V V + V QVF EAT + F+ ARFDG++G+
Sbjct: 101 YMENGAGFSIQYGSGGVKGFLSQDVVVVAGIPVI-QVFAEATALPAFPFIFARFDGVLGM 159
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLN---RDPDAEEGGEIVFGGVDPKHFKGK 252
GF A+ PV+D ++ + ++ E+VFS + + RD + GGEI+ GG DP ++ G
Sbjct: 160 GFPGQAIDGITPVFDRIISEQVLQEDVFSVYYSRSYRDSHLKPGGEIILGGSDPSYYTGS 219
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG- 311
Y+ + K+GYW + + IG + C+ GC+ +D+G + + GP V+ + AIG
Sbjct: 220 FQYLNLEKEGYWHIRMKGVSIGAEIL-FCKDGCSVAIDTGAAYITGPASSVSVLMKAIGA 278
Query: 312 ---GEGVVSAECKLVVSQYGDLIWDL 334
EG + +C +SQ D+ + +
Sbjct: 279 TELAEGEYTVDCDK-ISQLPDVSFHM 303
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 40/87 (45%), Positives = 53/87 (60%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
GE +DCD+I +P+VSF +G + L YIL+ + E+C F D+PPP GPL
Sbjct: 284 GEYTVDCDKISQLPDVSFHMGGNEYTLKGPAYILQQSQFGEEICSVAFTPLDIPPPVGPL 343
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WILG F+G Y+T FD RIGFA +
Sbjct: 344 WILGASFIGQYYTEFDRRNNRIGFATS 370
>gi|129797|sp|P03955.2|PEPC_MACFU RecName: Full=Gastricsin; AltName: Full=Pepsinogen C; Flags:
Precursor
gi|38073|emb|CAA42426.1| pepsinogen C [Macaca fuscata]
Length = 377
Score = 211 bits (538), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 113/283 (39%), Positives = 167/283 (59%), Gaps = 9/283 (3%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVS------GVRHRLGDSDEDILPLKNFMDAQYFGEIG 90
++ L + R T KE+ + G + ++ GD P+ +MDA YFGEI
Sbjct: 9 KVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYHFGDLSVSYEPMA-YMDAAYFGEIS 67
Query: 91 IGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSG 150
IG+PPQNF V+FDTGSSNLWVPS C S +C HSR+ +S+TY+ G++ + YGSG
Sbjct: 68 IGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHSRFNPSESSTYSTNGQTFSLQYGSG 126
Query: 151 SISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWD 210
S++GFF D + V + V +Q F + E F+ A+FDGI+GL + ++V A
Sbjct: 127 SLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGIMGLAYPTLSVDGATTAMQ 186
Query: 211 NMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 270
MV++G ++ +FS +L+ D GG +VFGGVD + G+ + PVT++ YWQ + +
Sbjct: 187 GMVQEGALTSPIFSVYLS-DQQGSSGGAVVFGGVDSSLYTGQIYWAPVTQELYWQIGIEE 245
Query: 271 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
LIG Q++G C GC AIVD+GTSLL P ++ + A G +
Sbjct: 246 FLIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMSALLQATGAQ 288
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 54/107 (50%), Gaps = 5/107 (4%)
Query: 401 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 460
++ +S + + + + G+ +++C+ I +P ++F I F L P YIL
Sbjct: 275 QQYMSALLQATGAQEDEYGQFLVNCNSIQNLPTLTFIINGVEFPLPPSSYILNNNG---- 330
Query: 461 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
C G L PLWILGDVF+ Y++V+D R+GFA AA
Sbjct: 331 YCTVGVEPTYLSAQNSQPLWILGDVFLRSYYSVYDLSNNRVGFATAA 377
>gi|194218273|ref|XP_001501915.2| PREDICTED: pepsin A-like [Equus caballus]
Length = 387
Score = 211 bits (538), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 120/296 (40%), Positives = 164/296 (55%), Gaps = 24/296 (8%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
LR GL L H N A A G L+N+MD +YFG
Sbjct: 32 LRENGLLADFLKQHPRNPASKYFPREAATLAATEG--------------LENYMDEEYFG 77
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
I IG+P Q F+VIFDTGSSNLWVPS C S++C H+R+ S+TY +S I Y
Sbjct: 78 TISIGTPAQEFTVIFDTGSSNLWVPSVYCS-SLACSDHNRFNPEDSSTYEATSESVSITY 136
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVP 207
G+GS++G D V VG + +Q+F + E S A FDGI+GL + I+ A P
Sbjct: 137 GTGSMTGVLGYDTVRVGGIEDTNQIFGLSESEPSSFLYYAPFDGILGLAYPSISASGATP 196
Query: 208 VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFE 267
V+DN+ +QGLVS+++FS +L+ D E G ++FGG+D ++ G +VPV+++ YWQ
Sbjct: 197 VFDNIWDQGLVSQDLFSVYLSSDD--ESGSVVMFGGIDSSYYSGSLNWVPVSEEAYWQIT 254
Query: 268 LGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 317
+ I + +S C GGC AIVD+GTSLLAGP + I IG GEG +S
Sbjct: 255 VDSITMNGESIA-CSGGCQAIVDTGTSLLAGPPSAIDNIQSYIGASEDSSGEGAIS 309
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 52/134 (38%), Positives = 64/134 (47%), Gaps = 10/134 (7%)
Query: 375 GDSAVCSACEMAVVWVQNQL--KQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMP 432
G+S CS A+V L + + SYI DS GE I C I ++P
Sbjct: 262 GESIACSGGCQAIVDTGTSLLAGPPSAIDNIQSYIGASEDS----SGEGAISCSSIDSLP 317
Query: 433 NVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTV 492
++ FTI F L+P YIL+ CISGF DL G LWILGDVF+ Y TV
Sbjct: 318 DIVFTINGVEFPLTPSAYILEEDGS----CISGFEGMDLDTSSGELWILGDVFIRQYFTV 373
Query: 493 FDSGKLRIGFAEAA 506
FD +IG A A
Sbjct: 374 FDRANNQIGLAPVA 387
>gi|2832610|emb|CAA11580.1| cathepsin [Chionodraco hamatus]
Length = 402
Score = 211 bits (538), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 139/398 (34%), Positives = 209/398 (52%), Gaps = 37/398 (9%)
Query: 6 LRSVF---CLWVLASCLLL-------PASSNGLRRIGLKKRRLDLHSLNAARITRKERYM 55
+RSV C+W S L+ P + LR GL + L + + +R+
Sbjct: 1 MRSVLLLLCIWTCRSSALIRVPLRKVPTIRSQLRSEGLLQDFLVENRPDM--FSRRYAQC 58
Query: 56 GGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSK 115
AG +R LG S E I NFMDAQY+G+I +G+P QNFSV+FDTGSS+LWVPS+
Sbjct: 59 FPAGTPSLR--LGRSSEKIY---NFMDAQYYGDIALGTPEQNFSVVFDTGSSDLWVPSAY 113
Query: 116 CYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIE 175
C + +C R+K+ KS ++ G+ INYGSG + G +D + V ++VK Q F E
Sbjct: 114 C-VTEACALPKRFKAFKSTSFLHDGRQFGINYGSGHLLGVMGRDYLMVAGMMVKRQEFRE 172
Query: 176 ATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEE 235
+ E FL ARFDG++GLG+ +A PV+DNM+ Q L+ + +FSF+L+R +
Sbjct: 173 SVYEPGTAFLKARFDGVLGLGYPALAEILGNPVFDNMLAQNLLDKPIFSFYLSRKLNGSP 232
Query: 236 GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSL 295
GE++ GG D + + ++PVT K YWQ ++ +++ + C GC AIVD+GTSL
Sbjct: 233 EGELLLGGTDERLYDLPINWLPVTAKAYWQIKIDSVVVQGVNP-FCPHGCQAIVDTGTSL 291
Query: 296 LAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNG 355
+ GPT + +I IG + +G+ I D P+ +G G
Sbjct: 292 ITGPTDDILDIQQLIGA----------TPTNFGEFIVDCARLSNFPQHQHFVLG-----G 336
Query: 356 AEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWVQNQ 393
EY T + KE + GD +C + AV + ++
Sbjct: 337 KEYTLTS-DQYIRKEML--GDRKLCFSGFQAVDMISSE 371
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 44/100 (44%), Positives = 62/100 (62%), Gaps = 1/100 (1%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMP-NVSFTIGDKIFNLSPEQYILKTGEGIAEVCISG 465
I +L + P GE I+DC R+ P + F +G K + L+ +QYI K G ++C SG
Sbjct: 302 IQQLIGATPTNFGEFIVDCARLSNFPQHQHFVLGGKEYTLTSDQYIRKEMLGDRKLCFSG 361
Query: 466 FMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
F A D+ GPLWILGDVF+ Y+++FD G+ R+GFA A
Sbjct: 362 FQAVDMISSEGPLWILGDVFLTQYYSIFDRGQDRVGFAIA 401
>gi|374431137|gb|AEZ51819.1| pepsin, partial [Oreochromis niloticus]
Length = 339
Score = 211 bits (537), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 107/236 (45%), Positives = 155/236 (65%), Gaps = 7/236 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
+ N D Y+G I IG+PPQ+FSVIFDTGSSNLWVPS C S +C H+++ +S+T+
Sbjct: 24 MTNDADLSYYGTISIGTPPQSFSVIFDTGSSNLWVPSVYCN-STACENHNQFNPSQSSTF 82
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGS-LTFLLARFDGIIGL 195
+S I YG+GS++GF D VEVG + V +QVF + E S +T++ A DGI+GL
Sbjct: 83 QWGNQSLSIQYGTGSMTGFLGSDTVEVGGISVANQVFGLSQTEASFMTYMQA--DGILGL 140
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
F+ IA + VPV++ M+ +GLVSE +FS +L+ ++E+G E+VFGG D H+ G T+
Sbjct: 141 AFQSIASDNVVPVFNTMITEGLVSEPIFSVYLSG--NSEQGSEVVFGGTDSTHYTGTITW 198
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
+P++ YWQ + + I Q T C GGC AI+D+GTSL+ GPT + +N +G
Sbjct: 199 IPLSSATYWQINMDSVTINGQ-TVACSGGCQAIIDTGTSLIVGPTTDINNLNSWVG 253
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 38/132 (28%), Positives = 67/132 (50%), Gaps = 10/132 (7%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 434
G + CS A++ L T + ++ +N + + G++I++C IP+MP+V
Sbjct: 218 GQTVACSGGCQAIIDTGTSLIVGPTTD--INNLNSWVGASTDQSGDAIVNCQNIPSMPDV 275
Query: 435 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 494
+FT+ F + Y+ ++ G C++GF LWILGDVF+ Y+ VF+
Sbjct: 276 TFTLNGNAFTVPASAYVSQSSSG----CMTGFGQGGTM----QLWILGDVFIREYYAVFN 327
Query: 495 SGKLRIGFAEAA 506
+ IG A++A
Sbjct: 328 AQTQNIGLAKSA 339
>gi|281183192|ref|NP_001162218.1| gastricsin precursor [Papio anubis]
gi|157939796|gb|ABW05535.1| progastricsin (predicted) [Papio anubis]
Length = 388
Score = 211 bits (537), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 113/283 (39%), Positives = 167/283 (59%), Gaps = 9/283 (3%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVS------GVRHRLGDSDEDILPLKNFMDAQYFGEIG 90
++ L + R T KE+ + G + ++ GD P+ +MDA YFGEI
Sbjct: 20 KVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYHFGDLSVSYEPMA-YMDAAYFGEIS 78
Query: 91 IGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSG 150
IG+PPQNF V+FDTGSSNLWVPS C S +C HSR+ +S+TY+ G++ + YGSG
Sbjct: 79 IGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHSRFNPSESSTYSTNGQTFSLQYGSG 137
Query: 151 SISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWD 210
S++GFF D + V + V +Q F + E F+ A+FDGI+GL + ++V A
Sbjct: 138 SLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGIMGLAYPTLSVDGATTAMQ 197
Query: 211 NMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 270
MV++G ++ +FS +L+ D GG +VFGGVD + G+ + PVT++ YWQ + +
Sbjct: 198 GMVQEGALTSPIFSVYLS-DQQGSSGGAVVFGGVDSSLYTGQIYWAPVTQELYWQIGIEE 256
Query: 271 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
LIG Q++G C GC AIVD+GTSLL P ++ + A G +
Sbjct: 257 FLIGGQASGWCSEGCQAIVDTGTSLLTVPQQYLSALLQATGAQ 299
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 35/107 (32%), Positives = 54/107 (50%), Gaps = 5/107 (4%)
Query: 401 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 460
++ LS + + + + G+ +++C+ I +P ++F I F L P YIL
Sbjct: 286 QQYLSALLQATGAQEDEYGQFLVNCNSIQNLPTLTFIINGVEFPLPPSSYILNNNG---- 341
Query: 461 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
C G L PLWILGDVF+ Y++V+D R+GFA AA
Sbjct: 342 YCTVGVEPTYLSAQNSQPLWILGDVFLRSYYSVYDLSNNRVGFATAA 388
>gi|355561685|gb|EHH18317.1| hypothetical protein EGK_14890 [Macaca mulatta]
gi|355748551|gb|EHH53034.1| hypothetical protein EGM_13592 [Macaca fascicularis]
Length = 388
Score = 211 bits (537), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 113/283 (39%), Positives = 167/283 (59%), Gaps = 9/283 (3%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVS------GVRHRLGDSDEDILPLKNFMDAQYFGEIG 90
++ L + R T KE+ + G + ++ GD P+ +MDA YFGEI
Sbjct: 20 KVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYHFGDLSVSYEPMA-YMDAAYFGEIS 78
Query: 91 IGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSG 150
IG+PPQNF V+FDTGSSNLWVPS C S +C HSR+ +S+TY+ G++ + YGSG
Sbjct: 79 IGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHSRFNPSESSTYSTNGQTFSLQYGSG 137
Query: 151 SISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWD 210
S++GFF D + V + V +Q F + E F+ A+FDGI+GL + ++V A
Sbjct: 138 SLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGIMGLAYPTLSVDGATTAMQ 197
Query: 211 NMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 270
MV++G ++ +FS +L+ D GG +VFGGVD + G+ + PVT++ YWQ + +
Sbjct: 198 GMVQEGALTSPIFSVYLS-DQQGSSGGAVVFGGVDSSLYTGQIYWAPVTQELYWQIGIEE 256
Query: 271 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
LIG Q++G C GC AIVD+GTSLL P ++ + A G +
Sbjct: 257 FLIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMSALLQATGAQ 299
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 54/107 (50%), Gaps = 5/107 (4%)
Query: 401 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 460
++ +S + + + + G+ +++C+ I +P ++F I F L P YIL
Sbjct: 286 QQYMSALLQATGAQEDEYGQFLVNCNSIQNLPTLTFIINGVEFPLPPSSYILNNNG---- 341
Query: 461 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
C G L PLWILGDVF+ Y++V+D R+GFA AA
Sbjct: 342 YCTVGVEPTYLSAQNSQPLWILGDVFLRSYYSVYDLSNNRVGFATAA 388
>gi|194862073|ref|XP_001969914.1| GG23678 [Drosophila erecta]
gi|190661781|gb|EDV58973.1| GG23678 [Drosophila erecta]
Length = 392
Score = 211 bits (537), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 124/306 (40%), Positives = 171/306 (55%), Gaps = 21/306 (6%)
Query: 24 SSNGLRRIGLKKRR--LDLH-SLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNF 80
S+ L R+ L + R H S+ A + +Y + S GD++ L+N
Sbjct: 16 SAGKLNRVQLHRNRNFKKTHGSVKAEKTVLASKYSVVSETSFSTSSAGDTES----LQNS 71
Query: 81 MDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEI 139
M+ +Y+G I IG+P Q F+++FDTGS+NLWVPS+ C S +C H++Y S S+TY
Sbjct: 72 MNNEYYGVITIGTPQQRFNILFDTGSANLWVPSASCPASNTACQRHNKYNSTASSTYVAN 131
Query: 140 GKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFRE 199
G+ I YG+GS+SGF S D V + V ++DQ F EA E TF+ A F GI+GL F
Sbjct: 132 GEEFAIEYGTGSLSGFLSTDTVAIAGVTIRDQTFGEALSEPGTTFVDAPFAGILGLAFST 191
Query: 200 IAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVT 259
IA P +DNM+ QG++ E V SF+L R A GGE++ GG+D +KG TYVPV+
Sbjct: 192 IADDGVTPPFDNMISQGVLDEPVISFYLKRQGTAVLGGELILGGIDSSLYKGSLTYVPVS 251
Query: 260 KKGYWQFELGDILIGNQSTGV--CEGGCAAIVDSGTSLLAGPTPVVTEINHAI------G 311
YWQF + I ++ GV C GC AI D+GTSL+ P IN + G
Sbjct: 252 VPAYWQFTVNTI----KTNGVLLCS-GCQAIADTGTSLIVAPLAAYKRINRQLGATDNGG 306
Query: 312 GEGVVS 317
GE VS
Sbjct: 307 GEAFVS 312
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 40/100 (40%), Positives = 53/100 (53%), Gaps = 4/100 (4%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
IN + N GE+ + C R+ +P V+ IG F L+P YI+K + C+S F
Sbjct: 295 INRQLGATDNGGGEAFVSCSRVSALPKVNLNIGGTAFTLAPRDYIVKLTQNGQTYCMSAF 354
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
D WILGDVF+G ++TVFD G RIGFA A
Sbjct: 355 TYMDGL----SFWILGDVFIGKFYTVFDKGSERIGFARVA 390
>gi|297688536|ref|XP_002821738.1| PREDICTED: pepsin A-4 [Pongo abelii]
Length = 388
Score = 211 bits (537), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 122/307 (39%), Positives = 175/307 (57%), Gaps = 27/307 (8%)
Query: 28 LRRI----GLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDA 83
LRR GL K L H+LN AR +Y + D PL+N++D
Sbjct: 28 LRRTLSEHGLLKDFLKTHNLNPAR-----KYF--------PQWEAPTLVDEQPLENYLDV 74
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
+YFG IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H+ + S+TY ++
Sbjct: 75 EYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACTNHNLFNPEDSSTYQSTSETV 133
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAV 202
I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + I+
Sbjct: 134 SIAYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISS 192
Query: 203 GDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
A PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +G
Sbjct: 193 SGATPVFDNIWNQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTGSLNWVPVTVEG 250
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSA 318
YWQ + I + ++ C GC AIVD+GTSLL GPT + I IG +G +
Sbjct: 251 YWQITVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVV 309
Query: 319 ECKLVVS 325
C + S
Sbjct: 310 SCSAISS 316
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
N G+ ++ C I ++P++ FTI + + P YIL++ EG CISGF ++P
Sbjct: 302 NSDGDMVVSCSAISSLPDIVFTINGVQYPVPPSAYILQS-EG---SCISGFQGMNVPTES 357
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 388
>gi|335955136|gb|AEH76574.1| pepsinogen [Epinephelus bruneus]
Length = 375
Score = 211 bits (537), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 114/266 (42%), Positives = 165/266 (62%), Gaps = 15/266 (5%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
+ N D Y+G I IG+PPQ+F+VIFDTGSSNLWVPS C S +C H ++ ++S+T+
Sbjct: 61 MTNDADLSYYGVISIGTPPQSFTVIFDTGSSNLWVPSVYCN-SQACQNHRKFNPQQSSTF 119
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
+ I YG+GS++G + DNVEVG + V++QVF + E +A DGI+GL
Sbjct: 120 KWGDQPLSIQYGTGSMTGRLAIDNVEVGGITVQNQVFGISQTEAPFMAHMAA-DGILGLA 178
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
F+ IA + VPV+DNMV+QGLVS+ +FS +L+ D +G E+VFGG+D H+ G+ T+V
Sbjct: 179 FQTIAADNVVPVFDNMVKQGLVSQPLFSVYLSSHGD--QGSEVVFGGIDNSHYTGQVTWV 236
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
P+T YWQ ++ + I Q T C GGC AI+D+GTSL+ GPT + +N +G
Sbjct: 237 PLTSATYWQIKMDGVKINGQ-TVACAGGCQAIIDTGTSLIVGPTNDINNMNSWVGAS--- 292
Query: 317 SAECKLVVSQYGDLIWDLLVSGLLPE 342
+QYG+ + G +PE
Sbjct: 293 -------TNQYGESTVNCQNVGSMPE 311
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 50/194 (25%), Positives = 85/194 (43%), Gaps = 17/194 (8%)
Query: 313 EGVVSAEC-KLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKEN 371
+G+VS + +S +GD +++ G+ Q+ A Y + V K N
Sbjct: 197 QGLVSQPLFSVYLSSHGDQGSEVVFGGIDNSHYTGQVTWVPLTSATYWQIKMDGV--KIN 254
Query: 372 VSAGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTM 431
G + C+ A++ L T + ++ +N + N GES ++C + +M
Sbjct: 255 ---GQTVACAGGCQAIIDTGTSLIVGPTND--INNMNSWVGASTNQYGESTVNCQNVGSM 309
Query: 432 PNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHT 491
P V+FT+ F L Y+ + G C +GF LWILGDVF+ Y+
Sbjct: 310 PEVTFTLNGHDFTLPASAYVSQNYYG----CNTGF-----GQGGSELWILGDVFIREYYA 360
Query: 492 VFDSGKLRIGFAEA 505
+FD+ IG A++
Sbjct: 361 IFDAQARYIGLAQS 374
>gi|296474377|tpg|DAA16492.1| TPA: progastricsin (pepsinogen C) [Bos taurus]
Length = 421
Score = 211 bits (537), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 105/250 (42%), Positives = 157/250 (62%), Gaps = 2/250 (0%)
Query: 64 RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCY 123
++R GD P+ ++MDA YFGEI IG+PPQNF V+FDTGSSNLWVPS C S +C
Sbjct: 54 KYRFGDFIVATEPM-DYMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACT 111
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
H+R+ S+TY+ ++ + YGSGS++G D + V + V +Q F + E
Sbjct: 112 SHTRFNHSLSSTYSTNEQTFSLQYGSGSLTGILGYDTLTVQGIKVPNQEFGLSKTEPGTN 171
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
FL A+FDGI+G+ + ++V A V M+++G ++ VFSF+L+ +++GG ++FGG
Sbjct: 172 FLYAKFDGIMGMAYPSLSVDGATTVLQGMLQEGALTSPVFSFYLSSQQGSQDGGAVIFGG 231
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
VD + G+ + PVT++ YWQ + LIG+Q+TG C GC AIVD+GTSLL P +
Sbjct: 232 VDNCLYTGQIYWAPVTQELYWQIGFEEFLIGDQATGWCSTGCQAIVDTGTSLLTVPQQFL 291
Query: 304 TEINHAIGGE 313
+ + A G +
Sbjct: 292 SALLQATGAQ 301
>gi|431910409|gb|ELK13482.1| Pepsin A [Pteropus alecto]
Length = 386
Score = 211 bits (537), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 115/290 (39%), Positives = 168/290 (57%), Gaps = 25/290 (8%)
Query: 28 LRR----IGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDA 83
LRR GL L H LN A KE S D L+N++D
Sbjct: 28 LRRNLIEHGLLADYLKTHKLNPASKYLKE---------------AASFTDTETLENYLDM 72
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
+YFG IGIG+P Q F+VIFDTGSSNLWVPS C S++CY H+ + S+T+ ++
Sbjct: 73 EYFGTIGIGTPAQEFTVIFDTGSSNLWVPSVYCS-SLACYNHNVFNPEDSSTFEATSETV 131
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAV 202
I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + I+
Sbjct: 132 SITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISA 190
Query: 203 GDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
A PV+DN+ +QGLVS+++FS +L+ D D+ G ++FGG+D ++ G +VP++ +
Sbjct: 191 SGATPVFDNLWDQGLVSQDLFSVYLSSDDDS--GSVVIFGGIDSSYYSGSLNWVPLSSET 248
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
YWQ + +++ ++ C C AIVD+GTSLLAGPT ++ I IG
Sbjct: 249 YWQITVDSVILDGEAIA-CSATCQAIVDTGTSLLAGPTTAISSIQKYIGA 297
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 49/132 (37%), Positives = 67/132 (50%), Gaps = 6/132 (4%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 434
G++ CSA A+V L T +S I + + N G+ ++ C +PN+
Sbjct: 261 GEAIACSATCQAIVDTGTSLLAGPTT--AISSIQKYIGASENSDGDMVVSCSAASELPNI 318
Query: 435 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 494
FTI + L YIL++ + VCISGF DLP G LWILGDVF+ Y TVFD
Sbjct: 319 IFTINGVQYPLPSSAYILESDD----VCISGFQGMDLPTSSGDLWILGDVFIRQYFTVFD 374
Query: 495 SGKLRIGFAEAA 506
++G A AA
Sbjct: 375 RANNQVGLASAA 386
>gi|124514108|gb|ABN13683.1| preprochymosin [Capra hircus]
Length = 381
Score = 211 bits (537), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 116/308 (37%), Positives = 173/308 (56%), Gaps = 17/308 (5%)
Query: 5 LLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVR 64
+L +VF L A +P R LK+R L L + Y G V+ V
Sbjct: 6 VLLAVFALSHGAEITRIPLYKGKPLRKALKERGLLEDFLQKQQYGVSSEYSGFGEVANV- 64
Query: 65 HRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYF 124
PL N++D+QYFG+I +G+PPQ F+V+FDTGSS+ WVPS C S +C
Sbjct: 65 -----------PLTNYLDSQYFGKIYLGTPPQEFTVLFDTGSSDFWVPSIYCK-SNACKN 112
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
H R+ RKS+T+ +GK I YG+GS+ G D V V ++V Q +T+E F
Sbjct: 113 HQRFDPRKSSTFQNLGKPLSIRYGTGSMQGILGYDTVTVSNIVDTQQTVGLSTQEPGDVF 172
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
A FDGI+G+ + +A +VPV+DNM+++ LV++++FS +++R+ +G + G +
Sbjct: 173 TYAEFDGILGMAYPSLASEYSVPVFDNMMDRHLVAQDLFSVYMDRN---GQGSMLTLGAI 229
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
DP ++ G +VPVT + YWQF + + I + CEGGC AI+D+GTS L GP+ +
Sbjct: 230 DPSYYTGSLHWVPVTLQKYWQFTVDSVTISG-AVVACEGGCQAILDTGTSKLVGPSSDIL 288
Query: 305 EINHAIGG 312
I AIG
Sbjct: 289 NIQQAIGA 296
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 36/99 (36%), Positives = 51/99 (51%), Gaps = 8/99 (8%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
I + + N GE +DCD + +MP V F I K++ L+P Y + EG C SGF
Sbjct: 290 IQQAIGATQNQYGEFDVDCDSLSSMPTVVFEINGKMYPLTPYAYTSQE-EGF---CTSGF 345
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+ WILGDVF+ Y++VFD +G A+A
Sbjct: 346 QGEN----HSHQWILGDVFIREYYSVFDRANNLVGLAKA 380
>gi|38640718|gb|AAR25994.1| prochymosin [Capra hircus]
Length = 381
Score = 211 bits (537), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 116/308 (37%), Positives = 173/308 (56%), Gaps = 17/308 (5%)
Query: 5 LLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVR 64
+L +VF L A +P R LK+R L L + Y G V+ V
Sbjct: 6 VLLAVFALSHGAEITRIPLYKGKPLRKALKERGLLEDFLQKQQYGVSSEYSGFGEVASV- 64
Query: 65 HRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYF 124
PL N++D+QYFG+I +G+PPQ F+V+FDTGSS+ WVPS C S +C
Sbjct: 65 -----------PLTNYLDSQYFGKIYLGTPPQEFTVLFDTGSSDFWVPSIYCK-SNACKN 112
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
H R+ RKS+T+ +GK I YG+GS+ G D V V ++V Q +T+E F
Sbjct: 113 HQRFDPRKSSTFQNLGKPLSIRYGTGSMQGILGYDTVTVSNIVDTQQTVGLSTQEPGDVF 172
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
A FDGI+G+ + +A +VPV+DNM+++ LV++++FS +++R+ +G + G +
Sbjct: 173 TYAEFDGILGMAYPSLASEYSVPVFDNMMDRRLVAQDLFSVYMDRN---GQGSMLTLGAI 229
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
DP ++ G +VPVT + YWQF + + I + CEGGC AI+D+GTS L GP+ +
Sbjct: 230 DPSYYTGSLHWVPVTLQKYWQFTVDSVTISG-AVVACEGGCQAILDTGTSKLVGPSSDIL 288
Query: 305 EINHAIGG 312
I AIG
Sbjct: 289 NIQQAIGA 296
Score = 61.6 bits (148), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 37/99 (37%), Positives = 51/99 (51%), Gaps = 8/99 (8%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
I + + N GE IDCD + +MP V F I K++ L+P Y + EG C SGF
Sbjct: 290 IQQAIGATQNQYGEFDIDCDSLSSMPTVVFEINGKMYPLTPYAYTSQE-EGF---CTSGF 345
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+ WILGDVF+ Y++VFD +G A+A
Sbjct: 346 QGEN----HSHQWILGDVFIREYYSVFDRANNLVGLAKA 380
>gi|56971217|gb|AAH88066.1| pga5-prov protein, partial [Xenopus (Silurana) tropicalis]
Length = 382
Score = 211 bits (537), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 113/288 (39%), Positives = 161/288 (55%), Gaps = 20/288 (6%)
Query: 26 NGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQY 85
N L+R+GL L + N A S L S ++L +N+MD +Y
Sbjct: 27 NRLQRLGLLGDYLKKYPYNPA--------------SKYFPTLAQSSAEVL--QNYMDIEY 70
Query: 86 FGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEI 145
+G I IG+PPQ F+VIFDTGS+NLWVPS C S +C H+R+ ++S T+ I
Sbjct: 71 YGTISIGTPPQEFTVIFDTGSANLWVPSVYCS-SSACTNHNRFNPQQSTTFQATNTPVSI 129
Query: 146 NYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDA 205
YG+GS+SGF D ++VG++ + +Q+F + E + FDGI+GL F IA A
Sbjct: 130 QYGTGSMSGFLGYDTLQVGNIKISNQMFGLSESEPGSFLYYSPFDGILGLAFPSIASSQA 189
Query: 206 VPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQ 265
PV+DNM QGL+ + +FS +L+ D + G ++FGGVD ++ G +VP+T + YWQ
Sbjct: 190 TPVFDNMWSQGLIPQNLFSVYLSS--DGQSGSYVLFGGVDTSYYSGSLNWVPLTAETYWQ 247
Query: 266 FELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
L I I Q C C AIVD+GTSL+ GPT + I + IG
Sbjct: 248 ITLDSISINGQVIA-CSQSCQAIVDTGTSLMTGPTTPIANIQYYIGAS 294
Score = 68.9 bits (167), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 35/88 (39%), Positives = 46/88 (52%), Gaps = 4/88 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +I+C+ I MP + FTI + L P Y+ + +G C SGF A LP G L
Sbjct: 299 GQYVINCNNISNMPTIVFTINGVQYPLPPTAYVRQNQQG----CSSGFQAMTLPTNSGDL 354
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEAA 506
WILGDVF+ Y VFD + A A
Sbjct: 355 WILGDVFIRQYFVVFDRTNNYVAMAPVA 382
>gi|195391510|ref|XP_002054403.1| GJ24430 [Drosophila virilis]
gi|194152489|gb|EDW67923.1| GJ24430 [Drosophila virilis]
Length = 376
Score = 211 bits (537), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 110/274 (40%), Positives = 160/274 (58%), Gaps = 4/274 (1%)
Query: 40 LHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFS 99
L++ AR+ R + S + + L L N + +Y+G I +G+PPQ F
Sbjct: 16 LYTFAKARMLRVPLEVQRKPASQLSQSFLATQSLQLMLDNRDNVEYYGRIAMGTPPQLFR 75
Query: 100 VIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQ 158
VIFDTGS+N W+PSS C S I+C HSRYK+ KS +Y + G++ + YG+G +SG+ SQ
Sbjct: 76 VIFDTGSANTWLPSSNCPDSNIACQQHSRYKAHKSKSYVKNGRNFSLAYGNGHVSGYLSQ 135
Query: 159 DNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLV 218
D + + DVVV D +F E TF+ FDGI+GLGFR+IA ++ P + +Q LV
Sbjct: 136 DTLRIADVVVPDLIFGETLSHHQATFIPTSFDGIVGLGFRQIAWKNSTPFLELFCQQHLV 195
Query: 219 SEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQST 278
+FS +L R GGEI FGG+D +KG YVP++K GYWQF + + +GN+
Sbjct: 196 KRCLFSVYLRRMAGELYGGEITFGGIDHSRYKGALDYVPLSKVGYWQFVMSGVSVGNKK- 254
Query: 279 GVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
+G AI+D+GTSL+ P + ++ AIG
Sbjct: 255 --IDGRVNAILDTGTSLVLMPRRIFEQLQQAIGA 286
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 41/75 (54%), Gaps = 5/75 (6%)
Query: 431 MPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYH 490
+ +V F IGD+ + L+ Y++ +C SGF+ P W+LGD+F+ +
Sbjct: 302 LQDVQFHIGDRKYALTAADYVVSLETANETICASGFV-----PIESDFWVLGDIFLTRVY 356
Query: 491 TVFDSGKLRIGFAEA 505
+V+D RIGFAEA
Sbjct: 357 SVYDVEAERIGFAEA 371
>gi|73620983|sp|P00792.2|PEPA_BOVIN RecName: Full=Pepsin A; Flags: Precursor
gi|24415088|emb|CAD55693.1| pepsinogen A [synthetic construct]
gi|37622272|gb|AAQ95219.1| pepsinogen A [Bos taurus]
Length = 372
Score = 211 bits (536), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 110/256 (42%), Positives = 160/256 (62%), Gaps = 6/256 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL+N++D +YFG IGIG+P Q+F+VIFDTGSSNLWVPS C S +C H+R+ + S+T
Sbjct: 51 PLQNYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSIYCS-SEACTNHNRFNPQDSST 109
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIG 194
Y ++ I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+G
Sbjct: 110 YEATSETLSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILG 168
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+ A PV+DN+ +QGLVS+++FS +L+ + E G ++FG +D ++ G
Sbjct: 169 LAYPSISSSGATPVFDNIWDQGLVSQDLFSVYLSS--NEESGSVVIFGDIDSSYYSGSLN 226
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
+VPV+ +GYWQ + I + +S C GC AIVD+GTSLLAGPT ++ I IG
Sbjct: 227 WVPVSVEGYWQITVDSITMNGESIA-CSDGCQAIVDTGTSLLAGPTTAISNIQSYIGASE 285
Query: 315 VVSAECKLVVSQYGDL 330
S E + S L
Sbjct: 286 DSSGEVVISCSSIDSL 301
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 50/134 (37%), Positives = 66/134 (49%), Gaps = 10/134 (7%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTK--EKVLSYINELCDSLPNPMGESIIDCDRIPTMP 432
G+S CS A+V L T + SYI DS GE +I C I ++P
Sbjct: 247 GESIACSDGCQAIVDTGTSLLAGPTTAISNIQSYIGASEDS----SGEVVISCSSIDSLP 302
Query: 433 NVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTV 492
++ FTI + + P YIL++ +C SGF D+ G LWILGDVF+ Y TV
Sbjct: 303 DIVFTINGVQYPVPPSAYILQSNG----ICSSGFEGMDISTSSGDLWILGDVFIRQYFTV 358
Query: 493 FDSGKLRIGFAEAA 506
FD G +IG A A
Sbjct: 359 FDRGNNQIGLAPVA 372
>gi|351707910|gb|EHB10829.1| Gastricsin [Heterocephalus glaber]
Length = 391
Score = 211 bits (536), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 113/308 (36%), Positives = 178/308 (57%), Gaps = 15/308 (4%)
Query: 13 WVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRH----RLG 68
W++ + L LP +I LKK + + R T +++ + G + + +L
Sbjct: 3 WMVVALLCLPLLEATKLKIPLKKFK-------SIRETMRDKGLLGDFLKTHKQDHIRKLS 55
Query: 69 DSDEDILPL---KNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFH 125
++ + L +++DA YFGEI +G+PPQ+F V+FDTGSSNLWVPS C S++C H
Sbjct: 56 NNFDHFSVLFEPMSYLDAAYFGEISLGTPPQSFQVLFDTGSSNLWVPSVYCQ-SLACTTH 114
Query: 126 SRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFL 185
R+ KS+TYT G+S + YGSGS++G F D + + V Q F + +E TF+
Sbjct: 115 PRFNPSKSSTYTSTGQSFSLQYGSGSLTGVFGYDTMTIQGTQVPKQEFGLSEQEPGTTFV 174
Query: 186 LARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVD 245
A+FDGI+GLG+ +A G A ++ +G +S+ +FS +L + +GG ++ GGVD
Sbjct: 175 YAQFDGIMGLGYPGLAAGGATTALQGLIREGALSQPLFSVYLGSQQGSSDGGALILGGVD 234
Query: 246 PKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTE 305
+ G+ ++ PVT++ YWQ + D+ + NQ+ G C GC IVD+GTSLL P +T
Sbjct: 235 ESLYNGQISWTPVTQELYWQIGIEDVQLDNQALGWCSQGCQGIVDTGTSLLTLPQQYLTT 294
Query: 306 INHAIGGE 313
+ AIG +
Sbjct: 295 LIQAIGAQ 302
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 58/106 (54%), Gaps = 5/106 (4%)
Query: 401 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 460
++ L+ + + + N GE ++DC+ I ++P ++ + F L P YIL+ +
Sbjct: 289 QQYLTTLIQAIGAQENEFGEYVVDCNSIQSLPTLTVILSGVKFPLLPSAYILQEDQ---- 344
Query: 461 VCISGFMAFDL-PPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
C+ G A L PLWILGDVF+ Y++VFD G R+GFA A
Sbjct: 345 YCMVGLSATYLYSESSQPLWILGDVFLRSYYSVFDLGNNRVGFAPA 390
>gi|335281744|ref|XP_003122705.2| PREDICTED: pregnancy-associated glycoprotein 2-like [Sus scrofa]
Length = 388
Score = 211 bits (536), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 112/310 (36%), Positives = 175/310 (56%), Gaps = 22/310 (7%)
Query: 9 VFCLWVLASCLL------LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSG 62
+ L L+ CL+ + + LR G K LD H + R E
Sbjct: 6 ILGLVTLSECLVTIPLRKVKSIRENLREKGFLKNFLDEHPHDMIRSRLTE---------- 55
Query: 63 VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISC 122
+ LPL+N++D Y G I IG+PPQ FSV+FDTGSS+ WVPS C S++C
Sbjct: 56 --NSAPQKKNTTLPLRNYLDVIYVGNISIGTPPQQFSVVFDTGSSDTWVPSIYCQ-SMAC 112
Query: 123 YFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSL 182
H+ + +S T+ G E+ Y +G+++GF D ++VGD+++KDQ F + E +
Sbjct: 113 VTHNTFDPFQSTTFRFPGFIVELQYATGAVTGFLGYDTIQVGDLIIKDQAFAISQSEDDV 172
Query: 183 TFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFG 242
F A FDGI+GL F +A+ P++D+++ Q L+++ VF+F+L+ +A+EG ++FG
Sbjct: 173 VFENAAFDGIVGLSFPSMAIEGTTPIFDSLMNQSLIAQTVFAFYLSS--NAQEGSVVMFG 230
Query: 243 GVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPV 302
GVD K++KG +VP+++ YWQ L I I S+ C+ GC I+D+GTSLL GP
Sbjct: 231 GVDKKYYKGDLKWVPLSQPHYWQIPLDKITI-RGSSAACKNGCQGILDTGTSLLMGPKNQ 289
Query: 303 VTEINHAIGG 312
V +++ + G
Sbjct: 290 VYKLHKRLPG 299
Score = 62.4 bits (150), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 35/86 (40%), Positives = 50/86 (58%), Gaps = 5/86 (5%)
Query: 422 IIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMA-FDLPPPRGPLWI 480
+I C I ++P+++FTI + + Y+ K+ G C+SG A D PP+ WI
Sbjct: 307 LIQCQDINSLPDITFTINGTDYPVPARVYVQKSFNGF---CLSGLRARTDTFPPKTA-WI 362
Query: 481 LGDVFMGVYHTVFDSGKLRIGFAEAA 506
LGDVF+ +Y TVFD G+ RIG A A
Sbjct: 363 LGDVFLRMYFTVFDRGQNRIGLAPAV 388
>gi|440905526|gb|ELR55898.1| Gastricsin [Bos grunniens mutus]
Length = 391
Score = 211 bits (536), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 118/306 (38%), Positives = 176/306 (57%), Gaps = 15/306 (4%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRH------RL 67
VLA L + L +I LKK + + R KE+ + + +H R
Sbjct: 5 VLALVCLQALEAAALVKIPLKKFK-------SIREIMKEKGLLEDFLRTYKHDPAQKYRF 57
Query: 68 GDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSR 127
GD P+ ++MDA YFGEI IG+PPQNF V+FDTGSSNLWVPS C S +C H+R
Sbjct: 58 GDFIVATEPM-DYMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHTR 115
Query: 128 YKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLA 187
+ S+TY+ ++ + YGSGS++G D + V + V +Q F + E FL A
Sbjct: 116 FNHSLSSTYSTNEQTFSLQYGSGSLTGILGYDTLTVQGIKVPNQEFGLSKTEPGTNFLYA 175
Query: 188 RFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPK 247
+FDGI+G+ + ++V A V M+++G ++ VFSF+L+ +++GG ++FGGVD
Sbjct: 176 KFDGIMGMAYPSLSVDGATTVLQGMLQEGALTSPVFSFYLSSQQGSQDGGAVIFGGVDSC 235
Query: 248 HFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEIN 307
+ G+ + PVT++ YWQ + LIG+Q+TG C GC AIVD+GTSLL P ++ +
Sbjct: 236 LYTGQIYWAPVTQELYWQIGFEEFLIGDQATGWCSTGCQAIVDTGTSLLTVPQQFLSALL 295
Query: 308 HAIGGE 313
A G +
Sbjct: 296 QATGAQ 301
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 51/161 (31%), Positives = 75/161 (46%), Gaps = 11/161 (6%)
Query: 351 CAFNGAEY---VSTGIKTVVEKENVSAGDSAV--CSACEMAVVWVQNQLKQKQTKEKVLS 405
C + G Y V+ + + E GD A CS A+V L ++ LS
Sbjct: 235 CLYTGQIYWAPVTQELYWQIGFEEFLIGDQATGWCSTGCQAIVDTGTSLLT--VPQQFLS 292
Query: 406 YINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISG 465
+ + + + G+ +DC+ I +P ++F I F L P YIL + CI G
Sbjct: 293 ALLQATGAQEDQYGQFPVDCNNIQNLPTLTFVINGVQFPLPPASYILNNDD---SYCILG 349
Query: 466 FMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+P G PLWILGDVF+ Y++V+D G R+GFA A
Sbjct: 350 VEVTYVPSQNGQPLWILGDVFLRSYYSVYDLGNNRVGFATA 390
>gi|18859121|ref|NP_571879.1| nothepsin [Danio rerio]
gi|12053847|emb|CAC20112.1| nothepsin [Danio rerio]
Length = 416
Score = 211 bits (536), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 110/249 (44%), Positives = 161/249 (64%), Gaps = 12/249 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
L NFMDAQ+FG+I +G P QNF+V+FDTGSS+LWVPSS C + +C H+++K+ +S+TY
Sbjct: 78 LYNFMDAQFFGQISLGRPEQNFTVVFDTGSSDLWVPSSYC-VTQACALHNKFKAFESSTY 136
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
T G+ I+YGSG + G ++D ++VG V V++QVF EA E +F+LA+FDG++GLG
Sbjct: 137 THDGRVFGIHYGSGHLLGVMARDELKVGSVRVQNQVFGEAVYEPGFSFVLAQFDGVLGLG 196
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
F ++A PV+D M+EQ ++ + VFSF+L + + GGE+VFG D F ++
Sbjct: 197 FPQLAEEKGSPVFDTMMEQNMLDQPVFSFYLTNN-GSGFGGELVFGANDESRFLPPINWI 255
Query: 257 PVTKKGYWQFEL------GDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI 310
PVT+KGYWQ +L G + ++S GC AIVD+GTSL+ GP + + I
Sbjct: 256 PVTQKGYWQIKLDAVKVQGALSFSDRSV----QGCQAIVDTGTSLIGGPARDILILQQFI 311
Query: 311 GGEGVVSAE 319
G + E
Sbjct: 312 GATPTANGE 320
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 36/98 (36%), Positives = 61/98 (62%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
+ + + P GE ++DC R+ ++P VSF I ++LS EQY+ + ++C SGF
Sbjct: 307 LQQFIGATPTANGEFVVDCVRVSSLPVVSFLINSVEYSLSGEQYVRRETLNNKQICFSGF 366
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAE 504
+ ++P P GP+WILGDVF+ ++++D G+ R+G A
Sbjct: 367 QSIEVPSPAGPVWILGDVFLSQVYSIYDRGENRVGLAR 404
>gi|426353119|ref|XP_004044046.1| PREDICTED: gastricsin [Gorilla gorilla gorilla]
Length = 388
Score = 211 bits (536), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 113/283 (39%), Positives = 167/283 (59%), Gaps = 9/283 (3%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVS------GVRHRLGDSDEDILPLKNFMDAQYFGEIG 90
++ L + R T KE+ + G + ++ GD P+ +MDA YFGEI
Sbjct: 20 KVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYHFGDLSVTYEPMA-YMDAAYFGEIS 78
Query: 91 IGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSG 150
IG+PPQNF V+FDTGSSNLWVPS C S +C HSR+ +S+TY+ G++ + YGSG
Sbjct: 79 IGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHSRFNPSESSTYSTNGQTFSLQYGSG 137
Query: 151 SISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWD 210
S++GFF D + V + V +Q F + E F+ A+FDGI+GL + ++V +A
Sbjct: 138 SLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGIMGLAYPALSVDEATTAMQ 197
Query: 211 NMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 270
MV++G ++ VFS +L+ GG +VFGGVD + G+ + PVT++ YWQ + +
Sbjct: 198 GMVQEGALTSPVFSVYLSNQ-QGSSGGAVVFGGVDNSLYTGQIYWAPVTQELYWQIGIEE 256
Query: 271 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
LIG Q++G C GC AIVD+GTSLL P ++ + A G +
Sbjct: 257 FLIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMSALLQATGAQ 299
Score = 65.1 bits (157), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 56/107 (52%), Gaps = 5/107 (4%)
Query: 401 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 460
++ +S + + + + G+ +++C+ I +P ++F I F L P YIL
Sbjct: 286 QQYMSALLQATGAQEDEYGQFLVNCNSIQNLPTLTFIINGVEFPLPPSSYILSNNG---- 341
Query: 461 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
C G L G PLWILGDVF+ Y++V+D G R+GFA AA
Sbjct: 342 YCTVGVEPTYLSSQNGQPLWILGDVFLRSYYSVYDLGNNRVGFATAA 388
>gi|395534129|ref|XP_003769100.1| PREDICTED: LOW QUALITY PROTEIN: gastricsin-like [Sarcophilus
harrisii]
Length = 391
Score = 211 bits (536), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 113/294 (38%), Positives = 170/294 (57%), Gaps = 13/294 (4%)
Query: 25 SNGLRRIGLKKRRLDLHSLNAARITRKER-----YMGGAGVSGVRHRLGDSDEDILPLKN 79
S G RI LKK + + R T KE+ ++ ++ L L +
Sbjct: 14 SEGFFRIPLKKGK-------SIRDTMKEKGVLEDFLKTHKYDPAKNYHFKDFSVALHLPS 66
Query: 80 FMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEI 139
++DA Y+GEI IG+PPQNF V+FDTG SNLWVPS C S +C H+++ +S+TY+
Sbjct: 67 YLDAAYYGEISIGTPPQNFLVLFDTGFSNLWVPSIYCQ-SQACSGHAQFSPSQSSTYSTN 125
Query: 140 GKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFRE 199
G++ + YGSGS++GFF D + V + V +QVF + E F+ A+FDGI+G+ +
Sbjct: 126 GQTFSLQYGSGSLTGFFGYDTITVQGIKVPNQVFGLSENEPGTNFVHAQFDGIMGMAYPA 185
Query: 200 IAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVT 259
+AVG A M++Q +++ +FSF+L + GGE++FGGVD + G+ + PVT
Sbjct: 186 LAVGGATTALQGMLQQNILTNPIFSFYLGNQQSSXNGGEVIFGGVDNNLYTGQIYWAPVT 245
Query: 260 KKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
++ YWQ + + IG Q+TG C GC AIVD+GTSLL P ++ A G +
Sbjct: 246 QELYWQIGIQEFSIGGQATGWCSQGCQAIVDTGTSLLTVPQQYMSAFLQATGAQ 299
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 39/107 (36%), Positives = 57/107 (53%), Gaps = 5/107 (4%)
Query: 401 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 460
++ +S + + + G+ ++DC+ I ++P +SF I F LSP YIL
Sbjct: 286 QQYMSAFLQATGAQQDQYGQYVVDCNNIQSLPTISFLINGVQFPLSPSAYILNNNG---- 341
Query: 461 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
C G LP G PLWILGDVF+ Y++V+D R+GFA AA
Sbjct: 342 YCTVGTEPTYLPFQNGQPLWILGDVFLRSYYSVYDMNNNRVGFATAA 388
>gi|147905812|ref|NP_001079036.1| gastricsin precursor [Xenopus laevis]
gi|12082174|dbj|BAB20797.1| pepsinogen C [Xenopus laevis]
gi|213625030|gb|AAI69665.1| Pepsinogen C [Xenopus laevis]
gi|213626584|gb|AAI69663.1| Pepsinogen C [Xenopus laevis]
Length = 383
Score = 211 bits (536), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 113/296 (38%), Positives = 165/296 (55%), Gaps = 12/296 (4%)
Query: 18 CLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPL 77
CL L S G+ R+ LKK + + R +E + V PL
Sbjct: 10 CLQL---SEGIIRVPLKKFK-------SMREVMRENGIKAPLVDPATKYYNQYATAYEPL 59
Query: 78 KNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYT 137
N+MD Y+GEI IG+PPQNF V+FDTGSSNLWV S+ C S +C H + +S+TY+
Sbjct: 60 SNYMDMSYYGEISIGTPPQNFLVLFDTGSSNLWVASTYCQ-SQACTNHPLFNPSQSSTYS 118
Query: 138 EIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGF 197
+ + YG+GS++G D V + +V + Q F + E F+ A+FDGI+GL +
Sbjct: 119 SNQQQFSLQYGTGSLTGILGYDTVTIQNVAISQQEFGLSETEPGTNFVYAQFDGILGLAY 178
Query: 198 REIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVP 257
IAVG A V M++Q L+++ +F F+L+ ++ GGE+ FGGVD ++ G+ + P
Sbjct: 179 PSIAVGGATTVMQGMMQQNLLNQPIFGFYLSGQ-SSQNGGEVAFGGVDQNYYTGQIYWTP 237
Query: 258 VTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
VT + YWQ + I Q+TG C GC AIVD+GTSLL P V + + +IG +
Sbjct: 238 VTSETYWQIGIQGFSINGQATGWCSQGCQAIVDTGTSLLTAPQSVFSSLIQSIGAQ 293
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 38/89 (42%), Positives = 51/89 (57%), Gaps = 4/89 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG-P 477
G+ ++ C I +P +SFTI F L P Y+L+ G C G M LP G P
Sbjct: 298 GQYVVSCSNIQNLPTISFTISGVSFPLPPSAYVLQQSSG---YCTIGIMPTYLPSQNGQP 354
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
LWILGDVF+ Y++V+D G ++GFA AA
Sbjct: 355 LWILGDVFLREYYSVYDLGNNQVGFATAA 383
>gi|151553998|gb|AAI49645.1| PGA5 protein [Bos taurus]
Length = 381
Score = 211 bits (536), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 110/256 (42%), Positives = 160/256 (62%), Gaps = 6/256 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL+N++D +YFG IGIG+P Q+F+VIFDTGSSNLWVPS C S +C H+R+ + S+T
Sbjct: 60 PLQNYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSIYCS-SEACTNHNRFNPQDSST 118
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIG 194
Y ++ I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+G
Sbjct: 119 YEATSETLSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILG 177
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+ A PV+DN+ +QGLVS+++FS +L+ + E G ++FG +D ++ G
Sbjct: 178 LAYPSISSSGATPVFDNIWDQGLVSQDLFSVYLSS--NEESGSVVIFGDIDSSYYSGSLN 235
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
+VPV+ +GYWQ + I + +S C GC AIVD+GTSLLAGPT ++ I IG
Sbjct: 236 WVPVSVEGYWQITVDSITMNGESIA-CSDGCQAIVDTGTSLLAGPTTAISNIQSYIGASE 294
Query: 315 VVSAECKLVVSQYGDL 330
S E + S L
Sbjct: 295 DSSGEVVISCSSIDSL 310
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 50/134 (37%), Positives = 66/134 (49%), Gaps = 10/134 (7%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTK--EKVLSYINELCDSLPNPMGESIIDCDRIPTMP 432
G+S CS A+V L T + SYI DS GE +I C I ++P
Sbjct: 256 GESIACSDGCQAIVDTGTSLLAGPTTAISNIQSYIGASEDS----SGEVVISCSSIDSLP 311
Query: 433 NVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTV 492
++ FTI + + P YIL++ +C SGF D+ G LWILGDVF+ Y TV
Sbjct: 312 DIVFTINGVQYPVPPSAYILQSNG----ICSSGFEGMDISTSSGDLWILGDVFIRQYFTV 367
Query: 493 FDSGKLRIGFAEAA 506
FD G +IG A A
Sbjct: 368 FDRGNNQIGLAPVA 381
>gi|340506705|gb|EGR32788.1| hypothetical protein IMG5_070700 [Ichthyophthirius multifiliis]
Length = 389
Score = 211 bits (536), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 102/259 (39%), Positives = 168/259 (64%), Gaps = 7/259 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
+ NFMDAQY+GE+ IG+PPQ+F VIFDTGSSNLWVPSS+C SI+C H+RY KS+T
Sbjct: 68 INNFMDAQYYGEVQIGTPPQSFQVIFDTGSSNLWVPSSECGILSIACRLHTRYDKTKSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + G +I YGSG +SG ++Q+ + +G + ++ EAT L+FL+++FDGI+GL
Sbjct: 128 YGKNGTHFDIKYGSGGVSGHWTQETIILGGLTAQNVTIGEATSMKGLSFLVSKFDGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ +I+V +A PV+ ++EQG V + F+F+L + +EG ++ GG DP++ Y
Sbjct: 188 AYPKISVDNATPVFMKLIEQGKVQDGSFAFFLT-NKAGQEGSRLILGGFDPQYAATPFKY 246
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
PV+ + +W ++ + +GN + + + AIVD+GTS++ GP V+ E+ + +G
Sbjct: 247 YPVSLEAWWVIDVDRVALGNTTYQIQK----AIVDTGTSVMVGPKSVIEEMKKQLPNQGK 302
Query: 316 VSAECKLVVSQYGDLIWDL 334
+C +S++ +L +++
Sbjct: 303 QKVDCS-TISEFPNLTFNI 320
Score = 78.2 bits (191), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 39/101 (38%), Positives = 53/101 (52%), Gaps = 1/101 (0%)
Query: 405 SYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCIS 464
S I E+ LPN G+ +DC I PN++F IG + L P YI++ G C+
Sbjct: 288 SVIEEMKKQLPN-QGKQKVDCSTISEFPNLTFNIGGDDYILEPADYIIQITSGSQSQCVL 346
Query: 465 GFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
G D+P P +ILGD F+ Y+T FD R+GFA A
Sbjct: 347 GLQGLDMPGPLAQAFILGDSFIHKYYTHFDQANKRVGFALA 387
>gi|292658855|ref|NP_001001600.2| pepsin A preproprotein [Bos taurus]
Length = 386
Score = 211 bits (536), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 110/256 (42%), Positives = 160/256 (62%), Gaps = 6/256 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL+N++D +YFG IGIG+P Q+F+VIFDTGSSNLWVPS C S +C H+R+ + S+T
Sbjct: 65 PLQNYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSIYCS-SEACTNHNRFNPQDSST 123
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIG 194
Y ++ I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+G
Sbjct: 124 YEATSETLSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILG 182
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+ A PV+DN+ +QGLVS+++FS +L+ + E G ++FG +D ++ G
Sbjct: 183 LAYPSISSSGATPVFDNIWDQGLVSQDLFSVYLSS--NEESGSVVIFGDIDSSYYSGSLN 240
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
+VPV+ +GYWQ + I + +S C GC AIVD+GTSLLAGPT ++ I IG
Sbjct: 241 WVPVSVEGYWQITVDSITMNGESIA-CSDGCQAIVDTGTSLLAGPTTAISNIQSYIGASE 299
Query: 315 VVSAECKLVVSQYGDL 330
S E + S L
Sbjct: 300 DSSGEVVISCSSIDSL 315
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 50/134 (37%), Positives = 66/134 (49%), Gaps = 10/134 (7%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTK--EKVLSYINELCDSLPNPMGESIIDCDRIPTMP 432
G+S CS A+V L T + SYI DS GE +I C I ++P
Sbjct: 261 GESIACSDGCQAIVDTGTSLLAGPTTAISNIQSYIGASEDS----SGEVVISCSSIDSLP 316
Query: 433 NVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTV 492
++ FTI + + P YIL++ +C SGF D+ G LWILGDVF+ Y TV
Sbjct: 317 DIVFTINGVQYPVPPSAYILQSNG----ICSSGFEGMDISTSSGDLWILGDVFIRQYFTV 372
Query: 493 FDSGKLRIGFAEAA 506
FD G +IG A A
Sbjct: 373 FDRGNNQIGLAPVA 386
>gi|296471634|tpg|DAA13749.1| TPA: pepsin A precursor [Bos taurus]
Length = 367
Score = 211 bits (536), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 110/256 (42%), Positives = 160/256 (62%), Gaps = 6/256 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL+N++D +YFG IGIG+P Q+F+VIFDTGSSNLWVPS C S +C H+R+ + S+T
Sbjct: 51 PLQNYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSIYCS-SEACTNHNRFNPQDSST 109
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIG 194
Y ++ I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+G
Sbjct: 110 YEATSETLSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILG 168
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+ A PV+DN+ +QGLVS+++FS +L+ + E G ++FG +D ++ G
Sbjct: 169 LAYPSISSSGATPVFDNIWDQGLVSQDLFSVYLSS--NEESGSVVIFGDIDSSYYSGSLN 226
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
+VPV+ +GYWQ + I + +S C GC AIVD+GTSLLAGPT ++ I IG
Sbjct: 227 WVPVSVEGYWQITVDSITMNGESIA-CSDGCQAIVDTGTSLLAGPTTAISNIQSYIGASE 285
Query: 315 VVSAECKLVVSQYGDL 330
S E + S L
Sbjct: 286 DSSGEVVISCSSIDSL 301
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 48/129 (37%), Positives = 64/129 (49%), Gaps = 10/129 (7%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTK--EKVLSYINELCDSLPNPMGESIIDCDRIPTMP 432
G+S CS A+V L T + SYI DS GE +I C I ++P
Sbjct: 247 GESIACSDGCQAIVDTGTSLLAGPTTAISNIQSYIGASEDS----SGEVVISCSSIDSLP 302
Query: 433 NVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTV 492
++ FTI + + P YIL++ +C SGF D+ G LWILGDVF+ Y TV
Sbjct: 303 DIVFTINGVQYPVPPSAYILQSNG----ICSSGFEGMDISTSSGDLWILGDVFIRQYFTV 358
Query: 493 FDSGKLRIG 501
FD G +IG
Sbjct: 359 FDRGNNQIG 367
>gi|57526769|ref|NP_001009804.1| chymosin precursor [Ovis aries]
gi|116405|sp|P18276.1|CHYM_SHEEP RecName: Full=Chymosin; AltName: Full=Preprorennin; Flags:
Precursor
gi|1374|emb|CAA37209.1| preprochymosin [Ovis aries]
gi|229045|prf||1817165A prepro-chymosin
Length = 381
Score = 211 bits (536), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 116/308 (37%), Positives = 173/308 (56%), Gaps = 17/308 (5%)
Query: 5 LLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVR 64
+L +VF L A +P R LK+R L L + Y G V+ V
Sbjct: 6 VLLAVFALSQGAEITRIPLYKGKPLRKALKERGLLEDFLQKQQYGVSSEYSGFGEVASV- 64
Query: 65 HRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYF 124
PL N++D+QYFG+I +G+PPQ F+V+FDTGSS+ WVPS C S +C
Sbjct: 65 -----------PLTNYLDSQYFGKIYLGTPPQEFTVLFDTGSSDFWVPSIYCK-SNACKN 112
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
H R+ RKS+T+ +GK I YG+GS+ G D V V ++V Q +T+E F
Sbjct: 113 HQRFDPRKSSTFQNLGKPLSIRYGTGSMQGILGYDTVTVSNIVDIQQTVGLSTQEPGDVF 172
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
A FDGI+G+ + +A +VPV+DNM+++ LV++++FS +++R + +G + G +
Sbjct: 173 TYAEFDGILGMAYPSLASEYSVPVFDNMMDRRLVAQDLFSVYMDR---SGQGSMLTLGAI 229
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
DP ++ G +VPVT + YWQF + + I + CEGGC AI+D+GTS L GP+ +
Sbjct: 230 DPSYYTGSLHWVPVTLQKYWQFTVDSVTISG-AVVACEGGCQAILDTGTSKLVGPSSDIL 288
Query: 305 EINHAIGG 312
I AIG
Sbjct: 289 NIQQAIGA 296
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 37/99 (37%), Positives = 51/99 (51%), Gaps = 8/99 (8%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
I + + N GE IDCD + +MP V F I K++ L+P Y + EG C SGF
Sbjct: 290 IQQAIGATQNQYGEFDIDCDSLSSMPTVVFEINGKMYPLTPYAYTSQE-EGF---CTSGF 345
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+ WILGDVF+ Y++VFD +G A+A
Sbjct: 346 QGEN----HSHQWILGDVFIREYYSVFDRANNLVGLAKA 380
>gi|169731523|gb|ACA64894.1| progastricsin (predicted) [Callicebus moloch]
Length = 388
Score = 211 bits (536), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 114/283 (40%), Positives = 164/283 (57%), Gaps = 9/283 (3%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRH------RLGDSDEDILPLKNFMDAQYFGEIG 90
++ L + R T KE+ + + +H D P+ ++MDA YFGEI
Sbjct: 20 KVPLKKFKSIRETMKEKGLLREFLKTHKHDPAWKYHFSDLRVSYEPM-DYMDAAYFGEIS 78
Query: 91 IGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSG 150
IG+PPQNF V+FDTGSSNLWVPS C S +C HSR+ KS+TY+ ++ + YGSG
Sbjct: 79 IGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHSRFNPSKSSTYSSNEQTFSLQYGSG 137
Query: 151 SISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWD 210
S++GFF D + V + V Q F + E F+ A+FDGI+GL + ++VG A
Sbjct: 138 SLTGFFGYDTLTVQSIQVPKQEFGLSENEPGTNFIYAKFDGIMGLAYPALSVGGATTAMQ 197
Query: 211 NMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 270
M+++G ++ VFSF+L+ GG +VFGGVD + G+ + PVT++ YWQ + +
Sbjct: 198 GMLQEGALTSPVFSFYLSNQ-QGSSGGAVVFGGVDSSLYTGQIYWAPVTQELYWQIGIEE 256
Query: 271 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
LIG Q++G C GC AIVD+GTSLL P ++ A G E
Sbjct: 257 FLIGGQASGWCSEGCQAIVDTGTSLLTVPQQYLSAFLEATGAE 299
Score = 65.5 bits (158), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 58/106 (54%), Gaps = 3/106 (2%)
Query: 401 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 460
++ LS E + + G+ +++CD I ++P ++F I F L P YIL + +G
Sbjct: 286 QQYLSAFLEATGAEEDEYGQFLVNCDSIQSLPTLTFIINGVEFPLPPSSYIL-SNDGYCT 344
Query: 461 VCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
V + + PLWILGDVF+ Y++V+D G R+GFA AA
Sbjct: 345 VGVEP--TYLSSQNSQPLWILGDVFLRSYYSVYDLGNNRVGFATAA 388
>gi|440893605|gb|ELR46308.1| Pepsin A, partial [Bos grunniens mutus]
Length = 388
Score = 210 bits (535), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 110/256 (42%), Positives = 160/256 (62%), Gaps = 6/256 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL+N++D +YFG IGIG+P Q+F+VIFDTGSSNLWVPS C S +C H+R+ + S+T
Sbjct: 67 PLQNYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSIYCS-SEACTNHNRFNPQDSST 125
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIG 194
Y ++ I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+G
Sbjct: 126 YEATSETLSITYGTGSMTGVLGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILG 184
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+ A PV+DN+ +QGLVS+++FS +L+ + E G ++FG +D ++ G
Sbjct: 185 LAYPSISSSGATPVFDNIWDQGLVSQDLFSVYLSS--NEESGSVVIFGDIDSSYYSGSLN 242
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
+VPV+ +GYWQ + I + +S C GC AIVD+GTSLLAGPT ++ I IG
Sbjct: 243 WVPVSVEGYWQITVDSITMNGESIA-CSDGCQAIVDTGTSLLAGPTTAISNIQSYIGASE 301
Query: 315 VVSAECKLVVSQYGDL 330
S E + S L
Sbjct: 302 DSSGEVVISCSSIDSL 317
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 52/134 (38%), Positives = 68/134 (50%), Gaps = 10/134 (7%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTK--EKVLSYINELCDSLPNPMGESIIDCDRIPTMP 432
G+S CS A+V L T + SYI DS GE +I C I ++P
Sbjct: 263 GESIACSDGCQAIVDTGTSLLAGPTTAISNIQSYIGASEDS----SGEVVISCSSIDSLP 318
Query: 433 NVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTV 492
++ FTI + + P YIL++ +GI C SGF D+ G LWILGDVF+ Y TV
Sbjct: 319 DIVFTINGVQYPVPPSAYILQS-DGI---CSSGFEGMDISTSSGDLWILGDVFIRQYFTV 374
Query: 493 FDSGKLRIGFAEAA 506
FD G +IG A A
Sbjct: 375 FDRGNNQIGLAPVA 388
>gi|327271277|ref|XP_003220414.1| PREDICTED: renin-like [Anolis carolinensis]
Length = 398
Score = 210 bits (535), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 120/320 (37%), Positives = 178/320 (55%), Gaps = 13/320 (4%)
Query: 13 WVLA---SCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMG-GAGVSGVRHRLG 68
WV A SC L SS+ +RI LKK +L I + + G+ +
Sbjct: 5 WVFAVVTSCFL-SFSSDAFQRIPLKKMPSIRETLQKMGIKVADFFPSLKHGIYFLNDGFY 63
Query: 69 DSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFHSR 127
+ + L N++D QY+GEI IG+P Q F V+FDTGS+NLWVPS +C +C H+R
Sbjct: 64 NGTAPTI-LTNYLDMQYYGEISIGTPAQIFKVVFDTGSANLWVPSQQCSPLYSACVSHNR 122
Query: 128 YKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLA 187
Y S +S+TY G I YG G + GF SQD V V D+ V Q+F EA + F+ A
Sbjct: 123 YDSSRSSTYKPNGTEIAIQYGQGYVKGFLSQDIVRVADIPVV-QLFAEAIALPNKPFIYA 181
Query: 188 RFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPK 247
RFDG++G+G+ A+ +PV+D ++ + ++SEEVFS + +R+ + GGEI+ GG DP
Sbjct: 182 RFDGVLGMGYPSQAIDGVIPVFDKIISERVLSEEVFSVYYSRNSEMNTGGEIILGGSDPS 241
Query: 248 HFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEIN 307
++ G YV ++ GYW +L + +G++ C GC A VD+G+S + GP V+ +
Sbjct: 242 YYTGDFHYVSISTPGYWHIDLKGVSLGSEML-FCHEGCTAAVDTGSSFITGPASAVSILM 300
Query: 308 HAIGG----EGVVSAECKLV 323
+IG E ECK +
Sbjct: 301 KSIGATLLEERDYVVECKKI 320
Score = 88.6 bits (218), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 37/86 (43%), Positives = 57/86 (66%)
Query: 420 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 479
+ +++C +I +P++SF +GD+ + LS Y+L+ + E+C F AFD+PPP GP+W
Sbjct: 312 DYVVECKKIHLLPDISFHLGDRSYTLSGYAYVLQYSDYGKELCAVAFSAFDIPPPLGPIW 371
Query: 480 ILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILG F+G Y+T FD RIGFA +
Sbjct: 372 ILGATFIGQYYTEFDRQNNRIGFARS 397
>gi|149725185|ref|XP_001501907.1| PREDICTED: pepsin A-like [Equus caballus]
Length = 387
Score = 210 bits (535), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 118/303 (38%), Positives = 167/303 (55%), Gaps = 18/303 (5%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
LR GL + L H N A + A G L+N+ D +YFG
Sbjct: 32 LRENGLLEDFLKQHPRNPASKYFPKEAATLAATEG--------------LENYKDEEYFG 77
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
I IG+PPQ F+VIFDTGSSNLWVPS+ C S++C H+R+ S+TY +S I Y
Sbjct: 78 TISIGTPPQEFTVIFDTGSSNLWVPSTYCS-SLACSDHNRFNPEDSSTYEATSESISITY 136
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVP 207
G+GS++G + V VG + +Q+F + E S A FDGI+GL + I+ A P
Sbjct: 137 GTGSMTGVLRYNTVRVGGIEDTNQIFGLSESEPSSFLYYAPFDGILGLAYPSISSSGATP 196
Query: 208 VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFE 267
V+DN+ +QGLVS+++FS +L+ D E G ++F G+D ++ G +VPV+++ YWQ
Sbjct: 197 VFDNIWDQGLVSQDLFSVYLSS--DDESGSMVIFSGIDSSYYSGSLCWVPVSEEAYWQIT 254
Query: 268 LGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQY 327
+ I + +S C GGC AIVD+GTSLLAGP + I IG S+E + S
Sbjct: 255 VDSITMNGESIA-CSGGCQAIVDTGTSLLAGPPSAIDNIQSYIGASEDYSSEAVISCSSI 313
Query: 328 GDL 330
L
Sbjct: 314 DSL 316
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 49/134 (36%), Positives = 64/134 (47%), Gaps = 10/134 (7%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQ--KQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMP 432
G+S CS A+V L + + SYI D E++I C I ++P
Sbjct: 262 GESIACSGGCQAIVDTGTSLLAGPPSAIDNIQSYIGASEDY----SSEAVISCSSIDSLP 317
Query: 433 NVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTV 492
++ FTI F LSP YIL+ + CISGF DL G LWILGDVF+ Y T+
Sbjct: 318 DIVFTINGVEFPLSPSAYILEEDDS----CISGFEGMDLDTSSGELWILGDVFIRQYFTI 373
Query: 493 FDSGKLRIGFAEAA 506
FD +I A A
Sbjct: 374 FDRANNQICLAPVA 387
>gi|326933881|ref|XP_003213026.1| PREDICTED: gastricsin-like [Meleagris gallopavo]
Length = 389
Score = 210 bits (535), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 121/316 (38%), Positives = 174/316 (55%), Gaps = 32/316 (10%)
Query: 13 WVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGV------RHR 66
W++ + L L GL R+ LKK + + R KE SGV HR
Sbjct: 3 WLIFTVLCLHLC-EGLLRVPLKKGK-------SIREVMKE--------SGVLHDYLANHR 46
Query: 67 LGDSDEDIL--------PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF 118
D PL N MD Y+GEI IG+PPQNF V+FDTGSSNLWVPS+ C
Sbjct: 47 YYDPAYKFFSNFATAYEPLANSMDMSYYGEISIGTPPQNFLVLFDTGSSNLWVPSTLCQ- 105
Query: 119 SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
S +C H+ + +S+T++ + + YGSGS++G F D V + + + +Q F +
Sbjct: 106 SQACANHNEFNPNESSTFSTQNEFFSLQYGSGSLTGIFGFDTVTIQGISITNQEFGLSET 165
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
E FL + FDGI+GL F I+ G A V M+++ L+ +FSF+L+ + +GGE
Sbjct: 166 EPGTNFLYSPFDGILGLAFPAISAGGATTVMQQMLQENLLDSPIFSFYLSGQ-EGSQGGE 224
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
++FGGV+P + G+ ++ PVT+ YWQ + D +G QS+G C GC AIVD+GTSLL
Sbjct: 225 LIFGGVNPNLYTGQISWTPVTQTTYWQIGIEDFTVGGQSSGWCSQGCQAIVDTGTSLLTV 284
Query: 299 PTPVVTEINHAIGGEG 314
P V +E+ IG +
Sbjct: 285 PNQVFSELMQYIGAQA 300
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 57/176 (32%), Positives = 78/176 (44%), Gaps = 14/176 (7%)
Query: 333 DLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMAVVWVQN 392
+L+ G+ P QI Y GI E V S CS A+V
Sbjct: 224 ELIFGGVNPNLYTGQISWTPVTQTTYWQIGI----EDFTVGGQSSGWCSQGCQAIVDTGT 279
Query: 393 QLKQ--KQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQY 450
L Q +++ YI DS G+ + C I MP ++F I F L P Y
Sbjct: 280 SLLTVPNQVFSELMQYIGAQADS----NGQYVASCSNIEYMPTLTFVISGTSFPLPPSAY 335
Query: 451 ILKTGEGIAEVCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+L++ G C G + LP G PLWILGDVF+ VY++++D G R+GFA A
Sbjct: 336 MLQSNSG---YCTVGIESTYLPSETGQPLWILGDVFLRVYYSIYDMGNNRVGFATA 388
>gi|329665035|ref|NP_001192720.1| gastricsin precursor [Bos taurus]
Length = 391
Score = 210 bits (535), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 118/306 (38%), Positives = 176/306 (57%), Gaps = 15/306 (4%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRH------RL 67
VLA L + L +I LKK + + R KE+ + + +H R
Sbjct: 5 VLALVCLQALEAAALVKIPLKKFK-------SIREIMKEKGLLEDFLRTYKHDPAQKYRF 57
Query: 68 GDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSR 127
GD P+ ++MDA YFGEI IG+PPQNF V+FDTGSSNLWVPS C S +C H+R
Sbjct: 58 GDFIVATEPM-DYMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHTR 115
Query: 128 YKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLA 187
+ S+TY+ ++ + YGSGS++G D + V + V +Q F + E FL A
Sbjct: 116 FNHSLSSTYSTNEQTFSLQYGSGSLTGILGYDTLTVQGIKVPNQEFGLSKTEPGTNFLYA 175
Query: 188 RFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPK 247
+FDGI+G+ + ++V A V M+++G ++ VFSF+L+ +++GG ++FGGVD
Sbjct: 176 KFDGIMGMAYPSLSVDGATTVLQGMLQEGALTSPVFSFYLSSQQGSQDGGAVIFGGVDNC 235
Query: 248 HFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEIN 307
+ G+ + PVT++ YWQ + LIG+Q+TG C GC AIVD+GTSLL P ++ +
Sbjct: 236 LYTGQIYWAPVTQELYWQIGFEEFLIGDQATGWCSTGCQAIVDTGTSLLTVPQQFLSALL 295
Query: 308 HAIGGE 313
A G +
Sbjct: 296 QATGAQ 301
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 74/161 (45%), Gaps = 11/161 (6%)
Query: 351 CAFNGAEY---VSTGIKTVVEKENVSAGDSAV--CSACEMAVVWVQNQLKQKQTKEKVLS 405
C + G Y V+ + + E GD A CS A+V L ++ LS
Sbjct: 235 CLYTGQIYWAPVTQELYWQIGFEEFLIGDQATGWCSTGCQAIVDTGTSLLT--VPQQFLS 292
Query: 406 YINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISG 465
+ + + + G+ +DC+ I +P ++ I F L P YIL + CI G
Sbjct: 293 ALLQATGAQEDQYGQFPVDCNNIQNLPTLTLVINGVQFPLPPASYILNNDD---SYCILG 349
Query: 466 FMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+P G PLWILGDVF+ Y++V+D G R+GFA A
Sbjct: 350 VEVTYVPSQNGQPLWILGDVFLRSYYSVYDLGNNRVGFATA 390
>gi|222425198|dbj|BAH20548.1| pepsinogen A-36 [Pongo abelii]
Length = 388
Score = 210 bits (535), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 122/307 (39%), Positives = 175/307 (57%), Gaps = 27/307 (8%)
Query: 28 LRRI----GLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDA 83
LRR GL K L H+LN AR +Y + D PL+N++D
Sbjct: 28 LRRTLSEHGLLKDFLKKHNLNPAR-----KYF--------PQWEAPTLVDEQPLENYLDM 74
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
+YFG IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H+ + S+TY ++
Sbjct: 75 EYFGSIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACTNHNLFNPEDSSTYQSTSETV 133
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAV 202
I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + I+
Sbjct: 134 SIAYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISS 192
Query: 203 GDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
A PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +G
Sbjct: 193 SGATPVFDNIWNQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTGSLNWVPVTVEG 250
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSA 318
YWQ + I + ++ C GC AIVD+GTSLL GPT + I IG +G +
Sbjct: 251 YWQITVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVV 309
Query: 319 ECKLVVS 325
C + S
Sbjct: 310 SCSAISS 316
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 38/91 (41%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
N G+ ++ C I ++P++ FTI + L P YIL++ EG CISGF ++P
Sbjct: 302 NSDGDMVVSCSAISSLPDIVFTINGVQYPLPPSAYILQS-EG---SCISGFQGMNVPTES 357
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 388
>gi|129780|sp|P27677.1|PEPA2_MACFU RecName: Full=Pepsin A-2/A-3; AltName: Full=Pepsin III-2/III-1;
Flags: Precursor
gi|38069|emb|CAA42427.1| prepropepsin a [Macaca fuscata]
Length = 388
Score = 210 bits (535), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 123/314 (39%), Positives = 179/314 (57%), Gaps = 28/314 (8%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL K L H+ N A +Y A + D PL+N++D +YFG
Sbjct: 32 LSEHGLLKDFLKKHNFNPA-----SKYFPQAEAPTLI--------DEQPLENYLDMEYFG 78
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H+R+ + S+TY + I Y
Sbjct: 79 TIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACTNHNRFNPQDSSTYQSTSGTVSITY 137
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
G+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + I+ A
Sbjct: 138 GTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISSSGAT 196
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPV+ +GYWQ
Sbjct: 197 PVFDNIWNQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTGSLNWVPVSVEGYWQI 254
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVSAEC 320
+ I + ++ C GC AIVD+GTSLL GPT + I IG GE VVS
Sbjct: 255 SVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGEMVVSCSA 313
Query: 321 KLVVSQYGDLIWDL 334
+S D+++ +
Sbjct: 314 ---ISSLPDIVFTI 324
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 51/91 (56%), Gaps = 4/91 (4%)
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
N GE ++ C I ++P++ FTI + + P YIL++ CISGF D+P
Sbjct: 302 NSDGEMVVSCSAISSLPDIVFTINGIQYPVPPSAYILQS----QGSCISGFQGMDVPTES 357
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 388
>gi|301625941|ref|XP_002942158.1| PREDICTED: pepsin A [Xenopus (Silurana) tropicalis]
Length = 384
Score = 210 bits (535), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 113/288 (39%), Positives = 161/288 (55%), Gaps = 20/288 (6%)
Query: 26 NGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQY 85
N L+R+GL L + N A S L S ++L +N+MD +Y
Sbjct: 29 NRLQRLGLLGDYLKKYPYNPA--------------SKYFPTLAQSSAEVL--QNYMDIEY 72
Query: 86 FGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEI 145
+G I IG+PPQ F+VIFDTGS+NLWVPS C S +C H+R+ ++S T+ I
Sbjct: 73 YGTISIGTPPQEFTVIFDTGSANLWVPSVYCS-SSACTNHNRFNPQQSTTFQATNTPVSI 131
Query: 146 NYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDA 205
YG+GS+SGF D ++VG++ + +Q+F + E + FDGI+GL F IA A
Sbjct: 132 QYGTGSMSGFLGYDTLQVGNIKISNQMFGLSESEPGSFLYYSPFDGILGLAFPSIASSQA 191
Query: 206 VPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQ 265
PV+DNM QGL+ + +FS +L+ D + G ++FGGVD ++ G +VP+T + YWQ
Sbjct: 192 TPVFDNMWSQGLIPQNLFSVYLSS--DGQSGSYVLFGGVDTSYYSGSLNWVPLTAETYWQ 249
Query: 266 FELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
L I I Q C C AIVD+GTSL+ GPT + I + IG
Sbjct: 250 IILDSISINGQVIA-CSQSCQAIVDTGTSLMTGPTTPIANIQYYIGAS 296
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 35/88 (39%), Positives = 46/88 (52%), Gaps = 4/88 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +I+C+ I MP + FTI + L P Y+ + +G C SGF A LP G L
Sbjct: 301 GQYVINCNNISNMPTIVFTINGVQYPLPPTAYVRQNQQG----CSSGFQAMTLPTNSGDL 356
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEAA 506
WILGDVF+ Y VFD + A A
Sbjct: 357 WILGDVFIRQYFVVFDRTNNYVAMAPVA 384
>gi|12843350|dbj|BAB25952.1| unnamed protein product [Mus musculus]
Length = 396
Score = 210 bits (535), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 116/304 (38%), Positives = 172/304 (56%), Gaps = 6/304 (1%)
Query: 13 WVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRK--ERYMGGAGVSGVRHRLGDS 70
W++ + L LP L R+ KK + ++ + + + + G + GD
Sbjct: 3 WMVVALLCLPLLEAALIRVPPKKMKSIRETMKEQGVLKDFLKNHKYDPGQKYHFGKFGDY 62
Query: 71 DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKS 130
P+ +MDA Y+GEI IG+PPQNF V+FDTGSSNLWV S C S +C H+RY
Sbjct: 63 SVLYEPMA-YMDASYYGEISIGTPPQNFLVLFDTGSSNLWVSSVYCQ-SEACTTHTRYNP 120
Query: 131 RKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFD 190
KS+TY G++ + YG+GS++GFF D + V + V +Q F + E F+ A+FD
Sbjct: 121 SKSSTYYTQGQTFSLQYGTGSLTGFFGYDTLRVQSIQVPNQEFGLSENEPGTNFVYAQFD 180
Query: 191 GIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFK 250
GI+GL + ++ G A M+ +G +S+ +F +L GG+IVFGGVD +
Sbjct: 181 GIMGLAYPGLSSGGATTALQGMLGEGALSQPLFGVYLGSQ-QGSNGGQIVFGGVDENLYT 239
Query: 251 GKHTYVPVTKKGYWQFELGDILIGNQSTGVC-EGGCAAIVDSGTSLLAGPTPVVTEINHA 309
G+ T++PVT++ YWQ + D LIGNQ++G C GC IVD+GTSLL P + E+
Sbjct: 240 GELTWIPVTQELYWQITIDDFLIGNQASGWCSSSGCQGIVDTGTSLLVMPAQYLNELLQT 299
Query: 310 IGGE 313
IG +
Sbjct: 300 IGAQ 303
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 31/89 (34%), Positives = 48/89 (53%), Gaps = 8/89 (8%)
Query: 406 YINELCDSL---PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVC 462
Y+NEL ++ G+ + CD + ++P ++F + F LSP YI++ EG C
Sbjct: 292 YLNELLQTIGAQEGEYGQYFVSCDSVSSLPTLTFVLNGVQFPLSPSSYIIQE-EG---SC 347
Query: 463 ISGFMAFDLPPPRG-PLWILGDVFMGVYH 490
+ G + L G PLWILGDVF+ Y+
Sbjct: 348 MVGLESLSLNAESGQPLWILGDVFLRSYY 376
>gi|395860891|ref|XP_003802735.1| PREDICTED: pepsin F-like [Otolemur garnettii]
Length = 470
Score = 210 bits (535), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 148/481 (30%), Positives = 235/481 (48%), Gaps = 41/481 (8%)
Query: 37 RLDLHSLNAARITRKER-----YMGGAGVSGVRHRLGDSDEDIL--PLKNFMDAQYFGEI 89
R+ L + + R +E Y+ S V+ D +++ L NF+D Y G I
Sbjct: 18 RVPLMKVKSMRENLQENGMLKEYLEKYPYSPVKFLSKDQKKNVTYESLSNFLDLAYVGLI 77
Query: 90 GIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIG---KSCEIN 146
IG+PPQ F V+FDTGS++LWVPS CY S SC H R+ + S+T+ ++ ++N
Sbjct: 78 SIGTPPQKFKVVFDTGSADLWVPSIFCY-SESCDKHRRFNPQNSSTFKLPPGNLRTVKLN 136
Query: 147 YGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
YGSG I G D V++GD+ Q F+ +T+E S+ FDGI+GL + ++
Sbjct: 137 YGSGDIMGIVVSDTVKIGDLEDISQTFVLSTQEDSVFRFFTEFDGILGLAYPDLGQAGGT 196
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
PV+DN+ ++G +SE +F+F+L+ + ++ GGVD ++ G+ +VP+TK+ YWQ
Sbjct: 197 PVFDNIWKKGRISENLFAFYLSNGGKGDS--MLMLGGVDHSYYSGELRWVPLTKQQYWQV 254
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQ 326
L I + N + C GC AI+D+G+S++ GP V I + I S
Sbjct: 255 ALDSISM-NGTIIACHDGCQAILDTGSSVVNGPNACVLNIQNVIHAHQ----------SF 303
Query: 327 YGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDSAVCSACEMA 386
G + D + LP+ V G+ N + I+ V VS DS + +
Sbjct: 304 NGKYVIDCNTTTHLPDIVFVIGGV---NYPVPARSYIRKVAFNTCVSTFDSFPDTMFN-S 359
Query: 387 VVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGES--IIDCDRIPTMPNVSFTIGDKIFN 444
W+ + L + D N +G + +IDC+ +P++ F IG +
Sbjct: 360 NTWILGDV--------FLRLYFSVYDRANNRVGLASFVIDCNTTTHLPDIVFVIGGVSYP 411
Query: 445 LSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAE 504
+ YI K G C+S F + +WILGDVF+ +Y +V+D R+G A
Sbjct: 412 VPARSYIQKVAFG---TCVSTFKSLPNNVFSSKIWILGDVFLRLYFSVYDRANNRVGLAP 468
Query: 505 A 505
A
Sbjct: 469 A 469
>gi|444706401|gb|ELW47743.1| Cathepsin E [Tupaia chinensis]
Length = 396
Score = 210 bits (535), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 108/248 (43%), Positives = 155/248 (62%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N + QY+G + IGSP QNFSV+FDTGSS+ WV S C S +C H+++ S +SNT
Sbjct: 67 PLTNSFNMQYYGTVSIGSPLQNFSVLFDTGSSDFWVTSVYC-ISPACEKHTKFFSSRSNT 125
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y++ G + I YGSGS+SG D V VG + V DQ F E+ E F+ A FDGI+GL
Sbjct: 126 YSKKGSNFFIEYGSGSLSGITGVDRVSVGGLTVVDQEFGESVTEPGQHFVYAAFDGILGL 185
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ ++V A PV+DNM+ +V++ +FS +++ D + G E++FGG D HF G +
Sbjct: 186 GYPSLSVTGATPVFDNMIVHNMVAQPMFSVYMSSDIENGTGSELIFGGYDCSHFSGSLNW 245
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
+PVTK+G+WQ L + +G+ + C GC AIVD+GTS + GP + ++ AIG
Sbjct: 246 IPVTKQGFWQIALDGVQVGD-TMMFCSKGCQAIVDTGTSRIIGPLNKIERLHRAIGATLV 304
Query: 313 EGVVSAEC 320
G+ EC
Sbjct: 305 NGIYFVEC 312
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 35/92 (38%), Positives = 50/92 (54%), Gaps = 8/92 (8%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDK----IFNLSPEQYILKT-GEGIAEVCISGFMAFDLPP 473
G ++C + MPNV+F I + LSP Y+L+ G+G+ +C SGF
Sbjct: 306 GIYFVECVNLTVMPNVTFIISGVPYFFFYTLSPTAYVLQALGDGM-RLCSSGFEGLHFLT 364
Query: 474 PRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
P WILGDVF+ +++VFD G R+G A A
Sbjct: 365 E--PSWILGDVFLRQFYSVFDRGNNRVGLAPA 394
>gi|71021685|ref|XP_761073.1| hypothetical protein UM04926.1 [Ustilago maydis 521]
gi|46100637|gb|EAK85870.1| hypothetical protein UM04926.1 [Ustilago maydis 521]
Length = 418
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 109/253 (43%), Positives = 155/253 (61%), Gaps = 9/253 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL +F++AQYF +I +G+P Q+F VI DTGSSNLWVPS+KC SI+C+ H +Y S S+
Sbjct: 97 VPLTDFLNAQYFCDISLGTPAQDFKVILDTGSSNLWVPSTKCS-SIACFLHKKYDSSASS 155
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y + G +I YGSGS+ G S D +++GD+ +K Q F EAT E L F +FDGI+G
Sbjct: 156 SYKKNGTEFKIQYGSGSMEGIVSNDVLKIGDLTIKGQDFAEATSEPGLAFAFGKFDGILG 215
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+V VP M+ QGL+ SF+L E+GGE VFGG+D H+ GK
Sbjct: 216 LAYDTISVNGIVPPMYQMINQGLLDAPQVSFYLGS--SEEDGGEAVFGGIDDSHYTGKIH 273
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG-- 312
+ PV +KGYW+ L + +G++ + G A +D+GTSL+A T +N IG
Sbjct: 274 WSPVKRKGYWEVALDKLALGDEELELDNGSAA--IDTGTSLIAMATDTAEILNAEIGATK 331
Query: 313 --EGVVSAECKLV 323
G S +C+ V
Sbjct: 332 SWNGQYSVDCEKV 344
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 51/87 (58%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC+++ +P ++F I + F L + Y+L+ + CIS F +LP P +
Sbjct: 335 GQYSVDCEKVKDLPPLTFYIDGQPFKLEGKDYVLE----VQGSCISSFSGINLPGPLADM 390
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
I+GDVF+ Y++V+D GK +G A A
Sbjct: 391 LIVGDVFLRKYYSVYDLGKNAVGLATA 417
>gi|400598686|gb|EJP66395.1| vacuolar protease A [Beauveria bassiana ARSEF 2860]
Length = 395
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 125/308 (40%), Positives = 174/308 (56%), Gaps = 33/308 (10%)
Query: 40 LHSLNAARITRKERYMGGAGVSGVRHRLGD--------SDEDIL--------------PL 77
+H + +I E+ +G A H+LG S DI+ P+
Sbjct: 19 IHKMKLQKIPLAEQLVG-ASFEAQAHQLGQKYLGARPASRADIMFNNQVAESKDGHPVPV 77
Query: 78 KNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYT 137
NF +AQYF EI IG+PPQ F V+ DTGSSNLWVPS C SI+C+ HS Y S S+TY
Sbjct: 78 TNFANAQYFSEITIGTPPQTFKVVLDTGSSNLWVPSQSCS-SIACFLHSTYDSSSSSTYK 136
Query: 138 EIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGF 197
+ G EI+YGSGS++GF S D V +GD+ +K+ F EAT E L F RFDGI+GLG+
Sbjct: 137 KNGSDFEIHYGSGSLTGFVSNDVVSIGDLTIKNTDFAEATSEPGLAFAFGRFDGILGLGY 196
Query: 198 REIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVP 257
I+V VP + M+ Q L+ E VF+F+L + + G E +FGGVD H++GK Y+P
Sbjct: 197 DTISVNKMVPPFYQMINQKLIDEPVFAFYLGSE---DSGSEAIFGGVDKDHYEGKIEYIP 253
Query: 258 VTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE---- 313
+ +K YW+ + I G++ + G I+D+GTSL PT + +N IG +
Sbjct: 254 LRRKAYWEVDFDAIAFGDEVAELENTGV--ILDTGTSLNTLPTDLAELLNKEIGAKKGFG 311
Query: 314 GVVSAECK 321
G S +CK
Sbjct: 312 GQYSIDCK 319
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 50/87 (57%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ IDC ++P+++FT+ + L YIL+ G C+S F D+P P GP+
Sbjct: 312 GQYSIDCKARDSLPDITFTLAGSNYTLPASDYILELGGS----CVSTFTPLDMPEPVGPI 367
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILGD F+ Y++V+D GK +G A A
Sbjct: 368 AILGDAFLRRYYSVYDLGKGAVGLARA 394
>gi|114637856|ref|XP_001145457.1| PREDICTED: pepsin A-5 isoform 6 [Pan troglodytes]
Length = 388
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 119/290 (41%), Positives = 169/290 (58%), Gaps = 23/290 (7%)
Query: 28 LRRI----GLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDA 83
LRR GL K L H+LN A +Y + D PL+N++D
Sbjct: 28 LRRTLSERGLLKDFLKKHNLNPA-----SKYF--------PQWEAPTLVDEQPLENYLDM 74
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
+YFG IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H+R+ S+TY ++
Sbjct: 75 EYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTYQSTSETV 133
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAV 202
I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + I+
Sbjct: 134 SIAYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISS 192
Query: 203 GDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
A PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +G
Sbjct: 193 SGATPVFDNIWNQGLVSQDLFSVYLSA--DDKSGSVVIFGGIDSSYYTGSLNWVPVTVEG 250
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
YWQ + I + ++ C GC AIVD+GTSLL GPT + I IG
Sbjct: 251 YWQITVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGA 299
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
N G+ ++ C I ++P++ FTI + + P YIL++ EG CISGF ++P
Sbjct: 302 NSDGDMVVSCSAISSLPDIVFTINGVQYPVPPSAYILQS-EG---SCISGFQGMNVPTES 357
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 388
>gi|448113357|ref|XP_004202330.1| Piso0_001822 [Millerozyma farinosa CBS 7064]
gi|359465319|emb|CCE89024.1| Piso0_001822 [Millerozyma farinosa CBS 7064]
Length = 414
Score = 210 bits (534), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 104/250 (41%), Positives = 156/250 (62%), Gaps = 8/250 (3%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL+N+++AQY+ IG+GSP Q F V+ DTGSSNLWVPS+ C S++C+ H++Y +S++
Sbjct: 91 PLENYLNAQYYTTIGLGSPVQEFKVVLDTGSSNLWVPSTDCS-SLACFLHTKYDHSESSS 149
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + G I YGSGS+ G+ SQD + + + ++ Q F EAT E L F A+FDGI+GL
Sbjct: 150 YKQNGSEFAIRYGSGSLEGYVSQDTLNLAGLTIEKQDFAEATSEPGLAFAFAKFDGILGL 209
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+ I+V + VP N + QGL+ E F+F+L ++D D +GG FGGVD KH+KG
Sbjct: 210 AYDTISVNNIVPPIYNAINQGLLDEPKFAFYLGDKDKDENDGGVATFGGVDTKHYKGDIV 269
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
+P+ +K YW+ I +G++ + G A +D+GTSL+ P+ + IN IG +
Sbjct: 270 ELPIRRKAYWEVSFDGIGLGDEYAELTSTGAA--IDTGTSLITLPSSLAEIINAKIGAKK 327
Query: 314 ---GVVSAEC 320
G S +C
Sbjct: 328 SWSGQYSVDC 337
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 49/87 (56%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DCD ++P ++ T F LSP +Y L+ G CIS F D P P G +
Sbjct: 331 GQYSVDCDSRDSLPELTMTFHGHNFTLSPYEYTLEVGGS----CISAFTPMDFPKPIGDM 386
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
I+GD F+ Y++V+D GK +G AE+
Sbjct: 387 AIVGDSFLRKYYSVYDLGKNVVGLAES 413
>gi|1065259|pdb|1PSO|E Chain E, The Crystal Structure Of Human Pepsin And Its Complex With
Pepstatin
gi|5542461|pdb|1QRP|E Chain E, Human Pepsin 3a In Complex With A Phosphonate Inhibitor
Iva-Val-Val- Leu(P)-(O)phe-Ala-Ala-Ome
gi|157833570|pdb|1PSN|A Chain A, The Crystal Structure Of Human Pepsin And Its Complex With
Pepstatin
gi|361132440|pdb|3UTL|A Chain A, Human Pepsin 3b
Length = 326
Score = 210 bits (534), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 109/258 (42%), Positives = 160/258 (62%), Gaps = 10/258 (3%)
Query: 73 DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRK 132
D PL+N++D +YFG IGIG+P Q+F+V+FDTGSSNLWVPS C S++C H+R+
Sbjct: 2 DEQPLENYLDMEYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCS-SLACTNHNRFNPED 60
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDG 191
S+TY ++ I YG+GS++G D V+VG + +Q+F + T GS + A FDG
Sbjct: 61 SSTYQSTSETVSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDG 119
Query: 192 IIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKG 251
I+GL + I+ A PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G
Sbjct: 120 ILGLAYPSISSSGATPVFDNIWNQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTG 177
Query: 252 KHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
+VPVT +GYWQ + I + ++ C GC AIVD+GTSLL GPT + I IG
Sbjct: 178 SLNWVPVTVEGYWQITVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIG 236
Query: 312 G----EGVVSAECKLVVS 325
+G + C + S
Sbjct: 237 ASENSDGDMVVSCSAISS 254
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 38/91 (41%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
N G+ ++ C I ++P++ FTI + + P YIL++ EG CISGF +LP
Sbjct: 240 NSDGDMVVSCSAISSLPDIVFTINGVQYPVPPSAYILQS-EG---SCISGFQGMNLPTES 295
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 296 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 326
>gi|19921120|ref|NP_609458.1| CG17134 [Drosophila melanogaster]
gi|7297766|gb|AAF53016.1| CG17134 [Drosophila melanogaster]
gi|17944939|gb|AAL48533.1| RE02351p [Drosophila melanogaster]
gi|220947772|gb|ACL86429.1| CG17134-PA [synthetic construct]
gi|220957078|gb|ACL91082.1| CG17134-PA [synthetic construct]
Length = 391
Score = 210 bits (534), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 107/236 (45%), Positives = 147/236 (62%), Gaps = 4/236 (1%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNT 135
L N M+ +Y+G I IG+P Q F+++FDTGS+NLWVPS+ C S +C H++Y S S+T
Sbjct: 68 LHNSMNNEYYGVIAIGTPEQRFNILFDTGSANLWVPSASCPASNTACQRHNKYDSSASST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G+ I YG+GS+SGF S D V + + +++Q F EA E TF+ A F GI+GL
Sbjct: 128 YVANGEEFAIEYGTGSLSGFLSNDIVTIAGISIQNQTFGEALSEPGTTFVDAPFAGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
F IAV P +DNM+ QGL+ E V SF+L R A GGE++ GG+D ++G TY
Sbjct: 188 AFSAIAVDGVTPPFDNMISQGLLDEPVISFYLKRQGTAVRGGELILGGIDSSLYRGSLTY 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
VPV+ YWQF++ I T +C GC AI D+GTSL+A P +IN +G
Sbjct: 248 VPVSVPAYWQFKVNT--IKTNGTLLCN-GCQAIADTGTSLIAVPLAAYRKINRQLG 300
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 36/88 (40%), Positives = 51/88 (57%), Gaps = 4/88 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
GE+ + C R+ ++P V+ IG +F L+P YI+K + C+S F +
Sbjct: 306 GEAFVRCGRVSSLPKVNLNIGGTVFTLAPRDYIVKVTQNGQTYCMSAFTYME----GLSF 361
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEAA 506
WILGDVF+G ++TVFD G RIGFA A
Sbjct: 362 WILGDVFIGKFYTVFDKGNERIGFARVA 389
>gi|395838792|ref|XP_003792290.1| PREDICTED: renin [Otolemur garnettii]
Length = 404
Score = 210 bits (534), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 127/305 (41%), Positives = 184/305 (60%), Gaps = 19/305 (6%)
Query: 17 SCLL-LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSG----VRHRL--GD 69
SC + L ++ RI LKK + + R KER + A +S RL G+
Sbjct: 19 SCTISLSTDTSAFSRIFLKK-------MPSVREKLKERGVDMARLSAEWSQFTRRLSSGN 71
Query: 70 SDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFHSRY 128
S ++ L N++D QY+GEIGIG+PPQ F VIFDTGS+NLWVPS+KC +C HS Y
Sbjct: 72 STSSVV-LTNYLDTQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCSPLYTACEIHSLY 130
Query: 129 KSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLAR 188
S S++Y E G I YG+G + GF SQD V VG + V Q F E T + F+LA+
Sbjct: 131 DSSDSSSYMENGTEFTIQYGTGKVKGFLSQDVVTVGGLTVT-QGFGEVTELPLMPFMLAK 189
Query: 189 FDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKH 248
FDG++G+GF AVG PV+DN++ Q ++ E+VFS + +R+ GGEIV GG DP++
Sbjct: 190 FDGVLGMGFPAQAVGGITPVFDNILSQRVLKEDVFSVYYSRNSHL-LGGEIVLGGSDPQY 248
Query: 249 FKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH 308
++G YV ++K G WQ ++ + + +T +CE GC A+VD+G S ++GPT + +
Sbjct: 249 YQGNFHYVSISKTGSWQIKMKGVSV-RSTTLLCEDGCMAVVDTGASYISGPTSSLRLLMK 307
Query: 309 AIGGE 313
A+G +
Sbjct: 308 ALGAQ 312
Score = 82.0 bits (201), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 54/88 (61%)
Query: 418 MGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGP 477
M E +++C+++P +P++SF +G + + L+ Y+L+ ++C F D+ PP GP
Sbjct: 316 MNEYVVNCNQVPALPDISFHLGGRAYTLTSVDYVLQDPYSSNDLCTLAFHGLDVSPPTGP 375
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAEA 505
LW+LG FM ++T FD RIGFA A
Sbjct: 376 LWVLGASFMRKFYTEFDRHNNRIGFALA 403
>gi|16974928|pdb|1FLH|A Chain A, Crystal Structure Of Human Uropepsin At 2.45 A Resolution
Length = 326
Score = 210 bits (534), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 106/240 (44%), Positives = 154/240 (64%), Gaps = 6/240 (2%)
Query: 73 DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRK 132
D PL+N++D +YFG IGIG+P Q+F+V+FDTGSSNLWVPS C S++C H+R+
Sbjct: 2 DEQPLENYLDMEYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCS-SLACTNHNRFNPED 60
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDG 191
S+TY ++ I YG+GS++G D V+VG + +Q+F + T GS + A FDG
Sbjct: 61 SSTYQSTSETVSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDG 119
Query: 192 IIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKG 251
I+GL + I+ A PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G
Sbjct: 120 ILGLAYPSISSSGATPVFDNIWNQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTG 177
Query: 252 KHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
+VPVT +GYWQ + I + ++ C GC AIVD+GTSLL GPT + I IG
Sbjct: 178 SLNWVPVTVEGYWQITVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIG 236
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
N G+ ++ C I ++P++ FTI + + P YIL++ EG CISGF ++P
Sbjct: 240 NSDGDMVVSCSAISSLPDIVFTINGVQYPVPPSAYILQS-EG---SCISGFQGMNVPTES 295
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 296 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 326
>gi|195583376|ref|XP_002081498.1| GD11051 [Drosophila simulans]
gi|194193507|gb|EDX07083.1| GD11051 [Drosophila simulans]
Length = 399
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 123/319 (38%), Positives = 177/319 (55%), Gaps = 32/319 (10%)
Query: 12 LWVLASCL----LLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRL 67
+W+L S L +LP L+ R+ L +AR R E+ G+ R RL
Sbjct: 1 MWLLVSLLPVLFILPVQFQHPVSCKLQLYRVPLRRFPSAR-HRFEK----LGIRMDRLRL 55
Query: 68 GDSDE------------DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSK 115
++E PL N++DAQYFG I IG+PPQ F VIFDTGSSNLWVPS+
Sbjct: 56 KYAEEVSHFRGDWSSAVKSTPLSNYLDAQYFGPITIGTPPQTFKVIFDTGSSNLWVPSAT 115
Query: 116 CYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFI 174
C + ++C H+RY +++S ++ G I+YGSGS+SGF S D V V + ++DQ F
Sbjct: 116 CASTMVACRVHNRYFAKRSKSHQARGDRFAIHYGSGSLSGFLSTDTVRVAGLEIRDQTFA 175
Query: 175 EATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSF-WLNRDPDA 233
EAT FL A+FDGI GL + I++ P + M+EQGL+++ +F+ + +P
Sbjct: 176 EATEMPGPIFLAAKFDGIFGLAYHSISMQRIKPPFYAMMEQGLLTKPIFNMARMMVEP-- 233
Query: 234 EEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGT 293
I FGG +P ++ G TYV V+ + YWQ ++ +I N +C+ GC I+D+GT
Sbjct: 234 -----IFFGGSNPHYYTGNFTYVQVSHRAYWQVKMDSAVIRNLE--LCQQGCEVIIDTGT 286
Query: 294 SLLAGPTPVVTEINHAIGG 312
S LA P IN +IGG
Sbjct: 287 SFLALPYDQAILINESIGG 305
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 44/99 (44%), Positives = 63/99 (63%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
INE P+ G+ ++ CD +P +P ++FT+G + F L +Y+ + +C S F
Sbjct: 299 INESIGGTPSSFGQFLVACDSVPALPRITFTLGGRTFFLESHEYVFQDIYQDRRICSSAF 358
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+A DLP P GPLWILGDVF+G Y+T FD + RIGFA+A
Sbjct: 359 IAVDLPSPSGPLWILGDVFLGKYYTEFDMERHRIGFADA 397
>gi|129776|sp|P03954.2|PEPA1_MACFU RecName: Full=Pepsin A-1; AltName: Full=Pepsin III-3; Flags:
Precursor
gi|38075|emb|CAA42424.1| prepropepsin a [Macaca fuscata]
Length = 388
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 121/297 (40%), Positives = 173/297 (58%), Gaps = 25/297 (8%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL K L H+LN A +Y A + D PL+N++D +YFG
Sbjct: 32 LSEHGLLKDFLKKHNLNPAS-----KYFPQAEAPTLI--------DEQPLENYLDVEYFG 78
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H+ + + S+TY + I Y
Sbjct: 79 TIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACTNHNLFNPQDSSTYQSTSGTLSITY 137
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
G+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + I+ A
Sbjct: 138 GTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISSSGAT 196
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
PV+DN+ +QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPV+ +GYWQ
Sbjct: 197 PVFDNIWDQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTGSLNWVPVSVEGYWQI 254
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 317
+ I + ++ C GC AIVD+GTSLL GPT + I IG GE VVS
Sbjct: 255 SVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGEMVVS 310
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 36/91 (39%), Positives = 50/91 (54%), Gaps = 4/91 (4%)
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
N GE ++ C I ++P++ FTI + + P YIL++ C SGF D+P
Sbjct: 302 NSDGEMVVSCSAISSLPDIVFTINGIQYPVPPSAYILQS----QGSCTSGFQGMDVPTES 357
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 388
>gi|18959216|ref|NP_579818.1| gastricsin precursor [Rattus norvegicus]
gi|129798|sp|P04073.1|PEPC_RAT RecName: Full=Gastricsin; AltName: Full=Pepsinogen C; Flags:
Precursor
gi|56881|emb|CAA28305.1| unnamed protein product [Rattus norvegicus]
gi|206083|gb|AAA41827.1| pepsinogen [Rattus norvegicus]
gi|149069457|gb|EDM18898.1| progastricsin (pepsinogen C) [Rattus norvegicus]
Length = 392
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 123/316 (38%), Positives = 180/316 (56%), Gaps = 30/316 (9%)
Query: 13 WVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV-----------S 61
W++ + L LP L R+ L+K + + R T KE+ GV
Sbjct: 3 WMVVALLCLPLLEASLLRVPLRKMK-------SIRETMKEQ-----GVLKDFLKTHKYDP 50
Query: 62 GVRHRLGD-SDEDIL--PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF 118
G ++ G+ D +L P+ +MDA YFGEI IG+PPQNF V+FDTGSSNLWV S C
Sbjct: 51 GQKYHFGNFGDYSVLYEPMA-YMDASYFGEISIGTPPQNFLVLFDTGSSNLWVSSVYCQ- 108
Query: 119 SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
S +C H+R+ KS+TY G++ + YG+GS++GFF D + V + V +Q F +
Sbjct: 109 SEACTTHARFNPSKSSTYYTEGQTFSLQYGTGSLTGFFGYDTLTVQSIQVPNQEFGLSEN 168
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
E F+ A+FDGI+GL + ++ G A M+ +G +S+ +F +L GG+
Sbjct: 169 EPGTNFVYAQFDGIMGLAYPGLSSGGATTALQGMLGEGALSQPLFGVYLGSQ-QGSNGGQ 227
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEG-GCAAIVDSGTSLLA 297
IVFGGVD + G+ T+VPVT++ YWQ + D LIG+Q++G C GC IVD+GTSLL
Sbjct: 228 IVFGGVDKNLYTGEITWVPVTQELYWQITIDDFLIGDQASGWCSSQGCQGIVDTGTSLLV 287
Query: 298 GPTPVVTEINHAIGGE 313
P ++E+ IG +
Sbjct: 288 MPAQYLSELLQTIGAQ 303
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 55/104 (52%), Gaps = 8/104 (7%)
Query: 406 YINELCDSL---PNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVC 462
Y++EL ++ GE + CD + ++P +SF + F LSP YI++ C
Sbjct: 292 YLSELLQTIGAQEGEYGEYFVSCDSVSSLPTLSFVLNGVQFPLSPSSYIIQE----DNFC 347
Query: 463 ISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+ G + L G PLWILGDVF+ Y+ +FD G ++G A +
Sbjct: 348 MVGLESISLTSESGQPLWILGDVFLRSYYAIFDMGNNKVGLATS 391
>gi|388856266|emb|CCF50075.1| probable PEP4-aspartyl protease [Ustilago hordei]
Length = 418
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 108/253 (42%), Positives = 155/253 (61%), Gaps = 9/253 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL +F++AQYF +I +G+P Q F VI DTGSSNLWVPS+KC SI+C+ H +Y S S+
Sbjct: 97 VPLTDFLNAQYFCDISLGTPAQEFKVILDTGSSNLWVPSNKCS-SIACFLHKKYDSSASS 155
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y + G +I YGSGS+ G S D +++GD+ +K Q F EAT E L F +FDGI+G
Sbjct: 156 SYKKNGTEFKIQYGSGSMEGIVSNDVLKIGDLTIKGQDFAEATSEPGLAFAFGKFDGILG 215
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+V VP M+ QGL+ SF+L ++GGE VFGG+D H+ GK
Sbjct: 216 LAYDTISVNGIVPPMYQMINQGLLDAPQVSFYLGS--SEQDGGEAVFGGIDESHYTGKIH 273
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG-- 312
+ PV +KGYW+ L + +G+++ + G A +D+GTSL+A T +N IG
Sbjct: 274 WAPVKRKGYWEVALDKLALGDEALELDNGSAA--IDTGTSLIAMATDTAEILNAEIGATK 331
Query: 313 --EGVVSAECKLV 323
G S +C+ V
Sbjct: 332 SWNGQYSVDCEKV 344
Score = 62.4 bits (150), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 49/87 (56%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC+++ +P ++F I K F L + Y+L + CIS F +LP P +
Sbjct: 335 GQYSVDCEKVKDLPPLTFYIDGKPFKLEGKDYVLD----VQGSCISSFSGINLPGPLANM 390
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
I+GDVF+ Y++V+D K +G A A
Sbjct: 391 LIVGDVFLRKYYSVYDLAKNAVGLAAA 417
>gi|354497176|ref|XP_003510697.1| PREDICTED: chymosin-like [Cricetulus griseus]
gi|344243543|gb|EGV99646.1| Chymosin [Cricetulus griseus]
Length = 379
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 116/313 (37%), Positives = 175/313 (55%), Gaps = 20/313 (6%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDI-----LPLKNFMDAQYFGEIGI 91
R+ LH + R T KE + +S + + D + PL N++D++YFG I I
Sbjct: 21 RIPLHKGTSLRNTLKEHGLLEDFLSRHQSEFSEKDSNTGMVANEPLTNYLDSEYFGTIYI 80
Query: 92 GSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGS 151
G+PPQ F+V+FDTGSS LWVPS C S C H R+ KS T+ + K + YG+G
Sbjct: 81 GTPPQEFTVVFDTGSSELWVPSVYCS-SRVCQNHHRFDPSKSFTFQNLSKPLFVQYGTGR 139
Query: 152 ISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDN 211
+ GF D V + D+VV Q +T+E F+ + FDGI+GL + +A +VP++DN
Sbjct: 140 MQGFLGYDTVTISDIVVPHQTVGLSTQEPGEIFIYSPFDGILGLSYPSLASKYSVPIFDN 199
Query: 212 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 271
M+ + LV++++FS +++R+ ++G + G +D +F G +VPVT +GYWQF + I
Sbjct: 200 MMNRHLVAQDLFSVYMSRN---DQGSMLTLGAIDQSYFVGSLHWVPVTVQGYWQFTVDRI 256
Query: 272 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 331
I N C+GGC A++D+GT+LLAGP + I AIG V QYG
Sbjct: 257 TI-NDEVVACQGGCTAVLDTGTALLAGPGRDILNIQQAIGA----------VQGQYGQFK 305
Query: 332 WDLLVSGLLPEKV 344
+ G++P V
Sbjct: 306 INCWRLGIMPTIV 318
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 50/99 (50%), Gaps = 10/99 (10%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
I + ++ G+ I+C R+ MP + F I + F L P Y T + + + C SGF
Sbjct: 290 IQQAIGAVQGQYGQFKINCWRLGIMPTIVFEIHGRKFPLPPSAY---TNQEL-DSCSSGF 345
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+WILGDVF+ +++VFD R+G A+A
Sbjct: 346 KL------GSHIWILGDVFIREFYSVFDRANNRVGLAKA 378
>gi|222425200|dbj|BAH20549.1| pepsinogen A-50 [Pongo abelii]
Length = 388
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 119/290 (41%), Positives = 168/290 (57%), Gaps = 23/290 (7%)
Query: 28 LRRI----GLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDA 83
LRR GL K L H+LN AR +Y + D PL+N++D
Sbjct: 28 LRRTLSEHGLLKDFLKKHNLNPAR-----KYF--------PQWEAPTLVDEQPLENYLDM 74
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
+YFG IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H+ + S+TY ++
Sbjct: 75 EYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACTNHNLFNPEDSSTYQSTSETV 133
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAV 202
I YG+GS++G D V+VG + Q+F + T GS + A FDGI+GL + I+
Sbjct: 134 SIAYGTGSMTGILGYDTVQVGGISDTSQIFGLSETEPGSFLYY-APFDGILGLAYPSISS 192
Query: 203 GDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
A PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +G
Sbjct: 193 SGATPVFDNIWNQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTGSLNWVPVTVEG 250
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
YWQ + I + ++ C GC AIVD+GTSLL GPT + I IG
Sbjct: 251 YWQITVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGA 299
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 37/91 (40%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
N G+ ++ C I ++P++ FTI + + P YIL++ EG CISGF ++P
Sbjct: 302 NSDGDMVVSCSAISSLPDIVFTINGVQYPVPPSAYILQS-EG---SCISGFQGMNVPTES 357
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 358 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 388
>gi|114572170|ref|XP_001163076.1| PREDICTED: cathepsin E isoform 1 [Pan troglodytes]
Length = 363
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 105/229 (45%), Positives = 143/229 (62%), Gaps = 1/229 (0%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C HSR++ +S+T
Sbjct: 69 PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHSRFQPSQSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y++ G+S I YG+GS+SG D V V + V Q F E+ E TF+ A FDGI+GL
Sbjct: 128 YSQPGQSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAEFDGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV +FS +++ +P+ G E++FGG D HF G +
Sbjct: 188 GYPSLAVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGSLNW 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
VPVTK+ YWQ L ++L + C + + +S PTP T
Sbjct: 248 VPVTKQAYWQIALDNMLWSVPTLTSCRMSPSPLTESPIPSAQLPTPYWT 296
>gi|23110952|ref|NP_683865.1| cathepsin E isoform b preproprotein [Homo sapiens]
gi|7339518|emb|CAB82849.1| cathepsin E, alternative [Homo sapiens]
gi|119611999|gb|EAW91593.1| cathepsin E, isoform CRA_b [Homo sapiens]
Length = 363
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 105/229 (45%), Positives = 143/229 (62%), Gaps = 1/229 (0%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C HSR++ +S+T
Sbjct: 69 PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHSRFQPSQSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y++ G+S I YG+GS+SG D V V + V Q F E+ E TF+ A FDGI+GL
Sbjct: 128 YSQPGQSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAEFDGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV +FS +++ +P+ G E++FGG D HF G +
Sbjct: 188 GYPSLAVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGSLNW 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
VPVTK+ YWQ L ++L + C + + +S PTP T
Sbjct: 248 VPVTKQAYWQIALDNMLWSVPTLTSCRMSPSPLTESPIPSAQLPTPYWT 296
>gi|431896476|gb|ELK05888.1| Chymosin [Pteropus alecto]
Length = 348
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 106/281 (37%), Positives = 171/281 (60%), Gaps = 10/281 (3%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDI-----LPLKNFMDAQYFGEIGI 91
R+ LH + R KER + + R+ + + + PL N++D+QYFG+I I
Sbjct: 21 RVPLHKGKSLRKALKERGLLEDFLRTHRYAISKENSGVGKVAREPLVNYLDSQYFGKISI 80
Query: 92 GSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGS 151
G+PPQ+F+V+FDTGSS+LWVPS C S +C H R+ S +S+T+ ++G+ I YG+GS
Sbjct: 81 GTPPQDFTVVFDTGSSDLWVPSVYCK-SDACKNHRRFNSSESSTFQKLGQPLSIQYGTGS 139
Query: 152 ISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDN 211
+ G D V V ++V Q +T+E F FDGI+GL + +A D+VPV+DN
Sbjct: 140 MEGILGSDTVTVSNIVDSRQTVGLSTQEPGDVFTYFEFDGILGLAYPSLAAKDSVPVFDN 199
Query: 212 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 271
M++ LV++++FS +++R+ ++G + G +D +++G +VPVT + YWQF + +
Sbjct: 200 MMKHHLVAQDLFSVYMSRN---DQGSMLTLGAIDSSYYRGSLHWVPVTVREYWQFTVDSV 256
Query: 272 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
+ + C+GGC AI+D+GTS+L GP+ + I AIG
Sbjct: 257 TV-DGVVVACDGGCQAILDTGTSMLVGPSSDILNIQQAIGA 296
Score = 41.6 bits (96), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 26/46 (56%), Gaps = 4/46 (8%)
Query: 460 EVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+ C SGF D LWILGDVF+ Y +VFD R+G A+A
Sbjct: 306 DFCTSGFQGED----DSQLWILGDVFIREYFSVFDRANNRVGLAKA 347
>gi|355329699|dbj|BAL14143.1| pepsinogen 2 [Pagrus major]
Length = 377
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 105/253 (41%), Positives = 162/253 (64%), Gaps = 9/253 (3%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
+ N D Y+G + IG+PPQ+F+VIFDTGSSNLW+PS C S +C H ++ ++S+T+
Sbjct: 62 MTNDADLSYYGVVSIGTPPQSFTVIFDTGSSNLWIPSVYCN-SQACQNHKKFNPQQSSTF 120
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
++ I YG+GS++G+ + D VEVG + V +QVF + E + +A DGI+GL
Sbjct: 121 KWGNEALSIQYGTGSMTGYLAIDTVEVGGISVANQVFGISQTEAAFMASMAA-DGILGLA 179
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
F+ IA + VPV+DNM++QGLVS+ +FS +L+ ++E+G E+VFGG D H+ G+ T++
Sbjct: 180 FQSIASDNVVPVFDNMIKQGLVSQPMFSVYLSG--NSEQGSEVVFGGTDSNHYTGQITWI 237
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE--- 313
P++ YWQ + + I Q T C GGC AI+D+GTSL+ GPT + +N +G
Sbjct: 238 PLSSATYWQISMDSVTINGQ-TVACSGGCQAIIDTGTSLIVGPTNDINNMNSWVGASTNQ 296
Query: 314 -GVVSAECKLVVS 325
G + C+ + S
Sbjct: 297 YGEATVNCQNIQS 309
Score = 58.9 bits (141), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 38/132 (28%), Positives = 64/132 (48%), Gaps = 10/132 (7%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 434
G + CS A++ L T + ++ +N + N GE+ ++C I +MP+V
Sbjct: 256 GQTVACSGGCQAIIDTGTSLIVGPTND--INNMNSWVGASTNQYGEATVNCQNIQSMPDV 313
Query: 435 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 494
+FT+ F + Y+ ++ G C +GF LWILGDVF+ Y+ VF+
Sbjct: 314 TFTLNGHAFTVPASAYVSQSYYG----CSTGFGQ----GGSQQLWILGDVFIREYYAVFN 365
Query: 495 SGKLRIGFAEAA 506
+ IG A++A
Sbjct: 366 AQSQYIGLAKSA 377
>gi|290993274|ref|XP_002679258.1| predicted protein [Naegleria gruberi]
gi|284092874|gb|EFC46514.1| predicted protein [Naegleria gruberi]
Length = 316
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 103/231 (44%), Positives = 145/231 (62%), Gaps = 7/231 (3%)
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
QY+G + +G+P QNF VIFDTGSSN+WVPS C+ SI+C H+RY KS+TY G+
Sbjct: 1 QYYGFVSLGTPQQNFKVIFDTGSSNVWVPSESCW-SITCLLHNRYDHTKSSTYVANGQKF 59
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVG 203
I YGSG ++GF SQD + G + VK QVF E E L FL + DGI+G+ F I+V
Sbjct: 60 NITYGSGGVNGFLSQDALSCGGIPVKGQVFGEVMSEQGLAFLFGKSDGIVGMAFPSISVD 119
Query: 204 DAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGY 263
P+++NM+ Q LV + +FSF+L++ ++ GG+D K++ G TYVP+ + Y
Sbjct: 120 GVTPMFNNMMNQKLVDKNLFSFYLSKT-SGSTASAMILGGIDTKYYTGPLTYVPLANRTY 178
Query: 264 WQFELGDILIGNQSTGVC-EGGCAAIVDSGTSLLAGPT----PVVTEINHA 309
W + D+ +G GVC GGC A VD+GTSL+AGP P++ +N A
Sbjct: 179 WAIRINDVGVGGDYKGVCPPGGCLAAVDTGTSLIAGPALKIGPIIESLNIA 229
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 39/82 (47%), Positives = 50/82 (60%)
Query: 424 DCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGD 483
DC I + P+V+F IG + L P Y+LK + CI+GFM LPP G WILGD
Sbjct: 231 DCSNIDSNPDVTFKIGGVEYTLKPRDYVLKMTQFGQSECIAGFMPLALPPQFGDFWILGD 290
Query: 484 VFMGVYHTVFDSGKLRIGFAEA 505
VF+ Y+TVFD R+GFA+A
Sbjct: 291 VFISTYYTVFDYDGSRVGFAKA 312
>gi|157836875|pdb|3PSG|A Chain A, The High Resolution Crystal Structure Of Porcine
Pepsinogen
Length = 370
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 109/249 (43%), Positives = 159/249 (63%), Gaps = 12/249 (4%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL+N++D +YFG IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H+++ S+T
Sbjct: 49 PLENYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACSDHNQFNPDDSST 107
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIG 194
+ + I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+G
Sbjct: 108 FEATSQELSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILG 166
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+ A PV+DN+ +QGLVS+++FS +L+ + D+ G ++ GG+D ++ G
Sbjct: 167 LAYPSISASGATPVFDNLWDQGLVSQDLFSVYLSSNDDS--GSVVLLGGIDSSYYTGSLN 224
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG--- 311
+VPV+ +GYWQ L I + + T C GGC AIVD+GTSLL GPT + I IG
Sbjct: 225 WVPVSVEGYWQITLDSITMDGE-TIACSGGCQAIVDTGTSLLTGPTSAIANIQSDIGASE 283
Query: 312 ---GEGVVS 317
GE V+S
Sbjct: 284 NSDGEMVIS 292
Score = 79.3 bits (194), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 47/132 (35%), Positives = 66/132 (50%), Gaps = 6/132 (4%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 434
G++ CS A+V L T ++ I + N GE +I C I ++P++
Sbjct: 245 GETIACSGGCQAIVDTGTSLLTGPTS--AIANIQSDIGASENSDGEMVISCSSIDSLPDI 302
Query: 435 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 494
FTI + LSP YIL+ + C SGF D+P G LWILGDVF+ Y+TVFD
Sbjct: 303 VFTIDGVQYPLSPSAYILQDDDS----CTSGFEGMDVPTSSGELWILGDVFIRQYYTVFD 358
Query: 495 SGKLRIGFAEAA 506
++G A A
Sbjct: 359 RANNKVGLAPVA 370
>gi|164604|gb|AAA31096.1| pepsinogen A precursor [Sus scrofa]
Length = 385
Score = 209 bits (532), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 109/249 (43%), Positives = 159/249 (63%), Gaps = 12/249 (4%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL+N++D +YFG IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H+++ S+T
Sbjct: 64 PLENYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACSDHNQFNPDDSST 122
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIG 194
+ + I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+G
Sbjct: 123 FEATSQELSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILG 181
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+ A PV+DN+ +QGLVS+++FS +L+ + D+ G ++ GG+D ++ G
Sbjct: 182 LAYPSISASGATPVFDNLWDQGLVSQDLFSVYLSSNDDS--GSVVLLGGIDSSYYTGSLN 239
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG--- 311
+VPV+ +GYWQ L I + + T C GGC AIVD+GTSLL GPT + I IG
Sbjct: 240 WVPVSVEGYWQITLDSITMDGE-TIACSGGCQAIVDTGTSLLTGPTSAIANIQSDIGASE 298
Query: 312 ---GEGVVS 317
GE V+S
Sbjct: 299 NSYGEMVIS 307
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 47/132 (35%), Positives = 66/132 (50%), Gaps = 6/132 (4%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 434
G++ CS A+V L T ++ I + N GE +I C I ++P++
Sbjct: 260 GETIACSGGCQAIVDTGTSLLTGPTS--AIANIQSDIGASENSYGEMVISCSSIDSLPDI 317
Query: 435 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 494
FTI + LSP YIL+ + C SGF D+P G LWILGDVF+ Y+TVFD
Sbjct: 318 VFTINGVQYPLSPSAYILQDDDS----CTSGFEGMDVPTSSGELWILGDVFIRQYYTVFD 373
Query: 495 SGKLRIGFAEAA 506
++G A A
Sbjct: 374 RANNKVGLAPVA 385
>gi|157837066|pdb|5PEP|A Chain A, X-Ray Analyses Of Aspartic Proteases. Ii.
Three-Dimensional Structure Of The Hexagonal Crystal
Form Of Porcine Pepsin At 2.3 Angstroms Resolution
Length = 326
Score = 209 bits (532), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 111/257 (43%), Positives = 162/257 (63%), Gaps = 14/257 (5%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL+N++D +YFG IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H+++ S+T
Sbjct: 5 PLENYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACSDHNQFNPDDSST 63
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIG 194
+ + I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+G
Sbjct: 64 FEATSQELSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILG 122
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+ A PV+DN+ +QGLVS+++FS +L+ + D+ G ++ GG+D ++ G
Sbjct: 123 LAYPSISASGATPVFDNLWDQGLVSQDLFSVYLSSNDDS--GSVVLLGGIDSSYYTGSLN 180
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG--- 311
+VPV+ +GYWQ L I + + T C GGC AIVD+GTSLL GPT + I IG
Sbjct: 181 WVPVSVEGYWQITLDSITMDGE-TIACSGGCQAIVDTGTSLLTGPTSAIANIQSDIGASE 239
Query: 312 ---GEGVVSAECKLVVS 325
GE V+S C + S
Sbjct: 240 NSDGEMVIS--CSSIAS 254
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 47/132 (35%), Positives = 66/132 (50%), Gaps = 6/132 (4%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 434
G++ CS A+V L T ++ I + N GE +I C I ++P++
Sbjct: 201 GETIACSGGCQAIVDTGTSLLTGPTS--AIANIQSDIGASENSDGEMVISCSSIASLPDI 258
Query: 435 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 494
FTI + LSP YIL+ + C SGF D+P G LWILGDVF+ Y+TVFD
Sbjct: 259 VFTINGVQYPLSPSAYILQDDDS----CTSGFEGMDVPTSSGELWILGDVFIRQYYTVFD 314
Query: 495 SGKLRIGFAEAA 506
++G A A
Sbjct: 315 RANNKVGLAPVA 326
>gi|253723303|pdb|2PSG|A Chain A, Refined Structure Of Porcine Pepsinogen At 1.8 Angstroms
Resolution
Length = 370
Score = 209 bits (532), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 109/249 (43%), Positives = 159/249 (63%), Gaps = 12/249 (4%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL+N++D +YFG IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H+++ S+T
Sbjct: 49 PLENYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACSDHNQFNPDDSST 107
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIG 194
+ + I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+G
Sbjct: 108 FEATXQELSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILG 166
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+ A PV+DN+ +QGLVS+++FS +L+ + D+ G ++ GG+D ++ G
Sbjct: 167 LAYPSISASGATPVFDNLWDQGLVSQDLFSVYLSSNDDS--GSVVLLGGIDSSYYTGSLN 224
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG--- 311
+VPV+ +GYWQ L I + + T C GGC AIVD+GTSLL GPT + I IG
Sbjct: 225 WVPVSVEGYWQITLDSITMDGE-TIACSGGCQAIVDTGTSLLTGPTSAIANIQSDIGASE 283
Query: 312 ---GEGVVS 317
GE V+S
Sbjct: 284 NSDGEMVIS 292
Score = 79.3 bits (194), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 47/132 (35%), Positives = 66/132 (50%), Gaps = 6/132 (4%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 434
G++ CS A+V L T ++ I + N GE +I C I ++P++
Sbjct: 245 GETIACSGGCQAIVDTGTSLLTGPTS--AIANIQSDIGASENSDGEMVISCSSIDSLPDI 302
Query: 435 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 494
FTI + LSP YIL+ + C SGF D+P G LWILGDVF+ Y+TVFD
Sbjct: 303 VFTIDGVQYPLSPSAYILQDDDS----CTSGFEGMDVPTSSGELWILGDVFIRQYYTVFD 358
Query: 495 SGKLRIGFAEAA 506
++G A A
Sbjct: 359 RANNKVGLAPVA 370
>gi|327271205|ref|XP_003220378.1| PREDICTED: gastricsin-like [Anolis carolinensis]
Length = 388
Score = 209 bits (532), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 107/290 (36%), Positives = 167/290 (57%), Gaps = 4/290 (1%)
Query: 25 SNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS-GVRHRLGDSDEDILPLKNFMDA 83
S GL R+ LK+ + ++ + E ++ V +++ + + P+ N++++
Sbjct: 14 SEGLERVILKRGKSIRENMKEKGVL--EEFLKKNHVDPALKYHFNEYNVAYEPITNYLNS 71
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
YFGEI IG+PPQNF V+ D+GSSNLWVPS C + +C H+R+ S+TY+ G++
Sbjct: 72 YYFGEISIGTPPQNFLVVMDSGSSNLWVPSVYC-DTAACAKHNRFSPSASSTYSNSGQTY 130
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVG 203
+ YG+G ++ D V V ++VV +Q F + E F A FDGI+G+ + +AVG
Sbjct: 131 TLYYGAGDLTVMLGYDTVMVQNIVVTNQEFGLSENEPMTPFYYASFDGIMGMAYPSLAVG 190
Query: 204 DAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGY 263
V M+ QG +SE +FSF+ +R P + GGE++ GGVD + F G ++ PVT++ Y
Sbjct: 191 GTATVMQQMLNQGQLSEPIFSFYFSRQPTVQYGGELILGGVDTQLFSGDVSWAPVTREVY 250
Query: 264 WQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
WQ + + IGN++TG C GC AIVD+GT L P A+G E
Sbjct: 251 WQIGVEEFAIGNEATGWCSEGCQAIVDTGTCQLTIPRQYFDTFLQAVGAE 300
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/88 (35%), Positives = 46/88 (52%), Gaps = 3/88 (3%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
GE +++C+ + MP ++F I F L P Y+ V ++A PL
Sbjct: 304 GELLVNCNNVQNMPTITFVINGAQFPLPPSAYVANNDGYCTVVVEPTYLASQ---SGEPL 360
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEAA 506
WILGDVF+ Y++VFD R+GFA +A
Sbjct: 361 WILGDVFLKEYYSVFDMANNRVGFALSA 388
>gi|118572685|sp|P00791.3|PEPA_PIG RecName: Full=Pepsin A; Flags: Precursor
Length = 385
Score = 209 bits (532), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 109/249 (43%), Positives = 159/249 (63%), Gaps = 12/249 (4%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL+N++D +YFG IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H+++ S+T
Sbjct: 64 PLENYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACSDHNQFNPDDSST 122
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIG 194
+ + I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+G
Sbjct: 123 FEATSQELSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILG 181
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+ A PV+DN+ +QGLVS+++FS +L+ + D+ G ++ GG+D ++ G
Sbjct: 182 LAYPSISASGATPVFDNLWDQGLVSQDLFSVYLSSNDDS--GSVVLLGGIDSSYYTGSLN 239
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG--- 311
+VPV+ +GYWQ L I + + T C GGC AIVD+GTSLL GPT + I IG
Sbjct: 240 WVPVSVEGYWQITLDSITMDGE-TIACSGGCQAIVDTGTSLLTGPTSAIANIQSDIGASE 298
Query: 312 ---GEGVVS 317
GE V+S
Sbjct: 299 NSDGEMVIS 307
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 47/132 (35%), Positives = 66/132 (50%), Gaps = 6/132 (4%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 434
G++ CS A+V L T ++ I + N GE +I C I ++P++
Sbjct: 260 GETIACSGGCQAIVDTGTSLLTGPTS--AIANIQSDIGASENSDGEMVISCSSIDSLPDI 317
Query: 435 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 494
FTI + LSP YIL+ + C SGF D+P G LWILGDVF+ Y+TVFD
Sbjct: 318 VFTINGVQYPLSPSAYILQDDDS----CTSGFEGMDVPTSSGELWILGDVFIRQYYTVFD 373
Query: 495 SGKLRIGFAEAA 506
++G A A
Sbjct: 374 RANNKVGLAPVA 385
>gi|195501958|ref|XP_002098019.1| GE10129 [Drosophila yakuba]
gi|194184120|gb|EDW97731.1| GE10129 [Drosophila yakuba]
Length = 396
Score = 209 bits (532), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 119/287 (41%), Positives = 164/287 (57%), Gaps = 7/287 (2%)
Query: 45 AARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDT 104
A R + +Y G R G + E L N ++ +Y G I IGSP Q F+++FDT
Sbjct: 45 AGRSSLLAKYNVAGGQEAATLRNGGATET---LDNRLNLEYAGPISIGSPGQPFNMLFDT 101
Query: 105 GSSNLWVPSSKCY-FSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEV 163
GS+NLWVPS++C S++C+ H RY + S+T+ G+ I YG+GS+SG +QD V +
Sbjct: 102 GSANLWVPSAECSPKSVACHRHHRYNASASSTFVPDGRRFSIAYGTGSLSGILAQDMVTI 161
Query: 164 GDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVF 223
G +VV++Q F AT E TF+ F GI+GLGFR +A P++++M EQ LV E VF
Sbjct: 162 GQLVVRNQTFAMATHEPGPTFVDTNFAGIVGLGFRPLAEQRIKPLFESMCEQQLVDECVF 221
Query: 224 SFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEG 283
SF+L R+ GGE++FGG+D F G TYVP+T YWQF L I +G +
Sbjct: 222 SFYLKRNGSERMGGELLFGGLDKTKFSGTLTYVPLTHAAYWQFPLDAIEVGGTAISHHR- 280
Query: 284 GCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 330
AI D+GTSLLA P IN +GG + E L S+ L
Sbjct: 281 --QAIADTGTSLLAAPPREYLIINSLLGGLPTANNEYLLNCSEIDSL 325
Score = 82.4 bits (202), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 40/101 (39%), Positives = 58/101 (57%), Gaps = 6/101 (5%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILK-TGEGIAEVCISG 465
IN L LP E +++C I ++P + F IG + F L P Y++ T + + +C+S
Sbjct: 301 INSLLGGLPTANNEYLLNCSEIDSLPEIVFIIGGQRFGLQPRDYVMSATNDDGSRICLSA 360
Query: 466 FMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
F + WILGDVF+G Y+T FD+G+ RIGFA AA
Sbjct: 361 FTLME-----AEFWILGDVFIGRYYTAFDAGQRRIGFAPAA 396
>gi|24647683|ref|NP_650623.1| CG5863 [Drosophila melanogaster]
gi|7300255|gb|AAF55418.1| CG5863 [Drosophila melanogaster]
Length = 395
Score = 209 bits (532), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 131/325 (40%), Positives = 180/325 (55%), Gaps = 17/325 (5%)
Query: 12 LWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITR-----KERYMGGAGVSGVRHR 66
LW+L CL L RI ++ + + S R R K +GG V+ R
Sbjct: 11 LWIL--CLFWAKCQGQLIRIPMQFQASFMASRRQHRAGRSSLLAKYNVVGGQEVTS---R 65
Query: 67 LGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFH 125
G + E L N ++ +Y G I IGSP Q F+++FDTGS+NLWVPS++C S++C+ H
Sbjct: 66 NGGATET---LDNRLNLEYAGPISIGSPGQPFNMLFDTGSANLWVPSAECSPKSVACHHH 122
Query: 126 SRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFL 185
RY + S+T+ G+ I YG+GS+SG +QD V +G +VV++Q F AT E TF+
Sbjct: 123 HRYNASASSTFVPDGRRFSIAYGTGSLSGRLAQDTVAIGQLVVQNQTFGMATHEPGPTFV 182
Query: 186 LARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVD 245
F GI+GLGFR IA P++++M +Q LV E VFSF+L R+ +GGE++FGGVD
Sbjct: 183 DTNFAGIVGLGFRPIAELGIKPLFESMCDQQLVDECVFSFYLKRNGSERKGGELLFGGVD 242
Query: 246 PKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTE 305
F G TYVP+T GYWQF L I + AI D+GTSLLA P
Sbjct: 243 KTKFSGSLTYVPLTHAGYWQFPLDVIEVAGTRINQNR---QAIADTGTSLLAAPPREYLI 299
Query: 306 INHAIGGEGVVSAECKLVVSQYGDL 330
IN +GG + E L S+ L
Sbjct: 300 INSLLGGLPTSNNEYLLNCSEIDSL 324
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 41/101 (40%), Positives = 58/101 (57%), Gaps = 6/101 (5%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILK-TGEGIAEVCISG 465
IN L LP E +++C I ++P + F IG + F L P Y++ T + + +C+S
Sbjct: 300 INSLLGGLPTSNNEYLLNCSEIDSLPEIVFIIGGQRFGLQPRDYVMSATNDDGSSICLSA 359
Query: 466 FMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
F D WILGDVF+G Y+T FD+G+ RIGFA AA
Sbjct: 360 FTLMD-----AEFWILGDVFIGRYYTAFDAGQRRIGFAPAA 395
>gi|407726059|dbj|BAM46127.1| pepsinogen C [Cynops pyrrhogaster]
Length = 385
Score = 209 bits (532), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 107/276 (38%), Positives = 158/276 (57%), Gaps = 3/276 (1%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVS-GVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPP 95
R+ LH + R E + V G ++RL + PL N+MD Y+GEI IG+PP
Sbjct: 19 RVPLHKFKSMRQVMIEHGLKVPWVDPGTKYRLNNFAVASEPLTNYMDMSYYGEISIGTPP 78
Query: 96 QNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
QNF V+FDTGSSNLWV S+ C S +C H + +S+TY+ + I YG+GS++G
Sbjct: 79 QNFLVLFDTGSSNLWVASTYCS-SSACTNHPLFNPSQSSTYSTENQQFSIQYGTGSLTGI 137
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
D V + + + Q F + E F+ A+FDGI+GL + IA A V + M+ Q
Sbjct: 138 LGYDTVSIQGLSITQQEFALSINEPGSNFVYAQFDGILGLAYPSIAADGATTVMEGMMNQ 197
Query: 216 GLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGN 275
GL+S+ +F F+++ + + GGE++FGGVD ++ G+ T+ PVT++ YWQ + +
Sbjct: 198 GLLSQNIFGFYMSEE-GTQPGGELIFGGVDSNYYTGEITWTPVTQQMYWQIGIQGFAVNG 256
Query: 276 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
Q TG C GC IVD+GTSLL P + + IG
Sbjct: 257 QETGWCSQGCQGIVDTGTSLLTAPGQYMAALMQDIG 292
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 34/88 (38%), Positives = 51/88 (57%), Gaps = 5/88 (5%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG-P 477
G+ ++ C + ++P +SFTIG L P YI++ + C G MA LP G P
Sbjct: 299 GQYVVTCSSVTSLPTLSFTIGGTSLPLPPSAYIVQG----SAACTVGIMATYLPSQDGQP 354
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAEA 505
LWILGDVF+ Y++++D R+GFA +
Sbjct: 355 LWILGDVFLRQYYSIYDVTNNRVGFATS 382
>gi|345318884|ref|XP_001520972.2| PREDICTED: renin-like [Ornithorhynchus anatinus]
Length = 388
Score = 209 bits (532), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 108/266 (40%), Positives = 168/266 (63%), Gaps = 11/266 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFHSRYKSRKSNT 135
L N++DAQYFGEIGIGSP Q F VIFDTGS+NLWVPS C +C H+ Y + +S T
Sbjct: 58 LTNYLDAQYFGEIGIGSPAQTFKVIFDTGSANLWVPSINCKPIHSACETHNLYDASQSQT 117
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y E G I+Y SG++ GF SQD V +G + V Q+F E T + +F+ A+FDG++G+
Sbjct: 118 YMENGTQIAISYVSGTVKGFLSQDLVTIGGIPVI-QMFAEITTLPTSSFMYAKFDGVLGM 176
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEE---GGEIVFGGVDPKHFKGK 252
G+ A+G PV+D+++ Q ++ E+VFS + +R+ + GGEI+ GG DP +++G
Sbjct: 177 GYPAQAIGGITPVFDHILTQHVLKEDVFSVYYSRNSKNDHMVPGGEIILGGRDPTYYQGD 236
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
Y+ V+KKG+WQ + + + +++ C+ GCAA+VD+G +L+ GP V + +G
Sbjct: 237 FYYLDVSKKGFWQVNMKGVSV-DRTLQFCQEGCAAMVDTGATLITGPVKDVKHMMDILGA 295
Query: 313 EGV----VSAECKLVVSQYGDLIWDL 334
+ + + +CK V+Q D+ + L
Sbjct: 296 QKIGGNMYAVDCK-EVAQLPDISFHL 320
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 32/83 (38%), Positives = 48/83 (57%)
Query: 423 IDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILG 482
+DC + +P++SF +G ++F LS Y+L+ + +C F D+ PP GPLW+LG
Sbjct: 305 VDCKEVAQLPDISFHLGGRVFPLSSSDYVLQDSDFDDVLCPLAFKGVDVHPPLGPLWVLG 364
Query: 483 DVFMGVYHTVFDSGKLRIGFAEA 505
F+ Y+ FD RIGFA A
Sbjct: 365 ASFIRRYYIEFDRQNNRIGFAMA 387
>gi|298706992|emb|CBJ29800.1| aspartyl protease [Ectocarpus siliculosus]
Length = 410
Score = 209 bits (532), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 128/337 (37%), Positives = 187/337 (55%), Gaps = 31/337 (9%)
Query: 5 LLRSVFCLWVLASCLLLPASSNG---LRRIGLKKR--------RLD----LHSLNAARIT 49
+ R+ L VL S LL N + R+ L KR +LD H A +
Sbjct: 1 MARASSVLTVLGSLLLASTCHNASAAVHRVKLSKRPDKEFVNSKLDKAHHRHHEGADEPS 60
Query: 50 RKERYMGGAGVSG-----VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDT 104
R + + A + G + L S E + +K++ +AQY+G++ IG+PPQ+F VIFDT
Sbjct: 61 RHDEGVLQANLRGAVEQVLMSELEASGEGKVIVKDYQNAQYYGQVEIGTPPQSFEVIFDT 120
Query: 105 GSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVG 164
GS+NLWV SKC +SC HSRY + KS+T+ E G+ EI Y SG +SG S D V G
Sbjct: 121 GSANLWVAGSKC--GLSCGLHSRYAASKSSTHAEDGRDFEITYASGPVSGSLSADTVTWG 178
Query: 165 DVVVKDQVFIEA--TREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEV 222
+ +KDQ F E + L F+L +FDGI+GL F EI+V + +VE+G + + V
Sbjct: 179 GIQLKDQTFAEVQDAKGLGLAFILGKFDGIMGLAFDEISVEGVPTPFGRLVEEGELDDAV 238
Query: 223 FSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCE 282
F+F+L ++ GE++ GG DP H+ + YVPVTKKGYWQ ++ ++ + S +
Sbjct: 239 FAFYLGN----QKEGELIIGGTDPDHYLHEINYVPVTKKGYWQIDMDNVDVSGSSVTSVK 294
Query: 283 GGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
+AI+DSGTSLL GP V +I +G ++ E
Sbjct: 295 ---SAILDSGTSLLVGPKEDVKKIASKVGAISFMNGE 328
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 54/172 (31%), Positives = 87/172 (50%), Gaps = 15/172 (8%)
Query: 333 DLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTV-VEKENVSAGDSAVCSACEMAVVWVQ 391
+L++ G P+ +I Y + V V +V++ SA+ + +V +
Sbjct: 250 ELIIGGTDPDHYLHEINYVPVTKKGYWQIDMDNVDVSGSSVTSVKSAILDSGTSLLVGPK 309
Query: 392 NQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYI 451
+K+ +K +S++N GE ++ C +P ++FTIG K + L ++Y+
Sbjct: 310 EDVKKIASKVGAISFMN----------GEYLMPCSS--DLPPLTFTIGGKEYTLEGDEYV 357
Query: 452 LKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFA 503
+ G +VCI M D+P P GPLWILGDVFM Y+TVFD G +IG A
Sbjct: 358 ISAGND--KVCILAIMGMDIPEPMGPLWILGDVFMRKYYTVFDYGNAQIGLA 407
>gi|443894057|dbj|GAC71407.1| aspartyl protease [Pseudozyma antarctica T-34]
Length = 418
Score = 209 bits (531), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 108/253 (42%), Positives = 154/253 (60%), Gaps = 9/253 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL +F++AQYF +I +G+P Q F VI DTGSSNLWVPS+KC SI+C+ H +Y S S+
Sbjct: 97 VPLTDFLNAQYFCDISLGTPAQEFKVILDTGSSNLWVPSTKCS-SIACFLHKKYDSSASS 155
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y + G +I YGSGS+ G S D +++GD+ +K Q F EAT E L F +FDGI+G
Sbjct: 156 SYKKNGTEFKIQYGSGSMEGIVSNDVLKIGDLTIKGQDFAEATSEPGLAFAFGKFDGILG 215
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+V VP M++QGL+ SF+L +GGE VFGG+D H+ GK
Sbjct: 216 LAYDTISVNGIVPPMYQMIDQGLLDAPQVSFYLGS--SEADGGEAVFGGIDDSHYTGKIH 273
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG-- 312
+ PV +KGYW+ L + +G++ + G A +D+GTSL+A T +N IG
Sbjct: 274 WAPVKRKGYWEVALDKLALGDEELELDNGSAA--IDTGTSLIAMATDTAEILNAEIGATK 331
Query: 313 --EGVVSAECKLV 323
G S +C+ V
Sbjct: 332 SWNGQYSVDCEKV 344
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 52/87 (59%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC+++ +P ++F I K F L + Y+L+ + CIS F +LP P +
Sbjct: 335 GQYSVDCEKVKDLPPLTFYIDGKPFKLEGKDYVLE----VQGSCISSFSGINLPGPLADM 390
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
I+GDVF+ Y++V+D G+ +G AEA
Sbjct: 391 LIVGDVFLRKYYSVYDLGRNAVGLAEA 417
>gi|157836865|pdb|3PEP|A Chain A, Revised 2.3 Angstroms Structure Of Porcine Pepsin.
Evidence For A Flexible Subdomain
Length = 326
Score = 209 bits (531), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 109/249 (43%), Positives = 159/249 (63%), Gaps = 12/249 (4%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL+N++D +YFG IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H+++ S+T
Sbjct: 5 PLENYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACSDHNQFNPDDSST 63
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIG 194
+ + I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+G
Sbjct: 64 FEATSQELSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILG 122
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+ A PV+DN+ +QGLVS+++FS +L+ + D+ G ++ GG+D ++ G
Sbjct: 123 LAYPSISASGATPVFDNLWDQGLVSQDLFSVYLSSNDDS--GSVVLLGGIDSSYYTGSLN 180
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG--- 311
+VPV+ +GYWQ L I + + T C GGC AIVD+GTSLL GPT + I IG
Sbjct: 181 WVPVSVEGYWQITLDSITMDGE-TIACSGGCQAIVDTGTSLLTGPTSAIANIQSDIGASE 239
Query: 312 ---GEGVVS 317
GE V+S
Sbjct: 240 NSDGEMVIS 248
Score = 79.3 bits (194), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 47/132 (35%), Positives = 66/132 (50%), Gaps = 6/132 (4%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 434
G++ CS A+V L T ++ I + N GE +I C I ++P++
Sbjct: 201 GETIACSGGCQAIVDTGTSLLTGPTS--AIANIQSDIGASENSDGEMVISCSSIDSLPDI 258
Query: 435 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 494
FTI + LSP YIL+ + C SGF D+P G LWILGDVF+ Y+TVFD
Sbjct: 259 VFTIDGVQYPLSPSAYILQDDDS----CTSGFEGMDVPTSSGELWILGDVFIRQYYTVFD 314
Query: 495 SGKLRIGFAEAA 506
++G A A
Sbjct: 315 RANNKVGLAPVA 326
>gi|410082415|ref|XP_003958786.1| hypothetical protein KAFR_0H02420 [Kazachstania africana CBS 2517]
gi|372465375|emb|CCF59651.1| hypothetical protein KAFR_0H02420 [Kazachstania africana CBS 2517]
Length = 416
Score = 209 bits (531), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 109/256 (42%), Positives = 150/256 (58%), Gaps = 3/256 (1%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+++AQYF +I IGSP Q F VI DTGSSNLWVPS C S++C+ H++Y R S+
Sbjct: 89 VPLNNYLNAQYFADISIGSPGQTFRVIMDTGSSNLWVPSVDCN-SLACFLHNKYDHRVSS 147
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY G I YGSG++ G+ S D V VGD+ + Q F EAT E L F +FDGI G
Sbjct: 148 TYVRNGTRFAIRYGSGALEGYMSNDTVTVGDLQIPKQDFAEATSEPGLAFAFGKFDGIFG 207
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L F I+V AVP + N V +GL+ F+F+L +EGGE+ FGG D F G T
Sbjct: 208 LAFDTISVNRAVPPFYNAVNRGLLDAPQFAFYLGDKRLRKEGGEVTFGGYDETRFTGNIT 267
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
++PV ++ YW+ + I G+Q + G A +D+GTSL+ P+ + +N IG
Sbjct: 268 WLPVRREAYWEVDFNGISFGSQYAPLTATGAA--IDTGTSLITLPSGLAEILNAQIGARK 325
Query: 315 VVSAECKLVVSQYGDL 330
S + L S+ L
Sbjct: 326 NWSGQYVLDCSRRSTL 341
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 56/103 (54%), Gaps = 8/103 (7%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
+N + N G+ ++DC R T+P+++F +G F++ P Y L+ + CIS
Sbjct: 317 LNAQIGARKNWSGQYVLDCSRRSTLPDITFNLGGSNFSIGPYDYTLEA----SGTCISAI 372
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSG----KLRIGFAEA 505
+ D P P GPL I+GD F+ +++V+D G +G AEA
Sbjct: 373 VPMDFPEPVGPLAIIGDAFLRRWYSVYDLGNSTTNSTVGLAEA 415
>gi|494476|pdb|1PSA|A Chain A, Structure Of A Pepsin(Slash)renin Inhibitor Complex
Reveals A Novel Crystal Packing Induced By Minor
Chemical Alterations In The Inhibitor
gi|494478|pdb|1PSA|B Chain B, Structure Of A Pepsin(Slash)renin Inhibitor Complex
Reveals A Novel Crystal Packing Induced By Minor
Chemical Alterations In The Inhibitor
gi|67463919|pdb|1YX9|A Chain A, Effect Of Dimethyl Sulphoxide On The Crystal Structure Of
Porcine Pepsin
Length = 326
Score = 209 bits (531), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 109/249 (43%), Positives = 159/249 (63%), Gaps = 12/249 (4%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL+N++D +YFG IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H+++ S+T
Sbjct: 5 PLENYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACSDHNQFNPDDSST 63
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIG 194
+ + I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+G
Sbjct: 64 FEATSQELSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILG 122
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+ A PV+DN+ +QGLVS+++FS +L+ + D+ G ++ GG+D ++ G
Sbjct: 123 LAYPSISASGATPVFDNLWDQGLVSQDLFSVYLSSNDDS--GSVVLLGGIDSSYYTGSLN 180
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG--- 311
+VPV+ +GYWQ L I + + T C GGC AIVD+GTSLL GPT + I IG
Sbjct: 181 WVPVSVEGYWQITLDSITMDGE-TIACSGGCQAIVDTGTSLLTGPTSAIANIQSDIGASE 239
Query: 312 ---GEGVVS 317
GE V+S
Sbjct: 240 NSDGEMVIS 248
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 47/132 (35%), Positives = 66/132 (50%), Gaps = 6/132 (4%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 434
G++ CS A+V L T ++ I + N GE +I C I ++P++
Sbjct: 201 GETIACSGGCQAIVDTGTSLLTGPTS--AIANIQSDIGASENSDGEMVISCSSIDSLPDI 258
Query: 435 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 494
FTI + LSP YIL+ + C SGF D+P G LWILGDVF+ Y+TVFD
Sbjct: 259 VFTINGVQYPLSPSAYILQDDDS----CTSGFEGMDVPTSSGELWILGDVFIRQYYTVFD 314
Query: 495 SGKLRIGFAEAA 506
++G A A
Sbjct: 315 RANNKVGLAPVA 326
>gi|326933879|ref|XP_003213025.1| PREDICTED: gastricsin-like [Meleagris gallopavo]
Length = 390
Score = 209 bits (531), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 111/304 (36%), Positives = 175/304 (57%), Gaps = 8/304 (2%)
Query: 11 CLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVR-HRLGD 69
CL + CL L + G+ RI LKK + + A + E Y+ V+ +
Sbjct: 3 CLVLAVLCLQL---TEGMVRIKLKKGKSIREKMREAGVL--EEYLKKIKHDPVKKYNFSK 57
Query: 70 SDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYK 129
++ P+ + +D+ YFGEI IG+PPQNF V+FDTGSSNLWVPS+ C +C H+++K
Sbjct: 58 NNVVYEPMASHLDSSYFGEISIGTPPQNFLVLFDTGSSNLWVPSTLCNMP-ACGNHAKFK 116
Query: 130 SRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARF 189
R S+T+ G+ ++YGSG+++ D + + + V++Q F + E + F A+F
Sbjct: 117 PRASSTFINNGQKVTLSYGSGTLTVVLGYDTLRIQTISVRNQEFGLSRDEPTQPFYYAQF 176
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF 249
DGI+G+ + +AVG A P+ M++Q + + +FSF+ +R+P GGE+V GGVD + F
Sbjct: 177 DGIMGMAYPALAVGGATPL-QGMLQQNQLKQPIFSFYFSRNPTYNYGGELVLGGVDSRLF 235
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHA 309
G + PVT++ YWQ + + IG G C GC AIVD+GT LL P ++ + A
Sbjct: 236 TGDIVWAPVTQELYWQVAIDEFAIGQSVMGWCSQGCQAIVDTGTFLLTVPQQYLSRLLKA 295
Query: 310 IGGE 313
+G +
Sbjct: 296 VGAQ 299
Score = 65.5 bits (158), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 48/87 (55%), Gaps = 5/87 (5%)
Query: 420 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG-PL 478
E +DC+ + ++P +SF I L+P Y+LK C G LP G PL
Sbjct: 307 EYAVDCNVVHSLPTISFIINGVQLPLTPSAYVLKNNG----YCTVGIEVTYLPSQNGQPL 362
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
WILGDVF+ Y+++FD RIGFA++
Sbjct: 363 WILGDVFLKEYYSIFDMAYNRIGFAKS 389
>gi|253723333|pdb|4PEP|A Chain A, The Molecular And Crystal Structures Of Monoclinic Porcine
Pepsin Refined At 1.8 Angstroms Resolution
Length = 326
Score = 209 bits (531), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 109/249 (43%), Positives = 159/249 (63%), Gaps = 12/249 (4%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL+N++D +YFG IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H+++ S+T
Sbjct: 5 PLENYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACSDHNQFNPDDSST 63
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIG 194
+ + I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+G
Sbjct: 64 FEATXQELSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILG 122
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+ A PV+DN+ +QGLVS+++FS +L+ + D+ G ++ GG+D ++ G
Sbjct: 123 LAYPSISASGATPVFDNLWDQGLVSQDLFSVYLSSNDDS--GSVVLLGGIDSSYYTGSLN 180
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG--- 311
+VPV+ +GYWQ L I + + T C GGC AIVD+GTSLL GPT + I IG
Sbjct: 181 WVPVSVEGYWQITLDSITMDGE-TIACSGGCQAIVDTGTSLLTGPTSAIANIQSDIGASE 239
Query: 312 ---GEGVVS 317
GE V+S
Sbjct: 240 NSDGEMVIS 248
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 47/132 (35%), Positives = 66/132 (50%), Gaps = 6/132 (4%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 434
G++ CS A+V L T ++ I + N GE +I C I ++P++
Sbjct: 201 GETIACSGGCQAIVDTGTSLLTGPTS--AIANIQSDIGASENSDGEMVISCSSIDSLPDI 258
Query: 435 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 494
FTI + LSP YIL+ + C SGF D+P G LWILGDVF+ Y+TVFD
Sbjct: 259 VFTIDGVQYPLSPSAYILQDDDS----CTSGFEGMDVPTSSGELWILGDVFIRQYYTVFD 314
Query: 495 SGKLRIGFAEAA 506
++G A A
Sbjct: 315 RANNKVGLAPVA 326
>gi|386371114|gb|AFJ11376.1| pregnancy-associated glycoprotein 1, partial [Bison bison]
Length = 367
Score = 209 bits (531), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 107/278 (38%), Positives = 163/278 (58%), Gaps = 8/278 (2%)
Query: 38 LDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDIL--PLKNFMDAQYFGEIGIGSPP 95
L L + R T +E+ + + +RL +D I PL+N++D Y G I IG+PP
Sbjct: 16 LPLKKMKTLRETLREKNLLNNFLEEQAYRLSKNDSKITVHPLRNYLDTAYVGNITIGTPP 75
Query: 96 QNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
Q F V+FDTGS+NLWVP C S +CY H + + S+++ E+G I YGSG I GF
Sbjct: 76 QEFRVVFDTGSANLWVPCITCT-SPACYTHKTFNPQNSSSFREVGSPITIFYGSGIIQGF 134
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
D V +G++V +Q F + E L FDGI+GL F + + D +P++DN+
Sbjct: 135 LGSDTVRIGNLVSPEQSFGLSLEEYGFDSL--PFDGILGLAFPAMGIEDTIPIFDNLWSH 192
Query: 216 GLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGN 275
G SE VF+F+LN + EG ++FGGVD +++KG+ ++PV++ +WQ + +I + N
Sbjct: 193 GAFSEPVFAFYLNT--NKPEGSVVMFGGVDHRYYKGELNWIPVSQTSHWQISMNNISM-N 249
Query: 276 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
+ C GC A+VD+GTSL+ GPT +VT I+ +
Sbjct: 250 GTVTACSCGCEALVDTGTSLIYGPTKLVTNIHKLMNAR 287
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 33/100 (33%), Positives = 50/100 (50%), Gaps = 8/100 (8%)
Query: 402 KVLSYINELCDS-LPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 460
K+++ I++L ++ L N E ++ CD + T+P V F I + L P+ YI+K
Sbjct: 275 KLVTNIHKLMNARLEN--SEYVVSCDAVKTLPPVIFNINGIDYPLRPQAYIIKIQNNCRS 332
Query: 461 VCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRI 500
V G L WILGD+F+ Y +VFD RI
Sbjct: 333 VFQGGTENSSL-----NTWILGDIFLRQYFSVFDRKNRRI 367
>gi|195386060|ref|XP_002051722.1| GJ17077 [Drosophila virilis]
gi|194148179|gb|EDW63877.1| GJ17077 [Drosophila virilis]
Length = 404
Score = 209 bits (531), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 113/256 (44%), Positives = 159/256 (62%), Gaps = 7/256 (2%)
Query: 74 ILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRK 132
I L N + Y+G IGIG+PPQ F+V+FDTGS+NLWVPS +C + ++C H++Y S
Sbjct: 81 IETLSNNQNMDYYGVIGIGTPPQYFNVVFDTGSANLWVPSVQCLPTDVACQNHNQYNSSA 140
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGI 192
S+TY G+S I YG+GS++GF S D V + + + Q F EA + + +F FDGI
Sbjct: 141 SSTYVANGQSFSIQYGTGSLTGFLSTDTVTINGLSIACQTFGEAISQPNGSFTGVPFDGI 200
Query: 193 IGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGK 252
+G+G+ IAV VP + N+ EQGL+ E F F+L R A++GG++V GGVD + F G
Sbjct: 201 LGMGYSTIAVDQVVPPFYNLYEQGLIDEPSFGFYLARTGSAQDGGQLVLGGVDYQLFSGN 260
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
TYVPV+++GYWQF + ++ VC C AI D+GTSLLA P T++N IGG
Sbjct: 261 LTYVPVSQEGYWQFVVTSAVM--NGFVVCS-NCQAIADTGTSLLACPGSSYTQLNQLIGG 317
Query: 313 ---EGVVSAECKLVVS 325
+G +C V S
Sbjct: 318 YLMDGDYYVDCSTVDS 333
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 35/88 (39%), Positives = 48/88 (54%), Gaps = 5/88 (5%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ +DC + ++P +SF IG IFNL YI E C+S F +
Sbjct: 322 GDYYVDCSTVDSLPVLSFNIGGTIFNLPASAYISSFTENNTTFCMSSFTYINTD-----F 376
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEAA 506
WILGDVF+G ++T FD G+ R+GFA A
Sbjct: 377 WILGDVFIGQFYTQFDFGENRVGFAPVA 404
>gi|360431|prf||1403354A pepsinogen
Length = 383
Score = 209 bits (531), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 102/254 (40%), Positives = 155/254 (61%), Gaps = 9/254 (3%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N +D +Y+G I IG+PPQ+F+V+FDTGSSNLWVPS C S +C H + +S+T
Sbjct: 67 PLLNTLDMEYYGTISIGTPPQDFTVVFDTGSSNLWVPSVSCT-SPACQSHQMFNPSQSST 125
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G++ I+YG+G + G D V V ++ +Q+F +T E F+ +FDGI+GL
Sbjct: 126 YKSTGQNLSIHYGTGDMEGTVGCDTVTVASLMDTNQLFGLSTSEPGQFFVYVKFDGILGL 185
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +A PV+DNMV + L+ + +FS +L+R+P G +VFGG+D +F G +
Sbjct: 186 GYPSLAADGITPVFDNMVNESLLEQNLFSVYLSREP---MGSMVVFGGIDESYFTGSINW 242
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE-- 313
+PV+ +GYWQ + I++ Q C GC AI+D+GTSL+AGP + +I A+G
Sbjct: 243 IPVSYQGYWQISMDSIIVNKQEIA-CSSGCQAIIDTGTSLVAGPASDINDIQSAVGANQN 301
Query: 314 --GVVSAECKLVVS 325
G S C +++
Sbjct: 302 TYGEYSVNCSHILA 315
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 33/90 (36%), Positives = 47/90 (52%), Gaps = 8/90 (8%)
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
N GE ++C I MP+V F IG + + Y + G+G C+S F
Sbjct: 301 NTYGEYSVNCSHILAMPDVVFVIGGIQYPVPALAYTQQNGQG---TCMSSFQN-----SS 352
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
LWILGDVF+ VY+++FD R+G A+A
Sbjct: 353 ADLWILGDVFIRVYYSIFDRANNRVGLAKA 382
>gi|13096225|pdb|1F34|A Chain A, Crystal Structure Of Ascaris Pepsin Inhibitor-3 Bound To
Porcine Pepsin
Length = 326
Score = 209 bits (531), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 109/249 (43%), Positives = 159/249 (63%), Gaps = 12/249 (4%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL+N++D +YFG IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H+++ S+T
Sbjct: 5 PLENYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACSDHNQFNPDDSST 63
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIG 194
+ + I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+G
Sbjct: 64 FEATXQELSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILG 122
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+ A PV+DN+ +QGLVS+++FS +L+ + D+ G ++ GG+D ++ G
Sbjct: 123 LAYPSISASGATPVFDNLWDQGLVSQDLFSVYLSSNDDS--GSVVLLGGIDSSYYTGSLN 180
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG--- 311
+VPV+ +GYWQ L I + + T C GGC AIVD+GTSLL GPT + I IG
Sbjct: 181 WVPVSVEGYWQITLDSITMDGE-TIACSGGCQAIVDTGTSLLTGPTSAIANIQSDIGASE 239
Query: 312 ---GEGVVS 317
GE V+S
Sbjct: 240 NSDGEMVIS 248
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 47/132 (35%), Positives = 66/132 (50%), Gaps = 6/132 (4%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNV 434
G++ CS A+V L T ++ I + N GE +I C I ++P++
Sbjct: 201 GETIACSGGCQAIVDTGTSLLTGPTS--AIANIQSDIGASENSDGEMVISCSSIDSLPDI 258
Query: 435 SFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFD 494
FTI + LSP YIL+ + C SGF D+P G LWILGDVF+ Y+TVFD
Sbjct: 259 VFTINGVQYPLSPSAYILQDDDS----CTSGFEGMDVPTSSGELWILGDVFIRQYYTVFD 314
Query: 495 SGKLRIGFAEAA 506
++G A A
Sbjct: 315 RANNKVGLAPVA 326
>gi|45384244|ref|NP_990385.1| embryonic pepsinogen precursor [Gallus gallus]
gi|129801|sp|P16476.1|PEPE_CHICK RecName: Full=Embryonic pepsinogen; Flags: Precursor
gi|222853|dbj|BAA00153.1| pepsinogen [Gallus gallus]
Length = 383
Score = 208 bits (530), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 102/254 (40%), Positives = 155/254 (61%), Gaps = 9/254 (3%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N +D +Y+G I IG+PPQ+F+V+FDTGSSNLWVPS C S +C H + +S+T
Sbjct: 67 PLLNTLDMEYYGTISIGTPPQDFTVVFDTGSSNLWVPSVSCT-SPACQSHQMFNPSQSST 125
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G++ I+YG+G + G D V V ++ +Q+F +T E F+ +FDGI+GL
Sbjct: 126 YKSTGQNLSIHYGTGDMEGTVGCDTVTVASLMDTNQLFGLSTSEPGQFFVYVKFDGILGL 185
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +A PV+DNMV + L+ + +FS +L+R+P G +VFGG+D +F G +
Sbjct: 186 GYPSLAADGITPVFDNMVNESLLEQNLFSVYLSREP---MGSMVVFGGIDESYFTGSINW 242
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE-- 313
+PV+ +GYWQ + I++ Q C GC AI+D+GTSL+AGP + +I A+G
Sbjct: 243 IPVSYQGYWQISMDSIIVNKQEIA-CSSGCQAIIDTGTSLVAGPASDINDIQSAVGANQN 301
Query: 314 --GVVSAECKLVVS 325
G S C +++
Sbjct: 302 TYGEYSVNCSHILA 315
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 33/90 (36%), Positives = 47/90 (52%), Gaps = 8/90 (8%)
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
N GE ++C I MP+V F IG + + Y + G+G C+S F
Sbjct: 301 NTYGEYSVNCSHILAMPDVVFVIGGIQYPVPALAYTEQNGQG---TCMSSFQN-----SS 352
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
LWILGDVF+ VY+++FD R+G A+A
Sbjct: 353 ADLWILGDVFIRVYYSIFDRANNRVGLAKA 382
>gi|395537495|ref|XP_003770734.1| PREDICTED: renin [Sarcophilus harrisii]
Length = 413
Score = 208 bits (530), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 117/311 (37%), Positives = 174/311 (55%), Gaps = 26/311 (8%)
Query: 12 LWVLASCLLLPASSNGLRRIGLKK----------RRLDLHSLNAARITRKERYMGGAGVS 61
L V+ S S+ L+RI LKK + DL N ++ ++ +S
Sbjct: 7 LLVVWSTCFFSLPSDALQRIVLKKMPSIQENMKLKGKDLGKFNMEWLSYTKQLTLFNVMS 66
Query: 62 GVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSI 120
VR L NF D QY+GEI IG+P Q F V+FDTGS++ WVPSSKC
Sbjct: 67 PVR------------LTNFEDTQYYGEISIGNPSQTFQVVFDTGSADFWVPSSKCSPLYT 114
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+C FH +Y S KS+TY E G +I Y SG + GF S+D V VG + + Q F E T
Sbjct: 115 ACVFHHQYDSTKSSTYKENGTEFKIQYASGQVMGFLSEDTVTVGGIKMT-QSFGEVTVLP 173
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
L F LA+FDG++GLGF +++ VP +DN++ QG++ +EVFS + +R+ GGEI+
Sbjct: 174 LLPFGLAKFDGVLGLGFPALSMSKIVPFFDNIISQGMLKKEVFSVYYSRNSHV-PGGEII 232
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
GG DPK+++G Y+ ++ G+WQ ++ + + + C+ GC A VD+G S + GPT
Sbjct: 233 LGGSDPKYYRGTFHYINISHPGFWQIQMNGVSV-ESNVLACQDGCIASVDTGASFITGPT 291
Query: 301 PVVTEINHAIG 311
+ ++ +G
Sbjct: 292 SSMRKVMKMLG 302
Score = 75.1 bits (183), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 33/86 (38%), Positives = 49/86 (56%)
Query: 420 ESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLW 479
+ ++ CD +P++SF++ K F L Y+L+ + C+ F D+PPP GPLW
Sbjct: 309 QYLVQCDLASMLPDISFSLDGKPFTLHSSDYVLEDLKSDDNFCLLAFRGLDIPPPTGPLW 368
Query: 480 ILGDVFMGVYHTVFDSGKLRIGFAEA 505
ILG F+ ++T FD RIGFA A
Sbjct: 369 ILGATFIRKFYTEFDRHNNRIGFAVA 394
>gi|367000932|ref|XP_003685201.1| hypothetical protein TPHA_0D01260 [Tetrapisispora phaffii CBS 4417]
gi|357523499|emb|CCE62767.1| hypothetical protein TPHA_0D01260 [Tetrapisispora phaffii CBS 4417]
Length = 419
Score = 208 bits (530), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 104/241 (43%), Positives = 153/241 (63%), Gaps = 5/241 (2%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+++AQY+ +I +G+P QNF VI DTGSSNLWVPS C S++CY HS+Y +S
Sbjct: 94 VPLSNYLNAQYYTDISLGTPKQNFKVILDTGSSNLWVPSKDCT-SLACYLHSKYDHDEST 152
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVG-DVVVKDQVFIEATREGSLTFLLARFDGII 193
TY + G I YGSGS+ G+ S+D + +G D+V+ +Q F EAT E L F +FDGI+
Sbjct: 153 TYEKNGTKFTIQYGSGSMDGYISRDTLIIGDDLVIPEQDFAEATSEPGLAFAFGKFDGIL 212
Query: 194 GLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGK 252
GL + IAV VP + N ++QG++ E F+F+L + + D + GGE FGG D F G
Sbjct: 213 GLAYDTIAVNKVVPPFYNAIKQGILDENKFAFYLGDTNKDNKSGGEATFGGYDKSKFTGD 272
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
T++PV +K YW+ + I +G++ + G A +D+GTSL+ P+ + IN IG
Sbjct: 273 ITWLPVRRKAYWEVKFDSIALGDEVASL--DGYGAAIDTGTSLITLPSGLAEVINTQIGA 330
Query: 313 E 313
+
Sbjct: 331 K 331
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 47/87 (54%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ IDCD +P+++F F +SP Y L+ ++ CIS D P P GPL
Sbjct: 336 GQYTIDCDTRDALPDMTFNFNGYNFTVSPYDYTLE----MSGSCISAITPMDFPEPVGPL 391
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
I+GD F+ Y++++D +G A++
Sbjct: 392 AIIGDAFLRKYYSIYDLDNNAVGLAKS 418
>gi|195570151|ref|XP_002103072.1| GD19155 [Drosophila simulans]
gi|194198999|gb|EDX12575.1| GD19155 [Drosophila simulans]
Length = 395
Score = 208 bits (530), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 130/323 (40%), Positives = 178/323 (55%), Gaps = 13/323 (4%)
Query: 12 LWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKE---RYMGGAGVSGVRHRLG 68
LW+L CL L RI ++ + + S R R +Y G V R G
Sbjct: 11 LWIL--CLFWAKCQGQLIRIPMQFQASFMASRRQHRAGRSSLLAKY-NVVGEQEVTSRNG 67
Query: 69 DSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFHSR 127
+ E L N ++ +Y G I IGSP Q F+++FDTGS+NLWVPS++C S++C+ H R
Sbjct: 68 GATET---LDNRLNLEYAGPISIGSPGQPFNMLFDTGSANLWVPSAECSPKSVACHHHHR 124
Query: 128 YKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLA 187
Y S S+T+ G+ I YG+GS+SG +QD V +G +VV++Q F AT E TF+
Sbjct: 125 YNSSASSTFVPDGRRFSIAYGTGSLSGRLAQDTVAIGQLVVRNQTFGMATHEPGPTFVDT 184
Query: 188 RFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPK 247
F GI+GLGFR IA P++++M +Q LV + VFSF+L R+ +GGE++FGGVD
Sbjct: 185 NFAGIVGLGFRPIAELGIKPLFESMCDQQLVDDCVFSFYLKRNGSERKGGELLFGGVDKT 244
Query: 248 HFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEIN 307
F G TYVP+T GYWQF L I + AI D+GTSLLA P IN
Sbjct: 245 KFSGSLTYVPLTHAGYWQFPLDAIEVAGTRITQHR---QAIADTGTSLLAAPPREYLIIN 301
Query: 308 HAIGGEGVVSAECKLVVSQYGDL 330
+GG + E L S+ L
Sbjct: 302 SLLGGLPTSNNEYLLNCSEIDSL 324
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 41/101 (40%), Positives = 57/101 (56%), Gaps = 6/101 (5%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILK-TGEGIAEVCISG 465
IN L LP E +++C I ++P + F IG + F L P Y++ T + + +C+S
Sbjct: 300 INSLLGGLPTSNNEYLLNCSEIDSLPEIVFIIGGQRFGLQPRDYVMSATNDDGSSICLSA 359
Query: 466 FMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
F D WILGDVF+G Y+T FD+G RIGFA AA
Sbjct: 360 FTLMD-----AEFWILGDVFIGRYYTAFDAGHRRIGFAPAA 395
>gi|391867010|gb|EIT76268.1| aspartyl protease [Aspergillus oryzae 3.042]
Length = 390
Score = 208 bits (530), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 122/303 (40%), Positives = 178/303 (58%), Gaps = 13/303 (4%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDED 73
+LA LLL ++ + R+ L+K L S + T +RY+G + + L D D
Sbjct: 5 LLAVPLLLSYTAAEIHRVPLEKELLVFGSDDDDTRTSSQRYIGS---NTHQKALQDHGPD 61
Query: 74 IL----PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYK 129
IL P+KN + QYF I IG+PPQ F V+ DTGS+NLWVPSSKC +ISC H +YK
Sbjct: 62 ILGHDIPVKNHRNTQYFSTIRIGTPPQKFKVVLDTGSANLWVPSSKCK-TISCKKHKKYK 120
Query: 130 SRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARF 189
S S+TY G EI YGSG ++G S+D +GD+ V++Q+F EAT+ + + A
Sbjct: 121 SALSDTYHNNGSEFEIYYGSGGMTGHVSEDIFTIGDLKVQEQLFGEATKVSGFSNVKA-- 178
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF 249
DGI+GLGF I+V P + NM++Q L+ E VF+F+L+ D EI FGGVD +H+
Sbjct: 179 DGILGLGFASISVNSIPPPFYNMLDQNLLDEPVFAFYLS-DTYKGRTSEITFGGVDEQHY 237
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHA 309
G+ +P+ +K YW+ E + G+ V + G AI+D+G+SL+ P+ + +N
Sbjct: 238 SGEIVKIPLRRKAYWEVEFSGLFFGDHFADVEDTG--AILDTGSSLIGLPSGLFETVNKE 295
Query: 310 IGG 312
IG
Sbjct: 296 IGA 298
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 33/99 (33%), Positives = 56/99 (56%), Gaps = 4/99 (4%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
+N+ + + G I+DCD+ MP+++F +G+ F + P+ Y L+ C+S
Sbjct: 292 VNKEIGATRDYQGRYILDCDKRSFMPSLTFVLGEYNFTIDPKDYSLQE----QNFCMSAL 347
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
+ D P P GPL +LGD F+ +++V+D G IG A+A
Sbjct: 348 VPMDFPGPTGPLVVLGDAFLRRWYSVYDFGNGAIGLAQA 386
>gi|403261257|ref|XP_003923041.1| PREDICTED: gastricsin [Saimiri boliviensis boliviensis]
Length = 388
Score = 208 bits (530), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 112/301 (37%), Positives = 173/301 (57%), Gaps = 4/301 (1%)
Query: 13 WVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDE 72
W++ + + L + ++ LKK + ++ + R+ +G ++ D
Sbjct: 3 WMVVAFVCLQLLEAAVVKVPLKKFKSIRETMKEKGLLREFLKTHKRDPAG-KYHFSDLSV 61
Query: 73 DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRK 132
P+ ++MDA YFGEI IG+PPQNF V+FDTGSSNLWVPS C S +C HSR+
Sbjct: 62 SYEPM-DYMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHSRFNPSA 119
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGI 192
S+TY+ G++ + YGSGS++G F D + V + V +Q F + E F+ A+FDGI
Sbjct: 120 SSTYSSNGQTFSLQYGSGSLTGLFGYDTLTVQSIQVPNQEFGLSENEPGTNFIYAQFDGI 179
Query: 193 IGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGK 252
+GL + ++VG A M+++ +++ VFSF+L+ GG +VFGGVD + G+
Sbjct: 180 MGLAYPALSVGGATTAMQGMLQEDVLTSPVFSFYLSNQ-QGSSGGAVVFGGVDSSLYTGQ 238
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
+ PVT++ YWQ + + LIG Q++G C GC AIVD+GTSLL P ++ A G
Sbjct: 239 IYWAPVTQELYWQIGIEEFLIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMSAFLEATGA 298
Query: 313 E 313
+
Sbjct: 299 Q 299
Score = 68.6 bits (166), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 38/107 (35%), Positives = 56/107 (52%), Gaps = 5/107 (4%)
Query: 401 EKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 460
++ +S E + + G+ +++C+ I +P ++F I F L P YIL
Sbjct: 286 QQYMSAFLEATGAQEDEYGQFLVNCNSIQNLPTLTFIINGVEFPLPPSSYILSNNG---- 341
Query: 461 VCISGFMAFDLPPPRG-PLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
C G LP G PLWILGDVF+ Y++V+D G R+GFA AA
Sbjct: 342 YCTVGVEPTYLPSQNGQPLWILGDVFLRSYYSVYDLGNNRVGFATAA 388
>gi|18152941|gb|AAB68519.2| proteinase A [Ogataea angusta]
gi|320580237|gb|EFW94460.1| proteinase A [Ogataea parapolymorpha DL-1]
Length = 413
Score = 208 bits (530), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 106/248 (42%), Positives = 153/248 (61%), Gaps = 4/248 (1%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N+++AQYF EI +G+P Q+F VI DTGSSNLWVPSS C S++CY H++Y +S+T
Sbjct: 90 PLTNYLNAQYFTEIQLGTPGQSFKVILDTGSSNLWVPSSDC-TSLACYLHTKYDHDESST 148
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + G S I YGSGS+ G+ SQD + +GD+V+ Q F EAT E L F +FDGI+GL
Sbjct: 149 YQKNGSSFAIQYGSGSLEGYVSQDTLTIGDLVIPKQDFAEATSEPGLAFAFGKFDGILGL 208
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEE-GGEIVFGGVDPKHFKGKHT 254
+ I+V VP N + GL+ F F+L +E+ GGE FGG D + G T
Sbjct: 209 AYDTISVNRIVPPIYNAINLGLLDTPQFGFYLGDTSKSEQDGGEATFGGYDVSKYTGDIT 268
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
++PV +K YW+ + I +G++ + G A +D+GTSL+A P+ + +N IG E
Sbjct: 269 WLPVRRKAYWEVKFSGIALGDEYAPLENTGAA--IDTGTSLIALPSQLAEILNSQIGAEK 326
Query: 315 VVSAECKL 322
S + ++
Sbjct: 327 SWSGQYQI 334
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 53/87 (60%), Gaps = 4/87 (4%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
G+ IDCD+ ++P+++F F +SP Y L+ ++ CIS F DLP P GP+
Sbjct: 330 GQYQIDCDKRDSLPDLTFNFDGYNFTISPYDYTLE----VSGSCISAFTPMDLPAPIGPM 385
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEA 505
I+GD F+ Y++V+D G+ +G A+A
Sbjct: 386 AIIGDAFLRRYYSVYDLGRDAVGLAKA 412
>gi|9910338|ref|NP_064476.1| embryonic pepsinogen precursor [Rattus norvegicus]
gi|7106000|emb|CAB75983.1| prochymosin [Rattus norvegicus]
Length = 379
Score = 208 bits (530), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 109/281 (38%), Positives = 166/281 (59%), Gaps = 10/281 (3%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDI-----LPLKNFMDAQYFGEIGI 91
R+ LH + R T KE+ + + ++ + + +I PL N++D++YFG I +
Sbjct: 21 RIPLHKGKSLRNTLKEQGLLEDFLRRHQYEFSEKNSNIGMVASEPLTNYLDSEYFGLIYV 80
Query: 92 GSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGS 151
G+PPQ F V+FDTGSS LWVPS C S C H+R+ KS T+ + K + YG+GS
Sbjct: 81 GTPPQEFKVVFDTGSSELWVPSVYCS-SKVCRNHNRFDPSKSFTFQNLSKPLFVQYGTGS 139
Query: 152 ISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDN 211
+ GF + D V V D+VV Q +T E F + FDGI+GL + A +VP++DN
Sbjct: 140 VEGFLAYDTVTVSDIVVPHQTVGLSTEEPGDIFTYSPFDGILGLAYPTFASKYSVPIFDN 199
Query: 212 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 271
M+ + LV++++FS +++R+ ++G + G +D +F G +VPVT +GYWQF + I
Sbjct: 200 MMNRHLVAQDLFSVYMSRN---DQGSMLTLGAIDQSYFIGSLHWVPVTVQGYWQFTVDRI 256
Query: 272 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
I N C+GGC A++D+GT+LL GP + I HAIG
Sbjct: 257 TI-NDEVVACQGGCPAVLDTGTALLTGPGRDILNIQHAIGA 296
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 40/83 (48%), Gaps = 10/83 (12%)
Query: 423 IDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILG 482
IDC R+ MP V F I + F L P Y C SGF +WILG
Sbjct: 306 IDCWRLNFMPTVVFEINGREFPLPPSAYT----NQFQGSCSSGFRH------GSQMWILG 355
Query: 483 DVFMGVYHTVFDSGKLRIGFAEA 505
DVF+ +++VFD R+G A+A
Sbjct: 356 DVFIREFYSVFDRANNRVGLAKA 378
>gi|395852554|ref|XP_003798803.1| PREDICTED: pepsin A-like [Otolemur garnettii]
Length = 387
Score = 208 bits (530), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 104/239 (43%), Positives = 153/239 (64%), Gaps = 6/239 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL+N+MD +YFG IGIG+P Q F+VIFDTGSSNLWVPS C S +C H+R+ + S+T
Sbjct: 66 PLENYMDTEYFGTIGIGTPAQEFTVIFDTGSSNLWVPSVYCS-SPACSNHNRFNPQSSST 124
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIG 194
Y ++ I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+G
Sbjct: 125 YQATSQTVSIAYGTGSMTGILGYDTVQVGGITDTNQIFGLSETEPGSFLYY-APFDGILG 183
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+ A PV+DNM QGLVS+++FS +L+ + + G ++FGG+D ++ G+
Sbjct: 184 LAYPSISSSGATPVFDNMWNQGLVSQDLFSVFLSS--NDQSGSVVMFGGIDSSYYTGELN 241
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
++P++ +GYWQ + I + + C GC AIVD+GTSLL+GPT + I IG
Sbjct: 242 WIPLSSEGYWQITVDSITMNGEPIA-CSQGCQAIVDTGTSLLSGPTSPIANIQSYIGAS 299
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 48/134 (35%), Positives = 63/134 (47%), Gaps = 10/134 (7%)
Query: 375 GDSAVCSACEMAVVWVQNQLKQKQTK--EKVLSYINELCDSLPNPMGESIIDCDRIPTMP 432
G+ CS A+V L T + SYI DS G+ +I C I ++P
Sbjct: 262 GEPIACSQGCQAIVDTGTSLLSGPTSPIANIQSYIGASEDSY----GQMVISCSAINSLP 317
Query: 433 NVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTV 492
N+ FTI + + P YIL+ G C SGF +LP G LWILGDVF+ Y V
Sbjct: 318 NIVFTINGVQYPVPPSAYILQQNGG----CTSGFQGMNLPTASGELWILGDVFIRQYFAV 373
Query: 493 FDSGKLRIGFAEAA 506
FD ++G A A
Sbjct: 374 FDRANNQVGLAPVA 387
>gi|149025623|gb|EDL81866.1| prochymosin [Rattus norvegicus]
Length = 379
Score = 208 bits (530), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 109/281 (38%), Positives = 166/281 (59%), Gaps = 10/281 (3%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDI-----LPLKNFMDAQYFGEIGI 91
R+ LH + R T KE+ + + ++ + + +I PL N++D++YFG I +
Sbjct: 21 RIPLHKGKSLRNTLKEQGLLEDFLRRHQYEFSEKNSNIGVVASEPLTNYLDSEYFGLIYV 80
Query: 92 GSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGS 151
G+PPQ F V+FDTGSS LWVPS C S C H+R+ KS T+ + K + YG+GS
Sbjct: 81 GTPPQEFKVVFDTGSSELWVPSVYCS-SKVCRNHNRFDPSKSFTFQNLSKPLFVQYGTGS 139
Query: 152 ISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDN 211
+ GF + D V V D+VV Q +T E F + FDGI+GL + A +VP++DN
Sbjct: 140 VEGFLAYDTVTVSDIVVPHQTVGLSTEEPGDIFTYSPFDGILGLAYPTFASKYSVPIFDN 199
Query: 212 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 271
M+ + LV++++FS +++R+ ++G + G +D +F G +VPVT +GYWQF + I
Sbjct: 200 MMNRHLVAQDLFSVYMSRN---DQGSMLTLGAIDQSYFIGSLHWVPVTVQGYWQFTVDRI 256
Query: 272 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
I N C+GGC A++D+GT+LL GP + I HAIG
Sbjct: 257 TI-NDEVVACQGGCPAVLDTGTALLTGPGRDILNIQHAIGA 296
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 40/83 (48%), Gaps = 10/83 (12%)
Query: 423 IDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILG 482
IDC R+ MP V F I + F L P Y C SGF +WILG
Sbjct: 306 IDCWRLNFMPTVVFEINGREFPLPPSAYT----NQFQGSCSSGFRH------GSQMWILG 355
Query: 483 DVFMGVYHTVFDSGKLRIGFAEA 505
DVF+ +++VFD R+G A+A
Sbjct: 356 DVFIREFYSVFDRANNRVGLAKA 378
>gi|118344566|ref|NP_001072055.1| nothepsin precursor [Takifugu rubripes]
gi|55771088|dbj|BAD69804.1| nothepsin [Takifugu rubripes]
Length = 414
Score = 208 bits (530), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 117/300 (39%), Positives = 175/300 (58%), Gaps = 21/300 (7%)
Query: 21 LPASSNGLRRIG-----LKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDIL 75
+P+ + LR G L++RR DL + RY +G R+ E
Sbjct: 30 MPSMRSQLRADGQLSAFLQERRPDLF---------QRRYFQCFPATGPSLRVERFSET-- 78
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
L N+MD Q++GEI +G+P QNFSV+FDTGSS+LWVPS C H R+K+ +S +
Sbjct: 79 -LYNYMDVQFYGEIELGTPGQNFSVVFDTGSSDLWVPSVYCVSQTCGTVHRRFKAFESTS 137
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G+ EI+YGSG + G ++D ++V +V V++Q F E+ E + F++A FDGI+G+
Sbjct: 138 YRHDGRVFEIHYGSGHMLGIMARDTLKVNNVTVQNQEFGESVYEPGVAFVMAHFDGILGM 197
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLN---RDPDAEEGGEIVFGGVDPKHFKGK 252
G+ +A PV+DNM+ Q +V E +FSF+L+ R ++ GE++ GG+D F G
Sbjct: 198 GYPSLAQILGNPVFDNMLAQQMVEEPIFSFYLSKYERFSGSKLQGELLLGGMDQDLFTGP 257
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
++PVT KGYWQ ++ + + T C GC AIVD+GTSL+AGPT + + IG
Sbjct: 258 INWLPVTTKGYWQIKVDSVAVQGVDT-FCPEGCQAIVDTGTSLIAGPTRDILRLQQLIGA 316
Score = 91.7 bits (226), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 70/249 (28%), Positives = 114/249 (45%), Gaps = 12/249 (4%)
Query: 266 FELGDILIGNQSTG--VCEGGCAAIVDSGTSLLAGPTPVVTEI------NHAIGGEGVVS 317
++ ++ + NQ G V E G A ++ +L P + +I ++ + + V
Sbjct: 163 LKVNNVTVQNQEFGESVYEPGVAFVMAHFDGILGMGYPSLAQILGNPVFDNMLAQQMVEE 222
Query: 318 AECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVSAGDS 377
+S+Y L LL + Q + N + G + G
Sbjct: 223 PIFSFYLSKYERFSGSKLQGELLLGGMDQDLFTGPINWLPVTTKGYWQIKVDSVAVQGVD 282
Query: 378 AVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNPMGESIIDCDRIPTMPNVSFT 437
C A+V L T++ + + +L + P +G + DC R+ ++P V+F
Sbjct: 283 TFCPEGCQAIVDTGTSLIAGPTRD--ILRLQQLIGATPTNIG-VVTDCVRLSSLPRVTFV 339
Query: 438 IGDKIFNLSPEQYILKTGE-GIAEVCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSG 496
+G + + L+PE+YI + G E C SGF A D+ P+GPLWILGDVF+ Y++VFD G
Sbjct: 340 LGGEEYTLTPERYIRRVEMLGDKEFCFSGFQAADILSPKGPLWILGDVFLTQYYSVFDRG 399
Query: 497 KLRIGFAEA 505
RIGFA A
Sbjct: 400 HDRIGFALA 408
>gi|344234771|gb|EGV66639.1| Asp-domain-containing protein [Candida tenuis ATCC 10573]
Length = 425
Score = 208 bits (529), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 110/264 (41%), Positives = 158/264 (59%), Gaps = 9/264 (3%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N+ +AQYF EI +G+P Q F VI DTGSSNLW+PS C S++CY HS+Y S+T
Sbjct: 102 PLSNYANAQYFTEIEVGTPGQPFKVILDTGSSNLWIPSQDCS-SLACYLHSKYDHDASST 160
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G I YGSG++ G+ S D + +GD+++K+Q F EAT E L F +FDGI+GL
Sbjct: 161 YKANGSEFAIQYGSGAMEGYVSTDALRIGDLLIKNQDFAEATSEPGLAFAFGKFDGILGL 220
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+ I+V VP N + QGL+ E+ F+F+L + + D E+GG FGG D F GK T
Sbjct: 221 AYDTISVNKIVPPVYNAINQGLLDEKSFAFYLGDTNKDEEDGGVATFGGYDESKFTGKIT 280
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG-- 312
++PV +K YW+ L + +G++ + G A +D+GTSL+ P+ + IN IG
Sbjct: 281 WLPVRRKAYWEVSLEGLGLGDEFAELKSTGAA--IDTGTSLITLPSSLAEIINAKIGAVK 338
Query: 313 --EGVVSAECKLVVSQYGDLIWDL 334
G + EC + DL ++L
Sbjct: 339 SWSGQYTVECD-ARANLPDLTFNL 361
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 52/99 (52%), Gaps = 4/99 (4%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGF 466
IN ++ + G+ ++CD +P+++F + F LS +Y L+ I+ CIS
Sbjct: 330 INAKIGAVKSWSGQYTVECDARANLPDLTFNLNGYNFTLSAYEYTLE----ISGSCISAI 385
Query: 467 MAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
D P P G + I+GD F+ Y++++D K +G A A
Sbjct: 386 TPMDFPKPIGDMAIIGDAFLRKYYSIYDLKKDAVGLATA 424
>gi|440894789|gb|ELR47149.1| Pregnancy-associated glycoprotein 2, partial [Bos grunniens mutus]
Length = 397
Score = 208 bits (529), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 106/278 (38%), Positives = 163/278 (58%), Gaps = 8/278 (2%)
Query: 38 LDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDIL--PLKNFMDAQYFGEIGIGSPP 95
L L + R T +E+ + + +RL +D I PL+N++D Y G I IG+PP
Sbjct: 40 LPLKKMKTLRETLREKNLLNNFLEEQAYRLSKNDSKITIHPLRNYLDTAYVGNITIGTPP 99
Query: 96 QNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
Q F V+FDTGS+NLWVP C S +CY H + + S+++ E+G I YGSG I GF
Sbjct: 100 QEFRVVFDTGSANLWVPCITCT-SPACYTHKTFNPQNSSSFREVGSPITIFYGSGIIQGF 158
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
D V +G++V +Q F + E L FDGI+GL F + + D +P++DN+
Sbjct: 159 LGSDTVRIGNLVSPEQSFGLSLEEYGFDSL--PFDGILGLAFPPMGIEDTIPIFDNLWSH 216
Query: 216 GLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGN 275
G SE VF+F+LN + EG ++FGGVD +++KG+ ++PV++ +WQ + +I + N
Sbjct: 217 GAFSEPVFAFYLNT--NKPEGSVVMFGGVDHRYYKGELNWIPVSQTSHWQISMNNISM-N 273
Query: 276 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
+ C GC A++D+GTSL+ GPT +VT I+ +
Sbjct: 274 GTVTACSCGCEALLDTGTSLIYGPTKLVTNIHKLMNAR 311
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 53/105 (50%), Gaps = 8/105 (7%)
Query: 402 KVLSYINELCDS-LPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAE 460
K+++ I++L ++ L N E ++ CD + T+P V F I + L P+ YI+K
Sbjct: 299 KLVTNIHKLMNARLEN--SEYVVSCDAVKTLPPVIFNINGIDYPLRPQAYIIKIQNNCRS 356
Query: 461 VCISGFMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEA 505
V G L WILGD+F+ Y +VFD RIG A A
Sbjct: 357 VFQGGTENSSLN-----TWILGDIFLRQYFSVFDRKNRRIGLAPA 396
>gi|426333518|ref|XP_004028323.1| PREDICTED: cathepsin E isoform 2 [Gorilla gorilla gorilla]
Length = 363
Score = 208 bits (529), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 105/229 (45%), Positives = 142/229 (62%), Gaps = 1/229 (0%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C HSR++ +S+T
Sbjct: 69 PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHSRFQPSQSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y++ G+S I YG+GS+SG D V V + V Q F E+ E TF+ A FDGI+GL
Sbjct: 128 YSQPGQSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAEFDGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV +FS +++ +P+ G E++FGG D HF G +
Sbjct: 188 GYPSLAVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGSLNW 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
VPVTK+ YWQ L ++L + C + +S PTP T
Sbjct: 248 VPVTKQAYWQIALDNMLWSVPTLTSCRMSPSPSTESPIPSAQLPTPYWT 296
>gi|1585064|prf||2124254A pepsin:ISOTYPE=3a
gi|1585065|prf||2124254B pepsin:ISOTYPE=3b
Length = 326
Score = 208 bits (529), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 109/258 (42%), Positives = 159/258 (61%), Gaps = 10/258 (3%)
Query: 73 DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRK 132
D PL+N++D +YFG IGIG+P Q+F+V+FDTGSSNLWVPS C S++C H+R+
Sbjct: 2 DEQPLENYLDMEYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCS-SLACTNHNRFNPED 60
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDG 191
S+TY ++ I YG+GS++G D V+VG + +Q+F + T GS + A FDG
Sbjct: 61 SSTYQSTSETVSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDG 119
Query: 192 IIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKG 251
I+GL I+ A PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G
Sbjct: 120 ILGLATPSISSSGATPVFDNIWNQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTG 177
Query: 252 KHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
+VPVT +GYWQ + I + ++ C GC AIVD+GTSLL GPT + I IG
Sbjct: 178 SLNWVPVTVEGYWQITVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIG 236
Query: 312 G----EGVVSAECKLVVS 325
+G + C + S
Sbjct: 237 ASENSDGDMVVSCSAISS 254
Score = 73.6 bits (179), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 38/91 (41%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
N G+ ++ C I ++P++ FTI + + P YIL++ EG CISGF +LP
Sbjct: 240 NSDGDMVVSCSAISSLPDIVFTINGVQYPVPPSAYILQS-EG---SCISGFQGMNLPTES 295
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 296 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 326
>gi|402855684|ref|XP_003892446.1| PREDICTED: LOW QUALITY PROTEIN: gastricsin-like [Papio anubis]
Length = 377
Score = 208 bits (529), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 101/250 (40%), Positives = 154/250 (61%), Gaps = 1/250 (0%)
Query: 63 VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISC 122
++R + P+ N+M + YFGEI IG+PPQNF ++FDTGSSNLWVPS C S +C
Sbjct: 39 AKYRFNNDAVAYEPITNYMXSFYFGEISIGTPPQNFLLLFDTGSSNLWVPSIYCQ-SQAC 97
Query: 123 YFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSL 182
H+R+ S+T+ G++ ++YGSG++S F D V V +++V +Q F + E S
Sbjct: 98 SNHNRFNPSLSSTFRNNGQTYTLSYGSGNLSVFLGYDTVTVQNIIVNNQEFGLSENELSD 157
Query: 183 TFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFG 242
F + FDGI+G+ + +AVG++ V M++QG +++ FSF+ P + GGE++ G
Sbjct: 158 PFYYSDFDGILGMAYPSMAVGNSPTVMQGMLQQGQITQPDFSFYFTHQPTRQYGGELILG 217
Query: 243 GVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPV 302
GVDP+ + G+ PVT++ YWQ + + +GNQ+TG+C GC AIV +GT LLA P
Sbjct: 218 GVDPQLYSGQIIXTPVTRELYWQIPIEEFAVGNQATGLCSEGCQAIVVTGTFLLAVPQQY 277
Query: 303 VTEINHAIGG 312
+ A G
Sbjct: 278 MGSFLQATGA 287
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 36/89 (40%), Positives = 50/89 (56%), Gaps = 5/89 (5%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRG-P 477
G+ ++ C I +MP ++F IG F L P Y+ G + I A LP P G P
Sbjct: 293 GDFVVHCSYIQSMPTITFIIGGAQFPLPPSAYVFNN-NGYCRLRIE---ATXLPLPSGQP 348
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
LWILGDVF+ Y++V+D R+GFA +A
Sbjct: 349 LWILGDVFLKEYYSVYDMANNRLGFAFSA 377
>gi|126306831|ref|XP_001370729.1| PREDICTED: renin-like [Monodelphis domestica]
Length = 389
Score = 207 bits (528), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 127/353 (35%), Positives = 190/353 (53%), Gaps = 35/353 (9%)
Query: 25 SNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQ 84
S+GL+RI LKK S+ + M GV + L N+ D Q
Sbjct: 20 SDGLQRIALKKMISVKESMKMRGKHLENLNMAENSWHGVVSPI--------ILTNYEDTQ 71
Query: 85 YFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSC 143
Y+GEI IGSPPQ F V+FDTGSS+ WVPSS+C +C FH+RY + KS+TY G +
Sbjct: 72 YYGEINIGSPPQTFKVVFDTGSSDFWVPSSQCDPLYTACEFHNRYDASKSSTYKMNGSNF 131
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVG 203
I+Y SG + GF SQD + +G++ V QVF E T + F LA FDGI+GLG+ + ++
Sbjct: 132 IIHYASGRVKGFLSQDILTIGEIKVT-QVFGEVTALPLIPFGLAWFDGILGLGYPKRSMS 190
Query: 204 DAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGY 263
PV+DN++ +G++ E+VFS + +R + GGE++ GG DP +++G Y+ ++ +
Sbjct: 191 GITPVFDNIMAEGVLKEDVFSIYYSRS-SGKNGGELILGGSDPNYYQGTFHYINTSRPHF 249
Query: 264 WQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE---GVVSAEC 320
WQ ++ + + + CE GC A+VD+GTS + GPT + + AIG E G +C
Sbjct: 250 WQIQMQGVAVKSYVLS-CEDGCPAVVDTGTSFITGPTDSIRGLMTAIGAEEDGGEYLVKC 308
Query: 321 KLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVSTGIKTVVEKENVS 373
L + LP+ F+G ++ G V+E EN S
Sbjct: 309 DLAST--------------LPDISFN------FDGKDFTLQGSDYVLEDENQS 341
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 48/88 (54%)
Query: 419 GESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPL 478
GE ++ CD T+P++SF K F L Y+L+ ++C+ D+ PP GPL
Sbjct: 302 GEYLVKCDLASTLPDISFNFDGKDFTLQGSDYVLEDENQSDQMCLVAINGLDVSPPTGPL 361
Query: 479 WILGDVFMGVYHTVFDSGKLRIGFAEAA 506
W+LG F+ ++ FD RIGFA AA
Sbjct: 362 WVLGATFIRKFYVEFDRHNNRIGFALAA 389
>gi|195349117|ref|XP_002041093.1| GM15229 [Drosophila sechellia]
gi|194122698|gb|EDW44741.1| GM15229 [Drosophila sechellia]
Length = 395
Score = 207 bits (528), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 125/320 (39%), Positives = 175/320 (54%), Gaps = 7/320 (2%)
Query: 12 LWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSD 71
LW+L CL L RI ++ + + S R R + V G + +
Sbjct: 11 LWIL--CLFWAKCQGQLIRIPMQFQASFMASRRQHRAGRSS-LLAKYNVVGEQELTSRNG 67
Query: 72 EDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFHSRYKS 130
L N ++ +Y G I IGSP Q F+++FDTGS+NLWVPS++C S++C+ H RY +
Sbjct: 68 GATETLDNRLNLEYAGPISIGSPGQPFNMLFDTGSANLWVPSAECSPKSVACHHHHRYNA 127
Query: 131 RKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFD 190
S+T+ G+ I YG+GS+SG +QD V +G +VV++Q F AT E TF+ F
Sbjct: 128 SASSTFVPDGRRFSIAYGTGSLSGRLAQDTVAIGQLVVRNQTFGMATHEPGPTFVDTNFA 187
Query: 191 GIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFK 250
GI+GLGFR IA P++++M +Q LV + VFSF+L R+ +GGE++FGGVD F
Sbjct: 188 GIVGLGFRPIAEQGIKPLFESMCDQKLVDDCVFSFYLKRNGSDRKGGELLFGGVDKTKFS 247
Query: 251 GKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI 310
G TYVP+T GYWQF L I + AI D+GTSLLA P IN +
Sbjct: 248 GSLTYVPLTHAGYWQFPLDAIEVAGTRISQHR---QAIADTGTSLLAAPPREYLIINSLL 304
Query: 311 GGEGVVSAECKLVVSQYGDL 330
GG + E L S+ L
Sbjct: 305 GGLPTSNNEYLLNCSEIDSL 324
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 41/101 (40%), Positives = 57/101 (56%), Gaps = 6/101 (5%)
Query: 407 INELCDSLPNPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILK-TGEGIAEVCISG 465
IN L LP E +++C I ++P + F IG + F L P Y++ T + + +C+S
Sbjct: 300 INSLLGGLPTSNNEYLLNCSEIDSLPEIVFIIGGQRFGLQPRDYVMSATNDDGSSICLSA 359
Query: 466 FMAFDLPPPRGPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
F D WILGDVF+G Y+T FD+G RIGFA AA
Sbjct: 360 FTLMD-----AEFWILGDVFIGRYYTAFDAGHRRIGFAPAA 395
>gi|123431419|ref|XP_001308165.1| Clan AA, family A1, cathepsin D-like aspartic peptidase
[Trichomonas vaginalis G3]
gi|121889831|gb|EAX95235.1| Clan AA, family A1, cathepsin D-like aspartic peptidase
[Trichomonas vaginalis G3]
Length = 370
Score = 207 bits (528), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 122/307 (39%), Positives = 169/307 (55%), Gaps = 27/307 (8%)
Query: 19 LLLPASSNGLRRIGLKKRRLDLHSLNAA--RITRKERYMGGAGVSGVRHRLGDSDEDILP 76
L ++S+ + LKK + + R + R GG+ V P
Sbjct: 4 FFLSSASSKAITMPLKKHDVSFEQVRRTIDRYRKLNRVDGGSSV---------------P 48
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
L +F DAQY+ EI IG+P Q F V DTGSSNLWVPS KC SI+C+ H+RY S KS+TY
Sbjct: 49 LHDFSDAQYYTEITIGTPAQKFKVCPDTGSSNLWVPSKKCN-SIACWLHTRYDSSKSSTY 107
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
T G+ +I YGSGS GF SQD V++ + K F E EGS++F+ A+FDGI+GL
Sbjct: 108 TADGREVDIQYGSGSCKGFASQDEVQIAGITDK-MTFAEMKEEGSISFIAAKFDGILGLA 166
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
F+ I+V P + E G + + +F L R + E GE+ GG +P F G+ T+
Sbjct: 167 FQNISVQGIPPPLQILYEHGEIEDYTVAFKLGR--TSGEDGEMTIGGYNPDAFSGEITWF 224
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGG--CAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
V K+ +W FE D+L+ + S GVC G CAAI+D+GTS+L GP + I I
Sbjct: 225 NVAKELWWYFEFDDVLVNDVSAGVCPAGGKCAAILDTGTSMLIGPVSAMDVIMKNID--- 281
Query: 315 VVSAECK 321
+ A C+
Sbjct: 282 -IDARCQ 287
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 44/87 (50%), Gaps = 10/87 (11%)
Query: 425 CDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGPLWILGDV 484
C + P V+F I F L+PE Y+++ G + C+ G M DL P +ILGD
Sbjct: 286 CQNLDQNPTVTFVINGVKFPLTPEDYVMRVNAGSYDQCLPGMMGADLV----PFFILGDT 341
Query: 485 FMGVYHTVFDSGKL------RIGFAEA 505
F+ Y++++D + R+G A A
Sbjct: 342 FLRKYYSIYDMNYVNGVANPRLGLALA 368
>gi|62319547|dbj|BAD94980.1| putative aspartic proteinase [Arabidopsis thaliana]
Length = 149
Score = 207 bits (528), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 95/149 (63%), Positives = 121/149 (81%), Gaps = 5/149 (3%)
Query: 362 GIKTVVEKENVS----AGDSAVCSACEMAVVWVQNQLKQKQTKEKVLSYINELCDSLPNP 417
GI++VV+KEN GD+A CSACEMAVVW+Q+QL+Q T+E++L+Y+NELC+ LP+P
Sbjct: 2 GIESVVDKENAKLSNGVGDAA-CSACEMAVVWIQSQLRQNMTQERILNYVNELCERLPSP 60
Query: 418 MGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPRGP 477
MGES +DC ++ TMP VS TIG K+F+L+PE+Y+LK GEG CISGF+A D+ PPRGP
Sbjct: 61 MGESAVDCAQLSTMPTVSLTIGGKVFDLAPEEYVLKVGEGPVAQCISGFIALDVAPPRGP 120
Query: 478 LWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
LWILGDVFMG YHTVFD G ++GFAEAA
Sbjct: 121 LWILGDVFMGKYHTVFDFGNEQVGFAEAA 149
>gi|1585066|prf||2124254C pepsin:ISOTYPE=3c
Length = 326
Score = 207 bits (528), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 106/240 (44%), Positives = 154/240 (64%), Gaps = 6/240 (2%)
Query: 73 DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRK 132
D PL+N++D +YFG IGIG+P Q+F+V+FDTGSSNLWVPS C S++C H+R+
Sbjct: 2 DEQPLENYLDMEYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCS-SLACTNHNRFNPED 60
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDG 191
S+TY ++ I YG+GS++G D V+VG + +Q+F + T GS + A FDG
Sbjct: 61 SSTYQSTSETVSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDG 119
Query: 192 IIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKG 251
I+GL I+ A PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G
Sbjct: 120 ILGLATPSISSSGATPVFDNIWNQGLVSQDLFSVYLSA--DDKSGSVVIFGGIDSSYYTG 177
Query: 252 KHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
+VPVT +GYWQ + I + ++ C GC AIVD+GTSLL GPT + +I IG
Sbjct: 178 SLNWVPVTVEGYWQITVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIAKIQSDIG 236
Score = 73.6 bits (179), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 38/91 (41%), Positives = 53/91 (58%), Gaps = 4/91 (4%)
Query: 416 NPMGESIIDCDRIPTMPNVSFTIGDKIFNLSPEQYILKTGEGIAEVCISGFMAFDLPPPR 475
N G+ ++ C I ++P++ FTI + + P YIL++ EG CISGF +LP
Sbjct: 240 NSDGDMVVSCSAISSLPDIVFTINGVQYPVPPSAYILQS-EG---SCISGFQGMNLPTES 295
Query: 476 GPLWILGDVFMGVYHTVFDSGKLRIGFAEAA 506
G LWILGDVF+ Y TVFD ++G A A
Sbjct: 296 GELWILGDVFIRQYFTVFDRANNQVGLAPVA 326
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.320 0.139 0.425
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 8,716,107,659
Number of Sequences: 23463169
Number of extensions: 397517336
Number of successful extensions: 800492
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 3247
Number of HSP's successfully gapped in prelim test: 2340
Number of HSP's that attempted gapping in prelim test: 780664
Number of HSP's gapped (non-prelim): 10896
length of query: 506
length of database: 8,064,228,071
effective HSP length: 147
effective length of query: 359
effective length of database: 8,910,109,524
effective search space: 3198729319116
effective search space used: 3198729319116
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 79 (35.0 bits)