BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 016295
(392 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224124910|ref|XP_002319454.1| predicted protein [Populus trichocarpa]
gi|222857830|gb|EEE95377.1| predicted protein [Populus trichocarpa]
Length = 507
Score = 560 bits (1444), Expect = e-157, Method: Compositional matrix adjust.
Identities = 262/363 (72%), Positives = 310/363 (85%), Gaps = 2/363 (0%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV 60
M K+L FCLW L +C LLPASSNGL RIGLKKR LDL ++ A I R+E G G
Sbjct: 1 MGNKILLKAFCLWAL-TCFLLPASSNGLVRIGLKKRHLDLQTIKDAIIARQEG-KAGVGA 58
Query: 61 SGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI 120
S H LG SD DI+PLKN++DAQY GEIGIGSPPQNF+V+FDTGSSNLWVPSSKCYFSI
Sbjct: 59 SSRVHDLGSSDGDIIPLKNYLDAQYLGEIGIGSPPQNFTVVFDTGSSNLWVPSSKCYFSI 118
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+CYFHS+YKS +S+TYT+ G CEI+YGSGS+SGFFSQDNV+VGD+VVKDQVF+EAT+EG
Sbjct: 119 ACYFHSKYKSSRSSTYTKNGNFCEIHYGSGSVSGFFSQDNVQVGDLVVKDQVFVEATKEG 178
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
SL+F+L +FDGI+GLGF+EI+VG+ VP+W NM++Q LV +EVFSFWLNR+P+A+EGGE+V
Sbjct: 179 SLSFILGKFDGILGLGFQEISVGNVVPLWYNMIQQDLVDDEVFSFWLNRNPEAKEGGELV 238
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGGVDPKHFKGKHTYVPVT+KGYWQ +GD LIG STG+CEGGCAAIVDSGTSLLAGPT
Sbjct: 239 FGGVDPKHFKGKHTYVPVTQKGYWQINMGDFLIGKHSTGLCEGGCAAIVDSGTSLLAGPT 298
Query: 301 PVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVR 360
P++TEINHAIG EG+VSAECK VVS YGDLIW+L++SG+ P KVC Q+GLC FN A+ R
Sbjct: 299 PIITEINHAIGAEGLVSAECKEVVSHYGDLIWELIISGVQPSKVCTQLGLCIFNEAKSAR 358
Query: 361 LGI 363
GI
Sbjct: 359 TGI 361
>gi|294440430|gb|ADE74632.1| aspartic protease 1 [Nicotiana tabacum]
Length = 506
Score = 532 bits (1371), Expect = e-149, Method: Compositional matrix adjust.
Identities = 250/364 (68%), Positives = 306/364 (84%), Gaps = 4/364 (1%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITR-KERYMGGAG 59
ME+K L + LW + +LP SS+ L R+GLKK+ LD++S+NAAR+ R ++RY G
Sbjct: 1 MERKHLCAALLLWAIVY-FVLPVSSDNLLRVGLKKQSLDVNSINAARVARLQDRY--GKN 57
Query: 60 VSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS 119
V+G+ +LGDSD DI+ LKN++DAQY+GEIG+GSPPQ F VIFDTGSSNLWVPSS+CYFS
Sbjct: 58 VNGIEKKLGDSDLDIVSLKNYLDAQYYGEIGVGSPPQKFKVIFDTGSSNLWVPSSRCYFS 117
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
I+C+FHS+YK+ KS TYT G+SC I YG+GSISG FSQDNV+VGD+VVKDQVFIEATRE
Sbjct: 118 IACWFHSKYKASKSTTYTRNGESCSIRYGTGSISGHFSQDNVQVGDLVVKDQVFIEATRE 177
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEI 239
S+TF++A+FDGI+GLGF+EI+VG+A PVW NMV QGLV E+VFSFW+NRD A+EGGE+
Sbjct: 178 PSITFIIAKFDGILGLGFQEISVGNATPVWYNMVGQGLVKEQVFSFWINRDATAKEGGEL 237
Query: 240 VFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGP 299
VFGGVD HFKG HTYVP+T+KGYWQF +GD LIGN STGVC GGCAAIVDSGTSLLAGP
Sbjct: 238 VFGGVDSNHFKGNHTYVPLTQKGYWQFNMGDFLIGNASTGVCAGGCAAIVDSGTSLLAGP 297
Query: 300 TPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYV 359
T VVT+INHAIG EG+VS ECK +VSQYG++IW+LLVSG+ P++VC Q GLC FNGA++V
Sbjct: 298 TTVVTQINHAIGAEGIVSMECKTIVSQYGEMIWNLLVSGVKPDQVCSQAGLCYFNGAQHV 357
Query: 360 RLGI 363
I
Sbjct: 358 SSNI 361
>gi|82623417|gb|ABB87123.1| aspartic protease precursor-like [Solanum tuberosum]
Length = 506
Score = 526 bits (1356), Expect = e-147, Method: Compositional matrix adjust.
Identities = 249/364 (68%), Positives = 302/364 (82%), Gaps = 4/364 (1%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITR-KERYMGGAG 59
ME+K L + LW + +C LPASS L RIGLKK RLD++S+ AAR+ + ++RY G
Sbjct: 1 MEKKHLCAALLLWAI-TCSALPASSGDLLRIGLKKHRLDVNSIKAARVAKLQDRY--GKH 57
Query: 60 VSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS 119
V+G+ + DSD DI+PLKN++DAQY+GEIGIGSPPQ F VIFDTGSSNLWVPSSKCYFS
Sbjct: 58 VNGIEKKSSDSDIDIVPLKNYLDAQYYGEIGIGSPPQKFKVIFDTGSSNLWVPSSKCYFS 117
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
I+C+ HS+YK+ KS+TYT G+SC I YG+GSISG FS DNV+VGD+VVKDQVFIEATRE
Sbjct: 118 IACWIHSKYKASKSSTYTRDGESCSIRYGTGSISGHFSMDNVQVGDLVVKDQVFIEATRE 177
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEI 239
S+TF++A+FDGI+GLGF+EI+VG+ PVW NMV QGLV E VFSFW NRD +A+EGGE+
Sbjct: 178 PSITFIVAKFDGILGLGFQEISVGNTTPVWYNMVGQGLVKESVFSFWFNRDANAKEGGEL 237
Query: 240 VFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGP 299
VFGGVDPKHFKG HTYVP+T+KGYWQF +GD LIGN STG C GGCAAIVDSGTSLLAGP
Sbjct: 238 VFGGVDPKHFKGNHTYVPLTQKGYWQFNMGDFLIGNTSTGYCAGGCAAIVDSGTSLLAGP 297
Query: 300 TPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYV 359
T +VT+INHAIG EG+VS ECK +VSQYG++IWDLLVSG+ P++VC Q GLC +GA++V
Sbjct: 298 TTIVTQINHAIGAEGIVSMECKTIVSQYGEMIWDLLVSGVRPDQVCSQAGLCFVDGAQHV 357
Query: 360 RLGI 363
I
Sbjct: 358 SSNI 361
>gi|359487701|ref|XP_002276363.2| PREDICTED: aspartic proteinase oryzasin-1-like [Vitis vinifera]
gi|296089851|emb|CBI39670.3| unnamed protein product [Vitis vinifera]
Length = 502
Score = 520 bits (1339), Expect = e-145, Method: Compositional matrix adjust.
Identities = 245/351 (69%), Positives = 297/351 (84%), Gaps = 2/351 (0%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV 60
M Q ++ + FCLW L C LLP S+G RIGLKKR LD +++ ARI + + +GG GV
Sbjct: 1 MRQGVVWAAFCLWALI-CPLLPVYSHGSVRIGLKKRPLDFNNMRTARIAQMQGKIGG-GV 58
Query: 61 SGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI 120
H D D + + LKN++DAQYFGEIGIG+PPQNF+V+FDTGSSNLWVPSSKCYFSI
Sbjct: 59 MSKYHGFDDPDGEFVSLKNYLDAQYFGEIGIGTPPQNFTVVFDTGSSNLWVPSSKCYFSI 118
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+C+FH++YK+R S+TYT+IG+ EI+YGSGSISGFFSQDNVEVG +VVKDQVFIEATREG
Sbjct: 119 ACFFHNKYKARLSSTYTKIGRPGEIHYGSGSISGFFSQDNVEVGSLVVKDQVFIEATREG 178
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
SLTF LA+FDGI+GLGF+ I+VG+A PVW M++QGL+ EE+FSFWLNR+P+A EGGEIV
Sbjct: 179 SLTFALAKFDGIMGLGFQGISVGNATPVWSTMLQQGLLHEELFSFWLNRNPNANEGGEIV 238
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGGVD +HF+GKHT+VPVT+ GYWQF +GD LI NQ+TGVCEGGC+AIVDSGTSL+AGPT
Sbjct: 239 FGGVDKRHFRGKHTFVPVTQAGYWQFRMGDFLISNQTTGVCEGGCSAIVDSGTSLIAGPT 298
Query: 301 PVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLC 351
VVT+INHAIG EG+VS ECK VVSQYG+++WDLLVSG+LP KVC QIGLC
Sbjct: 299 LVVTQINHAIGAEGIVSMECKEVVSQYGNMMWDLLVSGVLPSKVCSQIGLC 349
>gi|171854659|dbj|BAG16519.1| putative aspartic protease [Capsicum chinense]
Length = 506
Score = 518 bits (1335), Expect = e-144, Method: Compositional matrix adjust.
Identities = 243/345 (70%), Positives = 294/345 (85%), Gaps = 3/345 (0%)
Query: 20 LLPASSNGLRRIGLKKRRLDLHSLNAARITR-KERYMGGAGVSGVRHRLGDSDEDILPLK 78
+LPASS+ L RIGLKK +D++S+NAAR+ R ++RY G ++G+ + SD DI+PLK
Sbjct: 19 VLPASSDNLLRIGLKKHHVDVNSINAARVARLQDRY--GKHLNGLEKKSDGSDVDIVPLK 76
Query: 79 NFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTE 138
N++DAQY+GEIGIGSPPQ F VIFDTGSSNLWVPSS+CYFSI+C+FH +YK+ KS+TYT
Sbjct: 77 NYLDAQYYGEIGIGSPPQKFKVIFDTGSSNLWVPSSRCYFSIACWFHHKYKAGKSSTYTR 136
Query: 139 IGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFR 198
GKSC I YG+GSISG FSQDNV+VGD+VVKDQVFIEATRE S+TF++ +FDGI+GLGF+
Sbjct: 137 NGKSCSIRYGTGSISGHFSQDNVQVGDLVVKDQVFIEATREPSITFIIGKFDGILGLGFQ 196
Query: 199 EIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPV 258
EI+VG+A PVW NMV+QGLV E VFSFW NRD +EGGE+VFGGVDPKHFKG HTYVP+
Sbjct: 197 EISVGNATPVWYNMVDQGLVKEPVFSFWFNRDASTKEGGELVFGGVDPKHFKGNHTYVPL 256
Query: 259 TKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSA 318
T+KGYWQF +GD LIGN STG C GGCAAIVDSGTSLLAGPT +VT++NHAIG EGVVSA
Sbjct: 257 TQKGYWQFNMGDFLIGNTSTGYCAGGCAAIVDSGTSLLAGPTTIVTQLNHAIGAEGVVSA 316
Query: 319 ECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
ECK +VSQYG+++WDLLVSG+ P++VC Q GLC FNGAE+V I
Sbjct: 317 ECKTIVSQYGEVLWDLLVSGVRPDQVCSQAGLCFFNGAEHVSSNI 361
>gi|350535356|ref|NP_001234702.1| aspartic protease precursor [Solanum lycopersicum]
gi|951449|gb|AAB18280.1| aspartic protease precursor [Solanum lycopersicum]
Length = 506
Score = 513 bits (1322), Expect = e-143, Method: Compositional matrix adjust.
Identities = 244/364 (67%), Positives = 298/364 (81%), Gaps = 4/364 (1%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITR-KERYMGGAG 59
M++K L + LW +A C LPASS L RIGLKK RLD+ S+ AAR+ + ++RY G
Sbjct: 1 MDKKHLCAALLLWAIA-CSALPASSGDLFRIGLKKHRLDVDSIKAARVAKLQDRY--GKH 57
Query: 60 VSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS 119
V+G+ + DSD +PLKN++DAQY+GEIGIGSPPQ F VIFDTGSSNLWVPSSKCYFS
Sbjct: 58 VNGIEKKSSDSDIYKVPLKNYLDAQYYGEIGIGSPPQKFKVIFDTGSSNLWVPSSKCYFS 117
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
I+C+ HS+Y++ KS+TYT G+SC I YG+GSISG FS DNV+VGD+VVKDQVFIEATRE
Sbjct: 118 IACWIHSKYQASKSSTYTRDGESCSIRYGTGSISGHFSMDNVQVGDLVVKDQVFIEATRE 177
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEI 239
S+TF++A+FDGI+GLGF+EI+VG+ PVW NMV QGLV E VFSFW NRD +A+EGGE+
Sbjct: 178 PSITFIVAKFDGILGLGFQEISVGNTTPVWYNMVGQGLVKEPVFSFWFNRDANAKEGGEL 237
Query: 240 VFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGP 299
VFGGVDPKHFKG HT VP+T+KGYWQF +GD LIGN STG C GGCAAIVDSGTSLLAGP
Sbjct: 238 VFGGVDPKHFKGNHTCVPLTQKGYWQFNMGDFLIGNTSTGYCAGGCAAIVDSGTSLLAGP 297
Query: 300 TPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYV 359
T +VT+INHAIG EG+VS ECK +VSQYG++IWDLLVSG+ P++VC Q GLC +G+++V
Sbjct: 298 TTIVTQINHAIGAEGIVSMECKTIVSQYGEMIWDLLVSGIRPDQVCSQAGLCFLDGSQHV 357
Query: 360 RLGI 363
I
Sbjct: 358 SSNI 361
>gi|351725345|ref|NP_001237345.1| aspartic proteinase 2 [Glycine max]
gi|15425751|dbj|BAB64296.1| aspartic proteinase 2 [Glycine max]
Length = 508
Score = 508 bits (1308), Expect = e-141, Method: Compositional matrix adjust.
Identities = 247/360 (68%), Positives = 293/360 (81%), Gaps = 16/360 (4%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV 60
M QK L +VFCLW L +C LLP+ S G+ RIGLKKR LDL S+NAAR R+ G+
Sbjct: 1 MGQKHLVTVFCLWAL-TCSLLPSFSFGILRIGLKKRPLDLDSINAARKARE-------GL 52
Query: 61 SGVRHRLGDSD--------EDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVP 112
VR +G D EDI+PLKN++DAQYFGEIGIG PPQ F+V+FDTGSSNLWVP
Sbjct: 53 RSVRPMMGAHDQFIGKSKGEDIVPLKNYLDAQYFGEIGIGIPPQPFTVVFDTGSSNLWVP 112
Query: 113 SSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQV 172
SSKCYF+++CY H+ Y ++KS T+ + G SC+INYG+GSISGFFSQDNV+VG VVK Q
Sbjct: 113 SSKCYFTLACYTHNWYTAKKSKTHVKNGTSCKINYGTGSISGFFSQDNVKVGSAVVKHQD 172
Query: 173 FIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPD 232
FIEAT EGSLTFL A+FDGI+GLGF+EI+V +AVPVW MVEQ L+SE+VFSFWLN DP+
Sbjct: 173 FIEATHEGSLTFLSAKFDGILGLGFQEISVENAVPVWFKMVEQKLISEKVFSFWLNGDPN 232
Query: 233 AEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSG 292
A++GGE+VFGGVDPKHFKG HTYVP+T+KGYWQ E+GD +G STGVCEGGCAAIVDSG
Sbjct: 233 AKKGGELVFGGVDPKHFKGNHTYVPITEKGYWQIEMGDFFVGGVSTGVCEGGCAAIVDSG 292
Query: 293 TSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCA 352
TSLLAGPTPVV EINHAIG EGV+S ECK VVSQYG+LIWDLLVSG+ P+ +C Q+GLC+
Sbjct: 293 TSLLAGPTPVVAEINHAIGAEGVLSVECKEVVSQYGELIWDLLVSGVKPDDICSQVGLCS 352
>gi|255543036|ref|XP_002512581.1| Aspartic proteinase precursor, putative [Ricinus communis]
gi|223548542|gb|EEF50033.1| Aspartic proteinase precursor, putative [Ricinus communis]
Length = 494
Score = 508 bits (1307), Expect = e-141, Method: Compositional matrix adjust.
Identities = 243/354 (68%), Positives = 284/354 (80%), Gaps = 16/354 (4%)
Query: 5 LLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVR 64
L + FCLW L +C LPASSNGL +I LKKR LDL S+NAAR R+ER A S
Sbjct: 6 LWMAAFCLWAL-TCSFLPASSNGLMKISLKKRPLDLDSINAARTARQERKTRIAASS--- 61
Query: 65 HRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYF 124
L D D++PLKN++D QYFGEI IGSPPQ F+VIFDTGSSNLW+PS+KCYFS++CYF
Sbjct: 62 -MLHSPDPDMIPLKNYLDTQYFGEISIGSPPQTFTVIFDTGSSNLWIPSAKCYFSLACYF 120
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
HSRYKS +S TY G +C+I YG+GSI GFFSQD VEVG++VV++QVFIEATREGSLTF
Sbjct: 121 HSRYKSSRSTTYIRNGTTCKIRYGTGSIVGFFSQDTVEVGNLVVRNQVFIEATREGSLTF 180
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
+LA+FDGI GLGF+EI+VGDAVPVW NMV+QGLV + VFSFWLN DPDA+EGGE+VFGGV
Sbjct: 181 VLAKFDGIFGLGFQEISVGDAVPVWYNMVQQGLVGDPVFSFWLNNDPDAKEGGELVFGGV 240
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
D KH++GKHTYVPVT+KGYWQF +GD +IGN ST DSGTSLLAGPTP+V
Sbjct: 241 DEKHYRGKHTYVPVTQKGYWQFNMGDFIIGNHST-----------DSGTSLLAGPTPIVA 289
Query: 305 EINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEY 358
EINHAIG EG+VSAECK VVSQYG+LIWDLL+SG+ P KVC Q+GLC F G Y
Sbjct: 290 EINHAIGAEGIVSAECKEVVSQYGNLIWDLLISGVQPGKVCSQLGLCTFRGDRY 343
>gi|356505735|ref|XP_003521645.1| PREDICTED: aspartic proteinase-like [Glycine max]
Length = 508
Score = 506 bits (1303), Expect = e-141, Method: Compositional matrix adjust.
Identities = 245/364 (67%), Positives = 295/364 (81%), Gaps = 2/364 (0%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV 60
M QK L +V CLW L +C LLP+ S G+ RIGLKKR LD+ S+NAAR R+ G + +
Sbjct: 1 MGQKHLVTVLCLWAL-TCSLLPSFSFGILRIGLKKRPLDIDSINAARKAREGLRSGRSMM 59
Query: 61 SGVRHRLGDSD-EDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS 119
+G S ED++PLKN+MDAQYFGEIGIG+PPQ F+V+FDTGSSNLWVPSSKCYF+
Sbjct: 60 GAHDQYIGKSKGEDLVPLKNYMDAQYFGEIGIGTPPQPFTVVFDTGSSNLWVPSSKCYFT 119
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
++CY H+ Y ++KS T+ + G SC+I+YG+GSISGFFSQDNV+VG VVK Q FIEAT E
Sbjct: 120 LACYTHNWYTAKKSKTHAKNGTSCKISYGTGSISGFFSQDNVKVGSAVVKHQDFIEATHE 179
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEI 239
GSLTFL A+FDGI+GLGF+EI+V ++VPVW MVEQ L+SE+VFSFWLN DP+A++GGE+
Sbjct: 180 GSLTFLSAKFDGILGLGFQEISVENSVPVWYKMVEQKLISEKVFSFWLNGDPNAKKGGEL 239
Query: 240 VFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGP 299
VFGGVDPKHFKG HTYVP+T+KGYWQ E+GD IG STGVCEGGCAAIVDSGTSLLAGP
Sbjct: 240 VFGGVDPKHFKGNHTYVPITEKGYWQIEIGDFFIGGVSTGVCEGGCAAIVDSGTSLLAGP 299
Query: 300 TPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYV 359
TPVV EINHAIG EGV+S ECK VVSQYG+LIWDLLVSG+ P+ +C Q+GLC+ E
Sbjct: 300 TPVVAEINHAIGAEGVLSVECKEVVSQYGELIWDLLVSGVKPDDICSQVGLCSSKRHESK 359
Query: 360 RLGI 363
GI
Sbjct: 360 SAGI 363
>gi|114786427|gb|ABI78942.1| aspartic protease [Ipomoea batatas]
Length = 508
Score = 506 bits (1302), Expect = e-140, Method: Compositional matrix adjust.
Identities = 257/375 (68%), Positives = 301/375 (80%), Gaps = 13/375 (3%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV 60
M K L + LWV+A C +LPASS L R+GLKK LD +S+ AA+ R + G
Sbjct: 1 MAWKYLCASILLWVIA-CSVLPASSEKLLRVGLKKNPLDFNSIKAAKAARVQGKCG---- 55
Query: 61 SGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI 120
G ++LGDSD I+ LKN++DAQY+GEI IGSPPQ F+VIFDTGSSNLWVPSSKCYFSI
Sbjct: 56 KGANNKLGDSDTGIVSLKNYLDAQYYGEISIGSPPQKFTVIFDTGSSNLWVPSSKCYFSI 115
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+CYFHS+YKS KS+TYT+IG SC I YGSGSISGF SQDNV VGD+VVKDQVFIE T+E
Sbjct: 116 ACYFHSKYKSSKSSTYTKIGTSCSITYGSGSISGFLSQDNVGVGDLVVKDQVFIETTKEP 175
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
SLTF+LA+FDG++GLGF+EI+V D VPVW NMVEQGLV E VFSFWLNRD +AEEGGE++
Sbjct: 176 SLTFVLAKFDGLLGLGFQEISVEDVVPVWYNMVEQGLVDEPVFSFWLNRDTNAEEGGELI 235
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGGVDP HFKGKHTYVPVT+KGYWQFE+GD LIGN STG CEGGCAAIVDSGTSLL GPT
Sbjct: 236 FGGVDPNHFKGKHTYVPVTQKGYWQFEMGDFLIGNSSTGFCEGGCAAIVDSGTSLLTGPT 295
Query: 301 --------PVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCA 352
+VTEINHAIG EGVVS ECK +VSQYG++IWDLLVSG+ P++VC Q+GLC
Sbjct: 296 TIVTEINHAIVTEINHAIGAEGVVSTECKEIVSQYGNMIWDLLVSGVKPDEVCSQVGLCF 355
Query: 353 FNGAEYVRLGIPITR 367
FNGA +G+ + +
Sbjct: 356 FNGAAGSNIGMVVEK 370
>gi|357511707|ref|XP_003626142.1| Aspartic proteinase [Medicago truncatula]
gi|355501157|gb|AES82360.1| Aspartic proteinase [Medicago truncatula]
Length = 504
Score = 499 bits (1284), Expect = e-138, Method: Compositional matrix adjust.
Identities = 241/363 (66%), Positives = 293/363 (80%), Gaps = 4/363 (1%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV 60
M Q VFCL +C LLP+ S G+ RIGL+KR LDLH+++A ++ R+++ G +
Sbjct: 1 MVQTHFVVVFCLLAF-TCSLLPSFSFGMMRIGLQKRPLDLHNMDAFKMVREQQLRSGRPM 59
Query: 61 SGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI 120
+ H+ SD+ I+PLKN+MDAQYFGEI IG+PPQ F+VIFDTGSSNLWVPSSKCYFS+
Sbjct: 60 M-LAHK--SSDDAIVPLKNYMDAQYFGEIAIGTPPQTFTVIFDTGSSNLWVPSSKCYFSL 116
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+CY H+ YK++KS TY + G SC+I+YG+GSISG+FSQDNV+VG VVK Q FIEATREG
Sbjct: 117 ACYTHNWYKAKKSKTYNKNGTSCKISYGTGSISGYFSQDNVKVGSSVVKHQDFIEATREG 176
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
SL+FL +FDGI GLGF+EI+V A+PVW NM+EQ L+ E+VFSFWLN +P+A++GGE+V
Sbjct: 177 SLSFLAGKFDGIFGLGFQEISVERALPVWYNMLEQNLIGEKVFSFWLNGNPNAKKGGELV 236
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGGVDPKHFKGKHTYVPVT+KGYWQ E+GD IG STGVCEGGCAAIVDSGTSLLAGPT
Sbjct: 237 FGGVDPKHFKGKHTYVPVTEKGYWQIEMGDFFIGGLSTGVCEGGCAAIVDSGTSLLAGPT 296
Query: 301 PVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVR 360
PVV EINHAIG EGV+S ECK VVSQYG+LIWDLLVSG+ P VC Q+GLC+ G +
Sbjct: 297 PVVAEINHAIGAEGVLSVECKEVVSQYGELIWDLLVSGVKPGDVCSQVGLCSIRGDQSNS 356
Query: 361 LGI 363
GI
Sbjct: 357 AGI 359
>gi|356534977|ref|XP_003536026.1| PREDICTED: aspartic proteinase-like [Glycine max]
Length = 508
Score = 494 bits (1272), Expect = e-137, Method: Compositional matrix adjust.
Identities = 250/352 (71%), Positives = 295/352 (83%), Gaps = 2/352 (0%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV 60
M K L VFCLW L +C LLP+ S GL RIGLKKR LDL S+ AAR+ R++ +G +
Sbjct: 2 MGHKYLWLVFCLWAL-TCSLLPSFSFGLMRIGLKKRDLDLDSIRAARMVREKPRLGRPVL 60
Query: 61 SGVRHRLGDS-DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS 119
H LG DE I+PLKN++DAQY+GEIGIG+PPQ F+VIFDTGSSNLWVPSSKCYFS
Sbjct: 61 GAYDHDLGKPIDEGIVPLKNYLDAQYYGEIGIGTPPQKFNVIFDTGSSNLWVPSSKCYFS 120
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
I+CY H YKS+KS TYT+ G SC+I YGSGSISGFFS+D+V+VGDVVVK+Q FIEATRE
Sbjct: 121 IACYTHHWYKSKKSKTYTKNGTSCKIGYGSGSISGFFSKDHVKVGDVVVKNQDFIEATRE 180
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEI 239
GSL+F+LA+FDG++GLGF+EI+V +AVPVW NMV+Q LVSE+VFSFWLN DP A++GGE+
Sbjct: 181 GSLSFVLAKFDGLLGLGFQEISVENAVPVWYNMVKQNLVSEQVFSFWLNGDPKAKDGGEL 240
Query: 240 VFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGP 299
+FGG+DPKHFKG H YVPVTKKGYWQ E+GD IG STGVCEGGCAAIVDSGTSLLAGP
Sbjct: 241 IFGGIDPKHFKGDHIYVPVTKKGYWQIEMGDFFIGGLSTGVCEGGCAAIVDSGTSLLAGP 300
Query: 300 TPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLC 351
T VVTEINHAIG EGV+S ECK VVS+YG+L+WDLLVSG+ P+ VC Q+GLC
Sbjct: 301 TTVVTEINHAIGAEGVLSVECKEVVSEYGELLWDLLVSGVRPDDVCSQVGLC 352
>gi|50540937|gb|AAT77954.1| Asp [Solanum tuberosum]
Length = 497
Score = 493 bits (1270), Expect = e-137, Method: Compositional matrix adjust.
Identities = 238/364 (65%), Positives = 289/364 (79%), Gaps = 16/364 (4%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITR-KERYMGGAG 59
M++K L + LW + +C LPASS L RIGLKK RLD++S+ AAR+ + ++RY G
Sbjct: 1 MDKKHLCAALLLWAI-TCSALPASSGDLLRIGLKKHRLDVNSIKAARVAKLQDRY--GKH 57
Query: 60 VSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS 119
V+G+ + DSD DI+PLKN++DAQY+GEIGIGSPPQ F VIFDTGSSNLWVPSSKCYFS
Sbjct: 58 VNGIEKKSSDSDIDIVPLKNYLDAQYYGEIGIGSPPQKFKVIFDTGSSNLWVPSSKCYFS 117
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
I+C+ H G+SC I Y +GSISG FS DNV+VGD+VVKDQVFIEATRE
Sbjct: 118 IACWIHRD------------GESCSIRYETGSISGHFSMDNVQVGDLVVKDQVFIEATRE 165
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEI 239
S+TF++A+FDGI+GLGF+EI+VG+ PVW NMV QGLV E VFSFW NRD +A+EGGE+
Sbjct: 166 PSITFIVAKFDGILGLGFQEISVGNTTPVWYNMVGQGLVKEPVFSFWFNRDANAKEGGEL 225
Query: 240 VFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGP 299
VFGGVDPKHFKG HTYVP+T+KGYWQF +GD LIGN STG C GGCAAIVDSGTSLLAGP
Sbjct: 226 VFGGVDPKHFKGNHTYVPLTQKGYWQFNMGDFLIGNTSTGYCAGGCAAIVDSGTSLLAGP 285
Query: 300 TPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYV 359
T +V +INHAIG EG+VS ECK +VSQYG++IWDLLVSG+ P++VC Q GLC +GA++V
Sbjct: 286 TTIVAQINHAIGAEGIVSMECKTIVSQYGEMIWDLLVSGVRPDQVCSQAGLCFVDGAQHV 345
Query: 360 RLGI 363
I
Sbjct: 346 SSNI 349
>gi|357131833|ref|XP_003567538.1| PREDICTED: aspartic proteinase-like [Brachypodium distachyon]
Length = 503
Score = 493 bits (1270), Expect = e-137, Method: Compositional matrix adjust.
Identities = 229/359 (63%), Positives = 293/359 (81%), Gaps = 1/359 (0%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV 60
M + L V CLW L+ LLL ASS+G+ RI L K+RLD +L AA++ R++R + +G
Sbjct: 1 MGPRHLLWVTCLWTLSCALLLGASSDGVLRINLSKKRLDKEALTAAKLARQQRNVLRSGD 60
Query: 61 SGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI 120
R+ LG SD+DI+PL N++D QY+GEIG+G+PPQNF+VIFDTGSSNLWVPSSKCYFSI
Sbjct: 61 GSYRY-LGVSDDDIVPLDNYLDTQYYGEIGVGTPPQNFTVIFDTGSSNLWVPSSKCYFSI 119
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+CY H +YKS KS+TY + G++C I+YGSGSI+GFFS+D+V VGD+VVK+Q FIE TRE
Sbjct: 120 ACYLHHKYKSTKSSTYKKNGETCTISYGSGSIAGFFSEDSVLVGDLVVKNQKFIETTREA 179
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
S +F++ +FDGI+GLGF EI+VG A PVW +M EQ L+++++FSFWLNRDPDA GGE+V
Sbjct: 180 SPSFIIGKFDGILGLGFPEISVGSAPPVWQSMQEQKLIAKDIFSFWLNRDPDAPTGGELV 239
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGGVD KH+KGKHTYVPVT+KGYWQF++GD+LIG QSTG C GGCAAIVDSGTSLLAGPT
Sbjct: 240 FGGVDQKHYKGKHTYVPVTRKGYWQFDMGDLLIGGQSTGFCAGGCAAIVDSGTSLLAGPT 299
Query: 301 PVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYV 359
+V ++NHAIG EG++S ECK VV +YG++I +LLV+ P+KVC QIGLC F+G + V
Sbjct: 300 TIVAQVNHAIGAEGIISMECKEVVREYGEMILELLVAQTRPQKVCSQIGLCVFDGTKSV 358
>gi|359487589|ref|XP_003633616.1| PREDICTED: aspartic proteinase-like [Vitis vinifera]
Length = 510
Score = 491 bits (1265), Expect = e-136, Method: Compositional matrix adjust.
Identities = 237/357 (66%), Positives = 292/357 (81%), Gaps = 2/357 (0%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV 60
M Q L + FCLW L + LL ASS+GL RIGLKK RLD + + AAR+ R+ + +GG V
Sbjct: 3 MRQGYLWAAFCLWAL-TFPLLQASSDGLVRIGLKKWRLDYNRIRAARMARRAKSIGGV-V 60
Query: 61 SGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI 120
+ LGDSD + + L+N+MDAQY+GEIGIG+PPQNF+V+FDTGS+NLWVPS+KC+FSI
Sbjct: 61 KSMYQGLGDSDGESVLLRNYMDAQYYGEIGIGTPPQNFTVVFDTGSANLWVPSTKCHFSI 120
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+C FHS+Y SR S TY ++GK EI+YGSGSISG FSQDNV+VG + +K+QVFIEATRE
Sbjct: 121 ACLFHSKYNSRLSTTYIDLGKEGEIHYGSGSISGVFSQDNVQVGSMAIKNQVFIEATREA 180
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
SL F+L +FDGI+GLGF EI VG+A PVW N++ QGLV E++FSFWLNRDP A +GGEIV
Sbjct: 181 SLVFVLGKFDGILGLGFEEIVVGNATPVWYNLLRQGLVQEDIFSFWLNRDPQATDGGEIV 240
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGGVD +HFKG+HTY +T+KGYWQFE+G+ LIG QSTG CE GCAAIVDSGTSL+AGPT
Sbjct: 241 FGGVDKRHFKGQHTYASITQKGYWQFEMGEFLIGYQSTGFCEAGCAAIVDSGTSLIAGPT 300
Query: 301 PVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAE 357
+VTEINHAIG EG+VS ECK VVSQYG++IWDLL+S + P+ VC QIGLC FNG++
Sbjct: 301 AIVTEINHAIGAEGIVSQECKEVVSQYGNMIWDLLISRVQPDAVCSQIGLCNFNGSQ 357
>gi|13897888|gb|AAK48494.1|AF259982_1 putative aspartic protease [Ipomoea batatas]
Length = 504
Score = 491 bits (1264), Expect = e-136, Method: Compositional matrix adjust.
Identities = 252/358 (70%), Positives = 294/358 (82%), Gaps = 7/358 (1%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPA--SSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGA 58
M +K L + F LW + C LPA S N L R+GLKKR LDL S+ AA+ R +GG
Sbjct: 1 MGRKYLCNAFLLWAVV-CTALPAAYSDNNLLRVGLKKRPLDLESIKAAKGAR----LGGK 55
Query: 59 GVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF 118
GV +LGDSDE I+ L N++DAQY+GEI IGSPPQNF+VIFDTGSSNLWVPSSKCY
Sbjct: 56 YGKGVNKKLGDSDEGIVSLNNYLDAQYYGEISIGSPPQNFTVIFDTGSSNLWVPSSKCYL 115
Query: 119 SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
SI+CYFHS+YKS KS+TYT+IGKSC I YGS SISGF SQD+V++GD++VKDQVFIE TR
Sbjct: 116 SIACYFHSKYKSSKSSTYTQIGKSCSITYGSVSISGFLSQDDVQLGDLLVKDQVFIETTR 175
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
E SLTF++A+FDGI+GLGF+EI+V + VPVW +MVEQGLV E VFSFWLNRDP AE GGE
Sbjct: 176 EPSLTFIIAKFDGILGLGFQEISVENVVPVWYDMVEQGLVDEPVFSFWLNRDPKAEVGGE 235
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
+VFGGVDPKHFKG+HTYVPVT+KGYWQ +LGD LIGN STG CEGGCA IVDSGTSLL G
Sbjct: 236 LVFGGVDPKHFKGEHTYVPVTQKGYWQIDLGDFLIGNSSTGYCEGGCAVIVDSGTSLLTG 295
Query: 299 PTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGA 356
PT VVTEIN+AIG EGVV AECK VVS+YG++IWDLLVSGL ++VC ++GLC NGA
Sbjct: 296 PTAVVTEINYAIGPEGVVCAECKEVVSEYGEMIWDLLVSGLRADQVCSELGLCFLNGA 353
>gi|12231178|dbj|BAB20972.1| aspartic proteinase 4 [Nepenthes alata]
Length = 505
Score = 489 bits (1259), Expect = e-135, Method: Compositional matrix adjust.
Identities = 248/363 (68%), Positives = 296/363 (81%), Gaps = 4/363 (1%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV 60
M + L +FC L SC S++GL RIGLK++ D +S+ A RI RK G+
Sbjct: 1 MGHRNLWVIFCFCALISCFF-STSADGLVRIGLKRQFSDSNSIRAVRIARKAGM--NQGL 57
Query: 61 SGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI 120
++ GDSD DI+ LKN++DAQY+GEIGIGSPPQ FSVIFDTGSSNLWVPSSKCYFS+
Sbjct: 58 KRFQYSFGDSDTDIVYLKNYLDAQYYGEIGIGSPPQKFSVIFDTGSSNLWVPSSKCYFSV 117
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+CYFHS+YKS KS+TYT+IGKSCEI+YGSGSISGFFSQD VEVG++ VK+QVFIEA+RE
Sbjct: 118 ACYFHSKYKSSKSSTYTKIGKSCEIDYGSGSISGFFSQDIVEVGNLAVKNQVFIEASREK 177
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
SLTF LA+FDGI+GLGF+EI+VGD VPVW NMVEQGLVSE+VFSFW NRDP A+ GGEIV
Sbjct: 178 SLTFALAKFDGILGLGFQEISVGDVVPVWYNMVEQGLVSEKVFSFWFNRDPKAKIGGEIV 237
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGG+D KHF G+H YVP+T+KGYWQFE+G+ LIGN STG C GGC AIVDSGTSLLAGP
Sbjct: 238 FGGIDEKHFVGEHIYVPITRKGYWQFEMGNFLIGNYSTGFCRGGCDAIVDSGTSLLAGPM 297
Query: 301 PVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVR 360
VVTE+NHAIG EG+ S ECK VV QYGD+IWDLLVSG+ P+K+C Q+ LC FN A+++
Sbjct: 298 HVVTEVNHAIGAEGIASMECKEVVYQYGDMIWDLLVSGVQPDKICSQLALC-FNDAQFLS 356
Query: 361 LGI 363
+GI
Sbjct: 357 IGI 359
>gi|413946823|gb|AFW79472.1| hypothetical protein ZEAMMB73_587615 [Zea mays]
Length = 488
Score = 488 bits (1255), Expect = e-135, Method: Compositional matrix adjust.
Identities = 230/361 (63%), Positives = 287/361 (79%), Gaps = 4/361 (1%)
Query: 1 MEQKLLRSVFCLWVLASC-LLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAG 59
M Q L + C WVL++C LLL ASS+GL RI L K+RLD +L AA++ +KE + +
Sbjct: 42 MGQTHLLLLACFWVLSTCSLLLDASSDGLLRINLNKKRLDKEALTAAKLAKKESNLRRS- 100
Query: 60 VSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS 119
G L S +DI+PL N++D QYFG+I IG+PPQNF+VIFDTGSSNLWVPSSKCYFS
Sbjct: 101 -VGADQYLSASTDDIVPLDNYLDTQYFGQISIGTPPQNFTVIFDTGSSNLWVPSSKCYFS 159
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
I+CY H RYKS KS TYT+ G+SC I YGSG I+GFFS+DNV VG++VV++Q FIE TRE
Sbjct: 160 IACYLHHRYKSTKSKTYTKNGESCTITYGSGQIAGFFSEDNVLVGNLVVQNQKFIETTRE 219
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEE-GGE 238
S TF++ +FDGI+GLGF EI+VG A P+W +M +Q LV+++VFSFWLNRDPDA GGE
Sbjct: 220 TSPTFIIGKFDGILGLGFPEISVGGAPPIWQSMKQQKLVAKDVFSFWLNRDPDASSGGGE 279
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
+VFGGVDPKH+KG HTYVPVT+KGYWQF++GD++IG STG C GGCAAIVDSGTSLLAG
Sbjct: 280 LVFGGVDPKHYKGDHTYVPVTRKGYWQFDMGDLIIGGHSTGFCAGGCAAIVDSGTSLLAG 339
Query: 299 PTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEY 358
PT +V ++NHAIG EG++S ECK VVS+YG++I +LL+S P+KVC QIGLC F+GA
Sbjct: 340 PTTIVAQVNHAIGAEGIISTECKEVVSEYGEMILELLISQTSPQKVCTQIGLCVFDGAHS 399
Query: 359 V 359
V
Sbjct: 400 V 400
>gi|226497182|ref|NP_001152501.1| retrotransposon protein SINE subclass precursor [Zea mays]
gi|195624058|gb|ACG33859.1| retrotransposon protein SINE subclass [Zea mays]
gi|195656921|gb|ACG47928.1| retrotransposon protein SINE subclass [Zea mays]
gi|413946824|gb|AFW79473.1| retrotransposon protein SINE subclass isoform 1 [Zea mays]
gi|413946825|gb|AFW79474.1| retrotransposon protein SINE subclass isoform 2 [Zea mays]
gi|413946826|gb|AFW79475.1| retrotransposon protein SINE subclass isoform 3 [Zea mays]
Length = 504
Score = 487 bits (1254), Expect = e-135, Method: Compositional matrix adjust.
Identities = 230/361 (63%), Positives = 287/361 (79%), Gaps = 4/361 (1%)
Query: 1 MEQKLLRSVFCLWVLASC-LLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAG 59
M Q L + C WVL++C LLL ASS+GL RI L K+RLD +L AA++ +KE + +
Sbjct: 1 MGQTHLLLLACFWVLSTCSLLLDASSDGLLRINLNKKRLDKEALTAAKLAKKESNLRRS- 59
Query: 60 VSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS 119
G L S +DI+PL N++D QYFG+I IG+PPQNF+VIFDTGSSNLWVPSSKCYFS
Sbjct: 60 -VGADQYLSASTDDIVPLDNYLDTQYFGQISIGTPPQNFTVIFDTGSSNLWVPSSKCYFS 118
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
I+CY H RYKS KS TYT+ G+SC I YGSG I+GFFS+DNV VG++VV++Q FIE TRE
Sbjct: 119 IACYLHHRYKSTKSKTYTKNGESCTITYGSGQIAGFFSEDNVLVGNLVVQNQKFIETTRE 178
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEE-GGE 238
S TF++ +FDGI+GLGF EI+VG A P+W +M +Q LV+++VFSFWLNRDPDA GGE
Sbjct: 179 TSPTFIIGKFDGILGLGFPEISVGGAPPIWQSMKQQKLVAKDVFSFWLNRDPDASSGGGE 238
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
+VFGGVDPKH+KG HTYVPVT+KGYWQF++GD++IG STG C GGCAAIVDSGTSLLAG
Sbjct: 239 LVFGGVDPKHYKGDHTYVPVTRKGYWQFDMGDLIIGGHSTGFCAGGCAAIVDSGTSLLAG 298
Query: 299 PTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEY 358
PT +V ++NHAIG EG++S ECK VVS+YG++I +LL+S P+KVC QIGLC F+GA
Sbjct: 299 PTTIVAQVNHAIGAEGIISTECKEVVSEYGEMILELLISQTSPQKVCTQIGLCVFDGAHS 358
Query: 359 V 359
V
Sbjct: 359 V 359
>gi|356575293|ref|XP_003555776.1| PREDICTED: aspartic proteinase [Glycine max]
Length = 507
Score = 487 bits (1253), Expect = e-135, Method: Compositional matrix adjust.
Identities = 248/352 (70%), Positives = 292/352 (82%), Gaps = 2/352 (0%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV 60
M L VFCLW L +C LLP+ S GL RIGLKKR LDL S+ AAR+ R+ +G +
Sbjct: 1 MGHNYLWLVFCLWAL-TCSLLPSFSFGLLRIGLKKRDLDLDSIRAARMVRENLRLGRPVL 59
Query: 61 SGVRHRLGD-SDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS 119
+G +DE I+PLKN++DAQY+GEIGIG+PPQ F+VIFDTGSSNLWVPSSKCYFS
Sbjct: 60 GANDQYIGKPTDEGIVPLKNYLDAQYYGEIGIGTPPQKFNVIFDTGSSNLWVPSSKCYFS 119
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
I+CY H YKS+KS TYT+ G SC+I YGSGSISGFFS+D+V+VGDVVVK+Q FIEATRE
Sbjct: 120 IACYTHHWYKSKKSKTYTKNGTSCKIRYGSGSISGFFSKDHVKVGDVVVKNQDFIEATRE 179
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEI 239
GSL+F+LA+FDG++GLGF+EI+V +AVPVW NMV+Q LVSE+VFSFWLN DP + GGE+
Sbjct: 180 GSLSFVLAKFDGLLGLGFQEISVENAVPVWYNMVKQNLVSEQVFSFWLNGDPKVKNGGEL 239
Query: 240 VFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGP 299
VFGGVDPKHFKG+H YVPVTKKGYWQ E+GD IG STGVCEGGCAAIVDSGTSLLAGP
Sbjct: 240 VFGGVDPKHFKGEHIYVPVTKKGYWQIEMGDFFIGGLSTGVCEGGCAAIVDSGTSLLAGP 299
Query: 300 TPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLC 351
T VVTEINHAIG EGV+S ECK VVS+YG+L+WDLLVSG+ P+ VC Q+GLC
Sbjct: 300 TTVVTEINHAIGAEGVLSVECKEVVSEYGELLWDLLVSGVRPDDVCSQVGLC 351
>gi|413946821|gb|AFW79470.1| retrotransposon protein SINE subclass isoform 1 [Zea mays]
gi|413946822|gb|AFW79471.1| retrotransposon protein SINE subclass isoform 2 [Zea mays]
Length = 545
Score = 486 bits (1252), Expect = e-135, Method: Compositional matrix adjust.
Identities = 230/361 (63%), Positives = 287/361 (79%), Gaps = 4/361 (1%)
Query: 1 MEQKLLRSVFCLWVLASC-LLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAG 59
M Q L + C WVL++C LLL ASS+GL RI L K+RLD +L AA++ +KE + +
Sbjct: 42 MGQTHLLLLACFWVLSTCSLLLDASSDGLLRINLNKKRLDKEALTAAKLAKKESNLRRS- 100
Query: 60 VSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS 119
G L S +DI+PL N++D QYFG+I IG+PPQNF+VIFDTGSSNLWVPSSKCYFS
Sbjct: 101 -VGADQYLSASTDDIVPLDNYLDTQYFGQISIGTPPQNFTVIFDTGSSNLWVPSSKCYFS 159
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
I+CY H RYKS KS TYT+ G+SC I YGSG I+GFFS+DNV VG++VV++Q FIE TRE
Sbjct: 160 IACYLHHRYKSTKSKTYTKNGESCTITYGSGQIAGFFSEDNVLVGNLVVQNQKFIETTRE 219
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEE-GGE 238
S TF++ +FDGI+GLGF EI+VG A P+W +M +Q LV+++VFSFWLNRDPDA GGE
Sbjct: 220 TSPTFIIGKFDGILGLGFPEISVGGAPPIWQSMKQQKLVAKDVFSFWLNRDPDASSGGGE 279
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
+VFGGVDPKH+KG HTYVPVT+KGYWQF++GD++IG STG C GGCAAIVDSGTSLLAG
Sbjct: 280 LVFGGVDPKHYKGDHTYVPVTRKGYWQFDMGDLIIGGHSTGFCAGGCAAIVDSGTSLLAG 339
Query: 299 PTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEY 358
PT +V ++NHAIG EG++S ECK VVS+YG++I +LL+S P+KVC QIGLC F+GA
Sbjct: 340 PTTIVAQVNHAIGAEGIISTECKEVVSEYGEMILELLISQTSPQKVCTQIGLCVFDGAHS 399
Query: 359 V 359
V
Sbjct: 400 V 400
>gi|194706186|gb|ACF87177.1| unknown [Zea mays]
Length = 504
Score = 486 bits (1250), Expect = e-135, Method: Compositional matrix adjust.
Identities = 229/361 (63%), Positives = 286/361 (79%), Gaps = 4/361 (1%)
Query: 1 MEQKLLRSVFCLWVLASC-LLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAG 59
M Q L + C WVL++C LLL ASS+GL RI L K+RLD +L AA++ +KE + +
Sbjct: 1 MGQTHLLLLACFWVLSTCSLLLDASSDGLLRINLNKKRLDKEALTAAKLAKKESNLRRS- 59
Query: 60 VSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS 119
G L S +DI+PL N++D QYFG+I IG+PPQNF+VIFDTGSSNLWVPSSKCYFS
Sbjct: 60 -VGADQYLSASTDDIVPLDNYLDTQYFGQISIGTPPQNFTVIFDTGSSNLWVPSSKCYFS 118
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
I+CY H RYKS KS TYT+ G+SC I YGSG I+GFFS+DNV VG++VV++Q FIE TRE
Sbjct: 119 IACYLHHRYKSTKSKTYTKNGESCTITYGSGQIAGFFSEDNVLVGNLVVQNQKFIETTRE 178
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEE-GGE 238
S TF++ +FDGI+GLGF EI+VG A P+W +M +Q LV+++VFSFWLNRDPDA GGE
Sbjct: 179 TSPTFIIGKFDGILGLGFPEISVGGAPPIWQSMKQQKLVAKDVFSFWLNRDPDASSGGGE 238
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
+VFGGVDPKH+KG HTYVP T+KGYWQF++GD++IG STG C GGCAAIVDSGTSLLAG
Sbjct: 239 LVFGGVDPKHYKGDHTYVPATRKGYWQFDMGDLIIGGHSTGFCAGGCAAIVDSGTSLLAG 298
Query: 299 PTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEY 358
PT +V ++NHAIG EG++S ECK VVS+YG++I +LL+S P+KVC QIGLC F+GA
Sbjct: 299 PTTIVAQVNHAIGAEGIISTECKEVVSEYGEMILELLISQTSPQKVCTQIGLCVFDGAHS 358
Query: 359 V 359
V
Sbjct: 359 V 359
>gi|297848226|ref|XP_002891994.1| hypothetical protein ARALYDRAFT_314946 [Arabidopsis lyrata subsp.
lyrata]
gi|297337836|gb|EFH68253.1| hypothetical protein ARALYDRAFT_314946 [Arabidopsis lyrata subsp.
lyrata]
Length = 504
Score = 486 bits (1250), Expect = e-134, Method: Compositional matrix adjust.
Identities = 232/353 (65%), Positives = 285/353 (80%), Gaps = 3/353 (0%)
Query: 3 QKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSG 62
KL+ V C + LA LL P +S+ L +GLKKRRL+L + A+R+ RK ++
Sbjct: 8 NKLIHQVICFYFLA-ILLHPTTSSDLFHVGLKKRRLELDDIRASRVIRKLKHSQRLTNYP 66
Query: 63 VRHRLG--DSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI 120
LG S++D + LKN++DAQY+G IGIG+P Q F VIFDTGSSNLWVPSSKCY S+
Sbjct: 67 SFATLGGDSSNQDQVILKNYLDAQYYGVIGIGTPSQEFEVIFDTGSSNLWVPSSKCYLSL 126
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+CY H +YKS KS TY + GK+C I YGSGSISGFFS+DNV+VGD+VVK+Q FIEATREG
Sbjct: 127 ACYLHPKYKSTKSKTYIKNGKTCTITYGSGSISGFFSEDNVKVGDLVVKNQEFIEATREG 186
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
SLTFLLA+FDG++GLGF+EI+VG+AVPVW NMV+QGLV ++VFSFWLNRD +AE GGEIV
Sbjct: 187 SLTFLLAKFDGLLGLGFQEISVGNAVPVWYNMVDQGLVRDKVFSFWLNRDTEAEVGGEIV 246
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGGVDP HFKGKHTYVPVT+KGYWQF +GDI +G+ STG CE GC AI+DSGTSLLAGPT
Sbjct: 247 FGGVDPAHFKGKHTYVPVTRKGYWQFNMGDIFVGSNSTGFCEQGCDAIMDSGTSLLAGPT 306
Query: 301 PVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAF 353
V+ +INHAIG EG+VSAECK VVSQYG++IW+LLV +LP +VC+++GLC F
Sbjct: 307 TVIAQINHAIGAEGIVSAECKDVVSQYGEMIWNLLVKRVLPRQVCKELGLCVF 359
>gi|219887925|gb|ACL54337.1| unknown [Zea mays]
Length = 504
Score = 485 bits (1249), Expect = e-134, Method: Compositional matrix adjust.
Identities = 230/361 (63%), Positives = 286/361 (79%), Gaps = 4/361 (1%)
Query: 1 MEQKLLRSVFCLWVLASC-LLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAG 59
M Q L + C WVL++C LLL ASS+GL RI L K+RLD +L AA++ +KE + +
Sbjct: 1 MGQTHLLLLACFWVLSTCSLLLDASSDGLLRINLNKKRLDKEALTAAKLAKKESNLRRS- 59
Query: 60 VSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS 119
G L S +DI+PL N++D QYFG+I IG+PPQNF+VIFDTGSSNLWVPSSKCYFS
Sbjct: 60 -VGADQYLSASTDDIVPLDNYLDTQYFGQISIGTPPQNFTVIFDTGSSNLWVPSSKCYFS 118
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
I+CY H RYKS KS TYT+ G+SC I YGSG I+GFFS+DNV VG++VV++Q FIE TRE
Sbjct: 119 IACYLHHRYKSTKSKTYTKNGESCTITYGSGQIAGFFSEDNVLVGNLVVQNQKFIETTRE 178
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEE-GGE 238
S TF++ +FDGI+GLGF EI+VG A P+W +M +Q LV+++VFSFWLNRDPDA GGE
Sbjct: 179 TSPTFIIGKFDGILGLGFPEISVGGAPPIWQSMKQQKLVAKDVFSFWLNRDPDASSGGGE 238
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
VFGGVDPKH+KG HTYVPVT+KGYWQF++GD++IG STG C GGCAAIVDSGTSLLAG
Sbjct: 239 PVFGGVDPKHYKGDHTYVPVTRKGYWQFDMGDLIIGGHSTGFCAGGCAAIVDSGTSLLAG 298
Query: 299 PTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEY 358
PT +V ++NHAIG EG++S ECK VVS+YG++I +LL+S P+KVC QIGLC F+GA
Sbjct: 299 PTTIVAQVNHAIGAEGIISTECKEVVSEYGEMILELLISQTSPQKVCTQIGLCVFDGAHS 358
Query: 359 V 359
V
Sbjct: 359 V 359
>gi|357511711|ref|XP_003626144.1| Aspartic proteinase [Medicago truncatula]
gi|355501159|gb|AES82362.1| Aspartic proteinase [Medicago truncatula]
Length = 426
Score = 484 bits (1245), Expect = e-134, Method: Compositional matrix adjust.
Identities = 229/338 (67%), Positives = 280/338 (82%), Gaps = 3/338 (0%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
+ RIGL+KR LDLH+++A ++ R+++ G + + H+ SD+ I+PLKN+MDAQYFG
Sbjct: 1 MMRIGLQKRPLDLHNMDAFKMVREQQLRSGRPMM-LAHK--SSDDAIVPLKNYMDAQYFG 57
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
EI IG+PPQ F+VIFDTGSSNLWVPSSKCYFS++CY H+ YK++KS TY + G SC+I+Y
Sbjct: 58 EIAIGTPPQTFTVIFDTGSSNLWVPSSKCYFSLACYTHNWYKAKKSKTYNKNGTSCKISY 117
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVP 207
G+GSISG+FSQDNV+VG VVK Q FIEATREGSL+FL +FDGI GLGF+EI+V A+P
Sbjct: 118 GTGSISGYFSQDNVKVGSSVVKHQDFIEATREGSLSFLAGKFDGIFGLGFQEISVERALP 177
Query: 208 VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFE 267
VW NM+EQ L+ E+VFSFWLN +P+A++GGE+VFGGVDPKHFKGKHTYVPVT+KGYWQ E
Sbjct: 178 VWYNMLEQNLIGEKVFSFWLNGNPNAKKGGELVFGGVDPKHFKGKHTYVPVTEKGYWQIE 237
Query: 268 LGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQY 327
+GD IG STGVCEGGCAAIVDSGTSLLAGPTPVV EINHAIG EGV+S ECK VVSQY
Sbjct: 238 MGDFFIGGLSTGVCEGGCAAIVDSGTSLLAGPTPVVAEINHAIGAEGVLSVECKEVVSQY 297
Query: 328 GDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGIPI 365
G+LIWDLLVSG+ P VC Q+GLC+ G + GI +
Sbjct: 298 GELIWDLLVSGVKPGDVCSQVGLCSIRGDQSNSAGIEM 335
>gi|357511709|ref|XP_003626143.1| Aspartic proteinase [Medicago truncatula]
gi|355501158|gb|AES82361.1| Aspartic proteinase [Medicago truncatula]
Length = 478
Score = 483 bits (1242), Expect = e-134, Method: Compositional matrix adjust.
Identities = 229/336 (68%), Positives = 279/336 (83%), Gaps = 3/336 (0%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
+ RIGL+KR LDLH+++A ++ R+++ G + + H+ SD+ I+PLKN+MDAQYFG
Sbjct: 1 MMRIGLQKRPLDLHNMDAFKMVREQQLRSGRPMM-LAHK--SSDDAIVPLKNYMDAQYFG 57
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
EI IG+PPQ F+VIFDTGSSNLWVPSSKCYFS++CY H+ YK++KS TY + G SC+I+Y
Sbjct: 58 EIAIGTPPQTFTVIFDTGSSNLWVPSSKCYFSLACYTHNWYKAKKSKTYNKNGTSCKISY 117
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVP 207
G+GSISG+FSQDNV+VG VVK Q FIEATREGSL+FL +FDGI GLGF+EI+V A+P
Sbjct: 118 GTGSISGYFSQDNVKVGSSVVKHQDFIEATREGSLSFLAGKFDGIFGLGFQEISVERALP 177
Query: 208 VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFE 267
VW NM+EQ L+ E+VFSFWLN +P+A++GGE+VFGGVDPKHFKGKHTYVPVT+KGYWQ E
Sbjct: 178 VWYNMLEQNLIGEKVFSFWLNGNPNAKKGGELVFGGVDPKHFKGKHTYVPVTEKGYWQIE 237
Query: 268 LGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQY 327
+GD IG STGVCEGGCAAIVDSGTSLLAGPTPVV EINHAIG EGV+S ECK VVSQY
Sbjct: 238 MGDFFIGGLSTGVCEGGCAAIVDSGTSLLAGPTPVVAEINHAIGAEGVLSVECKEVVSQY 297
Query: 328 GDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
G+LIWDLLVSG+ P VC Q+GLC+ G + GI
Sbjct: 298 GELIWDLLVSGVKPGDVCSQVGLCSIRGDQSNSAGI 333
>gi|255578112|ref|XP_002529926.1| Aspartic proteinase precursor, putative [Ricinus communis]
gi|223530603|gb|EEF32480.1| Aspartic proteinase precursor, putative [Ricinus communis]
Length = 514
Score = 481 bits (1238), Expect = e-133, Method: Compositional matrix adjust.
Identities = 228/358 (63%), Positives = 292/358 (81%), Gaps = 4/358 (1%)
Query: 10 FCLWVLA-SCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSG--VRHR 66
FCL +L C +S++GL RIGLKKR+ D ++ AA+ KE A + +R
Sbjct: 11 FCLILLPLVCATASSSNDGLVRIGLKKRKFDQNNRVAAQFESKEGEAFRASIKKYHIRGN 70
Query: 67 LGDSDE-DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFH 125
LGD+++ DI+ LKN+MDAQYFGEIGIG+PPQ F+VIFDTGSSNLWVPSSKCYFS++CYFH
Sbjct: 71 LGDAEDIDIVSLKNYMDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFSVACYFH 130
Query: 126 SRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFL 185
S+YKS +S+TY + GKS +I+YG+G+ISGFFSQDNV+VG++V+K+Q FIEATRE S+TFL
Sbjct: 131 SKYKSGQSSTYKKNGKSADIHYGTGAISGFFSQDNVKVGELVIKNQEFIEATREPSITFL 190
Query: 186 LARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVD 245
+A+FDGI+GLGF+EI+VG+AVPVW NMV QGLV E VFSFW NR+ D +EGGEIVFGG+D
Sbjct: 191 VAKFDGILGLGFQEISVGNAVPVWYNMVNQGLVKEPVFSFWFNRNADEDEGGEIVFGGMD 250
Query: 246 PKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTE 305
P H+KG+HTYVPVT+KGYWQF++GD+LI ++TG+C GCAAI DSGTSLLAGPT ++TE
Sbjct: 251 PNHYKGEHTYVPVTQKGYWQFDMGDVLIDGKTTGICSSGCAAIADSGTSLLAGPTTIITE 310
Query: 306 INHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
+NHAIG GVVS ECK VV+QYG+ I +L++ P+K+C QIGLC F+G+ V +GI
Sbjct: 311 VNHAIGATGVVSQECKAVVAQYGETIIAMLLAKDQPQKICSQIGLCTFDGSRGVSMGI 368
>gi|224115794|ref|XP_002317126.1| predicted protein [Populus trichocarpa]
gi|222860191|gb|EEE97738.1| predicted protein [Populus trichocarpa]
Length = 512
Score = 479 bits (1234), Expect = e-133, Method: Compositional matrix adjust.
Identities = 230/356 (64%), Positives = 286/356 (80%), Gaps = 5/356 (1%)
Query: 13 WVLASCLL----LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLG 68
+VL S LL L S++GL RIGLKK + D ++ AAR+ +E + LG
Sbjct: 11 FVLLSFLLFAVVLSESNDGLLRIGLKKVKFDKNNRIAARLDSQEALRASIRKYNLLGNLG 70
Query: 69 DS-DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSR 127
+S D DI+ LKN+ DAQY+GEIG+G+PPQ F+VIFDTGSSNLWVPSSKCY S++CYFHS+
Sbjct: 71 ESEDTDIVALKNYFDAQYYGEIGVGTPPQKFTVIFDTGSSNLWVPSSKCYLSVACYFHSK 130
Query: 128 YKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLA 187
Y S KS++Y + GKS EI YGSGSISGFFS D VEVG++VVKDQ FIEAT+E S+TFL+
Sbjct: 131 YNSGKSSSYKKNGKSAEIQYGSGSISGFFSIDAVEVGNLVVKDQEFIEATKEPSITFLVG 190
Query: 188 RFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPK 247
+FDGI+GLGF+EIAVG AVPVWDNM++QGL+ E VFSFWLNR+ D EEGGEIVFGG+DP
Sbjct: 191 KFDGILGLGFKEIAVGGAVPVWDNMIKQGLIKEPVFSFWLNRNADDEEGGEIVFGGMDPN 250
Query: 248 HFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEIN 307
H+KGKHTYVPVT+KGYWQF++GD+++G++STG C GGCAAI DSGTSLLAGPT ++T IN
Sbjct: 251 HYKGKHTYVPVTQKGYWQFDMGDVIVGDKSTGYCAGGCAAIADSGTSLLAGPTAIITMIN 310
Query: 308 HAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
HAIG GVVS +CK VVSQYG++I DLL+S + P+K+C QIGLC F+G + +GI
Sbjct: 311 HAIGASGVVSQQCKAVVSQYGEVIMDLLLSEVQPKKICSQIGLCTFDGTRGISMGI 366
>gi|359483345|ref|XP_003632941.1| PREDICTED: aspartic proteinase isoform 2 [Vitis vinifera]
gi|359483347|ref|XP_002262915.2| PREDICTED: aspartic proteinase isoform 1 [Vitis vinifera]
Length = 514
Score = 479 bits (1234), Expect = e-133, Method: Compositional matrix adjust.
Identities = 234/344 (68%), Positives = 283/344 (82%), Gaps = 3/344 (0%)
Query: 23 ASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSG-VRH-RLGDS-DEDILPLKN 79
A+++GL RIGLKK +LD + AAR+ KE A + RH LGDS D DI+ LKN
Sbjct: 25 ATTDGLFRIGLKKMKLDQNDQLAARLESKEGESLRASIRKYFRHGNLGDSQDTDIVGLKN 84
Query: 80 FMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEI 139
+MDAQYFGEIGIG+PPQ F+VIFDTGSSNLWVPSSKCYFS+ CYFHS+YKS +S+TY +
Sbjct: 85 YMDAQYFGEIGIGTPPQTFTVIFDTGSSNLWVPSSKCYFSVPCYFHSKYKSSQSSTYRKN 144
Query: 140 GKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFRE 199
GKS +I+YG+G+ISGFFS+DNV+VGD+VVK+Q FIEATRE S+TFL+A+FDGI+GLGF+E
Sbjct: 145 GKSADIHYGTGAISGFFSEDNVKVGDLVVKNQEFIEATREPSVTFLVAKFDGILGLGFQE 204
Query: 200 IAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVT 259
I+VG+AVPVW NMV+QGLV E VFSFWLNR D +EGGE+VFGGVDP HFKG+HTYVPVT
Sbjct: 205 ISVGNAVPVWYNMVKQGLVKEPVFSFWLNRKTDDDEGGELVFGGVDPDHFKGEHTYVPVT 264
Query: 260 KKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
+KGYWQF++G++LI ++TG C GGCAAI DSGTSLLAGPT VV INHAIG GVVS E
Sbjct: 265 QKGYWQFDMGEVLIDGETTGYCAGGCAAIADSGTSLLAGPTAVVAMINHAIGATGVVSQE 324
Query: 320 CKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
CK VV+QYG+ I DLL+S P+K+C QIGLC F+G V +GI
Sbjct: 325 CKTVVAQYGETIMDLLLSEASPQKICSQIGLCTFDGTRGVGMGI 368
>gi|224056377|ref|XP_002298827.1| predicted protein [Populus trichocarpa]
gi|222846085|gb|EEE83632.1| predicted protein [Populus trichocarpa]
Length = 494
Score = 478 bits (1231), Expect = e-132, Method: Compositional matrix adjust.
Identities = 231/354 (65%), Positives = 288/354 (81%), Gaps = 10/354 (2%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSG---VRHRLGDS 70
+++S L P ++GL RIGLKKR+ + ++ AA++ KE G + +R+ GD+
Sbjct: 1 MISSALSPP--NDGLIRIGLKKRKYERNNRLAAKLESKE----GESIKKYHLLRNLGGDA 54
Query: 71 -DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYK 129
D DI+ LKN+MDAQYFGEIGIG+PPQ F+VIFDTGSSNLWVPSSKCYFS++CYFHS+YK
Sbjct: 55 EDTDIVSLKNYMDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFSVACYFHSKYK 114
Query: 130 SRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARF 189
S S TY E GKS EI+YG+G+ISGFFSQD+V+VGD+VVK+Q FIEATRE S+TFL+A+F
Sbjct: 115 SSHSRTYKENGKSAEIHYGTGAISGFFSQDHVKVGDLVVKNQEFIEATREPSVTFLVAKF 174
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF 249
DGI+GLGF+EI+VG AVPVW NMVEQGLV E VFSFW NR+ D +EGGEIVFGGVDP H+
Sbjct: 175 DGILGLGFQEISVGKAVPVWYNMVEQGLVKEPVFSFWFNRNADEKEGGEIVFGGVDPDHY 234
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHA 309
KG+HTYVPVT+KGYWQF++GD+LIG Q++G C GCAAI DSGTSLLAGPT ++TE+NHA
Sbjct: 235 KGEHTYVPVTQKGYWQFDMGDVLIGGQTSGFCASGCAAIADSGTSLLAGPTTIITEVNHA 294
Query: 310 IGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
IG GVVS ECK VV+QYGD I ++L++ P+K+C QIGLC F+G V +GI
Sbjct: 295 IGATGVVSQECKAVVAQYGDTIMEMLLAKDQPQKICAQIGLCTFDGTRGVSMGI 348
>gi|21616051|emb|CAC86003.1| aspartic proteinase [Theobroma cacao]
Length = 514
Score = 476 bits (1224), Expect = e-131, Method: Compositional matrix adjust.
Identities = 229/354 (64%), Positives = 282/354 (79%), Gaps = 8/354 (2%)
Query: 18 CLLL-----PASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHR--LGDS 70
CLLL S+ L RIGLKKR+ D + AA + KER A + R + L +S
Sbjct: 15 CLLLFPIVFSISNERLVRIGLKKRKFDQNYRLAAHLDSKEREAFRASLKKYRLQGNLQES 74
Query: 71 DE-DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYK 129
++ DI+ LKN++DAQYFGEIGIG+PPQNF+VIFDTGSSNLWVPSSKCYFSI+CY HSRYK
Sbjct: 75 EDIDIVALKNYLDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYLHSRYK 134
Query: 130 SRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARF 189
S +S+TY GK +I YG+G+ISGFFS+DNV+VGD+VVK+Q FIEATRE S+TFL+A+F
Sbjct: 135 SSRSSTYKANGKPADIQYGTGAISGFFSEDNVQVGDLVVKNQEFIEATREPSITFLVAKF 194
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF 249
DGI+GLGF+EI+VG+AVPVW NMV QGLV E VFSFW NRDP+ + GGE+VFGG+DPKHF
Sbjct: 195 DGILGLGFQEISVGNAVPVWYNMVNQGLVKEPVFSFWFNRDPEDDIGGEVVFGGMDPKHF 254
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHA 309
KG HTYVP+T+KGYWQF++GD+LIGNQ+TG+C GGC+AI DSGTSL+ GPT ++ ++NHA
Sbjct: 255 KGDHTYVPITRKGYWQFDMGDVLIGNQTTGLCAGGCSAIADSGTSLITGPTAIIAQVNHA 314
Query: 310 IGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
IG GVVS ECK VVSQYG+ I D+L+S P K+C QIGLC F+G V GI
Sbjct: 315 IGASGVVSQECKTVVSQYGETIIDMLLSKDQPLKICSQIGLCTFDGTRGVSTGI 368
>gi|115461973|ref|NP_001054586.1| Os05g0137400 [Oryza sativa Japonica Group]
gi|78099760|sp|P42211.2|ASPRX_ORYSJ RecName: Full=Aspartic proteinase; Flags: Precursor
gi|46485798|gb|AAS98423.1| aspartic proteinase [Oryza sativa Japonica Group]
gi|113578137|dbj|BAF16500.1| Os05g0137400 [Oryza sativa Japonica Group]
gi|215694423|dbj|BAG89416.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 496
Score = 470 bits (1210), Expect = e-130, Method: Compositional matrix adjust.
Identities = 216/362 (59%), Positives = 282/362 (77%), Gaps = 9/362 (2%)
Query: 2 EQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS 61
++ LL CLW L+ LLL ASS+G R+ L K+RLD L AA++ ++ +
Sbjct: 3 KRHLLLVTTCLWALSCALLLHASSDGFLRVNLNKKRLDKEDLTAAKLAQQGNRL------ 56
Query: 62 GVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSIS 121
+ G SD D +PL ++++ QY+G IG+GSPPQNF+VIFDTGSSNLWVPS+KCYFSI+
Sbjct: 57 ---LKTGSSDSDPVPLVDYLNTQYYGVIGLGSPPQNFTVIFDTGSSNLWVPSAKCYFSIA 113
Query: 122 CYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGS 181
CY HSRY S+KS++Y G++C+I YGSG+ISGFFS+DNV VGD+VVK+Q FIEATRE S
Sbjct: 114 CYLHSRYNSKKSSSYKADGETCKITYGSGAISGFFSKDNVLVGDLVVKNQKFIEATRETS 173
Query: 182 LTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVF 241
+TF++ +FDGI+GLG+ EI+VG A P+W +M EQ L++++VFSFWLNRDPDA GGE+VF
Sbjct: 174 VTFIIGKFDGILGLGYPEISVGKAPPIWQSMQEQELLADDVFSFWLNRDPDASSGGELVF 233
Query: 242 GGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTP 301
GG+DPKH+KG HTYVPV++KGYWQF +GD+LI STG C GCAAIVDSGTSLLAGPT
Sbjct: 234 GGMDPKHYKGDHTYVPVSRKGYWQFNMGDLLIDGHSTGFCAKGCAAIVDSGTSLLAGPTA 293
Query: 302 VVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRL 361
+V ++NHAIG EG++S ECK VVS+YG++I +LL++ P+KVC Q+GLC F+G V
Sbjct: 294 IVAQVNHAIGAEGIISTECKEVVSEYGEMILNLLIAQTDPQKVCSQVGLCMFDGKRSVSN 353
Query: 362 GI 363
GI
Sbjct: 354 GI 355
>gi|222630120|gb|EEE62252.1| hypothetical protein OsJ_17039 [Oryza sativa Japonica Group]
Length = 501
Score = 470 bits (1209), Expect = e-130, Method: Compositional matrix adjust.
Identities = 216/362 (59%), Positives = 282/362 (77%), Gaps = 9/362 (2%)
Query: 2 EQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS 61
++ LL CLW L+ LLL ASS+G R+ L K+RLD L AA++ ++ +
Sbjct: 3 KRHLLLVTTCLWALSCALLLHASSDGFLRVNLNKKRLDKEDLTAAKLAQQGNRL------ 56
Query: 62 GVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSIS 121
+ G SD D +PL ++++ QY+G IG+GSPPQNF+VIFDTGSSNLWVPS+KCYFSI+
Sbjct: 57 ---LKTGSSDSDPVPLVDYLNTQYYGVIGLGSPPQNFTVIFDTGSSNLWVPSAKCYFSIA 113
Query: 122 CYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGS 181
CY HSRY S+KS++Y G++C+I YGSG+ISGFFS+DNV VGD+VVK+Q FIEATRE S
Sbjct: 114 CYLHSRYNSKKSSSYKADGETCKITYGSGAISGFFSKDNVLVGDLVVKNQKFIEATRETS 173
Query: 182 LTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVF 241
+TF++ +FDGI+GLG+ EI+VG A P+W +M EQ L++++VFSFWLNRDPDA GGE+VF
Sbjct: 174 VTFIIGKFDGILGLGYPEISVGKAPPIWQSMQEQELLADDVFSFWLNRDPDASSGGELVF 233
Query: 242 GGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTP 301
GG+DPKH+KG HTYVPV++KGYWQF +GD+LI STG C GCAAIVDSGTSLLAGPT
Sbjct: 234 GGMDPKHYKGDHTYVPVSRKGYWQFNMGDLLIDGHSTGFCAKGCAAIVDSGTSLLAGPTA 293
Query: 302 VVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRL 361
+V ++NHAIG EG++S ECK VVS+YG++I +LL++ P+KVC Q+GLC F+G V
Sbjct: 294 IVAQVNHAIGAEGIISTECKEVVSEYGEMILNLLIAQTDPQKVCSQVGLCMFDGKRSVSN 353
Query: 362 GI 363
GI
Sbjct: 354 GI 355
>gi|218143|dbj|BAA02242.1| aspartic proteinase [Oryza sativa Japonica Group]
Length = 496
Score = 469 bits (1207), Expect = e-130, Method: Compositional matrix adjust.
Identities = 216/362 (59%), Positives = 281/362 (77%), Gaps = 9/362 (2%)
Query: 2 EQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS 61
++ LL CLW L+ LLL ASS+G R+ L K+RLD L AA++ ++ +
Sbjct: 3 KRHLLLVTTCLWALSCALLLHASSDGFLRVNLNKKRLDKEDLTAAKLAQQGNRL------ 56
Query: 62 GVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSIS 121
+ G SD D +PL ++++ QY+G IG+GSPPQNF+VIFDTGSSNLWVPS+KCYFSI+
Sbjct: 57 ---LKTGSSDSDPVPLVDYLNTQYYGVIGLGSPPQNFTVIFDTGSSNLWVPSAKCYFSIA 113
Query: 122 CYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGS 181
CY HSRY S+KS++Y G++C+I YGSG+ISGFFS+DNV VGD VVK+Q FIEATRE S
Sbjct: 114 CYLHSRYNSKKSSSYKADGETCKITYGSGAISGFFSKDNVLVGDQVVKNQKFIEATRETS 173
Query: 182 LTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVF 241
+TF++ +FDGI+GLG+ EI+VG A P+W +M EQ L++++VFSFWLNRDPDA GGE+VF
Sbjct: 174 VTFIIGKFDGILGLGYPEISVGKAPPIWQSMQEQELLADDVFSFWLNRDPDASSGGELVF 233
Query: 242 GGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTP 301
GG+DPKH+KG HTYVPV++KGYWQF +GD+LI STG C GCAAIVDSGTSLLAGPT
Sbjct: 234 GGMDPKHYKGDHTYVPVSRKGYWQFNMGDLLIDGHSTGFCAKGCAAIVDSGTSLLAGPTA 293
Query: 302 VVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRL 361
+V ++NHAIG EG++S ECK VVS+YG++I +LL++ P+KVC Q+GLC F+G V
Sbjct: 294 IVAQVNHAIGAEGIISTECKEVVSEYGEMILNLLIAQTDPQKVCSQVGLCMFDGKRSVSN 353
Query: 362 GI 363
GI
Sbjct: 354 GI 355
>gi|224118038|ref|XP_002331542.1| predicted protein [Populus trichocarpa]
gi|222873766|gb|EEF10897.1| predicted protein [Populus trichocarpa]
Length = 512
Score = 469 bits (1206), Expect = e-129, Method: Compositional matrix adjust.
Identities = 222/342 (64%), Positives = 275/342 (80%), Gaps = 1/342 (0%)
Query: 23 ASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDS-DEDILPLKNFM 81
AS++GL RIGLKK +LD ++ AAR+ KE + LG+S D DI+ LKN++
Sbjct: 25 ASNDGLLRIGLKKVKLDKNNRIAARLDSKETLRASIRKYNLCGNLGESEDTDIVALKNYL 84
Query: 82 DAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGK 141
D+QY+GEIG+GSPPQ F+VIFDTGSSNLWVPSSKCY S++CYFHS+Y S KS+TY + GK
Sbjct: 85 DSQYYGEIGVGSPPQKFTVIFDTGSSNLWVPSSKCYLSVACYFHSKYDSGKSSTYKKNGK 144
Query: 142 SCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIA 201
S EI YGSGSISGFFS D VEVG +VVKDQ FIEAT+E ++TFL+A+FDGI+GLGF+EI+
Sbjct: 145 SAEIRYGSGSISGFFSNDAVEVGGLVVKDQEFIEATKEPNITFLVAKFDGILGLGFKEIS 204
Query: 202 VGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKK 261
VGDAVPVWDNM++ GL+ E VFSFWLNR+ + EEGGEIVFGG+DP H+KGKHT+VPVT+K
Sbjct: 205 VGDAVPVWDNMIKHGLIKEPVFSFWLNRNAEDEEGGEIVFGGMDPNHYKGKHTFVPVTRK 264
Query: 262 GYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECK 321
GYWQF +GD+ IG++ TG C GCAAI DSGTSLLAGPT ++T IN AIG GVVS +CK
Sbjct: 265 GYWQFNMGDVHIGDKPTGYCASGCAAIADSGTSLLAGPTTIITMINQAIGASGVVSQQCK 324
Query: 322 LVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
VVSQYG+ I DLL+S P+++C QIGLC F+G + +GI
Sbjct: 325 AVVSQYGEAIMDLLLSQAQPKRICSQIGLCTFDGTRGISIGI 366
>gi|449454758|ref|XP_004145121.1| PREDICTED: aspartic proteinase-like [Cucumis sativus]
gi|449472326|ref|XP_004153558.1| PREDICTED: aspartic proteinase-like [Cucumis sativus]
Length = 514
Score = 468 bits (1205), Expect = e-129, Method: Compositional matrix adjust.
Identities = 228/360 (63%), Positives = 283/360 (78%), Gaps = 4/360 (1%)
Query: 8 SVFCLWVLASCLLLPASSN-GLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHR 66
+ CL++L S ++ + SN GL R+GLKK LD + AAR+ K+ + A
Sbjct: 9 AFLCLFLLVSLNIVSSVSNDGLLRVGLKKINLDPENRLAARLESKDAEILKAAFRKYNPN 68
Query: 67 --LGDS-DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCY 123
LG+S D DI+ LKN++DAQY+GEI IG+PPQ F+VIFDTGSSNLWVPS+KC FS++C+
Sbjct: 69 GNLGESSDTDIVALKNYLDAQYYGEIAIGTPPQKFTVIFDTGSSNLWVPSAKCLFSVACH 128
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
FH+RYKS +S+TY + G S I YG+G++SGFFS DNV+VGD+VVK+Q+FIEATRE LT
Sbjct: 129 FHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPGLT 188
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
FL+A+FDG++GLGF+EIAVG AVPVW NMVEQGLV E VFSFWLNR+ + EEGGEIVFGG
Sbjct: 189 FLVAKFDGLLGLGFQEIAVGSAVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGG 248
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
VDPKH+KGKHTYVPVT+KGYWQF++GD+LI + TG CEGGC+AI DSGTSLLAGPT +V
Sbjct: 249 VDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGKPTGYCEGGCSAIADSGTSLLAGPTTIV 308
Query: 304 TEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
T INHAIG +GV+S ECK VV QYG I DLL+S P+K+C QI LC F+G V +GI
Sbjct: 309 TMINHAIGAKGVMSQECKAVVQQYGQTIMDLLLSEADPKKICSQIKLCTFDGTRGVSMGI 368
>gi|359477267|ref|XP_002275241.2| PREDICTED: aspartic proteinase [Vitis vinifera]
Length = 502
Score = 468 bits (1204), Expect = e-129, Method: Compositional matrix adjust.
Identities = 225/363 (61%), Positives = 277/363 (76%), Gaps = 13/363 (3%)
Query: 12 LWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSD 71
LW A CL L SS+GL RIGLKK+ LDL L+AARITR + LG D
Sbjct: 13 LWAWACCLALDDSSDGLVRIGLKKKPLDLARLHAARITRGNGFHA--------QGLGKVD 64
Query: 72 ED-----ILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHS 126
++ + LKN+MDAQY+GEIGIGSPPQ FSV+FDTGSSNLWVPSSKCYFSI+CYFH+
Sbjct: 65 DNYPKANTVYLKNYMDAQYYGEIGIGSPPQTFSVVFDTGSSNLWVPSSKCYFSIACYFHA 124
Query: 127 RYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLL 186
RY++ S TY++ G+ C+INYGSGSISGFFSQD+V++G++V+K+QVF EAT+EG F L
Sbjct: 125 RYRAVLSRTYSKNGRHCKINYGSGSISGFFSQDHVQIGEIVIKNQVFTEATKEGLFAFSL 184
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+GLGF+ +VG P+W NMV+Q LVS E+ SFWLNRDP A+ GGE++FGGVD
Sbjct: 185 AQFDGILGLGFQNASVGKIPPIWYNMVQQSLVSMEIVSFWLNRDPKAKIGGEVIFGGVDW 244
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
+HF G HT+VP+T+K YWQ E+GDILI STG CEGGCAAIVD+GTS++AGPT VVT+I
Sbjct: 245 RHFMGDHTFVPITRKDYWQIEVGDILIAGSSTGFCEGGCAAIVDTGTSMIAGPTTVVTQI 304
Query: 307 NHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGIPIT 366
NHAIG EG+VS CK VV++YG LIW LVSG PE VC IGLCA+NG + R G +
Sbjct: 305 NHAIGAEGIVSFNCKNVVNKYGRLIWQFLVSGFQPENVCSDIGLCAYNGTKNARQGAGME 364
Query: 367 RVL 369
V+
Sbjct: 365 TVI 367
>gi|297736824|emb|CBI26025.3| unnamed protein product [Vitis vinifera]
Length = 500
Score = 466 bits (1198), Expect = e-128, Method: Compositional matrix adjust.
Identities = 226/369 (61%), Positives = 278/369 (75%), Gaps = 14/369 (3%)
Query: 1 MEQKLLRSVFCL-WVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAG 59
M K + CL W A CL L SS+GL RIGLKK+ LDL L+AARITR +
Sbjct: 1 MRLKYILVANCLLWAWACCLALDDSSDGLVRIGLKKKPLDLARLHAARITRGNGFHA--- 57
Query: 60 VSGVRHRLGDSDED-----ILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSS 114
LG D++ + LKN+MDAQY+GEIGIGSPPQ FSV+FDTGSSNLWVPSS
Sbjct: 58 -----QGLGKVDDNYPKANTVYLKNYMDAQYYGEIGIGSPPQTFSVVFDTGSSNLWVPSS 112
Query: 115 KCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFI 174
KCYFSI+CYFH+RY++ S TY++ G+ C+INYGSGSISGFFSQD+V++G++V+K+QVF
Sbjct: 113 KCYFSIACYFHARYRAVLSRTYSKNGRHCKINYGSGSISGFFSQDHVQIGEIVIKNQVFT 172
Query: 175 EATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAE 234
EAT+EG F LA+FDGI+GLGF+ +VG P+W NMV+Q LVS E+ SFWLNRDP A+
Sbjct: 173 EATKEGLFAFSLAQFDGILGLGFQNASVGKIPPIWYNMVQQSLVSMEIVSFWLNRDPKAK 232
Query: 235 EGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTS 294
GGE++FGGVD +HF G HT+VP+T+K YWQ E+GDILI STG CEGGCAAIVD+GTS
Sbjct: 233 IGGEVIFGGVDWRHFMGDHTFVPITRKDYWQIEVGDILIAGSSTGFCEGGCAAIVDTGTS 292
Query: 295 LLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFN 354
++AGPT VVT+INHAIG EG+VS CK VV++YG LIW LVSG PE VC IGLCA+N
Sbjct: 293 MIAGPTTVVTQINHAIGAEGIVSFNCKNVVNKYGRLIWQFLVSGFQPENVCSDIGLCAYN 352
Query: 355 GAEYVRLGI 363
G + G+
Sbjct: 353 GTKNASAGM 361
>gi|12231180|dbj|BAB20973.1| aspartic proteinase 5 [Nepenthes alata]
Length = 358
Score = 465 bits (1196), Expect = e-128, Method: Compositional matrix adjust.
Identities = 237/339 (69%), Positives = 278/339 (82%), Gaps = 3/339 (0%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV 60
M + L +FC L SC S++GL RIGLK++ D +S+ A RI RK G+
Sbjct: 1 MGHRNLWVIFCFCALISCFF-STSADGLVRIGLKRQFSDSNSIRAVRIARKAGM--NQGL 57
Query: 61 SGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI 120
++ GDSD DI+ LKN++DAQY+GEIGIGSPPQ FSVIFDTGSSNLWVPSSKCYFS+
Sbjct: 58 KRFQYSFGDSDTDIVYLKNYLDAQYYGEIGIGSPPQKFSVIFDTGSSNLWVPSSKCYFSV 117
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+CYFHS+YKS KS+TYT+IGKSCEI+YGSGSISGFFSQD VEVG++ VK+QVFIEA+RE
Sbjct: 118 ACYFHSKYKSSKSSTYTKIGKSCEIDYGSGSISGFFSQDIVEVGNLAVKNQVFIEASREK 177
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
SLTF LA+FDGI+GLGF+EI+VGD VPVW NMVEQGLVSE+VFSFW NRDP A+ GGEIV
Sbjct: 178 SLTFALAKFDGILGLGFQEISVGDVVPVWYNMVEQGLVSEKVFSFWFNRDPKAKIGGEIV 237
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGG+D KHF G+H YVP+T+KGYWQFE+G+ LIGN STG C GGC AIVDSGTSLLAGP
Sbjct: 238 FGGIDEKHFVGEHIYVPITRKGYWQFEMGNFLIGNYSTGFCRGGCDAIVDSGTSLLAGPM 297
Query: 301 PVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGL 339
VVTE+NHAIG EG+ S ECK VV QYGD+IWDLLVSG+
Sbjct: 298 HVVTEVNHAIGAEGIASMECKEVVYQYGDMIWDLLVSGV 336
>gi|21616053|emb|CAC86004.1| aspartic proteinase [Theobroma cacao]
Length = 514
Score = 464 bits (1194), Expect = e-128, Method: Compositional matrix adjust.
Identities = 223/342 (65%), Positives = 273/342 (79%), Gaps = 3/342 (0%)
Query: 25 SNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHR--LGDSDE-DILPLKNFM 81
++GL RIGLKK +LD ++ AAR+ K+ A + R R LGDS+E DI+ LKN+M
Sbjct: 27 NDGLVRIGLKKMKLDPNNRLAARLDSKDGEALRAFIKKYRFRNNLGDSEETDIVALKNYM 86
Query: 82 DAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGK 141
DAQY+GEIGIG+P Q F+VIFDTGSSNLWV S+KCYFS++CYFH +YK+ S+TY + GK
Sbjct: 87 DAQYYGEIGIGTPTQKFTVIFDTGSSNLWVSSTKCYFSVACYFHEKYKASDSSTYKKDGK 146
Query: 142 SCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIA 201
I YG+G+ISGFFS D+V+VGD+VVKDQ FIEAT+E LTF++A+FDGI+GLGF+EI+
Sbjct: 147 PASIQYGTGAISGFFSYDHVQVGDLVVKDQEFIEATKEPGLTFMVAKFDGILGLGFKEIS 206
Query: 202 VGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKK 261
VGDAVPVW NM++QGL+ E VFSFWLNR+ D E GGEIVFGGVDP H+KGKHTYVPVT+K
Sbjct: 207 VGDAVPVWYNMIKQGLIKEPVFSFWLNRNVDEEAGGEIVFGGVDPNHYKGKHTYVPVTQK 266
Query: 262 GYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECK 321
GYWQF++GD+LI ++ TG C G CAAI DSGTSLLAGP+ V+T INHAIG GVVS ECK
Sbjct: 267 GYWQFDMGDVLIADKPTGYCAGSCAAIADSGTSLLAGPSTVITMINHAIGATGVVSQECK 326
Query: 322 LVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
VV QYG I DLL++ P+K+C QIGLC FNGA V GI
Sbjct: 327 AVVQQYGRTIIDLLIAEAQPQKICSQIGLCTFNGAHGVSTGI 368
>gi|218188020|gb|EEC70447.1| hypothetical protein OsI_01478 [Oryza sativa Indica Group]
Length = 495
Score = 460 bits (1183), Expect = e-127, Method: Compositional matrix adjust.
Identities = 223/363 (61%), Positives = 280/363 (77%), Gaps = 9/363 (2%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV 60
M + L V CLW+L+ +LL AS +GL RI L K+RLD +L+ A++ R+E +
Sbjct: 1 MGRNHLCLVTCLWILSCAVLLHASPDGLLRISLNKKRLDKKTLDGAKLAREESH------ 54
Query: 61 SGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI 120
R R +DI+PL N++D QYFGEIGIG+PPQNF+VIFDTGSSNLWVPS KCYFSI
Sbjct: 55 ---RLRADGLGDDIVPLDNYLDTQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSVKCYFSI 111
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+CY H RYKS+ S++Y + G+SC I+YGSGSI+GFFS+D+V VGD+ VK+Q+FIE TRE
Sbjct: 112 ACYLHHRYKSKGSSSYKKNGESCSISYGSGSIAGFFSEDSVLVGDLAVKNQMFIETTREP 171
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
SLTF++ +FDGI+GLGF EI+VG A P+W M EQ L+ ++VFSFWLNRDPDA GGE++
Sbjct: 172 SLTFIIGKFDGILGLGFPEISVGGAPPIWQGMKEQQLIEKDVFSFWLNRDPDAPTGGELI 231
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGGVDP H+KG HTYVPVT+KGYWQFE+GD+LI + STG C GGCAAI DSGTSLL GPT
Sbjct: 232 FGGVDPNHYKGSHTYVPVTRKGYWQFEMGDLLIDDYSTGFCSGGCAAIADSGTSLLGGPT 291
Query: 301 PVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVR 360
+V +INHAIG EG+VS ECK VV YGD+I ++L++ P K+C QIGLCAF+G VR
Sbjct: 292 TIVAQINHAIGAEGIVSMECKQVVRDYGDMILEMLIAQASPMKLCSQIGLCAFDGTRSVR 351
Query: 361 LGI 363
I
Sbjct: 352 NNI 354
>gi|115436054|ref|NP_001042785.1| Os01g0290000 [Oryza sativa Japonica Group]
gi|8467954|dbj|BAA96578.1| putative aspartic proteinase [Oryza sativa Japonica Group]
gi|113532316|dbj|BAF04699.1| Os01g0290000 [Oryza sativa Japonica Group]
gi|215694819|dbj|BAG90010.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215701475|dbj|BAG92899.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222618242|gb|EEE54374.1| hypothetical protein OsJ_01384 [Oryza sativa Japonica Group]
Length = 495
Score = 459 bits (1182), Expect = e-127, Method: Compositional matrix adjust.
Identities = 223/363 (61%), Positives = 280/363 (77%), Gaps = 9/363 (2%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV 60
M + L V CLW+L+ +LL AS +GL RI L K+RLD +L+ A++ R+E +
Sbjct: 1 MGRNHLCLVTCLWILSCAVLLHASPDGLLRISLNKKRLDKKTLDGAKLAREESH------ 54
Query: 61 SGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI 120
R R +DI+PL N++D QYFGEIGIG+PPQNF+VIFDTGSSNLWVPS KCYFSI
Sbjct: 55 ---RLRADGLGDDIVPLDNYLDTQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSVKCYFSI 111
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+CY H RYKS+ S++Y + G+SC I+YGSGSI+GFFS+D+V VGD+ VK+Q+FIE TRE
Sbjct: 112 ACYLHHRYKSKGSSSYKKNGESCSISYGSGSIAGFFSEDSVLVGDLAVKNQMFIETTREP 171
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
SLTF++ +FDGI+GLGF EI+VG A P+W M EQ L+ ++VFSFWLNRDPDA GGE++
Sbjct: 172 SLTFIIGKFDGILGLGFPEISVGGAPPIWQGMKEQQLIEKDVFSFWLNRDPDAPTGGELI 231
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGGVDP H+KG HTYVPVT+KGYWQFE+GD+LI + STG C GGCAAI DSGTSLL GPT
Sbjct: 232 FGGVDPNHYKGSHTYVPVTRKGYWQFEMGDLLIDDYSTGFCSGGCAAIADSGTSLLGGPT 291
Query: 301 PVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVR 360
+V +INHAIG EG+VS ECK VV YGD+I ++L++ P K+C QIGLCAF+G VR
Sbjct: 292 TIVAQINHAIGAEGIVSMECKQVVRDYGDMILEMLIAQASPMKLCSQIGLCAFDGTRSVR 351
Query: 361 LGI 363
I
Sbjct: 352 NNI 354
>gi|255554815|ref|XP_002518445.1| Aspartic proteinase precursor, putative [Ricinus communis]
gi|223542290|gb|EEF43832.1| Aspartic proteinase precursor, putative [Ricinus communis]
Length = 511
Score = 459 bits (1182), Expect = e-127, Method: Compositional matrix adjust.
Identities = 220/341 (64%), Positives = 271/341 (79%)
Query: 23 ASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMD 82
A ++GL R+GLKK +LD +S AAR+ K A V R D DI+ LKN++D
Sbjct: 25 APNDGLVRLGLKKMKLDENSRLAARLESKNAEALRASVRKYGLRGDSKDTDIVALKNYLD 84
Query: 83 AQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKS 142
AQY+GEIGIG+PPQ F+V+FDTGSSNLWVPSSKC FS++C+FHSRYKS +S+TY + GKS
Sbjct: 85 AQYYGEIGIGTPPQKFTVVFDTGSSNLWVPSSKCIFSVACFFHSRYKSGQSSTYKKNGKS 144
Query: 143 CEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAV 202
EI+YGSG+ISGFFS DNV VG++VVKDQ FIEAT+E +TF+ A+FDGI+GLGF+EI+V
Sbjct: 145 AEIHYGSGAISGFFSSDNVVVGNLVVKDQEFIEATKEPGVTFVAAKFDGILGLGFQEISV 204
Query: 203 GDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
G+AVPVW NM++QGL+ E VFSFWLNR+ EEGGEIVFGGVD H+KGKHTYVPVT+KG
Sbjct: 205 GNAVPVWYNMIKQGLIKEPVFSFWLNRNTQGEEGGEIVFGGVDLNHYKGKHTYVPVTQKG 264
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKL 322
YWQFE+GD+LIG++ T C GGC+AI DSGTSLLAGPT VVT IN AIG GV S ECK
Sbjct: 265 YWQFEMGDVLIGHKPTEYCAGGCSAIADSGTSLLAGPTTVVTLINEAIGATGVASQECKT 324
Query: 323 VVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
V++QYG+ I DLL++ P+K+C QIGLC F+G V +GI
Sbjct: 325 VIAQYGETIMDLLIAEAQPKKICSQIGLCTFDGTRGVSMGI 365
>gi|312282703|dbj|BAJ34217.1| unnamed protein product [Thellungiella halophila]
Length = 506
Score = 458 bits (1179), Expect = e-126, Method: Compositional matrix adjust.
Identities = 222/361 (61%), Positives = 278/361 (77%), Gaps = 10/361 (2%)
Query: 7 RSVFCLWVLASCLLLPASS---NGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGV 63
R+V +++ L ASS +G R+GLKK +LD + AARI+ ++ A
Sbjct: 6 RTVAVSLIVSFLLFFSASSERNDGTVRVGLKKLKLDPKNRLAARISSEQEKPLRA----- 60
Query: 64 RHRLGDS-DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISC 122
LGDS D DI+ LKN++DAQY+GEI IG+PPQ F+V+FDTGSSNLWVPSSKCYFSI+C
Sbjct: 61 -FSLGDSGDADIVALKNYLDAQYYGEIAIGTPPQKFTVVFDTGSSNLWVPSSKCYFSIAC 119
Query: 123 YFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSL 182
H +YKS +S+TY + GKS I+YG+G+I+GFFS D V VGD+VVKDQ FIEAT+E +
Sbjct: 120 LLHPKYKSSRSSTYEKNGKSAAIHYGTGAIAGFFSNDAVTVGDLVVKDQEFIEATKEPGI 179
Query: 183 TFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFG 242
TF+LA+FDGI+GLGF+EI+VG+A PVW NM++QGL+ E VFSFWLNR+ + +EGGE+VFG
Sbjct: 180 TFVLAKFDGILGLGFKEISVGNAAPVWYNMLKQGLIKEPVFSFWLNRNAEDDEGGELVFG 239
Query: 243 GVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPV 302
GVDP HFKGKHTYVPVT+KGYWQF++GD+LIGN TG CE GC+AI DSGTSLLAGPT +
Sbjct: 240 GVDPNHFKGKHTYVPVTQKGYWQFDMGDVLIGNAPTGFCESGCSAIADSGTSLLAGPTTI 299
Query: 303 VTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLG 362
+T INHAIG GVVS +CK VV QYG I +LL+S P+K+C QIGLC FNG V +G
Sbjct: 300 ITMINHAIGAAGVVSQQCKTVVDQYGRTILELLLSETQPKKICSQIGLCTFNGKRGVSMG 359
Query: 363 I 363
I
Sbjct: 360 I 360
>gi|357132502|ref|XP_003567869.1| PREDICTED: phytepsin-like [Brachypodium distachyon]
Length = 505
Score = 458 bits (1179), Expect = e-126, Method: Compositional matrix adjust.
Identities = 222/361 (61%), Positives = 282/361 (78%), Gaps = 10/361 (2%)
Query: 7 RSVFCLW--VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKE-RYMGGAGVSGV 63
R V L+ VL LL + + GL RI LKKR +D ++ A R++ +E +++GGA
Sbjct: 5 RVVLVLFAAVLLQALLPASEAEGLVRIALKKRPIDQNNRVATRLSGEEGQHLGGA----- 59
Query: 64 RHRLGDSDE-DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISC 122
+ LG DE DI+ L+N+M+AQYFGEIG+G+PPQ F+VIFDTGSSNLWVPS+KCYFSI+C
Sbjct: 60 -NSLGSEDEGDIVALQNYMNAQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSAKCYFSIAC 118
Query: 123 YFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSL 182
YFHSRYK+ +S+TY + GK I YG+GSI+G+FS+D+V VGD+VVKDQ FIEAT+E +
Sbjct: 119 YFHSRYKAGQSSTYKKNGKPAAIQYGTGSIAGYFSEDSVTVGDLVVKDQEFIEATKEPGV 178
Query: 183 TFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFG 242
TF++A+FDGI+GLGF+EI+VG AVPVW M+EQGL+S+ VFSFW NR EGGEIVFG
Sbjct: 179 TFMVAKFDGILGLGFQEISVGKAVPVWYKMIEQGLISDPVFSFWFNRHAGEGEGGEIVFG 238
Query: 243 GVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPV 302
G+DPKH+ G+HTYVPVT+KGYWQF++GD+L+G +STG C GGCAAI DSGTSLLAGPT +
Sbjct: 239 GMDPKHYIGEHTYVPVTQKGYWQFDMGDVLVGGKSTGFCAGGCAAIADSGTSLLAGPTAI 298
Query: 303 VTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLG 362
+TEIN IG GVVS ECK VVSQYG I DLL++ P+K+C Q+GLC F+G V G
Sbjct: 299 ITEINEKIGAAGVVSQECKTVVSQYGQQILDLLLAETQPKKICSQVGLCTFDGTRGVSAG 358
Query: 363 I 363
I
Sbjct: 359 I 359
>gi|122890420|emb|CAM12780.1| aspartic proteinase [Fagopyrum esculentum]
Length = 506
Score = 457 bits (1177), Expect = e-126, Method: Compositional matrix adjust.
Identities = 228/347 (65%), Positives = 275/347 (79%), Gaps = 3/347 (0%)
Query: 17 SCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILP 76
S + L ++N L R+GLKKR+LD + A+R K+ M G+ + GD D I+
Sbjct: 17 SPIALSVANNDLVRVGLKKRKLDPTNRPASRFGCKKHLMQKYGLG---NGFGDDDTGIIS 73
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
LKN+MDAQYFGEI IG+P Q F+VIFDTGSSNLWVPS KCY SI+C+FHS+YKS KS+TY
Sbjct: 74 LKNYMDAQYFGEIAIGTPSQTFTVIFDTGSSNLWVPSGKCYLSIACFFHSKYKSSKSSTY 133
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
+ GKS EI+YG+G+ISG+FSQDNV+VGD+VV++Q FIEATRE SLTF+ A+FDGI+GLG
Sbjct: 134 VKNGKSAEIHYGTGAISGYFSQDNVKVGDLVVENQEFIEATREPSLTFVAAKFDGILGLG 193
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
F+EI+VG AVPVW NMV QGLV+E VFSFWLNR+ D E GGEIVFGG+DP H KG+HTY+
Sbjct: 194 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWLNRNADEEVGGEIVFGGIDPAHHKGEHTYL 253
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
PVT+KGYWQF+L D+L+G +STG C GGC+AI DSGTSLLAGPTPVV +INHAIG GVV
Sbjct: 254 PVTQKGYWQFDLDDVLVGGESTGFCSGGCSAIADSGTSLLAGPTPVVAQINHAIGASGVV 313
Query: 317 SAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
S ECK VVSQYG I DLLVS P K+C QIGLC F+G V +GI
Sbjct: 314 SQECKTVVSQYGKQILDLLVSQTQPRKICSQIGLCTFDGTRGVSMGI 360
>gi|1030715|dbj|BAA06876.1| aspartic protease [Oryza sativa]
gi|1711289|dbj|BAA06875.1| aspartic protease [Oryza sativa]
Length = 509
Score = 457 bits (1177), Expect = e-126, Method: Compositional matrix adjust.
Identities = 222/345 (64%), Positives = 272/345 (78%), Gaps = 5/345 (1%)
Query: 22 PASSN-GLRRIGLKKRRLDLHSLNAARITRKE--RYMGGAGVSGVRHRLGDSDEDILPLK 78
PAS+ GL RI LKKR +D +S AAR++ +E R +G G + + + DI+ LK
Sbjct: 21 PASAEEGLVRIALKKRPIDENSRVAARLSGEEGARRLGLRGANSLGGGG--GEGDIVALK 78
Query: 79 NFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTE 138
N+M+AQYFGEIG+G+PPQ F+VIFDTGSSNLWVPS+KCYFSI+C+FHSRYKS +S+TY +
Sbjct: 79 NYMNAQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSAKCYFSIACFFHSRYKSGQSSTYQK 138
Query: 139 IGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFR 198
GK I YG+GSI+GFFS+D+V VGD+VVKDQ FIEAT+E LTF++A+FDGI+GLGF+
Sbjct: 139 NGKPAAIQYGTGSIAGFFSEDSVTVGDLVVKDQEFIEATKEPGLTFMVAKFDGILGLGFQ 198
Query: 199 EIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPV 258
EI+VGDAVPVW MVEQGLVSE VFSFW NR D EGGEIVFGG+DP H+KG HTYVPV
Sbjct: 199 EISVGDAVPVWYKMVEQGLVSEPVFSFWFNRHSDEGEGGEIVFGGMDPSHYKGNHTYVPV 258
Query: 259 TKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSA 318
++KGYWQFE+GD+LIG ++TG C GC+AI DSGTSLLAGPT ++TEIN IG GVVS
Sbjct: 259 SQKGYWQFEMGDVLIGGKTTGFCASGCSAIADSGTSLLAGPTAIITEINEKIGATGVVSQ 318
Query: 319 ECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
ECK VVSQYG I DLL++ P K+C Q+GLC F+G V GI
Sbjct: 319 ECKTVVSQYGQQILDLLLAETQPSKICSQVGLCTFDGKHGVSAGI 363
>gi|115465497|ref|NP_001056348.1| Os05g0567100 [Oryza sativa Japonica Group]
gi|78099759|sp|Q42456.2|ASPR1_ORYSJ RecName: Full=Aspartic proteinase oryzasin-1; Flags: Precursor
gi|51854282|gb|AAU10663.1| aspartic proteinase oryzasin 1 precursor [Oryza sativa Japonica
Group]
gi|113579899|dbj|BAF18262.1| Os05g0567100 [Oryza sativa Japonica Group]
gi|125553350|gb|EAY99059.1| hypothetical protein OsI_21016 [Oryza sativa Indica Group]
gi|169244443|gb|ACA50495.1| aspartic proteinase oryzasin 1 [Oryza sativa Japonica Group]
gi|215695381|dbj|BAG90572.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737145|dbj|BAG96074.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215740829|dbj|BAG96985.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222632587|gb|EEE64719.1| hypothetical protein OsJ_19575 [Oryza sativa Japonica Group]
Length = 509
Score = 457 bits (1176), Expect = e-126, Method: Compositional matrix adjust.
Identities = 222/345 (64%), Positives = 272/345 (78%), Gaps = 5/345 (1%)
Query: 22 PASS-NGLRRIGLKKRRLDLHSLNAARITRKE--RYMGGAGVSGVRHRLGDSDEDILPLK 78
PAS+ GL RI LKKR +D +S AAR++ +E R +G G + + + DI+ LK
Sbjct: 21 PASAAEGLVRIALKKRPIDENSRVAARLSGEEGARRLGLRGANSLGGGG--GEGDIVALK 78
Query: 79 NFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTE 138
N+M+AQYFGEIG+G+PPQ F+VIFDTGSSNLWVPS+KCYFSI+C+FHSRYKS +S+TY +
Sbjct: 79 NYMNAQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSAKCYFSIACFFHSRYKSGQSSTYQK 138
Query: 139 IGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFR 198
GK I YG+GSI+GFFS+D+V VGD+VVKDQ FIEAT+E LTF++A+FDGI+GLGF+
Sbjct: 139 NGKPAAIQYGTGSIAGFFSEDSVTVGDLVVKDQEFIEATKEPGLTFMVAKFDGILGLGFQ 198
Query: 199 EIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPV 258
EI+VGDAVPVW MVEQGLVSE VFSFW NR D EGGEIVFGG+DP H+KG HTYVPV
Sbjct: 199 EISVGDAVPVWYKMVEQGLVSEPVFSFWFNRHSDEGEGGEIVFGGMDPSHYKGNHTYVPV 258
Query: 259 TKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSA 318
++KGYWQFE+GD+LIG ++TG C GC+AI DSGTSLLAGPT ++TEIN IG GVVS
Sbjct: 259 SQKGYWQFEMGDVLIGGKTTGFCASGCSAIADSGTSLLAGPTAIITEINEKIGATGVVSQ 318
Query: 319 ECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
ECK VVSQYG I DLL++ P K+C Q+GLC F+G V GI
Sbjct: 319 ECKTVVSQYGQQILDLLLAETQPSKICSQVGLCTFDGKHGVSAGI 363
>gi|12231174|dbj|BAB20970.1| aspartic proteinase 2 [Nepenthes alata]
Length = 514
Score = 457 bits (1175), Expect = e-126, Method: Compositional matrix adjust.
Identities = 219/346 (63%), Positives = 274/346 (79%), Gaps = 17/346 (4%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGV---------SGVRHRLGDSDE-DILPL 77
L R+GLKKR+LD +I R + G G G+ + LG+SD+ DI+ L
Sbjct: 30 LLRVGLKKRKLD-------QINRLSSHYGCKGKGSTSPSIWKHGLGNGLGNSDDADIISL 82
Query: 78 KNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYT 137
KN+MDAQYFGEIGIGSPPQ F+VIFDTGSSNLWVPS+KCYFSI+CY H +YKS KS+TY
Sbjct: 83 KNYMDAQYFGEIGIGSPPQKFTVIFDTGSSNLWVPSAKCYFSIACYLHPKYKSFKSSTYA 142
Query: 138 EIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGF 197
+ GKS I+YG+G+ISGFFSQD+V++GD+VV++Q FIEAT+E S+TF+ A+FDGI+GLGF
Sbjct: 143 KNGKSAAIHYGTGAISGFFSQDHVKMGDLVVENQDFIEATKEPSITFVAAKFDGILGLGF 202
Query: 198 REIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVP 257
+EI+VGDAVP W NM++QGLV+E VFSFWLNR + EEGGEIVFGGVDP H+KG+HTYVP
Sbjct: 203 QEISVGDAVPAWYNMIDQGLVNEPVFSFWLNRKSEEEEGGEIVFGGVDPNHYKGEHTYVP 262
Query: 258 VTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVS 317
VT+KGYWQF++ D+L+G ++TG C GGC+AI DSGTSLLAGPT ++ +INHAIG G+VS
Sbjct: 263 VTRKGYWQFDMDDVLVGGETTGYCSGGCSAIADSGTSLLAGPTTIIVQINHAIGASGLVS 322
Query: 318 AECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
ECK VVSQYG I D LV+ P+K+C QIGLC F+G V +GI
Sbjct: 323 QECKAVVSQYGKAILDALVAEAQPQKICSQIGLCTFDGKRGVSMGI 368
>gi|77808107|gb|AAV84085.2| aspartic proteinase 9 [Fagopyrum esculentum]
Length = 506
Score = 457 bits (1175), Expect = e-126, Method: Compositional matrix adjust.
Identities = 228/347 (65%), Positives = 275/347 (79%), Gaps = 3/347 (0%)
Query: 17 SCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILP 76
S + L ++N L R+GLKKR+LD + A+R K+ M G+ + GD D I+
Sbjct: 17 SPISLSVANNDLVRVGLKKRKLDPTNRPASRFGCKKHLMQKYGLG---NGFGDDDTGIIS 73
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
LKN+MDAQYFGEI IG+P Q F+VIFDTGSSNLWVPS KCY SI+C+FHS+YKS KS+TY
Sbjct: 74 LKNYMDAQYFGEIAIGTPSQTFTVIFDTGSSNLWVPSGKCYLSIACFFHSKYKSSKSSTY 133
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
+ GKS EI+YG+G+ISG+FSQDNV+VGD+VV++Q FIEATRE SLTF+ A+FDGI+GLG
Sbjct: 134 VKNGKSAEIHYGTGAISGYFSQDNVKVGDLVVENQEFIEATREPSLTFVAAKFDGILGLG 193
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
F+EI+VG AVPVW NMV QGLV+E VFSFWLNR+ D E GGEIVFGG+DP H KG+HTY+
Sbjct: 194 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWLNRNADEEIGGEIVFGGIDPAHHKGEHTYL 253
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
PVT+KGYWQF+L D+L+G +STG C GGC+AI DSGTSLLAGPTPVV +INHAIG GVV
Sbjct: 254 PVTQKGYWQFDLDDVLVGGESTGFCSGGCSAIADSGTSLLAGPTPVVAQINHAIGASGVV 313
Query: 317 SAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
S ECK VVSQYG I DLLVS P K+C QIGLC F+G V +GI
Sbjct: 314 SQECKTVVSQYGKQILDLLVSQTQPRKICSQIGLCTFDGTRGVSMGI 360
>gi|1326165|gb|AAB03108.1| aspartic protease [Brassica napus]
Length = 506
Score = 457 bits (1175), Expect = e-126, Method: Compositional matrix adjust.
Identities = 223/362 (61%), Positives = 281/362 (77%), Gaps = 12/362 (3%)
Query: 7 RSVFCLWVLASCLLLPASS---NGLRRIGLKKRRLDLHSLNAARITRKE-RYMGGAGVSG 62
++V +++ L L AS+ +G R+GLKK + D S AA + K+ + + G G
Sbjct: 6 KTVALSLIVSFLLFLSASAERNDGTFRVGLKKLKFDPRSRIAAPVGSKQLKPLRGYG--- 62
Query: 63 VRHRLGDS-DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSIS 121
LGDS D DI+ LKN++DAQY+GEI IG+PPQ F+V+FDTGSSNLWVPSSKCYFSI+
Sbjct: 63 ----LGDSGDADIVTLKNYLDAQYYGEIAIGTPPQKFTVVFDTGSSNLWVPSSKCYFSIA 118
Query: 122 CYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGS 181
C FHS+YKS +S+TY + GKS I+YG+G+I+GFFS D V VGD+VVKDQ FIEAT+E
Sbjct: 119 CLFHSKYKSSRSSTYEKNGKSAAIHYGTGAIAGFFSNDAVTVGDLVVKDQEFIEATKEPG 178
Query: 182 LTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVF 241
+TF+LA+FDGI+GLGF+EI+VG+A PVW NM++QGL+ E VFSFWLNR+ + EEGGE+VF
Sbjct: 179 ITFVLAKFDGILGLGFQEISVGNAAPVWYNMLKQGLIKEPVFSFWLNRNAEDEEGGELVF 238
Query: 242 GGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTP 301
GGVDP HFKG+HTYVPVT+KGYWQF++GD+LIG TG CE GC+AI DSGTSLLAGPT
Sbjct: 239 GGVDPNHFKGEHTYVPVTQKGYWQFDMGDVLIGGAPTGYCESGCSAIADSGTSLLAGPTT 298
Query: 302 VVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRL 361
V+T INHAIG GVVS +CK+VV QYG I DLL+S P+K+C QIGLC F+G V +
Sbjct: 299 VITMINHAIGAAGVVSQQCKIVVDQYGQTILDLLLSETQPKKICSQIGLCTFDGKRGVSM 358
Query: 362 GI 363
GI
Sbjct: 359 GI 360
>gi|357134751|ref|XP_003568979.1| PREDICTED: aspartic proteinase-like [Brachypodium distachyon]
Length = 498
Score = 456 bits (1174), Expect = e-126, Method: Compositional matrix adjust.
Identities = 216/363 (59%), Positives = 275/363 (75%), Gaps = 9/363 (2%)
Query: 2 EQKLLRSVFCLWVLASCLLLPASS-NGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV 60
++ L CLW L+ LL AS +GL RI L K+ L+ +LNAA++ R++
Sbjct: 3 QRHLFLVTTCLWALSCAGLLHASPPDGLLRINLNKKSLNYEALNAAKLARQQ-------- 54
Query: 61 SGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI 120
V ++ S+ DI+PL ++++ QYFG IG+G+PPQNF+VIFDTGSSNLWVPSSKCYFSI
Sbjct: 55 DSVHLKISSSNSDIVPLVDYLNTQYFGVIGVGTPPQNFTVIFDTGSSNLWVPSSKCYFSI 114
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+CY H +YKS KS+TY G+S +I YGSG+ISGFFS DNV VGD+VVK Q FIE TRE
Sbjct: 115 ACYLHHKYKSSKSSTYKADGESAKITYGSGAISGFFSNDNVLVGDLVVKKQKFIETTRET 174
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
S TF++ +FDGI+GLGF EI+VG A PVW +M +Q L++++VFSFWLNR+ DA GGE+V
Sbjct: 175 SATFIIGKFDGILGLGFPEISVGKAPPVWMSMQKQKLLADDVFSFWLNRNADATSGGELV 234
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGGVD H+KG HTYVPV++KGYWQF +GD+LI QSTG C GCAAIVDSGTSLLAGPT
Sbjct: 235 FGGVDSNHYKGNHTYVPVSRKGYWQFNMGDLLIDGQSTGFCAKGCAAIVDSGTSLLAGPT 294
Query: 301 PVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVR 360
+V ++NHAIG EG++S ECK VVSQYG++I DLL++ P+KVC Q+GLC F+G V
Sbjct: 295 AIVAQVNHAIGAEGIISTECKEVVSQYGEMILDLLLAQTEPQKVCSQVGLCLFDGTHSVS 354
Query: 361 LGI 363
GI
Sbjct: 355 KGI 357
>gi|326510801|dbj|BAJ91748.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 450
Score = 456 bits (1174), Expect = e-126, Method: Compositional matrix adjust.
Identities = 217/365 (59%), Positives = 280/365 (76%), Gaps = 10/365 (2%)
Query: 1 MEQKLLRSVF-CLWVLASCLLLPASS-NGLRRIGLKKRRLDLHSLNAARITRKERYMGGA 58
M Q+LL V CLW ++ + ASS +GL RI L KR L SL AA+ R+
Sbjct: 1 MGQRLLLLVTTCLWAISCAVPHHASSRDGLLRINLNKRSLTHKSLAAAKAARQ------- 53
Query: 59 GVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF 118
+R + G+SD DI+PL ++++ QY+G IG+G+PPQNF+VIFDTGSSNLWVPSSKCYF
Sbjct: 54 -YGALRLKSGNSDSDIVPLVDYLNTQYYGVIGLGTPPQNFTVIFDTGSSNLWVPSSKCYF 112
Query: 119 SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
SI+CY H +Y+S +S TY G++C+I YGSG+ISGFFS DNV VGD+VVK+Q FIEATR
Sbjct: 113 SIACYLHPKYRSSRSTTYKADGENCKITYGSGAISGFFSNDNVLVGDLVVKNQKFIEATR 172
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
E S++F+L +FDGI+GLG+ +I+VG A PVW +M EQ L++++VFSFWLNRD DA GGE
Sbjct: 173 ETSVSFILGKFDGILGLGYPDISVGKAPPVWLSMQEQKLLADDVFSFWLNRDSDALSGGE 232
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
+VFGG+DP H+KG HTYVPV++KGYWQF +GD+LI STG C GCAAIVDSGTSLLAG
Sbjct: 233 LVFGGMDPHHYKGNHTYVPVSRKGYWQFNMGDLLIDGHSTGFCAKGCAAIVDSGTSLLAG 292
Query: 299 PTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEY 358
PT +V ++NHAIG EG++S ECK VVSQYG++I ++L++ P+KVC QIGLC F+G +
Sbjct: 293 PTAIVAQVNHAIGAEGIISTECKEVVSQYGEMILEMLIAQTQPQKVCSQIGLCLFDGTQS 352
Query: 359 VRLGI 363
V GI
Sbjct: 353 VSNGI 357
>gi|255567717|ref|XP_002524837.1| Aspartic proteinase precursor, putative [Ricinus communis]
gi|223535897|gb|EEF37557.1| Aspartic proteinase precursor, putative [Ricinus communis]
Length = 456
Score = 456 bits (1173), Expect = e-126, Method: Compositional matrix adjust.
Identities = 212/356 (59%), Positives = 272/356 (76%), Gaps = 5/356 (1%)
Query: 12 LWVLASCLLLPA----SSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRL 67
LW+ + LLLP ++ L R+GLKK++ D ++ A + KE A
Sbjct: 8 LWI-SFVLLLPVVFSLHNDALVRVGLKKKKFDQVNIPAGTVDFKEGEAMRAATKKYNLVE 66
Query: 68 GDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSR 127
D DI+ LKN++DAQY+GEI IG+PPQ F+VIFDTGSSNLW+PSSKCYFS++CYFHS+
Sbjct: 67 NSDDVDIVELKNYLDAQYYGEIAIGTPPQTFTVIFDTGSSNLWIPSSKCYFSVACYFHSK 126
Query: 128 YKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLA 187
YK+ +S+TY + G S I YG+GSISGFFSQDNV+VGD+V+++Q FIEAT+E +TFL A
Sbjct: 127 YKASESSTYQKNGTSAAIRYGTGSISGFFSQDNVKVGDLVIRNQDFIEATKEPGVTFLAA 186
Query: 188 RFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPK 247
+FDGI+GLGF+EI+VG A+PVW NMV +GLV E+VFSFWLNR+ AEEGGEIVFGG+DP
Sbjct: 187 KFDGILGLGFQEISVGKAIPVWYNMVNEGLVKEQVFSFWLNRNVQAEEGGEIVFGGMDPN 246
Query: 248 HFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEIN 307
H+KG+HTYVPVT+KGYWQF++G++LIGN+ TG+C GC AI DSGTSLLAGPT V+T+IN
Sbjct: 247 HYKGQHTYVPVTQKGYWQFDMGEVLIGNEITGLCADGCKAIADSGTSLLAGPTTVITQIN 306
Query: 308 HAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
HAIG G+VS ECK VV QYG I ++L + P+K+C QIG C F+G + V I
Sbjct: 307 HAIGASGIVSQECKTVVEQYGKFILEMLTAQAQPQKICSQIGFCTFDGTQGVSTNI 362
>gi|384040313|gb|AFH58568.1| aspartic acid protease [Ananas comosus]
Length = 514
Score = 456 bits (1172), Expect = e-125, Method: Compositional matrix adjust.
Identities = 223/366 (60%), Positives = 281/366 (76%), Gaps = 13/366 (3%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKE--------RYMGG 57
L L VL +L AS++GL RIGLKKR +D ++ AAR+ KE RY
Sbjct: 8 LAVAILLSVLLHQSILLASADGLVRIGLKKRPIDENNRIAARLVEKEEGPLLAARRY--- 64
Query: 58 AGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY 117
G+ G + G+ + DI+ LKN+M+AQYFGEIGIG+PPQ F+VIFDTGSSNLWVPSSKCY
Sbjct: 65 -GLRGAPLKEGE-ETDIIALKNYMNAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCY 122
Query: 118 FSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEAT 177
FSI+C FH++YKS +S++Y + GKS I+YG+G+ISGFFS D+V+VGD+VVK Q FIEAT
Sbjct: 123 FSIACLFHTKYKSGRSSSYHKNGKSASIHYGTGAISGFFSTDHVKVGDLVVKTQDFIEAT 182
Query: 178 REGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGG 237
+E S+TF++A+FDGI+GLGF+EI+VG+AVPVW NMV+QGL+ E VFSFW NR+ + EGG
Sbjct: 183 KEPSVTFVVAKFDGILGLGFQEISVGNAVPVWYNMVDQGLIKEPVFSFWFNRNANDGEGG 242
Query: 238 EIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLA 297
EIVFGG DP H+KG HTYVPVT+KGYWQFE+GD+L+G QSTG C GGCAAI DSGTSLLA
Sbjct: 243 EIVFGGADPNHYKGNHTYVPVTQKGYWQFEMGDVLVGGQSTGFCNGGCAAIADSGTSLLA 302
Query: 298 GPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAE 357
GPT ++ EIN IG GVVS ECK VV++YG I +L++ + P K+C IGLC F+G +
Sbjct: 303 GPTTIIAEINQKIGASGVVSQECKAVVAEYGQQILQMLLAEVQPGKICSSIGLCTFDGKQ 362
Query: 358 YVRLGI 363
V GI
Sbjct: 363 GVSAGI 368
>gi|224068986|ref|XP_002302872.1| predicted protein [Populus trichocarpa]
gi|222844598|gb|EEE82145.1| predicted protein [Populus trichocarpa]
Length = 505
Score = 455 bits (1171), Expect = e-125, Method: Compositional matrix adjust.
Identities = 216/356 (60%), Positives = 278/356 (78%), Gaps = 6/356 (1%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLN----AARITRKERYMGGAGVSGVR--HRL 67
+L+ ++L A +GL RIGLKK++LD + +E G + + + + +
Sbjct: 4 LLSFPVVLSARDDGLMRIGLKKKKLDHLGRRVVPGSVNFIPEEEGGGASKPAATKKYYNI 63
Query: 68 GDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSR 127
G+++ DI+ LKN++DAQY+GEI IG+PPQ F+VIFDTGSSNLWVPSSKCYFS++CYFHS+
Sbjct: 64 GETEADIVALKNYLDAQYYGEITIGTPPQTFTVIFDTGSSNLWVPSSKCYFSLACYFHSK 123
Query: 128 YKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLA 187
YKS S TY + G S I YG+GSISGFFSQD+VEVGD+VVK+Q FIEAT+E +TFL +
Sbjct: 124 YKSSASTTYVKNGTSAAIQYGTGSISGFFSQDSVEVGDLVVKNQGFIEATKEPGVTFLAS 183
Query: 188 RFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPK 247
+FDGI+GLGF+EI+VG+AVPVW NMV QGLV E+VFSFWLNR+ + EEGGEIVFGGVDP
Sbjct: 184 KFDGILGLGFQEISVGNAVPVWYNMVNQGLVKEKVFSFWLNRNVEGEEGGEIVFGGVDPN 243
Query: 248 HFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEIN 307
H+KG+HTYVPVT KGYWQF++GD+LIG ++TG+C GGC AI DSGTSLLAGPT V+T+IN
Sbjct: 244 HYKGEHTYVPVTHKGYWQFDMGDLLIGTETTGLCAGGCKAIADSGTSLLAGPTTVITQIN 303
Query: 308 HAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
+AIG G+VS ECK VV+QYG +I ++LV+ P KVC QI C F+G + V + I
Sbjct: 304 NAIGASGIVSEECKTVVAQYGKIILEMLVAQAQPRKVCSQISFCTFDGTQGVSMNI 359
>gi|15221141|ref|NP_172655.1| aspartic proteinase A1 [Arabidopsis thaliana]
gi|75318541|sp|O65390.1|APA1_ARATH RecName: Full=Aspartic proteinase A1; Flags: Precursor
gi|3157937|gb|AAC17620.1| Identical to aspartic proteinase cDNA gb|U51036 from A. thaliana.
ESTs gb|N96313, gb|T21893, gb|R30158, gb|T21482,
gb|T43650, gb|R64749, gb|R65157, gb|T88269, gb|T44552,
gb|T22542, gb|T76533, gb|T44350, gb|Z34591, gb|AA728734,
gb|T46003, gb|R65157, gb|N38290, gb|AA395468, gb|T20815
and gb|Z34173 come from this gene [Arabidopsis thaliana]
gi|15912219|gb|AAL08243.1| At1g11910/F12F1_24 [Arabidopsis thaliana]
gi|15912251|gb|AAL08259.1| At1g11910/F12F1_24 [Arabidopsis thaliana]
gi|17381036|gb|AAL36330.1| putative aspartic proteinase [Arabidopsis thaliana]
gi|21617929|gb|AAM66979.1| putative aspartic proteinase [Arabidopsis thaliana]
gi|25055040|gb|AAN71979.1| putative aspartic proteinase [Arabidopsis thaliana]
gi|332190692|gb|AEE28813.1| aspartic proteinase A1 [Arabidopsis thaliana]
Length = 506
Score = 455 bits (1170), Expect = e-125, Method: Compositional matrix adjust.
Identities = 214/340 (62%), Positives = 268/340 (78%), Gaps = 7/340 (2%)
Query: 25 SNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDS-DEDILPLKNFMDA 83
++G R+GLKK +LD + AAR+ K+ A +RLGDS D D++ LKN++DA
Sbjct: 27 NDGTFRVGLKKLKLDSKNRLAARVESKQEKPLRA------YRLGDSGDADVVVLKNYLDA 80
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
QY+GEI IG+PPQ F+V+FDTGSSNLWVPSSKCYFS++C H +YKS +S+TY + GK+
Sbjct: 81 QYYGEIAIGTPPQKFTVVFDTGSSNLWVPSSKCYFSLACLLHPKYKSSRSSTYEKNGKAA 140
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVG 203
I+YG+G+I+GFFS D V VGD+VVKDQ FIEAT+E +TF++A+FDGI+GLGF+EI+VG
Sbjct: 141 AIHYGTGAIAGFFSNDAVTVGDLVVKDQEFIEATKEPGITFVVAKFDGILGLGFQEISVG 200
Query: 204 DAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGY 263
A PVW NM++QGL+ E VFSFWLNR+ D EEGGE+VFGGVDP HFKGKHTYVPVT+KGY
Sbjct: 201 KAAPVWYNMLKQGLIKEPVFSFWLNRNADEEEGGELVFGGVDPNHFKGKHTYVPVTQKGY 260
Query: 264 WQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLV 323
WQF++GD+LIG TG CE GC+AI DSGTSLLAGPT ++T INHAIG GVVS +CK V
Sbjct: 261 WQFDMGDVLIGGAPTGFCESGCSAIADSGTSLLAGPTTIITMINHAIGAAGVVSQQCKTV 320
Query: 324 VSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
V QYG I DLL+S P+K+C QIGLC F+G V +GI
Sbjct: 321 VDQYGQTILDLLLSETQPKKICSQIGLCTFDGTRGVSMGI 360
>gi|326494022|dbj|BAJ85473.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326511208|dbj|BAJ87618.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 498
Score = 454 bits (1169), Expect = e-125, Method: Compositional matrix adjust.
Identities = 217/365 (59%), Positives = 280/365 (76%), Gaps = 10/365 (2%)
Query: 1 MEQKLLRSVF-CLWVLASCLLLPASS-NGLRRIGLKKRRLDLHSLNAARITRKERYMGGA 58
M Q+LL V CLW ++ + ASS +GL RI L KR L SL AA+ R+
Sbjct: 1 MGQRLLLLVTTCLWAISCAVPHHASSRDGLLRINLNKRSLTHESLAAAKAARQ------- 53
Query: 59 GVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF 118
+R + G+SD DI+PL ++++ QY+G IG+G+PPQNF+VIFDTGSSNLWVPSSKCYF
Sbjct: 54 -YGALRLKSGNSDSDIVPLVDYLNTQYYGVIGLGTPPQNFTVIFDTGSSNLWVPSSKCYF 112
Query: 119 SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
SI+CY H +Y+S +S TY G++C+I YGSG+ISGFFS DNV VGD+VVK+Q FIEATR
Sbjct: 113 SIACYLHPKYRSSRSTTYKADGENCKITYGSGAISGFFSNDNVLVGDLVVKNQKFIEATR 172
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
E S++F+L +FDGI+GLG+ +I+VG A PVW +M EQ L++++VFSFWLNRD DA GGE
Sbjct: 173 ETSVSFILGKFDGILGLGYPDISVGKAPPVWLSMQEQKLLADDVFSFWLNRDSDALSGGE 232
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
+VFGG+DP H+KG HTYVPV++KGYWQF +GD+LI STG C GCAAIVDSGTSLLAG
Sbjct: 233 LVFGGMDPHHYKGNHTYVPVSRKGYWQFNMGDLLIDGHSTGFCAKGCAAIVDSGTSLLAG 292
Query: 299 PTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEY 358
PT +V ++NHAIG EG++S ECK VVSQYG++I ++L++ P+KVC QIGLC F+G +
Sbjct: 293 PTAIVAQVNHAIGAEGIISTECKEVVSQYGEMILEMLIAQTQPQKVCSQIGLCLFDGTQS 352
Query: 359 VRLGI 363
V GI
Sbjct: 353 VSNGI 357
>gi|1354272|gb|AAC49730.1| aspartic proteinase [Arabidopsis thaliana]
Length = 486
Score = 454 bits (1167), Expect = e-125, Method: Compositional matrix adjust.
Identities = 214/340 (62%), Positives = 268/340 (78%), Gaps = 7/340 (2%)
Query: 25 SNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDS-DEDILPLKNFMDA 83
++G R+GLKK +LD + AAR+ K+ A +RLGDS D D++ LKN++DA
Sbjct: 7 NDGTFRVGLKKLKLDSKNRLAARVESKQEKPLRA------YRLGDSGDADVVVLKNYLDA 60
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
QY+GEI IG+PPQ F+V+FDTGSSNLWVPSSKCYFS++C H +YKS +S+TY + GK+
Sbjct: 61 QYYGEIAIGTPPQKFTVVFDTGSSNLWVPSSKCYFSLACLLHPKYKSSRSSTYEKNGKAA 120
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVG 203
I+YG+G+I+GFFS D V VGD+VVKDQ FIEAT+E +TF++A+FDGI+GLGF+EI+VG
Sbjct: 121 AIHYGTGAIAGFFSNDAVTVGDLVVKDQEFIEATKEPGITFVVAKFDGILGLGFQEISVG 180
Query: 204 DAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGY 263
A PVW NM++QGL+ E VFSFWLNR+ D EEGGE+VFGGVDP HFKGKHTYVPVT+KGY
Sbjct: 181 KAAPVWYNMLKQGLIKEPVFSFWLNRNADEEEGGELVFGGVDPNHFKGKHTYVPVTQKGY 240
Query: 264 WQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLV 323
WQF++GD+LIG TG CE GC+AI DSGTSLLAGPT ++T INHAIG GVVS +CK V
Sbjct: 241 WQFDMGDVLIGGAPTGFCESGCSAIADSGTSLLAGPTTIITMINHAIGAAGVVSQQCKTV 300
Query: 324 VSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
V QYG I DLL+S P+K+C QIGLC F+G V +GI
Sbjct: 301 VDQYGQTILDLLLSETQPKKICSQIGLCTFDGTRGVSMGI 340
>gi|449466825|ref|XP_004151126.1| PREDICTED: aspartic proteinase-like [Cucumis sativus]
Length = 513
Score = 453 bits (1166), Expect = e-125, Method: Compositional matrix adjust.
Identities = 219/363 (60%), Positives = 283/363 (77%), Gaps = 4/363 (1%)
Query: 4 KLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGV 63
KL +V + ++ AS++G RIGLK+R+ ++ A++I KE V
Sbjct: 6 KLFIAVLFICFFMFPMVFCASNDGKVRIGLKRRKFGQNNRVASKIATKEGISLKNSVEKY 65
Query: 64 R--HRLGDSDE-DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI 120
+ LGDSD+ DI+ LKN+++AQYFGEIGIG+PPQ F+VIFDTGSSNLWVPSSKC FS+
Sbjct: 66 QPSANLGDSDDFDIVGLKNYLNAQYFGEIGIGTPPQKFAVIFDTGSSNLWVPSSKC-FSV 124
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+C HS+YKS++S+TY + GKS I YG+G+ISG+FS+DNV+VGD++VK Q FIEATRE
Sbjct: 125 ACLLHSKYKSKRSSTYKKNGKSASIKYGTGAISGYFSEDNVKVGDLIVKKQDFIEATREP 184
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
SLTF+LA+FDGI+GLGF+EI+VGDAVPVW NMV+Q LV E VFSFW NR+ D E+GGEIV
Sbjct: 185 SLTFVLAQFDGILGLGFKEISVGDAVPVWYNMVDQNLVKEPVFSFWFNRNADEEQGGEIV 244
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGGVDP H+KG+HTYVPVTKKGYWQF++GD+LI +TG C GGC+AI DSGTSLLAGPT
Sbjct: 245 FGGVDPDHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTGFCSGGCSAIADSGTSLLAGPT 304
Query: 301 PVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVR 360
++T++NHAIG GVVS ECK VV++YG+ I +L++ P+K+C +GLCAF+G V
Sbjct: 305 TIITQVNHAIGASGVVSEECKAVVAEYGETIIKMLLAKDQPKKICSTLGLCAFDGERGVS 364
Query: 361 LGI 363
+GI
Sbjct: 365 MGI 367
>gi|3551952|gb|AAC34854.1| senescence-associated protein 4 [Hemerocallis hybrid cultivar]
Length = 517
Score = 452 bits (1162), Expect = e-124, Method: Compositional matrix adjust.
Identities = 221/360 (61%), Positives = 274/360 (76%), Gaps = 15/360 (4%)
Query: 12 LWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKE------RYMGGAGVSGVRH 65
L +L L L AS+ GL RI LKK+ D S ++R++ E RY G+R
Sbjct: 14 LSMLVFQLALSASAEGLVRINLKKKPFDEKSRVSSRLSADEDEPLKARY-------GLRG 66
Query: 66 RLGDSDE--DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCY 123
L D + DI+ LKN+M+AQYFGEIG+G+PPQ F+VIFDTGSSNLWVPS+KCYFSI+C
Sbjct: 67 GLNDGADSTDIISLKNYMNAQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSAKCYFSIACL 126
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
H++YKS +S+TY + GK I+YG+G+I+G+FS+D+VE+GD VVK Q FIEAT+E +T
Sbjct: 127 LHTKYKSGRSSTYHKNGKPAAIHYGTGAIAGYFSEDHVELGDFVVKGQEFIEATKEPGVT 186
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
FL+A+FDGI+GLGF+EI+VG AVP+W NMVEQGLV E VFSFWLNR + EGGEIVFGG
Sbjct: 187 FLVAKFDGILGLGFKEISVGGAVPLWYNMVEQGLVKEAVFSFWLNRKSEDGEGGEIVFGG 246
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
VDP H KG+H YVPVT+KGYWQF++GD+L+G QSTG CEGGCAAI DSGTSL+AGPT V+
Sbjct: 247 VDPSHHKGEHVYVPVTQKGYWQFDMGDVLVGGQSTGFCEGGCAAIADSGTSLIAGPTTVI 306
Query: 304 TEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
TEINH IG GVVS ECK VV QYG I D+L++ P K+C QIGLC F+G V +GI
Sbjct: 307 TEINHKIGAAGVVSQECKAVVQQYGQQILDMLIAQTQPMKICSQIGLCTFDGTRGVSMGI 366
>gi|297837199|ref|XP_002886481.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297332322|gb|EFH62740.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 513
Score = 451 bits (1160), Expect = e-124, Method: Compositional matrix adjust.
Identities = 214/354 (60%), Positives = 270/354 (76%), Gaps = 2/354 (0%)
Query: 12 LWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDS- 70
+W L + ++G R+GLKK +LD ++ A R K+ + + + LG
Sbjct: 14 VWFLLFFTVSSQRNDGTFRVGLKKLKLDPNNRLATRFGSKQEEALRSSLPSYNNNLGSDS 73
Query: 71 -DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYK 129
D DI+PLKN++DAQY+GEI IG+PPQ F+VIFDTGSSNLWVPS KC+FS+SC+FH+++K
Sbjct: 74 GDADIVPLKNYLDAQYYGEIAIGTPPQKFTVIFDTGSSNLWVPSGKCFFSLSCFFHAKFK 133
Query: 130 SRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARF 189
S +S+TY + GK I+YGSGSISGFFS D V VGD+VVKDQ FIEAT E LTFL+A+F
Sbjct: 134 SSRSSTYKKSGKRAAIHYGSGSISGFFSYDAVTVGDLVVKDQEFIEATSEPGLTFLVAKF 193
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF 249
DG++GLGF+EIAVG+A PVW NM++QGL+ VFSFWLNRDP +EEGGEIVFGGVDPKHF
Sbjct: 194 DGLLGLGFQEIAVGNATPVWYNMLKQGLIERPVFSFWLNRDPKSEEGGEIVFGGVDPKHF 253
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHA 309
KG+HT+VPVT++GYWQF++G++LI STG C GC+AI DSGTSLLAGPT V+ IN A
Sbjct: 254 KGEHTFVPVTQRGYWQFDMGEVLIAGDSTGYCGSGCSAIADSGTSLLAGPTAVIAMINKA 313
Query: 310 IGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
IG GVVS +CK VV QYG I DLL++ P+K+C QIGLCAF+G V +GI
Sbjct: 314 IGASGVVSQQCKTVVDQYGQTILDLLLAETQPKKICSQIGLCAFDGTHGVSMGI 367
>gi|356555682|ref|XP_003546159.1| PREDICTED: aspartic proteinase-like [Glycine max]
Length = 507
Score = 451 bits (1160), Expect = e-124, Method: Compositional matrix adjust.
Identities = 217/359 (60%), Positives = 269/359 (74%), Gaps = 14/359 (3%)
Query: 10 FCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRL-- 67
FCLW L L+ A ++GLRRIGLKK +LD + + R S +H L
Sbjct: 12 FCLWTLLFPLVFCAPNDGLRRIGLKKVKLDTDDVVGFKEFRS---------SIRKHHLQN 62
Query: 68 ---GDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYF 124
G D D++ LKN++DAQY+GEI IG+PPQ F+VIFDTGSSNLWVPSSKCYFS++C+
Sbjct: 63 ILGGAEDTDVVALKNYLDAQYYGEIAIGTPPQKFTVIFDTGSSNLWVPSSKCYFSVACFM 122
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
H+RY+S +S+TY E G S I YG+G+ISGFFS D+V+VGD+VVKDQ FIEATRE +TF
Sbjct: 123 HARYRSSQSSTYRENGTSAAIQYGTGAISGFFSNDDVKVGDIVVKDQEFIEATREPGVTF 182
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
+ A+FDGI+GLGF+EI+VG AVPVW MVEQGLV + VFSFWLNR P+ E GGE+VFGG
Sbjct: 183 VAAKFDGILGLGFQEISVGYAVPVWYTMVEQGLVKDPVFSFWLNRKPEEENGGELVFGGA 242
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
DP H+KGKHTYVPVT+KGYWQF++GD+LI + TG C C+AI DSGTSLLAGPT V+T
Sbjct: 243 DPAHYKGKHTYVPVTRKGYWQFDMGDVLISGKPTGYCTNDCSAIADSGTSLLAGPTTVIT 302
Query: 305 EINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
IN AIG GVVS EC+ VV+QYG I +LL++ P+K+C QIGLC F+G V +GI
Sbjct: 303 MINQAIGAAGVVSKECRSVVNQYGQTILELLLAEAKPKKICSQIGLCTFDGTHGVSMGI 361
>gi|297849560|ref|XP_002892661.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338503|gb|EFH68920.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 506
Score = 451 bits (1159), Expect = e-124, Method: Compositional matrix adjust.
Identities = 212/340 (62%), Positives = 267/340 (78%), Gaps = 7/340 (2%)
Query: 25 SNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDS-DEDILPLKNFMDA 83
++G R+GLKK +LD + AAR+ K+ A + LG+S D D++ LKN++DA
Sbjct: 27 NDGTFRVGLKKLKLDSKNRLAARVESKQDKPLRA------YSLGNSEDADVVVLKNYLDA 80
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
QY+GEI IG+PPQ F+V+FDTGSSNLWVPSSKCYFS++C H +YKS +S+TY + GKS
Sbjct: 81 QYYGEIAIGTPPQKFTVVFDTGSSNLWVPSSKCYFSLACLLHPKYKSSRSSTYEKNGKSA 140
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVG 203
I+YG+G+I+GFFS D V VGD+VVKDQ FIEAT+E +TF++A+FDGI+GLGF+EI+VG
Sbjct: 141 AIHYGTGAIAGFFSNDAVTVGDLVVKDQEFIEATKEPGITFVVAKFDGILGLGFQEISVG 200
Query: 204 DAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGY 263
+A PVW NM++QGL+ E VFSFW NR+ D EEGGE+VFGGVDP HFKGKHTYVPVT+KGY
Sbjct: 201 NATPVWYNMLKQGLIKEPVFSFWFNRNADEEEGGELVFGGVDPNHFKGKHTYVPVTQKGY 260
Query: 264 WQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLV 323
WQF++GD+LIG TG CE GC+AI DSGTSLLAGPT ++T INHAIG GVVS +CK V
Sbjct: 261 WQFDMGDVLIGGAPTGFCESGCSAIADSGTSLLAGPTTIITMINHAIGAAGVVSQQCKTV 320
Query: 324 VSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
V QYG I DLL+S P+K+C QIGLC F+G V +GI
Sbjct: 321 VDQYGQTILDLLLSETQPKKICSQIGLCTFDGTRGVSMGI 360
>gi|225460913|ref|XP_002279049.1| PREDICTED: aspartic proteinase [Vitis vinifera]
gi|297737462|emb|CBI26663.3| unnamed protein product [Vitis vinifera]
Length = 514
Score = 451 bits (1159), Expect = e-124, Method: Compositional matrix adjust.
Identities = 215/363 (59%), Positives = 273/363 (75%), Gaps = 6/363 (1%)
Query: 7 RSVFCLWVLASCLLLP---ASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGV 63
R+V L+ + P AS GL RIGLKKR D + AARI K+ G +
Sbjct: 6 RTVAVALFLSILMFSPEFSASDGGLVRIGLKKRAFDQTNRLAARIESKQGEALGTSIRKY 65
Query: 64 R---HRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI 120
+ G ++ L N+MDAQYFGEI IG+PPQ F+VIFDTGSSNLWVPSSKCYFS+
Sbjct: 66 NLHGNAAGSKHTYVVALHNYMDAQYFGEISIGTPPQKFTVIFDTGSSNLWVPSSKCYFSV 125
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+CYFHS+YKS +S+TY + G S +I+YG+G+ISGFFS+D+V+VGD+ V +Q FIEAT+E
Sbjct: 126 ACYFHSKYKSSQSSTYKKNGTSADIHYGTGAISGFFSKDDVKVGDLAVINQEFIEATKEP 185
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
S+TF LA+FDGI+GLGF+EI+VG+AVPVW NM+ Q L+ E +FSFW NR+ + E GGEIV
Sbjct: 186 SITFALAKFDGILGLGFQEISVGNAVPVWYNMINQELIKEPIFSFWFNRNSNEEVGGEIV 245
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGG+D H+KGKHTYVPVTKKGYWQF+LGD++IG ++TG C GC+AI DSGTSLLAGPT
Sbjct: 246 FGGIDSDHYKGKHTYVPVTKKGYWQFDLGDVMIGGKTTGFCASGCSAIADSGTSLLAGPT 305
Query: 301 PVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVR 360
++TE+NHAIG G VS EC+ VV QYG +I D+L++ P+K+C QIGLCAFNG V
Sbjct: 306 TIITEVNHAIGASGFVSQECRAVVQQYGQIIIDMLLTKEQPQKICSQIGLCAFNGIRGVS 365
Query: 361 LGI 363
+GI
Sbjct: 366 MGI 368
>gi|73912435|dbj|BAE20414.1| aspartic proteinase [Triticum aestivum]
Length = 498
Score = 450 bits (1158), Expect = e-124, Method: Compositional matrix adjust.
Identities = 215/365 (58%), Positives = 278/365 (76%), Gaps = 10/365 (2%)
Query: 1 MEQKLLRSVF-CLWVLASCLLLPASS-NGLRRIGLKKRRLDLHSLNAARITRKERYMGGA 58
M Q+LL V CLW L+ + ASS +GL RI L K+ L SL AA+ R+
Sbjct: 1 MGQRLLLLVTTCLWALSCAVPHHASSRDGLLRINLNKKSLTHESLAAAKAARQH------ 54
Query: 59 GVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF 118
+R + G+SD DI+PL ++++ QY+G IG+G+PPQNF+VIFDTGSSNLWVPS+KCYF
Sbjct: 55 --DALRLKSGNSDSDIVPLVDYLNTQYYGVIGLGTPPQNFTVIFDTGSSNLWVPSAKCYF 112
Query: 119 SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
SI+CY H +YKS KS+TY G++C+I YGSG+ISGFFS DNV VGD+VVK+Q FI TR
Sbjct: 113 SIACYLHPKYKSSKSSTYKADGETCKITYGSGAISGFFSNDNVLVGDLVVKNQKFIGTTR 172
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
E S++F++ +FDGI+GLG+ +I+VG A PVW +M EQ L++++VFSFWLNRD DA GGE
Sbjct: 173 ETSVSFIVGKFDGILGLGYPDISVGKAPPVWLSMQEQKLLADDVFSFWLNRDSDALSGGE 232
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
+VFGG+DP H+KG HTYVPV+++GYWQF +GD+LI STG C GCAAIVDSGTSLLAG
Sbjct: 233 LVFGGMDPDHYKGNHTYVPVSRRGYWQFNMGDLLIDGHSTGFCAKGCAAIVDSGTSLLAG 292
Query: 299 PTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEY 358
PT +V ++NHAIG EG++S ECK VVSQYG++I +LL++ P+KVC QIGLC F+G
Sbjct: 293 PTAIVAQVNHAIGAEGIISTECKEVVSQYGEMILELLIAQTQPQKVCSQIGLCLFDGTHS 352
Query: 359 VRLGI 363
V GI
Sbjct: 353 VSNGI 357
>gi|223946977|gb|ACN27572.1| unknown [Zea mays]
gi|238014788|gb|ACR38429.1| unknown [Zea mays]
gi|413946556|gb|AFW79205.1| aspartic proteinase oryzasin-1 isoform 1 [Zea mays]
gi|413946557|gb|AFW79206.1| aspartic proteinase oryzasin-1 isoform 2 [Zea mays]
Length = 510
Score = 450 bits (1158), Expect = e-124, Method: Compositional matrix adjust.
Identities = 217/343 (63%), Positives = 266/343 (77%), Gaps = 6/343 (1%)
Query: 24 SSNGLRRIGLKKRRLDLHSLNAARITRKER---YMGGAGVSGVRHRLGDSDEDILPLKNF 80
SS GL R+ LKK +D + AAR++ +ER + GA G GD D D++ LKN+
Sbjct: 24 SSEGLVRVALKKLPVDQNGRVAARLSAEERQRLLLRGANALGSG---GDDDSDVIALKNY 80
Query: 81 MDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIG 140
M+AQYFGEIG+GSP Q F+VIFDTGSSNLWVPSSKCYFSI+CYFHSRYKS +S+TY + G
Sbjct: 81 MNAQYFGEIGVGSPQQKFTVIFDTGSSNLWVPSSKCYFSIACYFHSRYKSGQSSTYKKNG 140
Query: 141 KSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREI 200
K I YG+GSI+GFFS+D+V +GD+VVKDQ FIEAT+E LTF++A+FDGI+GLGF+EI
Sbjct: 141 KPAAIRYGTGSIAGFFSEDSVTLGDLVVKDQEFIEATKEPGLTFMVAKFDGILGLGFQEI 200
Query: 201 AVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTK 260
+VG+A PVW NMV+QGL+S+ VFSFW NR D EGGEIVFGG+D H+KG HT+VPVT+
Sbjct: 201 SVGNATPVWYNMVKQGLISDPVFSFWFNRHADEGEGGEIVFGGMDSSHYKGDHTFVPVTR 260
Query: 261 KGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAEC 320
KGYWQF +GD+L+ +STG C GGCAAI DSGTSLLAGPT ++TEIN IG GVVS EC
Sbjct: 261 KGYWQFNMGDVLVDGKSTGFCAGGCAAIADSGTSLLAGPTAIITEINEKIGAAGVVSQEC 320
Query: 321 KLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
K VVSQYG I DLL++ P K+C Q+GLC F+G V GI
Sbjct: 321 KTVVSQYGQQILDLLLAETQPAKICSQVGLCTFDGTHGVSAGI 363
>gi|226503984|ref|NP_001148782.1| aspartic proteinase oryzasin-1 precursor [Zea mays]
gi|195622118|gb|ACG32889.1| aspartic proteinase oryzasin-1 precursor [Zea mays]
Length = 510
Score = 450 bits (1158), Expect = e-124, Method: Compositional matrix adjust.
Identities = 215/340 (63%), Positives = 263/340 (77%)
Query: 24 SSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDA 83
SS GL R+ LKK +D + AAR++ +ER S GD D D++ LKN+M+A
Sbjct: 24 SSEGLVRVALKKLPVDQNGRVAARLSAEERQRLLLRGSNALGSGGDDDSDVIALKNYMNA 83
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
QYFGEIG+GSP Q F+VIFDTGSSNLWVPSSKCYFSI+CYFHSRYKS +S+TY + GK
Sbjct: 84 QYFGEIGVGSPQQKFTVIFDTGSSNLWVPSSKCYFSIACYFHSRYKSGQSSTYKKNGKPA 143
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVG 203
I YG+GSI+GFFS+D+V +GD+VVKDQ FIEAT+E LTF++A+FDGI+GLGF+EI+VG
Sbjct: 144 AIRYGTGSIAGFFSEDSVTLGDLVVKDQEFIEATKEPGLTFMVAKFDGILGLGFQEISVG 203
Query: 204 DAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGY 263
+A PVW NMV+QGL+S+ VFSFW NR D EGGEIVFGG+D H+KG HT+VPVT+KGY
Sbjct: 204 NATPVWYNMVKQGLISDPVFSFWFNRHADEGEGGEIVFGGMDSSHYKGDHTFVPVTRKGY 263
Query: 264 WQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLV 323
WQF +GD+L+ +STG C GGCAAI DSGTSLLAGPT ++TEIN IG GVVS ECK V
Sbjct: 264 WQFNMGDVLVDGKSTGFCAGGCAAIADSGTSLLAGPTAIITEINEKIGAAGVVSQECKTV 323
Query: 324 VSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
VSQYG I DLL++ P K+C Q+GLC F+G V GI
Sbjct: 324 VSQYGQQILDLLLAETQPTKICSQVGLCTFDGTHGVSAGI 363
>gi|22330379|ref|NP_176419.2| phytepsin [Arabidopsis thaliana]
gi|79320483|ref|NP_001031219.1| phytepsin [Arabidopsis thaliana]
gi|75331143|sp|Q8VYL3.1|APA2_ARATH RecName: Full=Aspartic proteinase A2; AltName: Full=Aspartic
protease 57; Short=AtASP57; Flags: Precursor
gi|17979428|gb|AAL49856.1| putative aspartic protease [Arabidopsis thaliana]
gi|23297031|gb|AAN13225.1| putative aspartic protease [Arabidopsis thaliana]
gi|222424000|dbj|BAH19961.1| AT1G62290 [Arabidopsis thaliana]
gi|332195825|gb|AEE33946.1| phytepsin [Arabidopsis thaliana]
gi|332195826|gb|AEE33947.1| phytepsin [Arabidopsis thaliana]
Length = 513
Score = 450 bits (1158), Expect = e-124, Method: Compositional matrix adjust.
Identities = 214/341 (62%), Positives = 268/341 (78%), Gaps = 2/341 (0%)
Query: 25 SNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLG-DS-DEDILPLKNFMD 82
++G R+GLKK +LD ++ A R K+ + + + LG DS D DI+PLKN++D
Sbjct: 27 NDGTFRVGLKKLKLDPNNRLATRFGSKQEEALRSSLRSYNNNLGGDSGDADIVPLKNYLD 86
Query: 83 AQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKS 142
AQY+GEI IG+PPQ F+VIFDTGSSNLWVPS KC+FS+SCYFH++YKS +S+TY + GK
Sbjct: 87 AQYYGEIAIGTPPQKFTVIFDTGSSNLWVPSGKCFFSLSCYFHAKYKSSRSSTYKKSGKR 146
Query: 143 CEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAV 202
I+YGSGSISGFFS D V VGD+VVKDQ FIE T E LTFL+A+FDG++GLGF+EIAV
Sbjct: 147 AAIHYGSGSISGFFSYDAVTVGDLVVKDQEFIETTSEPGLTFLVAKFDGLLGLGFQEIAV 206
Query: 203 GDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
G+A PVW NM++QGL+ VFSFWLNRDP +EEGGEIVFGGVDPKHF+G+HT+VPVT++G
Sbjct: 207 GNATPVWYNMLKQGLIKRPVFSFWLNRDPKSEEGGEIVFGGVDPKHFRGEHTFVPVTQRG 266
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKL 322
YWQF++G++LI +STG C GC+AI DSGTSLLAGPT VV IN AIG GVVS +CK
Sbjct: 267 YWQFDMGEVLIAGESTGYCGSGCSAIADSGTSLLAGPTAVVAMINKAIGASGVVSQQCKT 326
Query: 323 VVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
VV QYG I DLL++ P+K+C QIGLCA++G V +GI
Sbjct: 327 VVDQYGQTILDLLLAETQPKKICSQIGLCAYDGTHGVSMGI 367
>gi|356532081|ref|XP_003534602.1| PREDICTED: aspartic proteinase [Glycine max]
Length = 507
Score = 450 bits (1157), Expect = e-124, Method: Compositional matrix adjust.
Identities = 216/359 (60%), Positives = 270/359 (75%), Gaps = 14/359 (3%)
Query: 10 FCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRL-- 67
FCLW L L+ A ++GL RIGLKK +L+ H + + R S +H L
Sbjct: 12 FCLWTLLFSLVFCAPNDGLGRIGLKKVKLNTHDVEGLKEFRS---------SIRKHHLQN 62
Query: 68 ---GDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYF 124
G + D++ LKN++DAQY+GEI IG+PPQ F+VIFDTGSSNLWVPSSKCYFSI+C+
Sbjct: 63 ILGGAEETDVVALKNYLDAQYYGEIAIGTPPQKFTVIFDTGSSNLWVPSSKCYFSIACFM 122
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
H+RY+S +S+TY E G S I YG+G+ISGFFS D+V+VGD+VVKDQ FIEATRE +TF
Sbjct: 123 HARYRSSQSSTYRENGTSAAIQYGTGAISGFFSNDDVKVGDIVVKDQEFIEATREPGVTF 182
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
+ A+FDGI+GLGF++I+VG AVPVW +MVEQGLV + VFSFWLNR P+ E GGE+VFGG
Sbjct: 183 VAAKFDGILGLGFQDISVGYAVPVWYSMVEQGLVKDPVFSFWLNRKPEEENGGELVFGGA 242
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
DP H+KGKHTYVPVT+KGYWQF++GD+LI + TG C C+AI DSGTSLLAGPT VVT
Sbjct: 243 DPAHYKGKHTYVPVTRKGYWQFDMGDVLIAGKPTGYCADDCSAIADSGTSLLAGPTTVVT 302
Query: 305 EINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
IN AIG GVVS EC+ VV+QYG I +LL++ P+K+C QIGLC F+G V +GI
Sbjct: 303 MINQAIGASGVVSKECRSVVNQYGQTILELLLAEAKPKKICSQIGLCTFDGTHGVSMGI 361
>gi|12231176|dbj|BAB20971.1| aspartic proteinase 3 [Nepenthes alata]
Length = 507
Score = 449 bits (1156), Expect = e-124, Method: Compositional matrix adjust.
Identities = 232/364 (63%), Positives = 284/364 (78%), Gaps = 10/364 (2%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRH 65
+ S+F +L L+ S++GL RIGLKK+ D ++ AAR+ +E G A S +R
Sbjct: 1 MPSLFVFIILLP-LVFSDSNDGLLRIGLKKKIFDQNNRIAARLETEE---GEARRSSLRK 56
Query: 66 -----RLGDSDE-DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS 119
LG+ +E DI+ LKN+MDAQYFGEIGIG+PPQ F+VIFDTGSSNLWVPSSKCYFS
Sbjct: 57 YYLHGNLGNPEETDIVALKNYMDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFS 116
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
+ CYFH++YKS S++Y + GKS +I+YG+G+ISGFFS+DNV+VGD+ VK Q FIEA+RE
Sbjct: 117 VPCYFHAKYKSSISSSYKKNGKSADIHYGTGAISGFFSEDNVQVGDLAVKAQEFIEASRE 176
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEI 239
S+TFL+A+FDGI+GLGF+EI+VG+A PVW NMV QGLV E VFSFWLNR EEGGEI
Sbjct: 177 PSVTFLVAKFDGILGLGFQEISVGNATPVWYNMVNQGLVKEPVFSFWLNRKVGEEEGGEI 236
Query: 240 VFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGP 299
VFGGVDP HFKG H+YVPVT KGYWQF++GD+LI ++T CEGGC+AI DSGTSLLAGP
Sbjct: 237 VFGGVDPNHFKGTHSYVPVTHKGYWQFDMGDVLIDGKATEYCEGGCSAIADSGTSLLAGP 296
Query: 300 TPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYV 359
T VVT INHAIG GVVS ECK VVSQYG I DLL++ + PEK+C QIGLC F+G V
Sbjct: 297 TSVVTMINHAIGATGVVSEECKAVVSQYGQTIMDLLLAEVSPEKICSQIGLCTFDGTRGV 356
Query: 360 RLGI 363
+GI
Sbjct: 357 SIGI 360
>gi|255556616|ref|XP_002519342.1| Aspartic proteinase oryzasin-1 precursor, putative [Ricinus
communis]
gi|223541657|gb|EEF43206.1| Aspartic proteinase oryzasin-1 precursor, putative [Ricinus
communis]
Length = 500
Score = 449 bits (1155), Expect = e-123, Method: Compositional matrix adjust.
Identities = 214/348 (61%), Positives = 271/348 (77%), Gaps = 13/348 (3%)
Query: 10 FCLWVLASCL---LLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHR 66
F ++A CL L +SS+ L +IGLKKRRLDL+S+NAARIT ++
Sbjct: 9 FRFLLVALCLGAWLGASSSSRLVKIGLKKRRLDLYSINAARIT----------IADASAS 58
Query: 67 LGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHS 126
G D++ LKN++D QY+GE+ IGSPPQ F+V+FDTGSSNLWVPSSKC SI+CYFHS
Sbjct: 59 FGWPKADVVYLKNYLDTQYYGEVAIGSPPQTFTVVFDTGSSNLWVPSSKCVLSITCYFHS 118
Query: 127 RYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLL 186
+++++ S TYT+IG C+I+YGSGSISGFFSQD V++GD V+DQ F+E TREG L FL
Sbjct: 119 KFRAKMSRTYTKIGLPCKIDYGSGSISGFFSQDYVKLGDATVRDQEFVEVTREGLLAFLG 178
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
+FDGI+GLGF+EI VG A PVW NMV QG V++++FS WLNRDP A GGEIVFGG+D
Sbjct: 179 TQFDGILGLGFQEITVGQATPVWYNMVRQGHVNQKLFSLWLNRDPTAGMGGEIVFGGLDW 238
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
+HF+G+HTYVPVT+KGYWQ E+GD+ I +STG+CE GCAAIVDSGTS +AGPT +VT+I
Sbjct: 239 RHFRGEHTYVPVTEKGYWQIEVGDVFIAKKSTGMCEYGCAAIVDSGTSFIAGPTTIVTQI 298
Query: 307 NHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFN 354
NHAIG +G+VS ECK VV+++GDLIW+ L+SGL PE VC IGLC +N
Sbjct: 299 NHAIGAQGIVSLECKSVVTKFGDLIWESLISGLRPEIVCVDIGLCVYN 346
>gi|109675118|gb|ABG37021.1| aspartic protease [Nicotiana tabacum]
Length = 508
Score = 449 bits (1154), Expect = e-123, Method: Compositional matrix adjust.
Identities = 211/343 (61%), Positives = 265/343 (77%), Gaps = 2/343 (0%)
Query: 19 LLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLK 78
++ S++GL R+G+KKR+LD +N A A + +GDSD DI+ LK
Sbjct: 21 MVFSVSNDGLIRVGIKKRKLD--QINQAFGGIDSNGANSARTYHLGGNIGDSDTDIIALK 78
Query: 79 NFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTE 138
N++DAQYFGEI IGSPPQ F+VIFDTGSSNLWVPS++CYFS++CY H +YKS S+TY +
Sbjct: 79 NYLDAQYFGEICIGSPPQKFTVIFDTGSSNLWVPSARCYFSLACYLHPKYKSSHSSTYKK 138
Query: 139 IGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFR 198
G S I YG+GSISG+FS DNV+VGD++VKDQ FIEATRE +TFL A+FDGI+GLGF+
Sbjct: 139 NGTSAAIRYGTGSISGYFSNDNVKVGDLIVKDQDFIEATREPGITFLAAKFDGILGLGFQ 198
Query: 199 EIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPV 258
EI+VG +VPVW NMV QGLV + VFSFW NR+ EEGGE+VFGGVDP HFKGKHTYVPV
Sbjct: 199 EISVGKSVPVWYNMVNQGLVKKPVFSFWFNRNAQEEEGGELVFGGVDPNHFKGKHTYVPV 258
Query: 259 TKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSA 318
T KGYWQF++GD+L+G ++TG C GGC+AI DSGTSLLAGPT ++T+INH IG GVVS
Sbjct: 259 THKGYWQFDMGDVLVGGETTGFCSGGCSAIADSGTSLLAGPTTIITQINHVIGASGVVSQ 318
Query: 319 ECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRL 361
ECK +V++YG I DLL S P+K+C QIGLC+ +G+ V +
Sbjct: 319 ECKSLVTEYGKTILDLLESKAAPQKICSQIGLCSSDGSRDVSM 361
>gi|223929912|gb|ACN24614.1| aspartic acid protease [Phaseolus vulgaris]
Length = 513
Score = 447 bits (1151), Expect = e-123, Method: Compositional matrix adjust.
Identities = 217/360 (60%), Positives = 278/360 (77%), Gaps = 8/360 (2%)
Query: 10 FCLWVLASCLLLPASS----NGLRRIGLKKRRLDLHSLNAARI-TRKERYMGGAGVSGVR 64
+CL+V + LLL A S +GLRRIGLKK +LD ++ AARI ++ + + ++
Sbjct: 10 WCLFV--TTLLLSAVSCAPNDGLRRIGLKKIKLDPNNRLAARIGSKDDSFRASIRKFHLQ 67
Query: 65 HRLGDS-DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCY 123
+ G + D DI+ LKN++DAQYFGEI IG+ PQ F+VIFDTGSSNLWVPSS C FS++CY
Sbjct: 68 NNFGGTEDTDIVALKNYLDAQYFGEIAIGTSPQKFTVIFDTGSSNLWVPSSLCTFSVACY 127
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
FH++Y+S KS+TY + G + I YG+G+ISGFFS D+V VGD+VVK Q FIEATRE +
Sbjct: 128 FHAKYRSSKSSTYKKNGTAAAIQYGTGAISGFFSYDSVRVGDIVVKSQEFIEATREPGVV 187
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
FL A+FDGI+GLGF+EI+VG+AVPVW NMVEQGL+ E VFSFW NR P+ EEGGEIVFGG
Sbjct: 188 FLAAKFDGILGLGFQEISVGNAVPVWYNMVEQGLIKEPVFSFWFNRKPEEEEGGEIVFGG 247
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
VDP H+KGKHTYVPVT+KGYW+F++GD+LIG + TG C GC AI DSGTSLLAGPT ++
Sbjct: 248 VDPAHYKGKHTYVPVTRKGYWRFDMGDVLIGGKPTGYCADGCLAIADSGTSLLAGPTTII 307
Query: 304 TEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
T INHAIG G++S ECK VV++YG I +LL++ P+K+C QIGLC F+G + +GI
Sbjct: 308 TMINHAIGAAGIMSQECKTVVAEYGQTILNLLLAETQPKKICSQIGLCTFDGTRGIDMGI 367
>gi|449503193|ref|XP_004161880.1| PREDICTED: aspartic proteinase-like [Cucumis sativus]
Length = 516
Score = 447 bits (1150), Expect = e-123, Method: Compositional matrix adjust.
Identities = 218/366 (59%), Positives = 284/366 (77%), Gaps = 7/366 (1%)
Query: 4 KLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGV 63
KL +V + ++ AS++G RIGLK+R+ ++ A++I KE V
Sbjct: 6 KLFIAVLFICFFMFPMVFCASNDGKVRIGLKRRKFGQNNRVASKIATKEGISLKNSVEKY 65
Query: 64 R--HRLGDSDE-DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI 120
+ LGDSD+ DI+ LKN+++AQYFGEIGIG+PPQ F+VIFDTGSSNLWVPSSKC FS+
Sbjct: 66 QPSANLGDSDDFDIVGLKNYLNAQYFGEIGIGTPPQKFAVIFDTGSSNLWVPSSKC-FSV 124
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQV---FIEAT 177
+C HS+YKS++S+TY + GKS I YG+G+ISG+FS+DNV+VGD++VK++ FIEAT
Sbjct: 125 ACLLHSKYKSKRSSTYKKNGKSASIKYGTGAISGYFSEDNVKVGDLIVKNRSLFDFIEAT 184
Query: 178 REGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGG 237
RE SLTF+LA+FDGI+GLGF+EI+VGDAVPVW NMV+Q LV E VFSFW NR+ D E+GG
Sbjct: 185 REPSLTFVLAQFDGILGLGFKEISVGDAVPVWYNMVDQNLVKEPVFSFWFNRNADEEQGG 244
Query: 238 EIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLA 297
EIVFGGVDP H+KG+HTYVPVTKKGYWQF++GD+LI +TG C GGC+AI DSGTSLLA
Sbjct: 245 EIVFGGVDPDHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTGFCSGGCSAIADSGTSLLA 304
Query: 298 GPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAE 357
GPT ++T++NHAIG GVVS ECK VV++YG+ I +L++ P+K+C +GLCAF+G
Sbjct: 305 GPTTIITQVNHAIGASGVVSEECKAVVAEYGETIIKMLLAKDQPKKICSTLGLCAFDGER 364
Query: 358 YVRLGI 363
V +GI
Sbjct: 365 GVSMGI 370
>gi|261264941|gb|ACX55829.1| aspartic proteinase 1 [Castanea mollissima]
Length = 513
Score = 447 bits (1149), Expect = e-123, Method: Compositional matrix adjust.
Identities = 221/344 (64%), Positives = 279/344 (81%), Gaps = 3/344 (0%)
Query: 23 ASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSG--VRHRLGDSDE-DILPLKN 79
AS+ GL RIGLKK +LD ++ AA++ K+ + A + +R GD ++ DI+ LKN
Sbjct: 25 ASNGGLVRIGLKKMKLDKNNRVAAQLESKDGEVRSASIRKYYLRGNSGDPEDIDIVSLKN 84
Query: 80 FMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEI 139
+MDAQYFGEIG+G+PPQ F+VIFDTGSSNLWVPSSKCYFS++CYFHS+YKS S+TY +
Sbjct: 85 YMDAQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSSKCYFSVACYFHSKYKSSSSSTYKKN 144
Query: 140 GKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFRE 199
GK +I+YG+G+ISG+FSQD+V+VGD+VVK+Q FIEATRE S+TFL+A+FDGI+GLGF+E
Sbjct: 145 GKPADIHYGTGAISGYFSQDHVKVGDLVVKNQEFIEATREPSITFLVAKFDGILGLGFKE 204
Query: 200 IAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVT 259
I+VG+AVPVW NMV+QGLV E VFSFW NR+ D EEGGEIVFGGVDP H+KGKHTYVPVT
Sbjct: 205 ISVGNAVPVWYNMVKQGLVKEPVFSFWFNRNTDEEEGGEIVFGGVDPNHYKGKHTYVPVT 264
Query: 260 KKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
+KGYWQF++GD+LI Q+TG C GC+AI DSGTSLLAGPT ++TE+NHAIG GVVS E
Sbjct: 265 QKGYWQFDMGDVLIDGQTTGFCARGCSAIADSGTSLLAGPTTIITEVNHAIGATGVVSQE 324
Query: 320 CKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
CK VV++YG+ I +L+ P K+C QIGLC F+G V + I
Sbjct: 325 CKAVVAEYGETIIKMLLEKDQPMKICSQIGLCTFDGVRGVSMDI 368
>gi|297809619|ref|XP_002872693.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297318530|gb|EFH48952.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 507
Score = 446 bits (1148), Expect = e-123, Method: Compositional matrix adjust.
Identities = 210/360 (58%), Positives = 278/360 (77%), Gaps = 9/360 (2%)
Query: 10 FCLWVLASCLLLPASS------NGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGV 63
F L L SCL+L +++ +G RIGLKKR+LD + A+++ K R G+
Sbjct: 8 FLLVFLLSCLILISTALCERKGDGTIRIGLKKRKLDRSNRLASQLFLKNR---GSWSPKD 64
Query: 64 RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCY 123
RL D++ D++PLKN++DAQY+G+I IG+PPQ F+VIFDTGSSNLW+PS+KCY S++CY
Sbjct: 65 YFRLNDANADMVPLKNYLDAQYYGDITIGTPPQKFTVIFDTGSSNLWIPSTKCYLSVACY 124
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
FHS+YK+ +S++Y + GK I YG+G+ISG+FS D+V+VGD+VVK+Q FIEAT E +T
Sbjct: 125 FHSKYKASQSSSYRKNGKPASIRYGTGAISGYFSNDDVKVGDIVVKEQEFIEATTEPGIT 184
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
FLLA+FDGI+GLGF+EI+VG++ PVW NMVE+GLV + VFSFWLNR+P +EGGEIVFGG
Sbjct: 185 FLLAKFDGILGLGFKEISVGNSTPVWYNMVEKGLVKDPVFSFWLNRNPQDQEGGEIVFGG 244
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
VDPKHFKG+HTYVPVT KGYWQF++GD+ I + TG C GC+AI DSGTSLL GP+ V+
Sbjct: 245 VDPKHFKGEHTYVPVTHKGYWQFDMGDLQIAGKPTGYCAKGCSAIADSGTSLLTGPSTVI 304
Query: 304 TEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
T INHAIG +G+VS ECK VV QYG + + L++ P+KVC QIG+CA++G V + I
Sbjct: 305 TMINHAIGAQGIVSRECKAVVDQYGKTMLNSLLAQEDPKKVCSQIGVCAYDGTHSVSMDI 364
>gi|20800441|gb|AAB03843.2| aspartic proteinase [Vigna unguiculata]
gi|33339734|gb|AAQ14346.1| aspartic proteinase [Vigna unguiculata]
Length = 513
Score = 446 bits (1148), Expect = e-123, Method: Compositional matrix adjust.
Identities = 215/363 (59%), Positives = 278/363 (76%), Gaps = 6/363 (1%)
Query: 7 RSVFCLWVLASCLLLPASS----NGLRRIGLKKRRLDLHSLNAARI-TRKERYMGGAGVS 61
++V L + + LL A S +GLRRIGLKK +LD ++ AARI + + +
Sbjct: 5 KNVISLCLFVTTLLFSAVSCAPNDGLRRIGLKKIKLDPNNRLAARIGSNDDSFRASIRKF 64
Query: 62 GVRHRL-GDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI 120
+++ G + DI+ LKN++DAQY+GEI IG+ PQ F+VIFDTGSSNLWVPSS+C FS+
Sbjct: 65 HLQNNFAGTGETDIVALKNYLDAQYYGEISIGTSPQKFTVIFDTGSSNLWVPSSRCTFSL 124
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+CYFH++Y+S +S+TY G + I YG+G+I+GFFS DNV VGD+VVK+Q FIEATRE
Sbjct: 125 ACYFHAKYRSGRSSTYRRNGTAAAIQYGTGAIAGFFSYDNVRVGDIVVKNQEFIEATREP 184
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
+ FL A+FDGI+GLGF+EI+VG+AVPVW NMVEQGL+ E VFSFWLNR + EEGGE+V
Sbjct: 185 GVVFLAAKFDGILGLGFQEISVGNAVPVWYNMVEQGLIKEPVFSFWLNRKTEEEEGGELV 244
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGGVDP H+KG+HTYVPVT+KGYWQF++GD+LIG + TG C GGCAAI DSGTSLLAGPT
Sbjct: 245 FGGVDPAHYKGEHTYVPVTRKGYWQFDMGDVLIGGKPTGYCAGGCAAIADSGTSLLAGPT 304
Query: 301 PVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVR 360
++T INHAIG GV+S ECK VV++YG I +LL++ P+K+C QIGLC F+G V
Sbjct: 305 AIITMINHAIGASGVMSQECKTVVAEYGQTILNLLLAETQPKKICSQIGLCTFDGTRGVD 364
Query: 361 LGI 363
+GI
Sbjct: 365 MGI 367
>gi|15233518|ref|NP_192355.1| phytepsin [Arabidopsis thaliana]
gi|75338508|sp|Q9XEC4.1|APA3_ARATH RecName: Full=Aspartic proteinase A3; Flags: Precursor
gi|4773885|gb|AAD29758.1|AF076243_5 putative aspartic protease [Arabidopsis thaliana]
gi|13937238|gb|AAK50111.1|AF372974_1 AT4g04460/T26N6_7 [Arabidopsis thaliana]
gi|7267203|emb|CAB77914.1| putative aspartic protease [Arabidopsis thaliana]
gi|332656990|gb|AEE82390.1| phytepsin [Arabidopsis thaliana]
Length = 508
Score = 446 bits (1148), Expect = e-123, Method: Compositional matrix adjust.
Identities = 210/360 (58%), Positives = 277/360 (76%), Gaps = 8/360 (2%)
Query: 10 FCLWVLASCLLLPASS------NGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGV 63
F L L SCL+L +++ +G RIGLKKR+LD + A+++ K R G
Sbjct: 8 FLLVFLLSCLILISTASCERNGDGTIRIGLKKRKLDRSNRLASQLFLKNR--GSHWSPKH 65
Query: 64 RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCY 123
RL D + D++PLKN++DAQY+G+I IG+PPQ F+VIFDTGSSNLW+PS+KCY S++CY
Sbjct: 66 YFRLNDENADMVPLKNYLDAQYYGDITIGTPPQKFTVIFDTGSSNLWIPSTKCYLSVACY 125
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
FHS+YK+ +S++Y + GK I YG+G+ISG+FS D+V+VGD+VVK+Q FIEAT E +T
Sbjct: 126 FHSKYKASQSSSYRKNGKPASIRYGTGAISGYFSNDDVKVGDIVVKEQEFIEATSEPGIT 185
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
FLLA+FDGI+GLGF+EI+VG++ PVW NMVE+GLV E +FSFWLNR+P EGGEIVFGG
Sbjct: 186 FLLAKFDGILGLGFKEISVGNSTPVWYNMVEKGLVKEPIFSFWLNRNPKDPEGGEIVFGG 245
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
VDPKHFKG+HT+VPVT KGYWQF++GD+ I + TG C GC+AI DSGTSLL GP+ V+
Sbjct: 246 VDPKHFKGEHTFVPVTHKGYWQFDMGDLQIAGKPTGYCAKGCSAIADSGTSLLTGPSTVI 305
Query: 304 TEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
T INHAIG +G+VS ECK VV QYG + + L++ P+KVC QIG+CA++G + V +GI
Sbjct: 306 TMINHAIGAQGIVSRECKAVVDQYGKTMLNSLLAQEDPKKVCSQIGVCAYDGTQSVSMGI 365
>gi|261264943|gb|ACX55830.1| aspartic proteinase 2 [Castanea mollissima]
Length = 513
Score = 445 bits (1145), Expect = e-122, Method: Compositional matrix adjust.
Identities = 220/344 (63%), Positives = 279/344 (81%), Gaps = 3/344 (0%)
Query: 23 ASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSG--VRHRLGDSDE-DILPLKN 79
AS+ GL RIGLKK +LD ++ AA++ K+ + A + +R GD ++ DI+ LKN
Sbjct: 25 ASNGGLVRIGLKKMKLDKNNRVAAQLESKDGEVRSASIRKYYLRGNSGDPEDIDIVSLKN 84
Query: 80 FMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEI 139
+MDAQYFGEIG+G+PPQ F+VIFDTGSSNLWVPSSKCYFS++CYFHS+YKS S+TY +
Sbjct: 85 YMDAQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSSKCYFSVACYFHSKYKSSSSSTYKKN 144
Query: 140 GKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFRE 199
GK +I+YG+G+ISG+FSQD+V+VGD+VVK+Q FIEATRE S+TFL+A+FDGI+GLGF+E
Sbjct: 145 GKPADIHYGTGAISGYFSQDHVKVGDLVVKNQEFIEATREPSITFLVAKFDGILGLGFKE 204
Query: 200 IAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVT 259
I+VG+AVPVW NMV+QGLV E VFSFW NR+ D EEGGEIVFGGVDP H+KGKHTYVPVT
Sbjct: 205 ISVGNAVPVWYNMVKQGLVKEPVFSFWFNRNTDEEEGGEIVFGGVDPNHYKGKHTYVPVT 264
Query: 260 KKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
+KGYWQF++GD+LI Q+TG C C+AI DSGTSLLAGPT ++TE+NHAIG GVVS E
Sbjct: 265 QKGYWQFDMGDVLIDGQTTGFCVTTCSAIADSGTSLLAGPTTIITEVNHAIGATGVVSQE 324
Query: 320 CKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
CK VV++YG+ I +L+ P K+C QIGLC F+G + V + I
Sbjct: 325 CKAVVAEYGETIIKMLLEKDQPMKICSQIGLCTFDGTQGVSMDI 368
>gi|356556454|ref|XP_003546541.1| PREDICTED: aspartic proteinase oryzasin-1-like [Glycine max]
Length = 505
Score = 444 bits (1143), Expect = e-122, Method: Compositional matrix adjust.
Identities = 213/355 (60%), Positives = 266/355 (74%), Gaps = 6/355 (1%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNG-LRRIGLKKRRLDLHSLNAARITRKERYMGGAG 59
M+ K L C+W + S++G L RIGLK+R LDL L AARI + G
Sbjct: 1 MDFKYLLVGMCVWAWFGSITFATSNDGRLMRIGLKRRTLDLQCLKAARIKEAGHHRDLGG 60
Query: 60 VSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS 119
V+ DEDI+ LKN++DAQYFGEI IGSPPQ F+V+FDTGSSNLWVPSSKC FS
Sbjct: 61 VN-----RNCCDEDIVYLKNYLDAQYFGEISIGSPPQYFNVVFDTGSSNLWVPSSKCIFS 115
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
I+CYFHS+Y+S+ S+TYTEIG C+I YG GSI GFFSQDNV+VGD+++KDQ F E TRE
Sbjct: 116 IACYFHSKYRSKISSTYTEIGIPCKIPYGQGSIFGFFSQDNVQVGDIIIKDQEFAEITRE 175
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEI 239
GSL FDGI+GLGF++ +VG PVW NM+E GL+S ++FS WLN+DP E GGEI
Sbjct: 176 GSLALPALPFDGILGLGFQDTSVGKVTPVWYNMLEGGLISHKIFSLWLNQDPSEEMGGEI 235
Query: 240 VFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGP 299
VFGG+D +HF+G+HTYVP+++KGYWQ +LGDIL+ N STG+CEGGCAA+VDSGTSL+AGP
Sbjct: 236 VFGGIDYRHFRGEHTYVPLSQKGYWQIDLGDILLANNSTGLCEGGCAAVVDSGTSLIAGP 295
Query: 300 TPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFN 354
T VVT+INHAIG EG S ECK ++ YGD IW+ L++GL P+ +C IG C+ N
Sbjct: 296 TTVVTQINHAIGAEGYTSFECKSILHNYGDSIWESLIAGLYPDIICSAIGFCSNN 350
>gi|334186351|ref|NP_001190671.1| phytepsin [Arabidopsis thaliana]
gi|332656991|gb|AEE82391.1| phytepsin [Arabidopsis thaliana]
Length = 504
Score = 443 bits (1139), Expect = e-122, Method: Compositional matrix adjust.
Identities = 210/360 (58%), Positives = 276/360 (76%), Gaps = 12/360 (3%)
Query: 10 FCLWVLASCLLLPASS------NGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGV 63
F L L SCL+L +++ +G RIGLKKR+LD + A+++ K R G
Sbjct: 8 FLLVFLLSCLILISTASCERNGDGTIRIGLKKRKLDRSNRLASQLFLKNR--GSHWSPKH 65
Query: 64 RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCY 123
RL D + D++PLKN++DAQY+G+I IG+PPQ F+VIFDTGSSNLW+PS+KCY S++CY
Sbjct: 66 YFRLNDENADMVPLKNYLDAQYYGDITIGTPPQKFTVIFDTGSSNLWIPSTKCYLSVACY 125
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
FHS+YK+ +S++Y + GK I YG+G+ISG+FS D+V+VGD+VVK+Q FIEAT E +T
Sbjct: 126 FHSKYKASQSSSYRKNGKPASIRYGTGAISGYFSNDDVKVGDIVVKEQEFIEATSEPGIT 185
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
FLLA+FDGI+GLGF+EI+VG++ PVW NMVE+GLV E +FSFWLNR+P EGGEIVFGG
Sbjct: 186 FLLAKFDGILGLGFKEISVGNSTPVWYNMVEKGLVKEPIFSFWLNRNPKDPEGGEIVFGG 245
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
VDPKHFKG+HT+VPVT KGYWQF++GD+ I + TG C GC+AI DSGTSLL GP+ V+
Sbjct: 246 VDPKHFKGEHTFVPVTHKGYWQFDMGDLQIAGKPTGYCAKGCSAIADSGTSLLTGPSTVI 305
Query: 304 TEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
T INHAIG +G+VS ECK VV QYG +++ LL +KVC QIG+CA++G + V +GI
Sbjct: 306 TMINHAIGAQGIVSRECKAVVDQYG----KTMLNSLLAQKVCSQIGVCAYDGTQSVSMGI 361
>gi|2160151|gb|AAB60773.1| Strong similarity to Brassica aspartic protease (gb|X77260)
[Arabidopsis thaliana]
Length = 433
Score = 442 bits (1138), Expect = e-121, Method: Compositional matrix adjust.
Identities = 214/354 (60%), Positives = 268/354 (75%), Gaps = 15/354 (4%)
Query: 25 SNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLG-DS-DEDILPLKNFMD 82
++G R+GLKK +LD ++ A R K+ + + + LG DS D DI+PLKN++D
Sbjct: 27 NDGTFRVGLKKLKLDPNNRLATRFGSKQEEALRSSLRSYNNNLGGDSGDADIVPLKNYLD 86
Query: 83 AQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKS 142
AQY+GEI IG+PPQ F+VIFDTGSSNLWVPS KC+FS+SCYFH++YKS +S+TY + GK
Sbjct: 87 AQYYGEIAIGTPPQKFTVIFDTGSSNLWVPSGKCFFSLSCYFHAKYKSSRSSTYKKSGKR 146
Query: 143 CEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAV 202
I+YGSGSISGFFS D V VGD+VVKDQ FIE T E LTFL+A+FDG++GLGF+EIAV
Sbjct: 147 AAIHYGSGSISGFFSYDAVTVGDLVVKDQEFIETTSEPGLTFLVAKFDGLLGLGFQEIAV 206
Query: 203 GDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
G+A PVW NM++QGL+ VFSFWLNRDP +EEGGEIVFGGVDPKHF+G+HT+VPVT++G
Sbjct: 207 GNATPVWYNMLKQGLIKRPVFSFWLNRDPKSEEGGEIVFGGVDPKHFRGEHTFVPVTQRG 266
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT-------------PVVTEINHA 309
YWQF++G++LI +STG C GC+AI DSGTSLLAGPT VV IN A
Sbjct: 267 YWQFDMGEVLIAGESTGYCGSGCSAIADSGTSLLAGPTVSKYHEFIVLFQLAVVAMINKA 326
Query: 310 IGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
IG GVVS +CK VV QYG I DLL++ P+K+C QIGLCA++G V +GI
Sbjct: 327 IGASGVVSQQCKTVVDQYGQTILDLLLAETQPKKICSQIGLCAYDGTHGVSMGI 380
>gi|12231172|dbj|BAB20969.1| aspartic proteinase 1 [Nepenthes alata]
Length = 514
Score = 441 bits (1135), Expect = e-121, Method: Compositional matrix adjust.
Identities = 220/341 (64%), Positives = 277/341 (81%), Gaps = 7/341 (2%)
Query: 28 LRRIGLKKRRLD----LHSLNAARITRKERYMGGAGVSGVRHRLGDSDE-DILPLKNFMD 82
L R+GLKKR+LD SL + KE G+ + LG+SD+ DI+ LKN+M+
Sbjct: 30 LLRVGLKKRKLDQINRFSSLYGCK--GKESINPAIRKYGLGNGLGNSDDADIISLKNYMN 87
Query: 83 AQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKS 142
AQYFGEIGIG+PPQ F++IFDTGSSNLWVPS+KCYFSI+CYFHS+YKS S++YT+ GKS
Sbjct: 88 AQYFGEIGIGTPPQKFTLIFDTGSSNLWVPSAKCYFSIACYFHSKYKSSLSSSYTKNGKS 147
Query: 143 CEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAV 202
EI+YG+G+ISGFFSQD+V++GD+VV++Q FIEATRE S+TF+ A+FDGI+GLGF+EI+V
Sbjct: 148 AEIHYGTGAISGFFSQDHVKLGDLVVENQDFIEATREPSITFVAAKFDGILGLGFQEISV 207
Query: 203 GDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
G+AVPVW NMV+QGLV+E VFSFWLNR+ EEGGEIVFGGVDP H+KG+HT+VPVT KG
Sbjct: 208 GNAVPVWYNMVKQGLVNEPVFSFWLNRNATEEEGGEIVFGGVDPNHYKGEHTFVPVTHKG 267
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKL 322
YWQF++ D+L+G ++TG C GGC+AI DSGTSLLAGPT +V +INHAIG GVVS ECK
Sbjct: 268 YWQFDMDDVLVGGETTGYCSGGCSAIADSGTSLLAGPTTIVAQINHAIGASGVVSQECKA 327
Query: 323 VVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
VV+QYG I D+L+S P+K+C QIGLC F+G V +GI
Sbjct: 328 VVAQYGTAILDMLISETQPKKICSQIGLCTFDGKRGVSVGI 368
>gi|148910494|gb|ABR18322.1| unknown [Picea sitchensis]
Length = 471
Score = 440 bits (1132), Expect = e-121, Method: Compositional matrix adjust.
Identities = 219/357 (61%), Positives = 269/357 (75%), Gaps = 17/357 (4%)
Query: 23 ASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS-------GVRHRLGDSDE--- 72
A+++ L RI LKK+ LD +L AARI +E G+S G+R L S+
Sbjct: 19 AANDCLARIELKKKGLDQKTLQAARIVARE-----GGLSNEVNRKYGLRGGLSYSESARG 73
Query: 73 DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRK 132
+ +PLKN++DAQY+GEIG+G+PPQ F+VIFDTGSSNLWVPS+KCY SI+CYFHS+YK+ +
Sbjct: 74 EYVPLKNYLDAQYYGEIGLGTPPQKFTVIFDTGSSNLWVPSTKCYLSIACYFHSKYKASQ 133
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGI 192
S++Y GK I YGSGS+SG+ QD+V GD+VVKDQVF E T+E LTFL A+FDGI
Sbjct: 134 SSSYCVNGKPFNIQYGSGSVSGYLGQDHVTAGDLVVKDQVFAEVTQEPGLTFLAAKFDGI 193
Query: 193 IGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGK 252
+GLGF++I+VG+ VPVW NMV QGL+ E VFSFW+NR EEGGEIVFGGVDP HFKGK
Sbjct: 194 LGLGFQKISVGNVVPVWYNMVNQGLIKEPVFSFWMNRKVGDEEGGEIVFGGVDPNHFKGK 253
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
HTYVPVT++GYWQF +GD LIG QSTG C GGCAAIVDSGTSLLAGP+ +V +IN AIG
Sbjct: 254 HTYVPVTREGYWQFNMGDFLIGGQSTGFCSGGCAAIVDSGTSLLAGPSGIVAQINEAIGA 313
Query: 313 EGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGIPITRVL 369
G+ S ECK VVSQYGDLI +LL++ P+KVC QIGLC +G V G+ I VL
Sbjct: 314 SGLASQECKSVVSQYGDLIMELLMAQTNPQKVCSQIGLCLSDGTRDV--GMRIASVL 368
>gi|302144105|emb|CBI23210.3| unnamed protein product [Vitis vinifera]
Length = 429
Score = 440 bits (1131), Expect = e-121, Method: Compositional matrix adjust.
Identities = 205/283 (72%), Positives = 245/283 (86%)
Query: 81 MDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIG 140
MDAQYFGEIGIG+PPQ F+VIFDTGSSNLWVPSSKCYFS+ CYFHS+YKS +S+TY + G
Sbjct: 1 MDAQYFGEIGIGTPPQTFTVIFDTGSSNLWVPSSKCYFSVPCYFHSKYKSSQSSTYRKNG 60
Query: 141 KSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREI 200
KS +I+YG+G+ISGFFS+DNV+VGD+VVK+Q FIEATRE S+TFL+A+FDGI+GLGF+EI
Sbjct: 61 KSADIHYGTGAISGFFSEDNVKVGDLVVKNQEFIEATREPSVTFLVAKFDGILGLGFQEI 120
Query: 201 AVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTK 260
+VG+AVPVW NMV+QGLV E VFSFWLNR D +EGGE+VFGGVDP HFKG+HTYVPVT+
Sbjct: 121 SVGNAVPVWYNMVKQGLVKEPVFSFWLNRKTDDDEGGELVFGGVDPDHFKGEHTYVPVTQ 180
Query: 261 KGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAEC 320
KGYWQF++G++LI ++TG C GGCAAI DSGTSLLAGPT VV INHAIG GVVS EC
Sbjct: 181 KGYWQFDMGEVLIDGETTGYCAGGCAAIADSGTSLLAGPTAVVAMINHAIGATGVVSQEC 240
Query: 321 KLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
K VV+QYG+ I DLL+S P+K+C QIGLC F+G V +GI
Sbjct: 241 KTVVAQYGETIMDLLLSEASPQKICSQIGLCTFDGTRGVGMGI 283
>gi|296089849|emb|CBI39668.3| unnamed protein product [Vitis vinifera]
Length = 430
Score = 440 bits (1131), Expect = e-121, Method: Compositional matrix adjust.
Identities = 199/277 (71%), Positives = 240/277 (86%)
Query: 81 MDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIG 140
MDAQY+GEIGIG+PPQNF+V+FDTGS+NLWVPS+KC+FSI+C FHS+Y SR S TY ++G
Sbjct: 1 MDAQYYGEIGIGTPPQNFTVVFDTGSANLWVPSTKCHFSIACLFHSKYNSRLSTTYIDLG 60
Query: 141 KSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREI 200
K EI+YGSGSISG FSQDNV+VG + +K+QVFIEATRE SL F+L +FDGI+GLGF EI
Sbjct: 61 KEGEIHYGSGSISGVFSQDNVQVGSMAIKNQVFIEATREASLVFVLGKFDGILGLGFEEI 120
Query: 201 AVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTK 260
VG+A PVW N++ QGLV E++FSFWLNRDP A +GGEIVFGGVD +HFKG+HTY +T+
Sbjct: 121 VVGNATPVWYNLLRQGLVQEDIFSFWLNRDPQATDGGEIVFGGVDKRHFKGQHTYASITQ 180
Query: 261 KGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAEC 320
KGYWQFE+G+ LIG QSTG CE GCAAIVDSGTSL+AGPT +VTEINHAIG EG+VS EC
Sbjct: 181 KGYWQFEMGEFLIGYQSTGFCEAGCAAIVDSGTSLIAGPTAIVTEINHAIGAEGIVSQEC 240
Query: 321 KLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAE 357
K VVSQYG++IWDLL+S + P+ VC QIGLC FNG++
Sbjct: 241 KEVVSQYGNMIWDLLISRVQPDAVCSQIGLCNFNGSQ 277
>gi|226506070|ref|NP_001150729.1| aspartic proteinase oryzasin-1 precursor [Zea mays]
gi|195641348|gb|ACG40142.1| aspartic proteinase oryzasin-1 precursor [Zea mays]
Length = 518
Score = 439 bits (1129), Expect = e-120, Method: Compositional matrix adjust.
Identities = 207/338 (61%), Positives = 262/338 (77%), Gaps = 1/338 (0%)
Query: 27 GLRRIGLKKRRLDLHSLNAARITRKERY-MGGAGVSGVRHRLGDSDEDILPLKNFMDAQY 85
GL R+ LKK+ +D ++ AAR++ +ER + G + + GD D D++ L + +AQY
Sbjct: 34 GLVRVALKKQPVDQNARVAARLSAEERQRLLLRGANALGSAGGDDDSDVIALNXYXNAQY 93
Query: 86 FGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEI 145
FGEIG+G+PPQ F+VIFDTGSSNLWVPSSKCYFSI+CYFHSRYKS +S+TY + GK I
Sbjct: 94 FGEIGVGTPPQKFTVIFDTGSSNLWVPSSKCYFSIACYFHSRYKSGQSSTYKKNGKPAAI 153
Query: 146 NYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDA 205
YG+G+I+GFFS+D+V++GD+ V DQ FIEAT+E LTF++A+FDGI+GLGF+EI+VG+A
Sbjct: 154 QYGTGAIAGFFSEDSVKLGDLDVNDQEFIEATKEPGLTFMVAKFDGILGLGFQEISVGNA 213
Query: 206 VPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQ 265
PVW NMV+QGL+S+ VFSFW NR EGGEIVFGG+D H+KG HTYVPVT+KGYWQ
Sbjct: 214 TPVWYNMVKQGLISDPVFSFWFNRHAGEGEGGEIVFGGMDSSHYKGDHTYVPVTQKGYWQ 273
Query: 266 FELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVS 325
F +GD+L+ +STG C GGCAAI DSGTSLLAGPT ++TEIN IG GVVS ECK VVS
Sbjct: 274 FNMGDVLVDGKSTGFCAGGCAAIADSGTSLLAGPTAIITEINEKIGAAGVVSQECKTVVS 333
Query: 326 QYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
QYG I DLL++ P K+C Q+GLC F+G V GI
Sbjct: 334 QYGQQILDLLLAETQPAKICSQVGLCTFDGTHGVSTGI 371
>gi|357450315|ref|XP_003595434.1| Aspartic proteinase [Medicago truncatula]
gi|355484482|gb|AES65685.1| Aspartic proteinase [Medicago truncatula]
Length = 507
Score = 439 bits (1129), Expect = e-120, Method: Compositional matrix adjust.
Identities = 211/357 (59%), Positives = 265/357 (74%), Gaps = 5/357 (1%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV 60
M K + V CLW+ + L S++ L RI LKKR LD+ SLN +RI ++ + +
Sbjct: 1 MSLKYMLVVTCLWIWSLSLAYTISNDNLMRISLKKRNLDIQSLNTSRI---KKVIHERDL 57
Query: 61 SGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI 120
V G +D++ LKN+ D QY+GEIGIGSPPQ F+V+FDTGSSNLWVPSS+C FSI
Sbjct: 58 ESVDTNYGS--KDVVYLKNYFDVQYYGEIGIGSPPQYFNVVFDTGSSNLWVPSSRCIFSI 115
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+CYFHS+Y+S S+TY EIG CEI Y G I GFFSQDNV+VGD+ VKDQ F E TREG
Sbjct: 116 ACYFHSKYRSGISSTYNEIGVPCEIPYDEGYIYGFFSQDNVKVGDINVKDQEFCEITREG 175
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
+ L FDGI+GLGF++++VG PVW NM+EQG +S++VFS W N+DP AE GGEIV
Sbjct: 176 NFALLALPFDGILGLGFQDVSVGKVTPVWYNMIEQGHISDKVFSLWFNKDPMAEVGGEIV 235
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGGVD +HF+G HTY P+++KGYWQ E+GDIL+ N +TG+CEGGCAAIVDSGTSL+AGPT
Sbjct: 236 FGGVDKRHFRGDHTYFPISQKGYWQIEVGDILLANNTTGLCEGGCAAIVDSGTSLIAGPT 295
Query: 301 PVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAE 357
VVT+INH IG EG VS ECK +V YG+LIW+ L+SGL PE +C I LC+ NG +
Sbjct: 296 GVVTQINHVIGTEGYVSYECKNIVHNYGNLIWESLISGLNPEILCADIRLCSDNGFQ 352
>gi|148906206|gb|ABR16259.1| unknown [Picea sitchensis]
Length = 509
Score = 439 bits (1128), Expect = e-120, Method: Compositional matrix adjust.
Identities = 219/357 (61%), Positives = 269/357 (75%), Gaps = 17/357 (4%)
Query: 23 ASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS-------GVRHRLGDSDE--- 72
A+++ L RI LKK+ LD +L AARI +E G+S G+R L S+
Sbjct: 19 AANDCLARIELKKKGLDQKTLQAARIVARE-----GGLSNEVNRKYGLRGGLSYSESARG 73
Query: 73 DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRK 132
+ +PLKN++DAQY+GEIG+G+PPQ F+VIFDTGSSNLWVPS+KCY SI+CYFHS+YK+ +
Sbjct: 74 EYVPLKNYLDAQYYGEIGLGTPPQKFTVIFDTGSSNLWVPSTKCYLSIACYFHSKYKASQ 133
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGI 192
S++Y GK I YGSGS+SG+ QD+V GD+VVKDQVF E T+E LTFL A+FDGI
Sbjct: 134 SSSYCVNGKPFNIQYGSGSVSGYLGQDHVTAGDLVVKDQVFAEVTQEPGLTFLAAKFDGI 193
Query: 193 IGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGK 252
+GLGF++I+VG+ VPVW NMV QGL+ E VFSFW+NR EEGGEIVFGGVDP HFKGK
Sbjct: 194 LGLGFQKISVGNVVPVWYNMVNQGLIKEPVFSFWMNRKVGDEEGGEIVFGGVDPNHFKGK 253
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
HTYVPVT++GYWQF +GD LIG QSTG C GGCAAIVDSGTSLLAGP+ +V +IN AIG
Sbjct: 254 HTYVPVTREGYWQFNMGDFLIGGQSTGFCSGGCAAIVDSGTSLLAGPSGIVAQINEAIGA 313
Query: 313 EGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGIPITRVL 369
G+ S ECK VVSQYGDLI +LL++ P+KVC QIGLC +G V G+ I VL
Sbjct: 314 SGLASQECKSVVSQYGDLIMELLMAQTNPQKVCSQIGLCLSDGTRDV--GMRIASVL 368
>gi|509163|emb|CAA48939.1| cyprosin [Cynara cardunculus]
Length = 474
Score = 435 bits (1118), Expect = e-119, Method: Compositional matrix adjust.
Identities = 207/332 (62%), Positives = 261/332 (78%), Gaps = 16/332 (4%)
Query: 33 LKKRRLDL------HS-LNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQY 85
LKKR++++ H+ N A RK GVR DSD +++ LKN+MDAQY
Sbjct: 1 LKKRKVNILNHPGEHAGSNDANARRK---------YGVRGNFRDSDGELIALKNYMDAQY 51
Query: 86 FGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEI 145
FGEIGIG+PPQ F+VIFDTGSSNLWVPSSKCYFS++C FHS+Y+S S TY + GKS I
Sbjct: 52 FGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFSVACLFHSKYRSTDSTTYKKNGKSAAI 111
Query: 146 NYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDA 205
YG+GSISGFFSQD+V++GD++VK+Q FIEAT+E +TFL A+FDGI+GLGF+EI+VGDA
Sbjct: 112 QYGTGSISGFFSQDSVKLGDLLVKEQDFIEATKEPGITFLAAKFDGILGLGFQEISVGDA 171
Query: 206 VPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQ 265
VPVW M+ QGLV E VFSFWLNR+ D +EGGE+VFGGVDP HFKG+HTYVPVT+KGYWQ
Sbjct: 172 VPVWYTMLNQGLVQEPVFSFWLNRNADEQEGGELVFGGVDPNHFKGEHTYVPVTQKGYWQ 231
Query: 266 FELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVS 325
FE+GD+LIG+++TG C GCAAI DSGTSLLAG T +VT+IN AIG GV+S +CK +V
Sbjct: 232 FEMGDVLIGDKTTGFCASGCAAIADSGTSLLAGTTTIVTQINQAIGAAGVMSQQCKSLVD 291
Query: 326 QYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAE 357
QYG + ++L+S PEK+C Q+ LC+F+G+
Sbjct: 292 QYGKSMIEMLLSEEQPEKICSQMKLCSFDGSH 323
>gi|357480353|ref|XP_003610462.1| Aspartic proteinase [Medicago truncatula]
gi|355511517|gb|AES92659.1| Aspartic proteinase [Medicago truncatula]
Length = 519
Score = 435 bits (1118), Expect = e-119, Method: Compositional matrix adjust.
Identities = 221/376 (58%), Positives = 279/376 (74%), Gaps = 15/376 (3%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASS------NGLRRIGLKKRRLDLHSLNAARIT----- 49
M KL V CL L S LL+ A S +GLRRI LKK +LD ++ AA
Sbjct: 1 MGNKLHVIVLCL--LVSTLLISAVSIAASSSDGLRRIALKKIQLDRNNKLAAAAAAAAGG 58
Query: 50 RKERYMGGAGVSGVRHRLGDS--DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSS 107
R+ + S ++ L ++ + DI+ LKN++DAQY+GEI IG+ PQ F+VIFDTGSS
Sbjct: 59 RRTKDTDSLQSSIRKYNLANNYQETDIVALKNYLDAQYYGEISIGTSPQKFTVIFDTGSS 118
Query: 108 NLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVV 167
NLWVPSSKC FS++CYFH++YKS KS TY + G + I YG+G+ISGFFS D+V+VGD+V
Sbjct: 119 NLWVPSSKCTFSVACYFHAKYKSTKSTTYRKNGTAAAIQYGTGAISGFFSYDSVKVGDIV 178
Query: 168 VKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL 227
VK+Q FIEAT+E +TFL+A+FDGI+GLGF+EI+VG+AVPVW NMVEQGL+ E VFSFWL
Sbjct: 179 VKNQEFIEATKEPGVTFLVAKFDGILGLGFQEISVGNAVPVWYNMVEQGLIQEPVFSFWL 238
Query: 228 NRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAA 287
NR P+ EEGGEIVFGGVDP H+KG HTYVPV +KGYWQF++GD+ I +STG C GC+A
Sbjct: 239 NRKPEEEEGGEIVFGGVDPAHYKGNHTYVPVKRKGYWQFDMGDVTIDGKSTGYCVDGCSA 298
Query: 288 IVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQ 347
I DSGTSLLAGPT V+T INHAIG GVVS ECK +V++YG I +LL++ P+K+C +
Sbjct: 299 IADSGTSLLAGPTTVITMINHAIGASGVVSKECKTIVAEYGQTILNLLLAEAQPKKICSE 358
Query: 348 IGLCAFNGAEYVRLGI 363
IGLC F+G V L I
Sbjct: 359 IGLCTFDGTHGVDLAI 374
>gi|356522015|ref|XP_003529645.1| PREDICTED: aspartic proteinase-like [Glycine max]
Length = 514
Score = 434 bits (1116), Expect = e-119, Method: Compositional matrix adjust.
Identities = 219/344 (63%), Positives = 273/344 (79%), Gaps = 3/344 (0%)
Query: 23 ASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSG--VRHRLGDSDE-DILPLKN 79
A ++GLRRIGLKK +LD + AARI K+ A + +++ G S+E DI+ LKN
Sbjct: 25 APNDGLRRIGLKKIKLDPKNRLAARIGSKDVDSFRASIRKFHLQNNFGGSEETDIVALKN 84
Query: 80 FMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEI 139
++DAQY+GEI IG+ PQ F+VIFDTGSSNLWVPSSKC FS++CYFH++YKS KS+TY +
Sbjct: 85 YLDAQYYGEIAIGTSPQKFTVIFDTGSSNLWVPSSKCTFSVACYFHAKYKSSKSSTYKKN 144
Query: 140 GKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFRE 199
G + I YG+G+ISGFFS D+V VGD+ VK+Q FIEATRE +TFL A+FDGI+GLGF+E
Sbjct: 145 GTAAAIQYGTGAISGFFSYDSVRVGDIFVKNQEFIEATREPGVTFLAAKFDGILGLGFQE 204
Query: 200 IAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVT 259
I+VG+AVPVW NMV+QGL+ E VFSFW NR P+ EEGGEIVFGGVDP H+KGKHTYVPVT
Sbjct: 205 ISVGNAVPVWYNMVDQGLIKEPVFSFWFNRKPEEEEGGEIVFGGVDPAHYKGKHTYVPVT 264
Query: 260 KKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
+KGYWQF++GD+LIG + TG C GC+AI DSGTSLLAGPT V+T INHAIG GV+S E
Sbjct: 265 RKGYWQFDMGDVLIGGKPTGYCADGCSAIADSGTSLLAGPTTVITMINHAIGASGVMSQE 324
Query: 320 CKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
CK VV++YG I DLL+S P+K+C +IGLCAF+G V +GI
Sbjct: 325 CKTVVAEYGQTILDLLLSETQPKKICSRIGLCAFDGTRGVDVGI 368
>gi|1169175|sp|P40782.2|CYPR1_CYNCA RecName: Full=Cyprosin; Flags: Precursor
gi|1585067|prf||2124255A cyprosin
Length = 473
Score = 434 bits (1115), Expect = e-119, Method: Compositional matrix adjust.
Identities = 207/332 (62%), Positives = 261/332 (78%), Gaps = 16/332 (4%)
Query: 33 LKKRRLDL------HS-LNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQY 85
LKKR++++ H+ N A RK GVR DSD +++ LKN+MDAQY
Sbjct: 1 LKKRKVNILNHPGEHAGSNDANARRK---------YGVRGNFRDSDGELIALKNYMDAQY 51
Query: 86 FGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEI 145
FGEIGIG+PPQ F+VIFDTGSSNLWVPSSKCYFS++C FHS+Y+S S TY + GKS I
Sbjct: 52 FGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFSVACLFHSKYRSTDSTTYKKNGKSAAI 111
Query: 146 NYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDA 205
YG+GSISGFFSQD+V++GD++VK+Q FIEAT+E +TFL A+FDGI+GLGF+EI+VGDA
Sbjct: 112 QYGTGSISGFFSQDSVKLGDLLVKEQDFIEATKEPGITFLAAKFDGILGLGFQEISVGDA 171
Query: 206 VPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQ 265
VPVW M+ QGLV E VFSFWLNR+ D +EGGE+VFGGVDP HFKG+HTYVPVT+KGYWQ
Sbjct: 172 VPVWYTMLNQGLVQEPVFSFWLNRNADEQEGGELVFGGVDPNHFKGEHTYVPVTQKGYWQ 231
Query: 266 FELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVS 325
FE+GD+LIG+++TG C GCAAI DSGTSLLAG T +VT+IN AIG GV+S +CK +V
Sbjct: 232 FEMGDVLIGDKTTGFCASGCAAIADSGTSLLAGTTTIVTQINQAIGAAGVMSQQCKSLVD 291
Query: 326 QYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAE 357
QYG + ++L+S PEK+C Q+ LC+F+G+
Sbjct: 292 QYGKSMIEMLLSEEQPEKICSQMKLCSFDGSH 323
>gi|556819|emb|CAA57510.1| cyprosin [Cynara cardunculus]
Length = 509
Score = 433 bits (1113), Expect = e-119, Method: Compositional matrix adjust.
Identities = 213/339 (62%), Positives = 267/339 (78%), Gaps = 14/339 (4%)
Query: 24 SSNGLRRIGLKKRRLDL------HSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPL 77
S+ GL R+GLKKR++D H ++ RK+ GGA L DS DI+ L
Sbjct: 26 SNGGLLRVGLKKRKVDQINQLSGHGVSMEAKARKDFGFGGA--------LRDSGSDIIAL 77
Query: 78 KNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYT 137
KN+MDAQY+GEIGIGSPPQ F+VIFDTGSSNLWVPS+KCYFS++C FHS+YKS S+TY
Sbjct: 78 KNYMDAQYYGEIGIGSPPQKFTVIFDTGSSNLWVPSAKCYFSVACLFHSKYKSSHSSTYK 137
Query: 138 EIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGF 197
+ G S I YG+GSISGF SQD+V++GD+VVK+Q FIEAT+E +TFL A+FDGI+GLGF
Sbjct: 138 KNGTSAAIQYGTGSISGFVSQDSVKLGDLVVKEQDFIEATKEPGITFLAAKFDGILGLGF 197
Query: 198 REIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVP 257
+EI+VG +VP+W NMV QGLV E VFSFW NR+ D EEGGE+VFGGVDP HFKGKHTYVP
Sbjct: 198 QEISVGKSVPLWYNMVNQGLVQEPVFSFWFNRNADEEEGGELVFGGVDPNHFKGKHTYVP 257
Query: 258 VTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVS 317
VT+KGYWQF++GD+LI +++TG C GCAAI DSGTSLLAGPT ++TEINHAIG +GV+S
Sbjct: 258 VTEKGYWQFDMGDVLIEDKTTGFCSDGCAAIADSGTSLLAGPTAIITEINHAIGAKGVMS 317
Query: 318 AECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGA 356
+CK +VSQYG + ++L+S P+K+C Q+ LC F+GA
Sbjct: 318 QQCKTLVSQYGKTMIEMLLSEAQPDKICSQMKLCTFDGA 356
>gi|351724625|ref|NP_001237064.1| aspartic proteinase 1 precursor [Glycine max]
gi|15186732|dbj|BAB62890.1| aspartic proteinase 1 [Glycine max]
Length = 514
Score = 432 bits (1112), Expect = e-118, Method: Compositional matrix adjust.
Identities = 221/370 (59%), Positives = 284/370 (76%), Gaps = 9/370 (2%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPA----SSNGLRRIGLKKRRLDLHSLNAARITRKERYMG 56
M ++ V CL L S LL+ A + GLRRIGLKK +LD + AAR+ K+
Sbjct: 1 MGNRMNAIVLCL--LVSTLLVSAVYCAPNAGLRRIGLKKIKLDPKNRLAARVGSKDVDSF 58
Query: 57 GAGVS--GVRHRLGDSDE-DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPS 113
A + +++ G ++E DI+ LKN++DAQY+GEI IG+ PQ F+VIFDTGSSNLWVPS
Sbjct: 59 RASIRQFHLQNNFGGTEETDIVALKNYLDAQYYGEIAIGTSPQKFAVIFDTGSSNLWVPS 118
Query: 114 SKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF 173
SKC FS++CYFH++YKS KS+T+ + G + I YG+G+ISGFFS D+V VG++VVK+Q F
Sbjct: 119 SKCTFSVACYFHAKYKSSKSSTFKKNGTAAAIQYGTGAISGFFSYDSVRVGEIVVKNQEF 178
Query: 174 IEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDA 233
IEATRE +TFL A+FDGI+GLGF+EI+VG+A PVW NMV+QGL+ E VFSFW NR+P+
Sbjct: 179 IEATREPGVTFLAAKFDGILGLGFQEISVGNAAPVWYNMVDQGLLKEPVFSFWFNRNPEE 238
Query: 234 EEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGT 293
EEGGEIVFGGVDP H+KGKHTYVPVT+KGYWQF++GD+LIG + TG C GC+AI DSGT
Sbjct: 239 EEGGEIVFGGVDPAHYKGKHTYVPVTRKGYWQFDMGDVLIGGKPTGYCANGCSAIADSGT 298
Query: 294 SLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAF 353
SLLAGPT V+T INHAIG GV+S ECK +V++YG I DLL++ P+K+C +IGLCAF
Sbjct: 299 SLLAGPTTVITMINHAIGASGVMSQECKTIVAEYGQTILDLLLAETQPKKICSRIGLCAF 358
Query: 354 NGAEYVRLGI 363
+G V +GI
Sbjct: 359 DGTHGVDVGI 368
>gi|356545806|ref|XP_003541325.1| PREDICTED: aspartic proteinase oryzasin-1-like [Glycine max]
Length = 495
Score = 432 bits (1111), Expect = e-118, Method: Compositional matrix adjust.
Identities = 203/350 (58%), Positives = 260/350 (74%), Gaps = 16/350 (4%)
Query: 5 LLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVR 64
LL + C W S + +S +G+ R+ LK+R LD++SLN+ARI ++ GV
Sbjct: 7 LLVTSVCAW-FVSLAVTTSSGDGVTRVSLKRRSLDINSLNSARIKGVVNHLKADGVY--- 62
Query: 65 HRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYF 124
LKN++DAQYFGEIGIGSPPQ+F V+FDTGSSNLWVPS+KC SI+CYF
Sbjct: 63 ------------LKNYLDAQYFGEIGIGSPPQSFRVVFDTGSSNLWVPSAKCVLSIACYF 110
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
HS+Y+S+ SNTYT+IG C+I YG G + GF SQDN+ VGD+++KDQ F E T+EG L F
Sbjct: 111 HSKYRSKLSNTYTKIGTPCKIPYGHGHVPGFISQDNLRVGDIIIKDQQFAEITKEGPLAF 170
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
L FDGI+GLGF+ +V PVW NM+EQGLV++++FS WLN+DP A+ GGEIVFGG+
Sbjct: 171 LAMHFDGILGLGFQNKSVRQVTPVWYNMIEQGLVTQKIFSLWLNQDPVAKLGGEIVFGGI 230
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
D +HFKG+HTYVP+T+K YWQ E+GDI I N TG+CEGGCAAI+DSGTSL+AGPT +VT
Sbjct: 231 DWRHFKGEHTYVPLTQKDYWQIEVGDIQIANNPTGLCEGGCAAIIDSGTSLIAGPTKIVT 290
Query: 305 EINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFN 354
+INHAIG EG VS ECK ++ YGD IW+ ++SGL PE +C IGLC+ N
Sbjct: 291 QINHAIGAEGYVSYECKNIIHNYGDSIWEYIISGLKPEIICVDIGLCSRN 340
>gi|356565563|ref|XP_003551009.1| PREDICTED: aspartic proteinase oryzasin-1-like [Glycine max]
Length = 494
Score = 432 bits (1110), Expect = e-118, Method: Compositional matrix adjust.
Identities = 203/349 (58%), Positives = 258/349 (73%), Gaps = 16/349 (4%)
Query: 5 LLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVR 64
L+ + C W S ++ +S +GL R+ LK+R LD+ SLN+A+I ++ GV
Sbjct: 7 LVVTCVCAW-FGSLVVTTSSGDGLMRVSLKRRSLDISSLNSAKIKEVVNHLKADGVY--- 62
Query: 65 HRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYF 124
LKN++DAQYFGEIGIGSPPQ+F V+FDTGSSNLWVPS+KC SI+CYF
Sbjct: 63 ------------LKNYLDAQYFGEIGIGSPPQSFRVVFDTGSSNLWVPSAKCVLSIACYF 110
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
HS+Y+S+ SNTYT+IG C+I YG G I GF SQDN+ VGD+++KDQ F E T+EG L F
Sbjct: 111 HSKYRSKLSNTYTKIGTPCKIPYGRGHIPGFISQDNIRVGDIIIKDQQFAEITKEGPLAF 170
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
L FDGI+GLGF+ +VG PVW NM+EQG VS+++FS WLN+DP A+ GGEIVFGG+
Sbjct: 171 LAMHFDGILGLGFQNKSVGQVTPVWYNMIEQGHVSQKIFSLWLNQDPVAKVGGEIVFGGI 230
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
D +HFKG HTYVP+T+K YWQ E+GDILI N TG+CEGGCAAI+DSGTSL+AGPT +VT
Sbjct: 231 DWRHFKGDHTYVPLTQKDYWQIEVGDILIANNPTGLCEGGCAAIIDSGTSLIAGPTKIVT 290
Query: 305 EINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAF 353
+IN AIG EG VS ECK ++ YGD IW+ ++SGL PE +C IGLC+
Sbjct: 291 QINRAIGAEGYVSYECKNIIHNYGDSIWEYIISGLKPEIICVDIGLCSL 339
>gi|1168536|sp|P42210.1|ASPR_HORVU RecName: Full=Phytepsin; AltName: Full=Aspartic proteinase;
Contains: RecName: Full=Phytepsin 32 kDa subunit;
Contains: RecName: Full=Phytepsin 29 kDa subunit;
Contains: RecName: Full=Phytepsin 16 kDa subunit;
Contains: RecName: Full=Phytepsin 11 kDa subunit; Flags:
Precursor
gi|18904|emb|CAA39602.1| aspartic proteinase [Hordeum vulgare subsp. vulgare]
Length = 508
Score = 431 bits (1108), Expect = e-118, Method: Compositional matrix adjust.
Identities = 216/346 (62%), Positives = 272/346 (78%), Gaps = 5/346 (1%)
Query: 20 LLPASSN--GLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPL 77
+LPA+S GL RI LKKR +D +S A ++ E +G + +R + + DI+ L
Sbjct: 20 VLPAASEAEGLVRIALKKRPIDRNSRVATGLSGGEEQPLLSGANPLRS---EEEGDIVAL 76
Query: 78 KNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYT 137
KN+M+AQYFGEIG+G+PPQ F+VIFDTGSSNLWVPS+KCYFSI+CY HSRYK+ S+TY
Sbjct: 77 KNYMNAQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSAKCYFSIACYLHSRYKAGASSTYK 136
Query: 138 EIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGF 197
+ GK I YG+GSI+G+FS+D+V VGD+VVKDQ FIEAT+E +TFL+A+FDGI+GLGF
Sbjct: 137 KNGKPAAIQYGTGSIAGYFSEDSVTVGDLVVKDQEFIEATKEPGITFLVAKFDGILGLGF 196
Query: 198 REIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVP 257
+EI+VG AVPVW M+EQGLVS+ VFSFWLNR D EGGEI+FGG+DPKH+ G+HTYVP
Sbjct: 197 KEISVGKAVPVWYKMIEQGLVSDPVFSFWLNRHVDEGEGGEIIFGGMDPKHYVGEHTYVP 256
Query: 258 VTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVS 317
VT+KGYWQF++GD+L+G +STG C GGCAAI DSGTSLLAGPT ++TEIN IG GVVS
Sbjct: 257 VTQKGYWQFDMGDVLVGGKSTGFCAGGCAAIADSGTSLLAGPTAIITEINEKIGAAGVVS 316
Query: 318 AECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
ECK +VSQYG I DLL++ P+K+C Q+GLC F+G V GI
Sbjct: 317 QECKTIVSQYGQQILDLLLAETQPKKICSQVGLCTFDGTRGVSAGI 362
>gi|2811025|sp|O04057.1|ASPR_CUCPE RecName: Full=Aspartic proteinase; Flags: Precursor
gi|1944181|dbj|BAA19607.1| aspartic endopeptidase [Cucurbita pepo]
Length = 513
Score = 431 bits (1107), Expect = e-118, Method: Compositional matrix adjust.
Identities = 225/360 (62%), Positives = 287/360 (79%), Gaps = 5/360 (1%)
Query: 8 SVFCLWVLASCLLLPASSN-GLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHR 66
+ CL++L S ++ ++SN GL R+GLKK +LD + AAR+ K+ + A +
Sbjct: 9 AFLCLFLLVSFNIVSSASNDGLLRVGLKKIKLDPENRLAARVESKDAEILKAAFRKYNPK 68
Query: 67 --LGDS-DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCY 123
LG+S D DI+ LKN++DAQY+GEI IG+PPQ F+VIFDTGSSNLWV +C FS++C+
Sbjct: 69 GNLGESSDTDIVALKNYLDAQYYGEIAIGTPPQKFTVIFDTGSSNLWV-LCECLFSVACH 127
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
FH+RYKS +S++Y + G S I YG+G++SGFFS DNV+VGD+VVK+QVFIEATRE SLT
Sbjct: 128 FHARYKSSRSSSYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKEQVFIEATREPSLT 187
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
FL+A+FDG++GLGF+EIAVG+AVPVW NMVEQGLV E VFSFWLNR+ + EEGGEIVFGG
Sbjct: 188 FLVAKFDGLLGLGFQEIAVGNAVPVWYNMVEQGLVKEPVFSFWLNRNVEEEEGGEIVFGG 247
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
VDPKH++GKHTYVPVT+KGYWQF++GD+LI + TG C+GGC+AI DSGTSLLAGPTPV+
Sbjct: 248 VDPKHYRGKHTYVPVTQKGYWQFDMGDVLIDGEPTGFCDGGCSAIADSGTSLLAGPTPVI 307
Query: 304 TEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
T INHAIG +GVVS +CK VV+QYG I DLL+S P+K+C QI LC F+G V +GI
Sbjct: 308 TMINHAIGAKGVVSQQCKAVVAQYGQTIMDLLLSEADPKKICSQINLCTFDGTRGVSMGI 367
>gi|1665867|emb|CAA70340.1| aspartic proteinase [Centaurea calcitrapa]
Length = 509
Score = 430 bits (1105), Expect = e-118, Method: Compositional matrix adjust.
Identities = 214/343 (62%), Positives = 266/343 (77%), Gaps = 14/343 (4%)
Query: 23 ASSNGLRRIGLKKRRLDL------HSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILP 76
AS+ GL R+GLKKR++D H + RK+ GG+ L DSD DI+
Sbjct: 25 ASNGGLLRVGLKKRKVDQINQLRNHGASMEGKARKDFGFGGS--------LRDSDSDIIE 76
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
LKN+MDAQY+GEIGIGSP Q F+VIFDTGSSNLWVPS+KCYFS++C FHS+YKS S+TY
Sbjct: 77 LKNYMDAQYYGEIGIGSPAQKFTVIFDTGSSNLWVPSAKCYFSVACLFHSKYKSSHSSTY 136
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
+ G S I YG+GSISGF SQD+V++GD+VVK+Q FIEAT+E +TFL A+FDGI+GLG
Sbjct: 137 KKNGTSAAIQYGTGSISGFVSQDSVKLGDLVVKEQDFIEATKEPGVTFLAAKFDGILGLG 196
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
F+EI+VG +VPVW NMV QGLV E VFSFW NR+ D EEGGE+VFGGVDP HFKGKHTYV
Sbjct: 197 FQEISVGKSVPVWYNMVNQGLVQEPVFSFWFNRNADEEEGGELVFGGVDPNHFKGKHTYV 256
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
PVT+KGYWQF +GD+LI +++TG C GCAAI DSGTSLLAGPT ++T+INHAIG +GV+
Sbjct: 257 PVTQKGYWQFNMGDVLIEDKTTGFCADGCAAIADSGTSLLAGPTAIITQINHAIGAKGVM 316
Query: 317 SAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYV 359
S +CK +V QYG I ++L+S P+K+C Q+ LC F+GA V
Sbjct: 317 SQQCKTLVDQYGKTIIEMLLSEAQPDKICSQMKLCTFDGARDV 359
>gi|449433980|ref|XP_004134774.1| PREDICTED: aspartic proteinase-like [Cucumis sativus]
gi|449526063|ref|XP_004170034.1| PREDICTED: aspartic proteinase-like [Cucumis sativus]
Length = 516
Score = 429 bits (1104), Expect = e-118, Method: Compositional matrix adjust.
Identities = 207/346 (59%), Positives = 264/346 (76%), Gaps = 6/346 (1%)
Query: 23 ASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDE-----DILPL 77
AS+ G RIGLKK + D +S A + K+ G+ V G ++ G++ E DI+PL
Sbjct: 27 ASNEGFLRIGLKKIKYDQNSRFKALLESKKGEFLGSSV-GKHNQWGNNLEESKNADIVPL 85
Query: 78 KNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYT 137
KN++DAQY+GEIGIG+PPQ F+VIFDTGSSNLWVPS+KC FS++C+FH++Y+S +S+TY
Sbjct: 86 KNYLDAQYYGEIGIGTPPQKFTVIFDTGSSNLWVPSAKCIFSLACFFHAKYQSGRSSTYK 145
Query: 138 EIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGF 197
G S I YGSG+ISGFFS DNV+VGDV+V++Q IEAT ++TF+ A+FDGI+GLGF
Sbjct: 146 RNGTSAAIQYGSGAISGFFSYDNVQVGDVIVRNQELIEATSMSTMTFMAAKFDGILGLGF 205
Query: 198 REIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVP 257
+EIA G AVPVW NMV+Q LV E+VFSFWLNR+ + +EGGE+VFGGVDPKHFKG+HTYVP
Sbjct: 206 QEIATGGAVPVWYNMVKQKLVKEQVFSFWLNRNAEEKEGGELVFGGVDPKHFKGQHTYVP 265
Query: 258 VTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVS 317
VT KGYWQF++GDILIG ++T C GGC+AI DSGTSLLAGP+ +V IN AIG V
Sbjct: 266 VTDKGYWQFDIGDILIGGETTKYCAGGCSAIADSGTSLLAGPSNIVVSINRAIGAAAVAH 325
Query: 318 AECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
ECK +VSQYG I DLL++ PEK+C +IG+C F+ V L I
Sbjct: 326 PECKAIVSQYGRAIMDLLLAKAQPEKICSKIGVCTFDETHDVSLKI 371
>gi|147780252|emb|CAN65745.1| hypothetical protein VITISV_037763 [Vitis vinifera]
Length = 504
Score = 429 bits (1104), Expect = e-118, Method: Compositional matrix adjust.
Identities = 216/357 (60%), Positives = 266/357 (74%), Gaps = 31/357 (8%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV 60
M Q L + FCLW L + LL ASS+GL RIGLKK RLD + + AAR+ R+ + +GG V
Sbjct: 3 MRQGYLWAAFCLWAL-TFPLLQASSDGLVRIGLKKWRLDYNRIRAARMARRAKSIGGV-V 60
Query: 61 SGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI 120
+ LGDSD + + L+N+MDAQY+GEIGIG+PPQNF+V+FDTGS+NLWVPS+KC+FSI
Sbjct: 61 KSMYQGLGDSDGESVLLRNYMDAQYYGEIGIGTPPQNFTVVFDTGSANLWVPSTKCHFSI 120
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+C FHS+Y SR S T T+ C + VFIEATRE
Sbjct: 121 ACLFHSKYNSRLSTTSTK----CHFS-------------------------VFIEATREA 151
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
SL F+L +FDGI+GLGF EI VG+A PVW N++ QGLV E++FSFWLNRDP A +GGEIV
Sbjct: 152 SLVFVLGKFDGILGLGFEEIVVGNATPVWYNLLRQGLVQEDIFSFWLNRDPQATDGGEIV 211
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGGVD +HFKG+HTY +T+KGYWQFE+G+ LIG QSTG CE GCAAIVDSGTSL+AGPT
Sbjct: 212 FGGVDKRHFKGQHTYASITQKGYWQFEMGEFLIGYQSTGFCEAGCAAIVDSGTSLIAGPT 271
Query: 301 PVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAE 357
+VTEINHAIG EG+VS ECK VVSQYG++IWDLL+S + P+ VC QIGLC FNG++
Sbjct: 272 AIVTEINHAIGAEGIVSQECKEVVSQYGNMIWDLLISRVQPDAVCSQIGLCNFNGSQ 328
>gi|425892460|gb|AFB73927.2| preprocirsin [Cirsium vulgare]
Length = 509
Score = 429 bits (1104), Expect = e-117, Method: Compositional matrix adjust.
Identities = 213/345 (61%), Positives = 267/345 (77%), Gaps = 14/345 (4%)
Query: 21 LPASSNGLRRIGLKKRRLDL------HSLNAARITRKERYMGGAGVSGVRHRLGDSDEDI 74
+ S++GL R+GLKKR++D H + RK+ GG L DSD DI
Sbjct: 23 ISVSNDGLIRVGLKKRKVDQINQLSGHGASMEGKARKDFGFGGT--------LRDSDSDI 74
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+ LKN+MDAQY+GEIGIG+PPQ F+VIFDTGSSNLWVPS+KCYFS++C FHS+YKS S+
Sbjct: 75 IALKNYMDAQYYGEIGIGAPPQKFTVIFDTGSSNLWVPSAKCYFSVACLFHSKYKSSHSS 134
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + G S I YG+GSISGF SQD+V++GD+VVK+Q FIEAT+E +TFL A+FDGI+G
Sbjct: 135 TYKKNGTSAAIQYGTGSISGFVSQDSVKLGDLVVKEQDFIEATKEPGITFLAAKFDGILG 194
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LGF+EI+VG +VPVW NMV QGLV E VFSFW NR+ + EEGGE+VFGGVDP HFKGKHT
Sbjct: 195 LGFQEISVGKSVPVWYNMVNQGLVQEPVFSFWFNRNANEEEGGELVFGGVDPNHFKGKHT 254
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
YVPVT+KGYWQF +GD+LI +++TG C GCAAI DSGTSLLAGPT ++TEINHA G +G
Sbjct: 255 YVPVTEKGYWQFNMGDVLIEDKTTGFCSDGCAAIADSGTSLLAGPTAIITEINHASGAKG 314
Query: 315 VVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYV 359
V+S +CK +VSQYG I ++L+S P+K+C Q+ LC F+GA V
Sbjct: 315 VMSQQCKTLVSQYGKSIIEMLLSEAQPDKICSQMKLCTFDGARDV 359
>gi|4589716|dbj|BAA76870.1| aspartic proteinase [Helianthus annuus]
Length = 509
Score = 429 bits (1102), Expect = e-117, Method: Compositional matrix adjust.
Identities = 199/339 (58%), Positives = 263/339 (77%), Gaps = 6/339 (1%)
Query: 23 ASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS--GVRHRLGDSDEDILPLKNF 80
++ GL R+GLKKR+ + + R++ M G G L +S+ D++ LKN+
Sbjct: 25 STKGGLLRVGLKKRKTNQFN----RVSEHGLSMEGTDRRNFGFYDTLRNSEGDVIVLKNY 80
Query: 81 MDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIG 140
MDAQYFGEIGIG+PPQ F+V+FDTGS+NLWVPSSKC+ S++C FH +YK+ +S+TY + G
Sbjct: 81 MDAQYFGEIGIGTPPQKFTVVFDTGSANLWVPSSKCFLSVACLFHQKYKASRSSTYKKNG 140
Query: 141 KSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREI 200
+ I YG+G+ISG FS+D+V++GD+VVK+Q FIEATRE +TFL A+FDGI+GLG+++I
Sbjct: 141 TAAAIQYGTGAISGVFSRDSVKLGDLVVKEQDFIEATREPGITFLAAKFDGILGLGYQDI 200
Query: 201 AVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTK 260
+VG AVPVW NMV QGLV E VFSFW NR EEGGE+VFGGVDP HFKGKHTYVPVT+
Sbjct: 201 SVGKAVPVWYNMVNQGLVQEPVFSFWFNRHTGEEEGGELVFGGVDPNHFKGKHTYVPVTQ 260
Query: 261 KGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAEC 320
KGYWQF++GD+LIG+++TG C GGCAAI DSGTSLLAGPT ++T+INHAIG GV+S +C
Sbjct: 261 KGYWQFDMGDVLIGDKTTGFCSGGCAAIADSGTSLLAGPTTIITQINHAIGAAGVMSQQC 320
Query: 321 KLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYV 359
K +V QYG I ++L+S P+K+C ++ LC F+G+ V
Sbjct: 321 KTLVDQYGKTIIEMLLSEAQPDKICSRMNLCTFDGSRDV 359
>gi|73912433|dbj|BAE20413.1| aspartic proteinase [Triticum aestivum]
Length = 508
Score = 426 bits (1095), Expect = e-117, Method: Compositional matrix adjust.
Identities = 215/344 (62%), Positives = 271/344 (78%), Gaps = 7/344 (2%)
Query: 23 ASSNGLRRIGLKKRRLDLHSLNAARIT-RKERYMGGAGVSGVRHRLGDSDE-DILPLKNF 80
+ + GL RI LKKR +D +S A ++ R+E ++ G G + L +E DI+ LKN+
Sbjct: 23 SEAEGLVRIALKKRAIDRNSRVAKSLSDREEVHLLG----GASNTLPSEEEGDIVSLKNY 78
Query: 81 MDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIG 140
M+AQYFGEIG+G+PPQ F+VIFDTGSSNLWVPS+KCYFSI+CY H+RYK+ S+TY + G
Sbjct: 79 MNAQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSAKCYFSIACYLHARYKAGASSTYKKNG 138
Query: 141 KSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREI 200
K I YG+GSI+G+FS+D+V VGD+VVKDQ FIEAT+E +TFL+A+FDGI+GLGF+EI
Sbjct: 139 KPAAIQYGTGSIAGYFSEDSVTVGDLVVKDQEFIEATKEPGVTFLVAKFDGILGLGFKEI 198
Query: 201 AVGDAVPVWDNMVEQGLVSEEVFSFWLNRDP-DAEEGGEIVFGGVDPKHFKGKHTYVPVT 259
+VG AVPVW NMVEQGL+S+ VFSFWLNR D EGGEI+FGG+DPKH+ G+HTYVP T
Sbjct: 199 SVGKAVPVWYNMVEQGLISDPVFSFWLNRHADDEGEGGEIIFGGMDPKHYVGEHTYVPAT 258
Query: 260 KKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
+KGYWQF++GD+L+G +STG C GGCAAI DSGTSLLAGPT ++TEIN IG GVVS E
Sbjct: 259 QKGYWQFDMGDVLVGGKSTGFCAGGCAAIADSGTSLLAGPTAIITEINEKIGAAGVVSQE 318
Query: 320 CKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
CK +VSQYG I DLL++ P+KVC Q+GLC F+G V GI
Sbjct: 319 CKTIVSQYGQQILDLLLAETQPKKVCSQVGLCTFDGTRGVSAGI 362
>gi|115439013|ref|NP_001043786.1| Os01g0663400 [Oryza sativa Japonica Group]
gi|113533317|dbj|BAF05700.1| Os01g0663400 [Oryza sativa Japonica Group]
gi|215701483|dbj|BAG92907.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218188796|gb|EEC71223.1| hypothetical protein OsI_03158 [Oryza sativa Indica Group]
gi|222618996|gb|EEE55128.1| hypothetical protein OsJ_02912 [Oryza sativa Japonica Group]
gi|385717674|gb|AFI71272.1| unnamed protein [Oryza sativa Japonica Group]
Length = 522
Score = 425 bits (1092), Expect = e-116, Method: Compositional matrix adjust.
Identities = 206/347 (59%), Positives = 257/347 (74%), Gaps = 10/347 (2%)
Query: 27 GLRRIGLKKRRLD--------LHSLNAARI-TRKERYMGGAGVSGVRHRLGDSDE-DILP 76
G+ RI LKKR++D L +A R+ R+ ++ + E DI+
Sbjct: 30 GVVRIALKKRQVDETGRVGGHLAGEDAQRLLARRHGFLTNDAARAASRKARAEAEGDIVA 89
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
LKN+++AQY+GEI IG+PPQ F+VIFDTGSSNLWVPSSKC+ SI+CYFHSRYK+ +S+TY
Sbjct: 90 LKNYLNAQYYGEIAIGTPPQMFTVIFDTGSSNLWVPSSKCHLSIACYFHSRYKAGQSSTY 149
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
+ GK I+YG+G+ISG+FSQD+V+VGDV VK+Q FIEATRE S+TF++A+FDGI+GLG
Sbjct: 150 KKNGKPASIHYGTGAISGYFSQDSVKVGDVAVKNQDFIEATREPSITFMVAKFDGILGLG 209
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
F+EI+VG+AVP+W NMV QGLV + VFSFW NR D +GGEIVFGG+DP H+KG HTYV
Sbjct: 210 FKEISVGNAVPIWYNMVRQGLVVDPVFSFWFNRHADEGQGGEIVFGGIDPNHYKGNHTYV 269
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
PVT+KGYWQF +GD+LIG STG C GCAAI DSGTSLL GPT ++T+IN IG GVV
Sbjct: 270 PVTRKGYWQFNMGDVLIGGNSTGFCAAGCAAIADSGTSLLTGPTAIITQINEKIGATGVV 329
Query: 317 SAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
S ECK VVSQYG I D L + P KVC +GLC F+G V GI
Sbjct: 330 SQECKAVVSQYGQQILDQLRAETKPAKVCSSVGLCTFDGTHGVSAGI 376
>gi|5822248|pdb|1QDM|A Chain A, Crystal Structure Of Prophytepsin, A Zymogen Of A Barley
Vacuolar Aspartic Proteinase.
gi|5822249|pdb|1QDM|B Chain B, Crystal Structure Of Prophytepsin, A Zymogen Of A Barley
Vacuolar Aspartic Proteinase.
gi|5822250|pdb|1QDM|C Chain C, Crystal Structure Of Prophytepsin, A Zymogen Of A Barley
Vacuolar Aspartic Proteinase
Length = 478
Score = 424 bits (1090), Expect = e-116, Method: Compositional matrix adjust.
Identities = 210/334 (62%), Positives = 264/334 (79%), Gaps = 3/334 (0%)
Query: 30 RIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEI 89
RI LKKR +D +S A ++ E +G + +R + + DI+ LKN+M+AQYFGEI
Sbjct: 2 RIALKKRPIDRNSRVATGLSGGEEQPLLSGANPLRS---EEEGDIVALKNYMNAQYFGEI 58
Query: 90 GIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGS 149
G+G+PPQ F+VIFDTGSSNLWVPS+KCYFSI+CY HSRYK+ S+TY + GK I YG+
Sbjct: 59 GVGTPPQKFTVIFDTGSSNLWVPSAKCYFSIACYLHSRYKAGASSTYKKNGKPAAIQYGT 118
Query: 150 GSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVW 209
GSI+G+FS+D+V VGD+VVKDQ FIEAT+E +TFL+A+FDGI+GLGF+EI+VG AVPVW
Sbjct: 119 GSIAGYFSEDSVTVGDLVVKDQEFIEATKEPGITFLVAKFDGILGLGFKEISVGKAVPVW 178
Query: 210 DNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELG 269
M+EQGLVS+ VFSFWLNR D EGGEI+FGG+DPKH+ G+HTYVPVT+KGYWQF++G
Sbjct: 179 YKMIEQGLVSDPVFSFWLNRHVDEGEGGEIIFGGMDPKHYVGEHTYVPVTQKGYWQFDMG 238
Query: 270 DILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGD 329
D+L+G +STG C GGCAAI DSGTSLLAGPT ++TEIN IG GVVS ECK +VSQYG
Sbjct: 239 DVLVGGKSTGFCAGGCAAIADSGTSLLAGPTAIITEINEKIGAAGVVSQECKTIVSQYGQ 298
Query: 330 LIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
I DLL++ P+K+C Q+GLC F+G V GI
Sbjct: 299 QILDLLLAETQPKKICSQVGLCTFDGTRGVSAGI 332
>gi|224106994|ref|XP_002314336.1| predicted protein [Populus trichocarpa]
gi|222863376|gb|EEF00507.1| predicted protein [Populus trichocarpa]
Length = 483
Score = 421 bits (1083), Expect = e-115, Method: Compositional matrix adjust.
Identities = 201/331 (60%), Positives = 257/331 (77%), Gaps = 9/331 (2%)
Query: 24 SSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDA 83
SS+GL R+GLKKR LDL+S++AARITR + S R S+ +I+ LKN++D
Sbjct: 10 SSDGLARVGLKKRNLDLNSIHAARITRPQ------ATSFARVT---SNAEIVYLKNYLDT 60
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
QY+GEIGIGSPPQ F+V+FDTGSSNLWVPSSKC SI+CYFHS++ +R S TYT+IG C
Sbjct: 61 QYYGEIGIGSPPQIFTVVFDTGSSNLWVPSSKCLLSITCYFHSKFIARLSRTYTKIGIPC 120
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVG 203
+I YGSGS+SGF SQD+V+VGD ++ +QV +++EG L L +FDGI+GL F++IAV
Sbjct: 121 KIQYGSGSVSGFLSQDHVKVGDDIIINQVSSASSKEGFLALLGVQFDGILGLAFQDIAVA 180
Query: 204 DAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGY 263
A PVW NM EQG VS++VFS WLNR+P +E GGE+VFGG+D +HFKG HTYVPVT +GY
Sbjct: 181 KATPVWYNMAEQGHVSQKVFSLWLNRNPSSELGGEVVFGGLDWRHFKGDHTYVPVTGRGY 240
Query: 264 WQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLV 323
WQ ++GDI I N STG+C GGC+AIVDSGTS L+GPT +V +INHAIG G+VS ECK V
Sbjct: 241 WQIQVGDIFIANNSTGLCAGGCSAIVDSGTSFLSGPTRIVAQINHAIGARGIVSLECKEV 300
Query: 324 VSQYGDLIWDLLVSGLLPEKVCQQIGLCAFN 354
VS+Y + IWD ++SGL PE +C +GLC +N
Sbjct: 301 VSKYWNSIWDSMISGLRPEIICVDVGLCLYN 331
>gi|226532912|ref|NP_001146573.1| hypothetical protein [Zea mays]
gi|219887869|gb|ACL54309.1| unknown [Zea mays]
gi|413917600|gb|AFW57532.1| hypothetical protein ZEAMMB73_218341 [Zea mays]
Length = 494
Score = 416 bits (1069), Expect = e-114, Method: Compositional matrix adjust.
Identities = 189/299 (63%), Positives = 242/299 (80%), Gaps = 1/299 (0%)
Query: 66 RLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFH 125
RLG S +PL ++++ QY+G +GIG+PPQNF+VIFDTGSSNLWVPSS+CYFSI+CY H
Sbjct: 55 RLGASGGGDVPLVDYLNTQYYGVVGIGTPPQNFTVIFDTGSSNLWVPSSRCYFSIACYLH 114
Query: 126 SRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFL 185
RYKS KS+TY G++C+I YGSGSI+GFFS D+V VGD+ VK+Q FIE TRE S+TF+
Sbjct: 115 HRYKSAKSSTYKADGETCKITYGSGSIAGFFSDDDVLVGDLTVKNQKFIETTRESSITFI 174
Query: 186 LARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPD-AEEGGEIVFGGV 244
+ +FDGI+GLG+ EI+VG A P+W +M EQ L++E+VFSFWLNR PD A GGE+VFGGV
Sbjct: 175 IGKFDGILGLGYPEISVGKAPPIWQSMQEQELLAEDVFSFWLNRSPDAAAAGGELVFGGV 234
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
DP HF G HTYVPV++KGYWQF++GD+LI STG C GCAAIVDSGTSLLAGPT ++
Sbjct: 235 DPAHFSGNHTYVPVSRKGYWQFDMGDLLIDGHSTGFCAKGCAAIVDSGTSLLAGPTAIIA 294
Query: 305 EINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
++N AIG +G++S ECK VVSQYG++I D+L++ P++VC Q+GLC F+GA V GI
Sbjct: 295 QVNEAIGADGIISTECKEVVSQYGEMILDMLIAQTDPQRVCSQVGLCVFDGARSVSEGI 353
>gi|223949795|gb|ACN28981.1| unknown [Zea mays]
gi|413917601|gb|AFW57533.1| hypothetical protein ZEAMMB73_218341 [Zea mays]
gi|413917602|gb|AFW57534.1| hypothetical protein ZEAMMB73_218341 [Zea mays]
Length = 509
Score = 416 bits (1069), Expect = e-113, Method: Compositional matrix adjust.
Identities = 189/299 (63%), Positives = 242/299 (80%), Gaps = 1/299 (0%)
Query: 66 RLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFH 125
RLG S +PL ++++ QY+G +GIG+PPQNF+VIFDTGSSNLWVPSS+CYFSI+CY H
Sbjct: 70 RLGASGGGDVPLVDYLNTQYYGVVGIGTPPQNFTVIFDTGSSNLWVPSSRCYFSIACYLH 129
Query: 126 SRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFL 185
RYKS KS+TY G++C+I YGSGSI+GFFS D+V VGD+ VK+Q FIE TRE S+TF+
Sbjct: 130 HRYKSAKSSTYKADGETCKITYGSGSIAGFFSDDDVLVGDLTVKNQKFIETTRESSITFI 189
Query: 186 LARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPD-AEEGGEIVFGGV 244
+ +FDGI+GLG+ EI+VG A P+W +M EQ L++E+VFSFWLNR PD A GGE+VFGGV
Sbjct: 190 IGKFDGILGLGYPEISVGKAPPIWQSMQEQELLAEDVFSFWLNRSPDAAAAGGELVFGGV 249
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
DP HF G HTYVPV++KGYWQF++GD+LI STG C GCAAIVDSGTSLLAGPT ++
Sbjct: 250 DPAHFSGNHTYVPVSRKGYWQFDMGDLLIDGHSTGFCAKGCAAIVDSGTSLLAGPTAIIA 309
Query: 305 EINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
++N AIG +G++S ECK VVSQYG++I D+L++ P++VC Q+GLC F+GA V GI
Sbjct: 310 QVNEAIGADGIISTECKEVVSQYGEMILDMLIAQTDPQRVCSQVGLCVFDGARSVSEGI 368
>gi|388517285|gb|AFK46704.1| unknown [Medicago truncatula]
Length = 510
Score = 415 bits (1066), Expect = e-113, Method: Compositional matrix adjust.
Identities = 210/364 (57%), Positives = 265/364 (72%), Gaps = 21/364 (5%)
Query: 10 FCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSL----------NAARITRKERYMGGAG 59
CLW L L+ A + GLRRIGLKK +L+ +L ++ R + +GGAG
Sbjct: 12 LCLWTLLFSLVSCAPNEGLRRIGLKKNKLEPKNLLGSKGCESSWSSIRNYASKNILGGAG 71
Query: 60 VSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS 119
+ D++ LKN++DAQY+GEI IG+PPQ F+VIFDTGSSN WVPS KCYFS
Sbjct: 72 -----------EADVVALKNYLDAQYYGEISIGTPPQTFTVIFDTGSSNTWVPSVKCYFS 120
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
++C H++YKS +S+TY G I YG+G++SGFFS DNV+VGDVVVKD FIEATRE
Sbjct: 121 LACLVHAKYKSSQSSTYKPNGTHAAIQYGTGAVSGFFSYDNVKVGDVVVKDVEFIEATRE 180
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEI 239
LTF+ A+FDG++GLGF+EI+VG+AVP+W MV+QGLV + VFSFWLNR+P+ E+GGE+
Sbjct: 181 PGLTFVAAKFDGLLGLGFQEISVGNAVPIWYKMVKQGLVKDPVFSFWLNRNPNEEQGGEL 240
Query: 240 VFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGP 299
VFGGVDP HFKG+HTYVPVT+KGYWQF +GD+LI + TG C C+AI DSGTSLLAGP
Sbjct: 241 VFGGVDPAHFKGEHTYVPVTRKGYWQFAMGDVLIDGKPTGYCANDCSAIADSGTSLLAGP 300
Query: 300 TPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYV 359
T V+T IN AIG GV S EC+ VV QYG I LLV+ P+KVC QIGLC F+G + +
Sbjct: 301 TTVITMINQAIGASGVYSQECRTVVDQYGHSILQLLVAEAQPKKVCSQIGLCTFDGTQGI 360
Query: 360 RLGI 363
+GI
Sbjct: 361 SMGI 364
>gi|40641523|emb|CAE52913.1| putative vacuaolar aspartic proteinase [Physcomitrella patens]
Length = 504
Score = 414 bits (1064), Expect = e-113, Method: Compositional matrix adjust.
Identities = 198/329 (60%), Positives = 243/329 (73%), Gaps = 4/329 (1%)
Query: 29 RRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGE 88
RRI LKK+ + L S+ A +R + L D EDI+ L N++DAQYFGE
Sbjct: 28 RRIALKKKPVTLQSVRNAASRTIQR---AKTFTRSEDELRDG-EDIVALNNYLDAQYFGE 83
Query: 89 IGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYG 148
IGIGSPPQ F+VIFDTGSSNLWVPS+KCY S++CYFH RYKS KS+TY E G S I YG
Sbjct: 84 IGIGSPPQPFAVIFDTGSSNLWVPSAKCYLSLACYFHHRYKSGKSSTYKEDGTSFAIQYG 143
Query: 149 SGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPV 208
+GS+ GF SQD+V +GD+ VK QVF EAT+E LTF++A+FDGI+GLGF+EI+V P
Sbjct: 144 TGSMEGFLSQDDVTLGDLTVKGQVFAEATKEPGLTFVVAKFDGILGLGFKEISVNRVTPP 203
Query: 209 WDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFEL 268
W NM++QGLV E VFSFWLNR+PD GGE+V GGVDPKHFKG+H Y PVT+KGYWQF+L
Sbjct: 204 WYNMLDQGLVKEPVFSFWLNRNPDESSGGELVLGGVDPKHFKGEHVYTPVTRKGYWQFDL 263
Query: 269 GDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYG 328
GD+ I ++TG C GC AI DSGTSLLAGP+ +V EIN AIG GVVS +CK+VV QYG
Sbjct: 264 GDVTINGRTTGFCANGCTAIADSGTSLLAGPSGIVAEINQAIGATGVVSQQCKMVVQQYG 323
Query: 329 DLIWDLLVSGLLPEKVCQQIGLCAFNGAE 357
D I ++L++ + P KVC +GLC F E
Sbjct: 324 DQIVEMLLAQMNPGKVCTTLGLCNFGAGE 352
>gi|168029783|ref|XP_001767404.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162681300|gb|EDQ67728.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 499
Score = 414 bits (1064), Expect = e-113, Method: Compositional matrix adjust.
Identities = 198/329 (60%), Positives = 243/329 (73%), Gaps = 4/329 (1%)
Query: 29 RRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGE 88
RRI LKK+ + L S+ A +R + L D EDI+ L N++DAQYFGE
Sbjct: 28 RRIALKKKPVTLQSVRNAASRTIQR---AKTFTRSEDELRDG-EDIVALNNYLDAQYFGE 83
Query: 89 IGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYG 148
IGIGSPPQ F+VIFDTGSSNLWVPS+KCY S++CYFH RYKS KS+TY E G S I YG
Sbjct: 84 IGIGSPPQPFAVIFDTGSSNLWVPSAKCYLSLACYFHHRYKSGKSSTYKEDGTSFAIQYG 143
Query: 149 SGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPV 208
+GS+ GF SQD+V +GD+ VK QVF EAT+E LTF++A+FDGI+GLGF+EI+V P
Sbjct: 144 TGSMEGFLSQDDVTLGDLTVKGQVFAEATKEPGLTFVVAKFDGILGLGFKEISVNRVTPP 203
Query: 209 WDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFEL 268
W NM++QGLV E VFSFWLNR+PD GGE+V GGVDPKHFKG+H Y PVT+KGYWQF+L
Sbjct: 204 WYNMLDQGLVKEPVFSFWLNRNPDESSGGELVLGGVDPKHFKGEHVYTPVTRKGYWQFDL 263
Query: 269 GDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYG 328
GD+ I ++TG C GC AI DSGTSLLAGP+ +V EIN AIG GVVS +CK+VV QYG
Sbjct: 264 GDVTINGRTTGFCANGCTAIADSGTSLLAGPSGIVAEINQAIGATGVVSQQCKMVVQQYG 323
Query: 329 DLIWDLLVSGLLPEKVCQQIGLCAFNGAE 357
D I ++L++ + P KVC +GLC F E
Sbjct: 324 DQIVEMLLAQMNPGKVCTTLGLCNFGAGE 352
>gi|110162110|emb|CAL07969.1| aspartic proteinase [Cynara cardunculus]
Length = 506
Score = 412 bits (1059), Expect = e-112, Method: Compositional matrix adjust.
Identities = 200/339 (58%), Positives = 252/339 (74%), Gaps = 6/339 (1%)
Query: 24 SSNGLRRIGLKKRRLD-LHSLNAARITRKERYMGGAGVS-GVRHRLGDSDEDILPLKNFM 81
S+ GL R+GLKKR++D L L A + +G A G R L S I+ L N
Sbjct: 26 SNGGLLRVGLKKRKVDRLDQLRAHGV----HMLGNARKDFGFRRTLRVSGSGIVALTNDR 81
Query: 82 DAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGK 141
D Y+GEIGIG+PPQNF+VIFDTGSS+LWVPSSKCY S++C H RY+S S+TY G
Sbjct: 82 DTAYYGEIGIGTPPQNFAVIFDTGSSDLWVPSSKCYTSLACVIHPRYESGDSSTYKRNGT 141
Query: 142 SCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIA 201
+ I YG+G+I GF+SQD+VEVGD+VV+ Q FIE T E FL FDGI+GLGF+EI+
Sbjct: 142 TASIQYGTGAIVGFYSQDSVEVGDLVVEQQDFIETTEEDDTVFLARDFDGILGLGFQEIS 201
Query: 202 VGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKK 261
G AVPVW NMV QGLV E VFSFWLNR+ D EEGGE+VFGGVDP HF+G HTYVPVT+K
Sbjct: 202 AGKAVPVWYNMVNQGLVEEAVFSFWLNRNVDEEEGGELVFGGVDPNHFRGNHTYVPVTRK 261
Query: 262 GYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECK 321
GYWQFE+GD+LIG++S+G C GGCAAI DSGTSL+AGPT ++T+IN AIG +GV++ +CK
Sbjct: 262 GYWQFEMGDVLIGDKSSGFCAGGCAAIADSGTSLIAGPTAIITQINQAIGAKGVLNQQCK 321
Query: 322 LVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVR 360
+VSQYG + +L S + P+++C Q+ LC F+GA +VR
Sbjct: 322 TLVSQYGKNMIQMLTSEVQPDQICSQMKLCTFDGARHVR 360
>gi|75338567|sp|Q9XFX4.1|CARDB_CYNCA RecName: Full=Procardosin-B; Contains: RecName: Full=Cardosin-B
heavy chain; AltName: Full=Cardosin-B 34 kDa subunit;
Contains: RecName: Full=Cardosin-B light chain; AltName:
Full=Cardosin-B 14 kDa subunit; Flags: Precursor
gi|4582534|emb|CAB40349.1| preprocardosin B [Cynara cardunculus]
Length = 506
Score = 405 bits (1040), Expect = e-110, Method: Compositional matrix adjust.
Identities = 197/339 (58%), Positives = 247/339 (72%), Gaps = 6/339 (1%)
Query: 24 SSNGLRRIGLKKRRLD-LHSLNAARITRKERYMGGAGVS-GVRHRLGDSDEDILPLKNFM 81
S+ GL R+GLKKR++D L L A + +G A G R L DS I+ L N
Sbjct: 26 SNGGLLRVGLKKRKVDRLDQLRAHGV----HMLGNARKDFGFRRTLSDSGSGIVALTNDR 81
Query: 82 DAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGK 141
D Y+GEIGIG+PPQNF+VIFDTGSS+LWVPS+KC S++C H RY S S+TY G
Sbjct: 82 DTAYYGEIGIGTPPQNFAVIFDTGSSDLWVPSTKCDTSLACVIHPRYDSGDSSTYKGNGT 141
Query: 142 SCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIA 201
+ I YG+G+I GF+SQD+VEVGD+VV+ Q FIE T E FL + FDGI+GLGF+EI+
Sbjct: 142 TASIQYGTGAIVGFYSQDSVEVGDLVVEHQDFIETTEEDDTVFLKSEFDGILGLGFQEIS 201
Query: 202 VGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKK 261
G AVPVW NMV QGLV E VFSFWLNR+ D EEGGE+VFGGVDP HF+G HTYVPVT+K
Sbjct: 202 AGKAVPVWYNMVNQGLVEEAVFSFWLNRNVDEEEGGELVFGGVDPNHFRGNHTYVPVTRK 261
Query: 262 GYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECK 321
GYWQFE+GD+LIG++S+G C GGCAAI DSGTS AGPT ++T+IN AIG +GV++ +CK
Sbjct: 262 GYWQFEMGDVLIGDKSSGFCAGGCAAIADSGTSFFAGPTAIITQINQAIGAKGVLNQQCK 321
Query: 322 LVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVR 360
+V QYG + +L S + P+K+C + LC F+GA VR
Sbjct: 322 TLVGQYGKNMIQMLTSEVQPDKICSHMKLCTFDGAHDVR 360
>gi|302761354|ref|XP_002964099.1| hypothetical protein SELMODRAFT_142401 [Selaginella moellendorffii]
gi|300167828|gb|EFJ34432.1| hypothetical protein SELMODRAFT_142401 [Selaginella moellendorffii]
Length = 497
Score = 399 bits (1024), Expect = e-108, Method: Compositional matrix adjust.
Identities = 198/348 (56%), Positives = 247/348 (70%), Gaps = 12/348 (3%)
Query: 9 VFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAA--RITRKERYMGGAGVSGVRHR 66
+ +W L SCL+ + + + LKKR L L A + RK +G V G
Sbjct: 11 LLAVWGL-SCLI---AVTAVEVVPLKKRPLTAERLRLAVKSVPRKAHALGFHNVHG---- 62
Query: 67 LGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHS 126
+S DI PL+N++DAQY+GEIGIGSPPQ F+VIFDTGSSNLWVPSS+C FS +C+ H
Sbjct: 63 -ANSLTDIEPLRNYLDAQYYGEIGIGSPPQVFTVIFDTGSSNLWVPSSRCIFSPACWLHR 121
Query: 127 RYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLL 186
RYKSRKS+TY S I YGSG ++GFFS D V +GDVVVKDQ F E+T E L FL
Sbjct: 122 RYKSRKSSTYKPDDASIAIQYGSGQMAGFFSTDYVTIGDVVVKDQTFAESTSEPGLVFLF 181
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDP-DAEEGGEIVFGGVD 245
A+FDGI+GLGF+ I++G PVW NM+ Q L+S+ VFSFWLNRD D E+GGEIVFGGV+
Sbjct: 182 AKFDGILGLGFKAISMGQVTPVWYNMLAQKLISQPVFSFWLNRDASDEEDGGEIVFGGVN 241
Query: 246 PKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTE 305
FKGKH Y PVT++GYWQF +GD+++ QSTG C GCAAI DSGTSLL GPT +V +
Sbjct: 242 KDRFKGKHVYTPVTREGYWQFNMGDVVVDGQSTGFCAKGCAAIADSGTSLLVGPTGIVAQ 301
Query: 306 INHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAF 353
IN AIG G+VS ECK+VV+QYGDLI +LL++ + P+KVC Q G+C
Sbjct: 302 INQAIGATGLVSEECKMVVAQYGDLIVELLLAQVTPDKVCAQAGVCTL 349
>gi|302820804|ref|XP_002992068.1| hypothetical protein SELMODRAFT_186535 [Selaginella moellendorffii]
gi|300140190|gb|EFJ06917.1| hypothetical protein SELMODRAFT_186535 [Selaginella moellendorffii]
Length = 499
Score = 397 bits (1020), Expect = e-108, Method: Compositional matrix adjust.
Identities = 197/348 (56%), Positives = 247/348 (70%), Gaps = 12/348 (3%)
Query: 9 VFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAA--RITRKERYMGGAGVSGVRHR 66
+ +W L SCL+ + + + LKKR L L A + RK +G V G
Sbjct: 11 LLVVWGL-SCLI---AVTAVEVVPLKKRPLTAERLRLAVKSVPRKAHALGFHNVHG---- 62
Query: 67 LGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHS 126
+S DI PL+N++DAQY+GEIGIGSPPQ F+VIFDTGSSNLWVPSS+C FS +C+ H
Sbjct: 63 -ANSLTDIEPLRNYLDAQYYGEIGIGSPPQVFTVIFDTGSSNLWVPSSRCIFSPACWLHR 121
Query: 127 RYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLL 186
RYKSRKS+TY S I YG+G ++GF S D V +GDVVVKDQ F E+T E L FL
Sbjct: 122 RYKSRKSSTYKPDDASIAIQYGTGQMAGFLSTDYVTIGDVVVKDQTFAESTSEPGLVFLF 181
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDP-DAEEGGEIVFGGVD 245
A+FDGI+GLGF+ I++G PVW NM+ Q L+S+ VFSFWLNRD D E+GGEIVFGGV+
Sbjct: 182 AKFDGILGLGFKAISMGQVTPVWYNMLAQKLISQPVFSFWLNRDASDEEDGGEIVFGGVN 241
Query: 246 PKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTE 305
FKGKH Y PVT++GYWQF +GD+++ QSTG C GCAAI DSGTSLLAGPT +V +
Sbjct: 242 KDRFKGKHVYTPVTREGYWQFNMGDVVVDGQSTGFCAKGCAAIADSGTSLLAGPTGIVAQ 301
Query: 306 INHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAF 353
IN AIG G+VS ECK+VV+QYGDLI +LL++ + P+KVC Q G+C
Sbjct: 302 INQAIGATGLVSEECKMVVTQYGDLIVELLLAQVTPDKVCAQAGVCTL 349
>gi|168033581|ref|XP_001769293.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162679399|gb|EDQ65847.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 485
Score = 397 bits (1020), Expect = e-108, Method: Compositional matrix adjust.
Identities = 197/329 (59%), Positives = 240/329 (72%), Gaps = 18/329 (5%)
Query: 29 RRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGE 88
RRI LKK+ +DL S+ +A +R AG S R GD+ + L N+MDAQYFGE
Sbjct: 28 RRIPLKKKSIDLQSVRSAAARTLQRANALAG-SANSLRGGDA----VDLNNYMDAQYFGE 82
Query: 89 IGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYG 148
IGIGSPPQ FSVIFDTGSSNLWVPS+KCY S++CYFH RYKS KS+TY E G S I YG
Sbjct: 83 IGIGSPPQPFSVIFDTGSSNLWVPSAKCYLSLACYFHRRYKSSKSSTYKEDGTSFAIQYG 142
Query: 149 SGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPV 208
+GS+ GF SQD+V +GD+ VK QVF EAT+E +TF+ A+FDGI+GLGF+EI+V PV
Sbjct: 143 TGSMEGFLSQDDVTLGDLTVKWQVFAEATKEPGVTFVSAKFDGILGLGFKEISVDRVTPV 202
Query: 209 WDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFEL 268
W NM++QGLV E VFSFWLNRD D +GGE+VFGGVDP HFKG+HTY PVT+KGYWQF+L
Sbjct: 203 WYNMLDQGLVKEPVFSFWLNRDSDESDGGELVFGGVDPDHFKGEHTYTPVTRKGYWQFDL 262
Query: 269 GDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYG 328
GD GC+AI DSGTSLLAGP+ +V EIN AIG G+VS +CK+VV QYG
Sbjct: 263 GD-------------GCSAIADSGTSLLAGPSGIVAEINQAIGATGIVSQQCKMVVQQYG 309
Query: 329 DLIWDLLVSGLLPEKVCQQIGLCAFNGAE 357
+ I ++LV+ + P KVC +GLC E
Sbjct: 310 EQIVEMLVAQMNPGKVCASLGLCQLAAGE 338
>gi|302761358|ref|XP_002964101.1| hypothetical protein SELMODRAFT_166719 [Selaginella moellendorffii]
gi|300167830|gb|EFJ34434.1| hypothetical protein SELMODRAFT_166719 [Selaginella moellendorffii]
Length = 505
Score = 395 bits (1014), Expect = e-107, Method: Compositional matrix adjust.
Identities = 196/349 (56%), Positives = 247/349 (70%), Gaps = 8/349 (2%)
Query: 9 VFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAA--RITRKERYMGGAGVSGVRHR 66
+ +W L SCL+ + + + LKKR L L A + RK +G V
Sbjct: 11 LLAVWGL-SCLI---AVTAVEVVPLKKRPLTAERLRLAVKSVPRKAHALGFHNVRDANSL 66
Query: 67 LGD-SDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFH 125
+ S DI PL+N++DAQY+GEIGIGSPPQ F+VIFDTGSSNLWVPSS+C FS +C+ H
Sbjct: 67 TKNGSVPDIEPLRNYLDAQYYGEIGIGSPPQVFTVIFDTGSSNLWVPSSRCIFSPACWLH 126
Query: 126 SRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFL 185
RYKSRKS+TY G S I YG+G ++GF S D V +GDVVVKDQ F E+T E L FL
Sbjct: 127 HRYKSRKSSTYKPDGTSIAIQYGTGQMAGFLSTDYVTIGDVVVKDQTFAESTSEPGLVFL 186
Query: 186 LARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDP-DAEEGGEIVFGGV 244
+A+FDGI+GLGF+ I+ G PVW NM+ Q L+S+ VFSFWLNRD D E+GGEIVFGGV
Sbjct: 187 VAKFDGILGLGFKAISKGQVTPVWYNMLAQKLISQPVFSFWLNRDASDEEDGGEIVFGGV 246
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
+ FKGKH Y PVT++GYWQF +GD+ + QSTG C GCAAI DSGTSLLAGPT +V
Sbjct: 247 NKDRFKGKHVYTPVTREGYWQFNMGDVAVDGQSTGFCAKGCAAIADSGTSLLAGPTGIVA 306
Query: 305 EINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAF 353
+IN AIG G+VS ECK+VV+QYGDLI +LL++ + P++VC Q G+C+
Sbjct: 307 QINQAIGATGLVSEECKMVVAQYGDLIVELLLAQVTPDRVCAQAGVCSL 355
>gi|302761356|ref|XP_002964100.1| hypothetical protein SELMODRAFT_438819 [Selaginella moellendorffii]
gi|300167829|gb|EFJ34433.1| hypothetical protein SELMODRAFT_438819 [Selaginella moellendorffii]
Length = 503
Score = 394 bits (1011), Expect = e-107, Method: Compositional matrix adjust.
Identities = 195/349 (55%), Positives = 245/349 (70%), Gaps = 8/349 (2%)
Query: 9 VFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAA--RITRKERYMGGAGVSGVRHR 66
+ +W L SCL+ + + + LKKR L L A + RK +G V
Sbjct: 11 LLAVWGL-SCLI---AVTAVEVVPLKKRPLTAERLRLAVKSVPRKAHALGFHNVRDANSL 66
Query: 67 LGD-SDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFH 125
+ S DI PL+N++DAQY+GEIGIGSPPQ F+VIFDTGSSNLWVPSS+C FS +C+ H
Sbjct: 67 TKNGSVPDIEPLRNYLDAQYYGEIGIGSPPQVFTVIFDTGSSNLWVPSSRCIFSPACWLH 126
Query: 126 SRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFL 185
RYKSRKS+TY S I YG+G ++GF S D V +GDVVVKDQ F E+T E L FL
Sbjct: 127 RRYKSRKSSTYKPDDASIAIQYGTGQMAGFLSTDYVTIGDVVVKDQTFAESTSEPGLVFL 186
Query: 186 LARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDP-DAEEGGEIVFGGV 244
A+FDGI+GLGF+ I++G PVW NM+ Q L+S+ VFSFWLNRD D E+GGEIVFGGV
Sbjct: 187 FAKFDGILGLGFKAISMGQVTPVWYNMLAQKLISQPVFSFWLNRDASDEEDGGEIVFGGV 246
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
+ FKGKH Y PVT++GYWQF +GD+++ QSTG C GCAAI DSGTSLL GPT +V
Sbjct: 247 NKDRFKGKHVYTPVTREGYWQFNMGDVVVDGQSTGFCAKGCAAIADSGTSLLVGPTGIVA 306
Query: 305 EINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAF 353
+IN AIG G+VS ECK+VV+QYGDLI +LL++ + P+KVC Q G+C
Sbjct: 307 QINQAIGATGLVSEECKMVVAQYGDLIVELLLAQVTPDKVCAQAGVCTL 355
>gi|357135633|ref|XP_003569413.1| PREDICTED: aspartic proteinase oryzasin-1-like [Brachypodium
distachyon]
Length = 560
Score = 389 bits (998), Expect = e-105, Method: Compositional matrix adjust.
Identities = 182/306 (59%), Positives = 231/306 (75%), Gaps = 2/306 (0%)
Query: 59 GVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF 118
G+ G R + D ++I+PLKN+M+AQYFG+IG+G PPQNF+V+FDTGSSN+WVPS+KC F
Sbjct: 111 GIRGNR-SVHDGQQNIIPLKNYMNAQYFGQIGVGCPPQNFTVVFDTGSSNIWVPSAKCIF 169
Query: 119 SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
S++CYFH +Y SR S+TY E G I+YGSG+I GF+S+D V +G++VVK+Q FIE T
Sbjct: 170 SLACYFHPKYVSRWSSTYKENGTPASIHYGSGAIYGFYSEDQVTIGNLVVKNQEFIETTY 229
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
E TFL A+FDGI+GLGF+EI+V + PVW NM++QGLV E+ FSFWLNRD + EGGE
Sbjct: 230 EHGFTFLAAKFDGILGLGFKEISVEGSDPVWYNMIDQGLVKEKSFSFWLNRDANDGEGGE 289
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
IVFGG DPKH+KG HTY VT+K YWQFE+GD LIG +STG+C GCAAI DSGTSL+AG
Sbjct: 290 IVFGGSDPKHYKGSHTYTRVTRKAYWQFEMGDFLIGGKSTGICVDGCAAIADSGTSLIAG 349
Query: 299 PTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLL-VSGLLPEKVCQQIGLCAFNGAE 357
P V+ +IN IG GV + ECK VV+ YG + +LL P +VC +IGLC F+G
Sbjct: 350 PVAVIAQINEKIGANGVANEECKQVVAGYGQQMIELLEAKQTAPAQVCSKIGLCTFDGTR 409
Query: 358 YVRLGI 363
V GI
Sbjct: 410 AVSAGI 415
>gi|293335451|ref|NP_001169605.1| uncharacterized protein LOC100383486 precursor [Zea mays]
gi|224030337|gb|ACN34244.1| unknown [Zea mays]
Length = 556
Score = 383 bits (984), Expect = e-104, Method: Compositional matrix adjust.
Identities = 205/411 (49%), Positives = 262/411 (63%), Gaps = 52/411 (12%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASS----NGLRRIGLKKRRL------DL---------- 40
M ++ + F L + S LPASS +GL RI LKKR + DL
Sbjct: 1 MGRRTCGTAFILLYVLSTSTLPASSSNTGDGLIRIPLKKRSIMDTIYGDLLPKPSAPEEK 60
Query: 41 --------------------HSL--NAARITRKERYM---GGAGVSGVRHRLGDSDED-- 73
H + AA R+ RY GAG G RL D +
Sbjct: 61 EKQAVDDPVRDAIARARERQHEMLVQAAATERRRRYYWSYSGAGGKGNGSRLHDGGQGEG 120
Query: 74 -----ILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRY 128
I+ LKNF++AQYFG+IG+G PPQNF+V+FDTGS+NLWVPS+KC+FS++C FH +Y
Sbjct: 121 SGSIAIVALKNFLNAQYFGQIGVGCPPQNFTVVFDTGSANLWVPSAKCFFSLACLFHPKY 180
Query: 129 KSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLAR 188
SR+S+TY G I+YG+G I+GF+SQD V VG++VV++Q FIEAT E TFLLA+
Sbjct: 181 DSRQSSTYKPNGTPASIHYGTGGIAGFYSQDQVTVGNLVVQNQEFIEATHEPGFTFLLAK 240
Query: 189 FDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKH 248
FDGI+GL F+EI+V ++PVW NMV Q LV++ VFSFWLNR+P EGGEIVFGG D +H
Sbjct: 241 FDGILGLAFQEISVEGSLPVWYNMVNQNLVAQPVFSFWLNRNPFDGEGGEIVFGGSDEQH 300
Query: 249 FKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH 308
+KG HTY VT+KGYWQFE+GD LIG +STG+C GCAAI DSGTSL+AGP + +IN
Sbjct: 301 YKGSHTYTRVTRKGYWQFEMGDFLIGGRSTGICVDGCAAIADSGTSLIAGPLVAIAQINE 360
Query: 309 AIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYV 359
IG GVV+ ECK VV+ YG I LL + P +VC ++GLC F+G V
Sbjct: 361 QIGAAGVVNQECKQVVAGYGLQIAGLLEAQTPPSEVCSKVGLCTFDGTRGV 411
>gi|357130655|ref|XP_003566963.1| PREDICTED: aspartic proteinase oryzasin-1-like [Brachypodium
distachyon]
Length = 520
Score = 380 bits (976), Expect = e-103, Method: Compositional matrix adjust.
Identities = 195/350 (55%), Positives = 249/350 (71%), Gaps = 7/350 (2%)
Query: 20 LLPASSNGLRRIGLKKRRLDLHSLNAAR-----ITRKERYMGGAGVSGVRHRLGDSDED- 73
LL A + GL R+ LKK +D H L A + R+ ++ +G + + +
Sbjct: 26 LLAAPAEGLVRVALKKHPVDEHGLAAGEEAQRLLLRRYGHVFNDASAGASSKPSTAAKGG 85
Query: 74 ILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKS 133
+ LKN ++AQY+GE+GIG+PPQNF+VIFDTGS+NLWVPSS CYFSI+CYFH RY + +S
Sbjct: 86 SVTLKNCLNAQYYGEVGIGTPPQNFTVIFDTGSANLWVPSSNCYFSIACYFHPRYNAGQS 145
Query: 134 NTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGII 193
TY + GK EI+YG+G+ISG+ SQD+V+VG VVVK Q FIEAT E S+TF+ +FDGI+
Sbjct: 146 KTYKKNGKHVEIHYGTGAISGYLSQDSVQVGGVVVKKQDFIEATGEPSITFMFGKFDGIL 205
Query: 194 GLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKH 253
GLGF+E+ +P+W NMV QGLV + +FSFW NR +GGEIVFGG+DP H KG H
Sbjct: 206 GLGFKEMLYLSVLPIWYNMVSQGLVGDLIFSFWFNRHAGEGQGGEIVFGGIDPSHHKGNH 265
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
TYVPV KKGYWQF++ D+LIG STG C+ GCAA+ DSGTSLL+GPT +VT+IN IG
Sbjct: 266 TYVPVPKKGYWQFDMSDVLIGGNSTGFCKDGCAAMADSGTSLLSGPTAIVTQINKKIGAT 325
Query: 314 GVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
GVVS ECK VVSQYG I DLL+ +K+C +GLC F+GA V GI
Sbjct: 326 GVVSQECKAVVSQYGKQILDLLLK-YSRKKICSSVGLCTFDGAHGVSAGI 374
>gi|414881317|tpg|DAA58448.1| TPA: hypothetical protein ZEAMMB73_088821 [Zea mays]
Length = 557
Score = 379 bits (973), Expect = e-102, Method: Compositional matrix adjust.
Identities = 205/412 (49%), Positives = 261/412 (63%), Gaps = 53/412 (12%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASS----NGLRRIGLKKRRL------DL---------- 40
M ++ + F L + S LPASS +GL RI LKKR + DL
Sbjct: 1 MGRRTCGTAFILLYVLSTSTLPASSSNTGDGLIRIPLKKRSIMDTIYGDLLPKPSAPEEK 60
Query: 41 --------------------HSL--NAARITRKERYM---GGAGVSGVRHRLGDSDED-- 73
H + AA R+ RY GAG G RL D +
Sbjct: 61 EKQAVDDPVRDAIARARERQHEMLVQAAATERRRRYYWSYSGAGGKGNGSRLHDGGQGEG 120
Query: 74 -----ILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRY 128
I+ LKNF++AQYFG+IG+G PPQNF+V+FDTGS+NLWVPS+KC+FS++C FH +Y
Sbjct: 121 SGSIAIVALKNFLNAQYFGQIGVGCPPQNFTVVFDTGSANLWVPSAKCFFSLACLFHPKY 180
Query: 129 KSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLAR 188
SR+S+TY G I+YG+G I+GF+SQD V VG++VV++Q FIEAT E TFLLA+
Sbjct: 181 DSRQSSTYKPNGTPASIHYGTGGIAGFYSQDQVTVGNLVVQNQEFIEATHEPGFTFLLAK 240
Query: 189 FDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKH 248
FDGI+GL F+EI+V ++PVW NMV Q LV++ VFSFWLNR+P EGGEIVFGG D +H
Sbjct: 241 FDGILGLAFQEISVEGSLPVWYNMVNQNLVAQPVFSFWLNRNPFDGEGGEIVFGGSDEQH 300
Query: 249 FKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH 308
+KG HTY VT+KGYWQFE+GD LIG +STG+C GCAAI DSGTSL+AGP + +IN
Sbjct: 301 YKGSHTYTRVTRKGYWQFEMGDFLIGGRSTGICVDGCAAIADSGTSLIAGPLVAIAQINE 360
Query: 309 AIGGEGVVSAECKLVVSQYGDLIWDLL-VSGLLPEKVCQQIGLCAFNGAEYV 359
IG GVV+ ECK VV+ YG I LL P +VC ++GLC F+G V
Sbjct: 361 QIGAAGVVNQECKQVVAGYGLQIAGLLEAQQTPPSEVCSKVGLCTFDGTRGV 412
>gi|356542078|ref|XP_003539498.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase oryzasin-1-like
[Glycine max]
Length = 449
Score = 379 bits (972), Expect = e-102, Method: Compositional matrix adjust.
Identities = 176/293 (60%), Positives = 227/293 (77%), Gaps = 16/293 (5%)
Query: 71 DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKS 130
D I+ LKN+M+AQYFGEIGIG+ PQ F+VIFDTGSSNLWVPSSKCYFS++CY HSRYKS
Sbjct: 27 DTSIIRLKNYMNAQYFGEIGIGTLPQKFTVIFDTGSSNLWVPSSKCYFSVACYLHSRYKS 86
Query: 131 RKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFD 190
+S+T + G S EI+YG+G ISGFF+QD+V+V D+VV DQ FIEATR
Sbjct: 87 SQSSTCNKNGSSAEIHYGTGHISGFFTQDHVKVXDLVVYDQDFIEATR------------ 134
Query: 191 GIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFK 250
+GF+EI+VG+A P+W NM+ Q +++ VFSFWLNR+ + E+GG+IVFGG+D H+K
Sbjct: 135 ----VGFQEISVGNAAPIWYNMLNQHFLTQPVFSFWLNRNTNEEQGGQIVFGGIDSDHYK 190
Query: 251 GKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI 310
G+HTYVPVT+KGYWQ E+GD+LI ++TG+C C AIVDSGTSLLAGPT V+ +INHAI
Sbjct: 191 GEHTYVPVTQKGYWQIEIGDVLINGKTTGLCAAKCLAIVDSGTSLLAGPTGVIAQINHAI 250
Query: 311 GGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
G G+VS ECK +V+QYG I D L++ LP+++C QIGLC F+G + V +GI
Sbjct: 251 GAVGIVSQECKALVAQYGKTILDKLINEALPQQICSQIGLCTFDGTQGVSIGI 303
>gi|56182674|gb|AAV84086.1| aspartic proteinase 12 [Fagopyrum esculentum]
Length = 387
Score = 377 bits (968), Expect = e-102, Method: Compositional matrix adjust.
Identities = 172/261 (65%), Positives = 216/261 (82%)
Query: 103 DTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVE 162
DTGSSNLWVPS+KCYFSI+C+FHS+YKS KS T+ + G S I YG+G+ISGFFS+DNV+
Sbjct: 1 DTGSSNLWVPSAKCYFSIACFFHSKYKSSKSITHVKNGTSAAIRYGTGAISGFFSRDNVK 60
Query: 163 VGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEV 222
+GD+VV++Q FIEATRE S+TF+ A+FDGI+GLGF+EI+VG AVPVW NM++QGL+SE V
Sbjct: 61 IGDLVVENQEFIEATREPSITFIAAKFDGILGLGFQEISVGKAVPVWYNMIDQGLISEPV 120
Query: 223 FSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCE 282
FSFW NR+ + EEGGE+VFGG+DP HF+G+HTYVPVT+KGYWQF++ D+LI STG C
Sbjct: 121 FSFWFNRNAEEEEGGELVFGGIDPDHFRGQHTYVPVTQKGYWQFDMDDVLIDGMSTGFCA 180
Query: 283 GGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPE 342
GGCAAI DSGTSLLAGP VV +INHAIG G+VS ECK VV++YG I ++L+S P
Sbjct: 181 GGCAAIADSGTSLLAGPMAVVAQINHAIGATGIVSQECKTVVAEYGKEIIEMLLSEAQPL 240
Query: 343 KVCQQIGLCAFNGAEYVRLGI 363
K+C Q+GLC F+G V +GI
Sbjct: 241 KICSQVGLCTFDGTRGVSMGI 261
>gi|168031065|ref|XP_001768042.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680680|gb|EDQ67114.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 455
Score = 376 bits (966), Expect = e-102, Method: Compositional matrix adjust.
Identities = 178/291 (61%), Positives = 220/291 (75%), Gaps = 1/291 (0%)
Query: 62 GVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSIS 121
G R + S D + L N++DAQY+G I IG+P Q F+V+FDTGSSNLWVPS+KCY S++
Sbjct: 9 GTRGQGVGSGGDEVALVNYLDAQYYGVIEIGTPKQEFTVVFDTGSSNLWVPSAKCYLSLA 68
Query: 122 CYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGS 181
C+FH RYK+RKS+TY + G I YG+GS+ GF S D+V +GD+ VK QVF EAT+E
Sbjct: 69 CFFHHRYKARKSSTYKQDGTPFAIQYGTGSMEGFLSIDDVTLGDLTVKAQVFAEATKEPG 128
Query: 182 LTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVF 241
+TFL A DGI+GLGF+EI+V D PVW NM+ Q LV E VFSFWLNRD + E+GGE+V
Sbjct: 129 VTFLAAEMDGILGLGFKEISVNDVNPVWYNMLYQKLVQEPVFSFWLNRDVEGEKGGELVL 188
Query: 242 GGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTP 301
GGVDP HFKG HTY PVT+ GYWQF++GD+L+ QSTG C GGCAAI DSGTSLLAGPT
Sbjct: 189 GGVDPHHFKGNHTYTPVTRLGYWQFDMGDVLLDGQSTGFCAGGCAAIADSGTSLLAGPTG 248
Query: 302 VVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLL-PEKVCQQIGLC 351
+V EIN+AIG G++S ECKLVV QY D I +L+S LL P K+C + G C
Sbjct: 249 IVAEINYAIGATGIISGECKLVVDQYADFIIQMLMSKLLTPLKICAKAGAC 299
>gi|218188712|gb|EEC71139.1| hypothetical protein OsI_02961 [Oryza sativa Indica Group]
Length = 540
Score = 375 bits (963), Expect = e-101, Method: Compositional matrix adjust.
Identities = 172/289 (59%), Positives = 220/289 (76%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
L LKNF++AQYFGEIG+G PPQNF+V+FDTGSSNLWVPS+KC FS++CYFH +Y+SR S+
Sbjct: 144 LALKNFLNAQYFGEIGVGCPPQNFTVVFDTGSSNLWVPSAKCVFSLACYFHRKYESRSSS 203
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY E G I+YG+GSI G++SQD V +GD+VV +Q FIEAT E LTFL A+FDGI+G
Sbjct: 204 TYMENGTPASIHYGTGSIHGYYSQDQVTIGDLVVNNQEFIEATHEPGLTFLAAKFDGILG 263
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LGF+EI+V A PVW NM++Q LV+++VFSFWLNR+ + GGEIVFGG D H+KG HT
Sbjct: 264 LGFKEISVEGADPVWYNMIQQSLVTDKVFSFWLNRNANDINGGEIVFGGADESHYKGDHT 323
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y VT+K YWQFE+GD LIG +STG+C GCA I DSGTSL+AGP + +I+ IG G
Sbjct: 324 YTRVTRKAYWQFEMGDFLIGGRSTGICVDGCAVIADSGTSLIAGPIAAIAQIHAHIGATG 383
Query: 315 VVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
V + ECK VV+++G + +LL P +VC +IGLC +GA + GI
Sbjct: 384 VANEECKQVVARHGHEMLELLQDKTPPAQVCSKIGLCKSDGAHGISDGI 432
>gi|115438741|ref|NP_001043650.1| Os01g0631900 [Oryza sativa Japonica Group]
gi|55297073|dbj|BAD68642.1| putative aspartic proteinase [Oryza sativa Japonica Group]
gi|113533181|dbj|BAF05564.1| Os01g0631900 [Oryza sativa Japonica Group]
Length = 522
Score = 374 bits (961), Expect = e-101, Method: Compositional matrix adjust.
Identities = 172/289 (59%), Positives = 220/289 (76%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
L LKNF++AQYFGEIG+G PPQNF+V+FDTGSSNLWVPS+KC FS++CYFH +Y+SR S+
Sbjct: 129 LALKNFLNAQYFGEIGVGCPPQNFTVVFDTGSSNLWVPSAKCVFSLACYFHRKYESRSSS 188
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY E G I+YG+GSI G++SQD V +GD+VV +Q FIEAT E LTFL A+FDGI+G
Sbjct: 189 TYMENGTPASIHYGTGSIHGYYSQDQVTIGDLVVNNQEFIEATHEPGLTFLAAKFDGILG 248
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LGF+EI+V A PVW NM++Q LV+++VFSFWLNR+ + GGEIVFGG D H+KG HT
Sbjct: 249 LGFKEISVEGADPVWYNMIQQSLVTDKVFSFWLNRNANDINGGEIVFGGADESHYKGDHT 308
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y VT+K YWQFE+GD LIG +STG+C GCA I DSGTSL+AGP + +I+ IG G
Sbjct: 309 YTRVTRKAYWQFEMGDFLIGGRSTGICVDGCAVIADSGTSLIAGPIAAIAQIHAHIGATG 368
Query: 315 VVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
V + ECK VV+++G + +LL P +VC +IGLC +GA + GI
Sbjct: 369 VANEECKQVVARHGHEMLELLQDKTPPAQVCSKIGLCKSDGAHGISDGI 417
>gi|75267434|sp|Q9XFX3.1|CARDA_CYNCA RecName: Full=Procardosin-A; Contains: RecName: Full=Cardosin-A
intermediate form 35 kDa subunit; Contains: RecName:
Full=Cardosin-A heavy chain; AltName: Full=Cardosin-A 31
kDa subunit; Contains: RecName: Full=Cardosin-A
intermediate form 30 kDa subunit; Contains: RecName:
Full=Cardosin-A light chain; AltName: Full=Cardosin-A 15
kDa subunit; Flags: Precursor
gi|4581209|emb|CAB40134.1| preprocardosin A [Cynara cardunculus]
Length = 504
Score = 374 bits (961), Expect = e-101, Method: Compositional matrix adjust.
Identities = 192/348 (55%), Positives = 240/348 (68%), Gaps = 6/348 (1%)
Query: 12 LWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSD 71
L+ L S + S +GL RIGLKKR++D ++ R R G R + DS
Sbjct: 14 LFYLLSPTVFSVSDDGLIRIGLKKRKVD--RIDQLRGRRALMEGNARKDFGFRGTVRDSG 71
Query: 72 EDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSR 131
++ L N D YFGEIGIG+PPQ F+VIFDTGSS LWVPSSKC S +C HS Y+S
Sbjct: 72 SAVVALTNDRDTSYFGEIGIGTPPQKFTVIFDTGSSVLWVPSSKCINSKACRAHSMYESS 131
Query: 132 KSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDG 191
S+TY E G I YG+GSI+GFFSQD+V +GD+VVK+Q FIEAT E FL FDG
Sbjct: 132 DSSTYKENGTFGAIIYGTGSITGFFSQDSVTIGDLVVKEQDFIEATDEADNVFLHRLFDG 191
Query: 192 IIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKG 251
I+GL F+ I+V PVW NM+ QGLV E FSFWLNR+ D EEGGE+VFGG+DP HF+G
Sbjct: 192 ILGLSFQTISV----PVWYNMLNQGLVKERRFSFWLNRNVDEEEGGELVFGGLDPNHFRG 247
Query: 252 KHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
HTYVPVT + YWQF +GD+LIG++STG C GC A DSGTSLL+GPT +VT+INHAIG
Sbjct: 248 DHTYVPVTYQYYWQFGIGDVLIGDKSTGFCAPGCQAFADSGTSLLSGPTAIVTQINHAIG 307
Query: 312 GEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYV 359
GV++ +CK VVS+YG I ++L S + P+K+C + LC F+GA V
Sbjct: 308 ANGVMNQQCKTVVSRYGRDIIEMLRSKIQPDKICSHMKLCTFDGARDV 355
>gi|302756359|ref|XP_002961603.1| hypothetical protein SELMODRAFT_230037 [Selaginella moellendorffii]
gi|300170262|gb|EFJ36863.1| hypothetical protein SELMODRAFT_230037 [Selaginella moellendorffii]
Length = 423
Score = 370 bits (951), Expect = e-100, Method: Compositional matrix adjust.
Identities = 172/272 (63%), Positives = 212/272 (77%)
Query: 81 MDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIG 140
MDAQY+GEIGIGSPPQ F+VIFDTGSSNLWVPS KC S SC+FH RYK+ +S+TY G
Sbjct: 1 MDAQYYGEIGIGSPPQEFTVIFDTGSSNLWVPSGKCVLSPSCWFHRRYKAGQSSTYKPNG 60
Query: 141 KSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREI 200
S I YGSGS+SGF S D+V +G + VK +VF EAT E LTF+ A+FDGI+GLGF+ I
Sbjct: 61 TSISIQYGSGSMSGFLSVDDVTLGKLTVKGEVFAEATSEPGLTFMAAKFDGIMGLGFQAI 120
Query: 201 AVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTK 260
A VP+W ++VEQ LV E VFSFWLNRD GGE+V GGVDPKHFKGKH Y P+T+
Sbjct: 121 AQARVVPIWYHIVEQQLVKEPVFSFWLNRDATDGNGGELVLGGVDPKHFKGKHNYAPITR 180
Query: 261 KGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAEC 320
+GYW+ +GD+LI TG+C GCAAIVDSGTSLLAGP+ ++ EINHAIG GVVS EC
Sbjct: 181 EGYWEIRMGDVLIDGHGTGMCSKGCAAIVDSGTSLLAGPSAIIAEINHAIGASGVVSQEC 240
Query: 321 KLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCA 352
KL+V QYG++I +LL++ + P+KVC Q+G+C+
Sbjct: 241 KLIVDQYGNIIINLLLAQVSPDKVCSQLGVCS 272
>gi|449533814|ref|XP_004173866.1| PREDICTED: aspartic proteinase-like, partial [Cucumis sativus]
Length = 290
Score = 369 bits (948), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 178/282 (63%), Positives = 224/282 (79%), Gaps = 4/282 (1%)
Query: 8 SVFCLWVLASCLLLPASSN-GLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGV--R 64
+ CL++L S ++ + SN GL R+GLKK LD + AAR+ K+ + A
Sbjct: 9 AFLCLFLLVSLNIVSSVSNDGLLRVGLKKINLDPENRLAARLESKDAEILKAAFRKYSPN 68
Query: 65 HRLGDS-DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCY 123
LG+S D DI+ LKN++DAQY+GEI IG+PPQ F+VIFDTGSSNLWVPS+KC FS++C+
Sbjct: 69 GNLGESSDTDIVALKNYLDAQYYGEIAIGTPPQKFTVIFDTGSSNLWVPSAKCLFSVACH 128
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
FH+RYKS +S+TY + G S I YG+G++SGFFS DNV+VGD+VVK+Q+FIEATRE LT
Sbjct: 129 FHARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPGLT 188
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
FL+A+FDG++GLGF+EIAVG AVPVW NMVEQGLV E VFSFWLNR+ + EEGGEIVFGG
Sbjct: 189 FLVAKFDGLLGLGFQEIAVGSAVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGG 248
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGC 285
VDPKH+ GKHTYVPVT+KGYWQF++GD+LI + TG CEGGC
Sbjct: 249 VDPKHYTGKHTYVPVTQKGYWQFDMGDVLIDGKPTGYCEGGC 290
>gi|302775562|ref|XP_002971198.1| hypothetical protein SELMODRAFT_147484 [Selaginella moellendorffii]
gi|300161180|gb|EFJ27796.1| hypothetical protein SELMODRAFT_147484 [Selaginella moellendorffii]
Length = 423
Score = 369 bits (947), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 171/272 (62%), Positives = 212/272 (77%)
Query: 81 MDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIG 140
MDAQY+GEIGIGSPPQ F+VIFDTGSSNLWVPS KC S SC+FH R+K+ +S+TY G
Sbjct: 1 MDAQYYGEIGIGSPPQEFTVIFDTGSSNLWVPSGKCVLSPSCWFHRRFKAGQSSTYKPNG 60
Query: 141 KSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREI 200
S I YGSGS+SGF S D+V +G + VK +VF EAT E LTF+ A+FDGI+GLGF+ I
Sbjct: 61 TSISIQYGSGSMSGFLSVDDVTLGKLTVKGEVFAEATSEPGLTFMAAKFDGIMGLGFQAI 120
Query: 201 AVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTK 260
A VP+W ++VEQ LV E VFSFWLNRD GGE+V GGVDPKHFKGKH Y P+T+
Sbjct: 121 AQARVVPIWYHIVEQQLVKEPVFSFWLNRDATDGNGGELVLGGVDPKHFKGKHNYAPITR 180
Query: 261 KGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAEC 320
+GYW+ +GD+LI TG+C GCAAIVDSGTSLLAGP+ ++ EINHAIG GVVS EC
Sbjct: 181 EGYWEIRMGDVLIDGHGTGMCSKGCAAIVDSGTSLLAGPSAIIAEINHAIGASGVVSQEC 240
Query: 321 KLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCA 352
KL+V QYG++I +LL++ + P+KVC Q+G+C+
Sbjct: 241 KLIVDQYGNIIINLLLAQVSPDKVCSQLGVCS 272
>gi|242053731|ref|XP_002456011.1| hypothetical protein SORBIDRAFT_03g028820 [Sorghum bicolor]
gi|241927986|gb|EES01131.1| hypothetical protein SORBIDRAFT_03g028820 [Sorghum bicolor]
Length = 567
Score = 369 bits (947), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 174/293 (59%), Positives = 222/293 (75%), Gaps = 2/293 (0%)
Query: 73 DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRK 132
+I+ LKNF++AQYFG+IG+G PPQNF+V+FDTGS+NLWVPS+KC+FS++C FH +Y S +
Sbjct: 135 NIVALKNFLNAQYFGQIGVGCPPQNFTVVFDTGSANLWVPSAKCFFSLACLFHPKYDSSQ 194
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGI 192
S+TY G I+YG+G I+GF+SQD V VG++VV++Q FIEAT E TFLLA+FDGI
Sbjct: 195 SSTYKPNGTPASIHYGTGGIAGFYSQDEVTVGNLVVQNQEFIEATHEPGFTFLLAKFDGI 254
Query: 193 IGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDP-DAEEGGEIVFGGVDPKHFKG 251
+GL F+EI+V +VPVW NMV Q LV + VFSFWLNR+P D EEGGEIVFGG D +H+KG
Sbjct: 255 LGLAFQEISVEGSVPVWYNMVNQSLVPQPVFSFWLNRNPFDGEEGGEIVFGGSDEQHYKG 314
Query: 252 KHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
HTY VT+K YWQFE+GD LIG +STG+C GCAAI DSGTSL+AGP + +IN IG
Sbjct: 315 SHTYTRVTRKAYWQFEMGDFLIGERSTGICVDGCAAIADSGTSLIAGPLVAIAQINEQIG 374
Query: 312 GEGVVSAECKLVVSQYG-DLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
GVV+ ECK VV+ YG +++ L P +VC +IGLC +G V GI
Sbjct: 375 AAGVVNHECKQVVAGYGLEMVELLKAQQTPPSQVCSKIGLCTLDGTHGVSAGI 427
>gi|87241358|gb|ABD33216.1| Peptidase A1, pepsin [Medicago truncatula]
Length = 396
Score = 365 bits (938), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 173/248 (69%), Positives = 206/248 (83%)
Query: 118 FSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEAT 177
++CY H+ YK++KS TY + G SC+I+YG+GSISG+FSQDNV+VG VVK Q FIEAT
Sbjct: 6 LQLACYTHNWYKAKKSKTYNKNGTSCKISYGTGSISGYFSQDNVKVGSSVVKHQDFIEAT 65
Query: 178 REGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGG 237
REGSL+FL +FDGI GLGF+EI+V A+PVW NM+EQ L+ E+VFSFWLN +P+A++GG
Sbjct: 66 REGSLSFLAGKFDGIFGLGFQEISVERALPVWYNMLEQNLIGEKVFSFWLNGNPNAKKGG 125
Query: 238 EIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLA 297
E+VFGGVDPKHFKGKHTYVPVT+KGYWQ E+GD IG STGVCEGGCAAIVDSGTSLLA
Sbjct: 126 ELVFGGVDPKHFKGKHTYVPVTEKGYWQIEMGDFFIGGLSTGVCEGGCAAIVDSGTSLLA 185
Query: 298 GPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAE 357
GPTPVV EINHAIG EGV+S ECK VVSQYG+LIWDLLVSG+ P VC Q+GLC+ G +
Sbjct: 186 GPTPVVAEINHAIGAEGVLSVECKEVVSQYGELIWDLLVSGVKPGDVCSQVGLCSIRGDQ 245
Query: 358 YVRLGIPI 365
GI +
Sbjct: 246 SNSAGIEM 253
>gi|418731269|gb|AFX67029.1| aspartic protease, partial [Solanum tuberosum]
Length = 372
Score = 352 bits (902), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 164/227 (72%), Positives = 194/227 (85%)
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
T G+SC I YG+GSISG FS DNV+VGD+VVKDQVFIEATRE S+TF++A+FDGI+GLG
Sbjct: 1 TRDGESCSIRYGTGSISGHFSMDNVQVGDLVVKDQVFIEATREPSITFIVAKFDGILGLG 60
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
F+EI+VG+ PVW NMV QGLV E VFSFW NRD +A+EGGE+VFGGVDPKHFKG HTYV
Sbjct: 61 FQEISVGNTTPVWYNMVGQGLVKESVFSFWFNRDANAKEGGELVFGGVDPKHFKGNHTYV 120
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
P+T+KGYWQF +GD LIGN STG C GGCAAIVDSGTSLLAGPT +VT+INHAIG EG+V
Sbjct: 121 PLTQKGYWQFNMGDFLIGNTSTGYCAGGCAAIVDSGTSLLAGPTTIVTQINHAIGAEGIV 180
Query: 317 SAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
S ECK +VSQYG++IWDLLVSG+ P++VC Q GLC +GA++V I
Sbjct: 181 SMECKTIVSQYGEMIWDLLVSGVRPDQVCSQAGLCFVDGAQHVSSNI 227
>gi|356547093|ref|XP_003541952.1| PREDICTED: LOW QUALITY PROTEIN: cyprosin-like, partial [Glycine
max]
Length = 470
Score = 347 bits (889), Expect = 7e-93, Method: Compositional matrix adjust.
Identities = 182/345 (52%), Positives = 235/345 (68%), Gaps = 32/345 (9%)
Query: 19 LLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLK 78
++L SNG+ R+GL+K + D +++ GG S D I+ LK
Sbjct: 12 VVLSGPSNGIIRVGLEKNKFD----------QRKTPFGGYENS--------DDTSIIRLK 53
Query: 79 NFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTE 138
N+M+AQYFGEIGIG+P Q F+VIFDTGSSNLWVPSSKCYFS++CY HSRYKS +S+T +
Sbjct: 54 NYMNAQYFGEIGIGTP-QKFTVIFDTGSSNLWVPSSKCYFSVACYLHSRYKSSQSSTQNK 112
Query: 139 IGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFR 198
G S EI YG+G ISGFFSQD V+VGD++V TR LL +I L F+
Sbjct: 113 NGSSAEIRYGTGQISGFFSQDYVKVGDLIV-------LTR----XILLNEHFCVI-LQFK 160
Query: 199 EIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPV 258
I+VG P+W NM+ Q L+++ VFSFWLNR+ D ++GG+IVFGGVD H+ G+HTYVPV
Sbjct: 161 SISVGKVSPIWYNMLNQHLLAQPVFSFWLNRNTDEKQGGQIVFGGVDSDHYXGEHTYVPV 220
Query: 259 TKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSA 318
T KGYWQ E+GD+LI ++T C C+AI DSGTSLLAGPT + +INHAIG GVV+
Sbjct: 221 THKGYWQTEIGDVLIDRKTTEFCASKCSAIDDSGTSLLAGPTGAIAQINHAIGAVGVVNQ 280
Query: 319 ECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
ECK VV+QYG I D L++ LP++VC Q LC F+G + V +GI
Sbjct: 281 ECKAVVAQYGKTILDKLINEALPQQVCSQX-LCTFDGTKGVSMGI 324
>gi|222424506|dbj|BAH20208.1| AT1G11910 [Arabidopsis thaliana]
Length = 389
Score = 346 bits (888), Expect = 9e-93, Method: Compositional matrix adjust.
Identities = 159/243 (65%), Positives = 196/243 (80%)
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+C H +YKS +S+TY + GK+ I+YG+G+I+GFFS D V VGD+VVKDQ FIEAT+E
Sbjct: 1 ACLLHPKYKSSRSSTYEKNGKAAAIHYGTGAIAGFFSNDAVTVGDLVVKDQEFIEATKEP 60
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
+TF++A+FDGI+GLGF+EI+VG A PVW NM++QGL+ E VFSFWLNR+ D EEGGE+V
Sbjct: 61 GITFVVAKFDGILGLGFQEISVGKAAPVWYNMLKQGLIKEPVFSFWLNRNADEEEGGELV 120
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGGVDP HFKGKHTYVPVT+KGYWQF++GD+LIG TG CE GC+AI DSGTSLLAGPT
Sbjct: 121 FGGVDPNHFKGKHTYVPVTQKGYWQFDMGDVLIGGAPTGFCESGCSAIADSGTSLLAGPT 180
Query: 301 PVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVR 360
++T INHAIG GVVS +CK VV QYG I DLL+S P+K+C QIGLC F+G V
Sbjct: 181 TIITMINHAIGAAGVVSQQCKTVVDQYGQTILDLLLSETQPKKICSQIGLCTFDGTRGVS 240
Query: 361 LGI 363
+GI
Sbjct: 241 MGI 243
>gi|307103455|gb|EFN51715.1| hypothetical protein CHLNCDRAFT_59800 [Chlorella variabilis]
Length = 523
Score = 342 bits (877), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 171/339 (50%), Positives = 230/339 (67%), Gaps = 12/339 (3%)
Query: 18 CLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPL 77
CL+ A + G ++ L R+L L + + K R + A + ++D + +P+
Sbjct: 13 CLVATAQATGPLKVHL--RKLPLVAEQRQHLKDKHRLVTLAPAA-------ENDAEPVPI 63
Query: 78 KNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTY 136
NFMDAQY+GEIG+GSPPQ+F VIFDTGSSNLWVPSSKC Y S++CY HS+Y + +S+TY
Sbjct: 64 TNFMDAQYYGEIGLGSPPQSFQVIFDTGSSNLWVPSSKCSYLSVACYLHSKYYAERSHTY 123
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
E G+ I YGSG +SGF SQD + +G + V+ QVF EAT E SL F+ ARFDGI+G+G
Sbjct: 124 KEDGREFAIQYGSGQLSGFLSQDTLSMGGLKVEGQVFAEATMEPSLAFIAARFDGILGMG 183
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
F EIAVG P + NM++Q L+ E VFSFWLNR + EEGGE+V GGVDP HF G+HT+V
Sbjct: 184 FPEIAVGKVTPPFQNMLQQSLLPEPVFSFWLNRKVEGEEGGELVLGGVDPDHFVGEHTWV 243
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
PVT++G+WQF++ + + C+GGC AI D+GTSLL GP V+ IN AIG E V+
Sbjct: 244 PVTRRGFWQFKMDGMEVEGGGE-FCKGGCQAIADTGTSLLVGPPDVIDAINAAIGAEPVL 302
Query: 317 SAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNG 355
+CK +V QY I L++ + P+ VCQ +GLC+ G
Sbjct: 303 VEQCKEMVHQYLPEIIK-LINNMPPQAVCQSVGLCSAAG 340
>gi|218196057|gb|EEC78484.1| hypothetical protein OsI_18377 [Oryza sativa Indica Group]
Length = 389
Score = 332 bits (852), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 150/237 (63%), Positives = 196/237 (82%)
Query: 127 RYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLL 186
+ +S+KS++Y G++C+I YGSG+ISGFFS+DNV VGD+VVK+Q FIEATRE S+TF++
Sbjct: 12 QIQSKKSSSYKADGETCKITYGSGAISGFFSKDNVLVGDLVVKNQKFIEATRETSVTFII 71
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
+FDGI+GLG+ EI+VG A P+W +M EQ L++++VFSFWLNRDPDA GGE+VFGG+DP
Sbjct: 72 GKFDGILGLGYPEISVGKAPPIWQSMQEQELLADDVFSFWLNRDPDASSGGELVFGGMDP 131
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
KH+KG HTYVPV++KGYWQF +GD+LI STG C GCAAIVDSGTSLLAGPT +V ++
Sbjct: 132 KHYKGDHTYVPVSRKGYWQFNMGDLLIDGHSTGFCAKGCAAIVDSGTSLLAGPTAIVAQV 191
Query: 307 NHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
NHAIG EG++S ECK VVS+YG++I +LL++ P+KVC Q+GLC F+G V GI
Sbjct: 192 NHAIGAEGIISTECKEVVSEYGEMILNLLIAQTDPQKVCSQVGLCMFDGKRSVSNGI 248
>gi|384245845|gb|EIE19337.1| putative aspartic protease [Coccomyxa subellipsoidea C-169]
Length = 508
Score = 330 bits (845), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 169/354 (47%), Positives = 232/354 (65%), Gaps = 9/354 (2%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV 60
M K+ R+ F + S LL + R+ LKKR LD + A + R V
Sbjct: 2 MGTKMKRAGFLSLLCLSIGLLAQAQQSPLRVPLKKRTLDAEQVRATQTALHAR-----NV 56
Query: 61 SGVRHRL-GDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YF 118
V + L G+ +E +PL +F+DAQY+GEIG+G+P Q F+V+FDTGSSNLWVPSS+C YF
Sbjct: 57 RNVANALRGEPEEADIPLLDFLDAQYYGEIGLGTPEQKFTVVFDTGSSNLWVPSSQCSYF 116
Query: 119 SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
++C H+++ + KS TY G I YGSGS+SGFFS D + +G + V++Q F EAT+
Sbjct: 117 DLACLLHNKFYASKSRTYQANGTDFAIQYGSGSLSGFFSTDVLSLGSLNVQNQTFAEATK 176
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
E L F+ A+FDGI+GL F EI++G+ P + NMV+QGLV E VFSFWLNR+ + GGE
Sbjct: 177 EPGLAFVAAKFDGILGLAFPEISIGEVTPPFQNMVQQGLVPEPVFSFWLNRNDPSGPGGE 236
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
+V GGVDP H+ G+H +V VT++ YWQF+LG I + ++ C GC AI DSGTSL+ G
Sbjct: 237 LVLGGVDPSHYTGEHLWVNVTRRAYWQFDLGGISVPGTNS-PCADGCQAIADSGTSLIVG 295
Query: 299 PTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCA 352
P+ + EIN AIG +GV+ AEC+ +V QY I ++S L E+VC IGLC+
Sbjct: 296 PSDEIAEINRAIGAKGVLPAECRELVRQYVPEIMKAVIS-LPEEQVCGAIGLCS 348
>gi|33352213|emb|CAE18153.1| aspartic proteinase [Chlamydomonas reinhardtii]
Length = 578
Score = 320 bits (821), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 169/372 (45%), Positives = 235/372 (63%), Gaps = 19/372 (5%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSN-GLRRIGLKKRRLDLHSLNAARITRKERYMGGAG 59
M + + ++ L +++ L + A G+ R+ L+K + L +L R Y+
Sbjct: 1 MARSYVPALIALAAVSALLGVAAEQQAGMLRVTLRKTEM-LTTLG-----RPRPYL---- 50
Query: 60 VSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YF 118
G + LG SD+ + LKNFMDAQY+GEIG+G+PPQ F+VIFDTGS+NLWVPSSKC F
Sbjct: 51 -LGEQGLLGSSDQGQVTLKNFMDAQYYGEIGLGTPPQLFNVIFDTGSANLWVPSSKCALF 109
Query: 119 SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
+I+C H +Y + KS TY G I YG+GS+ G+ SQD + G + +KDQ F EA
Sbjct: 110 NIACRLHRKYNAAKSKTYKANGTEFAIEYGTGSLDGYISQDVLTWGGLTIKDQGFAEAIN 169
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
E LTF+ A+FDGI+G+GF I+V P + +VE+G ++ VFSFWLNRDP+A GGE
Sbjct: 170 EPGLTFVAAKFDGILGMGFPAISVQHVPPPFTRLVEEGGLAAPVFSFWLNRDPNAPNGGE 229
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
+V GG+DP HF G+HT+VPVT++GYWQF + + +G S +C GCAAI D+GTSL+AG
Sbjct: 230 LVLGGIDPTHFTGEHTWVPVTRQGYWQFTMEGLDLGPGSQKMCAKGCAAIADTGTSLIAG 289
Query: 299 PTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLP-EKVCQQIGLCAFNGAE 357
P+ V +NHAIG +SA+C+ +V Y I L LP ++VC IGLC A
Sbjct: 290 PSDEVAALNHAIGATSALSAQCRQLVRDYLPQIIAQLHD--LPLDQVCASIGLCPMAAAS 347
Query: 358 YVRLGIPITRVL 369
++ P R+L
Sbjct: 348 TIK---PARRLL 356
>gi|413946558|gb|AFW79207.1| hypothetical protein ZEAMMB73_486493 [Zea mays]
Length = 382
Score = 316 bits (810), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 149/233 (63%), Positives = 183/233 (78%), Gaps = 1/233 (0%)
Query: 131 RKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFD 190
+K+ TY GK I YG+GSI+GFFS+D+V +GD+VVKDQ FIEAT+E LTF++A+FD
Sbjct: 4 KKTKTYMS-GKPAAIRYGTGSIAGFFSEDSVTLGDLVVKDQEFIEATKEPGLTFMVAKFD 62
Query: 191 GIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFK 250
GI+GLGF+EI+VG+A PVW NMV+QGL+S+ VFSFW NR D EGGEIVFGG+D H+K
Sbjct: 63 GILGLGFQEISVGNATPVWYNMVKQGLISDPVFSFWFNRHADEGEGGEIVFGGMDSSHYK 122
Query: 251 GKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI 310
G HT+VPVT+KGYWQF +GD+L+ +STG C GGCAAI DSGTSLLAGPT ++TEIN I
Sbjct: 123 GDHTFVPVTRKGYWQFNMGDVLVDGKSTGFCAGGCAAIADSGTSLLAGPTAIITEINEKI 182
Query: 311 GGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
G GVVS ECK VVSQYG I DLL++ P K+C Q+GLC F+G V GI
Sbjct: 183 GAAGVVSQECKTVVSQYGQQILDLLLAETQPAKICSQVGLCTFDGTHGVSAGI 235
>gi|510880|emb|CAA56373.1| putative aspartic protease [Brassica oleracea]
Length = 255
Score = 313 bits (801), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 154/250 (61%), Positives = 197/250 (78%), Gaps = 12/250 (4%)
Query: 14 VLASCLLLPASS---NGLRRIGLKKRRLDLHSLNAARITRKE-RYMGGAGVSGVRHRLGD 69
+++ L L AS+ +G R+GLKK +LD S AAR+ K+ + + G G LGD
Sbjct: 13 IVSFLLFLSASAERNDGTFRVGLKKLKLDRKSRIAARVGSKQLKPLRGYG-------LGD 65
Query: 70 S-DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRY 128
S D DI+ LKN++DAQY+GEI IG+PPQ F+V+FDTGSSNLWVPSSKCYFSI+C FHS+Y
Sbjct: 66 SGDADIVTLKNYLDAQYYGEIAIGTPPQKFTVVFDTGSSNLWVPSSKCYFSIACLFHSKY 125
Query: 129 KSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLAR 188
KS +S+TY + GKS I+YG+G+I+GFFS D V VGD+VVKDQ FIEAT+E +TF+LA+
Sbjct: 126 KSSRSSTYEKNGKSAAIHYGTGAIAGFFSNDAVTVGDLVVKDQEFIEATKEPGITFVLAK 185
Query: 189 FDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKH 248
FDGI+GLGF+EI+VG+A PVW NM++QGL E VFSFWLNR+ + EEGGE+VFGGVDP H
Sbjct: 186 FDGILGLGFQEISVGNAAPVWYNMLKQGLYKEPVFSFWLNRNAEDEEGGELVFGGVDPNH 245
Query: 249 FKGKHTYVPV 258
+KG+H YVPV
Sbjct: 246 YKGEHIYVPV 255
>gi|302840660|ref|XP_002951885.1| hypothetical protein VOLCADRAFT_81669 [Volvox carteri f.
nagariensis]
gi|300262786|gb|EFJ46990.1| hypothetical protein VOLCADRAFT_81669 [Volvox carteri f.
nagariensis]
Length = 559
Score = 309 bits (792), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 166/340 (48%), Positives = 218/340 (64%), Gaps = 20/340 (5%)
Query: 14 VLASCLLLPASSNG-LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDE 72
VL +C +L + +G L R+ LKK++L L A R Y+ + LG +
Sbjct: 14 VLVACTVLASGDSGALHRVQLKKKQLSL-----ATYGRPRPYL--------NNMLGYGGD 60
Query: 73 DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSR 131
+PL NFMDAQY+GE+ +G+P Q F VIFDTGSSNLWVPSSKC +F+I+C H RY +
Sbjct: 61 --VPLHNFMDAQYYGEVSLGTPQQYFQVIFDTGSSNLWVPSSKCSFFNIACRLHRRYYAA 118
Query: 132 KSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDG 191
+S TY G + I YGSGS+ GF S+D + G + V +Q F EA E LTF+ A+FDG
Sbjct: 119 RSKTYKANGTAFSIQYGSGSLDGFISEDILGWGGLAVPEQGFAEAVNEPGLTFVAAKFDG 178
Query: 192 IIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKG 251
I+G+GF I+V VP + +V+ GL+SE VFSFWLNRD A GGE+V GGVDP HF G
Sbjct: 179 ILGMGFPAISVSGVVPPFTRLVDSGLLSEPVFSFWLNRDSSAAVGGELVLGGVDPAHFTG 238
Query: 252 KHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
+HT+V VT++GYWQF L I +G+Q +C GC AI D+GTSL+AGP V INHAIG
Sbjct: 239 EHTWVDVTRRGYWQFNLDGIHLGSQR--LCTQGCPAIADTGTSLIAGPVDEVAAINHAIG 296
Query: 312 GEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLC 351
+SA+C+ +V +Y I L L ++VC IGLC
Sbjct: 297 ATSALSAQCRTLVREYLPEIVAAL-HNLPLDQVCASIGLC 335
>gi|440803835|gb|ELR24718.1| aspartic proteinase, partial [Acanthamoeba castellanii str. Neff]
Length = 489
Score = 309 bits (791), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 151/319 (47%), Positives = 204/319 (63%), Gaps = 19/319 (5%)
Query: 35 KRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSP 94
+R+ +L + A +KE + GG GV P+ NF+DAQY+GEI IG+P
Sbjct: 59 QRKAELKKVEA---MKKEVFGGGKGVE--------------PISNFLDAQYYGEISIGNP 101
Query: 95 PQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSIS 153
PQ F+V+ DTGSSNLWVPS +C ++ I+C H +Y KS+TY G + +I YGSG++S
Sbjct: 102 PQYFNVVLDTGSSNLWVPSIQCPWYEIACDLHHKYDHSKSSTYKANGTNFQIQYGSGAMS 161
Query: 154 GFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMV 213
GF S DNV + + K Q+F EA E L F+ A+FDGI+GLGF I+V PVW ++
Sbjct: 162 GFLSADNVVIAGLTAKGQLFAEAVAEPGLAFVAAQFDGILGLGFDTISVDGVPPVWYTLL 221
Query: 214 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 273
Q V+E VF+FWLNRDP GGE+V GGVD H+ G TY P+TK+GYWQF D LI
Sbjct: 222 AQSQVAEPVFAFWLNRDPSGISGGELVLGGVDESHYTGDFTYTPITKEGYWQFLAHDFLI 281
Query: 274 GNQSTGVCE-GGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIW 332
+S G C GGC AI D+GTSLLAGP+ +V +IN I G++ +EC ++V+QY I
Sbjct: 282 NGKSMGFCPAGGCKAIADTGTSLLAGPSKIVAQINKMINATGILESECDMLVNQYAGQII 341
Query: 333 DLLVSGLLPEKVCQQIGLC 351
++ GL P++VC + LC
Sbjct: 342 QYILQGLQPDQVCSAVNLC 360
>gi|4389326|pdb|1B5F|A Chain A, Native Cardosin A From Cynara Cardunculus L.
gi|6729875|pdb|1B5F|C Chain C, Native Cardosin A From Cynara Cardunculus L
Length = 239
Score = 308 bits (788), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 151/238 (63%), Positives = 180/238 (75%), Gaps = 4/238 (1%)
Query: 74 ILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKS 133
++ L N D YFGEIGIG+PPQ F+VIFDTGSS LWVPSSKC S +C HS Y+S S
Sbjct: 4 VVALTNDRDTSYFGEIGIGTPPQKFTVIFDTGSSVLWVPSSKCINSKACRAHSMYESSDS 63
Query: 134 NTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGII 193
+TY E G I YG+GSI+GFFSQD+V +GD+VVK+Q FIEAT E FL FDGI+
Sbjct: 64 STYKENGTFGAIIYGTGSITGFFSQDSVTIGDLVVKEQDFIEATDEADNVFLHRLFDGIL 123
Query: 194 GLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKH 253
GL F+ I+V PVW NM+ QGLV E FSFWLNR+ D EEGGE+VFGG+DP HF+G H
Sbjct: 124 GLSFQTISV----PVWYNMLNQGLVKERRFSFWLNRNVDEEEGGELVFGGLDPNHFRGDH 179
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
TYVPVT + YWQF +GD+LIG++STG C GC A DSGTSLL+GPT +VT+INHAIG
Sbjct: 180 TYVPVTYQYYWQFGIGDVLIGDKSTGFCAPGCQAFADSGTSLLSGPTAIVTQINHAIG 237
>gi|145352062|ref|XP_001420378.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580612|gb|ABO98671.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 454
Score = 304 bits (779), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 150/290 (51%), Positives = 197/290 (67%), Gaps = 11/290 (3%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
+ N+MDAQY+GEI IG+P Q F V+FDTGSSNLWVPSSKC + I C H+++ SR S T
Sbjct: 18 VHNYMDAQYYGEIEIGNPRQKFQVVFDTGSSNLWVPSSKCGFLQIPCDLHAKFDSRASET 77
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G I YGSGS+SGF S+D V+VGD+VV+ Q F EAT+E + FL ++FDGI+GL
Sbjct: 78 YEADGTPFAIQYGSGSLSGFLSKDEVKVGDLVVQGQYFAEATKEPGIAFLFSKFDGILGL 137
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPD-----AEEGGEIVFGGVDPKHFK 250
GF IAV PV+ NM+EQGLV ++FSFWLNR +E GGE++FGG DP HF
Sbjct: 138 GFDNIAVDKVKPVFYNMMEQGLVENKMFSFWLNRTSTKDGMPSEVGGELIFGGSDPDHFI 197
Query: 251 GKHTYVPVTKKGYWQFELGDILIGNQSTGVCEG--GCAAIVDSGTSLLAGPTPVVTEINH 308
G+HTY PVT++GYWQ ++ D + +S G C+G GC I D+GTSLLAGPT +V +IN
Sbjct: 198 GEHTYAPVTREGYWQIKMDDFKVDGRSLGACDGDDGCQVIADTGTSLLAGPTEIVNKIND 257
Query: 309 AIGGEGVVSAECKLVVSQYGD-LIWDLLVSGLLPEKVCQQIGLCAFNGAE 357
IG ++ EC+L++ QY + + DL E++C IG C +G E
Sbjct: 258 YIGAHSMIGEECRLLIDQYAEQFVEDL--ENYSSEQICASIGACDADGVE 305
>gi|291223847|ref|XP_002731917.1| PREDICTED: putative gut cathepsin D-like aspartic protease-like
[Saccoglossus kowalevskii]
Length = 389
Score = 303 bits (776), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 150/309 (48%), Positives = 209/309 (67%), Gaps = 7/309 (2%)
Query: 12 LWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSD 71
+ L CLL + GL+RI L K R L+ +T K+ + G+ +++ G
Sbjct: 1 MRTLLICLLFVGLACGLQRIHLHKFRSVRRQLSDVGVTIKDLALSGS----LKYTQGAPI 56
Query: 72 EDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKS 130
++L KN++DAQY+GEIG+G+P Q F+V+FDTGSSNLWVPS KC + I+C FH +Y S
Sbjct: 57 PEVL--KNYLDAQYYGEIGLGTPQQKFNVVFDTGSSNLWVPSKKCPITDIACLFHKKYDS 114
Query: 131 RKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFD 190
KS+TY G EI YGSGS+ GF S+D++ + DVV K Q F EAT+E L F+ A+FD
Sbjct: 115 TKSSTYKVNGTKFEIQYGSGSMEGFLSEDSIAISDVVAKSQTFAEATKEPGLAFVAAKFD 174
Query: 191 GIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFK 250
GI+G+G+ +I+V VPV DNM++Q L+ + VFSF+L+R+ + +GGE+ GG DPK++
Sbjct: 175 GILGMGYPQISVDGVVPVIDNMIQQQLIEKPVFSFYLDRNVNDSQGGELFLGGSDPKYYT 234
Query: 251 GKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI 310
G TYVPVT+KGYWQF++ I +G ++ C+GGC AI D+GTSL+AGPT V IN AI
Sbjct: 235 GNFTYVPVTRKGYWQFKMDGITLGGSASQFCKGGCQAIADTGTSLIAGPTEEVQAINKAI 294
Query: 311 GGEGVVSAE 319
G +VS E
Sbjct: 295 GATPIVSGE 303
>gi|218944225|gb|ACL13150.1| cathepsin D [Azumapecten farreri]
Length = 396
Score = 301 bits (771), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 151/315 (47%), Positives = 203/315 (64%), Gaps = 12/315 (3%)
Query: 9 VFCLWVLASCLLLPASSNGLRRIGL---KKRRLDLHSLNAARITRKERYMGGAGVSGVRH 65
V C++ L + + A S+ L RI L K R L + + K RY G+S
Sbjct: 3 VLCIFALLAVI---ACSSALHRIKLHRVKTVRRSLQEVGTSINLLKNRY---TGLSDRNG 56
Query: 66 RLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYF 124
RL D + PL N++DAQY+G I IG+P Q F V+FDTGSSNLWVPS KC S I+C
Sbjct: 57 RLLGPDPE--PLSNYLDAQYYGAIQIGTPAQEFKVVFDTGSSNLWVPSKKCKLSDIACLL 114
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
H++Y S KS+TY + G EI YG+GS++GF S D+V +GD+ VK Q F EA + +TF
Sbjct: 115 HNKYDSTKSSTYKQNGTHFEIRYGTGSLTGFLSTDSVTIGDITVKGQTFAEAITQPGITF 174
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
+ A+FDGI+G+G+ I+V VPV+ NMV+Q LV VFSF+L+RDPDA GGE++ GG
Sbjct: 175 VAAKFDGILGMGYDTISVDHVVPVFYNMVQQKLVDSPVFSFYLDRDPDASAGGELIIGGS 234
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
DPKH+ G +Y P+TKKGYWQF++ I +G +++ C GGC+AI D+GTSLL GPT V
Sbjct: 235 DPKHYSGNFSYAPITKKGYWQFDMAGIQVGGKASAYCNGGCSAIADTGTSLLVGPTAEVQ 294
Query: 305 EINHAIGGEGVVSAE 319
++N IG E
Sbjct: 295 QLNKQIGATPFAGGE 309
>gi|117662285|gb|ABK55693.1| aspartic proteinase [Cucumis sativus]
Length = 196
Score = 301 bits (770), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 137/196 (69%), Positives = 168/196 (85%)
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+C HS+YKS++S+TY + GKS I YG+G+ISG FS+DNV+VGD++VK Q FIEATRE
Sbjct: 1 ACLLHSKYKSKRSSTYKKNGKSASIKYGTGAISGCFSEDNVKVGDLIVKKQDFIEATREP 60
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
SLTF+LA+FDGI+GLGF+EI+VGDAVPVW NMV+Q LV E VFSFW NR+ D E+GGEIV
Sbjct: 61 SLTFVLAQFDGILGLGFKEISVGDAVPVWYNMVDQNLVKEPVFSFWFNRNADEEQGGEIV 120
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGGVDP H+KG+HTYVPVTKKGYWQF++GD+LI +TG C GGC+AI DSGTSLLAGPT
Sbjct: 121 FGGVDPDHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTGFCSGGCSAIADSGTSLLAGPT 180
Query: 301 PVVTEINHAIGGEGVV 316
++T++NHAIG GVV
Sbjct: 181 TIITQVNHAIGASGVV 196
>gi|303285091|ref|XP_003061836.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226457166|gb|EEH54466.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 647
Score = 298 bits (762), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 151/330 (45%), Positives = 214/330 (64%), Gaps = 12/330 (3%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L R+ L KR +D +++A R+ A ++ + G + + + N+MDAQYFG
Sbjct: 54 LPRVSLSKRVVDARAVHA-RVVATRANEANARLNSM---YGADADARVSITNYMDAQYFG 109
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEIN 146
+ IG+PPQ+F V+FDTGSSNLWVPSSKC F+ I C H +Y ++ S+T+ + G I
Sbjct: 110 AVSIGTPPQSFDVVFDTGSSNLWVPSSKCKFTQIPCDLHHKYDAKASSTHAQNGTDFAIQ 169
Query: 147 YGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
YGSGS+SGF S D V G + + Q F EATRE L F+ A+FDGI+G+G+ I+V V
Sbjct: 170 YGSGSLSGFLSADVVGWGGLEIASQTFAEATREPGLAFMFAKFDGILGMGWDTISVDKVV 229
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRD---PDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGY 263
P + N QGLV ++VFSFWLNRD PD GGE+V GGVDP H+ G+H ++PVT++GY
Sbjct: 230 PPFYNAYAQGLVPDDVFSFWLNRDESHPDG-PGGELVLGGVDPAHYVGEHAWLPVTREGY 288
Query: 264 WQFELGDILIGNQSTGVCE--GGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECK 321
WQ + D+++ S G C+ GCAAI+D+GTSLLAGP V+ +IN IG +++ EC+
Sbjct: 289 WQVRMDDVIVDGASAGECDETDGCAAILDTGTSLLAGPKDVIEKINAKIGARPILNEECR 348
Query: 322 LVVSQYGDLIWDLLVSGLLPEKVCQQIGLC 351
+++ QYG+ + D V P+ +C GLC
Sbjct: 349 VMIEQYGEELID-DVKKFGPKAICVSAGLC 377
>gi|336454164|gb|AEI58896.1| cathepsin D [Pinctada maxima]
Length = 390
Score = 296 bits (759), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 142/287 (49%), Positives = 199/287 (69%), Gaps = 8/287 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRL---GDSDEDILPLKNFMDAQYFGEIGIGS 93
R+ LH + + R T +E G + ++ + G + PL N++DAQY+G IGIG+
Sbjct: 21 RIKLHKIKSVRRTLQEV---GTSIESLQQKYSGYGITGPAPEPLSNYLDAQYYGVIGIGT 77
Query: 94 PPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSI 152
P QNF V+FDTGSSNLWVPS KC + I+C H++Y S KS+TY + G EI YG+GS+
Sbjct: 78 PAQNFKVVFDTGSSNLWVPSKKCKVTDIACLLHNKYDSSKSSTYKKNGTDFEIRYGTGSL 137
Query: 153 SGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNM 212
+GF S D V V + VK Q F EAT++ +TF+ A+FDGI+G+ F +I+V VPV+ NM
Sbjct: 138 TGFLSTDTVTVAGIAVKGQTFAEATQQPGITFVAAKFDGILGMAFEKISVDGVVPVFYNM 197
Query: 213 VEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDIL 272
V+QGLV + +FSF+L+RDP A EGGE++ GG D KH+KG TY+PVT++GYWQFE+ +
Sbjct: 198 VKQGLVPQPIFSFYLDRDPSASEGGELILGGSDTKHYKGNFTYLPVTRQGYWQFEMDGVS 257
Query: 273 IGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
+G S C GGC AI D+GTSL+AGPT ++++N AIG + +V+ E
Sbjct: 258 VGG-SAKFCSGGCNAIADTGTSLIAGPTSEISKLNKAIGAKPLVAGE 303
>gi|255085919|ref|XP_002508926.1| predicted protein [Micromonas sp. RCC299]
gi|226524204|gb|ACO70184.1| predicted protein [Micromonas sp. RCC299]
Length = 557
Score = 295 bits (756), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 172/371 (46%), Positives = 231/371 (62%), Gaps = 30/371 (8%)
Query: 5 LLRSVFCLWVLASCLLLPA-------SSNGLRRIGLKKRRLDL-----HSLNAARITRKE 52
+LRS+ L+++ + L A S+ L R + KR L ++ AR R E
Sbjct: 4 ILRSIVALFLVCALCLAAAPGASALVESSHLPRAKVHKRALGPPETVKKCVDVARRARYE 63
Query: 53 RYMGGAGVSGVRHRLGDSDEDILP------LKNFMDAQYFGEIGIGSPPQNFSVIFDTGS 106
R+ A + HR D D L + N+MDAQY+G + IG+PPQ+F V+FDTGS
Sbjct: 64 RF--SARLHDEPHR--DPDGPTLAGGTPECISNYMDAQYYGAVSIGTPPQSFLVVFDTGS 119
Query: 107 SNLWVPSSKCYF-SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGD 165
SNLW+PS+KC F I C H +Y+S S+TY +G I YGSGS+SGF SQD V
Sbjct: 120 SNLWIPSAKCSFLQIPCDLHQKYRSGDSSTYKALGDPFAIQYGSGSLSGFLSQDTVTWAG 179
Query: 166 VVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSF 225
+ +KDQVF EAT+E + FL ++FDGI+G+G+ I+V P + N V+QGLV E VFSF
Sbjct: 180 LEIKDQVFAEATKEPGIAFLFSKFDGILGMGWDTISVNGVKPPFYNAVDQGLVVENVFSF 239
Query: 226 WLNR---DPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVC- 281
WLNR + EGGEIV GGVDP HF G+HT++ VT++GYWQ + D+L+G S G C
Sbjct: 240 WLNRDADEGGDGEGGEIVLGGVDPAHFVGEHTWLNVTREGYWQIAMDDVLLGGVSVGQCG 299
Query: 282 EGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGD-LIWDLLVSGLL 340
+ GCAAIVD+GTSLLAGPT VV +N IG + V+ EC++++ QYGD LI DL +
Sbjct: 300 KKGCAAIVDTGTSLLAGPTKVVEALNKRIGAKSVLGEECRVMIDQYGDELIRDL--AEFS 357
Query: 341 PEKVCQQIGLC 351
+C +GLC
Sbjct: 358 ATDICTSVGLC 368
>gi|329754204|gb|AEC03508.1| cathepsin-D [Polyrhachis vicina]
Length = 384
Score = 294 bits (753), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 144/284 (50%), Positives = 193/284 (67%), Gaps = 6/284 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQ 96
R+ LH + +AR KE S ++ + + PL N++DAQY+G I IG+PPQ
Sbjct: 20 RIPLHKIKSARKHFKEVDTEICPTSILQGGMPHPE----PLSNYLDAQYYGAISIGTPPQ 75
Query: 97 NFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
NF VIFDTGSSNLWVPS KC+F+ I+C H++Y + KS+TY + G I+YGSGS+SG+
Sbjct: 76 NFKVIFDTGSSNLWVPSKKCHFTNIACLLHNKYDTTKSSTYKKNGTDFAIHYGSGSLSGY 135
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
S D V +G + VKDQ F EA E L F+ A+FDGI+G+ + I+V PV+ NMV+Q
Sbjct: 136 LSTDTVTIGGLKVKDQTFAEAMSEPGLAFVAAKFDGILGMAYTTISVDGVTPVFYNMVKQ 195
Query: 216 GLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGN 275
GLVS+ VFSF+LNRDPDA+EGGE++ GG DP H+KG TYVPV +K YWQF++ + IG+
Sbjct: 196 GLVSQPVFSFYLNRDPDAKEGGELILGGSDPNHYKGDFTYVPVDRKAYWQFKMDSVQIGS 255
Query: 276 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
+C+ GC AI D+GTSL+AGP + IN AIG +V E
Sbjct: 256 D-LKLCKQGCEAIADTGTSLIAGPVKEIEAINKAIGATPIVGGE 298
>gi|405951067|gb|EKC19012.1| Lysosomal aspartic protease [Crassostrea gigas]
Length = 439
Score = 292 bits (748), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 144/297 (48%), Positives = 195/297 (65%), Gaps = 3/297 (1%)
Query: 36 RRLDLHSLN-AARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSP 94
+R+ LH ++ R T ER + +R ++ + PL N+MDAQY+G I IG+P
Sbjct: 21 QRIKLHKIDKTVRETLLERGTTAEYLKRKYNRY-ETGPEPEPLSNYMDAQYYGPISIGTP 79
Query: 95 PQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSIS 153
PQNF VIFDTGSSNLWVPS KC S I+C H++Y S KS+TY G EI YG+GS+
Sbjct: 80 PQNFKVIFDTGSSNLWVPSKKCKLSDIACLLHNKYDSTKSSTYKANGTDFEIRYGTGSLK 139
Query: 154 GFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMV 213
GF S D V VGD+ VKDQ F EAT + +TF+ A+FDGI+G+GF EI+V PV++NMV
Sbjct: 140 GFLSTDTVTVGDIKVKDQTFAEATEQPGITFVAAKFDGILGMGFPEISVKGVTPVFNNMV 199
Query: 214 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 273
Q LV +FSF+L+R+P GGE++ GG DPK++ G TYV VT+KGYWQF++ + +
Sbjct: 200 AQKLVPAPIFSFYLDRNPTGTPGGEMILGGSDPKYYSGNFTYVNVTRKGYWQFKMDGVKV 259
Query: 274 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 330
+++ C GGC AI D+GTSLLAGP+ V +N IG + + E + S+ G L
Sbjct: 260 NGKASKYCSGGCNAIADTGTSLLAGPSTEVKSLNAMIGAKPFAAGEYTVDCSKIGSL 316
>gi|224548868|dbj|BAH24176.1| aspartic proteinase [Sitophilus zeamais]
Length = 389
Score = 291 bits (746), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 138/279 (49%), Positives = 189/279 (67%), Gaps = 3/279 (1%)
Query: 42 SLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVI 101
SL + R G V V+ R D PL N++DAQY+G I IG+PPQNF+VI
Sbjct: 25 SLTKGKSVRNTLRDVGTHVQQVKLRYVSVDPSPEPLTNYLDAQYYGPISIGTPPQNFNVI 84
Query: 102 FDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDN 160
FDTGSSNLWVPS KC +I+C H++Y + KS+TY E G I YGSGS+SG+ S D+
Sbjct: 85 FDTGSSNLWVPSKKCELLNIACLLHNKYDATKSSTYKENGTEFAITYGSGSLSGYLSTDS 144
Query: 161 VEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSE 220
+ VG V VKDQ F EA +E LTF+ A+FDGI+G+ + I+V PV+ NM++Q LV+
Sbjct: 145 LSVGSVQVKDQTFGEAIKEPGLTFIAAKFDGILGMAYPRISVDGVTPVFYNMIDQNLVAA 204
Query: 221 EVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGV 280
+FSF+LNRDP+A+ GGEI+ GG DP +++G TY+PV ++ YWQF++ + + +QS +
Sbjct: 205 PIFSFYLNRDPNAQTGGEIILGGSDPNYYEGDFTYLPVDRQAYWQFKMDSVQVADQS--L 262
Query: 281 CEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
C+GGC AI D+GTSL+AGPT + +N AIG +V E
Sbjct: 263 CKGGCEAIADTGTSLIAGPTEEIAALNKAIGASAIVGGE 301
>gi|33347413|gb|AAQ15289.1| aspartic protease [Pyrus pyrifolia]
Length = 199
Score = 289 bits (740), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 132/192 (68%), Positives = 161/192 (83%)
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + GK I YG+G+ISGFFS+D+V VGD+VVKDQ FIEAT+E +TFL A+FDGI+GL
Sbjct: 5 YNKNGKPAAIQYGTGAISGFFSEDHVTVGDLVVKDQEFIEATKEPGITFLAAKFDGILGL 64
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
GF+EI+VG+AVPVW NMV QGL+ E VFSFW NR+ D EEGGEIVFGGVDP H+KGKHTY
Sbjct: 65 GFQEISVGNAVPVWYNMVNQGLLKEPVFSFWFNRNADEEEGGEIVFGGVDPNHYKGKHTY 124
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
VPVT+KGYWQF++GD++I Q+TG C GC+AI DSGTSLL GPT ++TE+NHAIG G+
Sbjct: 125 VPVTQKGYWQFDMGDVMIDGQTTGFCADGCSAIADSGTSLLVGPTTIITELNHAIGASGI 184
Query: 316 VSAECKLVVSQY 327
VS ECK VV++Y
Sbjct: 185 VSQECKTVVAEY 196
>gi|380746491|gb|AFE48185.1| cathepsin D [Pinctada margaritifera]
Length = 390
Score = 288 bits (738), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 139/298 (46%), Positives = 201/298 (67%), Gaps = 8/298 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRL---GDSDEDILPLKNFMDAQYFGEIGIGS 93
R+ LH + + R T +E G + ++ + G + PL N++DAQY+G IGIG+
Sbjct: 21 RIKLHKIKSVRRTLQEV---GTSIESLQQKYSGYGITGPAPEPLSNYLDAQYYGVIGIGT 77
Query: 94 PPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSI 152
P QNF V+FDTGSSNLWVPS KC FS I+C H++Y S KS+TY + + EI YG+GS+
Sbjct: 78 PAQNFKVVFDTGSSNLWVPSKKCKFSDIACLLHNKYDSSKSSTYKKNDTTFEIRYGTGSL 137
Query: 153 SGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNM 212
+GF S D V V + VK Q F EAT++ +TF+ A+FDGI+G+ F +I+V VPV+ NM
Sbjct: 138 TGFLSTDTVTVAGIAVKGQTFAEATQQPGITFVAAKFDGILGMAFDKISVDGVVPVFYNM 197
Query: 213 VEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDIL 272
++QGLV + +FSF+L+RDP A EGGE++ GG D KH+KG TY+PVT++GYW+F++ +
Sbjct: 198 IKQGLVPQPIFSFYLDRDPSASEGGELILGGSDTKHYKGNFTYLPVTRQGYWEFKMDGVS 257
Query: 273 IGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 330
+G ++ C GGC I D+GTSL+AGP+ V ++N AIG + E + ++ DL
Sbjct: 258 VG-ENHKFCTGGCNTIADTGTSLIAGPSSEVKKLNAAIGATAIPGGEYMIDCTKIPDL 314
>gi|320165710|gb|EFW42609.1| lysosomal aspartic protease [Capsaspora owczarzaki ATCC 30864]
Length = 462
Score = 288 bits (737), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 145/302 (48%), Positives = 199/302 (65%), Gaps = 8/302 (2%)
Query: 58 AGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY 117
A ++ R LG + + L NF +AQY+GEI IG+PPQ F V+FDTGSSN WVPS+ C
Sbjct: 37 AAINPNRRSLGANPA--VNLGNFENAQYYGEIEIGTPPQKFKVVFDTGSSNAWVPSATCK 94
Query: 118 FS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEA 176
+ + C H +Y S KS+TY G + I YGSGS++G+ SQD V + V +QVF EA
Sbjct: 95 ITDLPCDLHKKYHSEKSSTYVANGTTFAIQYGSGSLTGYLSQDTFTVAGLKVTNQVFAEA 154
Query: 177 TREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDA--E 234
T E L F+LARFDG++GLGF+EI+V + VPV+ NMV QGL++ F+FWL+R+ + +
Sbjct: 155 TNEPGLAFVLARFDGLLGLGFQEISVLNVVPVFYNMVAQGLLNSASFAFWLSRNGTSILK 214
Query: 235 EGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTS 294
GGE+V GGVDP H+ G TY+PV+K GYWQF L + +G+ + G G I DSGTS
Sbjct: 215 PGGELVLGGVDPSHYTGAFTYIPVSKPGYWQFALDSVQVGSTTFGANTQG---IADSGTS 271
Query: 295 LLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFN 354
LLAGP V +IN IG G+++ EC +++ QY +I + LV L P +C++IG C N
Sbjct: 272 LLAGPVADVKKINAQIGAIGILAEECDMIIEQYEPIIVEGLVQRLDPVTICKEIGSCKAN 331
Query: 355 GA 356
+
Sbjct: 332 AS 333
>gi|33347411|gb|AAQ15288.1| aspartic protease [Pyrus pyrifolia]
Length = 199
Score = 287 bits (735), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 131/192 (68%), Positives = 161/192 (83%)
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + GK I YG+G+ISGFFS+D+V VGD+VVKDQ FIEAT+E +TFL+A+FDGI+GL
Sbjct: 5 YNKNGKPAAIQYGTGAISGFFSEDHVTVGDLVVKDQEFIEATKEPGITFLVAKFDGILGL 64
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
GF+EI+VG+AVPVW NMV QGL+ E VFS W NR+ D EEGGEIVFGGVDP H+KGKHTY
Sbjct: 65 GFQEISVGNAVPVWYNMVNQGLLKEPVFSLWFNRNADEEEGGEIVFGGVDPNHYKGKHTY 124
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
VPVT+KGYWQF++GD++I Q+TG C GC+AI DSGTSLL GPT ++TE+NHAIG G+
Sbjct: 125 VPVTQKGYWQFDMGDVMIDGQTTGFCADGCSAIADSGTSLLVGPTTIITELNHAIGASGI 184
Query: 316 VSAECKLVVSQY 327
VS ECK VV++Y
Sbjct: 185 VSQECKTVVAEY 196
>gi|336454162|gb|AEI58895.1| cathepsin D [Pteria penguin]
Length = 392
Score = 287 bits (734), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 140/291 (48%), Positives = 195/291 (67%), Gaps = 13/291 (4%)
Query: 36 RRLDLHSLNAARITRKERYMGGAGVSGVRHRL------GDSDEDILPLKNFMDAQYFGEI 89
+R+ LH + R T +E G + ++++ G + E PL N+MDAQY+G+I
Sbjct: 20 QRIKLHKFKSVRRTLQEV---GTSIEALQNKYNVYKVEGPAPE---PLSNYMDAQYYGDI 73
Query: 90 GIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYG 148
IG+P Q+F VIFDTGSSNLWVPS KC S I+C H++Y S KS+TY G EI YG
Sbjct: 74 TIGTPGQSFKVIFDTGSSNLWVPSKKCKLSDIACLLHNKYDSSKSSTYKANGTDFEIRYG 133
Query: 149 SGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPV 208
+GS++GF S D V V + VK Q F EAT++ +TF+ A+FDGI+G+G++ I+V VPV
Sbjct: 134 TGSLTGFLSTDTVTVAGIAVKGQTFAEATQQPGITFVAAKFDGILGMGYQTISVDGVVPV 193
Query: 209 WDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFEL 268
+ NMV+Q LV VFSF+LNRDP A +GGE++ GG D K++KG TY+PVTK+GYW+F++
Sbjct: 194 FYNMVKQNLVPASVFSFYLNRDPGASDGGELILGGSDSKYYKGNFTYLPVTKQGYWRFKM 253
Query: 269 GDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
I++ +++ C GGC AI D+GTSLLAGP V +N IG + + E
Sbjct: 254 DGIMMNGKASKYCSGGCKAIADTGTSLLAGPKTEVDALNKQIGATPLAAGE 304
>gi|227018334|gb|ACP18833.1| aspartic proteinase 1 [Chrysomela tremula]
Length = 386
Score = 286 bits (733), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 145/321 (45%), Positives = 200/321 (62%), Gaps = 24/321 (7%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV 60
M + + SVFC+ +C + R+ LH ++ A+ T + R G
Sbjct: 1 MLRIFVLSVFCVLATVNCDFV---------------RVPLHKMDTAKSTLQSR-----GY 40
Query: 61 SGVRHRLGDSDED-ILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YF 118
+ + D PL N+MDAQY+GEI IG+P Q F+VIFDTGSSNLW+PS KC
Sbjct: 41 KSNENLVKKYTTDGYAPLTNYMDAQYYGEITIGTPGQKFNVIFDTGSSNLWIPSHKCKLL 100
Query: 119 SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
+++C H++Y S KS+TYT G I YGSGS+ GF S D VEV + VKDQ+F EAT
Sbjct: 101 NVACRTHNQYNSDKSSTYTSNGTDFSITYGSGSLKGFLSSDIVEVAGLTVKDQIFAEATE 160
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
E L F+ +FDGI+GL + I+V P + ++EQG+V E VFSF+LNRDP+AE GGE
Sbjct: 161 EPGLAFIAGKFDGILGLAYDTISVNQVTPFFYKLIEQGVVKEPVFSFYLNRDPNAEVGGE 220
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
IVFGG DPK++ G TY+PVT+KGYWQ ++ ++ S +C+GGC AIVD+GTSL+ G
Sbjct: 221 IVFGGSDPKYYTGDFTYLPVTRKGYWQIKMDKAVV--DSNTLCDGGCQAIVDTGTSLITG 278
Query: 299 PTPVVTEINHAIGGEGVVSAE 319
P+ + +I A+G + + E
Sbjct: 279 PSDEIEKIVKAVGATAITAGE 299
>gi|195332251|ref|XP_002032812.1| GM20753 [Drosophila sechellia]
gi|194124782|gb|EDW46825.1| GM20753 [Drosophila sechellia]
Length = 392
Score = 286 bits (731), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 147/310 (47%), Positives = 194/310 (62%), Gaps = 21/310 (6%)
Query: 23 ASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS----GVRHRLGDSDEDILPLK 78
A N + GL R+ LH +AR R+ G +R+ GD E PL
Sbjct: 17 AHPNSQEKPGL--LRVPLHKFQSAR-----RHFADVGTELQQLRIRYGGGDVPE---PLS 66
Query: 79 NFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYT 137
N+MDAQY+G I IGSPPQNF V+FDTGSSNLWVPS KC+ + I+C H++Y + KS TYT
Sbjct: 67 NYMDAQYYGPIAIGSPPQNFRVVFDTGSSNLWVPSKKCHLTNIACLMHNKYDASKSKTYT 126
Query: 138 EIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGF 197
+ G I+YGSGS+SG+ S D V + + +KDQ F EA E L F+ A+FDGI+GLG+
Sbjct: 127 KNGTEFAIHYGSGSLSGYLSTDTVSIAGLDIKDQTFAEALSEPGLVFVAAKFDGILGLGY 186
Query: 198 REIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVP 257
I+V P + M EQGL+S VFSF+LNRDP + EGGEI+FGG DP H+ G+ TY+P
Sbjct: 187 SSISVDKVKPPFYAMYEQGLISAPVFSFYLNRDPASPEGGEIIFGGSDPNHYTGEFTYLP 246
Query: 258 VTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVS 317
VT+K YWQ ++ IG+ +C+GGC I D+GTSL+A P T IN IGG ++
Sbjct: 247 VTRKAYWQIKMDAASIGDLQ--LCKGGCQVIADTGTSLIAAPLEEATSINQKIGGTPIIG 304
Query: 318 AE----CKLV 323
+ C L+
Sbjct: 305 GQYVVSCDLI 314
>gi|91093044|ref|XP_966517.1| PREDICTED: similar to cathepsin D isoform 1 [Tribolium castaneum]
gi|270002651|gb|EEZ99098.1| hypothetical protein TcasGA2_TC004989 [Tribolium castaneum]
Length = 384
Score = 285 bits (729), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 142/292 (48%), Positives = 192/292 (65%), Gaps = 11/292 (3%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQ 96
R+ L+ + +AR + +E G V VR R G + PL N++DAQY+G I IG+PPQ
Sbjct: 21 RVPLYKVKSARRSLQEV---GTHVQQVRMRYGGPTPE--PLSNYLDAQYYGPISIGNPPQ 75
Query: 97 NFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
NF V+FDTGSSNLWVPS KC+++ I+C H++Y S +S TY + G I YGSGS+SGF
Sbjct: 76 NFKVVFDTGSSNLWVPSKKCHYTNIACLLHNKYDSSQSKTYKKNGTDFAIQYGSGSLSGF 135
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
S D V VG + V+ Q F EA E L F+ A+FDGI+G+ + I+V PV+ NM++Q
Sbjct: 136 LSTDIVTVGGLKVQQQTFAEAMSEPGLAFVAAKFDGILGMAYNRISVDGVTPVFYNMIQQ 195
Query: 216 GLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGN 275
LV++ VFSF+LNRDP A +GGEI+ GG DP H+KG TY+ V ++ YWQF++ I +G
Sbjct: 196 NLVAQPVFSFYLNRDPSAAQGGEIILGGSDPAHYKGDFTYLSVDRQAYWQFKMDSISVGG 255
Query: 276 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLV 323
++T C GC AI D+GTSL+AGP V IN AIG +V E C L+
Sbjct: 256 KNT-FCANGCEAIADTGTSLIAGPVSEVQGINKAIGATPIVGGEYMVDCNLI 306
>gi|327259983|ref|XP_003214815.1| PREDICTED: cathepsin D-like [Anolis carolinensis]
Length = 399
Score = 285 bits (729), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 152/320 (47%), Positives = 203/320 (63%), Gaps = 24/320 (7%)
Query: 12 LWVLASCLLLPASSNGLRRIGLKK----RRL------DLHSLNAARITRKERYMG-GAGV 60
L L L + A+ + L RI LKK R + DL L+ K +Y G GAG
Sbjct: 3 LRALVLLLSVAAAYSALIRIPLKKFPSPRSIYAEYGTDLQDLDKLGEMLKYKYGGPGAGT 62
Query: 61 SGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFS 119
LKN+MDAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS C
Sbjct: 63 PTPET-----------LKNYMDAQYYGEIGIGTPPQKFTVVFDTGSSNLWVPSVHCRLLD 111
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
I+C H +Y S KSNTY + G I+YG+GS+SGF SQD V +GD+ VK+Q+F EAT E
Sbjct: 112 IACMLHHKYDSSKSNTYVQNGTKFAIHYGTGSLSGFISQDTVTIGDIAVKNQMFGEATSE 171
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEI 239
+TFL A+FDGI+GLGF +I+V P +DN ++QGL+ + +FSF+LNRDP + GGEI
Sbjct: 172 PGITFLAAKFDGILGLGFPKISVDKVTPFFDNAMKQGLLDKNMFSFFLNRDPSSSPGGEI 231
Query: 240 VFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGP 299
+FGGVDPK++ G +V VT+K YWQ + + + + T VC+ GC AIVD+GTSL+ GP
Sbjct: 232 IFGGVDPKYYSGDFNWVNVTRKAYWQVHMDRVEVPSGLT-VCKNGCEAIVDTGTSLITGP 290
Query: 300 TPVVTEINHAIGGEGVVSAE 319
T V + AIG + ++ +
Sbjct: 291 TDEVKALQKAIGAKPIIKGQ 310
>gi|257228998|gb|ACV53024.1| cathepsin D2 [Homarus americanus]
Length = 385
Score = 285 bits (729), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 137/292 (46%), Positives = 186/292 (63%), Gaps = 8/292 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQ 96
R+ LH + + R T +E V+ + G+ PL N+MDAQY+G I IG+PPQ
Sbjct: 19 RIPLHKIKSVRRTLQEV---DTAVTRAHRKWGNRGPMPEPLSNYMDAQYYGPISIGTPPQ 75
Query: 97 NFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
+F V+FDTGSSNLWVPS +C+++ I+C H++Y +RKS+TY + G I YGSGS+SG+
Sbjct: 76 SFRVVFDTGSSNLWVPSKQCHYTNIACMIHNKYDARKSSTYKKNGTDFAIQYGSGSLSGY 135
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
S D V VG + V+ Q F EA E L F+ A+FDGI+G+GF IAV PV+ NMV+Q
Sbjct: 136 LSTDTVAVGSLAVRQQTFAEALSEPGLAFVAAKFDGILGMGFDNIAVDGVTPVFYNMVKQ 195
Query: 216 GLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGN 275
L+ VFSF+LNRDP + EGGE++ GG DP ++ G TY+PV +KGYWQ ++ I +
Sbjct: 196 SLIPAPVFSFYLNRDPSSPEGGELILGGSDPNYYSGNFTYIPVDRKGYWQIKMDGIQMNG 255
Query: 276 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLV 323
CEGGC AI D+GTSL+A P IN IG + + S E C L+
Sbjct: 256 ARVPFCEGGCEAIADTGTSLIAAPVEEARSINKKIGAKPIASGEWSVDCSLI 307
>gi|159468321|ref|XP_001692331.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
gi|158278517|gb|EDP04281.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
Length = 303
Score = 285 bits (729), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 144/302 (47%), Positives = 200/302 (66%), Gaps = 13/302 (4%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSN-GLRRIGLKKRRLDLHSLNAARITRKERYMGGAG 59
M + + ++ L +++ L + A G+ R+ L+K + L +L R Y+
Sbjct: 1 MARSYVPALIALAAVSALLGVAAEQQAGMLRVTLRKTEM-LTTLG-----RPRPYL---- 50
Query: 60 VSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YF 118
G + LG SD+ + LKNFMDAQY+GEIG+G+PPQ F+VIFDTGS+NLWVPSSKC F
Sbjct: 51 -LGEQGLLGSSDQGQVTLKNFMDAQYYGEIGLGTPPQLFNVIFDTGSANLWVPSSKCALF 109
Query: 119 SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
+I+C H +Y + KS TY G I YG+GS+ G+ SQD + G + +KDQ F EA
Sbjct: 110 NIACRLHRKYNAAKSKTYKANGTEFAIEYGTGSLDGYISQDVLTWGGLTIKDQGFAEAIN 169
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
E LTF+ A+FDGI+G+GF I+V P + +VE+G ++ VFSFWLNRDP+A GGE
Sbjct: 170 EPGLTFVAAKFDGILGMGFPAISVQHVPPPFTRLVEEGGLAAPVFSFWLNRDPNAPNGGE 229
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
+V GG+DP HF G+HT+VPVT++GYWQF + + +G S +C GCAAI D+GTSL+AG
Sbjct: 230 LVLGGIDPTHFTGEHTWVPVTRQGYWQFNMEGLDLGPGSQKMCAKGCAAIADTGTSLIAG 289
Query: 299 PT 300
P+
Sbjct: 290 PS 291
>gi|332024025|gb|EGI64243.1| Lysosomal aspartic protease [Acromyrmex echinatior]
Length = 381
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 140/288 (48%), Positives = 193/288 (67%), Gaps = 11/288 (3%)
Query: 36 RRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPP 95
+R+ LH ++ R KE G ++ VR + PL N++DAQY+G I IG+PP
Sbjct: 20 QRIPLHKTDSIRKALKEV---GTDLTQVRTFTTTDNYTPEPLSNYLDAQYYGVISIGTPP 76
Query: 96 QNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISG 154
QNF VIFDTGSSNLWVPS KC+ + I+C H++Y S KS TY + G I YGSGS+SG
Sbjct: 77 QNFKVIFDTGSSNLWVPSKKCHITNIACLLHNKYTSEKSTTYKKNGTIFAIRYGSGSLSG 136
Query: 155 FFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVE 214
F S+D V V + V+ Q F EA E + F+ A+FDGI+G+G+ I+V PV+ NMV+
Sbjct: 137 FLSEDVVTVAGLAVQHQTFAEAISEPGIAFVAAKFDGILGMGYSTISVDGVTPVFYNMVK 196
Query: 215 QGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
Q LVS+ VFSF+LNRD A EGGE++ GG DP H++G+ TY+PVT+KGYWQF++ + +
Sbjct: 197 QNLVSQAVFSFYLNRDSSAAEGGEMILGGSDPDHYEGEFTYIPVTRKGYWQFKMDGVQVK 256
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH-----AIGGEGVVS 317
+ + C+ GC AI D+GTSL+AGPT + +IN +IGGE +V+
Sbjct: 257 DHA--FCKEGCQAIADTGTSLIAGPTSEIKDINEMIGATSIGGEAMVN 302
>gi|194863696|ref|XP_001970568.1| GG10707 [Drosophila erecta]
gi|190662435|gb|EDV59627.1| GG10707 [Drosophila erecta]
Length = 390
Score = 285 bits (728), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 147/308 (47%), Positives = 193/308 (62%), Gaps = 21/308 (6%)
Query: 25 SNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS----GVRHRLGDSDEDILPLKNF 80
SN + GL R+ LH +AR R+ G +R+ GD E PL N+
Sbjct: 17 SNPQEKPGL--LRVPLHKFQSAR-----RHFADVGTELQQLRIRYGGGDVPE---PLSNY 66
Query: 81 MDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEI 139
MDAQY+G I IGSPPQNF V+FDTGSSNLWVPS KC+ + I+C H++Y + KS TYT+
Sbjct: 67 MDAQYYGPIAIGSPPQNFRVVFDTGSSNLWVPSKKCHLTNIACLMHNKYDASKSKTYTKN 126
Query: 140 GKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFRE 199
G I YGSGS+SG+ S D V + + +KDQ F EA E L F+ A+FDGI+GLG+
Sbjct: 127 GTEFAIQYGSGSLSGYLSTDTVSIAGLDIKDQTFAEALSEPGLVFVAAKFDGILGLGYSS 186
Query: 200 IAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVT 259
I+V P + M EQGL+S VFSF+LNRDP + EGGEI+FGG DP H+ G+ TY+PVT
Sbjct: 187 ISVDKVKPPFYAMYEQGLISAPVFSFYLNRDPASPEGGEIIFGGSDPNHYTGEFTYLPVT 246
Query: 260 KKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
+K YWQ ++ IG+ +C+GGC I D+GTSL+A P T IN IGG ++ +
Sbjct: 247 RKAYWQIKMDAASIGDLQ--LCKGGCQVIADTGTSLIAAPLEEATSINQKIGGTPIIGGQ 304
Query: 320 ----CKLV 323
C L+
Sbjct: 305 YVVSCDLI 312
>gi|312861579|gb|ADR10277.1| cathepsin D [Branchiostoma belcheri]
Length = 395
Score = 285 bits (728), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 148/328 (45%), Positives = 203/328 (61%), Gaps = 15/328 (4%)
Query: 4 KLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGV 63
K L +F L V AS L RI L K + L IT + + SG
Sbjct: 2 KFLSVLFALVVFASAL---------HRIPLTKMKTVRRQLADVGITYDQ--VLDKDYSGK 50
Query: 64 RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISC 122
+ + D+ E PL N++DAQY+G I IG+P QNF V+FDTGSSNLWVPS KC S I+C
Sbjct: 51 YYNIKDAPE---PLTNYLDAQYYGPISIGTPAQNFQVVFDTGSSNLWVPSKKCKLSDIAC 107
Query: 123 YFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSL 182
H++Y S +S+TY + G I YGSGS++GF S+D V +G + V++Q F EA + +
Sbjct: 108 LLHNKYDSTQSSTYMKNGTDFAIRYGSGSLTGFLSEDTVTIGGLKVQNQTFAEAVTQPGI 167
Query: 183 TFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFG 242
TF+ A+FDGI+G+G+ I+V VP + NMV+Q LV + VFSF+LNRDP + GE++ G
Sbjct: 168 TFVAAKFDGILGMGYDTISVDGVVPPFYNMVQQKLVDKPVFSFYLNRDPSSTTRGELLLG 227
Query: 243 GVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPV 302
G DPK++ G T++ VTK GYWQF++ I+I ++T C+GGCAAI D+GTSL+AGPT
Sbjct: 228 GTDPKYYTGDFTFLDVTKPGYWQFKMDGIMINGKATDYCKGGCAAIADTGTSLIAGPTTE 287
Query: 303 VTEINHAIGGEGVVSAECKLVVSQYGDL 330
V +N IG + E + SQ L
Sbjct: 288 VQALNKQIGATPIPGGEYMVDCSQVSSL 315
>gi|380018765|ref|XP_003693293.1| PREDICTED: lysosomal aspartic protease-like [Apis florea]
Length = 385
Score = 284 bits (727), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 145/319 (45%), Positives = 200/319 (62%), Gaps = 18/319 (5%)
Query: 5 LLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVR 64
+ R++ CL C + ++ + RI LH +++ R KE +
Sbjct: 1 MFRAILCL-----CAFIAIANADITRI-------PLHKIDSIRKQFKEY---NTEIYQTH 45
Query: 65 HRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCY 123
GD + PL N++DAQY+G I IG+PPQ+F VIFDTGSSNLWVPS KC+ + I+C
Sbjct: 46 ILQGDFPQP-EPLSNYLDAQYYGVISIGTPPQDFRVIFDTGSSNLWVPSKKCHLTNIACK 104
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
H +Y + KS+TY + G I YGSGS+SG+ S D V++ + + DQ F EA E L
Sbjct: 105 LHRKYDNTKSSTYKKNGTDFAIRYGSGSLSGYLSTDTVDIAGMKISDQTFAEALSEPGLA 164
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
F+ A+FDGI+G+ + +IAV D PV+ NMV+QGLV + VFSF+LNR+PD + GGE++ GG
Sbjct: 165 FVAAKFDGILGMAYSKIAVDDVTPVFYNMVKQGLVPQPVFSFYLNRNPDDKYGGELILGG 224
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
DP H++G TYVPV KKGYWQF++ I IG+ VC+ GC AI D+GTSL+AGP V
Sbjct: 225 SDPNHYEGSFTYVPVDKKGYWQFKMDSIQIGSD-LKVCQQGCEAIADTGTSLIAGPVKEV 283
Query: 304 TEINHAIGGEGVVSAECKL 322
IN AIG + + E +
Sbjct: 284 GAINKAIGATPIAAGEAMI 302
>gi|332376487|gb|AEE63383.1| unknown [Dendroctonus ponderosae]
Length = 388
Score = 284 bits (726), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 148/315 (46%), Positives = 199/315 (63%), Gaps = 18/315 (5%)
Query: 15 LASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDI 74
L C + + L R+ L K + + I R+ G V VR R E +
Sbjct: 8 LIICFIATITCENLVRVPLTKGK------SPKNILREV----GTHVQQVRLRYTSGAEPV 57
Query: 75 L-PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRK 132
PL N++DAQYFG I IG+PPQ F V+FDTGSSNLWVPS KC F+ I+C H++Y S K
Sbjct: 58 PEPLSNYLDAQYFGAISIGTPPQKFVVVFDTGSSNLWVPSKKCSFTNIACLLHNKYDSSK 117
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGI 192
S+TY E G I YGSGS+SGF S D V V D+ VK Q F EA E L F+ A+FDGI
Sbjct: 118 SSTYKENGTEFAIRYGSGSLSGFLSTDVVGVSDINVKGQTFAEALSEPGLAFVAAKFDGI 177
Query: 193 IGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGK 252
+GL + I+V VP++ NMV QG+VS+ VFSF+LNR+PD + GGE++FGG DP ++ G
Sbjct: 178 LGLAYSRISVDGVVPLFYNMVNQGIVSQAVFSFYLNRNPDGKVGGELIFGGSDPNYYSGN 237
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
TY+PV ++ YWQF++ ++++G ++ C+GGC AI D+GTSL+AGP V +N AIG
Sbjct: 238 FTYLPVDRQAYWQFKMDEVIVGQKT--FCKGGCEAIADTGTSLIAGPVDEVKALNEAIGA 295
Query: 313 EGVVSAE----CKLV 323
+V E C L+
Sbjct: 296 TPLVGGEYAVDCSLI 310
>gi|383859202|ref|XP_003705085.1| PREDICTED: lysosomal aspartic protease-like [Megachile rotundata]
Length = 384
Score = 284 bits (726), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 145/297 (48%), Positives = 190/297 (63%), Gaps = 8/297 (2%)
Query: 36 RRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDIL-PLKNFMDAQYFGEIGIGSP 94
RR+ LH ++ R KE V+ R+ D + PL N++DAQY+G I IG+P
Sbjct: 19 RRIKLHKIDRIRSQLKEY-----DTDLVQTRIVQGDVILPEPLSNYLDAQYYGVINIGTP 73
Query: 95 PQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSIS 153
PQ F VIFDTGSSNLWVPS KC+ + I+C H +Y S KS+TY + G I YGSGS+S
Sbjct: 74 PQKFRVIFDTGSSNLWVPSKKCHLTNIACKLHYKYDSTKSSTYKKNGTDFSIRYGSGSLS 133
Query: 154 GFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMV 213
G+ S D V+V + V DQ F EA E L F+ A+FDGI+G+ + IAV PV+ NMV
Sbjct: 134 GYLSTDMVDVAGIKVNDQTFAEALSEPGLAFVAAKFDGIMGMAYSTIAVDGVTPVFYNMV 193
Query: 214 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 273
+QGLVS+ VFSF+LNRDP+AE GGE++ GG DP H+ G TYVPV KKGYWQF + + +
Sbjct: 194 KQGLVSQPVFSFYLNRDPNAEFGGEMILGGSDPNHYVGPFTYVPVDKKGYWQFAMDRVEV 253
Query: 274 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 330
G+ VCE GC AI D+GTSL+AGP + +N IG + + E + + DL
Sbjct: 254 GSD-VKVCEKGCEAIADTGTSLIAGPVKEIELLNKKIGATPIAAGEAMVECDKIPDL 309
>gi|21355083|ref|NP_652013.1| cathD [Drosophila melanogaster]
gi|6685167|gb|AAF23824.1|AF220040_1 cathepsin D precursor [Drosophila melanogaster]
gi|7304149|gb|AAF59186.1| cathD [Drosophila melanogaster]
gi|15292549|gb|AAK93543.1| SD07085p [Drosophila melanogaster]
gi|220946566|gb|ACL85826.1| cathD-PA [synthetic construct]
Length = 392
Score = 284 bits (726), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 143/296 (48%), Positives = 188/296 (63%), Gaps = 19/296 (6%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVS----GVRHRLGDSDEDILPLKNFMDAQYFGEIGIG 92
R+ LH +AR R+ G +R+ GD E PL N+MDAQY+G I IG
Sbjct: 29 RVPLHKFQSAR-----RHFADVGTELQQLRIRYGGGDVPE---PLSNYMDAQYYGPIAIG 80
Query: 93 SPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGS 151
SPPQNF V+FDTGSSNLWVPS KC+ + I+C H++Y + KS TYT+ G I YGSGS
Sbjct: 81 SPPQNFRVVFDTGSSNLWVPSKKCHLTNIACLMHNKYDASKSKTYTKNGTEFAIQYGSGS 140
Query: 152 ISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDN 211
+SG+ S D V + + +KDQ F EA E L F+ A+FDGI+GLG+ I+V P +
Sbjct: 141 LSGYLSTDTVSIAGLDIKDQTFAEALSEPGLVFVAAKFDGILGLGYNSISVDKVKPPFYA 200
Query: 212 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 271
M EQGL+S VFSF+LNRDP + EGGEI+FGG DP H+ G+ TY+PVT+K YWQ ++
Sbjct: 201 MYEQGLISAPVFSFYLNRDPASPEGGEIIFGGSDPNHYTGEFTYLPVTRKAYWQIKMDAA 260
Query: 272 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLV 323
IG+ +C+GGC I D+GTSL+A P T IN IGG ++ + C L+
Sbjct: 261 SIGDLQ--LCKGGCQVIADTGTSLIAAPLEEATSINQKIGGTPIIGGQYVVSCDLI 314
>gi|195474504|ref|XP_002089531.1| GE23596 [Drosophila yakuba]
gi|194175632|gb|EDW89243.1| GE23596 [Drosophila yakuba]
Length = 392
Score = 284 bits (726), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 143/296 (48%), Positives = 188/296 (63%), Gaps = 19/296 (6%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVS----GVRHRLGDSDEDILPLKNFMDAQYFGEIGIG 92
R+ LH +AR R+ G +R+ GD E PL N+MDAQY+G I IG
Sbjct: 29 RVPLHKFQSAR-----RHFADVGTELQQLRIRYGGGDVPE---PLSNYMDAQYYGPIAIG 80
Query: 93 SPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGS 151
SPPQNF V+FDTGSSNLWVPS KC+ + I+C H++Y + KS TYT+ G I YGSGS
Sbjct: 81 SPPQNFRVVFDTGSSNLWVPSKKCHLTNIACLMHNKYDASKSKTYTKNGTEFAIQYGSGS 140
Query: 152 ISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDN 211
+SG+ S D V + + +KDQ F EA E L F+ A+FDGI+GLG+ I+V P +
Sbjct: 141 LSGYLSTDTVSIAGLDIKDQTFAEALSEPGLVFVAAKFDGILGLGYSSISVDKVKPPFYA 200
Query: 212 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 271
M EQGL+S VFSF+LNRDP + EGGEI+FGG DP H+ G+ TY+PVT+K YWQ ++
Sbjct: 201 MYEQGLISAPVFSFYLNRDPASPEGGEIIFGGSDPNHYTGEFTYLPVTRKAYWQIKMDAA 260
Query: 272 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLV 323
IG+ +C+GGC I D+GTSL+A P T IN IGG ++ + C L+
Sbjct: 261 SIGDLQ--LCKGGCQVIADTGTSLIAAPLEEATSINQKIGGTPIIGGQYVVSCDLI 314
>gi|146217392|gb|ABQ10738.1| cathepsin D [Penaeus monodon]
Length = 386
Score = 283 bits (725), Expect = 9e-74, Method: Compositional matrix adjust.
Identities = 137/292 (46%), Positives = 189/292 (64%), Gaps = 8/292 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQ 96
R+ LH +AR + +E V V + G+ PL N+MDAQY+G I IG+PPQ
Sbjct: 20 RIKLHKFKSARRSLQEV---DTAVKVVHRKWGNKGPMPEPLSNYMDAQYYGPITIGTPPQ 76
Query: 97 NFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
+F V+FDTGSSNLWVPS +C+F+ I+C H++Y + KS+TY + G +I YGSGS+SG+
Sbjct: 77 SFRVVFDTGSSNLWVPSKQCHFTNIACLIHNKYDATKSSTYKKNGTKFDIQYGSGSLSGY 136
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
S D V VG V VKDQ F EA E L F+ A+FDGI+G+ + IAV PV+ NMV Q
Sbjct: 137 LSTDTVSVGSVSVKDQTFAEAMSEPGLAFVAAKFDGILGMAYDRIAVDGVTPVFYNMVNQ 196
Query: 216 GLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGN 275
+V +FSF+LNRDP A EGGE++ GG DP ++ G TYVPV ++GYWQF++ + +
Sbjct: 197 NVVPAPIFSFYLNRDPAAAEGGELILGGSDPAYYTGDFTYVPVDRQGYWQFKMDGLQMNG 256
Query: 276 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV----SAECKLV 323
+ C+GGC AI D+GTSL+A P+ IN IG + ++ S +C L+
Sbjct: 257 TTVPFCDGGCEAIADTGTSLIAAPSEEARLINKKIGAKPIMGGEWSVDCNLI 308
>gi|238816835|gb|ACR56788.1| aspartic protease 4 [Strongyloides ratti]
Length = 428
Score = 282 bits (722), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 138/281 (49%), Positives = 189/281 (67%), Gaps = 12/281 (4%)
Query: 40 LHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFS 99
L+ L RI + E+Y G HRL DS+E L+N+MDAQY+GEI IG+P QNFS
Sbjct: 34 LNFLENERINKGEKY-------GAVHRLMDSEE---ILRNYMDAQYYGEISIGTPGQNFS 83
Query: 100 VIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQ 158
VIFDTGSSNLW+PS KC ++I+C H++Y S S+TY G++ I YG+GS+ GF S+
Sbjct: 84 VIFDTGSSNLWIPSKKCPIYNIACLLHNKYDSSSSSTYVTDGRTMAIQYGTGSMKGFLSK 143
Query: 159 DNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLV 218
D V + D+ +DQ F EAT E +TF+ A+FDGI+G+ ++ IAV PV++ +++Q V
Sbjct: 144 DKVCIADLCAEDQTFAEATSEPGVTFIAAKFDGILGMAYQNIAVLGVKPVFNTLIDQHKV 203
Query: 219 SEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQST 278
+ +F+FWLNR D +GGEI GG+DPKH+KG TYVPV++KGYWQF++ D +G+
Sbjct: 204 PQPIFAFWLNRIADDSDGGEITLGGMDPKHYKGDITYVPVSRKGYWQFKM-DGFVGDNEK 262
Query: 279 GVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
C+ GC AI D+GTSL+AGP V I IG E + E
Sbjct: 263 IACKNGCQAIADTGTSLIAGPKAQVEAIQKFIGAEPLARGE 303
>gi|170063951|ref|XP_001867326.1| lysosomal aspartic protease [Culex quinquefasciatus]
gi|167881401|gb|EDS44784.1| lysosomal aspartic protease [Culex quinquefasciatus]
Length = 387
Score = 282 bits (721), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 134/253 (52%), Positives = 176/253 (69%), Gaps = 7/253 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSN 134
PL N+MDAQYFG I IG+PPQ+F V+FDTGSSNLWVPS +C F+ I+C H++Y ++KS+
Sbjct: 59 PLSNYMDAQYFGAITIGTPPQSFKVVFDTGSSNLWVPSKECSFTNIACLMHNKYNAKKSS 118
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
T+ + G + I YGSGS+SG+ S D V VG V ++ Q F EA E L F+ A+FDGI+G
Sbjct: 119 TFEKNGTAFAIQYGSGSLSGYLSTDTVTVGGVAIQKQTFAEAINEPGLVFVAAKFDGILG 178
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V VP + NM QGL+ VFSF+LNRDP A EGGEI+FGG D + G T
Sbjct: 179 LGYSSISVDGVVPPFYNMYNQGLIDSPVFSFYLNRDPSAAEGGEIIFGGSDSAKYTGDFT 238
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y+PV +K YWQF++ + +G+ T C GC AI D+GTSL+AGPT VT IN AIGG
Sbjct: 239 YLPVDRKAYWQFKMDSVKVGD--TEFCNNGCEAIADTGTSLIAGPTSEVTAINKAIGGTP 296
Query: 315 VVSAE----CKLV 323
+++ E C L+
Sbjct: 297 IINGEYMVDCSLI 309
>gi|350411706|ref|XP_003489428.1| PREDICTED: lysosomal aspartic protease-like [Bombus impatiens]
Length = 386
Score = 281 bits (719), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 142/316 (44%), Positives = 194/316 (61%), Gaps = 17/316 (5%)
Query: 5 LLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVR 64
+ R+ CL C + ++ L+RI LH +++ R KE V+
Sbjct: 1 MYRAALCL-----CACIALANADLQRI-------TLHKMDSVRKQFKEYNTEVYQAHMVQ 48
Query: 65 HRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCY 123
+ PL N++DAQY+G I IG+P Q+F VIFDTGSSNLWVPS KC+ + I+C
Sbjct: 49 GGFPQPE----PLSNYLDAQYYGVISIGTPSQDFKVIFDTGSSNLWVPSQKCHLTNIACK 104
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
H +Y + KS+TY + G I YGSGS+SG+ S D V + + V DQ F EA E +
Sbjct: 105 LHHKYDNTKSSTYKKNGTDFAIRYGSGSLSGYLSTDVVNIAGLKVSDQTFAEALSEPGMA 164
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
F+ A+FDGI+G+ + IAV PV+ NMV+QGLV + VFSF+LNR+PD + GGE++ GG
Sbjct: 165 FVAAKFDGILGMAYSRIAVDGVTPVFYNMVKQGLVPQPVFSFYLNRNPDDKAGGELILGG 224
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
DP H++G TYVPV +KGYWQF + I +G+Q +CE GC AI D+GTSL+AGP V
Sbjct: 225 SDPNHYEGPFTYVPVDRKGYWQFRMDGIKVGSQHLAICEKGCEAIADTGTSLIAGPVKEV 284
Query: 304 TEINHAIGGEGVVSAE 319
IN AIG + + E
Sbjct: 285 EAINSAIGATNIAAGE 300
>gi|321472775|gb|EFX83744.1| hypothetical protein DAPPUDRAFT_92408 [Daphnia pulex]
Length = 379
Score = 280 bits (717), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 134/302 (44%), Positives = 202/302 (66%), Gaps = 7/302 (2%)
Query: 33 LKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILP--LKNFMDAQYFGEIG 90
+K +R+ L + + R T + G + ++ + G S+ P LKN+MDAQY+G+I
Sbjct: 5 VKLQRVTLEKVPSVRKTLESV---GTSIKVIQKKWGASEAGPTPEELKNYMDAQYYGQIT 61
Query: 91 IGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGS 149
+G+PPQ F+V+FDTGS+NLWVPS+ C+ + ++C H++Y KS TY G I YGS
Sbjct: 62 LGTPPQTFNVVFDTGSANLWVPSTHCHLTNLACLLHNKYNGGKSQTYKANGTDFAIQYGS 121
Query: 150 GSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVW 209
G +SG+ S D + +G +VKDQ F EA E SLTF+ A+FDGI+G+ + I+V PV+
Sbjct: 122 GKLSGYLSTDTLGLGGALVKDQTFAEAISEPSLTFVAAKFDGILGMSYPSISVNGVPPVF 181
Query: 210 DNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELG 269
+NM+EQGLV + VFSFWL+R+PDA +GGEI FGG DP+ + G+ ++ PVT+K YWQF++
Sbjct: 182 NNMIEQGLVEDPVFSFWLSRNPDAAQGGEITFGGADPERYTGEISWAPVTRKAYWQFKVD 241
Query: 270 DILIGNQSTGV-CEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYG 328
+ + N++ G C+GGC I D+GTSL+AGP + ++N IGG +++ E + S+
Sbjct: 242 GVQVSNEADGAFCQGGCQMIADTGTSLIAGPVDEIKKLNTLIGGIPIMAGEYFINCSRID 301
Query: 329 DL 330
+L
Sbjct: 302 EL 303
>gi|195429864|ref|XP_002062977.1| GK21682 [Drosophila willistoni]
gi|194159062|gb|EDW73963.1| GK21682 [Drosophila willistoni]
Length = 389
Score = 280 bits (716), Expect = 9e-73, Method: Compositional matrix adjust.
Identities = 155/336 (46%), Positives = 209/336 (62%), Gaps = 23/336 (6%)
Query: 3 QKLLRSVFCLWVLASCLLLPASSNGLRRIGLKK-RRLDLHSLNAARITRKERYMGGAGVS 61
QKLL + +V+A+ S GL R+ LKK + H + ++ R
Sbjct: 2 QKLLILLAIGFVVAAEA---GDSAGLLRVPLKKFQSARRHFADVGTELQQLR-------- 50
Query: 62 GVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-I 120
+++ GD+ E PL N+MDAQY+G I IG+P Q+F V+FDTGSSNLWVPS KC+F+ I
Sbjct: 51 -IKYGGGDAPE---PLSNYMDAQYYGPISIGTPAQSFKVVFDTGSSNLWVPSKKCHFTNI 106
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+C H++Y + KSNTY + G I+YGSGS+SG+ S D V +G + +K Q F EA E
Sbjct: 107 ACLMHNKYDATKSNTYAKNGTEFAIHYGSGSLSGYLSTDTVGIGGLNIKGQTFAEALSEP 166
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
L F+ A+FDGI+GLG+ I+V P + M EQGL+S VFSF+LNRDP A EGGEI+
Sbjct: 167 GLVFVAAKFDGILGLGYSSISVDGVKPPFYAMYEQGLISSPVFSFYLNRDPSAPEGGEII 226
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGG DP H+ G TY+PVT+K YWQ ++ +G+ VC+GGC I D+GTSL+A P
Sbjct: 227 FGGSDPNHYTGDFTYLPVTRKAYWQIKMDSASVGDLQ--VCQGGCQVIADTGTSLIAAPL 284
Query: 301 PVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLV 336
T IN IGG ++ + VVS DLI +L V
Sbjct: 285 SEATSINQKIGGTPIIGGQ--YVVSC--DLIPNLPV 316
>gi|224050910|ref|XP_002199093.1| PREDICTED: cathepsin D [Taeniopygia guttata]
Length = 396
Score = 280 bits (716), Expect = 9e-73, Method: Compositional matrix adjust.
Identities = 145/293 (49%), Positives = 198/293 (67%), Gaps = 13/293 (4%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
+RRI LK+ ++ +N+ IT +Y G G +IL KN+MDAQYFG
Sbjct: 30 MRRI-LKEAGSEIPDMNS--ITEAIKYKLGFA------EAGKPTPEIL--KNYMDAQYFG 78
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEIN 146
IGIG+PPQNF+VIFDTGSSNLWVPS C I+C H +Y S KS+TY + G I
Sbjct: 79 VIGIGTPPQNFTVIFDTGSSNLWVPSVHCSLLDIACMVHHKYDSAKSSTYVKNGTKFAIR 138
Query: 147 YGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
YG+GS+SG+ SQD V +GD+ + DQ+F EAT++ +TF+ A+FDGI+GL F +I+V A
Sbjct: 139 YGTGSLSGYLSQDIVTLGDLKIMDQIFGEATKQPGITFIAAKFDGILGLAFPKISVEGAE 198
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
P +DN+++Q LV + +FSF+LNRDP GGE+V GG DPK++KG+ ++ VT+K YWQ
Sbjct: 199 PFFDNVMKQKLVEKNMFSFYLNRDPSGVPGGEMVLGGTDPKYYKGEFSWFNVTRKAYWQI 258
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
+ + +GN T VCEGGC AIVD+GTSL+ GPT V +I AIG + ++ E
Sbjct: 259 HMDSVDVGNGPT-VCEGGCEAIVDTGTSLITGPTKEVKKIQEAIGAKPLIKGE 310
>gi|260810438|ref|XP_002599971.1| hypothetical protein BRAFLDRAFT_74093 [Branchiostoma floridae]
gi|229285255|gb|EEN55983.1| hypothetical protein BRAFLDRAFT_74093 [Branchiostoma floridae]
Length = 388
Score = 280 bits (715), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 143/313 (45%), Positives = 196/313 (62%), Gaps = 8/313 (2%)
Query: 19 LLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLK 78
L + A++N L RI L K + L + + SG + + + PL
Sbjct: 8 LAIVATANALHRIPLTKMKTVRRHLAEVGVPYDKII---KDYSGKYYNMTGPQPE--PLS 62
Query: 79 NFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYT 137
N++DAQYFG I IG+PPQ+F V+FDTGSSNLWVPS KC++S I+C H++Y + KS+TY
Sbjct: 63 NYLDAQYFGPISIGTPPQSFQVVFDTGSSNLWVPSKKCHYSNIACLLHNKYDASKSSTYK 122
Query: 138 EIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGF 197
+ G+ I YGSGS+SGF SQD V V + VKDQ F EA E + F+ A+FDGI+G+G+
Sbjct: 123 KNGEKFAIQYGSGSLSGFLSQDTVSVAGIEVKDQTFAEALSEPGMAFVAAKFDGILGMGY 182
Query: 198 REIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVP 257
IAV VP + NMV QG V E VFSF+LNRDP A GGE++ GG DP ++ G T++
Sbjct: 183 SNIAVDGVVPPFYNMVSQGAVPEPVFSFYLNRDPSATAGGELILGGADPNYYTGDFTFLD 242
Query: 258 VTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVS 317
VT+KGYWQF++ I +G + C+ GC AI D+GTSL+AGP V +++ IG +
Sbjct: 243 VTRKGYWQFKMDGINVGGST--FCQEGCQAIADTGTSLIAGPIEEVNKLHKQIGATPLAG 300
Query: 318 AECKLVVSQYGDL 330
E K+ S+ L
Sbjct: 301 GEYKVDCSKVTSL 313
>gi|322796189|gb|EFZ18765.1| hypothetical protein SINV_10075 [Solenopsis invicta]
Length = 366
Score = 279 bits (713), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 135/254 (53%), Positives = 175/254 (68%), Gaps = 6/254 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSN 134
PL N++DAQY+GEI IG+PPQ F VIFDTGSSNLWVPS KC Y +I+C H++Y SRKS
Sbjct: 38 PLSNYLDAQYYGEITIGTPPQKFKVIFDTGSSNLWVPSKKCRYTNIACLLHNKYDSRKSI 97
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + G I YG+GS+SGF S D V V + V++Q F EA E LTF+ A+FDGI+G
Sbjct: 98 TYQKNGTPFAIRYGTGSLSGFLSTDVVNVAGLNVQNQTFAEAVSEPGLTFVAAKFDGILG 157
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+G+ I+V PV+ NMV+Q LV + +FSF+LNRDP A +GGE++ GG DP+H+ G T
Sbjct: 158 MGYSTISVDGVTPVFYNMVKQKLVPQPIFSFYLNRDPTAAQGGEMILGGSDPEHYVGSMT 217
Query: 255 YVPVTKKGYWQFELGDILIGNQSTG--VCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
YV VT+KGYWQF + I +G+ S +C+ C AI D+GTSL+AGPT + EIN IG
Sbjct: 218 YVDVTRKGYWQFTMDRITVGDSSPSHILCKNTCQAIADTGTSLIAGPTVEINEINKQIGA 277
Query: 313 E---GVVSAECKLV 323
G C +V
Sbjct: 278 TMIGGQALVNCAMV 291
>gi|17981530|gb|AAL51056.1|AF454831_1 cathepsin D [Apriona germari]
Length = 386
Score = 278 bits (712), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 149/321 (46%), Positives = 197/321 (61%), Gaps = 22/321 (6%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV 60
M + L SVFC+++ +C L+ R+ L +AR T +E V
Sbjct: 1 MSRLFLMSVFCVFITVNCDLI---------------RVPLERGKSARRTLQEV---NTHV 42
Query: 61 SGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS- 119
VR R G PL N++DAQYFG I IG+PPQ F V+FDTGSSNLWVPS KC+++
Sbjct: 43 QQVRFRYGVGGPAPEPLSNYLDAQYFGPISIGNPPQKFKVVFDTGSSNLWVPSKKCHYTN 102
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
I+C H++Y S KS+TY + G I YGSGS+SGF S D V VG + VKDQ F EA E
Sbjct: 103 IACLLHNKYDSSKSSTYKKNGTDFSIKYGSGSLSGFLSTDVVTVGSLAVKDQTFAEAMSE 162
Query: 180 GSLTFLLARFDGIIGLGFRE-IAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
L F+ A+FD G ++ + ++P + NM+ QGLVS+ VFSF+LNRDPDA EGGE
Sbjct: 163 PGLAFVAAKFDEYPWHGLQQDLGSRASLPFFYNMITQGLVSQPVFSFYLNRDPDAAEGGE 222
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
+ GG DPK++KG TY+ V ++ YWQF++ I +G T C+ GC AI D+GTSL+AG
Sbjct: 223 LSLGGSDPKYYKGNFTYLSVDRQAYWQFKMDKIQLGK--TVFCKSGCQAIADTGTSLVAG 280
Query: 299 PTPVVTEINHAIGGEGVVSAE 319
P VT IN IGG ++ E
Sbjct: 281 PVDEVTSINKLIGGTPIIGGE 301
>gi|194757447|ref|XP_001960976.1| GF11236 [Drosophila ananassae]
gi|190622274|gb|EDV37798.1| GF11236 [Drosophila ananassae]
Length = 388
Score = 278 bits (712), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 145/302 (48%), Positives = 190/302 (62%), Gaps = 13/302 (4%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDIL-PLKNFMDAQYFGEIGIGSPP 95
R+ L AR R+ G + R+ D+ PL N+MDAQY+G I IGSPP
Sbjct: 25 RVPLQKFTTAR-----RHFADVGTELQQLRIKYGGGDVPEPLSNYMDAQYYGPISIGSPP 79
Query: 96 QNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISG 154
QNF V+FDTGSSNLWVPS KC+ + I+C H++Y + KS +Y + G I YGSGS+SG
Sbjct: 80 QNFRVVFDTGSSNLWVPSKKCHLTNIACLMHNKYDASKSKSYVKNGTEFAIQYGSGSLSG 139
Query: 155 FFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVE 214
+ S D V +G + +KDQ F EA E L F+ A+FDGI+GLG+ I+V P + M E
Sbjct: 140 YLSTDTVSIGGLNIKDQTFAEALSEPGLVFVAAKFDGILGLGYSSISVDRVKPPFYAMYE 199
Query: 215 QGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
QGL+S +FSF+LNRDP EGGEI+FGG DPKH+ G TY+PVT+K YWQ ++ IG
Sbjct: 200 QGLISAPIFSFYLNRDPAGPEGGEIIFGGSDPKHYSGDFTYLPVTRKAYWQIKMDAASIG 259
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDL 334
+ +C+GGC I D+GTSL+A P T IN IGG ++ + VVS DLI +L
Sbjct: 260 DLE--LCKGGCQVIADTGTSLIAAPMSEATSINQKIGGTPIIGGQ--YVVSC--DLIPNL 313
Query: 335 LV 336
V
Sbjct: 314 PV 315
>gi|156553448|ref|XP_001600543.1| PREDICTED: lysosomal aspartic protease-like [Nasonia vitripennis]
Length = 384
Score = 278 bits (712), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 141/285 (49%), Positives = 189/285 (66%), Gaps = 7/285 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQ 96
R+ L+ + +AR T +E G + ++ R G +D PL N++DAQY+GEIGIGSP Q
Sbjct: 21 RVPLYRVKSARRTLQEV---GTELHQIKLRYG-ADPVPEPLSNYLDAQYYGEIGIGSPMQ 76
Query: 97 NFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
F+VIFDTGSSNLWVPS KC+ + I+C H++Y SRKS +Y G I YGSGS+SGF
Sbjct: 77 KFTVIFDTGSSNLWVPSKKCHITNIACLLHNKYDSRKSKSYKANGTDFSIRYGSGSLSGF 136
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
S D V + V VKD F EA E L F+ A+FDGI+G+ + I+V PV+ NMV+Q
Sbjct: 137 LSTDVVTIAGVDVKDTTFAEAMSEPGLAFVAAKFDGILGMAYDRISVDGVPPVFYNMVKQ 196
Query: 216 GLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGN 275
LV + +FSF+LNRDP+A+ GGE++ GG D H+ G TYVPV++K YWQF++ I IG+
Sbjct: 197 NLVPQPIFSFYLNRDPNAKIGGEMILGGSDSAHYTGDFTYVPVSRKAYWQFKMDKITIGD 256
Query: 276 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAEC 320
+ CE GC AI D+GTSL+AGP + IN IG +V+ E
Sbjct: 257 KL--FCENGCEAIADTGTSLIAGPVGEIEGINKKIGATPIVAGEA 299
>gi|412987808|emb|CCO19204.1| cathepsin D (lysosomal aspartyl protease) [Bathycoccus prasinos]
Length = 628
Score = 278 bits (712), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 158/349 (45%), Positives = 213/349 (61%), Gaps = 28/349 (8%)
Query: 30 RIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDI-----------LPLK 78
+ +K +L + E+Y A S + + +S ED +P+
Sbjct: 85 QTKMKASKLRAKHAEMKKKQMVEKYTRNAETSLMEDKKMESSEDAAIGGEGGATSSVPIA 144
Query: 79 NFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYT 137
N+MDAQY+G + IG+P Q F V FDTGSSNLWVPSSKC FS I C H +Y S KS +Y
Sbjct: 145 NYMDAQYYGPVEIGTPGQKFQVCFDTGSSNLWVPSSKCKFSQIPCDAHEKYDSEKSRSYE 204
Query: 138 EIGKSCEINYGSGSISGFFSQDNVEVGDVV-VKDQVFIEATREGSLTFLLARFDGIIGLG 196
G+ I YGSGS+SGF S D V +G+ + +KDQ F EAT+E LTFL A+FDGI+GLG
Sbjct: 205 PNGEDFAIQYGSGSLSGFLSSDTVRLGNSIEIKDQTFAEATKEPGLTFLFAKFDGILGLG 264
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAE---EGGEIVFGGVDPKHFKGKH 253
F+EIAV PV+DN V Q V ++ FSFWLNRD D + +GGE+VFGGVD KHF G+H
Sbjct: 265 FKEIAVDGVTPVFDNAVAQNQVEKDQFSFWLNRDQDGDGVVDGGELVFGGVDEKHFVGEH 324
Query: 254 TYVPVTKKGYWQFELGDILIG--------NQSTGVCEGGCA---AIVDSGTSLLAGPTPV 302
+V +TKKGYWQF+L D+ +G N T V AI D+GTSLLAGP+ V
Sbjct: 325 VWVDLTKKGYWQFDLDDVKVGEFSFIDDKNDKTTVSFSSSTKHQAIADTGTSLLAGPSAV 384
Query: 303 VTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLC 351
+ +IN AIG E ++ ECK+ + +YG+ D + + ++C+ + +C
Sbjct: 385 IDKINDAIGAENLMIQECKIAIKRYGEEFLDDIET-YDSSQICESLNIC 432
>gi|56118817|ref|NP_001008172.1| MGC89016 protein precursor [Xenopus (Silurana) tropicalis]
gi|51950197|gb|AAH82490.1| MGC89016 protein [Xenopus (Silurana) tropicalis]
Length = 421
Score = 278 bits (712), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 142/330 (43%), Positives = 207/330 (62%), Gaps = 24/330 (7%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRR---LDLHSLNAARITRKERYMGGAGVSGVRHRLGDS 70
++ +CLL A SNGL RI L + + LH + +A + +Y V + + +
Sbjct: 7 LVVTCLLFVAFSNGLERIKLHRFKSVARTLHDVGSAVEHVRMKY--------VDNHMKSA 58
Query: 71 DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYK 129
E PL N+MD QY+G I IG+PPQ+F V+FDTGSSNLWVPS KC ++ I+C+ H +Y
Sbjct: 59 PE---PLTNYMDVQYYGVISIGTPPQSFRVVFDTGSSNLWVPSKKCKWTDIACWLHRKYD 115
Query: 130 SRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARF 189
S+KS+TY G I+YG+GS++GF S D V VG + VK Q F EA + +TF+ A+F
Sbjct: 116 SKKSSTYKANGTEFAIHYGTGSLTGFLSTDTVSVGSLSVKSQTFAEAITQPGITFVAAKF 175
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF 249
DGI+G+ + I+V VPV++NMV Q LV + +FSF+L+RD A+EGGEI+ GG DP H+
Sbjct: 176 DGILGMAYPSISVDGVVPVFNNMVNQKLVDQAIFSFYLSRDASAKEGGEIILGGSDPDHY 235
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGV---------CEGGCAAIVDSGTSLLAGPT 300
G TY+ VT+K YWQ ++ + + ++S + C+GGC AI D+GTSL+ GP+
Sbjct: 236 VGNFTYLDVTRKAYWQIKMDSVTVSSESECMNAMMVGGEYCKGGCQAIADTGTSLIVGPS 295
Query: 301 PVVTEINHAIGGEGVVSAECKLVVSQYGDL 330
V ++N IG ++S E + S+ L
Sbjct: 296 SDVEKLNAEIGALPIISGEYWINCSKIASL 325
>gi|46309251|dbj|BAD15111.1| cathepsin D [Todarodes pacificus]
Length = 392
Score = 278 bits (710), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 135/295 (45%), Positives = 187/295 (63%), Gaps = 22/295 (7%)
Query: 36 RRLDLHSLNAARITRKERYMGGAGVSGV----------RHRLGDSDEDILPLKNFMDAQY 85
+R+ LH + +AR+ ++ G+G S R+R + PL N++DAQY
Sbjct: 23 QRIQLHKITSARM-----HLIGSGTSNSTLKMISQLQQRYRAPTPE----PLSNYLDAQY 73
Query: 86 FGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCE 144
+G I IG+P QNF V+FDTGSSNLWVPS KC S I+C H++Y S +S+TY G
Sbjct: 74 YGVISIGTPAQNFKVVFDTGSSNLWVPSKKCKLSDIACLLHNKYDSTQSSTYKANGTDFH 133
Query: 145 INYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGD 204
I YGSGS+ GF S D V +G V +K Q F EAT + L F+ A+FDGI+G+ + I+V
Sbjct: 134 IQYGSGSLDGFLSTDTVAIGSVAIKAQTFAEATNQPGLVFVAAKFDGILGMAYDTISVDK 193
Query: 205 AVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYW 264
PV+ ++ Q LV + VFSF+LNRDP +EGGE++ GG DPKH+ G TY+PVT+KGYW
Sbjct: 194 VTPVFYQIISQKLVDQPVFSFYLNRDPSGKEGGELILGGSDPKHYTGNFTYLPVTRKGYW 253
Query: 265 QFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
Q ++ ++ G + C GGC AI D+GTSL+AGP + ++N AIGG + E
Sbjct: 254 QIKMDKVVSGENT--FCSGGCQAIADTGTSLIAGPVDEIKKLNEAIGGRALPGGE 306
>gi|218847782|ref|NP_001136375.1| cathepsin D-like precursor [Xenopus (Silurana) tropicalis]
gi|159155417|gb|AAI54878.1| LOC613063 protein [Xenopus (Silurana) tropicalis]
Length = 399
Score = 277 bits (709), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 144/319 (45%), Positives = 203/319 (63%), Gaps = 22/319 (6%)
Query: 12 LWV--LASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKE--------RYMGGAGVS 61
+WV LAS LL P S+ L RI LKK H+ A KE +Y G S
Sbjct: 6 VWVVLLASSLLQPGSA--LIRIPLKKFPSIRHTFTEAGKDVKELLANEVPLKYSPGFPPS 63
Query: 62 GVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSI 120
G + LKN++DAQY+GEIG+GSPPQNF+V+FDTGSSNLWVPS C I
Sbjct: 64 G--------EPTPEALKNYLDAQYYGEIGLGSPPQNFTVVFDTGSSNLWVPSVHCSMLDI 115
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+C+ H +Y S KS+TY + G + I YG+GS+SG+ S+D V +G++ VK Q+F EA ++
Sbjct: 116 ACWMHHKYDSSKSSTYVKNGTAFAIQYGTGSLSGYLSKDTVTIGNLAVKGQIFGEAVKQP 175
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
+TF+ A+FDGI+G+ + I+V PV+DN++ Q LV +FSF+LNR+PD + GGE++
Sbjct: 176 GVTFVAAKFDGILGMAYPVISVDGVPPVFDNIMAQKLVESNIFSFYLNRNPDTQPGGELL 235
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
GG DPK++ G Y+ VT+K YWQ + + +G+Q T +C+GGC IVD+GTSL+ GP
Sbjct: 236 LGGTDPKYYTGDFHYLSVTRKAYWQIHMDQLGVGDQLT-LCKGGCEVIVDTGTSLITGPL 294
Query: 301 PVVTEINHAIGGEGVVSAE 319
VT + AIG ++ +
Sbjct: 295 EEVTALQKAIGAVPLIQGQ 313
>gi|326920173|ref|XP_003206349.1| PREDICTED: cathepsin D-like [Meleagris gallopavo]
Length = 397
Score = 277 bits (708), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 133/286 (46%), Positives = 197/286 (68%), Gaps = 5/286 (1%)
Query: 63 VRHRLGDSD-EDILP--LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF- 118
++ +LG SD + P LKN+MDAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS C+
Sbjct: 52 LKFKLGFSDLAEPTPEILKNYMDAQYYGEIGIGTPPQKFTVVFDTGSSNLWVPSVHCHLL 111
Query: 119 SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
I+C H +Y + KS+TY E G I+YG+GS+SGF SQD V +G++ +K+Q+F EA +
Sbjct: 112 DIACLLHHKYDASKSSTYVENGTEFAIHYGTGSLSGFLSQDTVTLGNLKIKNQIFGEAVK 171
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
+ +TF+ A+FDGI+G+ F I+V P +DN+++Q L+ + +FSF+LNRDP A+ GGE
Sbjct: 172 QPGITFIAAKFDGILGMAFPRISVDKVTPFFDNVMKQKLIEKNIFSFYLNRDPTAQPGGE 231
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
++ GG DPK+++G ++V VT+K YWQ + + + N T +C+GGC AIVD+GTSL+ G
Sbjct: 232 LLLGGTDPKYYRGDFSWVNVTRKAYWQVHMDSVNVANGLT-LCKGGCEAIVDTGTSLITG 290
Query: 299 PTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKV 344
PT V E+ AIG + ++ + + + L L+ G P K+
Sbjct: 291 PTKEVKELQTAIGAKPLIKGQYIIPCDKISSLPVVTLMLGGKPYKL 336
>gi|116284100|gb|AAI23963.1| LOC613063 protein [Xenopus (Silurana) tropicalis]
Length = 396
Score = 277 bits (708), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 144/319 (45%), Positives = 203/319 (63%), Gaps = 22/319 (6%)
Query: 12 LWV--LASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKE--------RYMGGAGVS 61
+WV LAS LL P S+ L RI LKK H+ A KE +Y G S
Sbjct: 3 VWVVLLASSLLQPGSA--LIRIPLKKFPSIRHTFTEAGKDVKELLANEVPLKYSPGFPPS 60
Query: 62 GVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSI 120
G + LKN++DAQY+GEIG+GSPPQNF+V+FDTGSSNLWVPS C I
Sbjct: 61 G--------EPTPEALKNYLDAQYYGEIGLGSPPQNFTVVFDTGSSNLWVPSVHCSMLDI 112
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+C+ H +Y S KS+TY + G + I YG+GS+SG+ S+D V +G++ VK Q+F EA ++
Sbjct: 113 ACWMHHKYDSSKSSTYVKNGTAFAIQYGTGSLSGYLSKDTVTIGNLAVKGQIFGEAVKQP 172
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
+TF+ A+FDGI+G+ + I+V PV+DN++ Q LV +FSF+LNR+PD + GGE++
Sbjct: 173 GVTFVAAKFDGILGMAYPVISVDGVPPVFDNIMAQKLVESNIFSFYLNRNPDTQPGGELL 232
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
GG DPK++ G Y+ VT+K YWQ + + +G+Q T +C+GGC IVD+GTSL+ GP
Sbjct: 233 LGGTDPKYYTGDFHYLSVTRKAYWQIHMDQLGVGDQLT-LCKGGCEVIVDTGTSLITGPL 291
Query: 301 PVVTEINHAIGGEGVVSAE 319
VT + AIG ++ +
Sbjct: 292 EEVTALQKAIGAVPLIQGQ 310
>gi|66560290|ref|XP_392857.2| PREDICTED: lysosomal aspartic protease [Apis mellifera]
Length = 385
Score = 277 bits (708), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 137/287 (47%), Positives = 187/287 (65%), Gaps = 6/287 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQ 96
R+ LH +++ R KE + GD + PL N++DAQY+G I IG+PPQ
Sbjct: 21 RIPLHKIDSIRKQFKEY---NTEIYQTHIFQGDLPQP-EPLSNYLDAQYYGVISIGTPPQ 76
Query: 97 NFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
+F VIFDTGSSNLWVPS KC+ + I+C H +Y + KS+TY + G I YGSGS+SG+
Sbjct: 77 DFRVIFDTGSSNLWVPSKKCHLTNIACKLHRKYDNTKSSTYKKNGTDFAIRYGSGSLSGY 136
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
S D V++ + + DQ F EA E L F+ A+FDGI+G+ + +I+V PV+ NMV+Q
Sbjct: 137 LSTDTVDIAGMKISDQTFAEALSEPGLAFVAAKFDGILGMAYSKISVDGVTPVFYNMVKQ 196
Query: 216 GLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGN 275
GLV + VFSF+LNR+PD + GGE++ GG DP H++G TYVPV KKGYWQF + I IG+
Sbjct: 197 GLVPQPVFSFYLNRNPDDKYGGELILGGSDPNHYEGSFTYVPVDKKGYWQFRMDSIQIGS 256
Query: 276 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKL 322
VC+ GC AI D+GTSL+AGP + IN AIG + + E +
Sbjct: 257 D-LKVCQQGCEAIADTGTSLIAGPVKEIEAINKAIGATPIAAGEAMI 302
>gi|195997283|ref|XP_002108510.1| expressed hypothetical protein [Trichoplax adhaerens]
gi|190589286|gb|EDV29308.1| expressed hypothetical protein [Trichoplax adhaerens]
Length = 389
Score = 277 bits (708), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 148/321 (46%), Positives = 204/321 (63%), Gaps = 13/321 (4%)
Query: 12 LWVLASCLLLPASSNGLRRIGLKK-RRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDS 70
L V+A+ L+ SS+ L R+ L K ++ L IT + + ++ LG S
Sbjct: 4 LLVIAALFLI--SSDALVRVPLYKFKKTPREHLAEVGIT--------SSMLSEKYELGAS 53
Query: 71 DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF-SISCYFHSRYK 129
L N++DAQY+GEI IG+PPQ F V+FDTGSSNLWVPSSKC F +I+C FHS+Y
Sbjct: 54 RNATEMLNNYLDAQYYGEISIGTPPQKFKVLFDTGSSNLWVPSSKCSFLNIACLFHSKYD 113
Query: 130 SRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARF 189
KS+TY + I YG+GS++GF S D V + V VK+Q F EA E LTF+ A+F
Sbjct: 114 HSKSSTYKKNSTKFSIRYGTGSLTGFLSVDTVRIQGVSVKNQGFAEAVSEPGLTFVAAQF 173
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF 249
DGI+G+G++EIAV PV++N++ Q V + VFSF+LNR A+ GGE++ GG D KH+
Sbjct: 174 DGILGMGYQEIAVDGVPPVFNNIMAQKQVGKSVFSFYLNRKEGAKPGGELILGGSDSKHY 233
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHA 309
G TY+PVTKKGYWQF++ I + + + C+GGC AI D+GTSLLAGPT V +I
Sbjct: 234 SGNFTYLPVTKKGYWQFKMDGISVKGKGS-FCKGGCQAIADTGTSLLAGPTAEVNKIQTL 292
Query: 310 IGGEGVVSAECKLVVSQYGDL 330
IG +++ E + S+ L
Sbjct: 293 IGATPLLNGEYTIDCSKISSL 313
>gi|66911216|gb|AAH96630.1| LOC613063 protein, partial [Xenopus (Silurana) tropicalis]
Length = 395
Score = 276 bits (707), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 144/319 (45%), Positives = 203/319 (63%), Gaps = 22/319 (6%)
Query: 12 LWV--LASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKE--------RYMGGAGVS 61
+WV LAS LL P S+ L RI LKK H+ A KE +Y G S
Sbjct: 2 VWVVLLASSLLQPGSA--LIRIPLKKFPSIRHTFTEAGKDVKELLANEVPLKYSPGFPPS 59
Query: 62 GVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSI 120
G + LKN++DAQY+GEIG+GSPPQNF+V+FDTGSSNLWVPS C I
Sbjct: 60 G--------EPTPEALKNYLDAQYYGEIGLGSPPQNFTVVFDTGSSNLWVPSVHCSMLDI 111
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+C+ H +Y S KS+TY + G + I YG+GS+SG+ S+D V +G++ VK Q+F EA ++
Sbjct: 112 ACWMHHKYDSSKSSTYVKNGTAFAIQYGTGSLSGYLSKDTVTIGNLAVKGQIFGEAVKQP 171
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
+TF+ A+FDGI+G+ + I+V PV+DN++ Q LV +FSF+LNR+PD + GGE++
Sbjct: 172 GVTFVAAKFDGILGMAYPVISVDGVPPVFDNIMAQKLVESNIFSFYLNRNPDTQPGGELL 231
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
GG DPK++ G Y+ VT+K YWQ + + +G+Q T +C+GGC IVD+GTSL+ GP
Sbjct: 232 LGGTDPKYYTGDFHYLSVTRKAYWQIHMDQLGVGDQLT-LCKGGCEVIVDTGTSLITGPL 290
Query: 301 PVVTEINHAIGGEGVVSAE 319
VT + AIG ++ +
Sbjct: 291 EEVTALQKAIGAVPLIQGQ 309
>gi|307167890|gb|EFN61279.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 354
Score = 276 bits (707), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 137/266 (51%), Positives = 175/266 (65%), Gaps = 14/266 (5%)
Query: 63 VRHRLGDSDEDILP-----------LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWV 111
+R+ L + D D+ P L N++DAQY+G I IG+PPQ F VIFDTGSSNLWV
Sbjct: 4 IRNSLKEVDADLQPVHLTGGITPEPLSNYLDAQYYGVISIGTPPQEFKVIFDTGSSNLWV 63
Query: 112 PSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKD 170
PS C+F+ I+C H +Y S+KS+TY G S I YGSGS+SG+ S D V V + V
Sbjct: 64 PSKNCHFTNIACQLHHKYNSKKSSTYEPNGASFAIQYGSGSLSGYLSADVVNVAGLNVTS 123
Query: 171 QVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRD 230
QVF EA E L F+ A+FDGI+G+G+ IAV PV+ NMV+Q LV + VFSF+LNRD
Sbjct: 124 QVFAEAISEPGLAFVAAKFDGILGMGYSTIAVDGVTPVFYNMVKQKLVPKAVFSFYLNRD 183
Query: 231 PDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVD 290
P AE GGE++ GG DP H++ TYVPVT+KGYWQF + I +GN++ C GC AI D
Sbjct: 184 PSAEVGGELILGGSDPDHYEADLTYVPVTRKGYWQFSMDGIEVGNRT--FCNNGCQAIAD 241
Query: 291 SGTSLLAGPTPVVTEINHAIGGEGVV 316
+GTSL+AGP V IN IG +
Sbjct: 242 TGTSLIAGPVADVAAINKLIGASAIA 267
>gi|31197673|ref|XP_307784.1| AGAP003277-PA [Anopheles gambiae str. PEST]
gi|347969584|ref|XP_003436430.1| AGAP003277-PB [Anopheles gambiae str. PEST]
gi|347969586|ref|XP_003436431.1| AGAP003277-PC [Anopheles gambiae str. PEST]
gi|347969588|ref|XP_003436432.1| AGAP003277-PD [Anopheles gambiae str. PEST]
gi|30179074|gb|EAA03535.2| AGAP003277-PA [Anopheles gambiae str. PEST]
gi|333466215|gb|EGK96172.1| AGAP003277-PB [Anopheles gambiae str. PEST]
gi|333466216|gb|EGK96173.1| AGAP003277-PC [Anopheles gambiae str. PEST]
gi|333466217|gb|EGK96174.1| AGAP003277-PD [Anopheles gambiae str. PEST]
Length = 389
Score = 276 bits (706), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 131/255 (51%), Positives = 177/255 (69%), Gaps = 7/255 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSN 134
PL N++DAQYFG I IG+PPQ+F V+FDTGSSNLWVPS +C F+ I+C H++Y ++KS+
Sbjct: 61 PLSNYLDAQYFGAISIGTPPQSFKVVFDTGSSNLWVPSKQCSFTNIACLMHNKYDAKKSS 120
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
++ + G + I YG+GS+SG+ S D V VG V V+ Q F EA +E L F+ A+FDGI+G
Sbjct: 121 SFEKNGTAFHIQYGTGSLSGYLSTDTVTVGGVPVEKQTFAEAIQEPGLVFVAAKFDGILG 180
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L ++ I+V +PV+ NM QG + VFSF+LNRDP A EGGEI+FGG D KH+ G T
Sbjct: 181 LAYKSISVDGVMPVFYNMFNQGKIDAPVFSFYLNRDPSAAEGGEIIFGGSDSKHYTGDFT 240
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y+ V +K YWQF++ + +G+ C GC AI D+GTSL+AGP VT IN AIGG
Sbjct: 241 YLSVDRKAYWQFKMDSVTVGDAQ--YCNNGCEAIADTGTSLIAGPVAEVTAINKAIGGTP 298
Query: 315 VVSAE----CKLVVS 325
V++ E C L+ S
Sbjct: 299 VLNGEYMVDCSLIPS 313
>gi|351712803|gb|EHB15722.1| Cathepsin D, partial [Heterocephalus glaber]
Length = 390
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 141/303 (46%), Positives = 200/303 (66%), Gaps = 23/303 (7%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRH--------RLGDSDEDILP--LKNFMDAQYF 86
R+ LH + R T E +GG+ + H +L +P LKN+MDAQY+
Sbjct: 3 RIPLHKFKSIRRTMTE--VGGSVEDLIAHGPLTKYSPQLSTKTTGPVPETLKNYMDAQYY 60
Query: 87 GEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEI 145
GEIGIG+PPQ F+V+FDTGSSNLWVPSS+C I+C+FH +Y S KS+TY + G S +I
Sbjct: 61 GEIGIGTPPQCFTVVFDTGSSNLWVPSSRCNMLDIACWFHHKYHSDKSSTYVKNGSSFDI 120
Query: 146 NYGSGSISGFFSQDNVEVG---------DVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
+YGSGS+SG+ SQD V V ++ V+ Q F EAT++ +TF+ A+FDGI+G+
Sbjct: 121 HYGSGSLSGYLSQDTVSVPCQSAESNPRNLRVEKQTFGEATKQPGITFIAAKFDGILGMA 180
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ I+V + +PV+DN++ Q LV + VFSF+LNRDP A+ GGE++ GG+D K++KG TY+
Sbjct: 181 YPRISVNNVLPVFDNLMSQKLVDKNVFSFYLNRDPSAQPGGELMLGGIDSKYYKGSFTYL 240
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
VT+K YWQ + + +G+ +C+GGC AIVD+GTSLL GP V E+ AIG ++
Sbjct: 241 NVTRKAYWQVHMDQLEVGS-GLNLCKGGCEAIVDTGTSLLVGPVDEVKELQKAIGAIPLI 299
Query: 317 SAE 319
E
Sbjct: 300 QGE 302
>gi|125807245|ref|XP_001360320.1| GA13759 [Drosophila pseudoobscura pseudoobscura]
gi|195149648|ref|XP_002015768.1| GL11239 [Drosophila persimilis]
gi|54635492|gb|EAL24895.1| GA13759 [Drosophila pseudoobscura pseudoobscura]
gi|194109615|gb|EDW31658.1| GL11239 [Drosophila persimilis]
Length = 388
Score = 276 bits (705), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 139/292 (47%), Positives = 187/292 (64%), Gaps = 8/292 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQ 96
RL LN + R+ G + +R R G D PL N+MDAQY+G I IGSPPQ
Sbjct: 22 RLLRVPLNRFQSARRHFADVGTELQQLRIRYGGGDVP-EPLSNYMDAQYYGPISIGSPPQ 80
Query: 97 NFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
+F V+FDTGSSNLWVPS KC+ + I+C H++Y + KS+TY + G + I YGSGS+SG+
Sbjct: 81 SFRVVFDTGSSNLWVPSKKCHLTNIACLMHNKYDASKSSTYAKNGTTFAIQYGSGSLSGY 140
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
S D + +G + +K Q F EA E L F+ A+FDGI+GLG+ I+V P + M EQ
Sbjct: 141 LSTDTLSMGGLDIKGQTFAEALSEPGLVFVAAKFDGILGLGYSSISVDGVKPPFYAMYEQ 200
Query: 216 GLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGN 275
GL+S VFSF+LNRDP + EGGEI+FGG DPKH+ G TY+PVT+K YWQ ++ +G+
Sbjct: 201 GLISSPVFSFYLNRDPASPEGGEIIFGGSDPKHYTGDFTYLPVTRKAYWQIKMDSAALGD 260
Query: 276 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLV 323
+C+GGC I D+GTSL+A P T IN IGG ++ + C L+
Sbjct: 261 LE--LCKGGCQVIADTGTSLIAAPMTEATSINQKIGGTPIIGGQYIVSCDLI 310
>gi|45384002|ref|NP_990508.1| cathepsin D precursor [Gallus gallus]
gi|461696|sp|Q05744.1|CATD_CHICK RecName: Full=Cathepsin D; Contains: RecName: Full=Cathepsin D
light chain; Contains: RecName: Full=Cathepsin D heavy
chain; Flags: Precursor
gi|259835|gb|AAB24157.1| prepro-cathepsin D [Gallus gallus]
Length = 398
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 123/244 (50%), Positives = 178/244 (72%), Gaps = 2/244 (0%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF-SISCYFHSRYKSRKSNT 135
LKN+MDAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS C+ I+C H +Y + KS+T
Sbjct: 70 LKNYMDAQYYGEIGIGTPPQKFTVVFDTGSSNLWVPSVHCHLLDIACLLHHKYDASKSST 129
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y E G I+YG+GS+SGF SQD V +G++ +K+Q+F EA ++ +TF+ A+FDGI+G+
Sbjct: 130 YVENGTEFAIHYGTGSLSGFLSQDTVTLGNLKIKNQIFGEAVKQPGITFIAAKFDGILGM 189
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
F I+V P +DN+++Q L+ + +FSF+LNRDP A+ GGE++ GG DPK++ G ++
Sbjct: 190 AFPRISVDKVTPFFDNVMQQKLIEKNIFSFYLNRDPTAQPGGELLLGGTDPKYYSGDFSW 249
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
V VT+K YWQ + + + N T +C+GGC AIVD+GTSL+ GPT V E+ AIG + +
Sbjct: 250 VNVTRKAYWQVHMDSVDVANGLT-LCKGGCEAIVDTGTSLITGPTKEVKELQTAIGAKPL 308
Query: 316 VSAE 319
+ +
Sbjct: 309 IKGQ 312
>gi|348565205|ref|XP_003468394.1| PREDICTED: cathepsin D-like [Cavia porcellus]
Length = 407
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 143/307 (46%), Positives = 201/307 (65%), Gaps = 11/307 (3%)
Query: 22 PASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILP--LKN 79
P S+ L RI L K + H++ A E + ++ +L +P L N
Sbjct: 15 PFSTTALIRIPLHKFKSIRHTMTEAG-GSVENLIARDPLTKYSPQLSTKATGPVPEPLSN 73
Query: 80 FMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTE 138
+MDAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS+KC I+C+FH +Y KS+TY +
Sbjct: 74 YMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSAKCKMLDIACWFHHKYHGDKSSTYVK 133
Query: 139 IGKSCEINYGSGSISGFFSQDNVEV------GDVVVKDQVFIEATREGSLTFLLARFDGI 192
G S +I+YGSGS+SG+ SQD V V V V Q F EAT++ + F+ A+FDGI
Sbjct: 134 NGTSFDIHYGSGSLSGYLSQDTVSVPCKSSNSSVKVSKQTFGEATKQPGIVFVAAKFDGI 193
Query: 193 IGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGK 252
+GL + I+V + +PV+DN++EQ LV + +FSF+LNRDP A+ GGE+V GG+D K++KG
Sbjct: 194 LGLAYPRISVNNVLPVFDNLMEQKLVEKNIFSFYLNRDPTAQPGGELVLGGIDSKYYKGS 253
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
TY+ VT+K YWQ + + +G++ T +C+GGC AIVD+GTSLL GP V E+ AIG
Sbjct: 254 FTYLNVTRKAYWQVHMDQLQVGSELT-LCKGGCEAIVDTGTSLLVGPVDEVKELQKAIGA 312
Query: 313 EGVVSAE 319
++ E
Sbjct: 313 LPLIQGE 319
>gi|225717994|gb|ACO14843.1| Lysosomal aspartic protease precursor [Caligus clemensi]
Length = 386
Score = 275 bits (703), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 131/271 (48%), Positives = 183/271 (67%), Gaps = 3/271 (1%)
Query: 50 RKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNL 109
R+ + G+ + +R R PL N++DAQY+G I IG+PPQ+F+VIFDTGSSNL
Sbjct: 32 RRHFFEVGSSIQLIRRRWNSVGAHPEPLSNYLDAQYYGPITIGTPPQSFNVIFDTGSSNL 91
Query: 110 WVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVV 168
WVPS C+ + I+C H ++ KS++Y G I YGSGS+ GF S D+V +G V +
Sbjct: 92 WVPSKSCHITNIACLLHHKFDHSKSSSYVVNGTEFAIQYGSGSLFGFLSTDSVSMGGVEI 151
Query: 169 KDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLN 228
Q F EA E + F+ A+FDGI+G+G+ IAV VP + NM +QGL+ E VFSF+LN
Sbjct: 152 GSQTFGEAMSEPGMAFVAAKFDGILGMGYSNIAVDGVVPPFYNMFKQGLIQEPVFSFYLN 211
Query: 229 RDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAI 288
RDP+A+ GGEI+FGG DP H+KG TY+PVTKKGYWQF++ + + +++ C+ GC AI
Sbjct: 212 RDPNAQVGGEIIFGGSDPDHYKGNITYIPVTKKGYWQFKMDGMKVSSKT--FCQNGCQAI 269
Query: 289 VDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
D+GTSL+AGP+ V +N +GG +V+ E
Sbjct: 270 ADTGTSLIAGPSVEVNALNQLLGGMPIVNGE 300
>gi|324507249|gb|ADY43078.1| Cathepsin D [Ascaris suum]
Length = 437
Score = 275 bits (703), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 136/272 (50%), Positives = 185/272 (68%), Gaps = 6/272 (2%)
Query: 51 KERYMGGAG--VSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSN 108
K+ + G A V +R + G+ +++L KN+MDAQY+G+I IG+PPQNF+VIFDTGS+N
Sbjct: 54 KKHFYGIANHRVHSLRGQSGNEIDELL--KNYMDAQYYGDISIGTPPQNFTVIFDTGSAN 111
Query: 109 LWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVV 167
LWVPS KC F+ I+C H +Y + KS+TY E G+ +I YG+GS+ GF S DNV V DV
Sbjct: 112 LWVPSRKCPFTDIACLLHHKYDAAKSSTYAEDGRKLQIQYGTGSMKGFISLDNVCVADVC 171
Query: 168 VKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL 227
+Q F EAT E LTF+ A+FDGI+G+ F EIAV PV+ M++Q L++ VF+FWL
Sbjct: 172 ATEQPFAEATSEPGLTFIAAKFDGILGMAFPEIAVLGVKPVFHTMIDQQLLAAPVFAFWL 231
Query: 228 NRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAA 287
+R+PD + GGEI FGG D K + TY PVT++GYWQF++ D ++G ++ C GC A
Sbjct: 232 DRNPDDQIGGEITFGGTDTKRYVEPITYTPVTRRGYWQFKM-DKVVGEEAVLACANGCQA 290
Query: 288 IVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
I D+GTSL+AGP V I IG E + E
Sbjct: 291 IADTGTSLIAGPKQQVDTIQKFIGAEPLFRGE 322
>gi|156406785|ref|XP_001641225.1| predicted protein [Nematostella vectensis]
gi|156228363|gb|EDO49162.1| predicted protein [Nematostella vectensis]
Length = 370
Score = 275 bits (703), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 140/284 (49%), Positives = 189/284 (66%), Gaps = 3/284 (1%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQ 96
R+ LH + R + KE + + G + + PL N+MDAQY+GEI IG+PPQ
Sbjct: 3 RIPLHKMPTPRQSLKEVGISVEQLLGKYGGKYEGGDVPEPLINYMDAQYYGEITIGTPPQ 62
Query: 97 NFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
F+V+FDTGSSNLWVPS KC + +I+C H +Y S KS+TY + G I YGSGS+SGF
Sbjct: 63 KFTVVFDTGSSNLWVPSKKCSWTNIACLLHDKYDSTKSSTYKKNGTEFAIRYGSGSLSGF 122
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
S D V VG + VK Q F EA +E LTF+ A+FDGI+G+GF I+V VPV+ +MV Q
Sbjct: 123 LSIDTVSVGGIDVKGQTFAEALKEPGLTFVAAKFDGILGMGFSSISVDQVVPVFYDMVLQ 182
Query: 216 GLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGN 275
LV VFSF+LNR+P A GGE++ GG DPK++KG +YVPVT++GYWQF++ I +
Sbjct: 183 KLVPAPVFSFYLNREPGASPGGELLLGGSDPKYYKGNFSYVPVTQEGYWQFKMDGISVKE 242
Query: 276 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
S C GC AI D+GTSL+AGPT + ++N+ IG + ++ E
Sbjct: 243 GS--FCSDGCQAIADTGTSLIAGPTDEIEKLNNLIGAKIIIGGE 284
>gi|242013446|ref|XP_002427417.1| Lysosomal aspartic protease precursor, putative [Pediculus humanus
corporis]
gi|212511797|gb|EEB14679.1| Lysosomal aspartic protease precursor, putative [Pediculus humanus
corporis]
Length = 383
Score = 275 bits (703), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 135/284 (47%), Positives = 184/284 (64%), Gaps = 8/284 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQ 96
R+ L+ +AR T + G V +R R G + PL N++DAQY+G I IG+PPQ
Sbjct: 21 RVPLYKFQSARRTLRGV---GTDVEHLRMRYGGPTPE--PLSNYLDAQYYGPISIGTPPQ 75
Query: 97 NFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
F VIFDTGSSNLW+PS KC FS I+C H++Y S +S+TY G I YGSGS+SG+
Sbjct: 76 QFKVIFDTGSSNLWIPSKKCLFSNIACLLHNKYDSSRSSTYIRNGTEFSIQYGSGSLSGY 135
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
S D+V +G + +K Q F EA E L F+ A+FDGI+G+G+ IAV VP + NM EQ
Sbjct: 136 LSTDDVTLGGLTIKRQTFAEAISEPGLAFVAAKFDGILGMGYMSIAVDGVVPPFYNMYEQ 195
Query: 216 GLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGN 275
LV +FSF+LNR+P+ + GGE++ GG DP ++KG TY+PV +K YWQF++ +++
Sbjct: 196 RLVDSPIFSFYLNRNPNEKVGGELLLGGSDPNYYKGNFTYLPVNRKAYWQFQMDKVMM-- 253
Query: 276 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
+ VC GGC AI D+GTSL+AGP V +IN + G V E
Sbjct: 254 EDITVCRGGCQAIADTGTSLIAGPVEDVNKINKKLNGVPVSGGE 297
>gi|226437842|gb|ACO56332.1| putative gut cathepsin D-like aspartic protease [Callosobruchus
maculatus]
Length = 389
Score = 275 bits (703), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 140/291 (48%), Positives = 193/291 (66%), Gaps = 13/291 (4%)
Query: 36 RRLDLHSLNAARITRKERYMGGAGVS-----GVRHR-LGDSDEDILPLKNFMDAQYFGEI 89
R+ L+ + R T +E G VS G ++R LG + PL N++DAQY+G I
Sbjct: 19 HRIPLYKFKSIRRTFQEV---GTDVSQVVLNGNKYRNLGGPVPE--PLSNYLDAQYYGPI 73
Query: 90 GIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYG 148
IG+PPQ F VIFDTGSSNLWVPS C+F+ I+C H++Y S KS+TY + G + I YG
Sbjct: 74 SIGTPPQTFKVIFDTGSSNLWVPSKLCHFTNIACLLHNKYDSSKSSTYKKNGTAFAIRYG 133
Query: 149 SGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPV 208
SGS+ GF S D+V G + V++Q F EA E + F+ A+FDGI+G+G+ IAV PV
Sbjct: 134 SGSLDGFLSTDHVSFGGLKVENQTFAEAMNEPGMAFVAAKFDGILGMGYSRIAVDGVPPV 193
Query: 209 WDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFEL 268
+ NMV Q LVS+ VFSF+LNRDP A +GGE++ GG D H+KG+ TY+PV ++ YWQF++
Sbjct: 194 FYNMVSQKLVSQPVFSFYLNRDPAAPQGGELILGGSDKAHYKGEFTYLPVDRQAYWQFKM 253
Query: 269 GDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
+ +G ++T +C GC AI D+GTSL+AGP+ V IN AIG ++ E
Sbjct: 254 DKVQVGPETT-LCAKGCEAIADTGTSLIAGPSEEVKAINKAIGATPIMGGE 303
>gi|195027894|ref|XP_001986817.1| GH21578 [Drosophila grimshawi]
gi|193902817|gb|EDW01684.1| GH21578 [Drosophila grimshawi]
Length = 388
Score = 275 bits (702), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 128/245 (52%), Positives = 170/245 (69%), Gaps = 3/245 (1%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSN 134
PL N++DAQY+G I IGSPPQNF V+FDTGSSNLWVPS KC+ + I+C H++Y + KS+
Sbjct: 60 PLSNYLDAQYYGPISIGSPPQNFKVVFDTGSSNLWVPSKKCHLTNIACLMHNKYDATKSS 119
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + G I+YGSGS+SG+ S D V + + +KD F EA E L F+ A+FDGI+G
Sbjct: 120 TYVKNGTEFAIHYGSGSLSGYLSTDTVNIAGLDIKDHTFAEALSEPGLVFVAAKFDGILG 179
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V P + M EQGL+S+ VFSF+LNRDP A EGGEI+FGG DP H+ G T
Sbjct: 180 LGYSSISVDGVKPSFYAMYEQGLISDPVFSFYLNRDPKAPEGGEIIFGGSDPNHYTGDFT 239
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y+PVT+KGYWQ ++ + + +C+GGC I D+GTSL+A P T IN AIGG
Sbjct: 240 YLPVTRKGYWQIKMDSAQLNDIE--LCKGGCQVIADTGTSLIAAPQDEATSINQAIGGTP 297
Query: 315 VVSAE 319
++ +
Sbjct: 298 ILGGQ 302
>gi|289740593|gb|ADD19044.1| aspartyl protease [Glossina morsitans morsitans]
Length = 394
Score = 275 bits (702), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 140/317 (44%), Positives = 199/317 (62%), Gaps = 11/317 (3%)
Query: 5 LLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVR 64
+++ + L AS LL G + K+ + L + R + G + +R
Sbjct: 1 MIKYILFLLFEASVLL-----QGFHAV--KEEKFIRVPLTRIKTARNYFHEVGTELQQLR 53
Query: 65 HRLGDS-DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISC 122
+ G + D PL N++DAQY+G I IG+P Q+F V+FDTGSSNLWVPS +CYF+ I+C
Sbjct: 54 LKYGSANDVRPEPLSNYLDAQYYGPISIGTPSQDFKVVFDTGSSNLWVPSKQCYFTNIAC 113
Query: 123 YFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSL 182
H++Y + KS++Y + G I+YGSGS+SG+ S D V + + ++ Q F EA E L
Sbjct: 114 LMHNKYDANKSSSYKKNGTEFAIHYGSGSLSGYLSTDTVNIAGLGIEGQTFAEALSEPGL 173
Query: 183 TFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFG 242
F+ A+FDGI+GLG+ IAV P + M EQGL+S+ VFSF+LNRDP A EGGEI+FG
Sbjct: 174 VFIGAKFDGILGLGYSSIAVDGVKPPFYQMYEQGLISQPVFSFYLNRDPKAPEGGEIIFG 233
Query: 243 GVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPV 302
G DP H+KG+ TY+PVT+K YWQ ++ +GN + +C+GGC I D+GTSL+A P
Sbjct: 234 GSDPNHYKGEFTYLPVTRKAYWQIKMDSASMGNLN--LCQGGCQVIADTGTSLIALPPSE 291
Query: 303 VTEINHAIGGEGVVSAE 319
T IN AIGG ++ +
Sbjct: 292 ATSINKAIGGTPIMGGQ 308
>gi|158523297|gb|ABW70789.1| cathepsin D [Scophthalmus maximus]
Length = 396
Score = 274 bits (701), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 144/316 (45%), Positives = 196/316 (62%), Gaps = 17/316 (5%)
Query: 15 LASCLL-----LPASSNGLRRIGLKK-----RRLDLHSLNAARITRKERYMGGAGVSGVR 64
+ SCLL L S + L RI LKK R L A + + + +G G
Sbjct: 1 MRSCLLVVFVSLALSGDALVRIPLKKFHSVRRELTDSGRKAEELLADKHSLKYSG--GFP 58
Query: 65 HRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCY 123
G + E LKNF+DAQY+G+I +GSPPQ FSV+FDTGSSNLWVPS C I+C
Sbjct: 59 SSNGPTPE---MLKNFLDAQYYGDIALGSPPQTFSVVFDTGSSNLWVPSVHCSLLDIACL 115
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
H +Y S KS+TY + G + I YGSGS+SGF SQD +GDV V++QVF EAT++ +
Sbjct: 116 LHHKYNSAKSSTYVKNGTAFAIQYGSGSLSGFLSQDTCTIGDVTVENQVFGEATKQPGVA 175
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
F+ A+FDGI+G+ F I+V VPV+DN++ Q V + VFSF+LNR+PD GGE++ GG
Sbjct: 176 FIAAKFDGILGMAFPRISVDGVVPVFDNIMSQKKVEQNVFSFYLNRNPDTAPGGELLLGG 235
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
DPK++ G Y+ +T+K YWQ + + +G+Q T +C GGC IVD+GTSL+ GP V
Sbjct: 236 TDPKYYTGDFNYINITRKAYWQIHMDGLAVGSQLT-LCNGGCEVIVDTGTSLITGPAAEV 294
Query: 304 TEINHAIGGEGVVSAE 319
+ AIG ++ E
Sbjct: 295 KALQKAIGAVPLIQGE 310
>gi|347451476|gb|AEO94539.1| aspartate protease cathepsin D [Triatoma infestans]
Length = 393
Score = 274 bits (701), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 149/326 (45%), Positives = 196/326 (60%), Gaps = 28/326 (8%)
Query: 14 VLASCLLLPAS-------SNGLRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSG 62
+LA LLL +S S+ L R+ L K RR + A +Y G GV G
Sbjct: 1 MLAHTLLLISSFCGVLLGSDNLVRVPLTKIQSARRF-FQDVGTAVEQLTLKYDTGNGVEG 59
Query: 63 VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSIS 121
PL N++DAQY+G I +GSPPQ+F V+FDTGSSNLWVPS KC F+I+
Sbjct: 60 PFPE---------PLSNYLDAQYYGAITLGSPPQSFRVVFDTGSSNLWVPSKKCSRFNIA 110
Query: 122 CYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGS 181
C+ H +Y S S TY G+ I YGSGS+SGF SQD + +G V V +Q F EA E
Sbjct: 111 CWVHRKYDSSNSKTYVPNGEKFAIQYGSGSLSGFLSQDQLSIGGVTVANQTFAEAVNEPG 170
Query: 182 LTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVF 241
+ F+ A+FDGI+GLG+ I+V P + NM +QG V VFSF+LNRDP A GGEI+F
Sbjct: 171 MVFVAAKFDGILGLGYDTISVDKVTPPFYNMYQQGAVQNPVFSFYLNRDPAAAVGGEIIF 230
Query: 242 GGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTP 301
GG DP+ + G TYVPV K+GYWQF + +++ ++ C+GGC AI D+GTSL+AGPT
Sbjct: 231 GGSDPEKYVGDFTYVPVDKQGYWQFNMDKVIVNGKT--FCKGGCQAIADTGTSLIAGPTE 288
Query: 302 VVTEINHAIGGEGVVSAE----CKLV 323
V +N +GG + E C L+
Sbjct: 289 DVIALNKLLGGTPIAGGEYMISCDLI 314
>gi|449280808|gb|EMC88033.1| Cathepsin D, partial [Columba livia]
Length = 387
Score = 274 bits (701), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 123/244 (50%), Positives = 178/244 (72%), Gaps = 2/244 (0%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF-SISCYFHSRYKSRKSNT 135
LKN+MDAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS C+ I+C H +Y S KS+T
Sbjct: 59 LKNYMDAQYYGEIGIGTPPQKFTVVFDTGSSNLWVPSVHCHLLDIACLLHHKYDSSKSST 118
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y E G I+YG+GS+SG+ SQD V +G++ +K+Q+F EA ++ +TF+ A+FDGI+G+
Sbjct: 119 YVENGTDFAIHYGTGSLSGYLSQDTVTLGNLKIKNQIFGEALKQPGITFIAAKFDGILGM 178
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
F I+V P +DN+++Q L+ + +FSF+LNRDP A+ GGE++ GG DPK++ G ++
Sbjct: 179 AFPRISVDKVTPFFDNIMQQKLIEKNIFSFYLNRDPSAQPGGELLLGGTDPKYYSGDFSW 238
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
V VT+K YWQ + + + N T +C+GGC AIVD+GTSL+ GPT V E+ AIG + +
Sbjct: 239 VNVTRKAYWQVHMDAVDVANGLT-LCKGGCEAIVDTGTSLITGPTKEVKELQTAIGAKPL 297
Query: 316 VSAE 319
+ +
Sbjct: 298 IKGQ 301
>gi|387015018|gb|AFJ49628.1| Cathepsin D [Crotalus adamanteus]
Length = 399
Score = 274 bits (701), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 125/247 (50%), Positives = 179/247 (72%), Gaps = 2/247 (0%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN+MDAQY+GEIGIG+P Q F+V+FDTGSSNLWVPSS C I+C H +Y S KS+T
Sbjct: 68 LKNYMDAQYYGEIGIGTPQQRFTVVFDTGSSNLWVPSSHCTLLDIACLIHHKYDSSKSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + G I+YG+GS+SG+ SQD V +GD+ VK+Q+F EAT++ +TF+ A+FDGI+G+
Sbjct: 128 YVKNGTDFAIHYGTGSLSGYLSQDTVTIGDMCVKNQLFGEATKQPGITFIAAKFDGILGM 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ EI+V P +DN++EQGL+ + +FSF+LNRDP E GGE++FGG D +++ G ++
Sbjct: 188 AYPEISVDKVAPFFDNVMEQGLLEKNLFSFYLNRDPKGETGGELLFGGTDSQYYSGDFSW 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
V V++K YWQ + + + N T VC+ GC AIVD+GTSL+ GPT + E+ AIG + +
Sbjct: 248 VNVSRKAYWQVHMDKVDVANGLT-VCKDGCEAIVDTGTSLITGPTKEIKELQKAIGAKPI 306
Query: 316 VSAECKL 322
+ + L
Sbjct: 307 IKGQYML 313
>gi|443723962|gb|ELU12180.1| hypothetical protein CAPTEDRAFT_225009 [Capitella teleta]
Length = 364
Score = 274 bits (701), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 128/238 (53%), Positives = 171/238 (71%), Gaps = 1/238 (0%)
Query: 83 AQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGK 141
AQY+G I IG+P Q F V+FDTGSSNLWVPS KC ++ I+C+ H+RY S KS +Y + G
Sbjct: 23 AQYYGAITIGTPAQTFKVVFDTGSSNLWVPSQKCKWTDIACWLHNRYDSTKSTSYKKNGT 82
Query: 142 SCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIA 201
+I YGSGS+SGF S D V +GDV V Q F EAT + +TF+ A+FDGI+G+G+ I+
Sbjct: 83 EFKIQYGSGSLSGFLSTDIVTIGDVSVTAQTFAEATAQPGITFVAAKFDGILGMGYPTIS 142
Query: 202 VGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKK 261
V PV++NMV+Q VS VFSF+LNRDP A EGGE++ GG DPK+++G TY+PV+KK
Sbjct: 143 VDGVTPVFNNMVKQKSVSSPVFSFFLNRDPSASEGGELILGGSDPKYYEGNFTYLPVSKK 202
Query: 262 GYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
GYWQF++ + + ST C+GGC AI D+GTSLLAGP+ V ++N +GG + E
Sbjct: 203 GYWQFKMDGMKLAGSSTSYCDGGCQAIADTGTSLLAGPSAEVQKLNQELGGTAIPGGE 260
>gi|83319201|dbj|BAE53722.1| aspartic protease [Haemaphysalis longicornis]
Length = 391
Score = 274 bits (700), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 127/245 (51%), Positives = 176/245 (71%), Gaps = 2/245 (0%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSN 134
PLKN++DAQY+G++ +G+PPQ F V+FDTGSSNLWVPSSKC F+ I+C H +Y S+KS+
Sbjct: 62 PLKNYLDAQYYGDVTLGTPPQVFRVVFDTGSSNLWVPSSKCPFTNIACMLHHKYNSKKSS 121
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + G EI YGSGS+ G S D +GD+ ++ Q F E RE L F+ A+FDGI+G
Sbjct: 122 TYAKNGTQFEIRYGSGSVKGELSTDVFGLGDIRLQGQTFAEILRESGLAFIAAKFDGILG 181
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ +I+V + PV+DNMV QG+ + VFS +L+R+ GGE++FGG+D H+ G T
Sbjct: 182 LGYPQISVLNVPPVFDNMVAQGVAPKPVFSVYLDRNASDPNGGEVLFGGIDEAHYTGNIT 241
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
YVPVT+KGYWQF + + +G+ +T C GGCAAI D+GTSL+AGPT + ++N AIG
Sbjct: 242 YVPVTRKGYWQFHMNGVKVGDNAT-FCNGGCAAIADTGTSLIAGPTEEIHKLNVAIGAAP 300
Query: 315 VVSAE 319
++ E
Sbjct: 301 FMAGE 305
>gi|157779726|gb|ABV71391.1| aspartic protease [Haemaphysalis longicornis]
Length = 391
Score = 274 bits (700), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 127/245 (51%), Positives = 176/245 (71%), Gaps = 2/245 (0%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSN 134
PLKN++DAQY+G++ +G+PPQ F V+FDTGSSNLWVPSSKC F+ I+C H +Y S+KS+
Sbjct: 62 PLKNYLDAQYYGDVTLGTPPQVFRVVFDTGSSNLWVPSSKCPFTNIACMLHHKYNSKKSS 121
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + G EI YGSGS+ G S D +GD+ ++ Q F E RE L F+ A+FDGI+G
Sbjct: 122 TYAKNGTQFEIRYGSGSVKGELSTDVFGLGDIRLQGQTFAEILRESGLAFIAAKFDGILG 181
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ +I+V + PV+DNMV QG+ + VFS +L+R+ GGE++FGG+D H+ G T
Sbjct: 182 LGYPQISVLNVPPVFDNMVAQGVAPKPVFSVYLDRNASDPNGGEVLFGGIDEAHYTGNIT 241
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
YVPVT+KGYWQF + + +G+ +T C GGCAAI D+GTSL+AGPT + ++N AIG
Sbjct: 242 YVPVTRKGYWQFHMNGVKVGDNAT-FCNGGCAAIADTGTSLIAGPTEEIHKLNVAIGAAP 300
Query: 315 VVSAE 319
++ E
Sbjct: 301 FMAGE 305
>gi|60678793|gb|AAX33731.1| Blo t allergen [Blomia tropicalis]
Length = 402
Score = 274 bits (700), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 154/316 (48%), Positives = 205/316 (64%), Gaps = 18/316 (5%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRH 65
++ L LA+ LL+ A L RI L+K + SL R E + A + H
Sbjct: 1 MKYSLVLVFLATILLVDAK---LHRIKLQKAQ----SLRK-RFVEVESPIKLAYTTHHYH 52
Query: 66 RLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYF 124
+ + PL N+ DAQY+GEI IGSPPQ F+VIFDTGSSNLWVPS KC F+ ++C
Sbjct: 53 HWYNGFPE--PLSNYADAQYYGEIQIGSPPQPFNVIFDTGSSNLWVPSKKCKFTNLACLL 110
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
H +Y S KS++Y G S EI YG+GS++GF S D V V + +++Q F EA E +TF
Sbjct: 111 HHKYDSSKSSSYVNNGTSFEIRYGTGSMTGFLSTDVVTVANQQIQNQTFAEAVSEPGITF 170
Query: 185 LLARFDGIIGLGFREIAVGDAVP-VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
+ A+FDGI+GLGF I+V D VP V+D+MV+QGLV + VFSF+LNRD + + GGEI+FGG
Sbjct: 171 VFAKFDGILGLGFNTISV-DGVPTVFDSMVKQGLVQQPVFSFYLNRDTNGKVGGEIIFGG 229
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTG-----VCEGGCAAIVDSGTSLLAG 298
DP ++KG TY P+TK GYWQF++ IL+ N+S VCE GC AI D+GTSL+AG
Sbjct: 230 SDPAYYKGDFTYAPLTKIGYWQFQMHGILLENKSNNKTVGHVCESGCEAIADTGTSLIAG 289
Query: 299 PTPVVTEINHAIGGEG 314
P+ V +N A+G G
Sbjct: 290 PSDQVEHLNRALGAIG 305
>gi|262232673|gb|ACY38599.1| cathepsin D-like aspartic protease [Anisakis simplex]
Length = 453
Score = 273 bits (699), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 130/244 (53%), Positives = 168/244 (68%), Gaps = 2/244 (0%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNT 135
L+N+MDAQY+G I IG+PPQNF+VIFDTGSSNLWVPS KC ++ I+C+ H +Y + KS+T
Sbjct: 100 LRNYMDAQYYGVISIGTPPQNFTVIFDTGSSNLWVPSRKCKWTDIACWLHHKYDAAKSST 159
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
+ G+ +I YG+GS+ GF S D V V ++ +DQ F EA E +TF+ A+FDGI+G+
Sbjct: 160 HKADGRELQIQYGTGSMKGFISLDTVCVAELCARDQPFAEAASEPGITFVAAKFDGILGM 219
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
F EIA + PV++ MV Q LV+E VF+FWLNR PD E GGEI FGG DPKHF Y
Sbjct: 220 AFPEIAALNVTPVFNTMVNQQLVAEPVFAFWLNRTPDDEIGGEITFGGTDPKHFVEPIVY 279
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
PVT++ YWQF++ D + G T C GC AI D+GTSL+AGP V I IG E +
Sbjct: 280 APVTRRAYWQFKM-DKISGQDGTLACSDGCQAIADTGTSLIAGPKQQVQLIQKYIGAEPL 338
Query: 316 VSAE 319
S E
Sbjct: 339 FSGE 342
>gi|340729556|ref|XP_003403066.1| PREDICTED: lysosomal aspartic protease-like [Bombus terrestris]
Length = 385
Score = 273 bits (699), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 132/273 (48%), Positives = 178/273 (65%), Gaps = 5/273 (1%)
Query: 36 RRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPP 95
+R+ LH ++ R KE V+ + PL N++DAQY+G I IG+P
Sbjct: 19 QRITLHKIDTVRKQFKEYNTEVYQAHMVQGNFPQPE----PLSNYLDAQYYGVISIGTPS 74
Query: 96 QNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISG 154
Q+F VIFDTGSSNLWVPS KC+ + I+C H +Y + KS+TY + G I YGSGS+SG
Sbjct: 75 QDFKVIFDTGSSNLWVPSKKCHLTNIACKLHHKYDNTKSSTYKKNGTDFAIRYGSGSLSG 134
Query: 155 FFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVE 214
+ S D V V + V DQ F EA E + F+ A+FDGI+G+ + +IAV PV+ NMV+
Sbjct: 135 YLSTDVVNVAGLKVSDQTFAEALSEPGMAFVAAKFDGILGMAYSKIAVDGVTPVFYNMVK 194
Query: 215 QGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
QGLV + VFSF+LNR+PD + GGE++ GG DP H++G TYVPV +KGYWQF + I +G
Sbjct: 195 QGLVPQPVFSFYLNRNPDDKAGGELILGGSDPNHYEGPFTYVPVDRKGYWQFRMDGIKVG 254
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEIN 307
+Q +C+ GC AI D+GTSL+AGP V IN
Sbjct: 255 SQHLAICQKGCEAIADTGTSLIAGPVKEVEAIN 287
>gi|227336874|gb|ACP21315.1| aspartic proteinase precursor [Rhipicephalus microplus]
Length = 391
Score = 273 bits (698), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 127/245 (51%), Positives = 172/245 (70%), Gaps = 2/245 (0%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSN 134
PLKN++DAQY+G+I +G+PPQ F V+FDTGSSNLWVPSSKC F+ I+C+ H +Y S KS
Sbjct: 62 PLKNYLDAQYYGDITLGTPPQVFRVVFDTGSSNLWVPSSKCSFTNIACWLHHKYHSSKST 121
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + G + EI YGSGS+ G S D +G+V V+ Q F E E L F+ A+FDGI+G
Sbjct: 122 TYQKNGTAFEIRYGSGSVKGVLSADMFGLGNVTVRSQTFAEIIDESGLAFIAAKFDGILG 181
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V PV+DNMV QG+ + VFS +L+R+ +GGE++FGG+D H+ G T
Sbjct: 182 LGYPRISVLGVPPVFDNMVAQGVAANPVFSVYLDRNTSDPQGGEVLFGGIDKAHYTGNIT 241
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
YVPVT+KGYWQF + + +G +T C GGC AI D+GTSL+AGPT + ++N AIG
Sbjct: 242 YVPVTRKGYWQFHMDGVTVGTNAT-FCNGGCEAIADTGTSLIAGPTAEIQKLNMAIGAAP 300
Query: 315 VVSAE 319
++ E
Sbjct: 301 FLAGE 305
>gi|190576608|gb|ACE79095.1| cathepsin D precursor (predicted) [Sorex araneus]
Length = 405
Score = 273 bits (698), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 131/258 (50%), Positives = 183/258 (70%), Gaps = 5/258 (1%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L+N+MDAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS KC I+C+ H +Y S KS+T
Sbjct: 72 LRNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSVKCQLLDIACWLHHKYNSAKSST 131
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEV---GDVVVKDQVFIEATREGSLTFLLARFDGI 192
Y + G + +I+YGSGS+SG+ SQD V V + V Q+F EAT++ +TF+ A+FDGI
Sbjct: 132 YVKNGTAFDIHYGSGSLSGYLSQDTVSVPCNSGIQVARQLFGEATKQPGVTFIAAKFDGI 191
Query: 193 IGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGK 252
+G+ + I+V + PV+DN+++Q LV + +FSF+LNRDP A+ GGE++ GG+D K+FKG
Sbjct: 192 LGMAYPRISVNNVPPVFDNLMQQKLVDKNIFSFYLNRDPTAQPGGELMLGGIDSKYFKGS 251
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
TY VT++ YWQ + I +GN T +C+GGC AIVD+GTSLL GP V E+ AIG
Sbjct: 252 MTYHNVTRQAYWQVHMDQIDVGNGLT-LCKGGCEAIVDTGTSLLVGPVDEVKELQKAIGA 310
Query: 313 EGVVSAECKLVVSQYGDL 330
++ E + + DL
Sbjct: 311 VPLIQGEYIIPCEKLPDL 328
>gi|293230|gb|AAA29350.1| aspartic protease [Aedes aegypti]
Length = 387
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 131/253 (51%), Positives = 174/253 (68%), Gaps = 7/253 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSN 134
PL N++DAQY+G I IG+PPQ+F V+FDTGSSNLWVPS +C F+ I+C H++Y ++KS+
Sbjct: 59 PLSNYLDAQYYGAITIGTPPQSFKVVFDTGSSNLWVPSKECSFTNIACLMHNKYNAKKSS 118
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
T+ + G + I YGSGS+SG+ S D V +G V V Q F EA E L F+ A+FDGI+G
Sbjct: 119 TFEKNGTAFHIQYGSGSLSGYLSTDTVGLGGVSVTKQTFAEAINEPGLVFVAAKFDGILG 178
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V VPV+ NM QGL+ VFSF+LNRDP A EGGEI+FGG D + G T
Sbjct: 179 LGYSSISVDGVVPVFYNMFNQGLIDAPVFSFYLNRDPSAAEGGEIIFGGSDSNKYTGDFT 238
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y+ V +K YWQF++ + +G+ T C GC AI D+GTSL+AGP VT IN AIGG
Sbjct: 239 YLSVDRKAYWQFKMDSVKVGD--TEFCNNGCEAIADTGTSLIAGPVSEVTAINKAIGGTP 296
Query: 315 VVSAE----CKLV 323
+++ E C L+
Sbjct: 297 IMNGEYMVDCSLI 309
>gi|157112486|ref|XP_001657556.1| cathepsin d [Aedes aegypti]
gi|205831550|sp|Q03168.2|ASPP_AEDAE RecName: Full=Lysosomal aspartic protease; Flags: Precursor
gi|108878060|gb|EAT42285.1| AAEL006169-PA [Aedes aegypti]
Length = 387
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 131/253 (51%), Positives = 174/253 (68%), Gaps = 7/253 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSN 134
PL N++DAQY+G I IG+PPQ+F V+FDTGSSNLWVPS +C F+ I+C H++Y ++KS+
Sbjct: 59 PLSNYLDAQYYGAITIGTPPQSFKVVFDTGSSNLWVPSKECSFTNIACLMHNKYNAKKSS 118
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
T+ + G + I YGSGS+SG+ S D V +G V V Q F EA E L F+ A+FDGI+G
Sbjct: 119 TFEKNGTAFHIQYGSGSLSGYLSTDTVGLGGVSVTKQTFAEAINEPGLVFVAAKFDGILG 178
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V VPV+ NM QGL+ VFSF+LNRDP A EGGEI+FGG D + G T
Sbjct: 179 LGYSSISVDGVVPVFYNMFNQGLIDAPVFSFYLNRDPSAAEGGEIIFGGSDSNKYTGDFT 238
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y+ V +K YWQF++ + +G+ T C GC AI D+GTSL+AGP VT IN AIGG
Sbjct: 239 YLSVDRKAYWQFKMDSVKVGD--TEFCNNGCEAIADTGTSLIAGPVSEVTAINKAIGGTP 296
Query: 315 VVSAE----CKLV 323
+++ E C L+
Sbjct: 297 IMNGEYMVDCSLI 309
>gi|122938524|gb|ABM69086.1| aspartic proteinase AspMD03 [Musca domestica]
Length = 390
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 133/284 (46%), Positives = 183/284 (64%), Gaps = 6/284 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQ 96
R+ + + +AR K Y G + +R G PL N++DAQY+G I IG+PPQ
Sbjct: 26 RVPIQKIKSAR---KHFYEVGTELQQLRLTYGAGGVTPEPLSNYLDAQYYGPISIGTPPQ 82
Query: 97 NFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
+F V+FDTGSSNLWVPS KC+ + I+C H++Y + KS T+ + G I+YGSGS+SG+
Sbjct: 83 DFKVVFDTGSSNLWVPSKKCHLTNIACLMHNKYDATKSKTFKQNGTEFAIHYGSGSLSGY 142
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
S D V +G + +KDQ F EA E L F+ A+FDGI+GLG+ I+V P + M EQ
Sbjct: 143 LSTDTVNIGGLDIKDQTFAEALSEPGLVFVAAKFDGILGLGYSSISVDGVKPPFYAMYEQ 202
Query: 216 GLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGN 275
GL+S+ +FSF+LNRDP A EGGEI+FGG DP H+ G TY+PVT+K YWQ ++ +G+
Sbjct: 203 GLISQPIFSFYLNRDPKAPEGGEIIFGGSDPDHYTGDFTYLPVTRKAYWQIKMDSASMGD 262
Query: 276 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
+GGC I D+GTSL+A P T IN AIGG ++ +
Sbjct: 263 LK--CAKGGCQVIADTGTSLIALPPSEATSINQAIGGTPIMGGQ 304
>gi|3378673|emb|CAA08878.1| Cathepsin D [Podarcis siculus]
Length = 399
Score = 272 bits (696), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 147/328 (44%), Positives = 215/328 (65%), Gaps = 28/328 (8%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKK----RRL------DLHSLNAARITRKERYM 55
LRS L +LAS ++ +S+ L RI LKK R + ++ LN K ++
Sbjct: 3 LRS---LILLASLVV---ASSALIRIPLKKFPSMRTIYTEYGTNVQDLNELGEMLKYKF- 55
Query: 56 GGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSK 115
GGAGV LKN+MDAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS K
Sbjct: 56 GGAGVGAPTPE---------ALKNYMDAQYYGEIGIGTPPQKFTVVFDTGSSNLWVPSVK 106
Query: 116 CYF-SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFI 174
C+ I+C H +Y S KS++Y + G I+YG+GS+SGF SQD+V +GD++V++Q+F
Sbjct: 107 CHLLDIACLLHHKYDSSKSSSYVKNGTDFAIHYGTGSLSGFLSQDHVTIGDLIVQNQLFG 166
Query: 175 EATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAE 234
EA ++ +TF+ A+FDGI+GL + +I+V +P +DN ++Q L+ + +FSF+LNRDP
Sbjct: 167 EAVKQPGITFIAAKFDGILGLAYPKISVDKVLPFFDNAMKQALMEKNLFSFYLNRDPKGA 226
Query: 235 EGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTS 294
GGE++FGGVDP+++ G T+V VT+K YWQ + + + N T VC+ GC AIVD+GTS
Sbjct: 227 TGGELLFGGVDPQYYTGDFTWVNVTRKAYWQIHMEKVDVDNGLT-VCKDGCEAIVDTGTS 285
Query: 295 LLAGPTPVVTEINHAIGGEGVVSAECKL 322
L+ GPT + ++ AIG + ++ + L
Sbjct: 286 LITGPTDEIKQLQKAIGAKPIIKGQYML 313
>gi|387915174|gb|AFK11196.1| cathepsin D1 [Callorhinchus milii]
Length = 394
Score = 272 bits (696), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 136/289 (47%), Positives = 189/289 (65%), Gaps = 27/289 (9%)
Query: 63 VRHRLGDSD---EDILP------------------LKNFMDAQYFGEIGIGSPPQNFSVI 101
+R L DS ED+LP LKN++DAQY+GE+GIG+PPQ F+V+
Sbjct: 31 IRRALSDSGRSVEDLLPENKYKTDSPGINGPTPETLKNYLDAQYYGEVGIGTPPQPFTVV 90
Query: 102 FDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDN 160
FDTGSSNLWVPS C F I+C H +Y S KS++Y G I YGSGS+SG+ S+D
Sbjct: 91 FDTGSSNLWVPSVHCSMFDIACLLHHKYNSDKSSSYVRNGTKFAIRYGSGSLSGYLSKDT 150
Query: 161 VEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSE 220
V +G++ V+ Q+F EA ++ L F+ A+FDGI+G+G+ I+V +PV+DN+V Q LV
Sbjct: 151 VLIGNIKVQSQLFGEAIKQPGLAFIAAKFDGILGMGYPLISVDGVIPVFDNIVTQKLVPN 210
Query: 221 EVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGV 280
VFSF+LNR+PD+ GGE++ GG DPK++ G Y+ VT+K YWQ ++ ++ IG Q T +
Sbjct: 211 NVFSFYLNRNPDSLPGGELILGGTDPKYYTGDFHYLNVTRKAYWQVKMDEVSIGEQLT-L 269
Query: 281 CEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLVVS 325
C+GGCAAIVD+GTSL+ GP + + AIG ++ E CK V S
Sbjct: 270 CKGGCAAIVDTGTSLITGPAQEIKALQKAIGAIPLIQGEYLIDCKKVAS 318
>gi|148229393|ref|NP_001085403.1| MGC82347 protein precursor [Xenopus laevis]
gi|48734644|gb|AAH72252.1| MGC82347 protein [Xenopus laevis]
Length = 401
Score = 272 bits (696), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 138/304 (45%), Positives = 196/304 (64%), Gaps = 4/304 (1%)
Query: 17 SCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILP 76
S LL P S+ L RI LKK H+L A KE G + +
Sbjct: 15 SSLLHPGSA--LIRIPLKKFPSIRHTLTEAGGDAKELLGNGMPLKYSTGFPPNGKATPEA 72
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L N++DAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS C F I+C+ H +Y S KS+T
Sbjct: 73 LMNYLDAQYYGEIGIGTPPQTFTVVFDTGSSNLWVPSVHCSMFDIACWMHHKYDSSKSST 132
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + G I YG+GS+SG+ S+D V +G++ +K+Q+F EA ++ +TF+ A+FDGI+G+
Sbjct: 133 YVKNGTEFAIQYGTGSLSGYLSKDTVTIGNLGIKEQLFGEAIKQPGVTFIAAKFDGILGM 192
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ I+V PV+DN++ Q LV VFSF+LNR+PD + GGE++ GG DPK++ G Y
Sbjct: 193 AYPIISVDGVSPVFDNIMAQKLVESNVFSFYLNRNPDTQPGGELLLGGTDPKYYTGDFHY 252
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
+ VT+K YWQ + + +G+Q T +C+GGC AIVD+GTSL+ GP VT + AIG +
Sbjct: 253 LNVTRKAYWQIHMDQLGVGDQLT-LCKGGCEAIVDTGTSLITGPLEEVTALQKAIGAVPL 311
Query: 316 VSAE 319
+ +
Sbjct: 312 IQGQ 315
>gi|60678795|gb|AAX33732.1| Blo t allergen isoform 2 [Blomia tropicalis]
Length = 402
Score = 272 bits (696), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 152/316 (48%), Positives = 202/316 (63%), Gaps = 18/316 (5%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRH 65
++ L LA+ LL+ A L RI L+K + + R E + A + H
Sbjct: 1 MKYSLVLVFLATILLVDAK---LHRIKLQKAQS-----HRKRFVEVESPIKLAYTTHHYH 52
Query: 66 RLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYF 124
+ + PL N+ DAQY+GEI IGSPPQ F+VIFDTGSSNLWVPS KC F+ + C
Sbjct: 53 HWYNGFPE--PLSNYADAQYYGEIQIGSPPQPFNVIFDTGSSNLWVPSKKCKFTNLVCLL 110
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
H +Y S KS++Y G S EI YG+GS++GF S D V V + +++Q F EA E +TF
Sbjct: 111 HHKYDSSKSSSYVNNGTSFEIRYGTGSMTGFLSTDVVTVANQQIQNQTFAEAVSEPGITF 170
Query: 185 LLARFDGIIGLGFREIAVGDAVP-VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
+ A+FDGI+GLGF I+V D VP V+D+MV+QGLV VFSF+LNRD + + GGEI+FGG
Sbjct: 171 VFAKFDGILGLGFNTISV-DGVPTVFDSMVKQGLVQHPVFSFYLNRDTNGKVGGEIIFGG 229
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTG-----VCEGGCAAIVDSGTSLLAG 298
DP ++KG TY P+TK GYWQF++ IL+ N+S VCE GC AI D+GTSL+AG
Sbjct: 230 SDPAYYKGDFTYAPLTKIGYWQFQMHGILLENKSNNKTVGHVCESGCEAIADTGTSLIAG 289
Query: 299 PTPVVTEINHAIGGEG 314
P+ V +N A+G G
Sbjct: 290 PSDQVEHLNRALGAIG 305
>gi|348530268|ref|XP_003452633.1| PREDICTED: cathepsin D-like [Oreochromis niloticus]
Length = 396
Score = 271 bits (694), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 141/325 (43%), Positives = 207/325 (63%), Gaps = 26/325 (8%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGV-- 63
+R++F L+V+A+ L +++ L RI LKK R R+E G G+ +
Sbjct: 1 MRTLF-LFVIAALAL---TNDALVRIPLKK----------FRSIRRELTDSGKGIEELVA 46
Query: 64 -RHRLG-----DSDEDILP--LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSK 115
+H L S P LKN++DAQY+GEI +G+PPQ F+V+FDTGSSNLWVPS
Sbjct: 47 DKHSLKYNFGFPSSNGPTPETLKNYLDAQYYGEITLGTPPQKFTVVFDTGSSNLWVPSVH 106
Query: 116 C-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFI 174
C +F I+C+ H +Y S KS+TY + G S I YGSGS+SG+ SQD +GD+ V+ Q+F
Sbjct: 107 CSFFDIACWLHHKYNSAKSSTYVKNGTSFAIQYGSGSLSGYLSQDTCSIGDISVEKQIFG 166
Query: 175 EATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAE 234
EA ++ + F+ A+FDGI+G+ + I+V VPV+DNM+ Q V + VFSF+LNR+PD E
Sbjct: 167 EAIKQPGVAFIAAKFDGILGMAYPSISVDGVVPVFDNMMNQKKVEKNVFSFYLNRNPDTE 226
Query: 235 EGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTS 294
GGE++ GG DPK++ G Y ++++ YWQ + + +G+Q + +C+GGC AIVD+GTS
Sbjct: 227 PGGELLLGGTDPKYYDGDFHYANISRQAYWQVHMDGMTVGSQLS-LCKGGCEAIVDTGTS 285
Query: 295 LLAGPTPVVTEINHAIGGEGVVSAE 319
L+ GP V + AIG ++ E
Sbjct: 286 LITGPAAEVKALQKAIGAIPLIQGE 310
>gi|332264729|ref|XP_003281384.1| PREDICTED: cathepsin D [Nomascus leucogenys]
Length = 412
Score = 271 bits (692), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 141/305 (46%), Positives = 197/305 (64%), Gaps = 25/305 (8%)
Query: 37 RLDLHSLNAARITRKERYMGGA--------GVSGVRHRLGDSDEDILP--LKNFMDAQYF 86
R+ LH + R T E +GG+ S L E +P LKN+MDAQY+
Sbjct: 23 RIPLHKFTSIRRTMSE--VGGSVEDLIAKGPSSKYSQALPAVTEGPVPEVLKNYMDAQYY 80
Query: 87 GEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEI 145
GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+TY + G S +I
Sbjct: 81 GEIGIGTPPQCFTVVFDTGSSNLWVPSVHCKLLDIACWIHHKYNSDKSSTYVKNGTSFDI 140
Query: 146 NYGSGSISGFFSQDNVEV-----------GDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+YGSGS+SG+ SQD V V G V V+ QVF EAT++ +TF+ A+FDGI+G
Sbjct: 141 HYGSGSLSGYLSQDTVSVPCQSASSASALGSVKVERQVFGEATKQPGITFIAAKFDGILG 200
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+ + I+V + +PV+DN+++Q LV + +FSF+LNRDPDA+ GGE++ GG D K++KG +
Sbjct: 201 MAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLNRDPDAQPGGELMLGGTDSKYYKGSLS 260
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y+ VT+K YWQ L + + + T +C+ GC AIVD+GTSL+ GP V E+ AIG
Sbjct: 261 YLNVTRKAYWQVHLDQVEVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVP 319
Query: 315 VVSAE 319
++ E
Sbjct: 320 LIQGE 324
>gi|170649686|gb|ACB21270.1| cathepsin D preproprotein (predicted) [Callicebus moloch]
Length = 412
Score = 271 bits (692), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 140/305 (45%), Positives = 197/305 (64%), Gaps = 25/305 (8%)
Query: 37 RLDLHSLNAARITRKERYMGGA--------GVSGVRHRLGDSDEDILP--LKNFMDAQYF 86
R+ LH + R T E MGG +S + +P LKN+MDAQY+
Sbjct: 23 RIPLHKFTSIRRTMSE--MGGPVEDLIAKGPISKYSQGMPTVPAGPVPEILKNYMDAQYY 80
Query: 87 GEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEI 145
GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+TY + G S +I
Sbjct: 81 GEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSAKSSTYVKNGTSFDI 140
Query: 146 NYGSGSISGFFSQDNVEV-----------GDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+YGSGS+SG+ SQD V V G V V+ QVF EAT++ +TF+ A+FDGI+G
Sbjct: 141 HYGSGSLSGYLSQDTVLVPCRSSSSASALGGVKVERQVFGEATKQPGITFIAAKFDGILG 200
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+ + I+V + +PV+DN+++Q LV + +FSF+LNRDPDA+ GGE++ GG D K++KG +
Sbjct: 201 MAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLNRDPDAQPGGELMLGGTDSKYYKGSLS 260
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y+ VT+K YWQ + + + + T +C+GGC AIVD+GTSL+ GP V E+ AIG
Sbjct: 261 YLNVTRKAYWQVHMDQVEVASGLT-LCKGGCEAIVDTGTSLMVGPVDEVRELQKAIGAVP 319
Query: 315 VVSAE 319
++ E
Sbjct: 320 LIQGE 324
>gi|403305561|ref|XP_003943328.1| PREDICTED: cathepsin D [Saimiri boliviensis boliviensis]
Length = 522
Score = 270 bits (691), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 129/255 (50%), Positives = 181/255 (70%), Gaps = 13/255 (5%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN+MDAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 36 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSAKSST 95
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEV-----------GDVVVKDQVFIEATREGSLTF 184
Y + G S +I+YGSGS+SG+ SQD V V G V V+ QVF EAT++ +TF
Sbjct: 96 YVKNGTSFDIHYGSGSLSGYLSQDTVLVPCRPSSSASALGGVKVERQVFGEATKQPGITF 155
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
+ A+FDGI+G+ + I+V + +PV+DN+++Q LV + +FSF+LNRDPDA+ GGE++ GG
Sbjct: 156 IAAKFDGILGMAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLNRDPDAQPGGELMLGGT 215
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
D K++KG +Y+ VT+K YWQ + + + + T +C+GGC AIVD+GTSL+ GP V
Sbjct: 216 DSKYYKGSLSYLNVTRKAYWQVHMDQVEVASGLT-LCKGGCEAIVDTGTSLMVGPVDEVR 274
Query: 305 EINHAIGGEGVVSAE 319
E+ AIG ++ E
Sbjct: 275 ELQKAIGAVPLIQGE 289
>gi|25452827|sp|Q9DEX3.1|CATD_CLUHA RecName: Full=Cathepsin D; Flags: Precursor
gi|11037777|gb|AAG27733.1|AF312364_1 muscular cathepsin D [Clupea harengus]
Length = 396
Score = 270 bits (691), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 139/308 (45%), Positives = 200/308 (64%), Gaps = 12/308 (3%)
Query: 24 SSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLG-DSDEDILP--LKNF 80
+S+ + RI LKK R +L+ + + ++ AG + ++H G S P LKN+
Sbjct: 15 TSDAIVRIPLKKFRSIRRTLSDSGLNVEQLL---AGTNSLQHNQGFPSSNAPTPETLKNY 71
Query: 81 MDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEI 139
MDAQY+GEIG+G+P Q F+V+FDTGSSNLW+PS C F+ I+C H +Y KS+TY +
Sbjct: 72 MDAQYYGEIGLGTPVQMFTVVFDTGSSNLWLPSIHCSFTDIACLLHHKYNGAKSSTYVKN 131
Query: 140 GKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFRE 199
G I YGSGS+SG+ SQD+ +GD+VV+ Q+F EA ++ + F+ A+FDGI+G+ +
Sbjct: 132 GTEFAIQYGSGSLSGYLSQDSCTIGDIVVEKQLFGEAIKQPGVAFIAAKFDGILGMAYPR 191
Query: 200 IAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVT 259
I+V PV+D M+ Q V + VFSF+LNR+PD E GGE++ GG DPK++ G YVPVT
Sbjct: 192 ISVDGVPPVFDMMMSQKKVEQNVFSFYLNRNPDTEPGGELLLGGTDPKYYTGDFNYVPVT 251
Query: 260 KKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
++ YWQ + + IG+Q T +C+ GC AIVD+GTSL+ GP V + AIG ++ E
Sbjct: 252 RQAYWQIHMDGMSIGSQLT-LCKDGCEAIVDTGTSLITGPPAEVRALQKAIGAIPLIQGE 310
Query: 320 ----CKLV 323
CK V
Sbjct: 311 YMIDCKKV 318
>gi|195581342|ref|XP_002080493.1| GD10217 [Drosophila simulans]
gi|194192502|gb|EDX06078.1| GD10217 [Drosophila simulans]
Length = 324
Score = 270 bits (691), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 129/248 (52%), Positives = 169/248 (68%), Gaps = 7/248 (2%)
Query: 81 MDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEI 139
MDAQY+G I IGSPPQNF V+FDTGSSNLWVPS KC+ + I+C H++Y + KS TYT+
Sbjct: 1 MDAQYYGPIAIGSPPQNFRVVFDTGSSNLWVPSKKCHLTNIACLMHNKYDASKSKTYTKN 60
Query: 140 GKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFRE 199
G I+YGSGS+SG+ S D V + + +KDQ F EA E L F+ A+FDGI+GLG+
Sbjct: 61 GTEFAIHYGSGSLSGYLSTDTVSIAGLDIKDQTFAEALSEPGLVFVAAKFDGILGLGYSS 120
Query: 200 IAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVT 259
I+V P + M EQGL+S VFSF+LNRDP + EGGEI+FGG DP H+ G+ TY+PVT
Sbjct: 121 ISVDKVKPPFYAMYEQGLISAPVFSFYLNRDPASPEGGEIIFGGSDPNHYTGEFTYLPVT 180
Query: 260 KKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
+K YWQ ++ IG+ +C+GGC I D+GTSL+A P T IN IGG ++ +
Sbjct: 181 RKAYWQIKMDAASIGDLQ--LCKGGCQVIADTGTSLIAAPLEEATSINQKIGGTPIIGGQ 238
Query: 320 ----CKLV 323
C L+
Sbjct: 239 YLVSCDLI 246
>gi|157644743|gb|ABV59077.1| cathepsin D [Lates calcarifer]
gi|396084116|gb|AFN84539.1| cathepsin D [Lates calcarifer]
Length = 396
Score = 270 bits (691), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 141/318 (44%), Positives = 205/318 (64%), Gaps = 12/318 (3%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRH 65
+RS+F L V A+ L SS+ L RI LKK R L + TR E + A +++
Sbjct: 1 MRSLF-LVVFAALAL---SSDALVRIPLKKFRSIRRELTDSG-TRLEELL--ADKHSLKY 53
Query: 66 RLG-DSDEDILP--LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSIS 121
G S P LKN++DAQY+G+I +G+PPQ FSV+FDTGSSNLWVPS C I+
Sbjct: 54 NFGFPSSNGPTPETLKNYLDAQYYGDISLGTPPQTFSVVFDTGSSNLWVPSVHCSLLDIA 113
Query: 122 CYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGS 181
C H +Y S KS+TY + G + I YGSGS+SG+ S+D +GD+ V+ Q+F EA ++
Sbjct: 114 CLLHHKYNSAKSSTYVKNGTAFAIQYGSGSLSGYLSEDTCTIGDISVEKQLFGEAIKQPG 173
Query: 182 LTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVF 241
+ F+ A+FDGI+G+ + I+V VPV+DN++ Q V + VFSF+LNR+PD GGE++
Sbjct: 174 VAFIAAKFDGILGMAYPRISVDGVVPVFDNIMSQKKVEQNVFSFYLNRNPDTAPGGELLL 233
Query: 242 GGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTP 301
GG DPK++ G YV +T++ YWQ + ++++G Q + +C+GGC AIVD+GTSL+ GP+
Sbjct: 234 GGTDPKYYTGDFNYVNITRQAYWQIHMDELVVGTQLS-LCKGGCEAIVDTGTSLITGPSA 292
Query: 302 VVTEINHAIGGEGVVSAE 319
V + AIG ++ E
Sbjct: 293 EVKALQKAIGAIPLIQGE 310
>gi|184185542|gb|ACC68942.1| cathepsin D (predicted) [Rhinolophus ferrumequinum]
Length = 410
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 133/281 (47%), Positives = 191/281 (67%), Gaps = 11/281 (3%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN+MDAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 71 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSGKSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEV---------GDVVVKDQVFIEATREGSLTFLL 186
Y + G S +I+YGSGS+SG+ SQD V V G V V+ QVF EAT++ +TF+
Sbjct: 131 YVKNGTSFDIHYGSGSLSGYLSQDTVSVPCNSALLGLGGVKVERQVFGEATKQPGITFIA 190
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+ + I+V + +PV+DN+++Q LV + +FSF+LNRDP+A+ GGE++ GG D
Sbjct: 191 AKFDGILGMAYPRISVNNVLPVFDNLMQQKLVDKNIFSFYLNRDPNAQPGGELMLGGTDS 250
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
+++KG +Y+ VT+K YWQ + + +GN T +C+ GC AIVD+GTSL+ GP V E+
Sbjct: 251 RYYKGALSYLNVTRKAYWQVHMDQVDVGNSLT-LCKAGCEAIVDTGTSLIVGPVEEVREL 309
Query: 307 NHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQ 347
AIG ++ E + + L +L G K+C +
Sbjct: 310 QKAIGAVPLIQGEYMIPCEKVSSLPEVILKLGGKDYKLCAE 350
>gi|427789779|gb|JAA60341.1| Putative cathepsin d isoform 1 protein [Rhipicephalus pulchellus]
Length = 391
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 125/245 (51%), Positives = 172/245 (70%), Gaps = 2/245 (0%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSN 134
PLKN++DAQY+G+I +G+PPQ F V+FDTGSSNLWVPSSKC F+ I+C+ H +Y S +S
Sbjct: 62 PLKNYLDAQYYGDITLGTPPQVFRVVFDTGSSNLWVPSSKCSFTNIACWLHHKYHSSRST 121
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + G + EI YGSGS+ G S D +G+V V+ Q F E E L F+ A+FDGI+G
Sbjct: 122 TYQKNGTAFEIRYGSGSVKGVLSTDVFGLGNVTVRSQTFAEIIDESGLAFIAAKFDGILG 181
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V PV+DNMV QG+ ++ VFS +L+R+ +GGE++FGG+D H+ G T
Sbjct: 182 LGYPRISVLGVPPVFDNMVAQGVAAKPVFSVYLDRNASDPQGGEVLFGGIDKAHYTGNIT 241
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
YVPVT+KGYWQF + + +G +T C GGC AI D+GTSL+AGP+ + ++N AIG
Sbjct: 242 YVPVTRKGYWQFHMDGVTVGTNTT-FCNGGCEAIADTGTSLIAGPSEEIQKLNLAIGAAP 300
Query: 315 VVSAE 319
+ E
Sbjct: 301 FTAGE 305
>gi|357627475|gb|EHJ77155.1| cathepsin D [Danaus plexippus]
Length = 358
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 130/253 (51%), Positives = 173/253 (68%), Gaps = 8/253 (3%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSN 134
PL N++DAQY+G I IG+PPQ F V+FDTGSSNLWVPS KC+++ I+C H++Y S KS
Sbjct: 31 PLSNYLDAQYYGPISIGNPPQTFKVVFDTGSSNLWVPSKKCHYTNIACLLHNKYDSSKSK 90
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y + G I+YGSGS+SGF S D+V +G + VK Q F EA E L F+ A+FDGI+G
Sbjct: 91 SYHKNGTEFAIHYGSGSLSGFLSVDDVTLGGMTVKSQTFAEAMSEPGLAFVAAKFDGILG 150
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+ F IAV PV+DNMV+QGLV+ VFSF+LNRD A +GGE+V GG DP H++G T
Sbjct: 151 MAFASIAVDGVTPVFDNMVKQGLVA-PVFSFYLNRDASAAQGGELVLGGSDPAHYRGPLT 209
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
YVP++K YWQF++ +L+ S C+ GC AI D+GTSL+ GP V +N IG
Sbjct: 210 YVPLSKDTYWQFQMDGVLVNGSS--FCKRGCQAIADTGTSLIGGPVEEVAALNAKIGATP 267
Query: 314 ---GVVSAECKLV 323
G + +C L+
Sbjct: 268 MAFGQFALDCSLI 280
>gi|268581165|ref|XP_002645565.1| C. briggsae CBR-ASP-4 protein [Caenorhabditis briggsae]
Length = 446
Score = 269 bits (688), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 145/307 (47%), Positives = 192/307 (62%), Gaps = 20/307 (6%)
Query: 28 LRRIGLKKR-RLDLHSLNAARITRKERYMGGAGVSGVRHR-------------LGDSDED 73
LR I LKK+ L L A R+ G ++H LG+ DE
Sbjct: 27 LRTISLKKQPTLRETLLQAGTFETFARHRHGYQKKFLKHHGNHHFDKYNGVKPLGEIDE- 85
Query: 74 ILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRK 132
L+N+MDAQYFG I IG+P QNF+VIFDTGSSNLWVPS KC ++ I+C H RY S+
Sbjct: 86 --LLRNYMDAQYFGTISIGTPGQNFTVIFDTGSSNLWVPSKKCPFYDIACMLHHRYDSKS 143
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGI 192
S+TY E G+ I YG+GS+ GF S+D+V V + +DQ F EAT E +TF+ A+FDGI
Sbjct: 144 SSTYKEDGRKMAIQYGTGSMKGFISKDSVCVAGICAEDQPFAEATSEPGITFVAAKFDGI 203
Query: 193 IGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGK 252
+G+ + EIAV PV++ + EQ V VFSFWLNR+PD+E GGEI FGG+D + +
Sbjct: 204 LGMAYPEIAVLGVQPVFNTLFEQKKVPSNVFSFWLNRNPDSELGGEITFGGIDARRYVEP 263
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
TY PVT+KGYWQF++ D ++G+ G C GC AI D+GTSL+AGP + I + IG
Sbjct: 264 ITYTPVTRKGYWQFKM-DKVVGSGVLG-CSNGCQAIADTGTSLIAGPKAQIEAIQNFIGA 321
Query: 313 EGVVSAE 319
E ++ E
Sbjct: 322 EPLIKGE 328
>gi|123993743|gb|ABM84473.1| cathepsin D (lysosomal aspartyl peptidase) [synthetic construct]
Length = 412
Score = 269 bits (688), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 140/305 (45%), Positives = 198/305 (64%), Gaps = 25/305 (8%)
Query: 37 RLDLHSLNAARITRKERYMGGA--------GVSGVRHRLGDSDEDILP--LKNFMDAQYF 86
R+ LH + R T E +GG+ VS + E +P LKN+MDAQY+
Sbjct: 23 RIPLHKFTSIRRTMSE--VGGSVEDLIAKGPVSKYSQAVPAVTEGPIPEVLKNYMDAQYY 80
Query: 87 GEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEI 145
GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+TY + G S +I
Sbjct: 81 GEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSDKSSTYVKNGTSFDI 140
Query: 146 NYGSGSISGFFSQDNVEV-----------GDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+YGSGS+SG+ SQD V V G V V+ QVF EAT++ +TF+ A+FDGI+G
Sbjct: 141 HYGSGSLSGYLSQDTVSVPCQSASSASALGGVKVERQVFGEATKQPGITFIAAKFDGILG 200
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+ + I+V + +PV+DN+++Q LV + +FSF+L+RDPDA+ GGE++ GG D K++KG +
Sbjct: 201 MAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLSRDPDAQPGGELMLGGTDSKYYKGSLS 260
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y+ VT+K YWQ L + + + T +C+ GC AIVD+GTSL+ GP V E+ AIG
Sbjct: 261 YLNVTRKAYWQVHLDQVEVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVP 319
Query: 315 VVSAE 319
++ E
Sbjct: 320 LIQGE 324
>gi|60654209|gb|AAX29797.1| cathepsin D [synthetic construct]
Length = 413
Score = 269 bits (688), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 140/305 (45%), Positives = 198/305 (64%), Gaps = 25/305 (8%)
Query: 37 RLDLHSLNAARITRKERYMGGA--------GVSGVRHRLGDSDEDILP--LKNFMDAQYF 86
R+ LH + R T E +GG+ VS + E +P LKN+MDAQY+
Sbjct: 23 RIPLHKFTSIRRTMSE--VGGSVEDLIAKGPVSKYSQAVPAVTEGPIPEVLKNYMDAQYY 80
Query: 87 GEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEI 145
GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+TY + G S +I
Sbjct: 81 GEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSDKSSTYVKNGTSFDI 140
Query: 146 NYGSGSISGFFSQDNVEV-----------GDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+YGSGS+SG+ SQD V V G V V+ QVF EAT++ +TF+ A+FDGI+G
Sbjct: 141 HYGSGSLSGYLSQDTVSVPCQSASSASALGGVKVERQVFGEATKQPGITFIAAKFDGILG 200
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+ + I+V + +PV+DN+++Q LV + +FSF+L+RDPDA+ GGE++ GG D K++KG +
Sbjct: 201 MAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLSRDPDAQPGGELMLGGTDSKYYKGSLS 260
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y+ VT+K YWQ L + + + T +C+ GC AIVD+GTSL+ GP V E+ AIG
Sbjct: 261 YLNVTRKAYWQVHLDQVEVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVP 319
Query: 315 VVSAE 319
++ E
Sbjct: 320 LIEGE 324
>gi|30584113|gb|AAP36305.1| Homo sapiens cathepsin D (lysosomal aspartyl protease) [synthetic
construct]
gi|60653917|gb|AAX29651.1| cathepsin D [synthetic construct]
Length = 413
Score = 269 bits (688), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 140/305 (45%), Positives = 198/305 (64%), Gaps = 25/305 (8%)
Query: 37 RLDLHSLNAARITRKERYMGGA--------GVSGVRHRLGDSDEDILP--LKNFMDAQYF 86
R+ LH + R T E +GG+ VS + E +P LKN+MDAQY+
Sbjct: 23 RIPLHKFTSIRRTMSE--VGGSVEDLIAKGPVSKYSQAVPAVTEGPIPEVLKNYMDAQYY 80
Query: 87 GEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEI 145
GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+TY + G S +I
Sbjct: 81 GEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSDKSSTYVKNGTSFDI 140
Query: 146 NYGSGSISGFFSQDNVEV-----------GDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+YGSGS+SG+ SQD V V G V V+ QVF EAT++ +TF+ A+FDGI+G
Sbjct: 141 HYGSGSLSGYLSQDTVSVPCQSASSASALGGVKVERQVFGEATKQPGITFIAAKFDGILG 200
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+ + I+V + +PV+DN+++Q LV + +FSF+L+RDPDA+ GGE++ GG D K++KG +
Sbjct: 201 MAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLSRDPDAQPGGELMLGGTDSKYYKGSLS 260
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y+ VT+K YWQ L + + + T +C+ GC AIVD+GTSL+ GP V E+ AIG
Sbjct: 261 YLNVTRKAYWQVHLDQVEVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVP 319
Query: 315 VVSAE 319
++ E
Sbjct: 320 LIQGE 324
>gi|197099366|ref|NP_001125492.1| cathepsin D precursor [Pongo abelii]
gi|55728229|emb|CAH90861.1| hypothetical protein [Pongo abelii]
Length = 412
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 140/305 (45%), Positives = 198/305 (64%), Gaps = 25/305 (8%)
Query: 37 RLDLHSLNAARITRKERYMGGA--------GVSGVRHRLGDSDEDILP--LKNFMDAQYF 86
R+ LH + R T E +GG+ VS + E +P LKN+MDAQY+
Sbjct: 23 RIPLHKFTSIRRTMSE--VGGSVEDLIAKGPVSKYSQAMPAVTEGPVPEVLKNYMDAQYY 80
Query: 87 GEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEI 145
GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+TY + G S +I
Sbjct: 81 GEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHRKYNSDKSSTYVKNGTSFDI 140
Query: 146 NYGSGSISGFFSQDNVEV-----------GDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+YGSGS+SG+ SQD V V G V V+ QVF EAT++ +TF+ A+FDGI+G
Sbjct: 141 HYGSGSLSGYLSQDTVSVPCQSASSASALGGVKVERQVFGEATKQPGITFIAAKFDGILG 200
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+ + I+V + +PV+DN+++Q LV + +FSF+L+RDPDA+ GGE++ GG D K++KG +
Sbjct: 201 MAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLSRDPDAQPGGELMLGGTDSKYYKGSLS 260
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y+ VT+K YWQ L + + + T +C+ GC AIVD+GTSL+ GP V E+ AIG
Sbjct: 261 YLNVTRKAYWQVHLDQVEVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVP 319
Query: 315 VVSAE 319
++ E
Sbjct: 320 LIQGE 324
>gi|4503143|ref|NP_001900.1| cathepsin D preproprotein [Homo sapiens]
gi|115717|sp|P07339.1|CATD_HUMAN RecName: Full=Cathepsin D; Contains: RecName: Full=Cathepsin D
light chain; Contains: RecName: Full=Cathepsin D heavy
chain; Flags: Precursor
gi|29678|emb|CAA28955.1| cathepsin D [Homo sapiens]
gi|179948|gb|AAA51922.1| cathepsin D [Homo sapiens]
gi|181180|gb|AAB59529.1| preprocathepsin D [Homo sapiens]
gi|16740920|gb|AAH16320.1| Cathepsin D [Homo sapiens]
gi|30582659|gb|AAP35556.1| cathepsin D (lysosomal aspartyl protease) [Homo sapiens]
gi|48146011|emb|CAG33228.1| CTSD [Homo sapiens]
gi|54697170|gb|AAV38957.1| cathepsin D (lysosomal aspartyl protease) [Homo sapiens]
gi|61356567|gb|AAX41260.1| cathepsin D [synthetic construct]
gi|61362282|gb|AAX42193.1| cathepsin D [synthetic construct]
gi|119622866|gb|EAX02461.1| cathepsin D (lysosomal aspartyl peptidase), isoform CRA_a [Homo
sapiens]
gi|119622867|gb|EAX02462.1| cathepsin D (lysosomal aspartyl peptidase), isoform CRA_a [Homo
sapiens]
gi|119622868|gb|EAX02463.1| cathepsin D (lysosomal aspartyl peptidase), isoform CRA_a [Homo
sapiens]
gi|123994405|gb|ABM84804.1| cathepsin D (lysosomal aspartyl peptidase) [synthetic construct]
gi|261860344|dbj|BAI46694.1| cathepsin D [synthetic construct]
Length = 412
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 140/305 (45%), Positives = 198/305 (64%), Gaps = 25/305 (8%)
Query: 37 RLDLHSLNAARITRKERYMGGA--------GVSGVRHRLGDSDEDILP--LKNFMDAQYF 86
R+ LH + R T E +GG+ VS + E +P LKN+MDAQY+
Sbjct: 23 RIPLHKFTSIRRTMSE--VGGSVEDLIAKGPVSKYSQAVPAVTEGPIPEVLKNYMDAQYY 80
Query: 87 GEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEI 145
GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+TY + G S +I
Sbjct: 81 GEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSDKSSTYVKNGTSFDI 140
Query: 146 NYGSGSISGFFSQDNVEV-----------GDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+YGSGS+SG+ SQD V V G V V+ QVF EAT++ +TF+ A+FDGI+G
Sbjct: 141 HYGSGSLSGYLSQDTVSVPCQSASSASALGGVKVERQVFGEATKQPGITFIAAKFDGILG 200
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+ + I+V + +PV+DN+++Q LV + +FSF+L+RDPDA+ GGE++ GG D K++KG +
Sbjct: 201 MAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLSRDPDAQPGGELMLGGTDSKYYKGSLS 260
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y+ VT+K YWQ L + + + T +C+ GC AIVD+GTSL+ GP V E+ AIG
Sbjct: 261 YLNVTRKAYWQVHLDQVEVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVP 319
Query: 315 VVSAE 319
++ E
Sbjct: 320 LIQGE 324
>gi|426366854|ref|XP_004050458.1| PREDICTED: cathepsin D [Gorilla gorilla gorilla]
Length = 412
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 140/305 (45%), Positives = 198/305 (64%), Gaps = 25/305 (8%)
Query: 37 RLDLHSLNAARITRKERYMGGA--------GVSGVRHRLGDSDEDILP--LKNFMDAQYF 86
R+ LH + R T E +GG+ VS + E +P LKN+MDAQY+
Sbjct: 23 RIPLHKFTSIRRTMSE--VGGSVEDLIAKGPVSKYSQAVPAVTEGPIPEVLKNYMDAQYY 80
Query: 87 GEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEI 145
GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+TY + G S +I
Sbjct: 81 GEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSDKSSTYVKNGTSFDI 140
Query: 146 NYGSGSISGFFSQDNVEV-----------GDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+YGSGS+SG+ SQD V V G V V+ QVF EAT++ +TF+ A+FDGI+G
Sbjct: 141 HYGSGSLSGYLSQDTVSVPCQSASSASAPGGVKVERQVFGEATKQPGITFIAAKFDGILG 200
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+ + I+V + +PV+DN+++Q LV + +FSF+L+RDPDA+ GGE++ GG D K++KG +
Sbjct: 201 MAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLSRDPDAQPGGELMLGGTDSKYYKGSLS 260
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y+ VT+K YWQ L + + + T +C+ GC AIVD+GTSL+ GP V E+ AIG
Sbjct: 261 YLNVTRKAYWQVHLDQVEVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVP 319
Query: 315 VVSAE 319
++ E
Sbjct: 320 LIQGE 324
>gi|417400425|gb|JAA47158.1| Putative cathepsin d [Desmodus rotundus]
Length = 409
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 130/252 (51%), Positives = 180/252 (71%), Gaps = 10/252 (3%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN+MDAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS C +C+ H +Y S KS T
Sbjct: 71 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDFACWIHHKYNSGKSTT 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEV--------GDVVVKDQVFIEATREGSLTFLLA 187
Y + G + +I+YGSGS+SG+ SQD V V V V+ QVF EAT++ +TF+ A
Sbjct: 131 YVKNGTTFDIHYGSGSLSGYLSQDTVSVPCNSAASGSGVKVERQVFGEATKQPGVTFIAA 190
Query: 188 RFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPK 247
+FDGI+G+ + I+V + +PV+DN+++Q LV E VFSF+LNRDP+A+ GGE++ GGVD K
Sbjct: 191 KFDGILGMAYPRISVNNVLPVFDNLMQQKLVDENVFSFYLNRDPNAQPGGELMLGGVDSK 250
Query: 248 HFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEIN 307
++KG TY+ VT+K YWQ + ++ +G+ T +C+ GC AIVD+GTSLL GP V E+
Sbjct: 251 YYKGPITYLNVTRKAYWQVHMDEVAVGSGLT-LCKEGCEAIVDTGTSLLVGPVEEVRELQ 309
Query: 308 HAIGGEGVVSAE 319
AIG ++ E
Sbjct: 310 KAIGAVPLIQGE 321
>gi|60820131|gb|AAX36524.1| cathepsin D [synthetic construct]
gi|61363243|gb|AAX42359.1| cathepsin D [synthetic construct]
Length = 412
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 140/305 (45%), Positives = 198/305 (64%), Gaps = 25/305 (8%)
Query: 37 RLDLHSLNAARITRKERYMGGA--------GVSGVRHRLGDSDEDILP--LKNFMDAQYF 86
R+ LH + R T E +GG+ VS + E +P LKN+MDAQY+
Sbjct: 23 RIPLHKFTSIRRTMSE--VGGSVEDLIAKGPVSKYSQAVPAVTEGPIPEVLKNYMDAQYY 80
Query: 87 GEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEI 145
GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+TY + G S +I
Sbjct: 81 GEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSDKSSTYVKNGTSFDI 140
Query: 146 NYGSGSISGFFSQDNVEV-----------GDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+YGSGS+SG+ SQD V V G V V+ QVF EAT++ +TF+ A+FDGI+G
Sbjct: 141 HYGSGSLSGYLSQDTVSVPCQSASSASALGGVKVERQVFGEATKQPGITFIAAKFDGILG 200
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+ + I+V + +PV+DN+++Q LV + +FSF+L+RDPDA+ GGE++ GG D K++KG +
Sbjct: 201 MAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLSRDPDAQPGGELMLGGTDSKYYKGSLS 260
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y+ VT+K YWQ L + + + T +C+ GC AIVD+GTSL+ GP V E+ AIG
Sbjct: 261 YLNVTRKAYWQVHLDQVEVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVP 319
Query: 315 VVSAE 319
++ E
Sbjct: 320 LIEGE 324
>gi|213625094|gb|AAI69806.1| LOC443721 protein [Xenopus laevis]
Length = 399
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 127/283 (44%), Positives = 185/283 (65%), Gaps = 25/283 (8%)
Query: 61 SGVRHRLGDSDEDILPLK-----------------------NFMDAQYFGEIGIGSPPQN 97
+ +R + D+D+D L L N++DAQY+GEI IG+PPQ
Sbjct: 32 TSIRRAMSDTDKDSLKLSGNEAATKYSAFPKSNNPTPETLLNYLDAQYYGEISIGTPPQP 91
Query: 98 FSVIFDTGSSNLWVPSSKCYF-SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFF 156
F+V+FDTGSSNLWVPS C F I+C+ H +Y S KS+TY G + I YGSGS++G+
Sbjct: 92 FTVVFDTGSSNLWVPSVHCSFWDIACWLHHKYDSSKSSTYVNNGTAFAIQYGSGSLTGYL 151
Query: 157 SQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQG 216
S+D V +GD+ VK Q+F EA ++ +TF+ A+FDGI+G+G+ I+V PV+D+++EQ
Sbjct: 152 SKDTVTIGDLAVKGQLFAEAVKQPGITFVAAKFDGILGMGYPRISVDGVPPVFDDIMEQK 211
Query: 217 LVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQ 276
LV +FSF+LNR+PD + GGE++ GG DP ++ G +Y+ VT+K YWQ + + +G+Q
Sbjct: 212 LVDSNLFSFYLNRNPDTQPGGELLLGGTDPTYYTGDFSYMNVTRKAYWQIRMDQLSVGDQ 271
Query: 277 STGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
T +C+GGC AIVD+GTSL+ GP VT + AIG ++ E
Sbjct: 272 LT-LCKGGCEAIVDTGTSLITGPVEEVTALQRAIGAIPLIRGE 313
>gi|339237491|ref|XP_003380300.1| lysosomal aspartic protease [Trichinella spiralis]
gi|316976887|gb|EFV60084.1| lysosomal aspartic protease [Trichinella spiralis]
Length = 405
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 128/244 (52%), Positives = 172/244 (70%), Gaps = 2/244 (0%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L N+MDAQY+GEI IG+PPQNF+VIFDTGSSNLWVPSSKC +F I+C+ H+RY S+KS+T
Sbjct: 73 LHNYMDAQYYGEISIGTPPQNFTVIFDTGSSNLWVPSSKCSFFDIACWLHNRYNSKKSST 132
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G++ EI YGSGS+ GF S+D V + + VK Q F EAT + L F+ A FDGI+G+
Sbjct: 133 YEASGETIEIRYGSGSMRGFKSKDTVCIASLCVKGQGFAEATSQPGLAFIFAHFDGILGM 192
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
F IAVG PV+ M+EQ L+SE VF+FWLNR+P+ + GG I FG VD K++ G T+
Sbjct: 193 AFPSIAVGGIQPVFQAMIEQNLISEAVFAFWLNRNPEDDLGGLISFGTVDEKYYIGNITW 252
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
VP+ + YW+F + I +G++ G C GC I D+GTSL+AGP V + AIG + +
Sbjct: 253 VPLVNQRYWEFNMETIKVGDEHVG-CIDGCTTIADTGTSLIAGPKDEVERLQEAIGAKPL 311
Query: 316 VSAE 319
+ +
Sbjct: 312 IMGQ 315
>gi|31559113|gb|AAP50847.1| cathepsin D [Bombyx mori]
gi|90992734|gb|ABE03014.1| aspartic protease [Bombyx mori]
Length = 385
Score = 268 bits (686), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 133/292 (45%), Positives = 185/292 (63%), Gaps = 11/292 (3%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQ 96
R+ LH + AR E G + +R + + PL N++DAQY+G I IG+PPQ
Sbjct: 21 RVPLHRMKTARTHFHEV---GTELELLRLKYDVTGPSPEPLSNYLDAQYYGVISIGTPPQ 77
Query: 97 NFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
+F V+FDTGSSNLWVPS KC+++ I+C H++Y SRKS +Y G I YGSGS+SGF
Sbjct: 78 SFKVVFDTGSSNLWVPSKKCHYTNIACLLHNKYDSRKSKSYVANGTQFAIQYGSGSLSGF 137
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
S D+V VG + V+ Q F EA E L F+ A+FDGI+G+ F IAV PV+DNMV Q
Sbjct: 138 LSTDDVTVGGLKVRRQTFAEAVSEPGLAFVAAKFDGILGMAFSTIAVDHVTPVFDNMVAQ 197
Query: 216 GLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGN 275
GLV + VFSF+LNRDP A GGE++ GG DP H++G VP+ + YW+F + + +
Sbjct: 198 GLV-QPVFSFYLNRDPGATTGGELLLGGSDPAHYRGDLVRVPLLRDTYWEFHMDSVNV-- 254
Query: 276 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV----SAECKLV 323
++ C GC+AI D+GTSL+AGP+ V +N A+G + + +C L+
Sbjct: 255 NASRFCAQGCSAIADTGTSLIAGPSKEVEALNAAVGATAIAFGQYAVDCSLI 306
>gi|112983576|ref|NP_001037351.1| cathepsin D precursor [Bombyx mori]
gi|66269351|gb|AAY43135.1| CathD [Bombyx mori]
Length = 384
Score = 268 bits (686), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 134/292 (45%), Positives = 184/292 (63%), Gaps = 11/292 (3%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQ 96
R+ LH + AR E G + +R + + PL N++DAQY+G I IG+PPQ
Sbjct: 21 RVPLHRMKTARTHFHEV---GTELELLRLKYDVTGPSPEPLSNYLDAQYYGVISIGTPPQ 77
Query: 97 NFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
+F V+FDTGSSNLWVPS KC+++ I+C H++Y SRKS TY G I YGSGS+SGF
Sbjct: 78 SFKVVFDTGSSNLWVPSKKCHYTNIACLLHNKYDSRKSKTYVANGTQFAIQYGSGSLSGF 137
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
S D+V VG + V+ Q F EA E L F+ A+FDGI+G+ F IAV PV+DNMV Q
Sbjct: 138 LSTDDVTVGGLKVRRQTFAEAVSEPGLAFVAAKFDGILGMAFSTIAVDHVTPVFDNMVAQ 197
Query: 216 GLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGN 275
GLV + VFSF+LNRDP A GGE++ GG DP H++G VP+ + YW+F + + +
Sbjct: 198 GLV-QPVFSFYLNRDPGATTGGELLLGGSDPAHYRGDLVRVPLLRDTYWEFHMDSVNV-- 254
Query: 276 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV----SAECKLV 323
++ C GC+AI D+GTSL+AGP+ V +N A+G + +C L+
Sbjct: 255 NASRFCAQGCSAIADTGTSLIAGPSKEVEALNAAVGATAIAFGQYVVDCSLI 306
>gi|56417363|gb|AAV90625.1| cathepsin D protein [Sus scrofa]
Length = 395
Score = 268 bits (686), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 136/301 (45%), Positives = 194/301 (64%), Gaps = 19/301 (6%)
Query: 37 RLDLHSLNAARITRKE------RYMGGAGVSGVRHRLGDSDEDILP--LKNFMDAQYFGE 88
R+ LH + R T E + +S + + +P LKN+MDAQY+GE
Sbjct: 8 RIPLHKFTSIRRTMSEVGGPVENLIAKGPISKYSQGVPAVTQGPIPEVLKNYMDAQYYGE 67
Query: 89 IGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+TY + G + I+Y
Sbjct: 68 IGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSGKSSTYVKNGTTFAIHY 127
Query: 148 GSGSISGFFSQDNVEV---------GDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFR 198
GSGS+SG++SQD V V G + V+ Q F EAT++ LTF+ A+FDGI+G+ +
Sbjct: 128 GSGSLSGYWSQDTVSVPCNSALLGVGGIKVERQTFGEATKQPGLTFIAAKFDGILGMAYP 187
Query: 199 EIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPV 258
I+V + VPV+DN+++Q LV + +FSF+LNRDP A+ GGE++ GG+D K++KG Y V
Sbjct: 188 RISVNNVVPVFDNLMQQKLVDKNIFSFYLNRDPGAQPGGELMLGGIDSKYYKGSLDYHNV 247
Query: 259 TKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSA 318
T+K YWQ + + +G+ T +C+GGC AIVD+GTSL+ GP V E+ AIG ++
Sbjct: 248 TRKAYWQIHMDQVAVGSSLT-LCKGGCEAIVDTGTSLIVGPVEEVRELQKAIGAVPLIQG 306
Query: 319 E 319
E
Sbjct: 307 E 307
>gi|148232796|ref|NP_001083566.1| napsin A aspartic peptidase precursor [Xenopus laevis]
gi|38197533|gb|AAH61685.1| MGC68767 protein [Xenopus laevis]
Length = 392
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 137/316 (43%), Positives = 201/316 (63%), Gaps = 22/316 (6%)
Query: 19 LLLPASSNGLRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDI 74
LLL ++G+ RI LKK RR+ S+ A + GA ++ ++
Sbjct: 9 LLLFWDTDGVIRIPLKKFPSIRRMLSDSMTAEELK-------GATKENLQQQMFPEK--- 58
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKS 133
L N++DAQY+GEI IG+PPQ F+VIFDTGSSNLWVPS KC +F +C+ H +Y+S+ S
Sbjct: 59 --LTNYLDAQYYGEIFIGTPPQKFAVIFDTGSSNLWVPSVKCSFFDFACWVHKKYRSQNS 116
Query: 134 NTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGII 193
+TY + + I YG+GS+SGF SQD V +G + V +Q F EA ++ + F+ A FDGI+
Sbjct: 117 STYRQNNTAFAIQYGTGSLSGFLSQDTVSIGSIEVANQTFAEAIKQPGIVFVFAHFDGIL 176
Query: 194 GLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKH 253
G+G+ +I+V VPV+DNM++Q L+ E VFSF+L+RDP A GGE++ GG DP ++ G
Sbjct: 177 GMGYPDISVDGVVPVFDNMMQQNLLEENVFSFYLSRDPMATVGGELILGGTDPNYYTGDF 236
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
Y+ VT+ YWQ + ++ + NQ +C+GGC AIVD+GTSL+ GP + ++ AIG
Sbjct: 237 HYLNVTRMAYWQIKADEVRVNNQLV-LCKGGCQAIVDTGTSLITGPKEEIRALHKAIGAF 295
Query: 314 GVVSAE----CKLVVS 325
+ + E CK + S
Sbjct: 296 PLFAGEYFINCKRIQS 311
>gi|17549909|ref|NP_510191.1| Protein ASP-4 [Caenorhabditis elegans]
gi|3879202|emb|CAA90633.1| Protein ASP-4 [Caenorhabditis elegans]
Length = 444
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 129/244 (52%), Positives = 172/244 (70%), Gaps = 3/244 (1%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L+N+MDAQYFG I IG+P QNF+VIFDTGSSNLW+PS KC ++ I+C H RY S+ S+T
Sbjct: 86 LRNYMDAQYFGTISIGTPAQNFTVIFDTGSSNLWIPSKKCPFYDIACMLHHRYDSKSSST 145
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y E G+ I YG+GS+ GF S+D+V V V +DQ F EAT E +TF+ A+FDGI+G+
Sbjct: 146 YKEDGRKMAIQYGTGSMKGFISKDSVCVAGVCAEDQPFAEATSEPGITFVAAKFDGILGM 205
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ EIAV PV++ + EQ V +FSFWLNR+PD+E GGEI FGG+D + + TY
Sbjct: 206 AYPEIAVLGVQPVFNTLFEQKKVPSNLFSFWLNRNPDSEIGGEITFGGIDSRRYVEPITY 265
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
VPVT+KGYWQF++ D ++G+ G C GC AI D+GTSL+AGP + I + IG E +
Sbjct: 266 VPVTRKGYWQFKM-DKVVGSGVLG-CSNGCQAIADTGTSLIAGPKAQIEAIQNFIGAEPL 323
Query: 316 VSAE 319
+ E
Sbjct: 324 IKGE 327
>gi|54020914|ref|NP_001005701.1| napsin A aspartic peptidase precursor [Xenopus (Silurana)
tropicalis]
gi|49522956|gb|AAH75272.1| cathepsin D (lysosomal aspartyl protease) [Xenopus (Silurana)
tropicalis]
Length = 402
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 139/312 (44%), Positives = 197/312 (63%), Gaps = 14/312 (4%)
Query: 19 LLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLK 78
LLL +++ L RI LKK +L+ + KE + G + + + L
Sbjct: 9 LLLVWTTDALIRIPLKKFPSIRRTLSDS--MTKEEFNGATKEFLKQQTIPEK------LT 60
Query: 79 NFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYT 137
N++DAQY+GEI IG+PPQ F+VIFDTGSSNLWVPS KC +F +C+ H +Y+S+ S+TY
Sbjct: 61 NYLDAQYYGEIFIGTPPQKFAVIFDTGSSNLWVPSIKCSFFDFACWLHKKYRSKDSSTYQ 120
Query: 138 EIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGF 197
+ I YG+GS+SGF SQD V VG + V +Q F EA ++ + F+ A FDGI+G+G+
Sbjct: 121 QNNTEFAIQYGTGSLSGFLSQDTVTVGSIDVANQTFAEAVKQPGIVFVFAHFDGILGMGY 180
Query: 198 REIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVP 257
I+V VPV+DNM+EQ L+ E VFSF+L+RDP A GGE+V GG DP ++ G Y+
Sbjct: 181 PNISVDGVVPVFDNMMEQKLLEENVFSFYLSRDPMAMVGGELVLGGTDPNYYTGDFHYLN 240
Query: 258 VTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVS 317
VT+ YWQ + ++ + NQ +C+GGC AIVD+GTSL+ GP + ++ AIG + S
Sbjct: 241 VTRMAYWQIKADEVRVANQLV-LCKGGCQAIVDTGTSLITGPREEIRALHKAIGAFPLFS 299
Query: 318 AE----CKLVVS 325
E CK + S
Sbjct: 300 GEYFVNCKRIQS 311
>gi|290561455|gb|ADD38128.1| Lysosomal aspartic protease [Lepeophtheirus salmonis]
Length = 384
Score = 268 bits (685), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 138/284 (48%), Positives = 190/284 (66%), Gaps = 6/284 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQ 96
R+ +H +AR K Y G+ + +R R PL N++DAQY+G I IGSPPQ
Sbjct: 20 RVPVHKFQSAR---KHFYEVGSSIQLIRKRWNTVGAHPEPLSNYLDAQYYGPITIGSPPQ 76
Query: 97 NFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
+F VIFDTGSSNLW+PS C+ + I+C H +Y KS+TY G I YGSGS+SGF
Sbjct: 77 SFKVIFDTGSSNLWIPSKSCHITNIACLLHHKYDHSKSSTYVANGTEFAIQYGSGSLSGF 136
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
S D+V +G+V + Q F EA E + F+ A+FDGI+G+G+ IAV VP + NM +Q
Sbjct: 137 LSSDSVSMGEVEIGSQTFGEAMSEPGMAFVAAKFDGILGMGYSNIAVDGVVPPFYNMFKQ 196
Query: 216 GLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGN 275
GL+ E +FSF+LNR+PDA+ GGEI+FGG DP H+KG TY+PVTKKGYWQF++ + + +
Sbjct: 197 GLIQEPIFSFYLNRNPDAKVGGEIIFGGSDPDHYKGNITYIPVTKKGYWQFKMDKMEVNS 256
Query: 276 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
+S C+ GC AI D+GTSL+AGP+ V +N +GG +++ E
Sbjct: 257 KS--FCQNGCQAIADTGTSLIAGPSIEVNALNQLLGGTPIINGE 298
>gi|237874218|ref|NP_001153867.1| cathepsin D [Acyrthosiphon pisum]
Length = 393
Score = 268 bits (685), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 131/295 (44%), Positives = 187/295 (63%), Gaps = 7/295 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQ 96
R+ LH +++ R + R G R+ +++ PL N++DAQY+G I IG+PPQ
Sbjct: 30 RVKLHKIDSVRNQLRGRTSNLFGFVQRRYDPLNAE----PLSNYLDAQYYGPITIGTPPQ 85
Query: 97 NFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
F+V+FDTGSSNLWVPS +C +I+C H++Y KS TY + G I+YGSGS+SG+
Sbjct: 86 PFNVVFDTGSSNLWVPSKQCSVLNIACMLHNKYNMAKSTTYXKNGTEFSIHYGSGSLSGY 145
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
S D + + + +Q F EA +E L F+ A+FDGI+GLG+ IAV VP + NMV Q
Sbjct: 146 LSTDVMSMDGTSIVNQTFAEAIQEPGLAFVAAKFDGILGLGYNTIAVDGVVPPFYNMVNQ 205
Query: 216 GLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGN 275
G++ +FSF+LNRDP + GGEI+FGG DP+ + G TYVPVT+ GYWQF L ++++GN
Sbjct: 206 GIIKSAIFSFYLNRDPSSTPGGEIIFGGSDPEKYTGPFTYVPVTRHGYWQFGLDEVIVGN 265
Query: 276 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 330
T + G AI D+GTSL+AGP + +IN +GG + E + Q +L
Sbjct: 266 --TSIVSGALQAIADTGTSLIAGPVDNIKQINELLGGTAIPGGEYIIACDQIDNL 318
>gi|397490270|ref|XP_003816129.1| PREDICTED: cathepsin D [Pan paniscus]
Length = 603
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 139/305 (45%), Positives = 197/305 (64%), Gaps = 25/305 (8%)
Query: 37 RLDLHSLNAARITRKERYMGGA--------GVSGVRHRLGDSDEDILP--LKNFMDAQYF 86
R+ LH + R T E +GG+ VS + +P LKN+MDAQY+
Sbjct: 23 RIPLHKFTSIRRTMSE--VGGSVEDLIAKGPVSKYSQAVPSVTAGPIPEVLKNYMDAQYY 80
Query: 87 GEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEI 145
GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+TY + G S +I
Sbjct: 81 GEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSDKSSTYVKNGTSFDI 140
Query: 146 NYGSGSISGFFSQDNVEV-----------GDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+YGSGS+SG+ SQD V V G V V+ QVF EAT++ +TF+ A+FDGI+G
Sbjct: 141 HYGSGSLSGYLSQDTVSVPCQSASSASAPGGVKVERQVFGEATKQPGITFIAAKFDGILG 200
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+ + I+V + +PV+DN+++Q LV + +FSF+L+RDPDA+ GGE++ GG D K++KG +
Sbjct: 201 MAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLSRDPDAQPGGELMLGGTDSKYYKGSLS 260
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y+ VT+K YWQ L + + + T +C+ GC AIVD+GTSL+ GP V E+ AIG
Sbjct: 261 YLNVTRKAYWQVHLDQVEVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVP 319
Query: 315 VVSAE 319
++ E
Sbjct: 320 LIQGE 324
>gi|346469557|gb|AEO34623.1| hypothetical protein [Amblyomma maculatum]
Length = 391
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 126/245 (51%), Positives = 174/245 (71%), Gaps = 2/245 (0%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSN 134
PLKN++DAQY+G+I +G+PPQ F V+FDTGSSNLWVPSSKC F+ I+C H +Y ++KS+
Sbjct: 62 PLKNYLDAQYYGDITLGTPPQVFRVVFDTGSSNLWVPSSKCPFTNIACMLHHKYYAKKSS 121
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + G EI YGSGS++G S D +GDV V+ Q F E E L F+ A+FDGI+G
Sbjct: 122 TYVKNGTKFEIRYGSGSVTGELSTDVFGLGDVRVQSQTFAEILHESGLAFIAAKFDGILG 181
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ +I+V PV+DNMV QG+ ++ VFS +L+R+ GGE++FGG+D H+ G +
Sbjct: 182 LGYPQISVLGVPPVFDNMVAQGVATKPVFSVYLDRNATDPNGGEVLFGGIDEAHYTGNIS 241
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
YVPVT+KGYWQF + + +G+ +T C GGC AI D+GTSL+AGPT + ++N AIG
Sbjct: 242 YVPVTRKGYWQFHMDGLKVGDNAT-FCNGGCEAIADTGTSLIAGPTEEIQKLNLAIGAAP 300
Query: 315 VVSAE 319
+ E
Sbjct: 301 FTAGE 305
>gi|332514729|gb|AEE69372.1| cathepsin D [Fasciola gigantica]
Length = 429
Score = 268 bits (684), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 145/303 (47%), Positives = 188/303 (62%), Gaps = 14/303 (4%)
Query: 14 VLASCLLLPASSNGLRRIGL---KKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDS 70
VL CLL A+ + RI L K R +L + +R G R G
Sbjct: 4 VLLICLLFSAALCDVLRIKLRPFKTTRQELSEYGSLDWESSQRLFGKYA-----GRNGSI 58
Query: 71 DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYK 129
E L N++DAQY+GEIGIG+PPQ F VIFDTGSSNLWVPS +C Y S +C+ H++Y
Sbjct: 59 PEQ---LNNYLDAQYYGEIGIGTPPQTFKVIFDTGSSNLWVPSKRCSYLSWACWLHNKYN 115
Query: 130 SRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARF 189
S+TY G + I YG+GS+SGF S D+ EVG V VK Q F EA +E + F+ A+F
Sbjct: 116 YAASSTYQANGTAFSIQYGTGSVSGFISVDSFEVGGVEVKGQPFGEAIKEPGIVFVFAKF 175
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF 249
DGI+G+GFR I+VG V V++NM+ QGLV E VFSF+LNR+ GGE++ GG+DP ++
Sbjct: 176 DGILGMGFRSISVGGLVTVFENMIAQGLVPEPVFSFYLNRNASDPVGGELLLGGIDPNYY 235
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHA 309
G TYVPVT + YWQF++ I S +C GC AI D+GTSL+AGP V +N
Sbjct: 236 TGDITYVPVTHEAYWQFKVDKIEFPGVS--ICADGCQAIADTGTSLIAGPKKEVDALNEQ 293
Query: 310 IGG 312
IGG
Sbjct: 294 IGG 296
>gi|311324976|gb|ADP89523.1| cathepsin D [Miichthys miiuy]
Length = 396
Score = 267 bits (683), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 140/319 (43%), Positives = 201/319 (63%), Gaps = 16/319 (5%)
Query: 5 LLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVR 64
LL SVF L +++ L RI LKK R L + R E + A ++
Sbjct: 4 LLLSVFAALAL--------TNDALVRIPLKKFRSIRRELTDSG-KRAEELL--ADRHSLK 52
Query: 65 HRLG-DSDEDILP--LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSI 120
+ G S P LKN++DAQY+GEIG+G+PPQ F+V+FDTGSSNLWVPS C I
Sbjct: 53 YNFGFPSSNGPTPELLKNYLDAQYYGEIGLGTPPQLFTVVFDTGSSNLWVPSVHCQILDI 112
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+C H +Y S KS+TY + G + I YGSGS+SGF SQD +GD+ V++Q+F EAT++
Sbjct: 113 ACLLHHKYNSAKSSTYVKNGTAFAIQYGSGSLSGFLSQDTCTIGDISVQNQLFGEATKQP 172
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
+ F+ A+FDGI+G+ + I+V PV+DN++ Q V + VFSF+LNR+PD + GGE++
Sbjct: 173 GVAFIAAKFDGILGMAYPRISVDGVAPVFDNIMSQKKVEKNVFSFYLNRNPDTQPGGELL 232
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
GG DPK++ G YV +T++ YWQ + + +G+Q T +C+ GC AIVD+GTSL+ GP+
Sbjct: 233 LGGTDPKYYSGDFHYVNITRQAYWQIHVDGMAVGSQLT-LCKSGCEAIVDTGTSLITGPS 291
Query: 301 PVVTEINHAIGGEGVVSAE 319
V + AIG ++ E
Sbjct: 292 AEVRSLQKAIGAIPLIQGE 310
>gi|74198620|dbj|BAE39786.1| unnamed protein product [Mus musculus]
Length = 410
Score = 267 bits (683), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 126/253 (49%), Positives = 181/253 (71%), Gaps = 11/253 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN+MDAQY+G+IGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 71 LKNYMDAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVG---------DVVVKDQVFIEATREGSLTFLL 186
Y + G S +I+YGSGS+SG+ SQD V V + V+ Q+F EAT++ + F+
Sbjct: 131 YVKNGTSFDIHYGSGSLSGYLSQDTVSVPCKSGQSKARGIKVEKQIFGEATKQPGIVFVA 190
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+G+ I+V + +PV+DNM++Q LV + +FSF+LNRDP+ + GGE++ GG D
Sbjct: 191 AKFDGILGMGYPHISVNNVLPVFDNMMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDS 250
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K++ G+ +Y+ VT+K YWQ + + +GN+ T +C+GGC AIVD+GTSLL GP V E+
Sbjct: 251 KYYHGELSYLNVTRKAYWQVHMDQLEVGNELT-LCKGGCEAIVDTGTSLLVGPVGEVKEL 309
Query: 307 NHAIGGEGVVSAE 319
AIG ++ E
Sbjct: 310 QKAIGAVPLIQGE 322
>gi|148231809|ref|NP_001085308.1| cathepsin D precursor [Xenopus laevis]
gi|62739292|gb|AAH94178.1| LOC443721 protein [Xenopus laevis]
Length = 399
Score = 267 bits (682), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 126/283 (44%), Positives = 184/283 (65%), Gaps = 25/283 (8%)
Query: 61 SGVRHRLGDSDEDILPLK-----------------------NFMDAQYFGEIGIGSPPQN 97
+ +R + D+D+D L L N++DAQY+GEI IG+PPQ
Sbjct: 32 TSIRRAMSDTDKDSLKLSGNEAATKYSAFPKSNNPTPETLLNYLDAQYYGEISIGTPPQP 91
Query: 98 FSVIFDTGSSNLWVPSSKCYF-SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFF 156
F+V+FDTGSSNLWVPS C F I+C+ H +Y S KS+TY G + I YGSGS++G+
Sbjct: 92 FTVVFDTGSSNLWVPSVHCSFWDIACWLHHKYDSSKSSTYVNNGTAFAIQYGSGSLTGYL 151
Query: 157 SQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQG 216
S+D V +GD+ VK Q+F EA ++ +TF+ A+FDGI+G+G+ I+V PV+D+++EQ
Sbjct: 152 SKDTVTIGDLAVKGQLFAEAVKQPGITFVAAKFDGILGMGYPRISVDGVPPVFDDIMEQK 211
Query: 217 LVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQ 276
LV +FSF+LNR+PD + GGE++ GG DP ++ G +Y+ VT+K YWQ + + +G+Q
Sbjct: 212 LVDSNLFSFYLNRNPDTQPGGELLLGGTDPTYYTGDFSYMNVTRKAYWQIRMDQLSVGDQ 271
Query: 277 STGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
T +C+GGC AIVD+GTSL+ GP V + AIG ++ E
Sbjct: 272 LT-LCKGGCEAIVDTGTSLITGPVEEVAALQRAIGAIPLIRGE 313
>gi|431920733|gb|ELK18506.1| Napsin-A [Pteropus alecto]
Length = 760
Score = 267 bits (682), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 136/306 (44%), Positives = 188/306 (61%), Gaps = 14/306 (4%)
Query: 32 GLKKRRLDLHSLNAARITRKERYMGGAGVSGVRH--------RLGDSDEDILPLKNFMDA 83
G ++ R RI + Y G ++ +R R+GD +PL NFM+A
Sbjct: 3 GPREGRPSCRCPPPHRIPLRRVYTGRRTLNPLRRWGNPEEPLRMGDPKFISVPLSNFMNA 62
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF-SISCYFHSRYKSRKSNTYTEIGKS 142
QY+GEIG+G+PPQNFSV+FDTGSSNLWVPS +CYF S+ C+FH R+ S+ S+++ G
Sbjct: 63 QYYGEIGLGTPPQNFSVVFDTGSSNLWVPSKRCYFFSLPCWFHHRFDSKASSSFKPNGTK 122
Query: 143 CEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAV 202
I YG+G +SG S+D + +G + F EA E SLTF+ ARFDGI+GLGF +AV
Sbjct: 123 FAIQYGTGRLSGVLSEDKLTIGGITGASVTFGEALWEPSLTFIFARFDGILGLGFPALAV 182
Query: 203 GDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
P D +V QGL+ + VFSF+L RDP+ +GGE+V GG DP H+ TYVPVT
Sbjct: 183 EGVRPPLDMLVAQGLLDKPVFSFYLTRDPEEADGGELVLGGSDPTHYIPPLTYVPVTVPA 242
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE--- 319
YWQ + + +G T +C GCAAI+D+GTSL+ GP+ + ++ AIGG ++ E
Sbjct: 243 YWQIHMERVQVGTGLT-LCAHGCAAILDTGTSLITGPSEEIRALHRAIGGISLLVGEYLI 301
Query: 320 -CKLVV 324
C L+
Sbjct: 302 QCSLIT 307
>gi|339460405|gb|AEJ76922.1| aspartic protease [Dimocarpus longan]
Length = 222
Score = 267 bits (682), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 131/216 (60%), Positives = 166/216 (76%), Gaps = 6/216 (2%)
Query: 9 VFCLWVLASCLLLP----ASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS--G 62
F + + S LL P A +GL RIGLKK++LD S + +I E A +
Sbjct: 7 AFWVALFLSLLLSPTAFSAPKDGLVRIGLKKKKLDQISRVSGQINSNEGEAIRAPIKKYN 66
Query: 63 VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISC 122
+R LGDSD DI+ LKN+MDAQYFGE+GIG+P Q F+VIFDTGSSNLWVPSSKCYFS++C
Sbjct: 67 LRSNLGDSDTDIVSLKNYMDAQYFGEVGIGTPSQTFTVIFDTGSSNLWVPSSKCYFSVAC 126
Query: 123 YFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSL 182
YFHS+Y+S +S+TY + G S I YG+G++SGFFSQD+V+VGD+ VK+Q FIEAT+E S+
Sbjct: 127 YFHSKYRSTQSSTYKKNGTSAAIQYGTGAVSGFFSQDSVKVGDLFVKNQDFIEATKEASI 186
Query: 183 TFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLV 218
TFL A+FDGI+GLGF+EI+VG+AVPVWDNMV QGLV
Sbjct: 187 TFLAAKFDGILGLGFQEISVGNAVPVWDNMVNQGLV 222
>gi|205289916|gb|ACI02330.1| aspartic protease 1 [Uncinaria stenocephala]
Length = 447
Score = 267 bits (682), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 145/326 (44%), Positives = 200/326 (61%), Gaps = 24/326 (7%)
Query: 15 LASCLLLPASSNGLRRIGLKKRRLDLHSLNAAR-ITRKERYMGG---------------- 57
LA C L AS + RR + R + S++ +R T +ER +G
Sbjct: 8 LALCTLAVASIH--RRTFHQPARRHVQSVSLSRQPTLRERLLGTGSWEDYQKQRYHYQRK 65
Query: 58 --AGVSGVR-HRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSS 114
A +G + +L ++E L+N+MDAQYFG I IG+P QNF+VIFDTGSSNLWVPS
Sbjct: 66 LLAKYAGNKASKLQSTNEIDELLRNYMDAQYFGTIQIGTPAQNFTVIFDTGSSNLWVPSR 125
Query: 115 KC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF 173
KC ++ I+C H RY S S+TY E G+ I YG+GS+ GF S+DNV + + ++Q F
Sbjct: 126 KCPFYDIACMLHHRYDSGASSTYKEDGRKMAIQYGTGSMKGFISKDNVCIAGICAEEQPF 185
Query: 174 IEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDA 233
EAT E LTF+ A+FDGI+G+ F EI+V PV+ +EQ V +F+FWLNR+PD+
Sbjct: 186 AEATSEPGLTFIAAKFDGILGMAFPEISVLGVPPVFHTFIEQKKVPSPMFAFWLNRNPDS 245
Query: 234 EEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGT 293
E GGEI GG+DP+ + T+ PVT++GYWQF++ D++ G S+ C GC AI D+GT
Sbjct: 246 ELGGEITLGGMDPRRYVEPLTWTPVTRRGYWQFKM-DMVQGGSSSIACPNGCQAIADTGT 304
Query: 294 SLLAGPTPVVTEINHAIGGEGVVSAE 319
SL+AGP V I IG E ++ E
Sbjct: 305 SLIAGPKAQVEAIQKFIGAEPLMRGE 330
>gi|26354406|dbj|BAC40831.1| unnamed protein product [Mus musculus]
Length = 445
Score = 267 bits (682), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 124/253 (49%), Positives = 181/253 (71%), Gaps = 11/253 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN++DAQY+G+IGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 71 LKNYLDAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVG---------DVVVKDQVFIEATREGSLTFLL 186
Y + G S +I+YGSGS+SG+ SQD V V + V+ Q+F EAT++ + F+
Sbjct: 131 YVKNGTSFDIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIFGEATKQPGIVFVA 190
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+G+ I+V + +PV+DN+++Q LV + +FSF+LNRDP+ + GGE++ GG D
Sbjct: 191 AKFDGILGMGYPHISVNNVLPVFDNLMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDS 250
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K++ G+ +Y+ VT+K YWQ + + +GN+ T +C+GGC AIVD+GTSLL GP V E+
Sbjct: 251 KYYHGELSYLNVTRKAYWQVHMDQLEVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKEL 309
Query: 307 NHAIGGEGVVSAE 319
AIG ++ E
Sbjct: 310 QKAIGAVPLIQGE 322
>gi|225713714|gb|ACO12703.1| Lysosomal aspartic protease precursor [Lepeophtheirus salmonis]
gi|290462953|gb|ADD24524.1| Lysosomal aspartic protease [Lepeophtheirus salmonis]
Length = 384
Score = 267 bits (682), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 138/284 (48%), Positives = 189/284 (66%), Gaps = 6/284 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQ 96
R+ +H +AR K Y G+ + +R R PL N++DAQY+G I IGSPPQ
Sbjct: 20 RVPVHKFQSAR---KHFYEVGSSIQLIRKRWNTVGAHPEPLSNYLDAQYYGPITIGSPPQ 76
Query: 97 NFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
+F VIFDTGSSNLW+PS C+ + I+C H +Y KS+TY G I YGSGS+SGF
Sbjct: 77 SFKVIFDTGSSNLWIPSKSCHITNIACLLHHKYDHSKSSTYVANGTEFAIQYGSGSLSGF 136
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
S D+V +G V + Q F EA E + F+ A+FDGI+G+G+ IAV VP + NM +Q
Sbjct: 137 LSSDSVSMGGVEIGSQTFGEAMSEPGMAFVAAKFDGILGMGYSNIAVDGVVPPFYNMFKQ 196
Query: 216 GLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGN 275
GL+ E +FSF+LNR+PDA+ GGEI+FGG DP H+KG TY+PVTKKGYWQF++ + + +
Sbjct: 197 GLIQEPIFSFYLNRNPDAKVGGEIIFGGSDPDHYKGNITYIPVTKKGYWQFKMDKMEVNS 256
Query: 276 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
+S C+ GC AI D+GTSL+AGP+ V +N +GG +++ E
Sbjct: 257 KS--FCQNGCQAIADTGTSLIAGPSIEVNALNQLLGGTPIINGE 298
>gi|49522906|gb|AAH75134.1| LOC443721 protein, partial [Xenopus laevis]
Length = 398
Score = 267 bits (682), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 126/283 (44%), Positives = 184/283 (65%), Gaps = 25/283 (8%)
Query: 61 SGVRHRLGDSDEDILPLK-----------------------NFMDAQYFGEIGIGSPPQN 97
+ +R + D+D+D L L N++DAQY+GEI IG+PPQ
Sbjct: 31 TSIRRAMSDTDKDSLKLSGNEAATKYSAFPKSNNPTPETLLNYLDAQYYGEISIGTPPQP 90
Query: 98 FSVIFDTGSSNLWVPSSKCYF-SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFF 156
F+V+FDTGSSNLWVPS C F I+C+ H +Y S KS+TY G + I YGSGS++G+
Sbjct: 91 FTVVFDTGSSNLWVPSVHCSFWDIACWLHHKYDSSKSSTYVNNGTAFAIQYGSGSLTGYL 150
Query: 157 SQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQG 216
S+D V +GD+ VK Q+F EA ++ +TF+ A+FDGI+G+G+ I+V PV+D+++EQ
Sbjct: 151 SKDTVTIGDLAVKGQLFAEAVKQPGITFVAAKFDGILGMGYPRISVDGVPPVFDDIMEQK 210
Query: 217 LVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQ 276
LV +FSF+LNR+PD + GGE++ GG DP ++ G +Y+ VT+K YWQ + + +G+Q
Sbjct: 211 LVDSNLFSFYLNRNPDTQPGGELLLGGTDPTYYTGDFSYMNVTRKAYWQIRMDQLSVGDQ 270
Query: 277 STGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
T +C+GGC AIVD+GTSL+ GP V + AIG ++ E
Sbjct: 271 LT-LCKGGCEAIVDTGTSLITGPVEEVAALQRAIGAIPLIRGE 312
>gi|355566182|gb|EHH22561.1| Cathepsin D [Macaca mulatta]
Length = 450
Score = 266 bits (681), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 146/330 (44%), Positives = 203/330 (61%), Gaps = 33/330 (10%)
Query: 18 CLLLPASSNGLRRIGLKKR------RLDLHSLNAARITRKERYMGGA--------GVSGV 63
C +L ASS RR L R+ LH + R T E MGG +S
Sbjct: 38 CAMLAASSG--RREDLPDMPQPLVDRIPLHKFTSIRRTMSE--MGGPVEDLIAKGPISKY 93
Query: 64 RHRLGDSDEDILP--LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSI 120
+ E +P LKN+MDAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS C I
Sbjct: 94 SQAMPAVTEGPIPEVLKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDI 153
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEV-----------GDVVVK 169
+C+ H +Y S KS+TY + G S I+YGSGS+SG+ SQD V V G V V+
Sbjct: 154 ACWLHHKYNSDKSSTYVKNGTSFAIHYGSGSLSGYLSQDTVSVPCKSASSTAALGGVKVE 213
Query: 170 DQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNR 229
QVF EA ++ +TF+ A+FDGI+G+ + I+V + +PV+DN+++Q LV + +FSF+LNR
Sbjct: 214 RQVFGEAIKQPGITFIAAKFDGILGMAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLNR 273
Query: 230 DPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIV 289
DP A+ GGE++ GG D K+++G +Y+ VT+K YWQ L + + + T +C+ GC AIV
Sbjct: 274 DPTAQPGGELMLGGTDSKYYRGSLSYLNVTRKAYWQVRLDQVEVASGLT-LCKEGCEAIV 332
Query: 290 DSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
D+GTSL+ GP V E+ AIG ++ E
Sbjct: 333 DTGTSLMVGPVDEVRELQKAIGAVPLIQGE 362
>gi|281182624|ref|NP_001162374.1| cathepsin D precursor [Papio anubis]
gi|160904227|gb|ABX52210.1| cathepsin D (predicted) [Papio anubis]
Length = 412
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 138/305 (45%), Positives = 194/305 (63%), Gaps = 25/305 (8%)
Query: 37 RLDLHSLNAARITRKERYMGGA--------GVSGVRHRLGDSDEDILP--LKNFMDAQYF 86
R+ LH + R T E MGG +S + E +P LKN+MDAQY+
Sbjct: 23 RIPLHKFTSIRRTMSE--MGGPVEDLIAKGPISKYSQAMPAVTEGPIPEVLKNYMDAQYY 80
Query: 87 GEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEI 145
GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+TY + G S I
Sbjct: 81 GEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWLHRKYNSDKSSTYVKNGTSFAI 140
Query: 146 NYGSGSISGFFSQDNVEV-----------GDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+YGSGS+SG+ SQD V V G V V+ QVF EA ++ +TF+ A+FDGI+G
Sbjct: 141 HYGSGSLSGYLSQDTVSVPCKSASSTAALGGVKVERQVFGEAIKQPGITFIAAKFDGILG 200
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+ + I+V + +PV+DN+++Q LV + +FSF+LNRDP A+ GGE++ GG D K+++G +
Sbjct: 201 MAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLNRDPTAQPGGELMLGGTDSKYYRGSLS 260
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y+ VT+K YWQ L + + + T +C+ GC AIVD+GTSL+ GP V E+ AIG
Sbjct: 261 YLNVTRKAYWQVHLDQVEVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVP 319
Query: 315 VVSAE 319
++ E
Sbjct: 320 LIQGE 324
>gi|386869594|ref|NP_001247483.1| cathepsin D precursor [Macaca mulatta]
gi|67971186|dbj|BAE01935.1| unnamed protein product [Macaca fascicularis]
gi|384939322|gb|AFI33266.1| cathepsin D preproprotein [Macaca mulatta]
Length = 412
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 138/305 (45%), Positives = 194/305 (63%), Gaps = 25/305 (8%)
Query: 37 RLDLHSLNAARITRKERYMGGA--------GVSGVRHRLGDSDEDILP--LKNFMDAQYF 86
R+ LH + R T E MGG +S + E +P LKN+MDAQY+
Sbjct: 23 RIPLHKFTSIRRTMSE--MGGPVEDLIAKGPISKYSQAMPAVTEGPIPEVLKNYMDAQYY 80
Query: 87 GEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEI 145
GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+TY + G S I
Sbjct: 81 GEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWLHHKYNSDKSSTYVKNGTSFAI 140
Query: 146 NYGSGSISGFFSQDNVEV-----------GDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+YGSGS+SG+ SQD V V G V V+ QVF EA ++ +TF+ A+FDGI+G
Sbjct: 141 HYGSGSLSGYLSQDTVSVPCKSASSTAALGGVKVERQVFGEAIKQPGITFIAAKFDGILG 200
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+ + I+V + +PV+DN+++Q LV + +FSF+LNRDP A+ GGE++ GG D K+++G +
Sbjct: 201 MAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLNRDPTAQPGGELMLGGTDSKYYRGSLS 260
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y+ VT+K YWQ L + + + T +C+ GC AIVD+GTSL+ GP V E+ AIG
Sbjct: 261 YLNVTRKAYWQVRLDQVEVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVP 319
Query: 315 VVSAE 319
++ E
Sbjct: 320 LIQGE 324
>gi|116282368|gb|ABJ97285.1| cathepsin D-like aspartic protease [Fasciola hepatica]
Length = 429
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 144/303 (47%), Positives = 188/303 (62%), Gaps = 14/303 (4%)
Query: 14 VLASCLLLPASSNGLRRIGL---KKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDS 70
VL CLL A+ + RI L K R +L + +R G R G
Sbjct: 4 VLLICLLFSAALCDVLRIKLRPFKTTRQELSEYGSLDWESSQRLFGKYA-----GRNGSI 58
Query: 71 DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYK 129
E L N++DAQY+GEIGIG+PPQ F VIFDTGSSNLWVPS +C Y S +C+ H++Y
Sbjct: 59 PEQ---LNNYLDAQYYGEIGIGTPPQTFKVIFDTGSSNLWVPSKRCSYLSWACWLHNKYN 115
Query: 130 SRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARF 189
S+TY G + I YG+GS+SGF S D+ EVG V VK Q F EA +E + F+ A+F
Sbjct: 116 YAASSTYQVNGTAFSIQYGTGSVSGFISVDSFEVGGVEVKGQPFGEAIKEPGIVFVFAKF 175
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF 249
DGI+G+GFR I+VG + V++NM+ QGLV E VFSF+LNR+ GGE++ GG+DP ++
Sbjct: 176 DGILGMGFRSISVGGLITVFENMIAQGLVPEPVFSFYLNRNASDPVGGELLLGGIDPNYY 235
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHA 309
G TYVPVT + YWQF++ I S +C GC AI D+GTSL+AGP V +N
Sbjct: 236 TGDITYVPVTHEAYWQFKVDKIEFPGVS--ICADGCQAIADTGTSLIAGPKKEVDALNEQ 293
Query: 310 IGG 312
IGG
Sbjct: 294 IGG 296
>gi|74207446|dbj|BAE30902.1| unnamed protein product [Mus musculus]
Length = 410
Score = 266 bits (679), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 124/253 (49%), Positives = 181/253 (71%), Gaps = 11/253 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN++DAQY+G+IGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 71 LKNYLDAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVG---------DVVVKDQVFIEATREGSLTFLL 186
Y + G S +I+YGSGS+SG+ SQD V V + V+ Q+F EAT++ + F+
Sbjct: 131 YVKNGTSFDIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIFGEATKQPGIVFVA 190
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+G+ I+V + +PV+DN+++Q LV + +FSF+LNRDP+ + GGE++ GG D
Sbjct: 191 AKFDGILGMGYPHISVNNVLPVFDNLMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDS 250
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K++ G+ +Y+ VT+K YWQ + + +GN+ T +C+GGC AIVD+GTSLL GP V E+
Sbjct: 251 KYYHGELSYLNVTRKAYWQVHMDQLEVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKEL 309
Query: 307 NHAIGGEGVVSAE 319
AIG ++ E
Sbjct: 310 QKAIGAVPLIQGE 322
>gi|224460527|gb|ACN43675.1| cathepsin D [Paralichthys olivaceus]
Length = 396
Score = 266 bits (679), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 123/244 (50%), Positives = 171/244 (70%), Gaps = 2/244 (0%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN++DAQY+G+I +G+PPQ FSV+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 68 LKNYLDAQYYGDIALGTPPQTFSVVFDTGSSNLWVPSVHCSILDIACWLHHKYNSAKSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + G + I YGSGS+SGF SQD +GD+ V+ QVF EAT++ + F+ A+FDGI+G+
Sbjct: 128 YVKNGTTFAIQYGSGSLSGFLSQDTCTIGDLTVEKQVFGEATKQPGVAFIAAKFDGILGM 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ I+V PV+DN++ Q V E VFSF+LNR+PD GGE++ GG DPK++ G Y
Sbjct: 188 AYPRISVDGVAPVFDNIMSQKKVEENVFSFYLNRNPDMAPGGELLLGGTDPKYYSGDFNY 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
V VT++ YWQ +G + G+Q T +C+ GC AIVD+GTSL+ GP+ V + AIG +
Sbjct: 248 VNVTRQAYWQIHMGGMGAGSQLT-LCKDGCEAIVDTGTSLITGPSAEVKALQKAIGAVPL 306
Query: 316 VSAE 319
+ E
Sbjct: 307 IQGE 310
>gi|395851770|ref|XP_003798425.1| PREDICTED: cathepsin D [Otolemur garnettii]
Length = 405
Score = 266 bits (679), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 124/250 (49%), Positives = 182/250 (72%), Gaps = 8/250 (3%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L+N+MDAQY+GEIGIG+PPQ F+V+FDTGS+NLWVPSSKC I+C+ H+RY S +S T
Sbjct: 69 LRNYMDAQYYGEIGIGTPPQCFTVVFDTGSANLWVPSSKCKMLDIACWLHNRYHSDRSTT 128
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVG------DVVVKDQVFIEATREGSLTFLLARF 189
Y + G + +I+YGSGS+SG+ SQD V + +V V+ QVF EAT++ +TF+ A+F
Sbjct: 129 YVKNGTAFDIHYGSGSLSGYLSQDTVLMPCKSVSVNVKVEKQVFGEATKQPGITFIAAKF 188
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF 249
DGI+G+ + I+V + +P +DN++EQ LV + +FSF+LNRDP+A+ GGE++ GGVD K++
Sbjct: 189 DGILGMAYPRISVDNVLPFFDNLMEQKLVEKNIFSFYLNRDPNAQPGGELMLGGVDSKYY 248
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHA 309
G +Y+ VT+K YW+ + + + + T +C+GGC AIVD+GTSL+ GP V E+ A
Sbjct: 249 TGSLSYLNVTRKAYWEVHMEQVEVASGLT-LCKGGCEAIVDTGTSLMVGPVDEVRELQKA 307
Query: 310 IGGEGVVSAE 319
IG ++ E
Sbjct: 308 IGAIPLIQGE 317
>gi|205363469|gb|ACI04164.1| cathepsin D-like aspartic protease precursor [Fasciola hepatica]
Length = 429
Score = 266 bits (679), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 144/303 (47%), Positives = 187/303 (61%), Gaps = 14/303 (4%)
Query: 14 VLASCLLLPASSNGLRRIGL---KKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDS 70
VL CLL A+ + R L K R +L + +R G R G
Sbjct: 4 VLLICLLFSAALCDVLRTKLRPFKTTRQELSEYGSLDWESSQRLFGKYA-----GRNGSI 58
Query: 71 DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYK 129
E L N++DAQY+GEIGIG+PPQ F VIFDTGSSNLWVPS +C Y S +C+ H++Y
Sbjct: 59 PEQ---LNNYLDAQYYGEIGIGTPPQTFKVIFDTGSSNLWVPSKRCSYLSWACWLHNKYN 115
Query: 130 SRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARF 189
S+TY G + I YG+GS+SGF S D+ EVG V VK Q F EA +E + F+ A+F
Sbjct: 116 YAASSTYQANGTAFSIQYGTGSVSGFISVDSFEVGGVEVKGQPFGEAIKEPGIVFVFAKF 175
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF 249
DGI+G+GFR I+VG V V++NM+ QGLV E VFSF+LNR+ GGE++ GG+DP ++
Sbjct: 176 DGILGMGFRSISVGGLVTVFENMIAQGLVPEPVFSFYLNRNASDPVGGELLLGGIDPNYY 235
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHA 309
G TYVPVT + YWQF++ I S +C GC AI D+GTSL+AGP V +N
Sbjct: 236 TGDITYVPVTHEAYWQFKVDKIEFPGVS--ICADGCQAIADTGTSLIAGPKKEVDALNEQ 293
Query: 310 IGG 312
IGG
Sbjct: 294 IGG 296
>gi|122114359|gb|AAY42145.2| cathepsin D [Sus scrofa]
Length = 410
Score = 265 bits (677), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 127/253 (50%), Positives = 177/253 (69%), Gaps = 11/253 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN+MDAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 71 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSGKSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEV---------GDVVVKDQVFIEATREGSLTFLL 186
Y + G + I+YGSGS+SG+ SQD V V G + V+ Q F EAT++ LTF+
Sbjct: 131 YVKNGTTFAIHYGSGSLSGYLSQDTVSVPCNSASSGVGGIKVERQTFGEATKQPGLTFIA 190
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+ + I+V + VPV+DN+++Q LV + +FSF+LNRDP A+ G E++ GG+D
Sbjct: 191 AKFDGILGMAYPRISVNNVVPVFDNLMQQKLVDKNIFSFYLNRDPGAQPGSELMLGGIDS 250
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K++KG Y VT+K YWQ + + +G+ T +C+GGC AIVD+GTSL+ GP V E+
Sbjct: 251 KYYKGSLDYHNVTRKAYWQIHMDQVAVGSSLT-LCKGGCEAIVDTGTSLIVGPVEEVREL 309
Query: 307 NHAIGGEGVVSAE 319
AIG ++ E
Sbjct: 310 QKAIGAVPLIQGE 322
>gi|71043798|ref|NP_001020792.1| cathepsin D precursor [Canis lupus familiaris]
gi|85540968|sp|Q4LAL9.1|CATD_CANFA RecName: Full=Cathepsin D; Flags: Precursor
gi|70561318|emb|CAJ14973.1| cathepsin D [Canis lupus familiaris]
Length = 410
Score = 265 bits (677), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 125/253 (49%), Positives = 180/253 (71%), Gaps = 11/253 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L+N+MDAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 71 LRNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSGKSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEV---------GDVVVKDQVFIEATREGSLTFLL 186
Y + G S +I+YGSGS+SG+ SQD V V + V+ Q F EAT++ +TF+
Sbjct: 131 YVKNGTSFDIHYGSGSLSGYLSQDTVSVPCKSALSGLAGIKVERQTFGEATKQPGITFIA 190
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+ + I+V + +PV+DN+++Q LV + +FSF+LNRDP+A+ GGE++ GG D
Sbjct: 191 AKFDGILGMAYPRISVNNVLPVFDNLMQQKLVEKNIFSFYLNRDPNAQPGGELMLGGTDS 250
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K++KG +Y+ VT+K YWQ + + +G+ T +C+GGC AIVD+GTSL+ GP V E+
Sbjct: 251 KYYKGPLSYLNVTRKAYWQVHMEQVDVGSSLT-LCKGGCEAIVDTGTSLIVGPVDEVREL 309
Query: 307 NHAIGGEGVVSAE 319
AIG ++ E
Sbjct: 310 QKAIGAVPLIQGE 322
>gi|355681641|gb|AER96810.1| cathepsin D [Mustela putorius furo]
Length = 410
Score = 265 bits (677), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 130/268 (48%), Positives = 185/268 (69%), Gaps = 13/268 (4%)
Query: 62 GVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSI 120
GV GD ++L +N+MDAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS C I
Sbjct: 58 GVPSVAGDPVPEVL--RNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDI 115
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEV---------GDVVVKDQ 171
+C+ H +Y S KS+TY + G S +I+YGSGS+SG+ SQD V V V V+ Q
Sbjct: 116 ACWIHHKYNSGKSSTYVKNGTSFDIHYGSGSLSGYLSQDTVSVPCKSGLSSLAGVKVERQ 175
Query: 172 VFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDP 231
F EAT++ +TF+ A+FDGI+G+ + I+V + +PV+DN+++Q LV + +FSF+LNRDP
Sbjct: 176 TFGEATKQPGITFIAAKFDGILGMAYPRISVNNVLPVFDNLMQQKLVEKNIFSFYLNRDP 235
Query: 232 DAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDS 291
A+ GGE++ GG D K++KG +Y+ VT+K YWQ + + +G+ T +C+GGC AIVD+
Sbjct: 236 GAQPGGELMLGGTDSKYYKGPLSYLNVTRKAYWQVHMEXVDVGSSLT-LCKGGCEAIVDT 294
Query: 292 GTSLLAGPTPVVTEINHAIGGEGVVSAE 319
GTSL+ GP V E+ AIG ++ E
Sbjct: 295 GTSLIVGPVDEVRELQKAIGAVPLIQGE 322
>gi|74198040|dbj|BAE35200.1| unnamed protein product [Mus musculus]
Length = 410
Score = 265 bits (677), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 124/253 (49%), Positives = 181/253 (71%), Gaps = 11/253 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN++DAQY+G+IGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 71 LKNYLDAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVG---------DVVVKDQVFIEATREGSLTFLL 186
Y + G S +I+YGSGS+SG+ SQD V V + V+ Q+F EAT++ + F+
Sbjct: 131 YVKNGTSFDIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIFGEATKQPGIVFVA 190
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+G+ I+V + +PV+DN+++Q LV + +FSF+LNRDP+ + GGE++ GG D
Sbjct: 191 AKFDGILGMGYPHISVNNVLPVFDNLMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDS 250
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K++ G+ +Y+ VT+K YWQ + + +GN+ T +C+GGC AIVD+GTSLL GP V E+
Sbjct: 251 KYYHGELSYLNVTRKAYWQVHMDQLEVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKEL 309
Query: 307 NHAIGGEGVVSAE 319
AIG ++ E
Sbjct: 310 QKAIGAVPLIQGE 322
>gi|6753556|ref|NP_034113.1| cathepsin D precursor [Mus musculus]
gi|115718|sp|P18242.1|CATD_MOUSE RecName: Full=Cathepsin D; Flags: Precursor
gi|50299|emb|CAA37067.1| cathepsin D [Mus musculus]
gi|50301|emb|CAA37423.1| unnamed protein product [Mus musculus]
gi|817945|emb|CAA48453.1| cathepsin d [Mus musculus]
gi|32452040|gb|AAH54758.1| Cathepsin D [Mus musculus]
gi|34785578|gb|AAH57931.1| Cathepsin D [Mus musculus]
gi|74139562|dbj|BAE40918.1| unnamed protein product [Mus musculus]
gi|74139905|dbj|BAE31791.1| unnamed protein product [Mus musculus]
gi|74151769|dbj|BAE29674.1| unnamed protein product [Mus musculus]
gi|74177956|dbj|BAE29773.1| unnamed protein product [Mus musculus]
gi|74178091|dbj|BAE29834.1| unnamed protein product [Mus musculus]
gi|74181413|dbj|BAE29980.1| unnamed protein product [Mus musculus]
gi|74184920|dbj|BAE39078.1| unnamed protein product [Mus musculus]
gi|74185047|dbj|BAE39131.1| unnamed protein product [Mus musculus]
gi|74185557|dbj|BAE30245.1| unnamed protein product [Mus musculus]
gi|74186716|dbj|BAE34813.1| unnamed protein product [Mus musculus]
gi|74189047|dbj|BAE39288.1| unnamed protein product [Mus musculus]
gi|74191359|dbj|BAE30262.1| unnamed protein product [Mus musculus]
gi|74191542|dbj|BAE30346.1| unnamed protein product [Mus musculus]
gi|74197068|dbj|BAE35086.1| unnamed protein product [Mus musculus]
gi|74197198|dbj|BAE35144.1| unnamed protein product [Mus musculus]
gi|74199016|dbj|BAE30724.1| unnamed protein product [Mus musculus]
gi|74204247|dbj|BAE39883.1| unnamed protein product [Mus musculus]
gi|74207294|dbj|BAE30833.1| unnamed protein product [Mus musculus]
gi|74207430|dbj|BAE30895.1| unnamed protein product [Mus musculus]
gi|74212520|dbj|BAE31001.1| unnamed protein product [Mus musculus]
gi|74212556|dbj|BAE31018.1| unnamed protein product [Mus musculus]
gi|74212558|dbj|BAE31019.1| unnamed protein product [Mus musculus]
gi|74213416|dbj|BAE35523.1| unnamed protein product [Mus musculus]
gi|74214708|dbj|BAE31193.1| unnamed protein product [Mus musculus]
gi|74217133|dbj|BAE31236.1| unnamed protein product [Mus musculus]
gi|74219445|dbj|BAE29499.1| unnamed protein product [Mus musculus]
gi|74220283|dbj|BAE31319.1| unnamed protein product [Mus musculus]
gi|74220373|dbj|BAE31412.1| unnamed protein product [Mus musculus]
gi|74220638|dbj|BAE31529.1| unnamed protein product [Mus musculus]
gi|74220740|dbj|BAE31342.1| unnamed protein product [Mus musculus]
gi|74222921|dbj|BAE42305.1| unnamed protein product [Mus musculus]
gi|74225262|dbj|BAE31566.1| unnamed protein product [Mus musculus]
gi|74225282|dbj|BAE31575.1| unnamed protein product [Mus musculus]
gi|148686195|gb|EDL18142.1| cathepsin D, isoform CRA_a [Mus musculus]
Length = 410
Score = 265 bits (677), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 124/253 (49%), Positives = 181/253 (71%), Gaps = 11/253 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN++DAQY+G+IGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 71 LKNYLDAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVG---------DVVVKDQVFIEATREGSLTFLL 186
Y + G S +I+YGSGS+SG+ SQD V V + V+ Q+F EAT++ + F+
Sbjct: 131 YVKNGTSFDIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIFGEATKQPGIVFVA 190
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+G+ I+V + +PV+DN+++Q LV + +FSF+LNRDP+ + GGE++ GG D
Sbjct: 191 AKFDGILGMGYPHISVNNVLPVFDNLMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDS 250
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K++ G+ +Y+ VT+K YWQ + + +GN+ T +C+GGC AIVD+GTSLL GP V E+
Sbjct: 251 KYYHGELSYLNVTRKAYWQVHMDQLEVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKEL 309
Query: 307 NHAIGGEGVVSAE 319
AIG ++ E
Sbjct: 310 QKAIGAVPLIQGE 322
>gi|74220304|dbj|BAE31329.1| unnamed protein product [Mus musculus]
Length = 410
Score = 265 bits (676), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 124/253 (49%), Positives = 181/253 (71%), Gaps = 11/253 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN++DAQY+G+IGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 71 LKNYLDAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVG---------DVVVKDQVFIEATREGSLTFLL 186
Y + G S +I+YGSGS+SG+ SQD V V + V+ Q+F EAT++ + F+
Sbjct: 131 YVKNGTSFDIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIFGEATKQPGIVFVA 190
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+G+ I+V + +PV+DN+++Q LV + +FSF+LNRDP+ + GGE++ GG D
Sbjct: 191 AKFDGILGMGYPHISVNNVLPVFDNLMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDS 250
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K++ G+ +Y+ VT+K YWQ + + +GN+ T +C+GGC AIVD+GTSLL GP V E+
Sbjct: 251 KYYHGELSYLNVTRKAYWQVHMDQLEVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKEL 309
Query: 307 NHAIGGEGVVSAE 319
AIG ++ E
Sbjct: 310 QKAIGAVPLIQGE 322
>gi|90076280|dbj|BAE87820.1| unnamed protein product [Macaca fascicularis]
Length = 412
Score = 265 bits (676), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 135/303 (44%), Positives = 192/303 (63%), Gaps = 21/303 (6%)
Query: 37 RLDLHSLNAARITRKE------RYMGGAGVSGVRHRLGDSDEDILP--LKNFMDAQYFGE 88
R+ LH + R T E + +S + E +P LKN+MDAQY+GE
Sbjct: 23 RIPLHKFTSIRRTMSEIGGPVEDLIAKGPISKYSQAMPAVTEGPIPEVLKNYMDAQYYGE 82
Query: 89 IGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+TY + G S I+Y
Sbjct: 83 IGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWLHHKYNSDKSSTYVKNGTSFAIHY 142
Query: 148 GSGSISGFFSQDNVEV-----------GDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
GSGS+SG+ SQD V V G V V+ QVF EA ++ +TF+ A+FDGI+G+
Sbjct: 143 GSGSLSGYLSQDTVSVPCKSAPSTAALGGVKVERQVFGEAIKQPGITFIAAKFDGILGMA 202
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ I+V + +PV+DN+++Q LV + +FSF+LNRDP A+ GGE++ GG D K+++G +Y+
Sbjct: 203 YPRISVNNVLPVFDNLMQQKLVDQNIFSFYLNRDPTAQPGGELMLGGTDSKYYRGSLSYL 262
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
VT+K YWQ L + + + T +C+ GC AIVD+GTSL+ GP V E+ AIG ++
Sbjct: 263 NVTRKAYWQVRLDQVEVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVPLI 321
Query: 317 SAE 319
E
Sbjct: 322 QGE 324
>gi|74142218|dbj|BAE31874.1| unnamed protein product [Mus musculus]
Length = 410
Score = 264 bits (675), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 124/253 (49%), Positives = 181/253 (71%), Gaps = 11/253 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN++DAQY+G+IGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 71 LKNYLDAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVG---------DVVVKDQVFIEATREGSLTFLL 186
Y + G S +I+YGSGS+SG+ SQD V V + V+ Q+F EAT++ + F+
Sbjct: 131 YVKNGTSFDIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIFGEATKQPGIVFVA 190
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+G+ I+V + +PV+DN+++Q LV + +FSF+LNRDP+ + GGE++ GG D
Sbjct: 191 AKFDGILGMGYPHISVNNVLPVFDNLMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDS 250
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K++ G+ +Y+ VT+K YWQ + + +GN+ T +C+GGC AIVD+GTSLL GP V E+
Sbjct: 251 KYYHGELSYLNVTRKAYWQVHMDQLEVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKEL 309
Query: 307 NHAIGGEGVVSAE 319
AIG ++ E
Sbjct: 310 QKAIGAVPLIQGE 322
>gi|432850599|ref|XP_004066827.1| PREDICTED: cathepsin D-like isoform 1 [Oryzias latipes]
Length = 396
Score = 264 bits (675), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 138/310 (44%), Positives = 196/310 (63%), Gaps = 8/310 (2%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLG-DSDE 72
VL L S L RI LKK R L + +E + A +++ LG S
Sbjct: 5 VLCVIAALALSGEALIRIPLKKFRSIRRELTD---SGREAHELLADKHSLKYNLGFPSSN 61
Query: 73 DILP--LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYK 129
P LKN++DAQY+GEI +G+PPQ F+V+FDTGSSNLWVPS C I+C +Y
Sbjct: 62 GPTPETLKNYLDAQYYGEIALGTPPQPFTVVFDTGSSNLWVPSVHCSLLDIACXXXHKYN 121
Query: 130 SRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARF 189
S KS+TY + G S I YGSGS+SG+ SQD +GD+ V++QVF EA ++ + F+ A+F
Sbjct: 122 SAKSSTYVKNGTSFSIQYGSGSLSGYLSQDTCTIGDISVENQVFGEAIKQPGVAFIAAKF 181
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF 249
DGI+G+ + I+V VPV+DN+++Q V VFSF+LNR+PD E GGE++ GG DPK++
Sbjct: 182 DGILGMAYPRISVDGVVPVFDNIMQQKKVDSNVFSFYLNRNPDTEPGGELLLGGTDPKYY 241
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHA 309
G YV ++++ YWQ + + +G+Q + +C+GGC AIVD+GTSLL GP+ V + A
Sbjct: 242 SGDFHYVNISRQAYWQIHMDGMAVGSQLS-LCKGGCEAIVDTGTSLLTGPSAEVKALQKA 300
Query: 310 IGGEGVVSAE 319
IG ++ E
Sbjct: 301 IGAIPLIQGE 310
>gi|42476045|ref|NP_599161.2| cathepsin D precursor [Rattus norvegicus]
gi|38303993|gb|AAH62032.1| Cathepsin D [Rattus norvegicus]
gi|149061703|gb|EDM12126.1| cathepsin D, isoform CRA_c [Rattus norvegicus]
Length = 407
Score = 264 bits (675), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 134/305 (43%), Positives = 201/305 (65%), Gaps = 9/305 (2%)
Query: 23 ASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPL-KNFM 81
ASS+ L RI L+K ++ + ++ + G + E + L KN++
Sbjct: 16 ASSSALIRIPLRKFTSIRRTMTEVGGSVEDLILKGPITKYSMQSSPRTKEPVSELLKNYL 75
Query: 82 DAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIG 140
DAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+TY + G
Sbjct: 76 DAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWVHHKYNSDKSSTYVKNG 135
Query: 141 KSCEINYGSGSISGFFSQDNVEV------GDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
S +I+YGSGS+SG+ SQD V V G + V+ Q+F EAT++ + F+ A+FDGI+G
Sbjct: 136 TSFDIHYGSGSLSGYLSQDTVSVPCKSDLGGIKVEKQIFGEATKQPGVVFIAAKFDGILG 195
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+G+ I+V + +PV+DN+++Q LV + +FSF+LNRDP + GGE++ GG D +++ G+ +
Sbjct: 196 MGYPFISVNNVLPVFDNLMKQKLVEKNIFSFYLNRDPTGQPGGELMLGGTDSRYYHGELS 255
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y+ VT+K YWQ + + +G++ T +C+GGC AIVD+GTSLL GP V E+ AIG
Sbjct: 256 YLNVTRKAYWQVHMDQLEVGSELT-LCKGGCEAIVDTGTSLLVGPVDEVKELQKAIGAVP 314
Query: 315 VVSAE 319
++ E
Sbjct: 315 LIQGE 319
>gi|27803878|gb|AAO22152.1| cathepsin D-like aspartic protease [Ancylostoma ceylanicum]
Length = 446
Score = 264 bits (675), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 128/255 (50%), Positives = 171/255 (67%), Gaps = 2/255 (0%)
Query: 66 RLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYF 124
+L ++E L+N+MDAQYFG I IG+P QNF+VIFDTGSSNLWVPS KC ++ I+C
Sbjct: 76 KLQSTNEIDELLRNYMDAQYFGTIQIGTPAQNFTVIFDTGSSNLWVPSRKCPFYDIACML 135
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
H RY S S+TY E G+ I YG+GS+ GF S+DNV + + +Q F EAT E LTF
Sbjct: 136 HHRYDSGASSTYKEDGRKMAIQYGTGSMKGFISKDNVCIAGICAVEQPFAEATSEPGLTF 195
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
+ A+FDGI+G+ F EI+V PV+ +EQ V VF+FWLNR+PD+E GGEI GG+
Sbjct: 196 IAAKFDGILGMAFPEISVLGVPPVFHTFIEQKKVPSPVFAFWLNRNPDSELGGEITLGGM 255
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
DP+ + T+ PVT++GYWQF++ D + G ++ C GC AI D+GTSL+AGP V
Sbjct: 256 DPRRYVEPITWTPVTRRGYWQFKM-DKVQGGSTSIACPNGCQAIADTGTSLIAGPKAQVE 314
Query: 305 EINHAIGGEGVVSAE 319
I IG E ++ E
Sbjct: 315 AIQKFIGAEPLMKGE 329
>gi|354496335|ref|XP_003510282.1| PREDICTED: cathepsin D [Cricetulus griseus]
gi|344248735|gb|EGW04839.1| Cathepsin D [Cricetulus griseus]
Length = 408
Score = 264 bits (675), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 125/251 (49%), Positives = 181/251 (72%), Gaps = 9/251 (3%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN++DAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 71 LKNYLDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSGKSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEV-------GDVVVKDQVFIEATREGSLTFLLAR 188
+ + G S +I+YGSGS+SG+ SQD V V G + V+ Q+F EA ++ +TF+ A+
Sbjct: 131 FVKNGTSFDIHYGSGSLSGYLSQDTVSVPCKSEQPGGLKVEKQIFGEAIKQPGITFIAAK 190
Query: 189 FDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKH 248
FDGI+G+G+ I+V + VPV+DN+++Q LV + +FSF+LNRDP + GGE++ GG+D K+
Sbjct: 191 FDGILGMGYPSISVNNVVPVFDNLMQQKLVEKNIFSFFLNRDPTGQPGGELMLGGIDSKY 250
Query: 249 FKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH 308
++G+ +Y+ VT+K YWQ + + + N T +C+GGC AIVD+GTSLL GP V E+
Sbjct: 251 YEGELSYLNVTRKAYWQVHMDQLDVANGLT-LCKGGCEAIVDTGTSLLVGPVDEVKELQK 309
Query: 309 AIGGEGVVSAE 319
AIG ++ E
Sbjct: 310 AIGAVPLIQGE 320
>gi|146286061|sp|O93428.2|CATD_CHIHA RecName: Full=Cathepsin D; Flags: Precursor
Length = 396
Score = 264 bits (675), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 120/244 (49%), Positives = 170/244 (69%), Gaps = 2/244 (0%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN++DAQY+GEIG+G+PPQ F+V+FDTGSSNLWVPS C I+C H +Y S KS+T
Sbjct: 68 LKNYLDAQYYGEIGLGTPPQPFTVVFDTGSSNLWVPSIHCSLLDIACLLHHKYNSGKSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + G + I YGSGS+SG+ SQD +GD+ + Q+F EA ++ + F+ A+FDGI+G+
Sbjct: 128 YVKNGTAFAIQYGSGSLSGYLSQDTCTIGDLAIDSQLFGEAIKQPGVAFIAAKFDGILGM 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ I+V PV+DN++ Q V + VFSF+LNR+PD E GGE++ GG DPK++ G Y
Sbjct: 188 AYPRISVDGVAPVFDNIMSQKKVEQNVFSFYLNRNPDTEPGGELLLGGTDPKYYTGDFNY 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
V VT++ YWQ + + +G+Q + +C GGC AIVDSGTSL+ GP+ V + AIG +
Sbjct: 248 VNVTRQAYWQIRVDSMAVGDQLS-LCTGGCEAIVDSGTSLITGPSVEVKALQKAIGAFPL 306
Query: 316 VSAE 319
+ E
Sbjct: 307 IQGE 310
>gi|74191361|dbj|BAE30263.1| unnamed protein product [Mus musculus]
Length = 410
Score = 264 bits (674), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 124/253 (49%), Positives = 180/253 (71%), Gaps = 11/253 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN++DAQY+G+IGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 71 LKNYLDAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVG---------DVVVKDQVFIEATREGSLTFLL 186
Y + G S +I+YGSGS+SG+ SQD V V + V+ Q+F EAT++ + F+
Sbjct: 131 YVKNGTSFDIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIFGEATKQPGIVFVA 190
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+G+ I+V + +PV+DN+++Q LV + FSF+LNRDP+ + GGE++ GG D
Sbjct: 191 AKFDGILGMGYPHISVNNVLPVFDNLMQQKLVDKNTFSFYLNRDPEGQPGGELMLGGTDS 250
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K++ G+ +Y+ VT+K YWQ + + +GN+ T +C+GGC AIVD+GTSLL GP V E+
Sbjct: 251 KYYHGELSYLNVTRKAYWQVHMDQLEVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKEL 309
Query: 307 NHAIGGEGVVSAE 319
AIG ++ E
Sbjct: 310 QKAIGAVPLIQGE 322
>gi|312097106|ref|XP_003148873.1| aspartic protease BmAsp-2 [Loa loa]
gi|307755962|gb|EFO15196.1| aspartic protease BmAsp-2 [Loa loa]
Length = 417
Score = 264 bits (674), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 140/306 (45%), Positives = 193/306 (63%), Gaps = 20/306 (6%)
Query: 30 RIGLKKR---RLDL---------HSLNAARITRK--ERYMG-GAGVSGVRHRLGDSDEDI 74
RI L+K+ R DL + L +I RK +R +G G++ + ++DE
Sbjct: 4 RIALRKQNSLRADLIKTGSLESYNKLLNFQIQRKKTQRKIGLDFGLASRPRTISETDE-- 61
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKS 133
LKN+MDAQY+G+I IG+P QNFSV+FDTGSSNLW+PS KC FS I+C FH++YK +S
Sbjct: 62 -ILKNYMDAQYYGQISIGTPAQNFSVVFDTGSSNLWIPSVKCPFSDIACLFHNKYKGAQS 120
Query: 134 NTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGII 193
TY G+ +I YG GS+ GF S D V + D+ V DQ F EAT E +TF++A+FDGI+
Sbjct: 121 TTYKPDGRKIKIQYGRGSMEGFISSDTVCIADICVTDQPFAEATSEPGVTFVMAKFDGIL 180
Query: 194 GLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKH 253
G+ F EIAV PV+ M++Q V E +F+FWL+R+P+ E GGEI GG+D F
Sbjct: 181 GMAFPEIAVLGLSPVFHTMIKQKTVKESLFAFWLDRNPNDEIGGEITLGGIDVNRFVAPL 240
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
Y P++K GYWQF++ D + G+ C GC AI D+GTSL+AGP + +I IG E
Sbjct: 241 VYTPISKHGYWQFQM-DSIQGDGKAISCANGCQAIADTGTSLIAGPKSQIDKIQKYIGAE 299
Query: 314 GVVSAE 319
+ + E
Sbjct: 300 HLYADE 305
>gi|342305186|dbj|BAK55647.1| cathepsin D [Oplegnathus fasciatus]
Length = 396
Score = 264 bits (674), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 139/315 (44%), Positives = 197/315 (62%), Gaps = 11/315 (3%)
Query: 9 VFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLG 68
+F L V A+ L LP S+ L RI L K R L + T +E A + +++ LG
Sbjct: 3 LFLLGVFAA-LALP--SDALIRIPLTKFRSIRRELTDSGRTAEELL---ADKNSLKYNLG 56
Query: 69 -DSDEDILP--LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYF 124
S P LKN++DAQY+GEIG+G+PPQ F+V+FDTGSSNLWVPS C I+C
Sbjct: 57 FPSSNGPTPETLKNYLDAQYYGEIGLGTPPQPFTVVFDTGSSNLWVPSVHCSILDIACLL 116
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
H +Y S KS+TY + G + I YG+GS+SG+ SQD +GD+ V Q+F EA ++ + F
Sbjct: 117 HHKYNSAKSSTYVKNGTAFAIQYGTGSLSGYLSQDTCTIGDISVDKQLFGEAIKQPGVAF 176
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
+ A+FDGI+G+ + I+V PV+DN++ Q V + VFSF+LNR+PD E GGE++ GG
Sbjct: 177 IAAKFDGILGMAYPRISVDGVAPVFDNIMSQKKVEKNVFSFYLNRNPDTEPGGELLLGGT 236
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
DPK++ G YV +T++ YWQ + + +G Q +C GC AIVD+GTSL+ GP+ V
Sbjct: 237 DPKYYSGDFHYVNITRQAYWQIHMDGMAVGGQ-LNLCTSGCEAIVDTGTSLITGPSAEVR 295
Query: 305 EINHAIGGEGVVSAE 319
+ AIG + E
Sbjct: 296 SLQKAIGAIPFIQGE 310
>gi|147743000|sp|P85137.1|CARDF_CYNCA RecName: Full=Cardosin-F; Contains: RecName: Full=Cardosin-F heavy
chain; Contains: RecName: Full=Cardosin-F light chain
Length = 281
Score = 264 bits (674), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 142/261 (54%), Positives = 169/261 (64%), Gaps = 35/261 (13%)
Query: 69 DSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRY 128
DS ++ L N D Y+GEIGIG+PPQ F+VIFDTGSS LWVPSSK HS Y
Sbjct: 2 DSGSAVVALTNDRDTSYYGEIGIGTPPQKFTVIFDTGSSVLWVPSSKA--------HSMY 53
Query: 129 KSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLAR 188
+S S+TY SQD+V +GD+VVK+Q FIEAT E FL
Sbjct: 54 ESSGSSTYK-------------------SQDSVTIGDLVVKEQDFIEATEEADNVFLNRL 94
Query: 189 FDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKH 248
FDGI+GL F+ I+V PVW NM+ QGLV FSFWLNR+ D EEGGE+VFGG+DP H
Sbjct: 95 FDGILGLSFQTISV----PVWYNMLNQGLVKR--FSFWLNRNVDEEEGGELVFGGLDPNH 148
Query: 249 FKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH 308
F+G HTYVPVT + YWQF +GD+LIG++STG C GC A DSGTSLL+GPT +VT+INH
Sbjct: 149 FRGDHTYVPVTYQYYWQFGIGDVLIGDKSTGFCAPGCQAFADSGTSLLSGPTAIVTQINH 208
Query: 309 AIGGEGVVSAECK--LVVSQY 327
AIG G K L QY
Sbjct: 209 AIGANGSEELNVKFGLTPEQY 229
>gi|432102593|gb|ELK30160.1| Napsin-A [Myotis davidii]
Length = 357
Score = 264 bits (674), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 134/281 (47%), Positives = 179/281 (63%), Gaps = 8/281 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDED----ILPLKNFMDAQYFGEIGIG 92
R+ LH + A +R + G G LG +PL N+M+AQY+G+IG+G
Sbjct: 29 RIPLHRVYAG--SRTPNPLRGWGSPEEPRGLGAPPPGGKSAFVPLSNYMNAQYYGKIGLG 86
Query: 93 SPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGS 151
+PPQNFSV+FDTGSSNLWVPS +C +FS+ C+FH R+ + S+T+ G I YGSG
Sbjct: 87 TPPQNFSVVFDTGSSNLWVPSRRCSFFSLPCWFHHRFDPKASSTFKPNGTKFAIQYGSGQ 146
Query: 152 ISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDN 211
+SG S+D + +G + VF EA E SL F+ A FDGI+GLGF +AVG P D
Sbjct: 147 LSGILSEDKLTIGGIKNASVVFGEALWEPSLVFVFAHFDGILGLGFPVLAVGGVRPPLDT 206
Query: 212 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 271
MV+QGL+ + VFSF+LNRDP+A EGGE+V GG DP H+ TYVPVT YWQ + +
Sbjct: 207 MVDQGLLDKPVFSFYLNRDPEAAEGGELVLGGSDPAHYIPPLTYVPVTVPAYWQVHMERV 266
Query: 272 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
+G T +C GC AI+D+GTSL+ GPT + ++ AIGG
Sbjct: 267 TVGPGLT-LCAQGCPAILDTGTSLITGPTEEIRALHRAIGG 306
>gi|83523775|ref|NP_001032810.1| cathepsin D precursor [Sus scrofa]
gi|65330113|gb|AAY42144.1| cathepsin D [Sus scrofa]
Length = 410
Score = 264 bits (674), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 127/253 (50%), Positives = 178/253 (70%), Gaps = 11/253 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN+MDAQ +GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 71 LKNYMDAQNYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSGKSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEV---------GDVVVKDQVFIEATREGSLTFLL 186
Y + G + I+YGSGS+SG++SQD V V G + V+ Q F EAT++ LTF+
Sbjct: 131 YVKNGTTFAIHYGSGSLSGYWSQDTVSVPCNSALLGVGGIKVERQTFGEATKQPGLTFIA 190
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+ + I+V + VPV+DN+++Q LV + +FSF+LNRDP A+ GGE++ GG+D
Sbjct: 191 AKFDGILGMAYPRISVNNVVPVFDNLMQQKLVDKNIFSFYLNRDPGAQPGGELMLGGIDS 250
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K++KG Y VT+K YWQ + + +G+ T +C+GGC AIVD+GTSL+ GP V E+
Sbjct: 251 KYYKGSLDYHNVTRKAYWQIHMDQVAVGSSLT-LCKGGCEAIVDTGTSLIVGPVEEVREL 309
Query: 307 NHAIGGEGVVSAE 319
AIG ++ E
Sbjct: 310 QKAIGAVPLIQGE 322
>gi|21907889|dbj|BAC05689.1| aspartic protease BmAsp-2 [Brugia malayi]
Length = 452
Score = 264 bits (674), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 126/244 (51%), Positives = 167/244 (68%), Gaps = 2/244 (0%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF-SISCYFHSRYKSRKSNT 135
LKN+MDAQY+GEI IG+PPQNFSV+FDTGSSNLWVPS KC F I+C FH++YK KS T
Sbjct: 91 LKNYMDAQYYGEISIGTPPQNFSVVFDTGSSNLWVPSVKCPFLDIACLFHNKYKGTKSTT 150
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G+ +I YG+GS+ GF S D V + ++ V Q F EAT E TF++A+FDGI+G+
Sbjct: 151 YKPDGRKIQIQYGTGSMEGFISLDTVCIANICVTGQPFAEATSEPGATFVMAKFDGILGM 210
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
F EI+V PV+ M+ Q +V + VF+FWL+R+P + GGEI FGG+D F TY
Sbjct: 211 AFPEISVLGLNPVFHTMISQKVVHQPVFAFWLDRNPSDKIGGEITFGGIDANRFVSPITY 270
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
PV++ GYWQF++ +L ++ G C GC AI D+GTSL+AGP + +I IG E V
Sbjct: 271 TPVSRHGYWQFKMDRVLGRGKAIG-CGNGCQAIADTGTSLIAGPKSQIDKIQEYIGAEHV 329
Query: 316 VSAE 319
+ E
Sbjct: 330 YAGE 333
>gi|3378161|emb|CAA07719.1| cathepsin D precursor [Chionodraco hamatus]
Length = 396
Score = 263 bits (673), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 132/300 (44%), Positives = 189/300 (63%), Gaps = 11/300 (3%)
Query: 23 ASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGG-AGVSGVRHRLGDSDEDIL-PLKNF 80
A SN L+ I H +A R+ + R G + V+ S+ LKN+
Sbjct: 19 ACSNSLKEI-------PFHQTSADRLWEESRGAPGRPSLPEVQLSFPASNAPTPETLKNY 71
Query: 81 MDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEI 139
+DAQY+GEIG+G+PPQ F+V+FDTGSSNLWVPS C I+C H +Y S KS+TY +
Sbjct: 72 LDAQYYGEIGLGTPPQPFTVVFDTGSSNLWVPSIHCSLLDIACLLHHKYNSGKSSTYVKN 131
Query: 140 GKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFRE 199
G + I YGSGS+SG+ SQD +GD+ + Q+F EA ++ + F+ A+FDGI+G+ +
Sbjct: 132 GTAFAIQYGSGSLSGYLSQDTCTIGDLAIDSQLFGEAIKQPGVAFIAAKFDGILGMAYPR 191
Query: 200 IAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVT 259
I+V PV+DN++ Q V + VFSF+LNR+PD E GGE++ GG DPK++ G YV VT
Sbjct: 192 ISVDGVAPVFDNIMSQKKVEQNVFSFYLNRNPDTEPGGELLLGGTDPKYYTGDFNYVNVT 251
Query: 260 KKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
++ YWQ + + +G+Q + +C GGC AIVDSGTSL+ GP+ V + AIG ++ E
Sbjct: 252 RQAYWQIRVDSMAVGDQLS-LCTGGCEAIVDSGTSLITGPSVEVKALQKAIGAFPLIQGE 310
>gi|226822856|gb|ACO83090.1| cathepsin D preproprotein (predicted) [Dasypus novemcinctus]
Length = 410
Score = 263 bits (673), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 128/263 (48%), Positives = 181/263 (68%), Gaps = 15/263 (5%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L+N+MDAQY+GEIGIG+P Q F V+FDTGSSNLWVPS C +C+ H +Y S +S+T
Sbjct: 71 LRNYMDAQYYGEIGIGTPAQCFRVVFDTGSSNLWVPSIHCRLLDFACWLHRKYNSGRSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVV---------VKDQVFIEATREGSLTFLL 186
Y + G + +I+YGSGS+SG+ SQD V V +V V QVF EAT++ +TFL+
Sbjct: 131 YVKNGSAFDIHYGSGSLSGYLSQDTVSVSPLVPCSAPVGVSVGKQVFGEATKQPGITFLM 190
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+ + I+VG +PV+DN+++Q LV + VFSF+LNRDP A+ GGE+V GG+DP
Sbjct: 191 AKFDGILGMAYPSISVGGVLPVFDNLMQQKLVDKNVFSFYLNRDPTAQPGGELVLGGMDP 250
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
+H+ G Y+ +T+K YWQ + + +G+ T +C+ GC AIVD+GTSL+ GP V E+
Sbjct: 251 RHYTGSVDYLNITRKAYWQVHMDRLEVGDGLT-LCKQGCEAIVDTGTSLMVGPVAEVREL 309
Query: 307 NHAIGGEGVVSAE----CKLVVS 325
AIG ++ E C+ V S
Sbjct: 310 QKAIGAVPLIQGEYMISCEKVAS 332
>gi|74191270|dbj|BAE39462.1| unnamed protein product [Mus musculus]
gi|74204799|dbj|BAE35462.1| unnamed protein product [Mus musculus]
Length = 410
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 123/253 (48%), Positives = 180/253 (71%), Gaps = 11/253 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN++DAQY+G+IGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 71 LKNYLDAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVG---------DVVVKDQVFIEATREGSLTFLL 186
Y + G S +I+YGSGS+SG+ SQD V V + V+ Q+F EAT++ + F+
Sbjct: 131 YVKNGTSFDIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIFGEATKQPGIVFVA 190
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+G+ I+V + +PV+DN+++Q LV + +FSF+LNRDP+ + GGE++ GG D
Sbjct: 191 AKFDGILGMGYPHISVNNVLPVFDNLMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDS 250
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K++ G+ +Y+ VT+K YWQ + + +GN+ T +C+GGC AIVD+GTSLL GP V E+
Sbjct: 251 KYYHGELSYLNVTRKAYWQVHMDQLEVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKEL 309
Query: 307 NHAIGGEGVVSAE 319
A G ++ E
Sbjct: 310 QKATGAVPLIQGE 322
>gi|301769501|ref|XP_002920177.1| PREDICTED: cathepsin D-like [Ailuropoda melanoleuca]
Length = 371
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 137/305 (44%), Positives = 194/305 (63%), Gaps = 27/305 (8%)
Query: 37 RLDLHSLNAARITRKERYMGGA------------GVSGVRHRLGDSDEDILPLKNFMDAQ 84
R+ LH + R T E +GG GV G +IL KN+MDAQ
Sbjct: 23 RIPLHKFTSIRRTMSE--LGGPVEDLIAKGPISKYAQGVPSVAGGPIPEIL--KNYMDAQ 78
Query: 85 YFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSC 143
Y+GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+TY + G S
Sbjct: 79 YYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSDKSSTYVKNGTSF 138
Query: 144 EINYGSGSISGFFSQDNVEV---------GDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+I+YGSGS+SG+ SQD V V V V+ Q F EA ++ +TF+ A+FDGI+G
Sbjct: 139 DIHYGSGSLSGYLSQDTVSVPCKSALSSLAGVKVERQTFGEAIKQPGITFIAAKFDGILG 198
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+ + I+V + +PV+DN++EQ LV + +FSF+LNR+P A+ GGE++ GG D K++KG +
Sbjct: 199 MAYPRISVNNVLPVFDNLMEQKLVEKNIFSFYLNRNPGAQPGGELMLGGTDSKYYKGPLS 258
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y+ VT+K YWQ + + +G+ T +C+GGC AI+D+GTSL+ GP V E+ AIG
Sbjct: 259 YLNVTRKAYWQVHMEQVDVGSSLT-LCKGGCEAILDTGTSLIVGPVDEVRELQKAIGAVP 317
Query: 315 VVSAE 319
++ E
Sbjct: 318 LIQGE 322
>gi|115720|sp|P24268.1|CATD_RAT RecName: Full=Cathepsin D; Contains: RecName: Full=Cathepsin D 12
kDa light chain; Contains: RecName: Full=Cathepsin D 9
kDa light chain; Contains: RecName: Full=Cathepsin D 34
kDa heavy chain; Contains: RecName: Full=Cathepsin D 30
kDa heavy chain; Flags: Precursor
gi|55882|emb|CAA38349.1| preprocathepsin D [Rattus norvegicus]
Length = 407
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 124/250 (49%), Positives = 180/250 (72%), Gaps = 8/250 (3%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN++DAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 71 LKNYLDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWVHHKYNSDKSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEV------GDVVVKDQVFIEATREGSLTFLLARF 189
Y + G S +I+YGSGS+SG+ SQD V V G + V+ Q+F EAT++ + F+ A+F
Sbjct: 131 YVKNGTSFDIHYGSGSLSGYLSQDTVSVPCKSDLGGIKVEKQIFGEATKQPGVVFIAAKF 190
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF 249
DGI+G+G+ I+V +PV+DN+++Q LV + +FSF+LNRDP + GGE++ GG D +++
Sbjct: 191 DGILGMGYPFISVNKVLPVFDNLMKQKLVEKNIFSFYLNRDPTGQPGGELMLGGTDSRYY 250
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHA 309
G+ +Y+ VT+K YWQ + + +G++ T +C+GGC AIVD+GTSLL GP V E+ A
Sbjct: 251 HGELSYLNVTRKAYWQVHMDQLEVGSELT-LCKGGCEAIVDTGTSLLVGPVDEVKELQKA 309
Query: 310 IGGEGVVSAE 319
IG ++ E
Sbjct: 310 IGAVPLIQGE 319
>gi|358255149|dbj|GAA56870.1| cathepsin D [Clonorchis sinensis]
Length = 425
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 128/238 (53%), Positives = 173/238 (72%), Gaps = 5/238 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L N++DAQY+GEIGIG+PPQ+F V+FDTGSSNLWVPS C FSI+C+ H +Y S KS+T
Sbjct: 61 LNNYLDAQYYGEIGIGTPPQSFEVVFDTGSSNLWVPSKHCSIFSIACWLHHKYDSAKSST 120
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G I YGSGS+SG S D V VG V VK+Q F EA +E + F+ A+FDGI+G+
Sbjct: 121 YMANGTEFSIRYGSGSVSGILSTDYVSVGTVTVKNQTFGEAMKEPGIAFVAAKFDGILGM 180
Query: 196 GFREIAVGDAVP-VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
GF+ I+V D VP ++DNM+ QGLVSE VFSF+L+R+ GGE++ GG DPK++KG+
Sbjct: 181 GFKTISV-DGVPTLFDNMISQGLVSEPVFSFYLDRNASDPVGGELLLGGTDPKYYKGEIL 239
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
+ P+T + YWQF++ + +G S +CE GC AI D+GTSL+AGP+ V ++N A+G
Sbjct: 240 WAPLTHEAYWQFKVDSMNVG--SMKLCENGCQAIADTGTSLIAGPSEEVGKLNDALGA 295
>gi|118429511|gb|ABK91803.1| aspartic protease precursor [Clonorchis sinensis]
Length = 425
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 128/238 (53%), Positives = 173/238 (72%), Gaps = 5/238 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L N++DAQY+GEIGIG+PPQ+F V+FDTGSSNLWVPS C FSI+C+ H +Y S KS+T
Sbjct: 61 LNNYLDAQYYGEIGIGTPPQSFEVVFDTGSSNLWVPSKHCSIFSIACWLHHKYDSAKSST 120
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G I YGSGS+SG S D V VG V VK+Q F EA +E + F+ A+FDGI+G+
Sbjct: 121 YMANGTEFSIRYGSGSVSGILSTDYVSVGTVTVKNQTFGEAMKEPGIAFVAAKFDGILGM 180
Query: 196 GFREIAVGDAVP-VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
GF+ I+V D VP ++DNM+ QGLVSE VFSF+L+R+ GGE++ GG DPK++KG+
Sbjct: 181 GFKTISV-DGVPTLFDNMISQGLVSEPVFSFYLDRNASDPVGGELLLGGTDPKYYKGEIL 239
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
+ P+T + YWQF++ + +G S +CE GC AI D+GTSL+AGP+ V ++N A+G
Sbjct: 240 WAPLTHEAYWQFKVDSMNVG--SMKLCENGCQAIADTGTSLIAGPSEEVGKLNDALGA 295
>gi|9581805|emb|CAC00543.1| necepsin II [Necator americanus]
Length = 446
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 128/255 (50%), Positives = 172/255 (67%), Gaps = 2/255 (0%)
Query: 66 RLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYF 124
+L ++E L+N+MDAQY+G I IG+P QNF+VIFDTGSSNLWVPS KC ++ I+C
Sbjct: 76 KLQSANEIDELLRNYMDAQYYGVIQIGTPAQNFTVIFDTGSSNLWVPSRKCPFYDIACML 135
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
H RY S S+TY E G+ I YG+GS+ GF S+D V + + ++Q F EAT E LTF
Sbjct: 136 HHRYDSGASSTYKEDGRKMAIQYGTGSMKGFISKDIVCIAGICAEEQPFAEATSEPGLTF 195
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
+ A+FDGI+G+ F EIAV PV+ +EQ V VF+FWLNR+P++E GGEI FGGV
Sbjct: 196 IAAKFDGILGMAFPEIAVLGVTPVFHTFIEQKKVPSPVFAFWLNRNPESEIGGEITFGGV 255
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
D + + T+ PVT++GYWQF++ D++ G S+ C GC AI D+GTSL+AGP V
Sbjct: 256 DTRRYVEPITWTPVTRRGYWQFKM-DMVQGGSSSIACPNGCQAIADTGTSLIAGPKAQVE 314
Query: 305 EINHAIGGEGVVSAE 319
I IG E ++ E
Sbjct: 315 AIQKYIGAEPLMKGE 329
>gi|74192771|dbj|BAE34900.1| unnamed protein product [Mus musculus]
Length = 410
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 123/253 (48%), Positives = 181/253 (71%), Gaps = 11/253 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN++DAQY+G+IGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 71 LKNYLDAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVG---------DVVVKDQVFIEATREGSLTFLL 186
Y + G S +I+YGSGS+SG+ SQD V V + V+ Q+F EAT++ + F+
Sbjct: 131 YVKNGTSFDIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIFGEATKQPGIVFVA 190
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+G+ I+V + +PV+DN+++Q LV + +FSF+LNRDP+ + GGE++ GG D
Sbjct: 191 AKFDGILGMGYPHISVNNVLPVFDNLMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDS 250
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K++ G+ +Y+ VT+K YWQ + + +G++ T +C+GGC AIVD+GTSLL GP V E+
Sbjct: 251 KYYHGELSYLNVTRKAYWQVHMDQLEVGSELT-LCKGGCEAIVDTGTSLLVGPVEEVKEL 309
Query: 307 NHAIGGEGVVSAE 319
AIG ++ E
Sbjct: 310 QKAIGAVPLIQGE 322
>gi|281344446|gb|EFB20030.1| hypothetical protein PANDA_008874 [Ailuropoda melanoleuca]
Length = 345
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 137/305 (44%), Positives = 194/305 (63%), Gaps = 27/305 (8%)
Query: 37 RLDLHSLNAARITRKERYMGGA------------GVSGVRHRLGDSDEDILPLKNFMDAQ 84
R+ LH + R T E +GG GV G +IL KN+MDAQ
Sbjct: 8 RIPLHKFTSIRRTMSE--LGGPVEDLIAKGPISKYAQGVPSVAGGPIPEIL--KNYMDAQ 63
Query: 85 YFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSC 143
Y+GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+TY + G S
Sbjct: 64 YYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSDKSSTYVKNGTSF 123
Query: 144 EINYGSGSISGFFSQDNVEV---------GDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+I+YGSGS+SG+ SQD V V V V+ Q F EA ++ +TF+ A+FDGI+G
Sbjct: 124 DIHYGSGSLSGYLSQDTVSVPCKSALSSLAGVKVERQTFGEAIKQPGITFIAAKFDGILG 183
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+ + I+V + +PV+DN++EQ LV + +FSF+LNR+P A+ GGE++ GG D K++KG +
Sbjct: 184 MAYPRISVNNVLPVFDNLMEQKLVEKNIFSFYLNRNPGAQPGGELMLGGTDSKYYKGPLS 243
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y+ VT+K YWQ + + +G+ T +C+GGC AI+D+GTSL+ GP V E+ AIG
Sbjct: 244 YLNVTRKAYWQVHMEQVDVGSSLT-LCKGGCEAILDTGTSLIVGPVDEVRELQKAIGAVP 302
Query: 315 VVSAE 319
++ E
Sbjct: 303 LIQGE 307
>gi|341884635|gb|EGT40570.1| CBN-ASP-4 protein [Caenorhabditis brenneri]
Length = 447
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 128/254 (50%), Positives = 176/254 (69%), Gaps = 6/254 (2%)
Query: 67 LGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFH 125
LG+ DE L+N+MDAQYFG I IG+P QNF+VIFDTGSSNLW+PS KC ++ I+C H
Sbjct: 80 LGEIDEL---LRNYMDAQYFGTISIGTPGQNFTVIFDTGSSNLWIPSKKCPFYDIACMLH 136
Query: 126 SRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFL 185
RY S+ S+TY E G+ I YG+GS+ GF S+D+V + + +DQ F EAT E +TF+
Sbjct: 137 HRYDSKASSTYKEDGRKMAIQYGTGSMKGFISKDSVCLAGICAEDQPFAEATSEPGITFV 196
Query: 186 LARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVD 245
A+FDGI+G+ + EIAV PV++ + EQ V +F+FWLNR+PD++ GGEI FGG+D
Sbjct: 197 AAKFDGILGMAYPEIAVLGVQPVFNTLFEQKKVPANLFAFWLNRNPDSDLGGEITFGGID 256
Query: 246 PKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTE 305
+ + TY PVT+KGYWQF++ D ++G+ G C GC AI D+GTSL+AGP +
Sbjct: 257 SRRYVEPITYAPVTRKGYWQFKM-DKVVGSGVLG-CSNGCQAIADTGTSLIAGPKAQIEA 314
Query: 306 INHAIGGEGVVSAE 319
I + IG E ++ E
Sbjct: 315 IQNFIGAEPLIKGE 328
>gi|74204520|dbj|BAE35336.1| unnamed protein product [Mus musculus]
Length = 410
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 123/253 (48%), Positives = 181/253 (71%), Gaps = 11/253 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN++DAQY+G+IGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 71 LKNYLDAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVG---------DVVVKDQVFIEATREGSLTFLL 186
+ + G S +I+YGSGS+SG+ SQD V V + V+ Q+F EAT++ + F+
Sbjct: 131 HVKNGTSFDIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIFGEATKQPGIVFVA 190
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+G+ I+V + +PV+DN+++Q LV + +FSF+LNRDP+ + GGE++ GG D
Sbjct: 191 AKFDGILGMGYPHISVNNVLPVFDNLMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDS 250
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K++ G+ +Y+ VT+K YWQ + + +GN+ T +C+GGC AIVD+GTSLL GP V E+
Sbjct: 251 KYYHGELSYLNVTRKAYWQVHMDQLEVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKEL 309
Query: 307 NHAIGGEGVVSAE 319
AIG ++ E
Sbjct: 310 QKAIGAVPLIQGE 322
>gi|167524529|ref|XP_001746600.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774870|gb|EDQ88496.1| predicted protein [Monosiga brevicollis MX1]
Length = 381
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 137/290 (47%), Positives = 176/290 (60%), Gaps = 19/290 (6%)
Query: 32 GLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGI 91
G+++ R L A T+ + M G V PL N+ DAQYFGEI I
Sbjct: 25 GMERTRDSLRRQGAMLTTKYQNIMAGTNV---------------PLSNYEDAQYFGEISI 69
Query: 92 GSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSG 150
G+P Q F VIFDTGSSNLWVPSS+C +I+C H++Y S S+TY G I YG+G
Sbjct: 70 GTPAQKFKVIFDTGSSNLWVPSSQCPKTNIACDVHAKYDSSASSTYKANGTKFAIQYGTG 129
Query: 151 SISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWD 210
S+SGF S D +GD+ VKDQ F EA E +TF+ A+FDGI+G+GF I+V VPVW
Sbjct: 130 SLSGFLSTDTACIGDLCVKDQTFAEALEEPGVTFVAAKFDGILGMGFSTISVDHVVPVWY 189
Query: 211 NMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 270
NMV+Q +V + ++SF+LNR+P+ GGE+ GG D HF G + VT GYWQF +
Sbjct: 190 NMVQQQVVEQNMYSFYLNRNPNGVSGGELTLGGYDESHFAGPIHWTDVTVDGYWQFTMTG 249
Query: 271 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAEC 320
+ I N T C C AI D+GTSLLAGPT VV +IN AIG + + E
Sbjct: 250 LSIEN--TPYCT-NCKAIADTGTSLLAGPTDVVKQINKAIGATTIAAGEA 296
>gi|410982348|ref|XP_003997519.1| PREDICTED: napsin-A [Felis catus]
Length = 422
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 133/296 (44%), Positives = 187/296 (63%), Gaps = 7/296 (2%)
Query: 25 SNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQ 84
S L RI L++ +LN R K G GD + ++PL N+M+ Q
Sbjct: 21 SASLIRIPLRRVHTGHRTLNPPRGWGKPAATPALGAPSP----GD-NPTVIPLSNYMNVQ 75
Query: 85 YFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSC 143
Y+GEIG+G+PPQNFSV+FDTGSSNLWVPS +C +FS+ C+ H R+ + S+++ G
Sbjct: 76 YYGEIGLGTPPQNFSVVFDTGSSNLWVPSIRCHFFSLPCWLHHRFNPKASSSFQPNGTKF 135
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVG 203
+I YG+G ++G S+D + +G ++ +F EA E SL F LARFDGI+GL F +AVG
Sbjct: 136 DIQYGTGRLAGILSEDKLTIGGMMNASVIFGEALWESSLVFTLARFDGILGLAFPVLAVG 195
Query: 204 DAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGY 263
P D +V+QGL+ + VFSF+LNRDP+A +GGE+V GG DP H+ T+VPVT Y
Sbjct: 196 GVRPPLDVLVDQGLLDKPVFSFYLNRDPEAADGGELVLGGSDPAHYIPPLTFVPVTIPAY 255
Query: 264 WQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
WQ + + +G T +C GCAAI+D+GTSL+ GPT + +N AIGG ++ E
Sbjct: 256 WQIHMERMKVGTGLT-LCAQGCAAILDTGTSLITGPTEEIRALNTAIGGISLLVGE 310
>gi|74219443|dbj|BAE29498.1| unnamed protein product [Mus musculus]
Length = 410
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 123/253 (48%), Positives = 180/253 (71%), Gaps = 11/253 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN++DAQY+G+IGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 71 LKNYLDAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVG---------DVVVKDQVFIEATREGSLTFLL 186
Y + G S +I+YGSGS+SG+ SQD V V + V+ Q+F EAT++ + F+
Sbjct: 131 YVKNGTSFDIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIFGEATKQPGIVFVA 190
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+G+ I+V + +PV+DN+++Q LV + +FSF+LNRDP+ + GGE++ GG D
Sbjct: 191 AKFDGILGMGYPHISVNNVLPVFDNLMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDS 250
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K++ G+ +Y+ VT+K YWQ + + +GN+ T +C+GGC AIVD+G SLL GP V E+
Sbjct: 251 KYYHGELSYLNVTRKAYWQVHMDQLEVGNELT-LCKGGCEAIVDTGASLLVGPVEEVKEL 309
Query: 307 NHAIGGEGVVSAE 319
AIG ++ E
Sbjct: 310 QKAIGAVPLIQGE 322
>gi|391329068|ref|XP_003738999.1| PREDICTED: lysosomal aspartic protease-like [Metaseiulus
occidentalis]
Length = 384
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 121/249 (48%), Positives = 170/249 (68%), Gaps = 2/249 (0%)
Query: 73 DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSR 131
++ P+ N+MDAQY+G I IG+PPQ F V+FDTGSSNLWVPS+ C + ++C H++Y S
Sbjct: 52 NVEPIANYMDAQYYGPISIGNPPQPFQVVFDTGSSNLWVPSANCPITNVACLLHNKYHSS 111
Query: 132 KSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDG 191
KS +Y G + I YGSG++SG S D+V V V + Q F E +E L F+ +FDG
Sbjct: 112 KSTSYLANGTTFSIQYGSGAVSGLLSADDVSVNGVNITRQTFAEILKESGLGFIAGKFDG 171
Query: 192 IIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKG 251
I+G+G+ +I+V +PV+D MV Q ++ +FSF+L RD D G E+V GG+DPKH KG
Sbjct: 172 ILGMGYPQISVLGVLPVFDQMVAQNAIAAPIFSFYLTRDNDHPTGSELVIGGIDPKHHKG 231
Query: 252 KHTYVPVTKKGYWQFELGDILIGNQS-TGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI 310
+ TY+PV++KGYWQF++ + IG+ S T +C GC AI D+GTSL+AGPT V +N AI
Sbjct: 232 EITYIPVSRKGYWQFKMDSVKIGDVSKTTLCANGCQAIADTGTSLIAGPTSEVKALNKAI 291
Query: 311 GGEGVVSAE 319
G ++ E
Sbjct: 292 GAAPFLNGE 300
>gi|195120065|ref|XP_002004549.1| GI19550 [Drosophila mojavensis]
gi|193909617|gb|EDW08484.1| GI19550 [Drosophila mojavensis]
Length = 387
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 134/258 (51%), Positives = 179/258 (69%), Gaps = 6/258 (2%)
Query: 63 VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-IS 121
+++ GDS E PL N++DAQY+G I IG+PPQNF V+FDTGSSNLWVPS KC+ + I+
Sbjct: 49 IKYGAGDSPE---PLSNYLDAQYYGPISIGTPPQNFKVVFDTGSSNLWVPSKKCHLTNIA 105
Query: 122 CYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGS 181
C H++Y + KS+TY + G S +I+YGSGS+SG+ S D V + + +K Q F EA E
Sbjct: 106 CLMHNKYDASKSSTYNKNGTSFDIHYGSGSLSGYLSSDTVNIAGLDIKGQTFAEALSEPG 165
Query: 182 LTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVF 241
L F+ A+FDGI+GLG+ I+V P + NM EQ L+++ VFSF+LNRDP A EGGEI+F
Sbjct: 166 LVFVAAKFDGILGLGYSSISVDGVKPPFYNMFEQSLIAQPVFSFYLNRDPKAPEGGEIIF 225
Query: 242 GGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTP 301
GG DP H+ G TY+PVT+KGYWQ ++ I N +C+GGC I D+GTSL+A P
Sbjct: 226 GGSDPNHYTGDFTYLPVTRKGYWQIKMDSAQINNVE--LCKGGCQVIADTGTSLIAAPAA 283
Query: 302 VVTEINHAIGGEGVVSAE 319
T IN AIGG +V +
Sbjct: 284 EATSINQAIGGTPIVGGQ 301
>gi|73947914|ref|XP_533610.2| PREDICTED: napsin-A [Canis lupus familiaris]
Length = 422
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 141/310 (45%), Positives = 194/310 (62%), Gaps = 8/310 (2%)
Query: 22 PASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFM 81
PA ++ L RI L++ L +LN+ R K GV GD + +PL N+M
Sbjct: 19 PARAS-LIRIPLRRVYPGLETLNSLRGWGKPTVPPSLGVPSS----GD-NPVFVPLSNYM 72
Query: 82 DAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIG 140
+ QY+GEIG+G+PPQNFSVIFDTGSSNLWVPS +C +FS+ C+FH RY S+ S+++ G
Sbjct: 73 NVQYYGEIGLGTPPQNFSVIFDTGSSNLWVPSIRCHFFSLPCWFHHRYNSKASSSFQPNG 132
Query: 141 KSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREI 200
I YG+G + G S+D + +G V +F EA E SL F LA FDGI+GLGF +
Sbjct: 133 TKFAIQYGTGRLDGILSEDKLTIGGVKSASVIFGEALWEPSLVFTLAHFDGILGLGFPIL 192
Query: 201 AVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTK 260
AVG P D +V+QGL+ + VFSF+LNRDP+A +GGE+V GG DP H+ T++PVT
Sbjct: 193 AVGGVQPPLDLLVDQGLLDKPVFSFYLNRDPEAVDGGELVLGGSDPAHYIPPLTFLPVTV 252
Query: 261 KGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAEC 320
YWQ + + +G +C GCAAI+D+GTSL+ GPT + +N AIGG ++ E
Sbjct: 253 PAYWQIHMERVKVGTGLI-LCAQGCAAILDTGTSLITGPTEEIQALNAAIGGFSLLLGEY 311
Query: 321 KLVVSQYGDL 330
+ S+ L
Sbjct: 312 LIQCSEIPTL 321
>gi|241275826|ref|XP_002406708.1| aspartic protease, putative [Ixodes scapularis]
gi|215496940|gb|EEC06580.1| aspartic protease, putative [Ixodes scapularis]
Length = 345
Score = 263 bits (671), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 133/284 (46%), Positives = 182/284 (64%), Gaps = 5/284 (1%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQ 96
R+ LH + ++R + +G R G E PLKN++DAQY+GEI +G+PPQ
Sbjct: 23 RMPLHKMQSSRAHLLDATTPLTRPAGHATRGGPIPE---PLKNYLDAQYYGEITLGTPPQ 79
Query: 97 NFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
+F V+FDTGSSNLWVPS+KC F+ I+C H +Y SRKS+TY + G EI YGSGS+ G
Sbjct: 80 SFRVVFDTGSSNLWVPSAKCPFTNIACLLHRKYYSRKSSTYVKNGTQFEIRYGSGSVRGE 139
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
S D + VGD V Q F E E L FL A+FDGI+GLG+ EI+V V+D MV Q
Sbjct: 140 LSTDTMGVGDSSVTGQTFAEILHESGLAFLAAKFDGILGLGYPEISVLGVPTVFDTMVAQ 199
Query: 216 GLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGN 275
G+ ++ VFS +L+R+ GGE++FGG+D H+ G +YVPV+K+GYWQ + +GN
Sbjct: 200 GVAAKPVFSVFLDRNASDPAGGEVLFGGIDESHYIGNISYVPVSKRGYWQVHMDGTRVGN 259
Query: 276 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
+ C GGC AI+D+GTSL+AGP+ + ++N IG S E
Sbjct: 260 NGS-FCSGGCEAILDTGTSLIAGPSDEIEKLNLLIGAAPFASGE 302
>gi|74151850|dbj|BAE29712.1| unnamed protein product [Mus musculus]
gi|74151877|dbj|BAE29725.1| unnamed protein product [Mus musculus]
Length = 410
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 123/253 (48%), Positives = 180/253 (71%), Gaps = 11/253 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN++DAQY+G+IGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 71 LKNYLDAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVG---------DVVVKDQVFIEATREGSLTFLL 186
Y + G S +I+YGSGS+SG+ SQD V V + V+ Q+F EAT++ + F+
Sbjct: 131 YVKNGTSFDIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIFGEATKQPGIVFVA 190
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+G+ I+V + +PV+DN+++Q LV + +FSF+LNRDP+ + GGE++ G D
Sbjct: 191 AKFDGILGMGYPHISVNNVLPVFDNLMQQKLVDKNIFSFYLNRDPEGQPGGELMLGDTDS 250
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K++ G+ +Y+ VT+K YWQ + + +GN+ T +C+GGC AIVD+GTSLL GP V E+
Sbjct: 251 KYYHGELSYLNVTRKAYWQVHMDQLEVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKEL 309
Query: 307 NHAIGGEGVVSAE 319
AIG ++ E
Sbjct: 310 QKAIGAVPLIQGE 322
>gi|198422402|ref|XP_002130569.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 389
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 126/245 (51%), Positives = 170/245 (69%), Gaps = 2/245 (0%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSN 134
PL N++DAQY+G+I IG+PPQ F+V+FDTGSSNLWVPS C + I+C H++YK+ +S+
Sbjct: 59 PLTNYLDAQYYGKIYIGTPPQPFTVVFDTGSSNLWVPSVHCAITDIACLIHNKYKASESS 118
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y G S I YGSGS+SG+ S D V + V K+Q+F EAT+E LTF+ A+FDGI+G
Sbjct: 119 SYKSNGTSFAIQYGSGSLSGYVSSDIVSIAGVKSKNQLFAEATKEPGLTFVAAKFDGILG 178
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+G+ EI+V PV++ M +Q ++ FSF+LNRD +A GGE+ GGVD K F G +
Sbjct: 179 MGYPEISVNGITPVFNQMFKQEALAHNQFSFYLNRDANASSGGELYLGGVDTKKFTGSFS 238
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
Y PVT KGYWQ + + +G+ ST C GC AIVDSGTSLLAGPT + +IN IG
Sbjct: 239 YHPVTVKGYWQISMDSVSVGS-STSACVSGCKAIVDSGTSLLAGPTDEIEKINKLIGATK 297
Query: 315 VVSAE 319
++ E
Sbjct: 298 FLNGE 302
>gi|205364148|gb|ACI04532.1| aspartic protease 1 precursor [Ancylostoma duodenale]
Length = 446
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 127/255 (49%), Positives = 171/255 (67%), Gaps = 2/255 (0%)
Query: 66 RLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYF 124
+L ++E L+N+MDAQYFG I IG+P QNF+VIFDTGSSNLWVPS KC ++ I+C
Sbjct: 76 KLQSTNEIDELLRNYMDAQYFGTIQIGTPAQNFTVIFDTGSSNLWVPSRKCPFYDIACML 135
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
H RY S S+TY E G+ I YG+GS+ GF S+DNV + + ++Q F EAT E LTF
Sbjct: 136 HRRYDSGASSTYKEDGRKMAIQYGTGSMKGFISKDNVCIAGICAEEQPFAEATSEPGLTF 195
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
+ A+FDGI+G+ F EI+V PV+ +EQ V VF+FWLNR+PD+E GGEI GG+
Sbjct: 196 IAAKFDGILGMAFPEISVLGVPPVFHTFIEQKKVPSPVFAFWLNRNPDSELGGEITLGGM 255
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
D + + T+ PVT++GYWQF++ D + G ++ C GC AI D+GTSL+AGP V
Sbjct: 256 DTRRYVEPITWTPVTRRGYWQFKM-DKVQGGSTSIACPNGCQAIADTGTSLIAGPKAQVE 314
Query: 305 EINHAIGGEGVVSAE 319
I IG E ++ E
Sbjct: 315 AIQKFIGAEPLMKGE 329
>gi|301619112|ref|XP_002938948.1| PREDICTED: cathepsin D-like [Xenopus (Silurana) tropicalis]
Length = 355
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 117/241 (48%), Positives = 171/241 (70%), Gaps = 2/241 (0%)
Query: 80 FMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTE 138
++ AQY+GEIG+GSPPQNF+V+FDTGSSNLWVPS C I+C+ H +Y S KS+TY +
Sbjct: 30 YLQAQYYGEIGLGSPPQNFTVVFDTGSSNLWVPSVHCSMLDIACWMHHKYDSSKSSTYVK 89
Query: 139 IGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFR 198
G + I YG+GS+SG+ S+D V +G++ VK Q+F EA ++ +TF+ A+FDGI+G+ +
Sbjct: 90 NGTAFAIQYGTGSLSGYLSKDTVTIGNLAVKGQIFGEAVKQPGVTFVAAKFDGILGMAYP 149
Query: 199 EIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPV 258
I+V PV+DN++ Q LV +FSF+LNR+PD + GGE++ GG DPK++ G Y+ V
Sbjct: 150 VISVDGVPPVFDNIMAQKLVESNIFSFYLNRNPDTQPGGELLLGGTDPKYYTGDFHYLSV 209
Query: 259 TKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSA 318
T+K YWQ + + +G+Q T +C+GGC IVD+GTSL+ GP VT + AIG ++
Sbjct: 210 TRKAYWQIHMDQLGVGDQLT-LCKGGCEVIVDTGTSLITGPLEEVTALQKAIGAVPLIQG 268
Query: 319 E 319
+
Sbjct: 269 Q 269
>gi|342675479|gb|AEL31665.1| cathepsin D [Cynoglossus semilaevis]
Length = 396
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 118/244 (48%), Positives = 172/244 (70%), Gaps = 2/244 (0%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN++DAQY+G+I +G+PPQ FSV+FDTGSSNLWVPS C I+C H +Y S KS+T
Sbjct: 68 LKNYLDAQYYGDITLGTPPQTFSVVFDTGSSNLWVPSIHCSLLDIACLLHKKYNSAKSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + G + I YGSGS+SG+ SQD +G + V++Q+F EA ++ + F+ A+FDGI+G+
Sbjct: 128 YVKNGTAFAIQYGSGSLSGYLSQDTCSIGGLTVENQLFGEAIKQPGIAFIAAKFDGILGM 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ I+V +PV+DN+++Q V VFSF+LNR+PD GGE++ GG DP ++ G+ Y
Sbjct: 188 AYPRISVDGVLPVFDNIMQQKKVESNVFSFYLNRNPDTAPGGELLLGGTDPTYYTGEFNY 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
V VT++ YWQ + ++ +G+Q T +C+GGC AIVD+GTSLL GP+ V + AIG +
Sbjct: 248 VNVTRQAYWQVSMDELAVGSQLT-LCKGGCQAIVDTGTSLLTGPSAEVKALQKAIGAIPL 306
Query: 316 VSAE 319
+ E
Sbjct: 307 IQGE 310
>gi|86278345|gb|ABC88426.1| cathepsin D-like aspartic proteinase preproprotein [Meloidogyne
incognita]
Length = 454
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 127/250 (50%), Positives = 165/250 (66%), Gaps = 7/250 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L+N+MDAQY+G I IGSPPQNFSVIFDTGSSNLWVPS KC ++ I+C H +Y S KS++
Sbjct: 82 LRNYMDAQYYGPISIGSPPQNFSVIFDTGSSNLWVPSKKCPFYDIACLLHHKYDSTKSSS 141
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + G+ +I YG+GS+ GF S+D V + ++ V Q F EA E LTF+ A+FDGI+G+
Sbjct: 142 YKDDGRKMQIQYGTGSMKGFVSKDTVCIANICVAGQEFAEAVSEPGLTFVAAKFDGILGM 201
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
F EI+V PV+ M+ Q V E VFSFWLNRDP ++ GGEI GG D + + Y
Sbjct: 202 AFPEISVLGVQPVFQQMISQQKVPEPVFSFWLNRDPYSKVGGEITIGGTDKRRYVEPLNY 261
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG---- 311
PVT+K YWQF++ + C+ GC AI D+GTSL+AGP + EI H IG
Sbjct: 262 TPVTRKAYWQFKMEGVHNSKGEKIACQNGCEAIADTGTSLIAGPKAQIEEIQHYIGAVPL 321
Query: 312 --GEGVVSAE 319
GE +VS E
Sbjct: 322 MHGEYMVSCE 331
>gi|344307517|ref|XP_003422427.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin D-like [Loxodonta
africana]
Length = 419
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 124/254 (48%), Positives = 177/254 (69%), Gaps = 12/254 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L+N+MDAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 79 LRNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSVHCKLLDIACWIHHKYNSAKSST 138
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEV----------GDVVVKDQVFIEATREGSLTFL 185
Y + G + +I+YGSGS+SG+ SQD V V G V V+ Q F EAT++ +TF+
Sbjct: 139 YVKNGTTFDIHYGSGSLSGYLSQDTVSVPCSSASASALGGVRVERQTFGEATKQPGITFI 198
Query: 186 LARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVD 245
A+FDGI+G+ + I+V VPV+DN++ Q LV + +FSF+LNRDP A+ GGE++ GG+D
Sbjct: 199 AAKFDGILGMAYPRISVNKVVPVFDNLMAQKLVEKNMFSFYLNRDPTAQPGGELMLGGID 258
Query: 246 PKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTE 305
K++ G + VT++ YWQ + + +GN T +C+GGC AIVD+GTSL+ GP +TE
Sbjct: 259 SKYYTGTLNFNKVTREAYWQIHMDRVDVGNGLT-LCKGGCEAIVDTGTSLMVGPVEEITE 317
Query: 306 INHAIGGEGVVSAE 319
+ A+G ++ E
Sbjct: 318 LQKALGAIPLIQGE 331
>gi|326433118|gb|EGD78688.1| cathepsin D [Salpingoeca sp. ATCC 50818]
Length = 385
Score = 261 bits (667), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 142/351 (40%), Positives = 202/351 (57%), Gaps = 28/351 (7%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRI---GLKKRRLDLHSLNAARITRKERYMGGAGVSG 62
+ L +A+ L+ + NGL R+ G+ + R L + AA + +
Sbjct: 3 MARTMALLAVATLLMAACAVNGLHRVPLTGMPRSRDTLRNAGAALLNK------------ 50
Query: 63 VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISC 122
+ LG+ +P+ NF DAQY+GEI IG+PPQ F V+FDTGSSNLWVPS +C S++C
Sbjct: 51 --YSLGNGTN--VPIYNFEDAQYYGEITIGTPPQRFKVVFDTGSSNLWVPSKQCK-SLAC 105
Query: 123 YFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSL 182
H +Y S +S+TY G I YGSGS++GF S D VGD+ V+ Q+F EAT E +
Sbjct: 106 DLHHKYDSSQSSTYFPNGTKFAIEYGSGSLTGFLSGDKTCVGDLCVEKQLFAEATNEPGI 165
Query: 183 TFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFG 242
TF+ A+FDGI+G+GF EI+V VP W N+V G V +++FWLNR A GGE+ G
Sbjct: 166 TFVAAKFDGILGMGFVEISVDQVVPYWYNLVSAGKVESNMYTFWLNRVQGAPSGGELTLG 225
Query: 243 GVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPV 302
G DPKH G +VP+T+ GYWQF + + + S C C AI D+GTSLLAGPT
Sbjct: 226 GYDPKHMSGPIQWVPLTRDGYWQFAMDSLSVNGDS--YCS-NCQAIADTGTSLLAGPTDA 282
Query: 303 VTEINHAIG----GEGVVSAECKLVVSQYG-DLIWDLLVSGLLPEKVCQQI 348
+ ++N IG +G +CK + + D++ + L P++ Q+
Sbjct: 283 IKKLNKQIGAIPIAQGEYMVDCKKIPTMPNVDIVLNGQKFTLTPQQYVLQV 333
>gi|195380081|ref|XP_002048799.1| GJ21122 [Drosophila virilis]
gi|194143596|gb|EDW59992.1| GJ21122 [Drosophila virilis]
Length = 391
Score = 261 bits (667), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 146/315 (46%), Positives = 195/315 (61%), Gaps = 21/315 (6%)
Query: 12 LWVLASCLLLP---ASSNGLRRIGLKK---RRLDLHSLNAARITRKERYMGGAGVSGVRH 65
L + A CL L A+ L R+ L K R + + +Y G GVS
Sbjct: 5 LLLFAVCLALAWAVAAEPKLLRVPLNKFQSARRHFADVGTELQQLRIKYGGAGGVSPE-- 62
Query: 66 RLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYF 124
PL N++DAQY+G I IGSPPQNF V+FDTGSSNLWVPS KC+ + I+C
Sbjct: 63 ----------PLSNYLDAQYYGPISIGSPPQNFKVVFDTGSSNLWVPSKKCHLTNIACLM 112
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
H++Y + KS++Y++ G I+YGSGS+SG+ S D V + + +KDQ F EA E L F
Sbjct: 113 HNKYDASKSSSYSKNGTEFAIHYGSGSLSGYLSSDTVNIAGLDIKDQTFAEALSEPGLVF 172
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
+ A+FDGI+GLG+ I+V P + +M EQGL+S+ VFSF+LNRDP A EGGEI+FGG
Sbjct: 173 VAAKFDGILGLGYSSISVDGVKPPFYSMFEQGLISQPVFSFYLNRDPKAPEGGEIIFGGS 232
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
DP H+ G TY+PVT+KGYWQ ++ + N +C+GGC I D+GTSL+A P T
Sbjct: 233 DPNHYTGDFTYLPVTRKGYWQIKMDSAQLNNLE--LCKGGCQIIADTGTSLIAAPVAEAT 290
Query: 305 EINHAIGGEGVVSAE 319
IN AIGG +V +
Sbjct: 291 SINQAIGGTPIVGGQ 305
>gi|71727523|gb|AAZ39883.1| cathepsin D-like aspartic protease [Opisthorchis viverrini]
Length = 425
Score = 261 bits (667), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 124/237 (52%), Positives = 173/237 (72%), Gaps = 5/237 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L N++DAQY+GEIGIG+PPQ+F V+FDTGSSNLWVPS+ C F+I+C+ H +Y S +S+T
Sbjct: 61 LNNYLDAQYYGEIGIGTPPQSFQVVFDTGSSNLWVPSTHCSIFNIACWLHHKYDSARSST 120
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G I YGSGS+SG S D V VG V+VK+Q F EA +E + F+ A+FDGI+G+
Sbjct: 121 YYPNGTEFSIRYGSGSVSGILSTDYVSVGTVIVKNQTFGEAMKEPGIAFVAAKFDGILGM 180
Query: 196 GFREIAVGDAVP-VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
GF+ I+V D VP ++DNM+ QGLV E VFSF+L+R+ GGE++ GG DPK++KG+
Sbjct: 181 GFKSISV-DGVPTLFDNMISQGLVPEPVFSFYLDRNASDPVGGELLLGGTDPKYYKGEIL 239
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
+ P+T + YWQF++ + +G +CE GC AI D+GTSL+AGP+ V ++N A+G
Sbjct: 240 WAPLTHEAYWQFKVDSMSVGGMK--LCENGCQAIADTGTSLIAGPSEEVGKLNDALG 294
>gi|407728652|gb|AFU24355.1| cathepsin D [Ctenopharyngodon idella]
Length = 398
Score = 261 bits (667), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 137/319 (42%), Positives = 201/319 (63%), Gaps = 16/319 (5%)
Query: 17 SCLLLPA----SSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLG-DSD 71
+CLLL A +S+ + RI L K R +L+ + +E AG +++ LG +
Sbjct: 4 ACLLLAAAFFWTSDAIVRIPLTKFRSIRRTLSDSGRAVEELV---AGSVPLKYNLGFPAS 60
Query: 72 EDILP--LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRY 128
P LKN++DAQY+GEIG+G+P Q+F+V+FDTGSSNLWVPS C I+C H +Y
Sbjct: 61 NGPTPGTLKNYLDAQYYGEIGLGTPVQSFTVVFDTGSSNLWVPSVHCSLMDIACLLHHKY 120
Query: 129 KSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLAR 188
KS+TY + G I YGSGS+SG+ SQD VGD+ V+ Q+F EA ++ + F+ A+
Sbjct: 121 NGGKSSTYVKNGTEFAIQYGSGSLSGYLSQDTCTVGDIAVEKQIFGEAIKQPGVAFIAAK 180
Query: 189 FDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKH 248
FDGI+G+ + IAV PV+D M+ Q V + +FSF+LNR+PD + GGE++ GG DPK+
Sbjct: 181 FDGILGMAYPRIAVDGVPPVFDMMMSQKKVEKNIFSFYLNRNPDTQPGGELLLGGTDPKY 240
Query: 249 FKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH 308
+ G YV ++++ YWQ + + IG++ T +C+GGC AIVD+GTSL+ GP + +
Sbjct: 241 YTGDFNYVDISRQAYWQIHMDGMSIGSELT-LCKGGCEAIVDTGTSLITGPATEIKALQK 299
Query: 309 AIGG----EGVVSAECKLV 323
AIG +G +CK V
Sbjct: 300 AIGAIPLIQGEYMVDCKKV 318
>gi|315440803|gb|ADU20407.1| aspartic protease 1 [Clonorchis sinensis]
Length = 425
Score = 261 bits (667), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 127/238 (53%), Positives = 172/238 (72%), Gaps = 5/238 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L N++DAQY+GEIGIG+PPQ+F V+FDTGSSNLWVPS C FSI+C+ H +Y S K +T
Sbjct: 61 LNNYLDAQYYGEIGIGTPPQSFEVVFDTGSSNLWVPSKHCSIFSIACWLHHKYDSAKYST 120
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G I YGSGS+SG S D V VG V VK+Q F EA +E + F+ A+FDGI+G+
Sbjct: 121 YMANGTEFSIRYGSGSVSGILSTDYVSVGTVTVKNQTFGEAMKEPGIAFVAAKFDGILGM 180
Query: 196 GFREIAVGDAVP-VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
GF+ I+V D VP ++DNM+ QGLVSE VFSF+L+R+ GGE++ GG DPK++KG+
Sbjct: 181 GFKTISV-DGVPTLFDNMISQGLVSEPVFSFYLDRNASDPVGGELLLGGTDPKYYKGEIL 239
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
+ P+T + YWQF++ + +G S +CE GC AI D+GTSL+AGP+ V ++N A+G
Sbjct: 240 WAPLTHEAYWQFKVDSMNVG--SMKLCENGCQAIADTGTSLIAGPSEEVGKLNDALGA 295
>gi|431910128|gb|ELK13201.1| Cathepsin D [Pteropus alecto]
Length = 375
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 124/253 (49%), Positives = 177/253 (69%), Gaps = 11/253 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN+MDAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 36 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSGKSST 95
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEV---------GDVVVKDQVFIEATREGSLTFLL 186
Y G + +I+YGSGS+SG+ SQD V V V V+ Q+F EAT++ +TF+
Sbjct: 96 YVRNGTAFDIHYGSGSLSGYLSQDTVSVPCKSAPSPPSSVKVERQIFGEATKQPGITFIA 155
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+ + I+V + +PV+DN+++Q LV + +FSF+LNRDP+A+ GGE++ GG D
Sbjct: 156 AKFDGILGMAYPRISVNNVLPVFDNLMQQKLVDKNIFSFYLNRDPNAQPGGELMLGGTDS 215
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K++ G +Y+ VT+K YWQ + + +GN T +C+ GC AIVD+GTSL+ GP V +
Sbjct: 216 KYYTGSLSYLNVTRKAYWQVHMEQVDVGNSLT-LCKAGCEAIVDTGTSLVVGPVEEVRAL 274
Query: 307 NHAIGGEGVVSAE 319
AIG ++ E
Sbjct: 275 QKAIGAVPLIQGE 287
>gi|315274244|gb|ADU03674.1| cathepsin D2 [Ixodes ricinus]
Length = 387
Score = 261 bits (666), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 132/286 (46%), Positives = 182/286 (63%), Gaps = 10/286 (3%)
Query: 37 RLDLHSLNAAR--ITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSP 94
R+ LH + +AR + + V R + + PLKN++DAQY+GEI +G+P
Sbjct: 23 RMPLHKMQSARAHLLDATTPLTRPAVHATRGPIPE------PLKNYLDAQYYGEITLGTP 76
Query: 95 PQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSIS 153
PQ+F V+FDTGSSNLWVPS+KC F+ I+C H +Y SRKS+TY + G EI YGSGS+
Sbjct: 77 PQSFRVVFDTGSSNLWVPSAKCPFTNIACLLHRKYYSRKSSTYVKNGTQFEIRYGSGSVR 136
Query: 154 GFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMV 213
G S D + VGD V Q F E E L FL A+FDGI+GLG+ EI+V V+D MV
Sbjct: 137 GELSTDTMGVGDSSVTGQTFAEILHESGLAFLAAKFDGILGLGYPEISVLGVPTVFDTMV 196
Query: 214 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 273
QG+ ++ VFS +L+R+ GGE++FGG+D H+ G +YVPV+K+GYWQ + +
Sbjct: 197 AQGVAAKPVFSVFLDRNASDPAGGEVLFGGIDESHYTGNISYVPVSKRGYWQVHMDGTRV 256
Query: 274 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
GN + C GGC AI+D+GTSL+AGP+ + ++N IG S E
Sbjct: 257 GNNGS-FCSGGCEAILDTGTSLIAGPSDEIEKLNLLIGAAPFASGE 301
>gi|118344558|ref|NP_001072052.1| cathepsin D1 precursor [Takifugu rubripes]
gi|55771082|dbj|BAD69801.1| cathepsin D1 [Takifugu rubripes]
Length = 396
Score = 261 bits (666), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 118/244 (48%), Positives = 171/244 (70%), Gaps = 2/244 (0%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN++DAQY+GEIG+G+PPQ F+V+FDTGSSNLWVPS C I+C H +Y S KS++
Sbjct: 68 LKNYLDAQYYGEIGLGTPPQPFTVVFDTGSSNLWVPSVHCSLLDIACLLHHKYNSAKSSS 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + G + I YGSGS+SG+ SQD +GD+ V+ Q+F EA ++ + F+ A+FDGI+G+
Sbjct: 128 YVKNGTAFAIRYGSGSLSGYLSQDTCTLGDLAVEKQLFGEAIKQPGIAFIAAKFDGILGM 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ I+V PV+DN++ Q V + VFSF+LNR+PD + GGE++ GG DPK++ G Y
Sbjct: 188 AYPRISVDGVTPVFDNIMSQKKVEKNVFSFYLNRNPDTQPGGELLLGGTDPKYYTGDFDY 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
V VT++ YWQ + + +G+Q + +C+ GC AIVD+GTSLL GP+ V + AIG +
Sbjct: 248 VNVTRQAYWQIHMDGMSVGSQLS-LCKSGCEAIVDTGTSLLTGPSEEVKALQKAIGAMPL 306
Query: 316 VSAE 319
+ E
Sbjct: 307 IQGE 310
>gi|147906891|ref|NP_001082550.1| cathepsin D precursor [Xenopus laevis]
gi|28436104|dbj|BAC57431.1| cathepsin D [Xenopus laevis]
Length = 409
Score = 261 bits (666), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 137/326 (42%), Positives = 197/326 (60%), Gaps = 20/326 (6%)
Query: 1 MEQKLLRSVFCLWVLASCLLLPASSNGLRRIGLKK----RRLDLHSLNAARITRKERYMG 56
M + S+ CL C L+ + L RI LKK RR A T K+
Sbjct: 1 MASAPVWSLLCL-----CCLVFQPGSSLVRIPLKKFTSIRR-------AMSDTDKDSLKL 48
Query: 57 GAGVSGVRHRLGDSDEDILP--LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSS 114
+ ++ + P L N++DAQY+GEI IG+PPQ F+V+FDTGSSNLWV S
Sbjct: 49 SGNEAATKYSAFPKSNNPTPETLLNYLDAQYYGEISIGTPPQPFTVVFDTGSSNLWVASV 108
Query: 115 KC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF 173
C F I+C+ H +Y S KS+TY + G I YG+GSISG+ S+D V +G++ K+Q+F
Sbjct: 109 HCSMFDIACWMHRKYDSSKSSTYVKNGTEFAIQYGTGSISGYLSKDTVTIGNLGYKEQIF 168
Query: 174 IEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDA 233
EA ++ +TF+ A+FDGI+G+ + I+V P +DN++ Q LV VFSF+LNR+PD
Sbjct: 169 GEAIKQPGVTFIAAKFDGILGMAYPIISVDGVSPCFDNIMAQKLVESNVFSFYLNRNPDT 228
Query: 234 EEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGT 293
+ GGE++ GG DPK++ G Y+ VT+K YWQ + + +G+Q T +C+GGC AIVD+GT
Sbjct: 229 QPGGELLLGGTDPKYYTGDFHYLNVTRKAYWQIHMDQLGVGDQLT-LCKGGCEAIVDTGT 287
Query: 294 SLLAGPTPVVTEINHAIGGEGVVSAE 319
SL+ GP V + AIG ++ E
Sbjct: 288 SLITGPVEEVAALQRAIGAIPLIRGE 313
>gi|311258028|ref|XP_003127411.1| PREDICTED: napsin-A [Sus scrofa]
Length = 416
Score = 260 bits (665), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 136/307 (44%), Positives = 189/307 (61%), Gaps = 14/307 (4%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLG---DSDEDILPLKNFMDAQ 84
L RI L++ L +LN R K S RLG D+ +PL N+++ Q
Sbjct: 23 LIRIPLRRVHAGLRTLNPLRAWEK---------SAEPPRLGAPSPGDKTFVPLSNYLNVQ 73
Query: 85 YFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF-SISCYFHSRYKSRKSNTYTEIGKSC 143
Y+GEIG+G+PPQNFSVIFDTGSSNLWVPS +C+F S+ C+ H RY S+ S+++
Sbjct: 74 YYGEIGLGTPPQNFSVIFDTGSSNLWVPSGRCHFLSLPCWLHHRYHSKASSSFHSNETKF 133
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVG 203
I YG+G ++G S+D + +G + +F EA E SL F A FDGI+GLGF +AVG
Sbjct: 134 AIQYGTGRLNGILSEDKLTIGGLTGASVIFGEALWEPSLVFAFAHFDGILGLGFPVLAVG 193
Query: 204 DAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGY 263
P D++V+QGL+ + VFSF+LNRDP+A +GGE+V GG DP H+ T+VPVT Y
Sbjct: 194 GVRPPLDSLVDQGLLDKPVFSFYLNRDPEAADGGELVLGGSDPAHYIPPLTFVPVTVPAY 253
Query: 264 WQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLV 323
WQ + + +G T +C GCAAI+D+GTSL+ GPT + + AIGG ++ E +
Sbjct: 254 WQVHVERVHVGTGLT-LCAQGCAAILDTGTSLITGPTEEIQALQAAIGGIPLLMGEYLIQ 312
Query: 324 VSQYGDL 330
S+ L
Sbjct: 313 CSKIPTL 319
>gi|27503926|gb|AAH42316.1| Ctsd protein [Danio rerio]
gi|38571742|gb|AAH62824.1| Ctsd protein [Danio rerio]
gi|197247273|gb|AAI64814.1| Ctsd protein [Danio rerio]
Length = 398
Score = 260 bits (665), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 137/326 (42%), Positives = 203/326 (62%), Gaps = 16/326 (4%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRH 65
+R F L V+A +S+ + RI LKK R +L+ + + +E + + +++
Sbjct: 1 MRIAFLLLVVA----FFCTSDAIVRIPLKKFRTLRRTLSDSGRSLEELV---SSSNSLKY 53
Query: 66 RLG-DSDEDILP--LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-IS 121
LG + D P LKN++DAQY+GEIG+G+P Q F+V+FDTGSSNLWVPS C + I+
Sbjct: 54 NLGFPASNDPTPETLKNYLDAQYYGEIGLGTPVQTFTVVFDTGSSNLWVPSVHCSLTDIA 113
Query: 122 CYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGS 181
C H +Y KS+TY + G I YGSGS+SG+ SQD +GD+ V+ Q+F EA ++
Sbjct: 114 CLLHHKYNGGKSSTYVKNGTQFAIQYGSGSLSGYLSQDTCTIGDIAVEKQIFGEAIKQPG 173
Query: 182 LTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVF 241
+ F+ A+FDGI+G+ + IAV PV+D M+ Q V + VFSF+LNR+PD + GGE++
Sbjct: 174 VAFIAAKFDGILGMAYPRIAVDGVPPVFDMMMSQKKVEKNVFSFYLNRNPDTQPGGELLL 233
Query: 242 GGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTP 301
GG DPK++ G YV ++++ YWQ + + IG+ +C+GGC AIVD+GTSL+ GP
Sbjct: 234 GGTDPKYYTGDFNYVDISRQAYWQIHMDGMSIGS-GLSLCKGGCEAIVDTGTSLITGPAA 292
Query: 302 VVTEINHAIGG----EGVVSAECKLV 323
V + AIG +G +CK V
Sbjct: 293 EVKALQKAIGAIPLMQGEYMVDCKKV 318
>gi|395858453|ref|XP_003801583.1| PREDICTED: napsin-A [Otolemur garnettii]
Length = 419
Score = 260 bits (664), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 135/298 (45%), Positives = 185/298 (62%), Gaps = 7/298 (2%)
Query: 23 ASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMD 82
S L R+ L++ +LN R R+ + S ++LG ++PL +F+D
Sbjct: 20 PSGATLIRVSLRRVHSGHKTLNLLRRWREPAELSSLEASSPGNKLG-----LVPLSDFLD 74
Query: 83 AQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGK 141
QYFGEIG+G+PPQNFSV+FDTGSSNLWVPS +C +FS+ C+FH R+ S+++ G
Sbjct: 75 VQYFGEIGLGTPPQNFSVVFDTGSSNLWVPSRRCHFFSVPCWFHHRFNPNASSSFQPNGT 134
Query: 142 SCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIA 201
I YGSG ++G S+D + +G + VF EA E SLTF A FDGI+GLGF +A
Sbjct: 135 KFAIEYGSGRLNGILSKDKLTIGGLKGASVVFGEALWEPSLTFTFAPFDGILGLGFPILA 194
Query: 202 VGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKK 261
V P D +VEQGL+ + VFSF+LNRDPD +GGE+V GG DP H+ T+VPVT
Sbjct: 195 VEGVRPPLDVLVEQGLLDKPVFSFYLNRDPDVADGGELVLGGSDPAHYIPPLTFVPVTIP 254
Query: 262 GYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
YWQ + + +G T +C GCAAI+D+GTSL+ GPT + ++ AIGG + E
Sbjct: 255 AYWQIHMERVKVGTGLT-LCAQGCAAILDTGTSLITGPTEEIRALHAAIGGIPLPPGE 311
>gi|22651403|gb|AAL61540.1| cathepsin D precursor [Danio rerio]
Length = 398
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 137/326 (42%), Positives = 202/326 (61%), Gaps = 16/326 (4%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRH 65
+R F L V A +S+ + RI LKK R +L+ + + +E + + +++
Sbjct: 1 MRIAFLLLVAA----FFCTSDAIVRIPLKKFRTLRRTLSDSGRSLEELV---SSSNSLKY 53
Query: 66 RLG-DSDEDILP--LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-IS 121
LG + D P LKN++DAQY+GEIG+G+P Q F+V+FDTGSSNLWVPS C + I+
Sbjct: 54 NLGFPASNDPTPETLKNYLDAQYYGEIGLGTPVQTFTVVFDTGSSNLWVPSVHCSLTDIA 113
Query: 122 CYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGS 181
C H +Y KS+TY + G I YGSGS+SG+ SQD +GD+ V+ Q+F EA ++
Sbjct: 114 CLLHHKYNGGKSSTYVKNGTQFAIQYGSGSLSGYLSQDTCTIGDIAVEKQIFGEAIKQPG 173
Query: 182 LTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVF 241
+ F+ A+FDGI+G+ + IAV PV+D M+ Q V + VFSF+LNR+PD + GGE++
Sbjct: 174 VAFIAAKFDGILGMAYPRIAVDGVPPVFDMMMSQKKVEKNVFSFYLNRNPDTQPGGELLL 233
Query: 242 GGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTP 301
GG DPK++ G YV ++++ YWQ + + IG+ +C+GGC AIVD+GTSL+ GP
Sbjct: 234 GGTDPKYYTGDFNYVDISRQAYWQIHMDGMSIGS-GLSLCKGGCEAIVDTGTSLITGPAA 292
Query: 302 VVTEINHAIGG----EGVVSAECKLV 323
V + AIG +G +CK V
Sbjct: 293 EVKALQKAIGAIPLMQGEYMVDCKKV 318
>gi|4099023|gb|AAD00524.1| aspartic protease [Onchocerca volvulus]
Length = 422
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 137/292 (46%), Positives = 182/292 (62%), Gaps = 18/292 (6%)
Query: 21 LPASSNGLRRIGLKKR-RLDLHSLNAA-----------RITRKE-RYMGGAGVSGVRHRL 67
+ A N RI L K+ + H L A +I RK ++ G +
Sbjct: 26 IAAEENHFTRIALHKQDSIHSHLLKAGSWEAYSELVNFQIQRKRIQHKYEFGSRSGKSIA 85
Query: 68 GDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF-SISCYFHS 126
G++DE LKN+MDAQY+GEI IG+PPQNFSVIFDTGSSNLW+PS KC F I+C H+
Sbjct: 86 GETDE---VLKNYMDAQYYGEISIGTPPQNFSVIFDTGSSNLWIPSIKCPFLDIACLLHN 142
Query: 127 RYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLL 186
+YK +S TY G+ EI YG GS+ GF S D V + DV V DQ F EAT E +TF++
Sbjct: 143 KYKGTESKTYKSDGRKIEIQYGRGSMKGFVSMDTVCIADVCVTDQPFAEATSEPGVTFIM 202
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+ F EIAV PV++ M+ Q ++ + VF+FWL+R+P E GGEI GG+D
Sbjct: 203 AKFDGILGMAFPEIAVLGLSPVFNTMISQKVLQQPVFAFWLDRNPSDEVGGEITLGGIDT 262
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
F TY PV++ GYWQF++ I +++ G C GC AI D+GTSL+AG
Sbjct: 263 NRFVSPITYTPVSRHGYWQFKMDSIQGKDEAIG-CANGCQAIADTGTSLIAG 313
>gi|380036056|ref|NP_001244039.1| cathepsin D1 precursor [Ictalurus punctatus]
gi|330689904|gb|AEC33270.1| cathepsin D1 [Ictalurus punctatus]
Length = 396
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 124/254 (48%), Positives = 172/254 (67%), Gaps = 6/254 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNT 135
LKN++DAQY+GEIG+GSP Q F+V+FDTGSSNLWVPS C + I+C H +Y KS+T
Sbjct: 68 LKNYLDAQYYGEIGLGSPVQTFTVVFDTGSSNLWVPSVHCSLTDIACLLHHKYNGAKSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + G + I YGSGS+SG+ SQD +GD+ V+ Q+F EA ++ + F+ A+FDGI+G+
Sbjct: 128 YVKNGTAFAIQYGSGSLSGYLSQDVCTIGDIAVEKQIFGEAIKQPGVAFIAAKFDGILGM 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ IAV PV+D M+ Q V + VFSF+LNR+PD + GGE++ GG DPK + G Y
Sbjct: 188 AYPRIAVDGVPPVFDMMMSQKKVEKNVFSFYLNRNPDTQPGGELLLGGTDPKFYTGDFHY 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
V +T++ YWQ + + IG+Q T +C+GGC AIVD+GTSL+ GP V + AIG
Sbjct: 248 VNITRQAYWQIHMDGMTIGSQLT-LCKGGCEAIVDTGTSLITGPAAEVKALQKAIGAIPL 306
Query: 313 -EGVVSAECKLVVS 325
+G +CK V S
Sbjct: 307 IQGEYMVDCKKVPS 320
>gi|260837471|ref|XP_002613727.1| hypothetical protein BRAFLDRAFT_114822 [Branchiostoma floridae]
gi|229299116|gb|EEN69736.1| hypothetical protein BRAFLDRAFT_114822 [Branchiostoma floridae]
Length = 392
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 127/268 (47%), Positives = 177/268 (66%), Gaps = 13/268 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
LKNFMD QY+G I +G+PPQ+F+VIFDTGSSNLWVPS KC +C H RY KS TY
Sbjct: 66 LKNFMDVQYYGVISLGTPPQDFNVIFDTGSSNLWVPSVKCE-GAACANHQRYNHSKSCTY 124
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
G+ +I YGSGS+SGF SQD V +G +V+K+Q F EAT E F +FDGI+GL
Sbjct: 125 KADGRPLKITYGSGSLSGFLSQDVVMIGSIVIKNQTFGEATNEPGSAFATGKFDGILGLA 184
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ +IAV PV+D +++Q LV + VFSF+L+RDP GGE++ GG DP ++ G TY+
Sbjct: 185 YPQIAVDHIRPVFDMIMDQKLVDKNVFSFYLDRDPSRAPGGELLLGGTDPTYYTGNFTYI 244
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
PV+ +GYWQ + + +G+Q +C GGC AIVD+GTSL+AGP+ + ++ AIG + +
Sbjct: 245 PVSYQGYWQLNMDGVHVGDQK--LCAGGCQAIVDTGTSLIAGPSEEIHKLQAAIGSQQIS 302
Query: 317 SAE----------CKLVVSQYGDLIWDL 334
+ +V Q+GD +++L
Sbjct: 303 PGQYLVDCGRLDDLPVVSFQFGDKLFNL 330
>gi|308483047|ref|XP_003103726.1| CRE-ASP-4 protein [Caenorhabditis remanei]
gi|308259744|gb|EFP03697.1| CRE-ASP-4 protein [Caenorhabditis remanei]
Length = 462
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 133/270 (49%), Positives = 177/270 (65%), Gaps = 22/270 (8%)
Query: 67 LGDSDEDILPLKNFMD----------------AQYFGEIGIGSPPQNFSVIFDTGSSNLW 110
LG+ DE L+N+MD AQYFG I IG+P QNF+VIFDTGSSNLW
Sbjct: 80 LGEIDE---LLRNYMDVRAQRLCCLKSKIIFQAQYFGTISIGTPGQNFTVIFDTGSSNLW 136
Query: 111 VPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVK 169
VPS KC ++ I+C H RY S+ S+TY E G+ I YG+GS+ GF S+D+V V V +
Sbjct: 137 VPSKKCPFYDIACMLHHRYDSKSSSTYKEDGRKMAIQYGTGSMKGFISKDSVCVAGVCAE 196
Query: 170 DQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNR 229
+Q F EAT E +TF+ A+FDGI+G+ + EIAV PV++ + EQ V VFSFWLNR
Sbjct: 197 EQPFAEATSEPGITFVAAKFDGILGMAYPEIAVLGVQPVFNTLFEQKKVPSNVFSFWLNR 256
Query: 230 DPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIV 289
+PD++ GGEI FGG+DP+ + TY PVT+KGYWQF++ D ++G+ G C GC AI
Sbjct: 257 NPDSDLGGEITFGGIDPRRYVEPITYTPVTRKGYWQFKM-DKVVGSGVLG-CSNGCQAIA 314
Query: 290 DSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
D+GTSL+AGP + I + IG E ++ E
Sbjct: 315 DTGTSLIAGPKAQIEAIQNFIGAEPLIKGE 344
>gi|432870116|ref|XP_004071815.1| PREDICTED: cathepsin D-like [Oryzias latipes]
Length = 397
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 129/300 (43%), Positives = 186/300 (62%), Gaps = 12/300 (4%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILP------LKNFMDAQYFGEIG 90
R+ LH + R + M + + R+G D P L NFMDAQY+G I
Sbjct: 23 RVPLHKTRSLRRLMSDNGMSLDDLRALGMRVGSLDSSASPELPVERLTNFMDAQYYGLIS 82
Query: 91 IGSPPQNFSVIFDTGSSNLWVPSSKCYF-SISCYFHSRYKSRKSNTYTEIGKSCEINYGS 149
IG+PPQNFSV+FDTGSSNLWVPS C F ++C+ H RY S+KS++Y + G I YG
Sbjct: 83 IGTPPQNFSVLFDTGSSNLWVPSIHCSFLDVACWVHRRYNSKKSSSYVKNGTEFSIRYGR 142
Query: 150 GSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVW 209
GS+SGF SQD V V + V Q F EA ++ +TF +ARFDG++G+ + I+V + PV+
Sbjct: 143 GSLSGFISQDTVSVAGLSVPGQQFGEAVKQPGITFAVARFDGVLGMAYPSISVANVTPVF 202
Query: 210 DNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELG 269
D + L+ + +FS +++RD AE GGE++ GG+DP++F G YV VT+K YWQ ++
Sbjct: 203 DTAMAAKLLPQNIFSVYISRDTAAEVGGELILGGIDPQYFSGDLHYVNVTRKAYWQIQMD 262
Query: 270 DILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLVVS 325
+ +GNQ T +C+ GC +IVD+GTSL+ GP + ++ AIG ++ E CK + S
Sbjct: 263 RVDVGNQLT-LCKAGCQSIVDTGTSLMVGPAEEIRALHKAIGALPLLMGEYFIDCKKIPS 321
>gi|313219527|emb|CBY30450.1| unnamed protein product [Oikopleura dioica]
Length = 396
Score = 258 bits (660), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 139/297 (46%), Positives = 190/297 (63%), Gaps = 14/297 (4%)
Query: 63 VRHR-LGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-I 120
++H+ LGD + P+ N+MDAQY+G I IG+PPQ FSVIFDTGSSNLWVPS+KC F+ +
Sbjct: 51 LQHKFLGDGHSE--PITNYMDAQYYGTIHIGTPPQEFSVIFDTGSSNLWVPSTKCKFTNV 108
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+C+ H +Y S+ S ++ G+ I YGSGS+SGF S D VEV V V+DQ F EA E
Sbjct: 109 ACFLHRKYDSQSSTSWKADGQEFAIQYGSGSLSGFCSTDAVEVAGVWVQDQKFAEAVEEP 168
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
+TF+ A+FDGI+GLG+ IAV P +NM+EQGL+S+ +FSF+LNR +AE+GGE+
Sbjct: 169 GITFVAAKFDGIMGLGYPSIAVNKITPPVNNMIEQGLLSDGMFSFFLNRTANAEDGGELT 228
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVC---EGGCAAIVDSGTSLLA 297
GGVD F G ++ VT++ YWQ ++ + + + C E GC IVDSGTSLLA
Sbjct: 229 IGGVDNSRFTGDFSWNEVTRQAYWQIKMDNFEVQGKGVSACGGNENGCQVIVDSGTSLLA 288
Query: 298 GPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDL------LVSGLLPEKVCQQI 348
P + EINHAIG + E +V ++ D + D+ V L PE +I
Sbjct: 289 VPKNLAEEINHAIGAFQFANGEW-IVPCRHMDTMPDIDFTLNGKVYTLTPEDYVMKI 344
>gi|94732449|emb|CAK11131.1| cathepsin D [Danio rerio]
gi|94733132|emb|CAK05390.1| cathepsin D [Danio rerio]
gi|158253911|gb|AAI54316.1| Ctsd protein [Danio rerio]
Length = 398
Score = 258 bits (660), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 136/326 (41%), Positives = 202/326 (61%), Gaps = 16/326 (4%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRH 65
+R F L V A +S+ + RI LKK R +L+ + + +E + + +++
Sbjct: 1 MRIAFLLLVAA----FFCTSDAIVRIPLKKFRTLRRTLSDSGRSLEELV---SSSNSLKY 53
Query: 66 RLG-DSDEDILP--LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-IS 121
LG + D P LKN++DAQY+GEIG+G+P Q F+V+FDTGSSNLWVPS C + I+
Sbjct: 54 NLGFPASNDPTPETLKNYLDAQYYGEIGLGTPVQTFTVVFDTGSSNLWVPSVHCSLTDIA 113
Query: 122 CYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGS 181
C H +Y KS+TY + G I YGSGS+SG+ SQD +GD+ V+ Q+F EA ++
Sbjct: 114 CLLHHKYNGGKSSTYVKNGTQFAIQYGSGSLSGYLSQDTCTIGDIAVEKQIFGEAIKQPG 173
Query: 182 LTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVF 241
+ F+ A+FDGI+G+ + I+V PV+D M+ Q V + VFSF+LNR+PD + GGE++
Sbjct: 174 VAFIAAKFDGILGMAYPRISVDGVPPVFDMMMSQKKVEKNVFSFYLNRNPDTQPGGELLL 233
Query: 242 GGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTP 301
GG DPK++ G YV ++++ YWQ + + IG+ +C+GGC AIVD+GTSL+ GP
Sbjct: 234 GGTDPKYYTGDFNYVDISRQAYWQIHMDGMSIGS-GLSLCKGGCEAIVDTGTSLITGPAA 292
Query: 302 VVTEINHAIGG----EGVVSAECKLV 323
V + AIG +G +CK V
Sbjct: 293 EVKALQKAIGAIPLMQGEYMVDCKKV 318
>gi|344269496|ref|XP_003406588.1| PREDICTED: LOW QUALITY PROTEIN: napsin-A-like [Loxodonta africana]
Length = 396
Score = 258 bits (660), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 132/295 (44%), Positives = 182/295 (61%), Gaps = 7/295 (2%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L RI L + D +LN+ R RK +S V GD +PL N+M+ QYFG
Sbjct: 26 LIRIPLHRVHPDPRTLNSPRAWRK----AAEHMSLVASSPGDKST-FVPLSNYMNVQYFG 80
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF-SISCYFHSRYKSRKSNTYTEIGKSCEIN 146
EIG+G+PPQNFSV+FDTGSSNLWVPS +C+F S+ C+ H R+ S+++ G I
Sbjct: 81 EIGLGTPPQNFSVVFDTGSSNLWVPSKRCHFLSLPCWVHHRFNPNASSSFQPNGTKFAIQ 140
Query: 147 YGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
YG+G ++G S+D + +G + VF EA E SL F A FDGI+GLGF +AV
Sbjct: 141 YGTGRLTGILSEDKLTIGGIEGTSVVFGEALWEPSLVFTFAPFDGILGLGFPILAVDGVR 200
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
P D +VEQGLV + VFSF+LNRDP+A +GGE+V GG DP H+ ++PVT YWQ
Sbjct: 201 PPLDILVEQGLVDKPVFSFYLNRDPEAPDGGELVLGGSDPAHYIPPLNFMPVTIPAYWQI 260
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECK 321
+ + +G +C GCAAI+D+GTSL+ GP + +N AIGG +++ + +
Sbjct: 261 HMERVKVGT-GLNLCAQGCAAILDTGTSLITGPAEEIQALNSAIGGVALLTGQVR 314
>gi|301764903|ref|XP_002917936.1| PREDICTED: napsin-A-like [Ailuropoda melanoleuca]
Length = 406
Score = 258 bits (660), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 138/314 (43%), Positives = 195/314 (62%), Gaps = 16/314 (5%)
Query: 22 PASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLG---DSDEDI-LPL 77
PA ++ L RI L++ +LN R G G V LG D+ I +PL
Sbjct: 19 PAGAS-LIRISLRRVYPGRGTLNPLR---------GWGRPAVPPSLGAPSPGDKPIFVPL 68
Query: 78 KNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF-SISCYFHSRYKSRKSNTY 136
N+M+AQY+GEIG+G+PPQNFSV+FDTGSSNLWVPS +C+F S+ C+FH R+ S+ S+++
Sbjct: 69 SNYMNAQYYGEIGLGTPPQNFSVVFDTGSSNLWVPSIRCHFLSLPCWFHHRFNSKASSSF 128
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
G I YG+G + G S+D + +G + +F EA E SL F A FDG++GLG
Sbjct: 129 HPNGTKFAIQYGTGKLDGILSEDKLTIGGIKGASVIFGEALWEPSLVFTFAHFDGVLGLG 188
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
F +AVG P D +V+QGL+ + VFSF+LNRDP+A +GGE+V GG DP H+ T++
Sbjct: 189 FPILAVGGVRPPLDTLVDQGLLDKPVFSFYLNRDPEAADGGELVLGGSDPAHYVPPLTFL 248
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
PVT YWQ + + +G T +C GCAAI+D+GTSL+ GPT + ++ AIGG ++
Sbjct: 249 PVTIPAYWQIHMERVNVGTGLT-LCAQGCAAILDTGTSLITGPTEEIQALHAAIGGVSLL 307
Query: 317 SAECKLVVSQYGDL 330
E + S+ L
Sbjct: 308 VGEYLIQCSKIPTL 321
>gi|13637914|sp|P80209.2|CATD_BOVIN RecName: Full=Cathepsin D; Flags: Precursor
Length = 390
Score = 258 bits (659), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 123/253 (48%), Positives = 176/253 (69%), Gaps = 11/253 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN+MDAQY+GEIGIG+PPQ F+V+FDTGS+NLWVPS C I+C+ H +Y S KS+T
Sbjct: 51 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSANLWVPSIHCKLLDIACWTHRKYNSDKSST 110
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEV---------GDVVVKDQVFIEATREGSLTFLL 186
Y + G + +I+YGSGS+SG+ SQD V V G V V+ Q F EA ++ + F+
Sbjct: 111 YVKNGTTFDIHYGSGSLSGYLSQDTVSVPCNPSSSSPGGVTVQRQTFGEAIKQPGVVFIA 170
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+ + I+V + +PV+DN+++Q LV + VFSF+LNRDP A+ GGE++ GG D
Sbjct: 171 AKFDGILGMAYPRISVNNVLPVFDNLMQQKLVDKNVFSFFLNRDPKAQPGGELMLGGTDS 230
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K+++G + VT++ YWQ + + +G+ T VC+GGC AIVD+GTSL+ GP V E+
Sbjct: 231 KYYRGSLMFHNVTRQAYWQIHMDQLDVGSSLT-VCKGGCEAIVDTGTSLIVGPVEEVREL 289
Query: 307 NHAIGGEGVVSAE 319
AIG ++ E
Sbjct: 290 QKAIGAVPLIQGE 302
>gi|197631813|gb|ACH70630.1| cathepsin D [Salmo salar]
gi|223648160|gb|ACN10838.1| Cathepsin D precursor [Salmo salar]
Length = 398
Score = 258 bits (658), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 117/244 (47%), Positives = 172/244 (70%), Gaps = 2/244 (0%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNT 135
LKNFMDAQY+GEIG+G+P Q F+V+FDTGSSNLWVPS C F+ I+C H +Y KS+T
Sbjct: 70 LKNFMDAQYYGEIGLGTPAQTFTVVFDTGSSNLWVPSVHCSFTDIACLLHHKYNGAKSST 129
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + G + I YGSGS+SG+ SQD +G + +++QVF EA ++ + F+ A+FDGI+G+
Sbjct: 130 YVKNGTAFAIQYGSGSLSGYLSQDTCTIGGLSIEEQVFGEAIKQPGVAFIAAKFDGILGM 189
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ I+V P +DN++ Q V + VFSF+LNR+P++E GGE++ GG DPK++ G Y
Sbjct: 190 AYPRISVDGVAPPFDNIMSQKKVEQNVFSFYLNRNPESEPGGELLLGGTDPKYYSGDFQY 249
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
+ V+++ YWQ + + +G+Q + +C+GGC AIVD+GTSL+ GPT V + AIG +
Sbjct: 250 LNVSRQAYWQVHMDGMGVGSQLS-LCKGGCEAIVDTGTSLITGPTAEVKALQKAIGATPL 308
Query: 316 VSAE 319
+ E
Sbjct: 309 IQGE 312
>gi|440899428|gb|ELR50729.1| Cathepsin D, partial [Bos grunniens mutus]
Length = 394
Score = 258 bits (658), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 123/253 (48%), Positives = 176/253 (69%), Gaps = 11/253 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN+MDAQY+GEIGIG+PPQ F+V+FDTGS+NLWVPS C I+C+ H +Y S KS+T
Sbjct: 55 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSANLWVPSIHCKLLDIACWTHRKYNSDKSST 114
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEV---------GDVVVKDQVFIEATREGSLTFLL 186
Y + G + +I+YGSGS+SG+ SQD V V G V V+ Q F EA ++ + F+
Sbjct: 115 YVKNGTTFDIHYGSGSLSGYLSQDTVSVPCNPSSSSPGGVTVQRQTFGEAIKQPGVVFIA 174
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+ + I+V + +PV+DN+++Q LV + VFSF+LNRDP A+ GGE++ GG D
Sbjct: 175 AKFDGILGMAYPRISVNNVLPVFDNLMQQKLVDKNVFSFFLNRDPKAQPGGELMLGGTDS 234
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K+++G + VT++ YWQ + + +G+ T VC+GGC AIVD+GTSL+ GP V E+
Sbjct: 235 KYYRGSLMFHNVTRQAYWQIHMDQLDVGSSLT-VCKGGCEAIVDTGTSLIVGPVEEVREL 293
Query: 307 NHAIGGEGVVSAE 319
AIG ++ E
Sbjct: 294 QKAIGAVPLIQGE 306
>gi|21552717|gb|AAM62283.1|AF396662_1 cathepsin D preproprotein [Silurus asotus]
Length = 395
Score = 257 bits (657), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 139/323 (43%), Positives = 202/323 (62%), Gaps = 21/323 (6%)
Query: 17 SCLLLPA----SSNGLRRIGLKKRRLDLHSL-NAARITRKERYMGGAGVS----GVRHRL 67
+CLLL +++ L RI LKK R ++ ++ R + R G + + GV ++
Sbjct: 4 ACLLLLVFIAWTADALVRIPLKKFRSIRRTMSDSGRAVEESR--GNSQNTKYNLGVTNKF 61
Query: 68 GDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHS 126
G + E LKN++DAQY+GEIG+G+P Q F+V+FDTGSSNLWVPS C + I+C H
Sbjct: 62 GPTPET---LKNYLDAQYYGEIGLGTPVQTFTVVFDTGSSNLWVPSVHCSLTDIACLLHH 118
Query: 127 RYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLL 186
+Y KS+TY + G + I YGSGS+SG+ SQD +GD+ V+ Q+F EA ++ + F+
Sbjct: 119 KYNGAKSSTYVKNGTAFAIQYGSGSLSGYLSQDVCSIGDIAVEKQIFGEAIKQPGVAFIA 178
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+ + IAV PV+D M+ Q + VFSF+LNR+PD + GGE++ GG DP
Sbjct: 179 AKFDGILGMAYPRIAVDGVPPVFD-MMSQKKFEKNVFSFYLNRNPDTQPGGELLLGGTDP 237
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K + G YV +T++ YWQ + + IG+Q + +C GGC AIVD+GTSL+ GP V +
Sbjct: 238 KFYTGDFHYVNITRQAYWQIHMDGMSIGSQLS-LCNGGCEAIVDTGTSLITGPAAEVKAL 296
Query: 307 NHAIGG----EGVVSAECKLVVS 325
AIG +G +CK V S
Sbjct: 297 QKAIGAIPLIQGEYMVDCKKVPS 319
>gi|432850601|ref|XP_004066828.1| PREDICTED: cathepsin D-like isoform 2 [Oryzias latipes]
Length = 398
Score = 257 bits (657), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 138/312 (44%), Positives = 195/312 (62%), Gaps = 10/312 (3%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLG-DSDE 72
VL L S L RI LKK R L + +E + A +++ LG S
Sbjct: 5 VLCVIAALALSGEALIRIPLKKFRSIRRELTD---SGREAHELLADKHSLKYNLGFPSSN 61
Query: 73 DILP--LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCY--FHSR 127
P LKN++DAQY+GEI +G+PPQ F+V+FDTGSSNLWVPS C I+C
Sbjct: 62 GPTPETLKNYLDAQYYGEIALGTPPQPFTVVFDTGSSNLWVPSVHCSLLDIACRECPPPS 121
Query: 128 YKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLA 187
Y S KS+TY + G S I YGSGS+SG+ SQD +GD+ V++QVF EA ++ + F+ A
Sbjct: 122 YNSAKSSTYVKNGTSFSIQYGSGSLSGYLSQDTCTIGDISVENQVFGEAIKQPGVAFIAA 181
Query: 188 RFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPK 247
+FDGI+G+ + I+V VPV+DN+++Q V VFSF+LNR+PD E GGE++ GG DPK
Sbjct: 182 KFDGILGMAYPRISVDGVVPVFDNIMQQKKVDSNVFSFYLNRNPDTEPGGELLLGGTDPK 241
Query: 248 HFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEIN 307
++ G YV ++++ YWQ + + +G+Q + +C+GGC AIVD+GTSLL GP+ V +
Sbjct: 242 YYSGDFHYVNISRQAYWQIHMDGMAVGSQLS-LCKGGCEAIVDTGTSLLTGPSAEVKALQ 300
Query: 308 HAIGGEGVVSAE 319
AIG ++ E
Sbjct: 301 KAIGAIPLIQGE 312
>gi|226476812|emb|CAX72322.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 257 bits (657), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 129/293 (44%), Positives = 190/293 (64%), Gaps = 8/293 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGV-RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPP 95
R+ L+ L +AR + E V V R + + LKN++DAQY+G+I IG+PP
Sbjct: 17 RVPLYPLKSARRSLIEFETSLENVQKVWFSRFSNVEPRPEYLKNYLDAQYYGDITIGTPP 76
Query: 96 QNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISG 154
Q FSV+FDTGSSNLWVPS+ C YF I+C H +Y S KS TY G I YG+GS+SG
Sbjct: 77 QTFSVVFDTGSSNLWVPSTHCSYFDIACLLHRKYDSSKSTTYVPNGTDFSIRYGTGSLSG 136
Query: 155 FFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVE 214
F S D++++G + VK Q F EAT++ L F++A+FDGI+G+ + +AVG PV+ NM++
Sbjct: 137 FLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGMAYPSLAVGGVTPVFVNMIK 196
Query: 215 QGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++ I
Sbjct: 197 QGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTIS 256
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 323
+ S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 257 DLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
>gi|281348334|gb|EFB23918.1| hypothetical protein PANDA_006240 [Ailuropoda melanoleuca]
Length = 379
Score = 257 bits (657), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 135/306 (44%), Positives = 190/306 (62%), Gaps = 15/306 (4%)
Query: 30 RIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLG---DSDEDI-LPLKNFMDAQY 85
RI L++ +LN R G G V LG D+ I +PL N+M+AQY
Sbjct: 1 RISLRRVYPGRGTLNPLR---------GWGRPAVPPSLGAPSPGDKPIFVPLSNYMNAQY 51
Query: 86 FGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF-SISCYFHSRYKSRKSNTYTEIGKSCE 144
+GEIG+G+PPQNFSV+FDTGSSNLWVPS +C+F S+ C+FH R+ S+ S+++ G
Sbjct: 52 YGEIGLGTPPQNFSVVFDTGSSNLWVPSIRCHFLSLPCWFHHRFNSKASSSFHPNGTKFA 111
Query: 145 INYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGD 204
I YG+G + G S+D + +G + +F EA E SL F A FDG++GLGF +AVG
Sbjct: 112 IQYGTGKLDGILSEDKLTIGGIKGASVIFGEALWEPSLVFTFAHFDGVLGLGFPILAVGG 171
Query: 205 AVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYW 264
P D +V+QGL+ + VFSF+LNRDP+A +GGE+V GG DP H+ T++PVT YW
Sbjct: 172 VRPPLDTLVDQGLLDKPVFSFYLNRDPEAADGGELVLGGSDPAHYVPPLTFLPVTIPAYW 231
Query: 265 QFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVV 324
Q + + +G T +C GCAAI+D+GTSL+ GPT + ++ AIGG ++ E +
Sbjct: 232 QIHMERVNVGTGLT-LCAQGCAAILDTGTSLITGPTEEIQALHAAIGGVSLLVGEYLIQC 290
Query: 325 SQYGDL 330
S+ L
Sbjct: 291 SKIPTL 296
>gi|313226363|emb|CBY21507.1| unnamed protein product [Oikopleura dioica]
Length = 396
Score = 257 bits (656), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 139/297 (46%), Positives = 189/297 (63%), Gaps = 14/297 (4%)
Query: 63 VRHR-LGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-I 120
++H+ LGD + P+ N+MDAQY+G I IG+PPQ FSVIFDTGSSNLWVPS+KC F+ +
Sbjct: 51 LQHKFLGDGHSE--PITNYMDAQYYGTIHIGTPPQEFSVIFDTGSSNLWVPSTKCKFTNV 108
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+C H +Y S+ S ++ G+ I YGSGS+SGF S D VEV V V+DQ F EA E
Sbjct: 109 ACLLHRKYDSQSSTSWKADGQEFAIQYGSGSLSGFCSTDAVEVAGVWVQDQKFAEAVEEP 168
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
+TF+ A+FDGI+GLG+ IAV P +NM+EQGL+S+ +FSF+LNR +AE+GGE+
Sbjct: 169 GITFVAAKFDGIMGLGYPSIAVNKITPPVNNMIEQGLLSDGMFSFFLNRTANAEDGGELT 228
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVC---EGGCAAIVDSGTSLLA 297
GGVD F G ++ VT++ YWQ ++ + + + C E GC IVDSGTSLLA
Sbjct: 229 IGGVDNSRFTGDFSWNEVTRQAYWQIKMDNFEVQGKGVSACGGNENGCQVIVDSGTSLLA 288
Query: 298 GPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDL------LVSGLLPEKVCQQI 348
P + EINHAIG + E +V ++ D + D+ V L PE +I
Sbjct: 289 VPKNLAEEINHAIGAFQFANGEW-IVPCRHMDTMPDIDFTLNGKVYTLTPEDYVMKI 344
>gi|226476838|emb|CAX72335.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 257 bits (656), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 129/293 (44%), Positives = 189/293 (64%), Gaps = 8/293 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGV-RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPP 95
R+ L+ L +AR + E V V R + + LKN++DAQY+G+I IG+PP
Sbjct: 17 RVPLYPLKSARRSLIEFETSLKNVQKVWFSRFSNVEPRPEYLKNYLDAQYYGDITIGTPP 76
Query: 96 QNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISG 154
Q FSV+FDTGSSNLWVPS C YF I+C H +Y S KS TY G I YG+GS+SG
Sbjct: 77 QTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTTYVPNGTDFSIRYGTGSLSG 136
Query: 155 FFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVE 214
F S D++++G + VK Q F EAT++ L F++A+FDGI+G+ + +AVG PV+ NM++
Sbjct: 137 FLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGMAYPSLAVGGVTPVFVNMIK 196
Query: 215 QGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++ I
Sbjct: 197 QGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTIS 256
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 323
+ S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 257 DLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
>gi|196123668|gb|ACG70181.1| cathepsin D-like protein [Homarus americanus]
Length = 386
Score = 257 bits (656), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 140/298 (46%), Positives = 183/298 (61%), Gaps = 21/298 (7%)
Query: 28 LRRIGLKK--RRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQY 85
L RI LKK + L L R+ RY G+ D++ L N+ DAQY
Sbjct: 18 LHRIPLKKIEKSRTLQDLRRTRVFLNHRYGVGS--------------DVIDLDNYEDAQY 63
Query: 86 FGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFHSRYKSRKSNTYTEIGKSCE 144
+G I IG+P Q F VIFDTGSSNLW+PS KC+ +++ H+RY S KS+TY E G + +
Sbjct: 64 YGPITIGTPGQGFDVIFDTGSSNLWIPSEKCFILNLARRLHNRYDSTKSSTYIENGTAFD 123
Query: 145 INYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGD 204
I YGSG++ GF S DNVE+G V Q F EAT+E L F++ + DGI+G+ F EI+V
Sbjct: 124 IQYGSGALHGFLSSDNVEMGGVNAMGQTFAEATQEPGLAFIMGKLDGILGMAFTEISVMG 183
Query: 205 AVPVWDNMVEQGLVSEEVFSFWLNRD-PDAEE--GGEIVFGGVDPKHFKGKHTYVPVTKK 261
V+D MV QG V + +FSF+LN D D E GGE+V GG DP H++G+ YVPV+K
Sbjct: 184 IPTVFDTMVAQGAVDQPIFSFYLNHDVSDMNETLGGELVLGGSDPNHYEGEFHYVPVSKV 243
Query: 262 GYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
GYWQ I +G+ TG C C AIVD+GTSL+AGP V EI H +GG G ++ E
Sbjct: 244 GYWQVTAEAIKVGDNVTGFCN-PCEAIVDTGTSLIAGPNAEVKEIVHMLGGYGFIAGE 300
>gi|432099182|gb|ELK28547.1| Cathepsin D [Myotis davidii]
Length = 351
Score = 257 bits (656), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 125/248 (50%), Positives = 174/248 (70%), Gaps = 11/248 (4%)
Query: 82 DAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIG 140
+AQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+TY E G
Sbjct: 34 EAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSGKSSTYVENG 93
Query: 141 KSCEINYGSGSISGFFSQDNVEV---------GDVVVKDQVFIEATREGSLTFLLARFDG 191
+ +I+YGSGS+SG+ SQD V V G V V+ QVF EAT++ +TF+ A+FDG
Sbjct: 94 TTFDIHYGSGSLSGYLSQDTVSVPCNSGLASLGGVKVERQVFGEATKQPGITFIAAKFDG 153
Query: 192 IIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKG 251
I+G+ + I+V + VPV+DN+++Q LV + +FSF+LNRDP A+ GGE++ GG D K++KG
Sbjct: 154 ILGMAYPRISVNNVVPVFDNLMQQKLVEKNIFSFYLNRDPSAQPGGELMLGGTDSKYYKG 213
Query: 252 KHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
Y+ VT+K YWQ + + +GN T +C+ GC AIVD+GTSL+ GP V E+ AIG
Sbjct: 214 PIAYLNVTRKAYWQVHMDQVDVGNGLT-LCKEGCEAIVDTGTSLMVGPVDEVRELQKAIG 272
Query: 312 GEGVVSAE 319
++ E
Sbjct: 273 AVPLIQGE 280
>gi|149757990|ref|XP_001490885.1| PREDICTED: napsin-A [Equus caballus]
Length = 401
Score = 256 bits (655), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 132/292 (45%), Positives = 185/292 (63%), Gaps = 14/292 (4%)
Query: 47 RITRKERYMG--------GAGVSGVRHRLG---DSDEDI-LPLKNFMDAQYFGEIGIGSP 94
RI + Y G G G R+G D+ I +PL ++M+AQY+GEIG+G+P
Sbjct: 23 RIPLRRVYTGRGVLNPLRGWGKPAKPPRMGAPSPGDKPIFVPLSDYMNAQYYGEIGLGTP 82
Query: 95 PQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSIS 153
PQNFSV+FDTGSSNLWVPS +C +FS+ C+FH R+ + S+++ G I YG+G ++
Sbjct: 83 PQNFSVLFDTGSSNLWVPSVRCHFFSLPCWFHHRFNPKASSSFKPNGTKFAIQYGTGRLN 142
Query: 154 GFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMV 213
G S+D + +G + VF EA E SL F +A FDGI+GLGF +AV P D +V
Sbjct: 143 GILSEDKLTIGGITGASVVFGEALSEPSLIFTIAHFDGILGLGFPILAVEGVRPPLDTLV 202
Query: 214 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 273
+QGL+ + VFSF+LNRDP+A +GGE+V GG DP H+ T+VPVT YWQ + + +
Sbjct: 203 DQGLLDKPVFSFYLNRDPEAADGGELVLGGSDPSHYIPPLTFVPVTIPAYWQIHMKRVKV 262
Query: 274 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVS 325
G T +C GCAAI+D+GTSL+ GPT + ++ AIGG +++ E L S
Sbjct: 263 GTGLT-LCAQGCAAILDTGTSLITGPTEEIRALHAAIGGIPLLAGEYLLQCS 313
>gi|2347147|gb|AAC37302.1| aspartic proteinase precursor [Schistosoma japonicum]
gi|226476814|emb|CAX72323.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476816|emb|CAX72324.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476820|emb|CAX72326.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476822|emb|CAX72327.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476824|emb|CAX72328.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476826|emb|CAX72329.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476834|emb|CAX72333.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476836|emb|CAX72334.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476840|emb|CAX72336.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476842|emb|CAX72337.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476844|emb|CAX72338.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476846|emb|CAX72339.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476852|emb|CAX72342.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476880|emb|CAX72318.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476882|emb|CAX72317.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476886|emb|CAX72315.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476890|emb|CAX72313.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476892|emb|CAX72312.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476894|emb|CAX72311.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476896|emb|CAX72310.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476898|emb|CAX72309.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476900|emb|CAX72308.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226482870|emb|CAX79402.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 256 bits (655), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 129/293 (44%), Positives = 189/293 (64%), Gaps = 8/293 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGV-RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPP 95
R+ L+ L +AR + E V V R + + LKN++DAQY+G+I IG+PP
Sbjct: 17 RVPLYPLKSARRSLIEFETSLENVQKVWFSRFSNVEPRPEYLKNYLDAQYYGDITIGTPP 76
Query: 96 QNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISG 154
Q FSV+FDTGSSNLWVPS C YF I+C H +Y S KS TY G I YG+GS+SG
Sbjct: 77 QTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTTYVPNGTDFSIRYGTGSLSG 136
Query: 155 FFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVE 214
F S D++++G + VK Q F EAT++ L F++A+FDGI+G+ + +AVG PV+ NM++
Sbjct: 137 FLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGMAYPSLAVGGVTPVFVNMIK 196
Query: 215 QGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++ I
Sbjct: 197 QGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTIS 256
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 323
+ S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 257 DLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
>gi|226476888|emb|CAX72314.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476904|emb|CAX72306.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 256 bits (655), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 129/293 (44%), Positives = 189/293 (64%), Gaps = 8/293 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGV-RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPP 95
R+ L+ L +AR + E V V R + + LKN++DAQY+G+I IG+PP
Sbjct: 17 RVPLYPLKSARRSLIEFETSLENVQKVWFSRFSNVEPRPEYLKNYLDAQYYGDITIGTPP 76
Query: 96 QNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISG 154
Q FSV+FDTGSSNLWVPS C YF I+C H +Y S KS TY G I YG+GS+SG
Sbjct: 77 QTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTTYVPNGTDFSIRYGTGSLSG 136
Query: 155 FFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVE 214
F S D++++G + VK Q F EAT++ L F++A+FDGI+G+ + +AVG PV+ NM++
Sbjct: 137 FLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGMAYPSLAVGGVTPVFVNMIK 196
Query: 215 QGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++ I
Sbjct: 197 QGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTIS 256
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 323
+ S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 257 DLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
>gi|226476856|emb|CAX72344.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 256 bits (655), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 129/293 (44%), Positives = 189/293 (64%), Gaps = 8/293 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGV-RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPP 95
R+ L+ L +AR + E V V R + + LKN++DAQY+G+I IG+PP
Sbjct: 17 RVPLYPLKSARRSLIEFETSLENVQKVWFSRFSNVEPRPEYLKNYLDAQYYGDITIGTPP 76
Query: 96 QNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISG 154
Q FSV+FDTGSSNLWVPS C YF I+C H +Y S KS TY G I YG+GS+SG
Sbjct: 77 QTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTTYVPNGTDFSIRYGTGSLSG 136
Query: 155 FFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVE 214
F S D++++G + VK Q F EAT++ L F++A+FDGI+G+ + +AVG PV+ NM++
Sbjct: 137 FLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGMAYPSLAVGGVTPVFVNMIK 196
Query: 215 QGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++ I
Sbjct: 197 QGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTIS 256
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 323
+ S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 257 DLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
>gi|226476902|emb|CAX72307.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 129/293 (44%), Positives = 189/293 (64%), Gaps = 8/293 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGV-RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPP 95
R+ L+ L +AR + E V V R + + LKN++DAQY+G+I IG+PP
Sbjct: 17 RVPLYPLKSARRSLIEFETSLENVQKVWFSRFSNVEPRPEYLKNYLDAQYYGDITIGTPP 76
Query: 96 QNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISG 154
Q FSV+FDTGSSNLWVPS C YF I+C H +Y S KS TY G I YG+GS+SG
Sbjct: 77 QTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTTYVPNGTDFSIRYGTGSLSG 136
Query: 155 FFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVE 214
F S D++++G + VK Q F EAT++ L F++A+FDGI+G+ + +AVG PV+ NM++
Sbjct: 137 FLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGMAYPSLAVGGVTPVFVNMIK 196
Query: 215 QGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++ I
Sbjct: 197 QGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTIS 256
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 323
+ S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 257 DLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
>gi|189502972|gb|ACE06867.1| unknown [Schistosoma japonicum]
Length = 429
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 129/293 (44%), Positives = 189/293 (64%), Gaps = 8/293 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGV-RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPP 95
R+ L+ L +AR + E V V R + + LKN++DAQY+G+I IG+PP
Sbjct: 17 RVPLYPLKSARRSLIEFETSLENVQKVWFSRFSNVEPRPEYLKNYLDAQYYGDITIGTPP 76
Query: 96 QNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISG 154
Q FSV+FDTGSSNLWVPS C YF I+C H +Y S KS TY G I YG+GS+SG
Sbjct: 77 QTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTTYVPNGTDFSIRYGTGSLSG 136
Query: 155 FFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVE 214
F S D++++G + VK Q F EAT++ L F++A+FDGI+G+ + +AVG PV+ NM++
Sbjct: 137 FLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGMAYPSLAVGGVTPVFVNMIK 196
Query: 215 QGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++ I
Sbjct: 197 QGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTIS 256
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 323
+ S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 257 DLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
>gi|226476818|emb|CAX72325.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 129/293 (44%), Positives = 189/293 (64%), Gaps = 8/293 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGV-RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPP 95
R+ L+ L +AR + E V V R + + LKN++DAQY+G+I IG+PP
Sbjct: 17 RVPLYPLKSARRSLIEFETSLENVQKVWFSRFSNVEPRPEYLKNYLDAQYYGDITIGTPP 76
Query: 96 QNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISG 154
Q FSV+FDTGSSNLWVPS C YF I+C H +Y S KS TY G I YG+GS+SG
Sbjct: 77 QTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTTYVPNGTDFSIRYGTGSLSG 136
Query: 155 FFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVE 214
F S D++++G + VK Q F EAT++ L F++A+FDGI+G+ + +AVG PV+ NM++
Sbjct: 137 FLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGMAYPSLAVGGVTPVFVNMIK 196
Query: 215 QGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++ I
Sbjct: 197 QGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTIS 256
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 323
+ S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 257 DLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
>gi|226476810|emb|CAX72321.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 129/293 (44%), Positives = 189/293 (64%), Gaps = 8/293 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGV-RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPP 95
R+ L+ L +AR + E V V R + + LKN++DAQY+G+I IG+PP
Sbjct: 17 RVPLYPLKSARRSLIEFETSLENVQKVWFSRFSNVEPRPEYLKNYLDAQYYGDITIGTPP 76
Query: 96 QNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISG 154
Q FSV+FDTGSSNLWVPS C YF I+C H +Y S KS TY G I YG+GS+SG
Sbjct: 77 QTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTTYVPNGTDFSIRYGTGSLSG 136
Query: 155 FFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVE 214
F S D++++G + VK Q F EAT++ L F++A+FDGI+G+ + +AVG PV+ NM++
Sbjct: 137 FLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGMAYPSLAVGGVTPVFVNMIK 196
Query: 215 QGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++ I
Sbjct: 197 QGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTIS 256
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 323
+ S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 257 DLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
>gi|226476854|emb|CAX72343.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 435
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 129/293 (44%), Positives = 189/293 (64%), Gaps = 8/293 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGV-RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPP 95
R+ L+ L +AR + E V V R + + LKN++DAQY+G+I IG+PP
Sbjct: 23 RVPLYPLKSARRSLIEFETSLENVQKVWFSRFSNVEPRPEYLKNYLDAQYYGDITIGTPP 82
Query: 96 QNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISG 154
Q FSV+FDTGSSNLWVPS C YF I+C H +Y S KS TY G I YG+GS+SG
Sbjct: 83 QTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTTYVPNGTDFSIRYGTGSLSG 142
Query: 155 FFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVE 214
F S D++++G + VK Q F EAT++ L F++A+FDGI+G+ + +AVG PV+ NM++
Sbjct: 143 FLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGMAYPSLAVGGVTPVFVNMIK 202
Query: 215 QGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++ I
Sbjct: 203 QGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTIS 262
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 323
+ S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 263 DLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 313
>gi|226476830|emb|CAX72331.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 129/293 (44%), Positives = 189/293 (64%), Gaps = 8/293 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGV-RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPP 95
R+ L+ L +AR + E V V R + + LKN++DAQY+G+I IG+PP
Sbjct: 17 RVPLYPLKSARRSLIEFETSLENVQKVWFSRFSNVEPRPEYLKNYLDAQYYGDITIGTPP 76
Query: 96 QNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISG 154
Q FSV+FDTGSSNLWVPS C YF I+C H +Y S KS TY G I YG+GS+SG
Sbjct: 77 QTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTTYVPNGTDFSIRYGTGSLSG 136
Query: 155 FFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVE 214
F S D++++G + VK Q F EAT++ L F++A+FDGI+G+ + +AVG PV+ NM++
Sbjct: 137 FLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGMAYPSLAVGGVTPVFVNMIK 196
Query: 215 QGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++ I
Sbjct: 197 QGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTIS 256
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 323
+ S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 257 DLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
>gi|2102722|gb|AAB63357.1| aspartic protease precursor, partial [Schistosoma japonicum]
Length = 428
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 129/293 (44%), Positives = 189/293 (64%), Gaps = 8/293 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGV-RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPP 95
R+ L+ L +AR + E V V R + + LKN++DAQY+G+I IG+PP
Sbjct: 16 RVPLYPLKSARRSLIEFETSLENVQKVWFSRFSNVEPRPEYLKNYLDAQYYGDITIGTPP 75
Query: 96 QNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISG 154
Q FSV+FDTGSSNLWVPS C YF I+C H +Y S KS TY G I YG+GS+SG
Sbjct: 76 QTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTTYVPNGTDFSIRYGTGSLSG 135
Query: 155 FFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVE 214
F S D++++G + VK Q F EAT++ L F++A+FDGI+G+ + +AVG PV+ NM++
Sbjct: 136 FLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGMAYPSLAVGGVTPVFVNMIK 195
Query: 215 QGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++ I
Sbjct: 196 QGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTIS 255
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 323
+ S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 256 DLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 306
>gi|299522|gb|AAB26186.1| cathepsin D {EC 3.4.23.5} [cattle, Peptide Partial, 346 aa]
Length = 346
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 123/253 (48%), Positives = 176/253 (69%), Gaps = 11/253 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN+MDAQY+GEIGIG+PPQ F+V+FDTGS+NLWVPS C I+C+ H +Y S KS+T
Sbjct: 7 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSANLWVPSIHCKLLDIACWTHRKYNSDKSST 66
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEV---------GDVVVKDQVFIEATREGSLTFLL 186
Y + G + +I+YGSGS+SG+ SQD V V G V V+ Q F EA ++ + F+
Sbjct: 67 YVKNGTTFDIHYGSGSLSGYLSQDTVSVPCNPSSSSPGGVTVQRQTFGEAIKQPGVVFIA 126
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+ + I+V + +PV+DN+++Q LV + VFSF+LNRDP A+ GGE++ GG D
Sbjct: 127 AKFDGILGMAYPRISVNNVLPVFDNLMQQKLVDKNVFSFFLNRDPKAQPGGELMLGGTDS 186
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K+++G + VT++ YWQ + + +G+ T VC+GGC AIVD+GTSL+ GP V E+
Sbjct: 187 KYYRGSLMFHNVTRQAYWQIHMDQLDVGSSLT-VCKGGCEAIVDTGTSLIVGPVEEVREL 245
Query: 307 NHAIGGEGVVSAE 319
AIG ++ E
Sbjct: 246 QKAIGAVPLIQGE 258
>gi|226476832|emb|CAX72332.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 129/293 (44%), Positives = 189/293 (64%), Gaps = 8/293 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGV-RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPP 95
R+ L+ L +AR + E V V R + + LKN++DAQY+G+I IG+PP
Sbjct: 17 RVPLYPLKSARRSLIEFETSLENVQKVWFSRFSNVEPRPEYLKNYLDAQYYGDITIGTPP 76
Query: 96 QNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISG 154
Q FSV+FDTGSSNLWVPS C YF I+C H +Y S KS TY G I YG+GS+SG
Sbjct: 77 QTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTTYVPNGTDFSIRYGTGSLSG 136
Query: 155 FFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVE 214
F S D++++G + VK Q F EAT++ L F++A+FDGI+G+ + +AVG PV+ NM++
Sbjct: 137 FLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGMAYPSLAVGGVTPVFVNMIK 196
Query: 215 QGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++ I
Sbjct: 197 QGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTIS 256
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 323
+ S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 257 DLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
>gi|296417651|ref|XP_002838466.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295634405|emb|CAZ82657.1| unnamed protein product [Tuber melanosporum]
Length = 396
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 143/327 (43%), Positives = 195/327 (59%), Gaps = 31/327 (9%)
Query: 14 VLASCLLLPASSNGLRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSGVR----- 64
+ A+ LL ++ G+ R LKK +L H +N ++YMG +R
Sbjct: 6 IFAAGSLLGSAMAGVHRAPLKKVPLTEQLSHHDINTQMRALGQKYMG------IRPEKID 59
Query: 65 ------HRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF 118
+ D +P+ NF++AQYF EI IG+PPQ F V+ DTGSSNLWVPSS+C
Sbjct: 60 EEMFKTQEIKTDDGHPVPVSNFLNAQYFSEITIGTPPQTFKVVLDTGSSNLWVPSSQCG- 118
Query: 119 SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
SI+CY HS+Y S S+TY G S EI YGSGS+SGF SQDN+E+G++ +KDQ F EAT
Sbjct: 119 SIACYLHSKYDSSTSSTYRPNGTSFEIRYGSGSLSGFVSQDNIEIGNLKIKDQTFAEATS 178
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
E L F RFDGI+GLG+ I+V VP + MV+QGL+ E VF+F+L D ++ E
Sbjct: 179 EPGLAFAFGRFDGILGLGYDSISVNHIVPPFYQMVDQGLLDEPVFAFYLG---DKDDQSE 235
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
+FGG+D H++GK +PV +K YW+ E I G +ST E AIVD+GTSL+A
Sbjct: 236 AIFGGIDKAHYQGKLIKLPVRRKAYWEVEFEAITFG-KSTAQFE-NTGAIVDTGTSLIAL 293
Query: 299 PTPVVTEINHAIGGE----GVVSAECK 321
P+ + +N IG + G S EC+
Sbjct: 294 PSTLAELLNKEIGAKKGFNGQYSVECE 320
>gi|185132376|ref|NP_001118183.1| cathepsin D precursor [Oncorhynchus mykiss]
gi|1858020|gb|AAC60301.1| cathepsin D [Oncorhynchus mykiss]
Length = 398
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 117/244 (47%), Positives = 170/244 (69%), Gaps = 2/244 (0%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNT 135
LKNFMDAQY+GEIG+G+P Q F+V+FDTGSSNLWVPS C F+ I+C H +Y KS+T
Sbjct: 70 LKNFMDAQYYGEIGLGTPVQTFTVVFDTGSSNLWVPSVHCSFTDIACLLHHKYNGAKSST 129
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + G + I YGSGS+SG+ SQD +G + ++DQ F EA ++ + F+ A+FDGI+G+
Sbjct: 130 YVKNGTAFAIQYGSGSLSGYLSQDTCTIGGLSIEDQGFGEAIKQPGVAFIAAKFDGILGM 189
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ I+V P +DN++ Q V + VFSF+LNR+PD+E GGE++ GG DPK++ G Y
Sbjct: 190 AYPRISVDGVAPPFDNIMSQKKVEQNVFSFYLNRNPDSEPGGELLLGGTDPKYYSGDFQY 249
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
+ V+++ YWQ + + +G+Q + +C+GGC AIVD+GTSL+ GP V + AIG +
Sbjct: 250 LDVSRQAYWQIHMDGMGVGSQLS-LCKGGCEAIVDTGTSLITGPAAEVKALQRAIGATPL 308
Query: 316 VSAE 319
+ E
Sbjct: 309 IQGE 312
>gi|18203300|sp|Q9MZS8.1|CATD_SHEEP RecName: Full=Cathepsin D; Flags: Precursor
gi|8886526|gb|AAF80494.1|AF164143_1 cathepsin D [Ovis aries]
Length = 365
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 123/253 (48%), Positives = 175/253 (69%), Gaps = 11/253 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L N+MDAQY+GEIGIG+PPQ F+V+FDTGS+NLWVPS C I+C+ H +Y S KS+T
Sbjct: 46 LTNYMDAQYYGEIGIGTPPQCFTVVFDTGSANLWVPSIHCKLLDIACWVHHKYNSDKSST 105
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEV---------GDVVVKDQVFIEATREGSLTFLL 186
Y + G + +I+YGSGS+SG+ SQD V V G V V+ Q F EA ++ + F+
Sbjct: 106 YVKNGTTFDIHYGSGSLSGYLSQDTVSVPCNPSSSSPGGVTVQRQTFGEAIKQPGVVFIA 165
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+ + I+V + +PV+DN++ Q LV + VFSF+LNRDP A+ G E++ GG D
Sbjct: 166 AKFDGILGMAYPRISVNNVLPVFDNLMRQKLVDKNVFSFFLNRDPKAQPGEELMLGGTDS 225
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K+++G TY VT++ YWQ + + +G+ T VC+GGC AIVD+GTSL+ GP V E+
Sbjct: 226 KYYRGSLTYHNVTRQAYWQIHMDQLDVGSSLT-VCKGGCEAIVDTGTSLMVGPVDEVREL 284
Query: 307 NHAIGGEGVVSAE 319
+ AIG ++ E
Sbjct: 285 HKAIGAVPLIQGE 297
>gi|115279794|gb|ABI85390.1| cathepsin D [Hippoglossus hippoglossus]
Length = 399
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 127/298 (42%), Positives = 189/298 (63%), Gaps = 10/298 (3%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDI-LPLK---NFMDAQYFGEIGIG 92
R+ LH + R + M + + G SD + LP++ NFMDAQY+GEIGIG
Sbjct: 27 RVPLHKTRSLRRLMTDNGMSLQELQALASSTGASDSVLSLPVERPTNFMDAQYYGEIGIG 86
Query: 93 SPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGS 151
+PPQ F+V+FDTGSSNLW+PS C F+++C+ H RY S+KS+TY + G I YG GS
Sbjct: 87 TPPQPFTVLFDTGSSNLWIPSIHCNLFNVACWLHHRYNSKKSSTYVKNGTEFSIQYGRGS 146
Query: 152 ISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDN 211
++G+ S+D V + + V Q F EA ++ +TF +ARFDG++G+G+ I+V PV+D+
Sbjct: 147 LTGYISEDTVSLAGLSVPGQQFAEAVKQPGITFAVARFDGVLGMGYPSISVDKVKPVFDS 206
Query: 212 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 271
+ L+ + VFSF+++RD A GGE++ GG DP+++ G YV VT+K YWQ ++ +
Sbjct: 207 AMAAKLLPQNVFSFYISRDASATVGGELILGGTDPQYYTGDLHYVNVTRKAYWQIKMDGV 266
Query: 272 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLVVS 325
+G Q T +C+ GC AIVD+GTSL+ GP V ++ AIG ++ E CK + S
Sbjct: 267 EVGTQLT-LCKAGCQAIVDTGTSLIVGPREEVRALHRAIGALPLIMGEYLIDCKKIPS 323
>gi|226476906|emb|CAX72305.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 128/293 (43%), Positives = 189/293 (64%), Gaps = 8/293 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGV-RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPP 95
R+ L+ L +AR + E V V R + + LKN++DAQY+G+I +G+PP
Sbjct: 17 RVPLYPLKSARRSLIEFETSLENVQKVWFSRFSNVEPRPEYLKNYLDAQYYGDITVGTPP 76
Query: 96 QNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISG 154
Q FSV+FDTGSSNLWVPS C YF I+C H +Y S KS TY G I YG+GS+SG
Sbjct: 77 QTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTTYVPNGTDFSIRYGTGSLSG 136
Query: 155 FFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVE 214
F S D++++G + VK Q F EAT++ L F++A+FDGI+G+ + +AVG PV+ NM++
Sbjct: 137 FLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGMAYPSLAVGGVTPVFVNMIK 196
Query: 215 QGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++ I
Sbjct: 197 QGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTIS 256
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 323
+ S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 257 DLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
>gi|74198157|dbj|BAE35255.1| unnamed protein product [Mus musculus]
Length = 335
Score = 256 bits (653), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 121/248 (48%), Positives = 176/248 (70%), Gaps = 11/248 (4%)
Query: 82 DAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIG 140
DAQY+G+IGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+TY + G
Sbjct: 1 DAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSSTYVKNG 60
Query: 141 KSCEINYGSGSISGFFSQDNVEVG---------DVVVKDQVFIEATREGSLTFLLARFDG 191
S +I+YGSGS+SG+ SQD V V + V+ Q+F EAT++ + F+ A+FDG
Sbjct: 61 TSFDIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIFGEATKQPGIVFVAAKFDG 120
Query: 192 IIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKG 251
I+G+G+ I+V + +PV+DN+++Q LV + +FSF+LNRDP+ + GGE++ GG D K++ G
Sbjct: 121 ILGMGYPHISVNNVLPVFDNLMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDSKYYHG 180
Query: 252 KHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
+ +Y+ VT+K YWQ + + +GN+ T +C+GGC AIVD+GTSLL GP V E+ AIG
Sbjct: 181 ELSYLNVTRKAYWQVHMDQLEVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKELQKAIG 239
Query: 312 GEGVVSAE 319
++ E
Sbjct: 240 AMPLIQGE 247
>gi|66815097|ref|XP_641645.1| cathepsin D [Dictyostelium discoideum AX4]
gi|74960832|sp|O76856.1|CATD_DICDI RecName: Full=Cathepsin D; AltName: Full=Ddp44; Flags: Precursor
gi|3288145|emb|CAA76563.1| preprocathepsin D [Dictyostelium discoideum]
gi|6010025|emb|CAB57223.1| cathepsin D [Dictyostelium discoideum]
gi|60469656|gb|EAL67644.1| cathepsin D [Dictyostelium discoideum AX4]
Length = 383
Score = 256 bits (653), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 128/255 (50%), Positives = 169/255 (66%), Gaps = 8/255 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI-SCYFHSRYKSRKS 133
+P+ +F DAQY+G I IG+P Q F V+FDTGSSNLW+PS KC ++ +C H++Y S S
Sbjct: 53 IPISDFEDAQYYGAITIGTPGQAFKVVFDTGSSNLWIPSKKCPITVVACDLHNKYNSGAS 112
Query: 134 NTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGII 193
+TY G I YGSG++SGF SQD+V VG + VKDQ+F EAT E + F A+FDGI+
Sbjct: 113 STYVANGTDFTIQYGSGAMSGFVSQDSVTVGSLTVKDQLFAEATAEPGIAFDFAKFDGIL 172
Query: 194 GLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKH 253
GL F+ I+V PV+ NM+ QGLVS +FSFWL+R P A GGE+ FG +D + G
Sbjct: 173 GLAFQSISVNSIPPVFYNMLSQGLVSSTLFSFWLSRTPGA-NGGELSFGSIDNTKYTGDI 231
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG-- 311
TYVP+T + YW+F + D I QS G C C AI DSGTSL+AGP +T +N +G
Sbjct: 232 TYVPLTNETYWEFVMDDFAIDGQSAGFCGTTCHAICDSGTSLIAGPMADITALNEKLGAV 291
Query: 312 ---GEGVVSAECKLV 323
GEGV S +C ++
Sbjct: 292 ILNGEGVFS-DCSVI 305
>gi|226476876|emb|CAX72320.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 128/293 (43%), Positives = 189/293 (64%), Gaps = 8/293 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGV-RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPP 95
R+ L+ L +AR + E V V R + + LKN++DAQY+G+I IG+PP
Sbjct: 17 RVPLYPLKSARRSLIEFETSLENVQKVWFSRFSNVEPRPEYLKNYLDAQYYGDITIGTPP 76
Query: 96 QNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISG 154
Q FSV+FDTGSSNLWVPS C YF I+C H +Y S KS TY G I YG+GS+SG
Sbjct: 77 QTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTTYVPNGTDFSIRYGTGSLSG 136
Query: 155 FFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVE 214
F S D++++G + VK Q + EAT++ L F++A+FDGI+G+ + +AVG PV+ NM++
Sbjct: 137 FLSTDSLQLGSLGVKGQTYGEATKQPGLVFVMAKFDGILGMAYPSLAVGGVTPVFVNMIK 196
Query: 215 QGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++ I
Sbjct: 197 QGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTIS 256
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 323
+ S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 257 DLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
>gi|209154266|gb|ACI33365.1| Cathepsin D precursor [Salmo salar]
Length = 402
Score = 255 bits (651), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 138/326 (42%), Positives = 200/326 (61%), Gaps = 13/326 (3%)
Query: 11 CLWVL-ASCLLLPASSNGLRRIGLKKRR-----LDLHSLNAARITRKERYMGGAGVSGVR 64
CL +L + LL A S+ + RI L K R + + ++ ++ + GAG + V
Sbjct: 3 CLKILYITIALLIAHSSAIIRIPLHKTRSMRRLMSDNGMSFEQLQDMAKTGCGAG-ANVP 61
Query: 65 HRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF-SISCY 123
+ L NFMDAQY+G I IG+PPQ+F+V+FDTGSSNLWVPS C F ++C+
Sbjct: 62 INAPSPKVPVERLTNFMDAQYYGVISIGTPPQDFTVLFDTGSSNLWVPSIHCSFLDVACW 121
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
H RY S+KS+TY + G I YG GS+SGF S D V + + V Q F EA ++ +T
Sbjct: 122 LHHRYNSKKSSTYVQNGTKFSIQYGRGSLSGFISGDTVSLAGMQVTGQQFGEAVKQPGIT 181
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
F +ARFDG++G+G+ I+V + PV+D + L+ + +FSF+++RDP A GGE++ GG
Sbjct: 182 FAVARFDGVLGMGYPTISVNNITPVFDTAMAAKLLPQNIFSFYISRDPLAAVGGELMLGG 241
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
DP ++ G YV VT+K YWQ E+ ++ +GNQ T +C+ GC AIVD+GTSL+ GP V
Sbjct: 242 TDPLYYTGDLHYVNVTRKAYWQIEMSNVEVGNQLT-LCKAGCQAIVDTGTSLIIGPAEEV 300
Query: 304 TEINHAIGGEGVVSAE----CKLVVS 325
++ AIG ++ E CK V S
Sbjct: 301 RVLHKAIGALPLLMGEYWIDCKKVPS 326
>gi|307203870|gb|EFN82801.1| Lysosomal aspartic protease [Harpegnathos saltator]
Length = 374
Score = 255 bits (651), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 142/292 (48%), Positives = 188/292 (64%), Gaps = 11/292 (3%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQ 96
R+ LH + R +E G + VR G + PL N++DAQY+G I IG+PPQ
Sbjct: 11 RIQLHKTESIRRILQEV---GTDLHQVR-LYGVTTPTPEPLSNYLDAQYYGVITIGTPPQ 66
Query: 97 NFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
F VIFDTGSSNLWVPS KC + I+C H +Y SRKS+TY + G I YGSGS+SGF
Sbjct: 67 EFRVIFDTGSSNLWVPSKKCSITNIACLLHHKYDSRKSSTYQKNGTEFAIRYGSGSLSGF 126
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
S D V +G + V+ Q F EA +E L F+ A+FDGI+G+G+ IAV PV+ NMV+Q
Sbjct: 127 LSSDVVNIGGLNVQGQTFAEAVKEPGLVFVAAKFDGILGMGYSTIAVDGVTPVFYNMVKQ 186
Query: 216 GLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGN 275
LV + VFSF+LNRDPDA+ GGE++ GG D H++G+ TYVPV++KGYWQF + I +
Sbjct: 187 DLVPKAVFSFYLNRDPDAKVGGEMLLGGSDSDHYEGEFTYVPVSRKGYWQFAMDSIQVHG 246
Query: 276 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLV 323
+ +C GC AI D+GTSL+AGP V IN IG +++ E C L+
Sbjct: 247 HT--LCASGCQAIADTGTSLIAGPVEEVAVINSLIGATTIIAGEAIVDCDLI 296
>gi|256072903|ref|XP_002572773.1| cathepsin D (A01 family) [Schistosoma mansoni]
gi|360043053|emb|CCD78465.1| cathepsin D (A01 family) [Schistosoma mansoni]
Length = 430
Score = 254 bits (650), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 127/292 (43%), Positives = 188/292 (64%), Gaps = 8/292 (2%)
Query: 35 KRRLDLHSLNAARITRKERYMGGAGVSGV-RHRLGDSDEDILPLKNFMDAQYFGEIGIGS 93
+ R+ LH L +A+ T E V V R+ D LKN++DAQY+G+I IG+
Sbjct: 17 RPRIPLHPLKSAQRTLIEFETSLEIVKKVWLSRVSGVDPQPEYLKNYLDAQYYGDITIGT 76
Query: 94 PPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSI 152
PPQ FSV+FDTGSSNLWVPS C YF I+C H +Y S KS+TY G ++YG+GS+
Sbjct: 77 PPQTFSVVFDTGSSNLWVPSKYCSYFDIACLLHRKYDSSKSSTYIPNGTEFSVHYGTGSL 136
Query: 153 SGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNM 212
SGF S D++++G + VK Q F EAT++ L F++A+FDGI+G+ + I+V PV+ NM
Sbjct: 137 SGFLSTDSLQLGSLSVKGQTFGEATQQPGLVFVMAKFDGILGMAYPSISVDGVTPVFVNM 196
Query: 213 VEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDIL 272
++QG+V VFSF+L+R+ A GGE++ GG+D K++ G+ YV +T++ YW F++ +
Sbjct: 197 IQQGIVESPVFSFYLSRNISAVLGGELMIGGIDKKYYSGEINYVDLTEQSYWLFKMDKLT 256
Query: 273 IGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAEC 320
I + + C GC AI D+GTS++AGPT + +IN +G G+ + C
Sbjct: 257 ISDMT--ACPDGCLAIADTGTSMIAGPTDEIQKINAKLGATRLPGGIYTVSC 306
>gi|320163747|gb|EFW40646.1| cathepsin D [Capsaspora owczarzaki ATCC 30864]
Length = 382
Score = 254 bits (650), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 120/227 (52%), Positives = 154/227 (67%), Gaps = 3/227 (1%)
Query: 74 ILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRK 132
I P N+ DAQY+G+I IG+P Q F+V+FDTGS+NLWVPS KC + I+C H++Y S K
Sbjct: 51 IEPQHNYQDAQYYGDITIGTPGQKFTVVFDTGSANLWVPSKKCPVTDIACQLHNKYDSTK 110
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGI 192
S+TY G S I YGSG +SGF S D+V + V Q F EAT E L+F+ A+FDGI
Sbjct: 111 SSTYKVNGTSFAIQYGSGKLSGFLSTDSVSFAGLTVTGQTFAEATAEPGLSFVAAKFDGI 170
Query: 193 IGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGK 252
+GLGF +IAV PVW+N + QG+ + +F FWLNRDP A +GGEI FG +D H+ G
Sbjct: 171 LGLGFPQIAVDGVTPVWNNAILQGVAAAPLFGFWLNRDPTAADGGEIDFGAIDDSHYTGP 230
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGP 299
Y PVT++GYWQF LG + + ++ C GC AI DSGTSLL GP
Sbjct: 231 ILYTPVTRQGYWQFALGAVTVSGKN--YCASGCQAIADSGTSLLVGP 275
>gi|24417300|gb|AAN60260.1| unknown [Arabidopsis thaliana]
Length = 168
Score = 254 bits (650), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 118/168 (70%), Positives = 141/168 (83%)
Query: 157 SQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQG 216
S D V VGD+VVKDQ F+EAT+E +TF++A+ DGI+GLGF+EI+VG A PVW NM++QG
Sbjct: 1 SNDAVTVGDLVVKDQEFMEATKELGITFVVAKXDGILGLGFQEISVGKAAPVWYNMLKQG 60
Query: 217 LVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQ 276
L+ E VFSFWLNR+ D EEGGE+VFGGVDP HFKGKHTYVPVT+KGYWQF++GD+LIG
Sbjct: 61 LIKEPVFSFWLNRNADEEEGGELVFGGVDPNHFKGKHTYVPVTQKGYWQFDMGDVLIGGA 120
Query: 277 STGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVV 324
TG CE GC+AI DSGTSLLAGPT ++T INHAIG GVVS +CK VV
Sbjct: 121 PTGFCESGCSAIADSGTSLLAGPTTIITMINHAIGAAGVVSQQCKTVV 168
>gi|6978973|dbj|BAA90785.1| aspartic proteinase family member similar to renin [Mus musculus]
Length = 419
Score = 254 bits (650), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 119/243 (48%), Positives = 165/243 (67%), Gaps = 2/243 (0%)
Query: 71 DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYK 129
+ +PL FM+ QYFG IG+G+PPQNF+V+FDTGSSNLWVPS++C +FS++C+FH R+
Sbjct: 59 NPSFVPLSKFMNTQYFGTIGLGTPPQNFTVVFDTGSSNLWVPSTRCHFFSLACWFHHRFN 118
Query: 130 SRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARF 189
+ S+++ G I YG+G +SG SQDN+ +G + F EA E SL F LA F
Sbjct: 119 PKASSSFRPNGTKFAIQYGTGRLSGILSQDNLTIGGIHDAFVTFGEALWEPSLIFALAHF 178
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF 249
DGI+GLGF +AVG P D+MVEQGL+ + VFSF+LNRD + +GGE+V GG DP H+
Sbjct: 179 DGILGLGFPTLAVGGVQPPLDSMVEQGLLEKPVFSFYLNRDSEGSDGGELVLGGSDPAHY 238
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHA 309
T++PVT YWQ + + +G +C GC+AI+D+GTSL+ GP+ + +N A
Sbjct: 239 VPPLTFIPVTIPAYWQVHMESVKVGT-GLSLCAQGCSAILDTGTSLITGPSEEIRALNKA 297
Query: 310 IGG 312
IGG
Sbjct: 298 IGG 300
>gi|74199699|dbj|BAE41511.1| unnamed protein product [Mus musculus]
Length = 419
Score = 254 bits (649), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 119/243 (48%), Positives = 164/243 (67%), Gaps = 2/243 (0%)
Query: 71 DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYK 129
+ +PL FM+ QYFG IG+G+PPQNF+V+FDTGSSNLWVPS++C +FS++C+FH R+
Sbjct: 59 NPSFVPLSKFMNTQYFGTIGLGTPPQNFTVVFDTGSSNLWVPSTRCHFFSLACWFHHRFN 118
Query: 130 SRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARF 189
+ S+++ G I YG+G +SG SQDN+ +G + F EA E SL F LA F
Sbjct: 119 PKASSSFRPNGTKFAIQYGTGRLSGILSQDNLTIGGIHDAFATFGEALWEPSLIFALAHF 178
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF 249
DGI+GLGF +AVG P D MVEQGL+ + VFSF+LNRD + +GGE+V GG DP H+
Sbjct: 179 DGILGLGFPTLAVGGVQPPLDAMVEQGLLEKPVFSFYLNRDSEGSDGGELVLGGSDPAHY 238
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHA 309
T++PVT YWQ + + +G +C GC+AI+D+GTSL+ GP+ + +N A
Sbjct: 239 VPPLTFIPVTIPAYWQVHMESVKVGT-GLSLCAQGCSAILDTGTSLITGPSEEIRALNKA 297
Query: 310 IGG 312
IGG
Sbjct: 298 IGG 300
>gi|1778026|gb|AAB63442.1| aspartic proteinase [Schistosoma mansoni]
Length = 427
Score = 254 bits (649), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 127/290 (43%), Positives = 187/290 (64%), Gaps = 8/290 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGV-RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPP 95
R+ LH L +A+ T E V V R+ D LKN++DAQY+G+I IG+PP
Sbjct: 16 RIPLHPLKSAQRTLIEFETSLEIVKKVWLSRVSGVDPQPEYLKNYLDAQYYGDITIGTPP 75
Query: 96 QNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISG 154
Q FSV+FDTGSSNLWVPS C YF I+C H +Y S KS+TY G ++YG+GS+SG
Sbjct: 76 QTFSVVFDTGSSNLWVPSKYCSYFDIACLLHRKYDSSKSSTYIPNGTEFSVHYGTGSLSG 135
Query: 155 FFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVE 214
F S D++++G + VK Q F EAT++ L F++A+FDGI+G+ + I+V PV+ NM++
Sbjct: 136 FLSTDSLQLGSLSVKGQTFGEATQQPGLVFVMAKFDGILGMAYPSISVDGVTPVFVNMIQ 195
Query: 215 QGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
QG+V VFSF+L+R+ A GGE++ GG+D K++ G+ YV +T++ YW F++ + I
Sbjct: 196 QGIVESPVFSFYLSRNISAVLGGELMIGGIDKKYYSGEINYVDLTEQSYWLFKMDKLTIS 255
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAEC 320
+ + C GC AI D+GTS++AGPT + +IN +G G+ + C
Sbjct: 256 DMT--ACPDGCLAIADTGTSMIAGPTDEIQKINAKLGATRLPGGIYTVSC 303
>gi|256072901|ref|XP_002572772.1| cathepsin D (A01 family) [Schistosoma mansoni]
gi|360043052|emb|CCD78464.1| cathepsin D (A01 family) [Schistosoma mansoni]
Length = 428
Score = 254 bits (649), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 127/290 (43%), Positives = 187/290 (64%), Gaps = 8/290 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGV-RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPP 95
R+ LH L +A+ T E V V R+ D LKN++DAQY+G+I IG+PP
Sbjct: 17 RIPLHPLKSAQRTLIEFETSLEIVKKVWLSRVSGVDPQPEYLKNYLDAQYYGDITIGTPP 76
Query: 96 QNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISG 154
Q FSV+FDTGSSNLWVPS C YF I+C H +Y S KS+TY G ++YG+GS+SG
Sbjct: 77 QTFSVVFDTGSSNLWVPSKYCSYFDIACLLHRKYDSSKSSTYIPNGTEFSVHYGTGSLSG 136
Query: 155 FFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVE 214
F S D++++G + VK Q F EAT++ L F++A+FDGI+G+ + I+V PV+ NM++
Sbjct: 137 FLSTDSLQLGSLSVKGQTFGEATQQPGLVFVMAKFDGILGMAYPSISVDGVTPVFVNMIQ 196
Query: 215 QGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
QG+V VFSF+L+R+ A GGE++ GG+D K++ G+ YV +T++ YW F++ + I
Sbjct: 197 QGIVESPVFSFYLSRNISAVLGGELMIGGIDKKYYSGEINYVDLTEQSYWLFKMDKLTIS 256
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAEC 320
+ + C GC AI D+GTS++AGPT + +IN +G G+ + C
Sbjct: 257 DMT--ACPDGCLAIADTGTSMIAGPTDEIQKINAKLGATRLPGGIYTVSC 304
>gi|12697815|dbj|BAB21620.1| cathepsin D [Bos taurus]
Length = 386
Score = 254 bits (649), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 122/253 (48%), Positives = 175/253 (69%), Gaps = 11/253 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN+MDAQY+GEIGIG+PPQ F+V+FDTGS+NLWVPS C I+C+ H +Y S KS+T
Sbjct: 47 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSANLWVPSIHCKLLDIACWTHRKYNSDKSST 106
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEV---------GDVVVKDQVFIEATREGSLTFLL 186
Y + G + +I+YGSGS+SG+ SQD V V G V V+ Q F EA ++ + F+
Sbjct: 107 YVKNGTTFDIHYGSGSLSGYLSQDTVSVPCNPSSSSPGGVTVQRQTFGEAIKQPGVVFIA 166
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+F GI+G+ + I+V + +PV+DN+++Q LV + VFSF+LNRDP A+ GGE++ GG D
Sbjct: 167 AKFGGILGMAYPRISVNNVLPVFDNLMQQKLVDKNVFSFFLNRDPKAQPGGELMLGGTDS 226
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K+++G + VT++ YWQ + + +G+ T VC+GGC AIVD+GTSL+ GP V E+
Sbjct: 227 KYYRGSLMFHNVTRQAYWQIHMDQLDVGSSLT-VCKGGCEAIVDTGTSLIVGPVEEVREL 285
Query: 307 NHAIGGEGVVSAE 319
AIG ++ E
Sbjct: 286 QKAIGAVPLIQGE 298
>gi|334562337|gb|AEG79714.1| cathepsin D [Apostichopus japonicus]
Length = 372
Score = 254 bits (649), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 130/304 (42%), Positives = 196/304 (64%), Gaps = 8/304 (2%)
Query: 19 LLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLK 78
LLLP +S L+RI L K L R ++ + G+G+ ++ + + + LK
Sbjct: 9 LLLPIAS-ALQRIPLFKVESARQRLIRTRSSKSDLEAIGSGL-----QVKEVNGSPIILK 62
Query: 79 NFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYT 137
+++DAQY+G I +G+PPQ+F V+FDTGSSNLWVPSS C + I+C F +Y S+TY
Sbjct: 63 DYLDAQYYGPITLGTPPQDFVVVFDTGSSNLWVPSSTCSWKDIACSFTKKYDHSVSSTYV 122
Query: 138 EIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGF 197
+ I YGSG+ +GF S D + +G+V VK Q+F EAT E L++++A+FDGI+G+G+
Sbjct: 123 ANDTAFAIPYGSGNCAGFLSYDTLMMGNVAVKSQLFGEATAEPGLSWIMAQFDGILGMGY 182
Query: 198 REIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVP 257
I+V +P +DN++ + L+S +FSF+L++DP A GGE++ GG D K++ G TYV
Sbjct: 183 PTISVDGVIPPFDNIMNRKLISNNIFSFYLSKDPSAAVGGELLLGGTDSKYYTGNFTYVK 242
Query: 258 VTKKGYWQFELGDILIGNQSTGVCEG-GCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
V+KKGYWQF + + IG + G C G C+AI D+GTSL+AGPT + ++N IG ++
Sbjct: 243 VSKKGYWQFAMDKVSIGGKDAGYCTGKNCSAICDTGTSLIAGPTADINDLNKKIGAIPLI 302
Query: 317 SAEC 320
E
Sbjct: 303 KGEA 306
>gi|226476848|emb|CAX72340.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 254 bits (648), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 128/293 (43%), Positives = 188/293 (64%), Gaps = 8/293 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGV-RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPP 95
R+ L+ L +AR + E V V R + + LKN++DAQY+G+I IG+PP
Sbjct: 17 RVPLYPLKSARRSLIEFETSLENVQKVWFSRFSNVEPRPEYLKNYLDAQYYGDITIGTPP 76
Query: 96 QNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISG 154
Q FSV+FDTGSSNLWVPS C YF I+C H +Y S KS TY G I YG+GS+SG
Sbjct: 77 QTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTTYVPNGTDFSIRYGTGSLSG 136
Query: 155 FFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVE 214
F S D++++G + VK Q F EAT++ L F++A+FDGI+G+ + +AVG PV+ NM++
Sbjct: 137 FLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGMAYPSLAVGGVTPVFVNMIK 196
Query: 215 QGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
Q +V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++ I
Sbjct: 197 QRVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTIS 256
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 323
+ S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 257 DLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
>gi|4927648|gb|AAD33219.1| cathepsin D [Hynobius leechii]
Length = 397
Score = 254 bits (648), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 132/295 (44%), Positives = 183/295 (62%), Gaps = 7/295 (2%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILP--LKNFMDAQY 85
+ RI L K R H+L A K A V++ + P LKN++DAQY
Sbjct: 20 MVRIPLTKFRSIRHTLTEAGGDIKNLV---ATSDQVKYNCFPKTQQPTPEILKNYLDAQY 76
Query: 86 FGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCE 144
+GEI IG+PPQ F+V+FDTGSSNLWVPS C I+C H +Y S S+TY + G
Sbjct: 77 YGEICIGTPPQCFTVVFDTGSSNLWVPSVHCSLLDIACLVHPKYDSSSSSTYVKNGTEFS 136
Query: 145 INYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGD 204
I YG+GS+SG+ QD V VG + V QVF EA ++ + F+ A+FDGI+G+ + I+V
Sbjct: 137 IQYGTGSLSGYLRQDTVSVGGLGVLKQVFGEAIKQPGVAFIAAKFDGILGMAYPRISVDG 196
Query: 205 AVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYW 264
V+DN++ Q LV + VFSF+LNR+PD GGE++ GG DP ++ G TY+ VT K YW
Sbjct: 197 VTTVFDNIMSQKLVEKNVFSFYLNRNPDTRPGGELLLGGTDPNYYTGDFTYLNVTPKAYW 256
Query: 265 QFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
Q + + +G+Q T +C+GGC AIVD+GTSL+ GP+ VT + AIG ++ E
Sbjct: 257 QIHMDQLGVGDQLT-LCKGGCEAIVDTGTSLIIGPSAEVTALQKAIGAIPLIQGE 310
>gi|449666857|ref|XP_002161366.2| PREDICTED: lysosomal aspartic protease-like [Hydra magnipapillata]
Length = 387
Score = 254 bits (648), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 125/259 (48%), Positives = 176/259 (67%), Gaps = 5/259 (1%)
Query: 62 GVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSI 120
G + + G+S E L+N+MDAQY+G+I +G+PPQ F V+FDTGSSNLWVPSS C + I
Sbjct: 47 GFQSKWGESPE---VLRNYMDAQYYGDISLGTPPQPFKVVFDTGSSNLWVPSSHCGWTDI 103
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+C H++Y KS+TY + G I YGSGS SG+ S D ++V D+ VK+Q+F EAT E
Sbjct: 104 ACLTHNKYHGDKSSTYVQNGTKFSIQYGSGSCSGYQSIDTLQVADISVKNQMFGEATSEP 163
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
+ F+ A+FDG++G+G+ +I+V VP + NMV+Q LV + VFSF+L+R+ + GGE++
Sbjct: 164 GIAFVAAKFDGLLGMGYSQISVNGVVPPFYNMVDQKLVEDAVFSFYLDRNVNDSTGGELL 223
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
GGVD F G TY PVT +GYWQF++ +++ N C GC AI D+GTSL+AGPT
Sbjct: 224 LGGVDSSKFVGDITYTPVTVEGYWQFKMDKVVV-NGEPMFCASGCNAIADTGTSLIAGPT 282
Query: 301 PVVTEINHAIGGEGVVSAE 319
V ++N IG +V E
Sbjct: 283 EEVNKLNQMIGATPIVGGE 301
>gi|12832561|dbj|BAB22158.1| unnamed protein product [Mus musculus]
Length = 419
Score = 254 bits (648), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 119/243 (48%), Positives = 164/243 (67%), Gaps = 2/243 (0%)
Query: 71 DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYK 129
+ +PL FM+ QYFG IG+G+PPQNF+V+FDTGSSNLWVPS++C +FS++C+FH R+
Sbjct: 59 NPSFVPLSKFMNTQYFGTIGLGTPPQNFTVVFDTGSSNLWVPSTRCHFFSLACWFHHRFN 118
Query: 130 SRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARF 189
+ S+++ G I YG+G +SG SQDN+ +G + F EA E SL F LA F
Sbjct: 119 PKASSSFRPNGTKFAIQYGTGRLSGILSQDNLTIGGIHDAFVTFGEALWEPSLIFALAHF 178
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF 249
DGI+GLGF +AVG P D MVEQGL+ + VFSF+LNRD + +GGE+V GG DP H+
Sbjct: 179 DGILGLGFPTLAVGGVQPPLDAMVEQGLLEKPVFSFYLNRDSEGSDGGELVLGGSDPAHY 238
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHA 309
T++PVT YWQ + + +G +C GC+AI+D+GTSL+ GP+ + +N A
Sbjct: 239 VPPLTFIPVTIPAYWQVHMESVKVGT-GLSLCAQGCSAILDTGTSLITGPSEEIRALNKA 297
Query: 310 IGG 312
IGG
Sbjct: 298 IGG 300
>gi|318977821|ref|NP_001187407.1| cathepsin D precursor [Ictalurus punctatus]
gi|308322929|gb|ADO28602.1| cathepsin D [Ictalurus punctatus]
Length = 398
Score = 253 bits (647), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 115/244 (47%), Positives = 171/244 (70%), Gaps = 2/244 (0%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L NFMDAQY+G I IG+PPQ F+V+FDTGSSNLWVPS C +F ++C+ H RY S+KS+T
Sbjct: 70 LSNFMDAQYYGVISIGTPPQEFTVLFDTGSSNLWVPSIHCAFFDLACWLHHRYDSKKSST 129
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + G I YG GS+SGFFSQD V + + V++Q+F EA ++ + F LA+FDG++G+
Sbjct: 130 YVQNGTQFSIQYGRGSLSGFFSQDTVTLAGLGVQNQMFAEAVKQPGVVFALAKFDGVLGM 189
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ ++VG P++D+++ L+ + +FSF++NRDP AE GGE++ GG D ++F G Y
Sbjct: 190 AYPILSVGKVRPIFDSIMAGKLLQQNIFSFYINRDPKAEVGGELMLGGCDKQYFDGDLHY 249
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
+ VT+K YWQ ++ + +G+ T +C+ GC AIVDSGTS++ GP + +N AIG +
Sbjct: 250 LNVTRKAYWQIKMDTVEVGSTLT-LCKDGCQAIVDSGTSMITGPVEEIRALNKAIGAVPL 308
Query: 316 VSAE 319
+ E
Sbjct: 309 IMGE 312
>gi|6680552|ref|NP_032463.1| napsin-A precursor [Mus musculus]
gi|6016430|sp|O09043.1|NAPSA_MOUSE RecName: Full=Napsin-A; AltName: Full=KDAP-1; AltName:
Full=Kidney-derived aspartic protease-like protein;
Short=KAP; Flags: Precursor
gi|1906810|dbj|BAA19004.1| kidney-derived aspartic protease-like protein [Mus musculus]
gi|7340352|emb|CAB82907.1| Napsin [Mus musculus]
gi|15928694|gb|AAH14813.1| Napsin A aspartic peptidase [Mus musculus]
gi|74220342|dbj|BAE31398.1| unnamed protein product [Mus musculus]
Length = 419
Score = 253 bits (647), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 119/243 (48%), Positives = 164/243 (67%), Gaps = 2/243 (0%)
Query: 71 DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYK 129
+ +PL FM+ QYFG IG+G+PPQNF+V+FDTGSSNLWVPS++C +FS++C+FH R+
Sbjct: 59 NPSFVPLSKFMNTQYFGTIGLGTPPQNFTVVFDTGSSNLWVPSTRCHFFSLACWFHHRFN 118
Query: 130 SRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARF 189
+ S+++ G I YG+G +SG SQDN+ +G + F EA E SL F LA F
Sbjct: 119 PKASSSFRPNGTKFAIQYGTGRLSGILSQDNLTIGGIHDAFVTFGEALWEPSLIFALAHF 178
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF 249
DGI+GLGF +AVG P D MVEQGL+ + VFSF+LNRD + +GGE+V GG DP H+
Sbjct: 179 DGILGLGFPTLAVGGVQPPLDAMVEQGLLEKPVFSFYLNRDSEGSDGGELVLGGSDPAHY 238
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHA 309
T++PVT YWQ + + +G +C GC+AI+D+GTSL+ GP+ + +N A
Sbjct: 239 VPPLTFIPVTIPAYWQVHMESVKVGT-GLSLCAQGCSAILDTGTSLITGPSEEIRALNKA 297
Query: 310 IGG 312
IGG
Sbjct: 298 IGG 300
>gi|195997419|ref|XP_002108578.1| expressed hypothetical protein [Trichoplax adhaerens]
gi|190589354|gb|EDV29376.1| expressed hypothetical protein [Trichoplax adhaerens]
Length = 383
Score = 253 bits (647), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 140/325 (43%), Positives = 197/325 (60%), Gaps = 21/325 (6%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRH 65
+RS+ L VLA L A+ L+RI L K + +L A IT + M A S
Sbjct: 1 MRSI--LLVLALVLSCAAA---LQRIKLYKMKTIRQTLLDAGITAE---MLKAKYSKFSA 52
Query: 66 RLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFH 125
GD L N++DAQY+G I IG+PPQNF ++FDTGSS+LWVPS+KC + +C H
Sbjct: 53 SRGDES-----LSNYLDAQYYGPITIGTPPQNFKILFDTGSSDLWVPSTKCNGNAACESH 107
Query: 126 SRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFL 185
+Y KS+TY G+ I YGSG+ SGF S+D V V + V++Q F EA E L+F+
Sbjct: 108 DKYDHTKSSTYVSNGQQWSIQYGSGAASGFLSEDVVTVAGISVRNQTFGEAVGEPGLSFV 167
Query: 186 LARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVD 245
A+FDGI+G+G+++++ PV+ NMV+QGLV + VFSF+LNR GGE++ GG D
Sbjct: 168 AAKFDGILGMGYKQLSAERTNPVFVNMVQQGLVRKPVFSFYLNRKQGGAVGGELILGGSD 227
Query: 246 PKHFKGKHTYVPVTKKGYWQFEL--GDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
P ++ G+ YVP++++ YWQF + G + G T VC GGC AI D+GT+L+ GP V
Sbjct: 228 PNYYSGQFNYVPLSRESYWQFAMDGGKVATG---TTVCNGGCQAIADTGTTLIVGPPEDV 284
Query: 304 TEINHAIGGE---GVVSAECKLVVS 325
I AIG + G + +C + S
Sbjct: 285 QRIQQAIGAQNAGGQYTVDCSTISS 309
>gi|148690790|gb|EDL22737.1| napsin A aspartic peptidase, isoform CRA_a [Mus musculus]
Length = 393
Score = 253 bits (647), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 119/250 (47%), Positives = 167/250 (66%), Gaps = 2/250 (0%)
Query: 71 DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYK 129
+ +PL FM+ QYFG IG+G+PPQNF+V+FDTGSSNLWVPS++C +FS++C+FH R+
Sbjct: 34 NPSFVPLSKFMNTQYFGTIGLGTPPQNFTVVFDTGSSNLWVPSTRCHFFSLACWFHHRFN 93
Query: 130 SRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARF 189
+ S+++ G I YG+G +SG SQDN+ +G + F EA E SL F LA F
Sbjct: 94 PKASSSFRPNGTKFAIQYGTGRLSGILSQDNLTIGGIHDAFVTFGEALWEPSLIFALAHF 153
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF 249
DGI+GLGF +AVG P D MVEQGL+ + VFSF+LNRD + +GGE+V GG DP H+
Sbjct: 154 DGILGLGFPTLAVGGVQPPLDAMVEQGLLEKPVFSFYLNRDSEGSDGGELVLGGSDPAHY 213
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHA 309
T++PVT YWQ + + +G +C GC+AI+D+GTSL+ GP+ + +N A
Sbjct: 214 VPPLTFIPVTIPAYWQVHMESVKVGT-GLSLCAQGCSAILDTGTSLITGPSEEIRALNKA 272
Query: 310 IGGEGVVSAE 319
IGG ++ +
Sbjct: 273 IGGYPFLNGQ 282
>gi|328869722|gb|EGG18099.1| cathepsin D [Dictyostelium fasciculatum]
Length = 476
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 131/318 (41%), Positives = 191/318 (60%), Gaps = 28/318 (8%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDED 73
V+A ++P N + R L++ +L +K+ + AG +
Sbjct: 103 VVAQAYVVPLGFNKVTRQALRRIPQNL---------QKKYMLAAAGTT------------ 141
Query: 74 ILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRK 132
+PL +F DAQY+G I IG+P Q F V+FDTGSSNLW+PS KC + I+C H++Y S K
Sbjct: 142 -IPLSDFEDAQYYGAITIGTPGQPFKVVFDTGSSNLWIPSKKCPITVIACDLHNKYDSTK 200
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGI 192
S+++ + G I YGSG++SGF S+D V+VG + VK+Q+F EAT E + F A+FDGI
Sbjct: 201 SSSFVQNGTDFSIQYGSGAMSGFVSEDTVQVGSLSVKNQLFAEATAEPGIAFDFAKFDGI 260
Query: 193 IGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGK 252
+GL F+ I+V + PV+ NM++QGLV++ +F+FWL++ GGE+ FG +D F G
Sbjct: 261 LGLAFQSISVNNIPPVFYNMMDQGLVAQPLFAFWLSKTASPTNGGELSFGSIDNSKFTGA 320
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVC-EGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
TYVP+T + YW+F + D+ S G C + GC AI DSGTSLLAGPT + IN +G
Sbjct: 321 ITYVPLTNRTYWEFSMDDVQYDGNSLGYCGKTGCRAIADSGTSLLAGPTEQIEAINTKLG 380
Query: 312 GEGV----VSAECKLVVS 325
V + C ++ S
Sbjct: 381 AVSVNGEAIFPSCNVISS 398
>gi|74220823|dbj|BAE31380.1| unnamed protein product [Mus musculus]
Length = 404
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 122/253 (48%), Positives = 178/253 (70%), Gaps = 17/253 (6%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN++DAQY+G+IGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 71 LKNYLDAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVG---------DVVVKDQVFIEATREGSLTFLL 186
Y + G S +I+YGSGS+S + SQD V V + V+ Q+F EAT++ + F+
Sbjct: 131 YVKNGTSFDIHYGSGSLSRYLSQDTVSVPCKSDQSKARGIKVEKQIFGEATKQPGIVFVA 190
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+G+ I+V + +PV+DN+++Q LV + +FSF+LNRDP+ + GGE++ GG D
Sbjct: 191 AKFDGILGMGYPHISVNNVLPVFDNLMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDS 250
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K++ G+ +Y+ VT+K YW + +GN+ T +C+GGC AIVD+GTSLL GP V E+
Sbjct: 251 KYYHGELSYLNVTRKAYW------LEVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKEL 303
Query: 307 NHAIGGEGVVSAE 319
AIG ++ E
Sbjct: 304 QKAIGAVPLIQGE 316
>gi|432850603|ref|XP_004066829.1| PREDICTED: cathepsin D-like isoform 3 [Oryzias latipes]
Length = 416
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 139/330 (42%), Positives = 196/330 (59%), Gaps = 28/330 (8%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLG-DSDE 72
VL L S L RI LKK R L + +E + A +++ LG S
Sbjct: 5 VLCVIAALALSGEALIRIPLKKFRSIRRELTD---SGREAHELLADKHSLKYNLGFPSSN 61
Query: 73 DILP--LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYK 129
P LKN++DAQY+GEI +G+PPQ F+V+FDTGSSNLWVPS C I+C +Y
Sbjct: 62 GPTPETLKNYLDAQYYGEIALGTPPQPFTVVFDTGSSNLWVPSVHCSLLDIACXXXHKYN 121
Query: 130 SRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEV--------------------GDVVVK 169
S KS+TY + G S I YGSGS+SG+ SQD V GD+ V+
Sbjct: 122 SAKSSTYVKNGTSFSIQYGSGSLSGYLSQDTCTVSVGGAVTPPTTHSVETAKAIGDISVE 181
Query: 170 DQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNR 229
+QVF EA ++ + F+ A+FDGI+G+ + I+V VPV+DN+++Q V VFSF+LNR
Sbjct: 182 NQVFGEAIKQPGVAFIAAKFDGILGMAYPRISVDGVVPVFDNIMQQKKVDSNVFSFYLNR 241
Query: 230 DPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIV 289
+PD E GGE++ GG DPK++ G YV ++++ YWQ + + +G+Q + +C+GGC AIV
Sbjct: 242 NPDTEPGGELLLGGTDPKYYSGDFHYVNISRQAYWQIHMDGMAVGSQLS-LCKGGCEAIV 300
Query: 290 DSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
D+GTSLL GP+ V + AIG ++ E
Sbjct: 301 DTGTSLLTGPSAEVKALQKAIGAIPLIQGE 330
>gi|315274255|gb|ADU03675.1| putative cathepsin D3 [Ixodes ricinus]
Length = 398
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 120/238 (50%), Positives = 162/238 (68%), Gaps = 1/238 (0%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSN 134
PL N++DAQY+G I IGSPPQ F V+FDTGSSNLWVPS +C + +I+C H +Y +S
Sbjct: 65 PLSNYLDAQYYGPISIGSPPQPFRVVFDTGSSNLWVPSKQCKWTNIACLLHKKYDHTRSR 124
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y + G + + YG+GS++GF S D V + + V +Q F EA E LTF+ A+FDGI+G
Sbjct: 125 SYRKNGTAISLRYGTGSMTGFLSVDTVSLAGIDVHNQTFAEAVTEPGLTFVAAKFDGILG 184
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LGF IAV A V+DNMV Q LV VFSF+LNR+ + GGEI FGG D + + G +
Sbjct: 185 LGFSNIAVMGAPTVFDNMVAQLLVPRPVFSFFLNRNTTSPTGGEITFGGTDDRFYSGDIS 244
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
YVPV+ KGYWQF + +I++ N S +C GC AI D+GTSL+AGP+ + ++ IG
Sbjct: 245 YVPVSTKGYWQFTVDNIVVKNSSFKLCAEGCEAIADTGTSLMAGPSLEIMKLQKLIGA 302
>gi|18858489|ref|NP_571785.1| cathepsin D [Danio rerio]
gi|12053845|emb|CAC20111.1| cathepsin D enzyme [Danio rerio]
Length = 399
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 138/329 (41%), Positives = 202/329 (61%), Gaps = 21/329 (6%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLR-RIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVR 64
+R FC C LLP S+ RI LKK R +L+ + + +E + + ++
Sbjct: 1 MRIRFC------CSLLPFSARRRDCRIPLKKFRTLRRTLSDSGRSLEELV---SSSNSLK 51
Query: 65 HRLG-DSDEDILP--LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-I 120
+ LG + D P LKN++DAQY+GEIG+G+P Q F+V+FDTGSSNLWVPS C + I
Sbjct: 52 YNLGFPASNDPTPETLKNYLDAQYYGEIGLGTPVQTFTVVFDTGSSNLWVPSVHCSLTDI 111
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+C H +Y KS+TY + G I YGSGS+SG+ SQD +GD+ V+ Q+F EA ++
Sbjct: 112 ACLLHHKYNGGKSSTYVKNGTQFAIQYGSGSLSGYLSQDTCTIGDIAVEKQIFGEAIKQP 171
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
+ F+ A+FDGI+G+ + I+V PV+D M+ Q V + VFSF+LNR+PD + GGE++
Sbjct: 172 GVAFIAAKFDGILGMAYPRISVDGVPPVFDMMMSQKKVEKNVFSFYLNRNPDTQPGGELL 231
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSG--TSLLAG 298
GG DPK++ G YV ++++ YWQ + + IG+ +C+GGC AIVD+G TSL+ G
Sbjct: 232 LGGTDPKYYTGDFNYVDISRQAYWQIHMDGMSIGS-GLSLCKGGCEAIVDTGTSTSLITG 290
Query: 299 PTPVVTEINHAIGG----EGVVSAECKLV 323
P V + AIG +G +CK V
Sbjct: 291 PAAEVKALQKAIGAIPLMQGEYMVDCKKV 319
>gi|1585311|prf||2124395A Asp protease
Length = 380
Score = 253 bits (645), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 128/293 (43%), Positives = 187/293 (63%), Gaps = 8/293 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGV-RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPP 95
R+ L+ L +AR + E V V R + + LKN++DAQY G+I IG+PP
Sbjct: 17 RVPLYPLKSARRSLIEFETSLENVQKVWFSRFSNVEPRPEYLKNYLDAQYHGDITIGTPP 76
Query: 96 QNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISG 154
Q FS +FDTGSSNLWVPS C YF I+C H +Y S KS TY G I YG+GS+SG
Sbjct: 77 QTFSAVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTTYVPNGTDFSIRYGTGSLSG 136
Query: 155 FFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVE 214
F S D++++G + VK Q F EAT++ L F++A+FDGI+G+ + +AVG PV+ NM++
Sbjct: 137 FLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGMAYPSLAVGGVTPVFVNMIK 196
Query: 215 QGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
QG+V VFSF+L+R+ GGE++ GG+D K++ G+ YV +T+K YW F++ ++ I
Sbjct: 197 QGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTIS 256
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 323
+ S +C GC AI D+GTS++AGPT V +IN +G G+ + C ++
Sbjct: 257 DLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVI 307
>gi|241813645|ref|XP_002416518.1| aspartic protease, putative [Ixodes scapularis]
gi|215510982|gb|EEC20435.1| aspartic protease, putative [Ixodes scapularis]
Length = 392
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 120/238 (50%), Positives = 162/238 (68%), Gaps = 1/238 (0%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSN 134
PL N++DAQY+G I IGSPPQ F V+FDTGSSNLWVPS +C + +I+C H +Y +S
Sbjct: 59 PLSNYLDAQYYGPISIGSPPQPFRVVFDTGSSNLWVPSKQCKWTNIACLLHKKYDHTRSR 118
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y + G + + YG+GS++GF S D V + + V +Q F EA E LTF+ A+FDGI+G
Sbjct: 119 SYRKNGTAISLRYGTGSMTGFLSVDTVSLAGIDVHNQTFAEAVTEPGLTFVAAKFDGILG 178
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LGF IAV A V+DNMV Q LV VFSF+LNR+ + GGEI FGG D + + G +
Sbjct: 179 LGFSNIAVMGAPTVFDNMVAQLLVPRPVFSFFLNRNTTSPTGGEITFGGTDDRFYSGDIS 238
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
YVPV+ KGYWQF + +I++ N S +C GC AI D+GTSL+AGP+ + ++ IG
Sbjct: 239 YVPVSTKGYWQFTVDNIVVKNSSFKLCAEGCEAIADTGTSLMAGPSLEIMKLQKLIGA 296
>gi|330800100|ref|XP_003288077.1| preprocathepsin D [Dictyostelium purpureum]
gi|325081901|gb|EGC35401.1| preprocathepsin D [Dictyostelium purpureum]
Length = 386
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 140/318 (44%), Positives = 188/318 (59%), Gaps = 25/318 (7%)
Query: 5 LLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVR 64
LL + VLA+ L +P S R +K R+ + N I GG +
Sbjct: 4 LLALILTFIVLANALTVPLSFTPASRQAIK--RIPQNVANKYTIAAN----GGTNI---- 53
Query: 65 HRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI-SCY 123
P+ +F DAQY+G I IG+P Q F V+FDTGSSNLW+PS KC ++ +C
Sbjct: 54 -----------PISDFEDAQYYGAITIGTPGQPFKVVFDTGSSNLWIPSKKCSITVPACD 102
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
H +Y S KS++Y G S I YGSG++SGF SQD V VG + VK+Q+F EAT E +
Sbjct: 103 LHEKYDSSKSSSYVANGTSFSIQYGSGAMSGFVSQDTVTVGSLSVKNQLFAEATAEPGIA 162
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
F A+FDGI+GL F+ I+V D PV+ NM++QGLV + +FSFWL++ P GGE+ FG
Sbjct: 163 FDFAKFDGILGLAFQSISVNDIPPVFYNMIDQGLVGQNLFSFWLSKTP-GSNGGELSFGS 221
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVC-EGGCAAIVDSGTSLLAGPTPV 302
+D + G TYVP+T YW+F++ D IG QS G C GC AI DSGTSL+AGP
Sbjct: 222 IDSSKYTGPITYVPLTNTTYWEFKMDDFAIGGQSAGFCGSQGCPAIADSGTSLIAGPIDF 281
Query: 303 VTEINHAIGGEGVVSAEC 320
+T +N +G V+S E
Sbjct: 282 ITALNQKLGAV-VISGEA 298
>gi|13928928|ref|NP_113858.1| napsin A aspartic peptidase precursor [Rattus norvegicus]
gi|6689137|emb|CAB65392.1| napsin [Rattus norvegicus]
gi|51260062|gb|AAH78790.1| Napsin A aspartic peptidase [Rattus norvegicus]
gi|149056039|gb|EDM07470.1| napsin A aspartic peptidase, isoform CRA_a [Rattus norvegicus]
Length = 420
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 118/258 (45%), Positives = 172/258 (66%), Gaps = 2/258 (0%)
Query: 74 ILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRK 132
+PL FM+ QYFG+IG+G+PPQNF+V+FDTGSSNLWVPS++C +FS++C+FH R+ +
Sbjct: 63 FVPLSKFMNTQYFGDIGLGTPPQNFTVVFDTGSSNLWVPSTRCHFFSLACWFHHRFNPKA 122
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGI 192
S+++ G I YG+G +SG S+DN+ +G + F EA E SL F LARFDGI
Sbjct: 123 SSSFRPNGTKFAIQYGTGRLSGILSRDNLTIGGIHNVSVTFGEALWEPSLVFALARFDGI 182
Query: 193 IGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGK 252
+GLGF +AVG P D +VEQ L+ + VFSF+LNRD + +GGE+V GG DP H+
Sbjct: 183 LGLGFPTLAVGGVQPPLDALVEQRLLEKPVFSFYLNRDSEGSDGGELVLGGSDPDHYVPP 242
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
T++PVT YWQ + + +G +C GC AI+D+GTSL+ GP+ + +N A+GG
Sbjct: 243 LTFIPVTIPAYWQVHMQSVKVGT-GLNLCAQGCGAILDTGTSLITGPSEEIRALNKAVGG 301
Query: 313 EGVVSAECKLVVSQYGDL 330
+++ + + S+ +L
Sbjct: 302 FPLLTGQYLIQCSKIPEL 319
>gi|403299328|ref|XP_003940441.1| PREDICTED: napsin-A-like [Saimiri boliviensis boliviensis]
Length = 421
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 134/314 (42%), Positives = 188/314 (59%), Gaps = 16/314 (5%)
Query: 22 PASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDED----ILPL 77
PA + L RI L++ + + +LN R G G +LG +PL
Sbjct: 22 PAGAT-LIRIPLRRVQPERRTLNLLR---------GWGEPAKLPKLGAPSPGDKPAFVPL 71
Query: 78 KNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTY 136
N+ D QYFGEIG+G PPQNF+V+FDTGSSNLWVPS +C +FS+ C+ H R+ + S+++
Sbjct: 72 SNYRDVQYFGEIGLGMPPQNFTVVFDTGSSNLWVPSRRCHFFSVPCWLHHRFDPKASSSF 131
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
G I YGSG + G S+D + +G + +F EA E SL F A FDGI+GLG
Sbjct: 132 QPNGTKFAIQYGSGRVDGILSEDKLTIGGIKGASVIFGEALWEPSLVFTFAHFDGILGLG 191
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
F +AV P D +VEQGL+ + VFSF+ NRDP+ +GGE+V GG DP H+ T+V
Sbjct: 192 FPVLAVEGVRPPLDVLVEQGLLDKPVFSFYFNRDPEKPDGGELVLGGSDPAHYIPPLTFV 251
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
PVT YWQ + + +G+ T +C GCAAI+D+GTSL+ GPT + +N AIGG ++
Sbjct: 252 PVTVPAYWQIHMERVKVGSGLT-LCARGCAAILDTGTSLITGPTEEIQALNAAIGGFPLL 310
Query: 317 SAECKLVVSQYGDL 330
+ E ++ S+ L
Sbjct: 311 AGEYIILCSEIPKL 324
>gi|355703800|gb|EHH30291.1| hypothetical protein EGK_10923 [Macaca mulatta]
Length = 423
Score = 252 bits (643), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 133/314 (42%), Positives = 190/314 (60%), Gaps = 16/314 (5%)
Query: 22 PASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDE----DILPL 77
PA + L RI L++ L +LN R G G RLG ++PL
Sbjct: 22 PARAT-LIRIPLRRVHPGLRTLNLLR---------GWGKPAKLPRLGAPSPGDKPALVPL 71
Query: 78 KNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTY 136
F+DAQYFGEIG+G+PPQNF+V+FDTGSSNLWVPS +C +FS+ C+FH R+ S+++
Sbjct: 72 SKFLDAQYFGEIGLGTPPQNFTVVFDTGSSNLWVPSRRCHFFSVPCWFHHRFNPNASSSF 131
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
G I YG+G + G S+D + +G + +F EA E SL F ++R DGI+GLG
Sbjct: 132 QPNGTKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWESSLVFTISRPDGILGLG 191
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
F +AV P D +VEQGL+ + VFSF+LNRD + +GGE+V GG DP H+ T+V
Sbjct: 192 FPILAVEGVPPPLDVLVEQGLLDKPVFSFYLNRDSEVADGGELVLGGSDPAHYIPPLTFV 251
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
PVT YWQ + +++G+ T +C GCAAI+D+GT ++ GPT + ++ AIGG ++
Sbjct: 252 PVTVPAYWQIHMERVMVGSGLT-LCARGCAAILDTGTPVIIGPTEEIRALHEAIGGIPLL 310
Query: 317 SAECKLVVSQYGDL 330
+ E + S+ L
Sbjct: 311 AGEYIIRCSEIPKL 324
>gi|332241362|ref|XP_003269849.1| PREDICTED: napsin-A-like [Nomascus leucogenys]
Length = 421
Score = 252 bits (643), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 131/310 (42%), Positives = 190/310 (61%), Gaps = 8/310 (2%)
Query: 22 PASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFM 81
PA + L RI L + + + +LN R R+ + G GD +PL N+
Sbjct: 22 PAGAT-LIRIPLHRVQPERRTLNLMRGWREPAELPKLGAPSP----GDK-PTFVPLSNYR 75
Query: 82 DAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIG 140
D QYFGEIG+G+PPQNF+V+FDTGSSNLWVPS +C +FS+ C+ H R+ + S+++ G
Sbjct: 76 DVQYFGEIGLGTPPQNFTVVFDTGSSNLWVPSRRCHFFSVPCWLHHRFDPKASSSFQANG 135
Query: 141 KSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREI 200
+I YG+G + G S+D + +G + +F EA E SL F A FDGI+GLGF +
Sbjct: 136 TKFDIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWEPSLVFTFAHFDGILGLGFPIL 195
Query: 201 AVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTK 260
+V P D +VEQGL+ + +FSF+LNRDP+ +GGE+V GG DP H+ T+VPVT
Sbjct: 196 SVEGVRPPVDVLVEQGLLDKPIFSFYLNRDPEEPDGGELVLGGSDPAHYIPPLTFVPVTV 255
Query: 261 KGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAEC 320
YWQ + + +G T +C GCAAI+D+GTSL+ GPT + ++ AIGG +++ E
Sbjct: 256 PAYWQIHMERVKVGPGLT-LCARGCAAILDTGTSLITGPTEEIRALHAAIGGYPLLAGEY 314
Query: 321 KLVVSQYGDL 330
++ S+ L
Sbjct: 315 IILCSEIPKL 324
>gi|348511299|ref|XP_003443182.1| PREDICTED: cathepsin D-like [Oreochromis niloticus]
Length = 397
Score = 251 bits (641), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 127/303 (41%), Positives = 184/303 (60%), Gaps = 3/303 (0%)
Query: 19 LLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLK 78
+LL A + R+ L K R L L + + A +G + L
Sbjct: 12 VLLLAQCTAILRVPLYKTR-SLRRLMSDNGMSVDELRALAKSTGSPDSAPSPQLPVERLT 70
Query: 79 NFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYT 137
NF+D+QY+G I IG+PPQNF+V+FDTGSSNLWVPS C I+C+FH RY S+KS+TY
Sbjct: 71 NFLDSQYYGIISIGTPPQNFTVLFDTGSSNLWVPSIHCSLLDIACWFHHRYNSKKSSTYA 130
Query: 138 EIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGF 197
+ G I YG+GS+SGF S D V + + V Q F EA ++ +TF ARFDG++G+G+
Sbjct: 131 KNGTEFSIQYGTGSLSGFISGDTVTIAGLSVPGQQFGEAVKQPGITFAFARFDGVLGMGY 190
Query: 198 REIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVP 257
I+V + +PV+D + L+ + +FSF+++RDP A GGE++ GG DP+++ G YV
Sbjct: 191 PSISVDNVMPVFDTAMAAKLLPQNIFSFYISRDPTAAVGGELMLGGTDPQYYTGDLHYVN 250
Query: 258 VTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVS 317
VT+K +WQ + + +GNQ T +C+ GC AIVD+GTSL+ GP V + AIG ++
Sbjct: 251 VTRKAFWQIGMNRVDVGNQLT-LCKAGCQAIVDTGTSLIVGPKEEVKALQKAIGAIPLLM 309
Query: 318 AEC 320
E
Sbjct: 310 GEA 312
>gi|327278613|ref|XP_003224055.1| PREDICTED: cathepsin E-like [Anolis carolinensis]
Length = 396
Score = 251 bits (640), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 136/315 (43%), Positives = 194/315 (61%), Gaps = 15/315 (4%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKE--RYMGGAGVSGVRHRLGDS- 70
VL +C +L S GL+R+ LK+ + SL R E ++ V +++ S
Sbjct: 5 VLITCFILFVS--GLQRVPLKRHK----SLRNILRERGELSKFWKSYKVDNIQYTQDCSA 58
Query: 71 -DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYK 129
E PL N+ D +YFGEI IG+PPQNF+V+FDTGSSNLWVPS C S +C HSR+
Sbjct: 59 FQEANEPLLNYFDVEYFGEISIGTPPQNFTVLFDTGSSNLWVPSVYCA-SKACVEHSRFH 117
Query: 130 SRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARF 189
+S+TY E+G S I+YG+GS++G D+V V + V +Q F E+ E TFL + F
Sbjct: 118 PTESSTYNEVGTSFSIHYGTGSLTGIIGMDSVTVEGITVTNQQFAESVSEPGKTFLDSEF 177
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF 249
DGI+GL + +AV PV+DNM+ Q LV +FS +L+R+PD+ GGE++FGG DP F
Sbjct: 178 DGILGLAYPSLAVDGVTPVFDNMMAQNLVELPLFSVYLSRNPDSSIGGELIFGGYDPSLF 237
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHA 309
G ++PV+KKGYWQ +L +I +G + C GC AIVD+GTSL+ GP+ + ++ +
Sbjct: 238 SGNLNWIPVSKKGYWQIQLDNIQVGG-TIAFCAEGCQAIVDTGTSLITGPSDDIKQMQNL 296
Query: 310 IGGE---GVVSAECK 321
IG + G + EC
Sbjct: 297 IGAQPVDGEYAVECS 311
>gi|402906426|ref|XP_003916003.1| PREDICTED: napsin-A-like [Papio anubis]
Length = 423
Score = 250 bits (638), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 133/314 (42%), Positives = 189/314 (60%), Gaps = 16/314 (5%)
Query: 22 PASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDED----ILPL 77
PA + L RI L++ L +LN R G G RLG ++PL
Sbjct: 22 PAGAT-LIRIPLRRVHPGLRTLNLLR---------GWGKPAKLPRLGAPSPGDKPALVPL 71
Query: 78 KNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTY 136
F+DAQYFGEIG+G+PPQNF+V+FDTGSSNLWVPS +C +FS+ C+FH R+ S+++
Sbjct: 72 SKFLDAQYFGEIGLGTPPQNFTVVFDTGSSNLWVPSRRCHFFSVPCWFHHRFNPNASSSF 131
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
G I YG+G + G S+D + +G + +F EA E SL F ++R DGI+GLG
Sbjct: 132 QPNGTKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWESSLVFTISRPDGILGLG 191
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
F +AV P D +VEQGL+ + VFSF+LNRD + +GGE+V GG DP H+ T+V
Sbjct: 192 FPILAVEGVPPPLDVLVEQGLLDKPVFSFYLNRDSEVADGGELVLGGSDPAHYIPPLTFV 251
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
PVT YWQ + + +G+ T +C GCAAI+D+GT ++ GPT + ++ AIGG ++
Sbjct: 252 PVTVPAYWQIHMERVTVGSGLT-LCARGCAAILDTGTPVIIGPTEEIRALHEAIGGIPLL 310
Query: 317 SAECKLVVSQYGDL 330
+ E + S+ L
Sbjct: 311 AGEYIIRCSEIPKL 324
>gi|354497676|ref|XP_003510945.1| PREDICTED: napsin-A [Cricetulus griseus]
Length = 569
Score = 250 bits (638), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 117/258 (45%), Positives = 168/258 (65%), Gaps = 2/258 (0%)
Query: 74 ILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRK 132
+PL FM+ QYFG+IG+G+PPQNF+V+FDTGSSNLWVPS +C +FS+ C+FH R+ +
Sbjct: 62 FVPLYKFMNTQYFGDIGLGTPPQNFTVVFDTGSSNLWVPSVRCHFFSLPCWFHRRFNPKA 121
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGI 192
S+++ G I YGSG ++G SQDN+ +G++ F EA E S+ F LA FDGI
Sbjct: 122 SSSFRPNGTKLAIQYGSGQLTGILSQDNLTIGEIRGVSVTFGEALWESSMVFTLAHFDGI 181
Query: 193 IGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGK 252
+GLGF +AV P D MVEQGL+ + +FSF+LNRD + +GGE+V GG DP H+
Sbjct: 182 LGLGFPSLAVDGVQPPLDAMVEQGLLQKPIFSFYLNRDAEGSDGGELVLGGSDPAHYIPP 241
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
T++PVT YWQ + + +G +C GC I+D+GTSL+ GP+ + +N AIGG
Sbjct: 242 LTFIPVTIPAYWQVHMESVNVGT-GLSLCAQGCGVILDTGTSLITGPSEEIHALNKAIGG 300
Query: 313 EGVVSAECKLVVSQYGDL 330
++ + + S+ +L
Sbjct: 301 LPFLAGQYFIQCSKTPEL 318
>gi|156039363|ref|XP_001586789.1| hypothetical protein SS1G_11818 [Sclerotinia sclerotiorum 1980]
gi|154697555|gb|EDN97293.1| hypothetical protein SS1G_11818 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 396
Score = 250 bits (638), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 137/321 (42%), Positives = 193/321 (60%), Gaps = 17/321 (5%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITR----KERYMGGAGVSGVRHRLGD 69
VLA+ LL + S G+ ++ LKK L A T ++YMG S +
Sbjct: 5 VLAAASLLGSVSAGVHKMPLKKVSLSEQLATANMDTHVKHLGQKYMGVRPQSHASEMFKE 64
Query: 70 SD------EDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCY 123
+ + +P+ NF++AQYF EI IG+PPQ F V+ DTGSSNLWVPSS+C SI+CY
Sbjct: 65 TSVHLEGGDHTVPVSNFLNAQYFSEITIGTPPQTFKVVLDTGSSNLWVPSSEC-GSIACY 123
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
H++Y S S+TY + G S EI YGSGS+SGF S+D + +GD+ +KDQVF EAT E L
Sbjct: 124 LHTKYDSSSSSTYEKNGTSFEIRYGSGSLSGFTSRDVMSIGDLEIKDQVFAEATEEPGLA 183
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
F RFDGI+GLG+ I+V VP + NM+ QGL+ E VF+F+L D + E +FGG
Sbjct: 184 FAFGRFDGILGLGYDTISVNQIVPPFYNMINQGLLDEPVFAFYLGDSKDEGDESEAIFGG 243
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
V+ H++GK T +P+ +K YW+ +L I G+ + G I+D+GTSL+A P+ +
Sbjct: 244 VNKDHYEGKITEIPLRRKAYWEVDLDAISFGDAKADLDNTGV--ILDTGTSLIAVPSTLA 301
Query: 304 TEINHAIGGE----GVVSAEC 320
+N IG + G S +C
Sbjct: 302 ELLNKEIGAKKGWNGQYSVDC 322
>gi|321461134|gb|EFX72169.1| hypothetical protein DAPPUDRAFT_189045 [Daphnia pulex]
Length = 391
Score = 250 bits (638), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 137/322 (42%), Positives = 191/322 (59%), Gaps = 25/322 (7%)
Query: 6 LRSVFCLWVL----ASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS 61
++ +F L+ L A+ LL S L R+ + + L + R + RY G ++
Sbjct: 1 MKKIFVLFALVGLSAAAKLL---SIPLERLPTARSSMSLVEQSMER--TRNRYSSGKILT 55
Query: 62 GVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF-SI 120
ED L+NF D+QYFG I +G+PPQ+F+VIFDTGS+NLWVPSS+C ++
Sbjct: 56 ----------ED---LRNFQDSQYFGPITLGTPPQDFTVIFDTGSANLWVPSSQCSEENL 102
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+C H++Y S S+TY G I YG+G++ GF S D + V V DQ F EA E
Sbjct: 103 ACKVHNQYNSSLSDTYKPNGTEFSIQYGTGAMDGFLSTDILGVAGAQVMDQTFAEAVNEP 162
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDP-DAEEGGEI 239
+TF+ RFDGI+G+ + IAV VP++ NM+ QGLV E VFSFWLNRD D GGEI
Sbjct: 163 GVTFVAGRFDGILGMSYPNIAVQGVVPMFQNMMAQGLVDEPVFSFWLNRDASDPVNGGEI 222
Query: 240 VFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI-GNQSTGVCEGGCAAIVDSGTSLLAG 298
VFGG +P H+ G+ Y+PVT+K YWQF ++I G C+GGC I D+GTS++AG
Sbjct: 223 VFGGTNPDHYVGEINYIPVTRKAYWQFRADGLMIEGIPEYPFCDGGCEMISDTGTSVIAG 282
Query: 299 PTPVVTEINHAIGGEGVVSAEC 320
P V +N +G +++ E
Sbjct: 283 PAEEVNLLNRLLGAINIINGEA 304
>gi|47213062|emb|CAF91576.1| unnamed protein product [Tetraodon nigroviridis]
Length = 395
Score = 250 bits (638), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 118/255 (46%), Positives = 172/255 (67%), Gaps = 13/255 (5%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L N++DAQY+GEIG+G+PPQ F+V+FDTGSSNLWVPS C I+C H +Y S KS+T
Sbjct: 57 LTNYLDAQYYGEIGLGTPPQPFTVVFDTGSSNLWVPSVHCSLLDIACLLHRKYNSAKSST 116
Query: 136 YTEIGKSCEINYGSGSISGFFSQDN-----------VEVGDVVVKDQVFIEATREGSLTF 184
Y + G + I YGSGS+SG+ SQD +VG + V+ Q+F EA ++ + F
Sbjct: 117 YVKNGTAFAIRYGSGSLSGYLSQDTCTVRACDPCPFFQVGGLAVEKQLFGEAIKQPGIAF 176
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
+ A+FDGI+G+G+ I+V PV+DN++ Q V + VFSF+LNR+P + GGE++ GG
Sbjct: 177 IAAKFDGILGMGYPRISVDGVAPVFDNIMSQKKVEKNVFSFYLNRNPQTQPGGELLLGGT 236
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
DP+++ G +YV VT++ YWQ + ++ +G+Q T +C+ GC AIVD+GTSLL GP+ V
Sbjct: 237 DPQYYTGDFSYVNVTRQAYWQIHVDELSVGSQLT-LCKSGCEAIVDTGTSLLTGPSEEVR 295
Query: 305 EINHAIGGEGVVSAE 319
+ AIG ++ E
Sbjct: 296 SLQKAIGALPLIQGE 310
>gi|351702766|gb|EHB05685.1| Napsin-A [Heterocephalus glaber]
Length = 417
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 122/258 (47%), Positives = 167/258 (64%), Gaps = 2/258 (0%)
Query: 74 ILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRK 132
++PL FM+ QYFGEIG+G+PPQNFSV+FDTGSSNLWVPS +C +FS+ C+FH RY +
Sbjct: 64 LVPLSKFMNVQYFGEIGLGTPPQNFSVVFDTGSSNLWVPSKRCHFFSVPCWFHHRYDPKA 123
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGI 192
S+++ G I YG+G +SG S+D + +G + F EA E SL F A FDGI
Sbjct: 124 SSSFRPNGTKFAIQYGTGRLSGILSEDKLNIGGISNASVTFGEALWEPSLVFAFASFDGI 183
Query: 193 IGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGK 252
GLGF +AV P D +VEQGL+ + +FSF+LNRD +GGE+V GG DP H+
Sbjct: 184 FGLGFPTLAVDRVPPPLDVLVEQGLLEKPIFSFYLNRDFAGADGGELVLGGADPAHYIPP 243
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
T+VPVT YWQ + + +G T +C GCAAIVD+GTSL+ GP+ + ++ AIGG
Sbjct: 244 LTFVPVTVPAYWQIHMERVKVGTGLT-LCAQGCAAIVDTGTSLITGPSEEIRALHRAIGG 302
Query: 313 EGVVSAECKLVVSQYGDL 330
++ E ++ S+ L
Sbjct: 303 LPWLAGEHFILCSKIPTL 320
>gi|315440805|gb|ADU20408.1| aspartic protease 2 [Clonorchis sinensis]
Length = 385
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 124/254 (48%), Positives = 172/254 (67%), Gaps = 7/254 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFHSRYKSRKSNT 135
L N+MD+QY+GEI IG+PPQ F V+FDTGSSNLWVPS++C ++ +C H RY KS+T
Sbjct: 60 LDNYMDSQYYGEIAIGTPPQPFKVVFDTGSSNLWVPSNRCSPWNEACRLHHRYDCEKSST 119
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y GK I YG+GS+SG S D V V V+DQ F EA E L F++A+FDGI+GL
Sbjct: 120 YKANGKPFSIQYGTGSVSGVLSTDVVTVSSAKVQDQTFGEAINEPGLVFVVAKFDGILGL 179
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
F+ IAV + VPV+DNM+ QGLV + +FS WL+R+ + GGEI+FGG++ +H+ G +
Sbjct: 180 AFQSIAVDNVVPVFDNMISQGLVEKPLFSVWLDRNDVQDIGGEIMFGGINKEHYMGDMYF 239
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
VP++ + YWQ +L I + S +C GC AIVD+GT+L+ GPT V ++N A+G
Sbjct: 240 VPLSSETYWQIDLDGIQV--TSLTLCAQGCQAIVDTGTTLIVGPTADVNQLNEALGAVSI 297
Query: 313 EGVVSA-ECKLVVS 325
EG +S EC + +
Sbjct: 298 EGGLSVLECSQIYT 311
>gi|45360583|ref|NP_988964.1| cathepsin D precursor [Xenopus (Silurana) tropicalis]
gi|38174445|gb|AAH61433.1| cathepsin D (lysosomal aspartyl protease) [Xenopus (Silurana)
tropicalis]
Length = 398
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 135/316 (42%), Positives = 198/316 (62%), Gaps = 17/316 (5%)
Query: 12 LWVLAS--CLLLPASSNGLRRIGLKK-----RRLDLHSLNAARITRKERYMGGAGVSGVR 64
+W L + C++ P SS L RI LKK R + +A +++ E S
Sbjct: 6 VWALLALCCVMQPGSS--LVRIPLKKFTSIRRAMSETDQDALKLSGNE---AATKYSAFL 60
Query: 65 HRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF-SISCY 123
+ + E +L N++DAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS C F ++C+
Sbjct: 61 NSKNPTPETLL---NYLDAQYYGEIGIGTPPQPFTVVFDTGSSNLWVPSIHCSFWDLACW 117
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
H +Y S KS TY G I YGSGS++G+ S+D V +GD+ V Q F EA ++ +T
Sbjct: 118 LHHKYDSSKSTTYINNGTEFAIQYGSGSLTGYLSKDTVTIGDLAVNGQFFAEAIKQPGIT 177
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
F+ A+FDGI+G+G+ +I+V PV+D+++EQ LV +FSF+LNR+PD GGE++ GG
Sbjct: 178 FVAAKFDGILGMGYPKISVDGVPPVFDDIMEQKLVDSNIFSFYLNRNPDTLPGGELLLGG 237
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
DP + G Y+ VT+K YWQ + + +G++ + +C+ GC AIVD+GTSL+ GP V
Sbjct: 238 TDPAFYTGDFNYMNVTRKAYWQIHMDQLSVGDRLS-LCKDGCEAIVDTGTSLITGPVEEV 296
Query: 304 TEINHAIGGEGVVSAE 319
T + AIG ++ E
Sbjct: 297 TALQRAIGAIPLICGE 312
>gi|402906424|ref|XP_003916002.1| PREDICTED: napsin-A-like [Papio anubis]
Length = 421
Score = 249 bits (636), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 129/310 (41%), Positives = 188/310 (60%), Gaps = 8/310 (2%)
Query: 22 PASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFM 81
PA + L RI L + + + +LN R R+ + G +L +PL N+
Sbjct: 22 PARAT-LIRIPLHRVQPERRTLNLLRGWREPAEVPKLGAPSPGDKL-----TFVPLSNYR 75
Query: 82 DAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIG 140
D QYFG+IG+G+PPQNF+V+FDTGSSNLWVPS +C +FS+ C+ H R+ + S+++ G
Sbjct: 76 DVQYFGKIGLGTPPQNFTVVFDTGSSNLWVPSRRCHFFSVPCWLHHRFDPKASSSFQANG 135
Query: 141 KSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREI 200
I YG+G + G S+D + +G + +F EA E L F A FDGI+GLGF +
Sbjct: 136 TKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWEPGLVFTFAHFDGILGLGFPIL 195
Query: 201 AVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTK 260
+V P D +VEQGL+ + VFSF+LNRDP+ +GGE+V GG DP H+ T+VPVT
Sbjct: 196 SVEGVRPPMDVLVEQGLLDKPVFSFYLNRDPEEPDGGELVLGGSDPAHYIPPLTFVPVTV 255
Query: 261 KGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAEC 320
YWQ + + +G T +C GCAAI+D+GTSL+ GPT + ++ AIGG +++ E
Sbjct: 256 PAYWQIHMERVKVGPGLT-LCVPGCAAILDTGTSLITGPTEEIRALHAAIGGYPLLAGEY 314
Query: 321 KLVVSQYGDL 330
++ S+ L
Sbjct: 315 IILCSEIPKL 324
>gi|358333762|dbj|GAA52230.1| cathepsin D [Clonorchis sinensis]
Length = 408
Score = 249 bits (636), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 124/254 (48%), Positives = 172/254 (67%), Gaps = 7/254 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFHSRYKSRKSNT 135
L N+MD+QY+GEI IG+PPQ F V+FDTGSSNLWVPS++C ++ +C H RY KS+T
Sbjct: 83 LDNYMDSQYYGEIAIGTPPQPFKVVFDTGSSNLWVPSNRCSPWNEACRLHHRYDCEKSST 142
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y GK I YG+GS+SG S D V V V+DQ F EA E L F++A+FDGI+GL
Sbjct: 143 YKANGKPFSIQYGTGSVSGVLSTDVVTVSSAKVQDQTFGEAINEPGLVFVVAKFDGILGL 202
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
F+ IAV + VPV+DNM+ QGLV + +FS WL+R+ + GGEI+FGG++ +H+ G +
Sbjct: 203 AFQSIAVDNVVPVFDNMISQGLVEKPLFSVWLDRNDVQDIGGEIMFGGINKEHYMGDMYF 262
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
VP++ + YWQ +L I + S +C GC AIVD+GT+L+ GPT V ++N A+G
Sbjct: 263 VPLSSETYWQIDLDGIQV--TSLTLCAQGCQAIVDTGTTLIVGPTADVNQLNEALGAVSI 320
Query: 313 EGVVSA-ECKLVVS 325
EG +S EC + +
Sbjct: 321 EGGLSVLECSQIYT 334
>gi|262073106|ref|NP_001159993.1| cathepsin D precursor [Bos taurus]
gi|296471411|tpg|DAA13526.1| TPA: cathepsin D [Bos taurus]
Length = 410
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 121/253 (47%), Positives = 174/253 (68%), Gaps = 13/253 (5%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN+MD Y+GEIGIG+PPQ F+V+FDTGS+NLWVPS C I+C+ H +Y S KS+T
Sbjct: 73 LKNYMD--YYGEIGIGTPPQCFTVVFDTGSANLWVPSIHCKLLDIACWTHRKYNSDKSST 130
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEV---------GDVVVKDQVFIEATREGSLTFLL 186
Y + G + +I+YGSGS+SG+ SQD V V G V V+ Q F EA ++ + F+
Sbjct: 131 YVKNGTTFDIHYGSGSLSGYLSQDTVSVPCNPSSSSPGGVTVQRQTFGEAIKQPGVVFIA 190
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+G+ + I+V + +PV+DN+++Q LV + VFSF+LNRDP A+ GGE++ GG D
Sbjct: 191 AKFDGILGMAYPRISVNNVLPVFDNLMQQKLVDKNVFSFFLNRDPKAQPGGELMLGGTDS 250
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
K+++G + VT++ YWQ + + +G+ T VC+GGC AIVD+GTSL+ GP V E+
Sbjct: 251 KYYRGSLMFHNVTRQAYWQIHMDQLDVGSSLT-VCKGGCEAIVDTGTSLIVGPVEEVREL 309
Query: 307 NHAIGGEGVVSAE 319
AIG ++ E
Sbjct: 310 QKAIGAVPLIQGE 322
>gi|397485038|ref|XP_003813670.1| PREDICTED: napsin-A-like [Pan paniscus]
Length = 420
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 132/311 (42%), Positives = 188/311 (60%), Gaps = 10/311 (3%)
Query: 22 PASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDI-LPLKNF 80
PA + L RI L + + +LN R R+ + G D+ I +PL N+
Sbjct: 21 PAGAT-LIRIPLHRVQPGRRTLNLLRGWREPAELPKLGAPS------PGDKTIFVPLSNY 73
Query: 81 MDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEI 139
D QYFGEIG+G+PPQNF+V FDTGSSNLWVPS +C +FS+ C+ H R+ + S+++
Sbjct: 74 RDVQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPCWLHHRFDPKASSSFQAN 133
Query: 140 GKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFRE 199
G I YG+G + G S+D + +G + +F EA E SL F A FDGI+GLGF
Sbjct: 134 GTKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWEPSLVFAFAHFDGILGLGFPI 193
Query: 200 IAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVT 259
++V P D +VEQGL+ + VFSF+LNRDP+ +GGE+V GG DP H+ T+VPVT
Sbjct: 194 LSVEGVRPPMDVLVEQGLLEKPVFSFYLNRDPEEPDGGELVLGGSDPAHYIPPLTFVPVT 253
Query: 260 KKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
YWQ + + +G T +C GCAAI+D+GTSL+ GPT + ++ AIGG +++ E
Sbjct: 254 VPAYWQIHMERVKVGPGLT-LCAQGCAAILDTGTSLITGPTEEIRALHAAIGGIPLLAGE 312
Query: 320 CKLVVSQYGDL 330
++ S+ L
Sbjct: 313 YIILCSEIPKL 323
>gi|17389633|gb|AAH17842.1| Napsin A aspartic peptidase [Homo sapiens]
gi|123982255|gb|ABM82919.1| napsin A aspartic peptidase [synthetic construct]
gi|123997015|gb|ABM86109.1| napsin A aspartic peptidase [synthetic construct]
Length = 420
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 131/309 (42%), Positives = 186/309 (60%), Gaps = 9/309 (2%)
Query: 24 SSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDI-LPLKNFMD 82
S L RI L + + +LN R R+ + G D+ I +PL N+ D
Sbjct: 22 SGATLIRIPLHRVQPGRRTLNLLRGWREPAELPKLGAPS------PGDKPIFVPLSNYRD 75
Query: 83 AQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGK 141
QYFGEIG+G+PPQNF+V FDTGSSNLWVPS +C +FS+ C+ H R+ + S+++ G
Sbjct: 76 VQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPCWLHHRFDPKASSSFQANGT 135
Query: 142 SCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIA 201
I YG+G + G S+D + +G + +F EA E SL F A FDGI+GLGF ++
Sbjct: 136 KFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWEPSLVFAFAHFDGILGLGFPILS 195
Query: 202 VGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKK 261
V P D +VEQGL+ + VFSF+LNRDP+ +GGE+V GG DP H+ T+VPVT
Sbjct: 196 VEGVRPPMDVLVEQGLLDKPVFSFYLNRDPEEPDGGELVLGGSDPAHYIPPLTFVPVTVP 255
Query: 262 GYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECK 321
YWQ + + +G T +C GCAAI+D+GTSL+ GPT + ++ AIGG +++ E
Sbjct: 256 AYWQIHMERVKVGPGLT-LCAKGCAAILDTGTSLITGPTEEIRALHAAIGGIPLLAGEYI 314
Query: 322 LVVSQYGDL 330
++ S+ L
Sbjct: 315 ILCSEIPKL 323
>gi|426244096|ref|XP_004015868.1| PREDICTED: napsin-A [Ovis aries]
Length = 443
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 129/304 (42%), Positives = 184/304 (60%), Gaps = 11/304 (3%)
Query: 30 RIGLKKRRLDLHSLNAARITRK--ERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
RI L++ +LN R K E GA G + +PL N+++AQY+G
Sbjct: 28 RIPLRRVNTGFKALNPLRGWEKLAEAPRLGAPSPGNKSLF-------VPLSNYLNAQYYG 80
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEIN 146
EIG+G+PPQNFSV+FDTGSSNLWVPS +C +FS+ C+ H R+ + S+++ G I
Sbjct: 81 EIGLGTPPQNFSVVFDTGSSNLWVPSVRCRFFSLPCWLHHRFNPKASSSFRFNGTKFAIQ 140
Query: 147 YGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
YG+G ++G S+D + +G + F EA E SL F A FDGI+GLGF +AVG
Sbjct: 141 YGTGRLAGILSEDKLTIGGITGATVTFGEALWEPSLVFTFAHFDGILGLGFPVLAVGGVQ 200
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
P D +V+QGL+ + VFSF+LNR+P+A +GGE+V GG DP H+ T+VPVT +WQ
Sbjct: 201 PPLDRLVDQGLLDKPVFSFYLNRNPEAADGGELVLGGSDPAHYIPPLTFVPVTIPAFWQI 260
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQ 326
+ + +G T +C GCAAI+D+GTSL+ GPT + + AIG ++ E + S+
Sbjct: 261 HMERVQVGTGLT-LCARGCAAILDTGTSLITGPTEEIRALQKAIGAVPLLMGEYYIKCSK 319
Query: 327 YGDL 330
L
Sbjct: 320 IPTL 323
>gi|198421979|ref|XP_002130758.1| PREDICTED: similar to Ctsd protein [Ciona intestinalis]
Length = 385
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 121/246 (49%), Positives = 161/246 (65%), Gaps = 2/246 (0%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSN 134
PL N+MDAQYFGEI IG+P Q F+VIFDTGSSNLWVPS+ C + +C H++Y S S+
Sbjct: 54 PLTNYMDAQYFGEISIGTPEQTFTVIFDTGSSNLWVPSASCPSTNYACMTHNKYNSAASS 113
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY G+ I YG+GS+ G+ S D V++ V Q F EA E +TF+ A+FDGI+G
Sbjct: 114 TYVADGEEFRIQYGTGSMVGYDSVDTVKIAGVPSTSQTFAEALEEPGITFVAAKFDGILG 173
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+G+ IAV PV++ M EQG V + +F+F+LNRDP+A +GGEI GGV+P + G
Sbjct: 174 MGYPNIAVNGMKPVFNQMFEQGAVDQNLFAFYLNRDPEAADGGEITLGGVNPARYVGDFN 233
Query: 255 YVPVTKKGYWQFELGDILIGNQS-TGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
Y VT++GYWQ ++ + I + + T C GGC IVDSGTSL+ GP+ IN AIG
Sbjct: 234 YHDVTRQGYWQIKMDGLSIADTAKTTACNGGCQVIVDSGTSLITGPSADTDAINQAIGAI 293
Query: 314 GVVSAE 319
V E
Sbjct: 294 KFVQGE 299
>gi|114678580|ref|XP_524345.2| PREDICTED: napsin-A isoform 4 [Pan troglodytes]
Length = 420
Score = 248 bits (634), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 132/311 (42%), Positives = 188/311 (60%), Gaps = 10/311 (3%)
Query: 22 PASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDI-LPLKNF 80
PA + L RI L + + +LN R R+ + G D+ I +PL N+
Sbjct: 21 PAGAT-LIRIPLHRVQPGRRTLNLLRGWREPAELPKLGAPS------PGDKTIFVPLSNY 73
Query: 81 MDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEI 139
D QYFGEIG+G+PPQNF+V FDTGSSNLWVPS +C +FS+ C+ H R+ + S+++
Sbjct: 74 RDVQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPCWLHHRFDPKASSSFQAN 133
Query: 140 GKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFRE 199
G I YG+G + G S+D + +G + +F EA E SL F A FDGI+GLGF
Sbjct: 134 GTKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWEPSLVFAFAHFDGILGLGFPI 193
Query: 200 IAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVT 259
++V P D +VEQGL+ + VFSF+LNRDP+ +GGE+V GG DP H+ T+VPVT
Sbjct: 194 LSVEGVRPPMDVLVEQGLLDKPVFSFYLNRDPEEPDGGELVLGGSDPAHYIPPLTFVPVT 253
Query: 260 KKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
YWQ + + +G T +C GCAAI+D+GTSL+ GPT + ++ AIGG +++ E
Sbjct: 254 VPAYWQIHMERVKVGPGLT-LCAQGCAAILDTGTSLITGPTEEIRALHAAIGGIPLLAGE 312
Query: 320 CKLVVSQYGDL 330
++ S+ L
Sbjct: 313 YIILCSEIPKL 323
>gi|355756059|gb|EHH59806.1| hypothetical protein EGM_10003 [Macaca fascicularis]
Length = 423
Score = 248 bits (634), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 135/324 (41%), Positives = 194/324 (59%), Gaps = 17/324 (5%)
Query: 22 PASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDED----ILPL 77
PA + L RI L++ L +LN R G G RLG ++PL
Sbjct: 22 PARAT-LIRIPLRRVHPGLRTLNLLR---------GWGKPAKLPRLGAPSPGDKPALVPL 71
Query: 78 KNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTY 136
F+DAQYFGEIG+G+PPQNF+V+FDTGSSNLWVPS +C +FS+ C+FH R+ S+++
Sbjct: 72 SKFLDAQYFGEIGLGTPPQNFTVVFDTGSSNLWVPSRRCHFFSVPCWFHHRFNPNASSSF 131
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
G I YG+G + G S+D + +G + +F EA E SL F ++R DGI+GLG
Sbjct: 132 QPNGTKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWESSLVFTISRPDGILGLG 191
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
F ++V P D +VEQGL+ + VFSF+LNRD + +GGE+V GG DP H+ T+V
Sbjct: 192 FPILSVEGVRPPMDVLVEQGLLDKPVFSFYLNRDSEVADGGELVLGGSDPAHYIPPLTFV 251
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
PVT YWQ + + +G+ T +C GCAAI+D+GT ++ GPT + ++ AIGG ++
Sbjct: 252 PVTVPAYWQIHMERVTVGSGLT-LCARGCAAILDTGTPVIIGPTEEIRALHEAIGGIPLL 310
Query: 317 SAECKLVVSQYGDL-IWDLLVSGL 339
+ E + S+ L LL+ G+
Sbjct: 311 AGEYIIRCSEIPKLPTVSLLIGGV 334
>gi|109125662|ref|XP_001116026.1| PREDICTED: napsin-A-like [Macaca mulatta]
Length = 421
Score = 248 bits (634), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 129/310 (41%), Positives = 188/310 (60%), Gaps = 8/310 (2%)
Query: 22 PASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFM 81
PA + L RI L + + + +LN R R+ + G +L +PL N+
Sbjct: 22 PARAT-LIRIPLHRVQPERRNLNLLRGWREPAEVPKLGAPSPGDKL-----TFVPLSNYR 75
Query: 82 DAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIG 140
D QYFG+IG+G+PPQNF+V+FDTGSSNLWVPS +C +FS+ C+ H R+ + S+++ G
Sbjct: 76 DVQYFGKIGLGTPPQNFTVVFDTGSSNLWVPSRRCHFFSVPCWLHHRFDPKASSSFQANG 135
Query: 141 KSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREI 200
I YG+G + G S+D + +G + +F EA E L F A FDGI+GLGF +
Sbjct: 136 TKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWEPGLVFTFAHFDGILGLGFPIL 195
Query: 201 AVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTK 260
+V P D +VEQGL+ + VFSF+LNRDP+ +GGE+V GG DP H+ T+VPVT
Sbjct: 196 SVEGVRPPMDVLVEQGLLDKPVFSFYLNRDPEEPDGGELVLGGSDPAHYIPPLTFVPVTV 255
Query: 261 KGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAEC 320
YWQ + + +G T +C GCAAI+D+GTSL+ GPT + ++ AIGG +++ E
Sbjct: 256 PAYWQIHMERVKVGPGLT-LCVRGCAAILDTGTSLITGPTEEIRALHAAIGGYPLLAGEY 314
Query: 321 KLVVSQYGDL 330
++ S+ L
Sbjct: 315 IILCSEIPKL 324
>gi|347836229|emb|CCD50801.1| similar to vacuolar protease A (secreted protein) [Botryotinia
fuckeliana]
Length = 398
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 134/309 (43%), Positives = 192/309 (62%), Gaps = 15/309 (4%)
Query: 15 LASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGA--GVSGVRH------- 65
LA+ LL + S G+ ++ LKK L L A + +++G GV H
Sbjct: 6 LAAASLLGSVSAGVHKMPLKKVSLS-EQLATANMQEHAKHLGQKYMGVRPESHASEMFKE 64
Query: 66 -RLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYF 124
+ D+ + +P+ NF++AQYF EI IG+PPQ+F V+ DTGSSNLWVPSS+C SI+CY
Sbjct: 65 TSVHDAGDHTVPVSNFLNAQYFSEITIGTPPQSFKVVLDTGSSNLWVPSSQC-GSIACYL 123
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
H++Y S S+TY + G S EI YGSGS+SGF S+D + +GD+ +KDQVF EAT E L F
Sbjct: 124 HTKYDSSSSSTYKQNGTSFEIRYGSGSLSGFTSKDVMTIGDLKIKDQVFAEATEEPGLAF 183
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
RFDGI+GLG+ I+V VP + +MV+QGL+ E VF+F+L + D + E +FGGV
Sbjct: 184 AFGRFDGILGLGYDTISVNSIVPPFYSMVDQGLLDEPVFAFYLGSN-DESDPSEAIFGGV 242
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
+ H+ GK T +P+ +K YW+ +L I G+ + G I+D+GTSL+A P +
Sbjct: 243 NKDHYDGKITEIPLRRKAYWEVDLDSIAFGDSEAELENTGV--ILDTGTSLIALPADLAG 300
Query: 305 EINHAIGGE 313
+N IG +
Sbjct: 301 LLNAEIGAK 309
>gi|154309857|ref|XP_001554261.1| hypothetical protein BC1G_06849 [Botryotinia fuckeliana B05.10]
gi|38195404|gb|AAR13364.1| aspartic proteinase precursor [Botryotinia fuckeliana]
Length = 398
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 134/309 (43%), Positives = 192/309 (62%), Gaps = 15/309 (4%)
Query: 15 LASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGA--GVSGVRH------- 65
LA+ LL + S G+ ++ LKK L L A + +++G GV H
Sbjct: 6 LAAASLLGSVSAGVHKMPLKKVSLS-EQLATANMQEHAKHLGQKYMGVRPESHASEMFKE 64
Query: 66 -RLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYF 124
+ D+ + +P+ NF++AQYF EI IG+PPQ+F V+ DTGSSNLWVPSS+C SI+CY
Sbjct: 65 TSVHDAGDHTVPVSNFLNAQYFSEITIGTPPQSFKVVLDTGSSNLWVPSSQC-GSIACYL 123
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
H++Y S S+TY + G S EI YGSGS+SGF S+D + +GD+ +KDQVF EAT E L F
Sbjct: 124 HTKYDSSSSSTYKQNGTSFEIRYGSGSLSGFTSKDVMTIGDLKIKDQVFAEATEEPGLAF 183
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
RFDGI+GLG+ I+V VP + +MV+QGL+ E VF+F+L + D + E +FGGV
Sbjct: 184 AFGRFDGILGLGYDTISVNSIVPPFYSMVDQGLLDEPVFAFYLGSN-DESDPSEAIFGGV 242
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
+ H+ GK T +P+ +K YW+ +L I G+ + G I+D+GTSL+A P +
Sbjct: 243 NKDHYDGKITEIPLRRKAYWEVDLDSIAFGDSEAELENTGV--ILDTGTSLIALPADLAG 300
Query: 305 EINHAIGGE 313
+N IG +
Sbjct: 301 LLNAEIGAK 309
>gi|297462061|ref|XP_001790669.2| PREDICTED: napsin-A [Bos taurus]
gi|297485858|ref|XP_002695173.1| PREDICTED: napsin-A [Bos taurus]
gi|296477597|tpg|DAA19712.1| TPA: napsin A aspartic peptidase [Bos taurus]
Length = 408
Score = 248 bits (633), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 126/287 (43%), Positives = 178/287 (62%), Gaps = 11/287 (3%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRK--ERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQY 85
L RI L++ + +LN R K E GA G + +PL ++M+ QY
Sbjct: 26 LIRIPLRRVNIGFKALNPLRGWEKLAEPPRLGAPAPG-------NKSLFVPLSDYMNVQY 78
Query: 86 FGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCE 144
+GEIG+G+PPQNFSV+FDTGSSNLWVPS +C +FS+ C+ H R+ + S+++ G
Sbjct: 79 YGEIGLGTPPQNFSVVFDTGSSNLWVPSVRCHFFSLPCWLHHRFNPKASSSFRSNGTKFA 138
Query: 145 INYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGD 204
I YG+G ++G S+D + +G + F EA E SL F A FDGI+GLGF +AVG
Sbjct: 139 IQYGTGRLAGILSEDKLTIGGITGATVTFGEALWEPSLVFTFAHFDGILGLGFPVLAVGG 198
Query: 205 AVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYW 264
P D +V+QGL+ + VFSF+LNR+P+A +GGE+V GG DP H+ T+VPVT +W
Sbjct: 199 VRPPLDRLVDQGLLDKPVFSFYLNRNPEAADGGELVLGGSDPAHYIPPLTFVPVTIPAFW 258
Query: 265 QFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
Q + + +G T +C GCAAI+D+GTSL+ GPT + + AIG
Sbjct: 259 QIHMERVQVGTGLT-LCARGCAAILDTGTSLITGPTEEIRALQKAIG 304
>gi|426389739|ref|XP_004061277.1| PREDICTED: napsin-A-like [Gorilla gorilla gorilla]
Length = 420
Score = 248 bits (633), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 132/311 (42%), Positives = 188/311 (60%), Gaps = 10/311 (3%)
Query: 22 PASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDI-LPLKNF 80
PA + L RI L + + +LN R R+ + G D+ I +PL N+
Sbjct: 21 PAGAT-LIRIPLHRVQPGRRTLNLLRGWREPAELPKLGAPS------PVDKPIFVPLLNY 73
Query: 81 MDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEI 139
D QYFGEIG+G+PPQNF+V FDTGSSNLWVPS +C +FS+ C+ H R+ + S+++
Sbjct: 74 RDVQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPCWLHDRFDPKASSSFQAN 133
Query: 140 GKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFRE 199
G I YG+G + G S+D + +G + +F EA E SL F A FDGI+GLGF
Sbjct: 134 GTKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWEPSLVFAFAHFDGILGLGFPI 193
Query: 200 IAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVT 259
++V P D +VEQGL+ + VFSF+LNRDP+ +GGE+V GG DP H+ T+VPVT
Sbjct: 194 LSVEGVRPPMDVLVEQGLLDKPVFSFYLNRDPEEPDGGELVLGGSDPAHYIPPLTFVPVT 253
Query: 260 KKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
YWQ + + +G T +C GCAAI+D+GTSL+ GPT + ++ AIGG +++ E
Sbjct: 254 VPAYWQIHMERVKVGPGLT-LCAQGCAAILDTGTSLITGPTEEIRALHAAIGGIPLLAGE 312
Query: 320 CKLVVSQYGDL 330
++ S+ L
Sbjct: 313 YIILCSEIPKL 323
>gi|397485042|ref|XP_003813672.1| PREDICTED: napsin-A-like [Pan paniscus]
Length = 420
Score = 248 bits (633), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 131/310 (42%), Positives = 189/310 (60%), Gaps = 8/310 (2%)
Query: 22 PASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFM 81
PA + L RI L++ +LN R K + G GD ++PL F+
Sbjct: 21 PAGAT-LIRIPLRQVHPGRRTLNLLRGWGKPAELPKLGAPSP----GDKPA-LVPLSKFL 74
Query: 82 DAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIG 140
DAQYFGEIG+G+PPQNF+V FDTGSSNLWVPS +C +FS+ C+FH R+ S+++ G
Sbjct: 75 DAQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPCWFHHRFNPNASSSFKPNG 134
Query: 141 KSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREI 200
I YG+G + G S+D + +G + +F EA E SL F ++R DGI+GLGF +
Sbjct: 135 TKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWESSLVFTVSRPDGILGLGFPIL 194
Query: 201 AVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTK 260
+V P D +VEQGL+ + VFSF+LNRDP+ +GGE+V GG DP H+ T+VPVT
Sbjct: 195 SVEGVRPPLDVLVEQGLLDKPVFSFYLNRDPEVADGGELVLGGSDPAHYIPPLTFVPVTV 254
Query: 261 KGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAEC 320
YWQ + + +G++ T +C GCAAI+D+GT ++ GPT + ++ AIGG +++ E
Sbjct: 255 PAYWQIHMERVKVGSRLT-LCAQGCAAILDTGTPVIVGPTEEIRALHAAIGGIPLLAGEY 313
Query: 321 KLVVSQYGDL 330
+ S+ L
Sbjct: 314 IIRCSEIPKL 323
>gi|380483026|emb|CCF40872.1| vacuolar protease A [Colletotrichum higginsianum]
Length = 399
Score = 248 bits (633), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 135/321 (42%), Positives = 194/321 (60%), Gaps = 18/321 (5%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRK-----ERYMGG-----AGVSGV 63
+L + +LL A+ ++ LKK L+ LN+ I + ++YMG A
Sbjct: 5 LLTAAVLLGAAQAEFHKLKLKKVSLE-EQLNSVPIEHQVRQLGQKYMGARPDNHADAMFK 63
Query: 64 RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCY 123
+ + + E +P+ NFM+AQYF EI IG+PPQ F V+ DTGSSNLWVPS +C SI+CY
Sbjct: 64 QKPVQSNGEHPVPVSNFMNAQYFSEIEIGNPPQTFKVVLDTGSSNLWVPSQQC-GSIACY 122
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
H++Y S S+TY G S EI+YGSGS++GF SQD+V +GD+ +K Q F EAT E L
Sbjct: 123 LHTKYDSSASSTYKANGSSFEIHYGSGSLTGFVSQDDVSIGDLKIKKQDFAEATSEPGLA 182
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
F RFDGI+GLG+ I+V VP + N+V Q + E VF+F+L + + E FGG
Sbjct: 183 FAFGRFDGILGLGYDTISVNKIVPPFYNLVNQKAIDEPVFAFYLGDTNEEGDESEATFGG 242
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
+D H++GK TY+P+ +K YW+ +L I +G+Q+ + G AI+D+GTSL P+ +
Sbjct: 243 LDDSHYEGKITYIPLRRKAYWEVDLDAISLGDQTAEL--EGHGAILDTGTSLNVLPSALA 300
Query: 304 TEINHAIGGE----GVVSAEC 320
+N IG + G S EC
Sbjct: 301 ELLNKEIGAKKGYNGQYSVEC 321
>gi|114678578|ref|XP_530061.2| PREDICTED: napsin-A-like [Pan troglodytes]
Length = 420
Score = 248 bits (633), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 131/310 (42%), Positives = 188/310 (60%), Gaps = 8/310 (2%)
Query: 22 PASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFM 81
PA + L RI L++ +LN R K + G GD + PL F+
Sbjct: 21 PAGAT-LIRIPLRQVHPGRRTLNLLRGWGKPAELPKLGAPSP----GDKPASV-PLSKFL 74
Query: 82 DAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIG 140
DAQYFGEIG+G+PPQNF+V FDTGSSNLWVPS +C +FS+ C+FH R+ S+++ G
Sbjct: 75 DAQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPCWFHHRFNPNASSSFKPNG 134
Query: 141 KSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREI 200
I YG+G + G S+D + +G + +F EA E SL F ++R DGI+GLGF +
Sbjct: 135 TKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWESSLVFTVSRPDGILGLGFPIL 194
Query: 201 AVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTK 260
+V P D +VEQGL+ + VFSF+LNRDP+ +GGE+V GG DP H+ T+VPVT
Sbjct: 195 SVEGVRPPLDVLVEQGLLDKPVFSFYLNRDPEVADGGELVLGGSDPAHYIPPLTFVPVTV 254
Query: 261 KGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAEC 320
YWQ + + +G++ T +C GCAAI+D+GT ++ GPT + ++ AIGG +++ E
Sbjct: 255 PAYWQIHMERVKVGSRLT-LCAQGCAAILDTGTPVIVGPTEEIRALHAAIGGIPLLAGEY 313
Query: 321 KLVVSQYGDL 330
+ S+ L
Sbjct: 314 IIRCSEIPKL 323
>gi|119592255|gb|EAW71849.1| napsin A aspartic peptidase, isoform CRA_c [Homo sapiens]
Length = 328
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 131/308 (42%), Positives = 183/308 (59%), Gaps = 9/308 (2%)
Query: 23 ASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDI-LPLKNFM 81
S L RI L + + LN R R+ + G D+ I +PL N+
Sbjct: 21 PSGATLIRIPLHRVQPGRRILNLLRGWREPAELPKLGAPS------PGDKPIFVPLSNYR 74
Query: 82 DAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIG 140
D QYFGEIG+G+PPQNF+V FDTGSSNLWVPS +C +FS+ C+ H R+ + S+++ G
Sbjct: 75 DVQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPCWLHHRFDPKASSSFQANG 134
Query: 141 KSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREI 200
I YG+G + G S+D + +G + +F EA E SL F A FDGI+GLGF +
Sbjct: 135 TKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWEPSLVFAFAHFDGILGLGFPIL 194
Query: 201 AVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTK 260
+V P D +VEQGL+ + VFSF+LNRDP+ +GGE+V GG DP H+ T+VPVT
Sbjct: 195 SVEGVRPPMDVLVEQGLLDKPVFSFYLNRDPEEPDGGELVLGGSDPAHYIPPLTFVPVTV 254
Query: 261 KGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAEC 320
YWQ + + +G T +C GCAAI+D+GTSL+ GPT + ++ AIGG +++ E
Sbjct: 255 PAYWQIHMERVKVGPGLT-LCAKGCAAILDTGTSLITGPTEEIRALHAAIGGIPLLAGEV 313
Query: 321 KLVVSQYG 328
+ YG
Sbjct: 314 RSQSGGYG 321
>gi|321461133|gb|EFX72168.1| hypothetical protein DAPPUDRAFT_227643 [Daphnia pulex]
Length = 394
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 129/292 (44%), Positives = 181/292 (61%), Gaps = 12/292 (4%)
Query: 34 KKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGS 93
K R+ L ++++R T K G V+ R G PL N+ DAQYFG + +G+
Sbjct: 19 KGLRVPLKQMDSSRKTMKGL---GLAYEKVQRRYGSGKLISEPLTNYQDAQYFGPLTLGT 75
Query: 94 PPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSI 152
PPQ F +IFDTGS+NLWVPSS+C +++C H++Y S S+TYT G I YG+G++
Sbjct: 76 PPQEFDIIFDTGSANLWVPSSECAPTNLACRNHNQYNSSLSSTYTPNGTEFSIQYGTGAM 135
Query: 153 SGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNM 212
+GF S D + + V DQ F EA E + F+ RFDGI+G+ + I+V VP++ NM
Sbjct: 136 TGFLSTDVLGIAGAQVIDQTFAEAVEEPGVVFVAGRFDGILGMSYPSISVQGVVPMFQNM 195
Query: 213 VEQGLVSEEVFSFWLNRD-PDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 271
+ QGLV E VFSFWLNR+ + E GGEI+FGG +P H++G+ +YVPV++K YWQF + +
Sbjct: 196 MAQGLVDEPVFSFWLNRNLNNPENGGEILFGGTNPTHYEGEISYVPVSRKAYWQFSVDGV 255
Query: 272 -LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVV 316
L G C GGC I D+GTSL+ GP+ +T + IG GEG+V
Sbjct: 256 NLAGYDEYPFCNGGCEMISDTGTSLITGPSEEITLFHKLIGAQVNIVGEGIV 307
>gi|344312912|emb|CCC33063.1| cathepsin D-1 [Dermanyssus gallinae]
Length = 383
Score = 248 bits (632), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 121/258 (46%), Positives = 162/258 (62%), Gaps = 2/258 (0%)
Query: 74 ILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRK 132
I PL NF DAQY+G I IG+PPQ F VIFDTGSS+LWVPSSKC S I+C HS+Y + K
Sbjct: 54 IEPLNNFGDAQYYGPITIGTPPQTFQVIFDTGSSDLWVPSSKCPSSNIACATHSKYNAEK 113
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGI 192
S+TY G I YGSGS+SG S D V V + V Q F E T E +F+ ++DGI
Sbjct: 114 SSTYVANGTKFAIQYGSGSVSGVLSTDTVSVSGITVTKQTFGEITEESGDSFIYGKYDGI 173
Query: 193 IGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGK 252
+G+G+ EIA +PV+D MV+Q +V + +FSF+L RDP G E+V GG+DPKH+KG
Sbjct: 174 LGMGYPEIA-SSGLPVFDQMVKQKVVEKAIFSFFLTRDPQHPIGSELVLGGIDPKHYKGD 232
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
TY P+T++ YWQF + + + ++ VC+ GC I D+GTSL GPT V + +
Sbjct: 233 ITYAPLTRESYWQFRVDKVTLNGKAAPVCQKGCEGIADTGTSLFVGPTADVAALASQLDA 292
Query: 313 EGVVSAECKLVVSQYGDL 330
+ + + GDL
Sbjct: 293 QETAPGLYLVDCEKAGDL 310
>gi|310796316|gb|EFQ31777.1| eukaryotic aspartyl protease [Glomerella graminicola M1.001]
Length = 399
Score = 248 bits (632), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 134/321 (41%), Positives = 193/321 (60%), Gaps = 18/321 (5%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDS--- 70
+L + +LL A+ + ++ LKK L+ LNA I + R +G + + D+
Sbjct: 5 LLTAAVLLGAAQAEVHKLKLKKVPLE-EQLNAVPIEHQVRQLGQKYMGTRPNNHADAMFN 63
Query: 71 -------DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCY 123
E +P+ NFM+AQYF EI IG+PPQ F V+ DTGSSNLWVPS +C SI+CY
Sbjct: 64 QKPIQTDGEHPVPVSNFMNAQYFSEIQIGTPPQTFKVVLDTGSSNLWVPSQQC-GSIACY 122
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
H++Y S S+TY G S EI+YGSGS++GF SQD+V +GD+ +K Q F EAT E L
Sbjct: 123 LHTKYDSSASSTYKSNGSSFEIHYGSGSLTGFVSQDDVSIGDLKIKKQDFAEATSEPGLA 182
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
F RFDGI+GLG+ I+V VP + N+V Q + E VF+F+L + + E FGG
Sbjct: 183 FAFGRFDGILGLGYDTISVNKIVPPFYNLVNQKAIDEPVFAFYLGDTNEEGDESEATFGG 242
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
+D H++GK TY+P+ +K YW+ +L I +G+++ + G AI+D+GTSL P+ +
Sbjct: 243 LDESHYEGKVTYIPLRRKAYWEVDLDAISLGDETADL--EGHGAILDTGTSLNVLPSALA 300
Query: 304 TEINHAIGGE----GVVSAEC 320
+N IG + G S EC
Sbjct: 301 ELLNKEIGAKKGYNGQYSVEC 321
>gi|410974821|ref|XP_003993838.1| PREDICTED: cathepsin D [Felis catus]
Length = 418
Score = 247 bits (631), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 131/304 (43%), Positives = 188/304 (61%), Gaps = 25/304 (8%)
Query: 36 RRLDLHSLNAARITRKERYMGGA------------GVSGVRHRLGDSDEDILPLKNFMDA 83
R+ LH + R T E +GG GV G +IL KN++DA
Sbjct: 32 ERIPLHKFTSVRRTMSE--LGGPVEDLIAKGPISKYAQGVPAVTGGPIPEIL--KNYLDA 87
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKS 142
QY+GEIGIG+PPQ F+V+FDTGS+NLWVPS C I+C+ S Y + G S
Sbjct: 88 QYYGEIGIGTPPQCFTVVFDTGSANLWVPSIHCKLLDIACWGGSVAXXXXXXXYVKNGTS 147
Query: 143 CEINYGSGSISGFFSQDNVEV-------GDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
+I+YGSGS+SG+ SQD V V V V+ Q+F EA ++ +TF+ A+FDGI+G+
Sbjct: 148 FDIHYGSGSLSGYLSQDTVSVPCQTPTVAGVKVERQIFGEAIKQPGITFIAAKFDGILGM 207
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ I+V D +PV+DN+++Q LV + +FSF+LNRDP+A+ GGE++ GG D K++KG +Y
Sbjct: 208 AYPRISVDDVLPVFDNLMKQKLVEKNIFSFYLNRDPNAQPGGELMLGGTDSKYYKGPLSY 267
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
+ VT+K YWQ + + +G T +C+GGC AI+D+GTSL+ GP V E+ AIG +
Sbjct: 268 LNVTRKAYWQVHMDQVDVGTSLT-LCKGGCEAILDTGTSLMVGPVDEVRELQKAIGAVPL 326
Query: 316 VSAE 319
+ E
Sbjct: 327 IQGE 330
>gi|4758754|ref|NP_004842.1| napsin-A preproprotein [Homo sapiens]
gi|6225749|sp|O96009.1|NAPSA_HUMAN RecName: Full=Napsin-A; AltName: Full=Aspartyl protease 4;
Short=ASP4; Short=Asp 4; AltName: Full=Napsin-1;
AltName: Full=TA01/TA02; Flags: Precursor
gi|4154287|gb|AAD04917.1| napsin A [Homo sapiens]
gi|4235425|gb|AAD13215.1| napsin 1 precursor [Homo sapiens]
gi|6561818|gb|AAF17081.1| aspartyl protease 4 [Homo sapiens]
gi|119592253|gb|EAW71847.1| napsin A aspartic peptidase, isoform CRA_a [Homo sapiens]
Length = 420
Score = 247 bits (631), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 131/309 (42%), Positives = 185/309 (59%), Gaps = 9/309 (2%)
Query: 24 SSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDI-LPLKNFMD 82
S L RI L + + LN R R+ + G D+ I +PL N+ D
Sbjct: 22 SGATLIRIPLHRVQPGRRILNLLRGWREPAELPKLGAPS------PGDKPIFVPLSNYRD 75
Query: 83 AQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGK 141
QYFGEIG+G+PPQNF+V FDTGSSNLWVPS +C +FS+ C+ H R+ + S+++ G
Sbjct: 76 VQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPCWLHHRFDPKASSSFQANGT 135
Query: 142 SCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIA 201
I YG+G + G S+D + +G + +F EA E SL F A FDGI+GLGF ++
Sbjct: 136 KFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWEPSLVFAFAHFDGILGLGFPILS 195
Query: 202 VGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKK 261
V P D +VEQGL+ + VFSF+LNRDP+ +GGE+V GG DP H+ T+VPVT
Sbjct: 196 VEGVRPPMDVLVEQGLLDKPVFSFYLNRDPEEPDGGELVLGGSDPAHYIPPLTFVPVTVP 255
Query: 262 GYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECK 321
YWQ + + +G T +C GCAAI+D+GTSL+ GPT + ++ AIGG +++ E
Sbjct: 256 AYWQIHMERVKVGPGLT-LCAKGCAAILDTGTSLITGPTEEIRALHAAIGGIPLLAGEYI 314
Query: 322 LVVSQYGDL 330
++ S+ L
Sbjct: 315 ILCSEIPKL 323
>gi|332241360|ref|XP_003269848.1| PREDICTED: napsin-A-like [Nomascus leucogenys]
Length = 421
Score = 247 bits (631), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 130/306 (42%), Positives = 185/306 (60%), Gaps = 12/306 (3%)
Query: 26 NGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQY 85
N LRR+ +R +LN R K + G GD + PL F+DAQY
Sbjct: 30 NPLRRVHPGRR-----ALNLLRGWGKPAELPKLGAPSP----GDKPASV-PLSKFLDAQY 79
Query: 86 FGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCE 144
FGEIG+G+PPQNF+V FDTGSSNLWVPS +C +FS+ C+FH R+ S+++ G
Sbjct: 80 FGEIGLGTPPQNFTVTFDTGSSNLWVPSRRCHFFSVPCWFHHRFNPNASSSFKPNGTKFA 139
Query: 145 INYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGD 204
I YG+G + G S+D + +G + +F EA E SL F ++R DGI+GLGF +AV
Sbjct: 140 IQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWESSLVFTVSRPDGILGLGFPILAVEG 199
Query: 205 AVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYW 264
P D +VEQGL+ + +FSF+LNRDP+ +GGE+V GG DP H+ T+VPVT YW
Sbjct: 200 VRPPLDVLVEQGLLDKPIFSFYLNRDPEVADGGELVLGGSDPAHYIPPLTFVPVTVPAYW 259
Query: 265 QFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVV 324
Q + + +G+ T +C GCAAI+D+GT ++ GPT + ++ AIGG +++ E +
Sbjct: 260 QIHMERVKVGSGLT-LCARGCAAILDTGTPVIIGPTEEIRALHAAIGGISLLAGEYLIRC 318
Query: 325 SQYGDL 330
S+ L
Sbjct: 319 SEIPKL 324
>gi|194374823|dbj|BAG62526.1| unnamed protein product [Homo sapiens]
Length = 325
Score = 247 bits (631), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 126/295 (42%), Positives = 181/295 (61%), Gaps = 7/295 (2%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L RI L++ +LN R K + G GD + PL F+DAQYFG
Sbjct: 26 LIRIPLRQVHPGRRTLNLLRGWGKPAELPKLGAPSP----GDKPASV-PLSKFLDAQYFG 80
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEIN 146
EIG+G+PPQNF+V FDTGSSNLWVPS +C +FS+ C+FH R+ S+++ G I
Sbjct: 81 EIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPCWFHHRFNPNASSSFKPSGTKFAIQ 140
Query: 147 YGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
YG+G + G S+D + +G + +F EA E SL F ++R DGI+GLGF ++V
Sbjct: 141 YGTGRVDGILSEDKLTIGGIKGASVIFGEALWESSLVFTVSRPDGILGLGFPILSVEGVR 200
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
P D +VEQGL+ + VFSF+ NRDP+ +GGE+V GG DP H+ T+VPVT YWQ
Sbjct: 201 PPLDVLVEQGLLDKPVFSFYFNRDPEVADGGELVLGGSDPAHYIPPLTFVPVTVPAYWQI 260
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECK 321
+ + +G++ T +C GCAAI+D+GT ++ GPT + ++ AIGG +++ E +
Sbjct: 261 HMERVKVGSRLT-LCAQGCAAILDTGTPVIVGPTEEIRALHAAIGGIPLLAGEVR 314
>gi|195430468|ref|XP_002063276.1| GK21477 [Drosophila willistoni]
gi|194159361|gb|EDW74262.1| GK21477 [Drosophila willistoni]
Length = 402
Score = 247 bits (630), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 131/295 (44%), Positives = 184/295 (62%), Gaps = 11/295 (3%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDIL---PLKNFMDAQYFGEIGIGS 93
R+ LH +AR R E++ +R+ + D + L PL N++DAQYFG I IG+
Sbjct: 32 RVPLHRFPSAR-RRFEQFGIRMERLRLRYSVMPRDGEKLRTEPLTNYLDAQYFGPITIGT 90
Query: 94 PPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSI 152
PPQ F VIFDTGS+NLWVPS+ C S++C HSR+ +++S +Y IG I+YGSGS+
Sbjct: 91 PPQIFKVIFDTGSANLWVPSTSCSPASVACMIHSRFHAKRSTSYYPIGAPFAIHYGSGSL 150
Query: 153 SGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNM 212
SG+ S+D V V + +++QVF EAT FL A+FDGI GLG+R I+V P + M
Sbjct: 151 SGYLSRDTVRVAGLEIENQVFAEATNMPGPIFLAAKFDGIFGLGYRSISVQRIKPPFYAM 210
Query: 213 VEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDIL 272
+EQ L++ VFS +LNRD A+EGG + FGG +P+++ G TYVPV+++ YWQ +
Sbjct: 211 MEQNLLASPVFSVYLNRDVAAKEGGALFFGGSNPQYYTGNFTYVPVSRRSYWQITMDSAH 270
Query: 273 IGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 323
I + +CE GC I+D+GTS LA P IN +IGG G+ S C+ V
Sbjct: 271 I--KDLNLCEQGCEVIIDTGTSFLAMPYDQAMLINKSIGGTPSSYGMFSIPCEQV 323
>gi|348559312|ref|XP_003465460.1| PREDICTED: napsin-A-like [Cavia porcellus]
Length = 523
Score = 247 bits (630), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 127/254 (50%), Positives = 166/254 (65%), Gaps = 5/254 (1%)
Query: 68 GDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHS 126
GDS +PL F++ QYFGEIG+G+PPQNFSV+FDTGSSNLWVPS C +FS+ C+FH
Sbjct: 59 GDS-PFFVPLSKFLNVQYFGEIGLGTPPQNFSVVFDTGSSNLWVPSKSCRFFSLPCWFHH 117
Query: 127 RYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLL 186
RY + S+++ G I YG+G +SG SQD + +G + F EA E SL F
Sbjct: 118 RYDPKASSSFCPNGTKFAIQYGTGRLSGILSQDKLTIGGINNVSVTFGEALWEPSLVFAF 177
Query: 187 ARFDGIIGLGFREIAVGDAVPV-WDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVD 245
A FDGI GLGF +AV D VP D MVEQGL+ + VFSF+LNRD + GGE+V GG D
Sbjct: 178 ASFDGIFGLGFPALAV-DGVPTPLDVMVEQGLLDKPVFSFYLNRDFEGTHGGELVLGGSD 236
Query: 246 PKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTE 305
P H+ T+VPVT YWQ + +++G T +C GCAAIVD+GTSL+ GP+ +
Sbjct: 237 PAHYIPPLTFVPVTIPAYWQIHMDRVMVGTGLT-LCAQGCAAIVDTGTSLITGPSEEIRA 295
Query: 306 INHAIGGEGVVSAE 319
++ AIGG ++ E
Sbjct: 296 LHRAIGGLPWLAGE 309
>gi|6561816|gb|AAF17080.1| aspartyl protease 3 [Homo sapiens]
Length = 450
Score = 247 bits (630), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 130/310 (41%), Positives = 187/310 (60%), Gaps = 8/310 (2%)
Query: 22 PASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFM 81
PA + L RI L++ +LN R K + G GD + PL F+
Sbjct: 21 PAGAT-LIRIPLRQVHPGRRTLNLLRGWGKPAELPKLGAPSP----GDKPASV-PLSKFL 74
Query: 82 DAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIG 140
DAQYFGEIG+G+PPQNF+V FDTGSSNLWVPS +C +FS+ C+FH R+ S+++ G
Sbjct: 75 DAQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPCWFHHRFNPNASSSFKPSG 134
Query: 141 KSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREI 200
I YG+G + G S+D + +G + +F EA E SL F ++R DGI+GLGF +
Sbjct: 135 TKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWESSLVFTVSRPDGILGLGFPIL 194
Query: 201 AVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTK 260
+V P D +VEQGL+ + VFSF+ NRDP+ +GGE+V GG DP H+ T+VPVT
Sbjct: 195 SVEGVRPPLDVLVEQGLLDKPVFSFYFNRDPEVADGGELVLGGSDPAHYIPPLTFVPVTV 254
Query: 261 KGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAEC 320
YWQ + + +G++ T +C GCAAI+D+GT ++ GPT + ++ AIGG +++ E
Sbjct: 255 PAYWQIHMERVKVGSRLT-LCAQGCAAILDTGTPVIVGPTEEIRALHAAIGGIPLLAGEY 313
Query: 321 KLVVSQYGDL 330
+ S+ L
Sbjct: 314 IIRCSEIPKL 323
>gi|307166067|gb|EFN60339.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 370
Score = 247 bits (630), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 120/240 (50%), Positives = 165/240 (68%), Gaps = 9/240 (3%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSK---CYFSISCY---FHSRYKS 130
L ++DAQY+G I IG+PPQNF+V+FDTGSSNLWVPS K ++ +SC+ +H +Y +
Sbjct: 46 LFKYLDAQYYGVISIGTPPQNFTVLFDTGSSNLWVPSIKSEITFYKLSCWTAPYHHKYNN 105
Query: 131 RKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFD 190
KS TY I YGSG +SGF S D V V + V++Q F EAT E S+ F+L +FD
Sbjct: 106 SKSITYQANSAPFAIEYGSGDLSGFLSTDVVNVAGLNVRNQTFAEATHESSI-FILMQFD 164
Query: 191 GIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFK 250
GI+G+G+ I+V P++ NM++Q LVS+ +FSF+LNR+P AEEGGE++ GG DP H+
Sbjct: 165 GILGMGYPTISVDGVTPIFQNMIQQRLVSQPIFSFYLNRNPSAEEGGELILGGCDPNHYV 224
Query: 251 GKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI 310
G+ TYVPVT +GYWQF + ++ GN +C GC AI D+GTSL+ GP+ + IN I
Sbjct: 225 GEFTYVPVTVEGYWQFTMDSVIAGNYI--LCAQGCQAIADTGTSLIVGPSEDIDVINGYI 282
>gi|119592251|gb|EAW71845.1| hCG1733572, isoform CRA_a [Homo sapiens]
Length = 449
Score = 247 bits (630), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 128/304 (42%), Positives = 184/304 (60%), Gaps = 7/304 (2%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L RI L++ +LN R K + G GD + PL F+DAQYFG
Sbjct: 26 LIRIPLRQVHPGRRTLNLLRGWGKPAELPKLGAPSP----GDKPASV-PLSKFLDAQYFG 80
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEIN 146
EIG+G+PPQNF+V FDTGSSNLWVPS +C +FS+ C+FH R+ S+++ G I
Sbjct: 81 EIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPCWFHHRFNPNASSSFKPSGTKFAIQ 140
Query: 147 YGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
YG+G + G S+D + +G + +F EA E SL F ++R DGI+GLGF ++V
Sbjct: 141 YGTGRVDGILSEDKLTIGGIKGASVIFGEALWESSLVFTVSRPDGILGLGFPILSVEGVR 200
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
P D +VEQGL+ + VFSF+ NRDP+ +GGE+V GG DP H+ T+VPVT YWQ
Sbjct: 201 PPLDVLVEQGLLDKPVFSFYFNRDPEVADGGELVLGGSDPAHYIPPLTFVPVTVPAYWQI 260
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQ 326
+ + +G++ T +C GCAAI+D+GT ++ GPT + ++ AIGG +++ E + S+
Sbjct: 261 HMERVKVGSRLT-LCAQGCAAILDTGTPVIVGPTEEIRALHAAIGGIPLLAGEYIIRCSE 319
Query: 327 YGDL 330
L
Sbjct: 320 IPKL 323
>gi|302696543|ref|XP_003037950.1| hypothetical protein SCHCODRAFT_71897 [Schizophyllum commune H4-8]
gi|300111647|gb|EFJ03048.1| hypothetical protein SCHCODRAFT_71897 [Schizophyllum commune H4-8]
Length = 406
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 143/332 (43%), Positives = 198/332 (59%), Gaps = 27/332 (8%)
Query: 20 LLPASSNGLRRIGLKK-----RRLDLHSLNAAR----ITRKERYMGGAGVSGVRHRLGDS 70
LLPA + ++ L+K +L SL+ A + + + GAG +G R + D+
Sbjct: 10 LLPAVYAEVHKLQLQKIPATVGNPELESLHLAEKYGVVNEFQTPLMGAGGAGRRLK-NDA 68
Query: 71 DEDI------------LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF 118
ED+ +PL NFM+AQYF EI +G+PPQNF VI DTGSSNLWVPSSKC
Sbjct: 69 GEDLFWTQEQVKGGHGVPLTNFMNAQYFTEITLGTPPQNFKVILDTGSSNLWVPSSKCT- 127
Query: 119 SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
SI+C+ H++Y S S+TY + G I YGSGS+ GF SQD + +GD+ + Q F EA +
Sbjct: 128 SIACFLHAKYDSSASSTYKQNGTEFSIQYGSGSMEGFVSQDVLTIGDLTIPGQDFAEAVK 187
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
E LTF +FDGI+GLG+ I+V VP NM+ +GL+ E VFSF L + E+GGE
Sbjct: 188 EPGLTFAFGKFDGILGLGYDTISVNHIVPPHYNMINKGLLDEPVFSFRLGK--SEEDGGE 245
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
+FGGVD +KG TYVPV +K YW+ EL I G++ + G A +D+GTSL+A
Sbjct: 246 AIFGGVDKSAYKGDLTYVPVRRKAYWEVELEKISFGSEELELESTGAA--IDTGTSLIAL 303
Query: 299 PTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 330
PT + IN IG + + + ++ S+ DL
Sbjct: 304 PTDMAEMINAEIGAKKSWNGQYQVECSKVPDL 335
>gi|361128953|gb|EHL00878.1| putative Vacuolar protease A [Glarea lozoyensis 74030]
Length = 399
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 136/323 (42%), Positives = 195/323 (60%), Gaps = 20/323 (6%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRK-----ERYMGGAGVSGVRHRLG 68
++A+ LL + S G+ ++ LKK L L A I ++YMG + +
Sbjct: 5 LIAAASLLGSVSAGIHKMPLKKISLS-EQLAGANIDTHVKHLGQKYMGIRPEAHEQEMFK 63
Query: 69 DSD------EDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISC 122
DS +P+ NF++AQYF EI IG+PPQ+F V+ DTGSSNLWVPSS+C SI+C
Sbjct: 64 DSSLHTEKGAHPVPVSNFLNAQYFSEITIGTPPQSFKVVLDTGSSNLWVPSSEC-GSIAC 122
Query: 123 YFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSL 182
Y H++Y S S+TY + G EI YGSGS+SGF SQD + +GD+ +KDQ+F EAT E L
Sbjct: 123 YLHTKYDSSSSSTYKKNGSDFEIRYGSGSLSGFVSQDTMTIGDLKIKDQIFAEATEEPGL 182
Query: 183 TFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFG 242
F RFDGI+GLGF I+V VP + +M+ QGL+ E VF+F+L + EE E FG
Sbjct: 183 AFAFGRFDGILGLGFDTISVNKIVPPFYSMINQGLLDEPVFAFYLGDTNNGEE-SEATFG 241
Query: 243 GVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPV 302
GV+ H+ GK T +P+ +K YW+ +L I G+ + + G I+D+GTSL+A P+ +
Sbjct: 242 GVNEDHYTGKMTTIPLRRKAYWEVDLDAITFGDATAELENTGV--ILDTGTSLIALPSTL 299
Query: 303 VTEINHAIGGE----GVVSAECK 321
+N +G + G + EC+
Sbjct: 300 AELLNKEMGAKKGYNGQYTVECE 322
>gi|119592254|gb|EAW71848.1| napsin A aspartic peptidase, isoform CRA_b [Homo sapiens]
Length = 357
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 119/258 (46%), Positives = 168/258 (65%), Gaps = 2/258 (0%)
Query: 74 ILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRK 132
+PL N+ D QYFGEIG+G+PPQNF+V FDTGSSNLWVPS +C +FS+ C+ H R+ +
Sbjct: 4 FVPLSNYRDVQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPCWLHHRFDPKA 63
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGI 192
S+++ G I YG+G + G S+D + +G + +F EA E SL F A FDGI
Sbjct: 64 SSSFQANGTKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWEPSLVFAFAHFDGI 123
Query: 193 IGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGK 252
+GLGF ++V P D +VEQGL+ + VFSF+LNRDP+ +GGE+V GG DP H+
Sbjct: 124 LGLGFPILSVEGVRPPMDVLVEQGLLDKPVFSFYLNRDPEEPDGGELVLGGSDPAHYIPP 183
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
T+VPVT YWQ + + +G T +C GCAAI+D+GTSL+ GPT + ++ AIGG
Sbjct: 184 LTFVPVTVPAYWQIHMERVKVGPGLT-LCAKGCAAILDTGTSLITGPTEEIRALHAAIGG 242
Query: 313 EGVVSAECKLVVSQYGDL 330
+++ E ++ S+ L
Sbjct: 243 IPLLAGEYIILCSEIPKL 260
>gi|41053329|ref|NP_956325.1| uncharacterized protein LOC336746 precursor [Danio rerio]
gi|34783813|gb|AAH56836.1| Zgc:63831 [Danio rerio]
Length = 412
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 131/315 (41%), Positives = 184/315 (58%), Gaps = 16/315 (5%)
Query: 20 LLPASSNGLRRIGLKKRRLDLHSL--NAARITR-------KERYMGGAGVSGVRHRLGDS 70
LL A S + RI L K R L N I K +Y G + +
Sbjct: 13 LLIADSQAIIRIPLHKMRTVRRMLADNGKTIDEIKSLAKMKAKYSDGTFTNQGSVTIPAP 72
Query: 71 DEDILP-----LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF-SISCYF 124
LP L NFMDAQY+G I IG+PPQ+FSV+FDTGSSNLWVPS C F I+C+
Sbjct: 73 TTTQLPPPVEKLTNFMDAQYYGMISIGTPPQDFSVLFDTGSSNLWVPSIHCAFLDIACWL 132
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
H RY S+KS+TY + G I YG GS+SGF SQD V + + V Q F EA ++ + F
Sbjct: 133 HRRYNSKKSSTYVQNGTEFSIQYGRGSLSGFISQDTVNLAGLNVTGQQFAEAVKQPGIVF 192
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
+ARFDG++G+ + I+V PV+D + ++ + +FSF++NRDP + GGE++ GG
Sbjct: 193 AVARFDGVLGMAYPAISVDRVTPVFDTAMAAKILPQNIFSFYINRDPAGDVGGELMLGGF 252
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
D ++F G YV VT+K YWQ ++ ++ +G+ T +C+ GC AIVD+GTS++ GP V
Sbjct: 253 DQQYFNGDLHYVNVTRKAYWQIKMDEVQVGSTLT-LCKSGCQAIVDTGTSMITGPVQEVR 311
Query: 305 EINHAIGGEGVVSAE 319
+ AIG ++ E
Sbjct: 312 ALQKAIGAIPLLMGE 326
>gi|403299330|ref|XP_003940442.1| PREDICTED: napsin-A-like [Saimiri boliviensis boliviensis]
Length = 425
Score = 246 bits (628), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 127/310 (40%), Positives = 187/310 (60%), Gaps = 8/310 (2%)
Query: 22 PASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFM 81
PA + L I L++ +LN R K+ + G H+ G +PL F+
Sbjct: 26 PAEAT-LIHIPLRRVHPGRRTLNLLRGWGKQAKLPRLGAPSPGHKPG-----FVPLSKFL 79
Query: 82 DAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIG 140
D QYFGEIG+G+PPQNF+V FDTGSSNLWVPS +C+ S + C+FH R+ + S+++ G
Sbjct: 80 DVQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSKRCHLSSVPCWFHHRFDPKASSSFQPNG 139
Query: 141 KSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREI 200
I YG+G + G S+D + +G + +F EA E SL F ++R DGI+GLGF +
Sbjct: 140 TKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWESSLVFTVSRPDGILGLGFPIL 199
Query: 201 AVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTK 260
AV P D +VEQGL+ + VFSF+LNRDP+ +GGE+V GG DP H+ T+VPVT
Sbjct: 200 AVEGVRPPLDVLVEQGLLDKPVFSFYLNRDPEVADGGELVLGGSDPAHYIPPLTFVPVTV 259
Query: 261 KGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAEC 320
YWQ + + +G++ T +C GCAA++D+GT ++ GP + ++ AIGG +++ E
Sbjct: 260 PAYWQIHMERVKVGSELT-LCARGCAAVLDTGTPVIIGPAEEIRALHKAIGGLPLLAGEY 318
Query: 321 KLVVSQYGDL 330
+ S+ L
Sbjct: 319 IIRCSEIPKL 328
>gi|158254091|gb|AAI54325.1| Zgc:63831 [Danio rerio]
Length = 412
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 114/244 (46%), Positives = 163/244 (66%), Gaps = 2/244 (0%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF-SISCYFHSRYKSRKSNT 135
L NFMDAQY+G I IG+PPQ+FSV+FDTGSSNLWVPS C F I+C+ H RY S+KS+T
Sbjct: 84 LTNFMDAQYYGMISIGTPPQDFSVLFDTGSSNLWVPSIHCAFLDIACWLHRRYNSKKSST 143
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + G I YG GS+SGF SQD V + + V Q F EA ++ + F +ARFDG++G+
Sbjct: 144 YVQNGTEFSIQYGRGSLSGFISQDTVNLAGLNVTGQQFAEAVKQPGIVFAVARFDGVLGM 203
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ I+V PV+D + ++ + +FSF++NRDP + GGE++ GG D ++F G Y
Sbjct: 204 AYPAISVDRVTPVFDTAMAAKILPQNIFSFYINRDPAGDVGGELMLGGFDQQYFNGDLHY 263
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
V VT+K YWQ ++ ++ +G+ T +C+ GC AIVD+GTS++ GP V + AIG +
Sbjct: 264 VNVTRKAYWQIKMDEVQVGSTLT-LCKSGCQAIVDTGTSMITGPVQEVRALQKAIGAIPL 322
Query: 316 VSAE 319
+ E
Sbjct: 323 LMGE 326
>gi|121543617|gb|ABM55520.1| putative cathepsin D [Maconellicoccus hirsutus]
Length = 391
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 146/338 (43%), Positives = 208/338 (61%), Gaps = 28/338 (8%)
Query: 6 LRSVFCLWVLASCLLLP-ASSNGLRRIGLKKRRLDLHSLNAARITRKERY-MGGAGVSGV 63
L +F L+ + +C + +SS L RI L + +T +ER + G +
Sbjct: 3 LLCIFVLFSIGTCHVNSVSSSEKLFRISLSRV-----------VTPRERLRLAGTEFKLL 51
Query: 64 RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISC 122
R + PL+N++DAQY+G I IG+PPQ F+V+FDTGSSNLWVPS +C +I+C
Sbjct: 52 NARYNGTGTP-EPLRNYLDAQYYGPITIGTPPQPFNVVFDTGSSNLWVPSKQCSILNIAC 110
Query: 123 YFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSL 182
H++Y S+ S+TY G I+YGSGS+SGF S D V +G + ++ Q F EA +E +
Sbjct: 111 LIHNKYNSKTSSTYQANGTEFAIHYGSGSLSGFLSSDTVSIGGLDIEKQTFAEAVKEPGI 170
Query: 183 TFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFG 242
F+ A+FDGI+GLG++EI+VG P + NMV+QGLV + VFSF+LNR+ A +GGEI+FG
Sbjct: 171 AFIAAKFDGILGLGYKEISVGGIPPPFYNMVDQGLVKDSVFSFYLNRNTSAADGGEIIFG 230
Query: 243 GVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPV 302
GVDP F+G TYVPV+ KGYWQF + I +G + + AI D+GTSL+AGP+
Sbjct: 231 GVDPSKFRGNFTYVPVSVKGYWQFGMEKISLGGKDIQTSQ----AIADTGTSLIAGPSED 286
Query: 303 VTEINHAI------GGEGVVSAECKLVVSQYGDLIWDL 334
+ IN AI GG+ VS E + Q D+ + +
Sbjct: 287 IAAINKAIGAVEILGGQYTVSCES---IDQLPDITFTI 321
>gi|148227998|ref|NP_001079043.1| cathepsin E-A precursor [Xenopus laevis]
gi|46395761|sp|Q805F3.1|CATEA_XENLA RecName: Full=Cathepsin E-A; Flags: Precursor
gi|28460653|dbj|BAC57453.1| cathepsin E1 [Xenopus laevis]
gi|213625998|gb|AAI69692.1| Cathepsin E1 [Xenopus laevis]
gi|213627772|gb|AAI69694.1| Cathepsin E1 [Xenopus laevis]
Length = 397
Score = 244 bits (624), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 132/313 (42%), Positives = 193/313 (61%), Gaps = 22/313 (7%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKER-----YMGGAGV 60
+R + L + A+ + GL R+ LK+++ + R T KE+ G+
Sbjct: 1 MRQILVLLLFATLVY------GLIRVPLKRQK-------SIRKTLKEKGKLSHIWTQQGI 47
Query: 61 SGVRHRLGDSDEDIL--PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF 118
V++ S++ PL N+MD +YFGEI +G+PPQNF+VIFDTGSSNLWVPS C
Sbjct: 48 DMVQYTDSCSNDQAPSEPLINYMDVEYFGEISVGTPPQNFTVIFDTGSSNLWVPSVYC-I 106
Query: 119 SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
S +C H R++ + S+TY G + + YG+GS+SG D V V ++V++Q F E+
Sbjct: 107 SQACAQHDRFQPQLSSTYESNGNNFSLQYGTGSLSGVIGIDAVTVEGILVQNQQFGESVS 166
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
E TF+ A FDGI+GLG+ IAVGD PV+DNM+ Q LV +FS +++R+P++ GGE
Sbjct: 167 EPGSTFVDAEFDGILGLGYPSIAVGDCTPVFDNMIAQNLVELPMFSVYMSRNPNSAVGGE 226
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
+VFGG D F G+ +VPVT +GYWQ +L ++ I N C GGC AIVD+GTSL+ G
Sbjct: 227 LVFGGFDASRFSGQLNWVPVTNQGYWQIQLDNVQI-NGEVLFCSGGCQAIVDTGTSLITG 285
Query: 299 PTPVVTEINHAIG 311
P+ + ++ + IG
Sbjct: 286 PSSDIVQLQNIIG 298
>gi|440898030|gb|ELR49612.1| Napsin-A, partial [Bos grunniens mutus]
Length = 406
Score = 244 bits (624), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 124/287 (43%), Positives = 177/287 (61%), Gaps = 11/287 (3%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRK--ERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQY 85
L RI L++ + +LN R K E A G + +PL ++M+ QY
Sbjct: 26 LIRIPLRRVNIGFKALNPPRGWEKLAEPPRLAAPSPG-------NKSLFVPLSDYMNVQY 78
Query: 86 FGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCE 144
+GEIG+G+PPQNFSV+FDTGSSNLWVPS +C +FS+ C+ H R+ + S+++ G
Sbjct: 79 YGEIGLGTPPQNFSVVFDTGSSNLWVPSVRCHFFSLPCWLHHRFNPKASSSFRSNGTKFA 138
Query: 145 INYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGD 204
I YG+G ++G S+D + +G + F EA E SL F A FDGI+GLGF +AVG
Sbjct: 139 IQYGTGRLAGILSEDKLTIGGITGATVTFGEALWEPSLVFTFAHFDGILGLGFPVLAVGG 198
Query: 205 AVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYW 264
P D +V++GL+ + VFSF+LNR+P+A +GGE+V GG DP H+ T+VPVT +W
Sbjct: 199 VRPPLDRLVDRGLLDKPVFSFYLNRNPEAADGGELVLGGSDPAHYIPPLTFVPVTIPAFW 258
Query: 265 QFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
Q + + +G T +C GCAAI+D+GTSL+ GPT + + AIG
Sbjct: 259 QIHMERVQVGTGLT-LCARGCAAILDTGTSLITGPTEEIRALQKAIG 304
>gi|406861956|gb|EKD15008.1| aspartic endopeptidase Pep2 [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 401
Score = 244 bits (623), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 128/311 (41%), Positives = 192/311 (61%), Gaps = 14/311 (4%)
Query: 14 VLASCLLLPASSNGLRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSGVRHRLG- 68
++ + LL ++S G+ ++ LKK +L +++A ++YMG S
Sbjct: 5 LVTAATLLSSASAGIHKLPLKKVSLSEQLATANIDAHVKNLGQKYMGIRPQSHADEMFKE 64
Query: 69 -----DSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCY 123
D + +P+ NF++AQYF EI IG+PPQ F V+ DTGSSNLWVPSS+C SI+CY
Sbjct: 65 TSVHEDGSDHTVPVSNFLNAQYFSEITIGTPPQTFKVVLDTGSSNLWVPSSQC-GSIACY 123
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
H++Y S S+TY + G + EI YGSGS+SGF S+D + +GD+ +K+Q+F EAT+E L
Sbjct: 124 LHTKYDSSSSSTYKKNGTAFEIRYGSGSLSGFTSEDTMSIGDLKIKNQIFAEATQEPGLA 183
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFG 242
F RFDGI+GLG+ I+V P + NMV Q L+ E VF+F+L + D E+ E +FG
Sbjct: 184 FAFGRFDGILGLGYDTISVNKIPPPFYNMVNQELLDEPVFAFYLGSTDKGEEDQSEAIFG 243
Query: 243 GVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPV 302
GV+ HF GK T +P+ +K YW+ +L I G+ + + G I+D+GTSL+A P+ +
Sbjct: 244 GVNKDHFTGKITEIPLRRKAYWEVDLDAITFGDATAELENTGV--ILDTGTSLIALPSTL 301
Query: 303 VTEINHAIGGE 313
+N +G +
Sbjct: 302 AELLNKEMGAK 312
>gi|147743007|sp|P85138.1|CARDG_CYNCA RecName: Full=Cardosin-G; Contains: RecName: Full=Cardosin-G heavy
chain; Contains: RecName: Full=Cardosin-G light chain
Length = 266
Score = 244 bits (622), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 131/243 (53%), Positives = 153/243 (62%), Gaps = 46/243 (18%)
Query: 69 DSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRY 128
DS ++ L N D YFGEIGIG+PPQ F+VIFDTGSS LWVPSSK HS Y
Sbjct: 1 DSGSTVVALTNDRDTSYFGEIGIGTPPQKFTVIFDTGSSYLWVPSSKA--------HSMY 52
Query: 129 KSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLAR 188
+S S+TY K+Q FIEAT E FL
Sbjct: 53 ESSDSSTY--------------------------------KEQDFIEATEEADNVFLNRL 80
Query: 189 FDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKH 248
FDGI+GL F+ I +VPVW NMV QGLV FSFWLNR+ D EEGGE+VFGG+DP H
Sbjct: 81 FDGILGLSFQTI----SVPVWYNMVNQGLVKR--FSFWLNRNVDEEEGGELVFGGLDPNH 134
Query: 249 FKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH 308
F+G HTYVPVT + YWQF +GD+LIG++STG C GC A DSGTSLL+GPT +VT+INH
Sbjct: 135 FRGDHTYVPVTYQYYWQFGIGDVLIGDKSTGFCAPGCQAFADSGTSLLSGPTAIVTQINH 194
Query: 309 AIG 311
AIG
Sbjct: 195 AIG 197
>gi|125858582|gb|AAI29608.1| Ce1-A protein [Xenopus laevis]
Length = 394
Score = 244 bits (622), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 131/300 (43%), Positives = 187/300 (62%), Gaps = 16/300 (5%)
Query: 19 LLLPASSNGLRRIGLKKRRLDLHSLNAARITRKER-----YMGGAGVSGVRHRLGDSDED 73
LL GL R+ LK+++ + R T KE+ G+ V++ S++
Sbjct: 5 LLFATLVYGLIRVPLKRQK-------SIRKTLKEKGKLSHIWTQQGIDMVQYTDSCSNDQ 57
Query: 74 IL--PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSR 131
PL N+MD +YFGEI +G+PPQNF+VIFDTGSSNLWVPS C S +C H R++ +
Sbjct: 58 APSEPLINYMDVEYFGEISVGTPPQNFTVIFDTGSSNLWVPSVYC-ISQACAQHDRFQPQ 116
Query: 132 KSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDG 191
S+TY G + + YG+GS+SG D V V ++V++Q F E+ E TF+ A FDG
Sbjct: 117 LSSTYESNGNNFSLQYGTGSLSGVIGIDAVTVEGILVQNQQFGESVSEPGSTFVDAEFDG 176
Query: 192 IIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKG 251
I+GLG+ IAVGD PV+DNM+ Q LV +FS +++R+P++ GGE+VFGG D F G
Sbjct: 177 ILGLGYPSIAVGDCTPVFDNMIAQNLVELPMFSIYMSRNPNSAVGGELVFGGFDASRFSG 236
Query: 252 KHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
+ +VPVT +GYWQ +L ++ I N C GGC AIVD+GTSL+ GP+ + ++ + IG
Sbjct: 237 QLNWVPVTNQGYWQIQLDNVQI-NGEVLFCSGGCQAIVDTGTSLITGPSSDIVQLQNIIG 295
>gi|157423181|gb|AAI53793.1| Cathepsin E2 [Xenopus laevis]
Length = 397
Score = 243 bits (621), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 130/294 (44%), Positives = 185/294 (62%), Gaps = 16/294 (5%)
Query: 27 GLRRIGLKKRRLDLHSLNAARITRKER-----YMGGAGVSGVRHRLGDSDEDIL--PLKN 79
GL R+ LK+++ + R T KE+ G+ V++ +++ PL N
Sbjct: 16 GLIRVPLKRQK-------SIRKTLKEKGKLSHVWTQQGIDMVQYTDSCNNDQAPSEPLIN 68
Query: 80 FMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEI 139
+MD QYFGEI IG+PPQNF+VIFDTGSSNLWVPS C S +C H+R++ + S+TY
Sbjct: 69 YMDVQYFGEISIGTPPQNFTVIFDTGSSNLWVPSVYC-ISPACAQHNRFQPQLSSTYESN 127
Query: 140 GKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFRE 199
G + + YG+GS+SG D+V V ++V++Q F E+ E TF+ A FDGI+GLG+
Sbjct: 128 GNNFSLQYGTGSLSGVIGIDSVTVEGILVQNQQFGESVSEPGSTFVDASFDGILGLGYPS 187
Query: 200 IAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVT 259
IAVG PV+DNM+ Q LV +FS +++RDP++ GGE+VFGG D F G+ +VPVT
Sbjct: 188 IAVGGCTPVFDNMIAQNLVELPMFSVYMSRDPNSPVGGELVFGGFDASRFSGQLNWVPVT 247
Query: 260 KKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
+GYWQ +L +I I N C GGC AIVD+GTS++ GP+ + ++ IG
Sbjct: 248 NQGYWQIQLDNIQI-NGEVVFCSGGCQAIVDTGTSMITGPSSDIVQLQSIIGAS 300
>gi|16119024|gb|AAL14708.1|AF420068_1 aspartic protease [Clonorchis sinensis]
Length = 419
Score = 243 bits (621), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 122/237 (51%), Positives = 166/237 (70%), Gaps = 11/237 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L N++DAQY+GEIGIG+PPQ+F V+FDTGSSNLWVPS C FSI+C+ H +Y S KS+T
Sbjct: 61 LNNYLDAQYYGEIGIGTPPQSFEVVFDTGSSNLWVPSKHCSIFSIACWLHHKYDSAKSST 120
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G I YGSGS+SG S D V VG V VK+Q F EA +E + F+ A+FDGI+G+
Sbjct: 121 YMANGTEFNIRYGSGSVSGILSTDYVSVGTVTVKNQTFGEAMKEPGIAFVAAKFDGILGM 180
Query: 196 GFREIAVGDAVP-VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
GF+ I+V D VP ++DNM+ QG F F L+R+ GGE++ GG DPK++KG+
Sbjct: 181 GFKTISV-DGVPTLFDNMISQG------FGFRLDRNRSDPVGGELLLGGTDPKYYKGEIL 233
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
+ P+T + YWQF++ + +G S +CE GC AI D+GTSL+AGP+ V ++N A+G
Sbjct: 234 WAPLTHEAYWQFKVDSMNVG--SMKLCENGCQAIADTGTSLIAGPSEEVGKLNDALG 288
>gi|148236737|ref|NP_001079044.1| cathepsin E-B precursor [Xenopus laevis]
gi|46395760|sp|Q805F2.1|CATEB_XENLA RecName: Full=Cathepsin E-B; Flags: Precursor
gi|28460655|dbj|BAC57454.1| cathepsin E2 [Xenopus laevis]
Length = 397
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 130/292 (44%), Positives = 185/292 (63%), Gaps = 16/292 (5%)
Query: 27 GLRRIGLKKRRLDLHSLNAARITRKER-----YMGGAGVSGVRHRLGDSDEDIL--PLKN 79
GL R+ LK+++ + R T KE+ G+ V++ +++ PL N
Sbjct: 16 GLIRVPLKRQK-------SIRKTPKEKGKLSHVWTQQGIDMVQYTDSCNNDQAPSEPLIN 68
Query: 80 FMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEI 139
+MD QYFGEI IG+PPQNF+VIFDTGSSNLWVPS C S +C H+R++ + S+TY
Sbjct: 69 YMDVQYFGEISIGTPPQNFTVIFDTGSSNLWVPSVYC-ISPACAQHNRFQPQLSSTYESN 127
Query: 140 GKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFRE 199
G + + YG+GS+SG D+V V ++V++Q F E+ E TF+ A FDGI+GLG+
Sbjct: 128 GNNFSLQYGTGSLSGVIGIDSVTVEGILVQNQQFGESVSEPGSTFVDASFDGILGLGYPS 187
Query: 200 IAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVT 259
IAVG PV+DNM+ Q LV +FS +++RDP++ GGE+VFGG D F G+ +VPVT
Sbjct: 188 IAVGGCTPVFDNMIAQNLVELPMFSVYMSRDPNSPVGGELVFGGFDASRFSGQLNWVPVT 247
Query: 260 KKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
+GYWQ +L +I I N C GGC AIVD+GTS++ GP+ + ++ IG
Sbjct: 248 NQGYWQIQLDNIQI-NGEVVFCSGGCQAIVDTGTSMITGPSSDIVQLQSIIG 298
>gi|297705581|ref|XP_002829653.1| PREDICTED: napsin-A, partial [Pongo abelii]
Length = 392
Score = 243 bits (619), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 129/318 (40%), Positives = 189/318 (59%), Gaps = 21/318 (6%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDE----DILPLKNFMDA 83
LRR+ ++R L+L + G G +LG +PL N+ D
Sbjct: 3 LRRVHPERRTLNL--------------LKGWGKPAKLPKLGAPSPGDKPTFVPLSNYWDV 48
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKS 142
QYFGEIG+G+PPQNF+V FDTGSSNLWVPS +C +FS+ C+FH R+ S+++ G
Sbjct: 49 QYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPCWFHHRFNPSASSSFKPNGTK 108
Query: 143 CEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAV 202
I YG+G + G S+D + +G + +F EA E SL F ++R DGI+GLGF +AV
Sbjct: 109 FAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWESSLVFTVSRPDGILGLGFPILAV 168
Query: 203 GDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
P D +V+QGL+ + +FSF+LNRDP +GGE+V GG DP H+ T+VPVT
Sbjct: 169 EGVRPPLDVLVKQGLLDKPIFSFYLNRDPKVADGGELVLGGSDPAHYIPPLTFVPVTVPA 228
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKL 322
YWQ + + +G+ T +C GCAAI+D+GT ++ GPT + ++ AIGG +++ E +
Sbjct: 229 YWQIHMERVKVGSGLT-LCARGCAAILDTGTPVIVGPTEEIRALHAAIGGIPLLAGEYII 287
Query: 323 VVSQYGDL-IWDLLVSGL 339
S+ L LL++G+
Sbjct: 288 RCSEIPKLPAVSLLIAGV 305
>gi|395531206|ref|XP_003767673.1| PREDICTED: cathepsin E [Sarcophilus harrisii]
Length = 395
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 121/249 (48%), Positives = 162/249 (65%), Gaps = 5/249 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +Y+G I IGSPPQNF+VIFDTGSSNLWVPS C S +C H+R+ +S+T
Sbjct: 68 PLINYLDMEYYGVISIGSPPQNFTVIFDTGSSNLWVPSVYC-VSPACKNHNRFYPSQSST 126
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y E G S I YG+GS+SG D V V + V +Q F E+ E TF+ A FDGI+GL
Sbjct: 127 YVENGNSFSIQYGTGSLSGIIGMDQVSVEGITVANQQFGESVSEPGSTFVNAEFDGILGL 186
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ +AVG PV+DNM+ Q LV +FS ++ R+PD+ G E+VFGG D HF G +
Sbjct: 187 AYPSLAVGGVTPVFDNMIAQNLVDMPIFSVYMTRNPDSPTGSELVFGGYDHAHFTGSLNW 246
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
VPVTK+GYWQ L +I +G + C GC AIVD+GTSL+ GP+ + ++ +AIG
Sbjct: 247 VPVTKQGYWQIALDNIQVGG-TIMFCAEGCQAIVDTGTSLITGPSDKIKQLQNAIGAVLT 305
Query: 313 EGVVSAECK 321
+G + EC
Sbjct: 306 DGEYAMECN 314
>gi|440633873|gb|ELR03792.1| vacuolar protease A [Geomyces destructans 20631-21]
Length = 395
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 130/321 (40%), Positives = 190/321 (59%), Gaps = 19/321 (5%)
Query: 14 VLASCLLLPASSNGLRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSGVRHRLGD 69
+ + +LL ++S G+ ++ L+K +L+ ++ ++YMG + V +
Sbjct: 5 LFTAAMLLGSASAGVHKMKLQKIPLAEQLEFANVETHVRNLGQKYMGIRPQTHVDAVFQE 64
Query: 70 SDE-----DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYF 124
S ++P+ NF++AQYF EI IG+PPQ F V+ DTGSSNLWVPS C SI+CY
Sbjct: 65 SSSIKQGGHLVPVSNFLNAQYFSEITIGNPPQTFKVVLDTGSSNLWVPSQSC-GSIACYL 123
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
HS+Y S +S TY + G I YGSGS+SG+ SQD V +GD+V+KDQ+F EA E L F
Sbjct: 124 HSKYDSSESKTYEKNGTEFAIQYGSGSVSGYISQDQVTIGDLVIKDQLFGEAVEEPGLAF 183
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
RFDGI+GLGF I+V VP + +M++QGL+ E+VFSF+L D E VFGG+
Sbjct: 184 AFGRFDGILGLGFDTISVNKVVPPFYSMIDQGLLDEKVFSFYLADDKSQSEA---VFGGI 240
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
D H+ G TY+P+ +K YW+ + I G+ + G I+D+GTSL P+ +
Sbjct: 241 DKSHYTGDLTYIPLRRKAYWEVDFDAISFGDVKADLDNTGV--ILDTGTSLNTLPSSLAE 298
Query: 305 EINHAIGGE----GVVSAECK 321
+N IG + G + +CK
Sbjct: 299 LLNKEIGAKKGYNGQYTIDCK 319
>gi|195382956|ref|XP_002050194.1| GJ22010 [Drosophila virilis]
gi|194144991|gb|EDW61387.1| GJ22010 [Drosophila virilis]
Length = 394
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 118/253 (46%), Positives = 159/253 (62%), Gaps = 6/253 (2%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N++DAQYFG I IG+PPQ F+VIFDTGS+NLWVPS C+ ++C HSRY SR S
Sbjct: 65 VPLSNYLDAQYFGPISIGTPPQKFNVIFDTGSANLWVPSESCHQKLACQIHSRYNSRHSR 124
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y GK +I YGSGS++G+ SQD V V + + +Q F EAT FL A+FDGI G
Sbjct: 125 SYKSDGKQFDIQYGSGSLAGYLSQDTVRVAGLEITNQTFAEATEMPGPIFLAAKFDGIFG 184
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L +R I++ + P + ++EQ L+ VFS +LNR + +GG + FGG P++++G T
Sbjct: 185 LAYRGISIQNIKPPFYAVMEQNLLKRPVFSVYLNRIASSRQGGYLFFGGSSPRYYRGNFT 244
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
YVPVT + YWQ +L IG +C GC I+D+GTS LA P IN +IGG
Sbjct: 245 YVPVTHRAYWQVKLEAARIG--PLQLCLNGCQVIIDTGTSFLAVPYEQAILINESIGGTP 302
Query: 314 ---GVVSAECKLV 323
G S C+ V
Sbjct: 303 AAYGQFSVPCEQV 315
>gi|296230510|ref|XP_002760737.1| PREDICTED: renin isoform 1 [Callithrix jacchus]
gi|50401196|sp|Q9TSZ1.1|RENI_CALJA RecName: Full=Renin; AltName: Full=Angiotensinogenase; Flags:
Precursor
gi|6687184|emb|CAB64879.1| preprorenin [Callithrix jacchus]
Length = 400
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 123/301 (40%), Positives = 187/301 (62%), Gaps = 14/301 (4%)
Query: 17 SCLL-LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDIL 75
SC LP + +RI LK+ + + R + KER + A + R L + ++
Sbjct: 19 SCTFGLPTETTTFKRISLKR-------MPSIRESLKERGVDMARLGPERMALVNITSSVI 71
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSN 134
L N+MD QY+GEIGIG+PPQ F V+FDTGSSN+WVPSSKC +C +H + + S+
Sbjct: 72 -LTNYMDTQYYGEIGIGTPPQTFKVVFDTGSSNVWVPSSKCSRLYTACVYHKLFDASDSS 130
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y G + Y +G++SGF SQD + VG + V Q F E T +L F+LA FDG++G
Sbjct: 131 SYKHNGTELTLRYSTGTVSGFLSQDVITVGGITVT-QTFGEVTEMPALPFMLAEFDGVVG 189
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEE--GGEIVFGGVDPKHFKGK 252
+GF E A+G P++DN++ QGL+ E+VFSF+ NRD + + GG+IV GG DP+H++G
Sbjct: 190 MGFSEQAIGKVTPLFDNIISQGLLKEDVFSFYYNRDSENSQSLGGQIVLGGSDPQHYEGN 249
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
Y+ + + G WQ + + +G+ ST +CE GC A+VD+G S ++G T + ++ A+G
Sbjct: 250 FHYINLIRTGLWQIPMKGVSVGS-STLLCEDGCLALVDTGASYISGSTSSIEKLMEALGA 308
Query: 313 E 313
+
Sbjct: 309 K 309
>gi|296230582|ref|XP_002760770.1| PREDICTED: cathepsin E isoform 1 [Callithrix jacchus]
Length = 396
Score = 241 bits (615), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 120/248 (48%), Positives = 162/248 (65%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C H+R++ +SNT
Sbjct: 69 PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKRHTRFQPSQSNT 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + G+S I YG+GS+SG D V V + V Q F E+ E TF+ A FDGI+GL
Sbjct: 128 YNQPGQSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAEFDGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV +FS +++ +P+ G E++FGG D HF G +
Sbjct: 188 GYPSLAVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGSLNW 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
VPVTK+ YWQ L DI +G + C GC AIVD+GTSL+ GP+ + ++ +AIG
Sbjct: 248 VPVTKQAYWQIALDDIQVGGTAM-FCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGAAPV 306
Query: 313 EGVVSAEC 320
+G + EC
Sbjct: 307 DGEYAVEC 314
>gi|398396710|ref|XP_003851813.1| hypothetical protein MYCGRDRAFT_104895 [Zymoseptoria tritici
IPO323]
gi|339471693|gb|EGP86789.1| hypothetical protein MYCGRDRAFT_104895 [Zymoseptoria tritici
IPO323]
Length = 398
Score = 241 bits (615), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 131/322 (40%), Positives = 193/322 (59%), Gaps = 21/322 (6%)
Query: 14 VLASCLLLPASSNGLRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSGVRHRLGD 69
+LAS L+ +S G+ ++ L+K +L+ +S+ ++YMG +
Sbjct: 6 LLASALVAGTASAGVHKMKLQKVPLSEQLEGYSIEEQVQHLGQKYMGIRPQGRINEMF-- 63
Query: 70 SDEDILPLK-------NFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISC 122
++ P K NF++AQYF EI IG+PPQ F V+ DTGSSNLWVPS C SI+C
Sbjct: 64 KEQSYKPNKGHPVGVSNFLNAQYFSEIAIGTPPQEFKVVLDTGSSNLWVPSKDC-GSIAC 122
Query: 123 YFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSL 182
Y HS+Y SNTY + G I YGSGS+ G+ SQD V++GD+ +K+Q+F EAT E L
Sbjct: 123 YLHSKYNHGDSNTYKQNGSDFAIQYGSGSLEGYISQDTVQIGDLKIKNQLFAEATSEPGL 182
Query: 183 TFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFG 242
F RFDGI+GLG+ I+V P + NM++QGL+ E+VF+F+L+ +E E +FG
Sbjct: 183 AFAFGRFDGIMGLGYDTISVNGIPPPFYNMIDQGLLDEKVFAFYLSSTDKGDE-SEAIFG 241
Query: 243 GVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPV 302
GV+ H+ GK T +P+ +K YW+ + I +G+Q+ + G AI+D+GTSL+A P+ +
Sbjct: 242 GVNKDHYTGKMTNIPLRRKAYWEVDFDAITLGDQTAELDSTG--AILDTGTSLIALPSTM 299
Query: 303 VTEINHAIGGE----GVVSAEC 320
+N IG + G S EC
Sbjct: 300 AELLNKEIGAKKGYNGQYSVEC 321
>gi|194756946|ref|XP_001960731.1| GF13504 [Drosophila ananassae]
gi|190622029|gb|EDV37553.1| GF13504 [Drosophila ananassae]
Length = 402
Score = 241 bits (615), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 135/329 (41%), Positives = 192/329 (58%), Gaps = 19/329 (5%)
Query: 6 LRSVFCLWVLASCLL--LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGV 63
L V C W L S L P++ + ++G+ R+D L + + +ER G S
Sbjct: 13 LLPVTCNWELYSVPLRRFPSARHRFEKLGI---RMDRLRLKYSSESSEER-----GNSRT 64
Query: 64 RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISC 122
+ + + L N++DAQYFG I IG+PPQ F VIFDTGSSNLWVPS+ C + ++C
Sbjct: 65 KWNVKSTT-----LSNYLDAQYFGPITIGTPPQTFQVIFDTGSSNLWVPSATCSSTMVAC 119
Query: 123 YFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSL 182
HSRY +R+S +Y IG I+YGSGS++GF S D V V + ++DQVF EAT
Sbjct: 120 RVHSRYYARRSRSYRPIGDHFVIHYGSGSLAGFLSTDTVRVAGLEIEDQVFAEATNMPGP 179
Query: 183 TFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRD-PDAEEGGEIVF 241
FL A+FDGI GL +R I++ P + M+EQGL+ VFS +LNR + EEGG + F
Sbjct: 180 IFLAAKFDGIFGLAYRSISMQRIKPPFYAMIEQGLLPRAVFSVYLNRHLGNQEEGGVLFF 239
Query: 242 GGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTP 301
GG +P++++G TYVPV+++ YWQ ++ I + +C+ GC I+D+GTS LA P
Sbjct: 240 GGSNPEYYRGNFTYVPVSRRAYWQVKMDAATI--RKLELCQNGCEVIIDTGTSFLALPYD 297
Query: 302 VVTEINHAIGGEGVVSAECKLVVSQYGDL 330
IN +IGG + + Q DL
Sbjct: 298 QAILINKSIGGRPSAYGQFSVPCDQVSDL 326
>gi|301786118|ref|XP_002928474.1| PREDICTED: cathepsin E-like [Ailuropoda melanoleuca]
Length = 396
Score = 241 bits (614), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 126/255 (49%), Positives = 164/255 (64%), Gaps = 8/255 (3%)
Query: 69 DSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRY 128
D++E PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C HSR+
Sbjct: 65 DANE---PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SAACKTHSRF 120
Query: 129 KSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLAR 188
+SNTY+ +G I YG+GS+SG D V+V +VV Q F E+ E TF+ A
Sbjct: 121 YPSQSNTYSVLGSHFSIQYGTGSLSGIIGADQVDVEGLVVVGQQFGESVTEPGQTFVNAE 180
Query: 189 FDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKH 248
FDGI+GLG+ +AVG PV+DNM+ Q LV +FS +++ DP+ G E++FGG D H
Sbjct: 181 FDGILGLGYPSLAVGGVTPVFDNMMAQNLVDIPMFSVYMSSDPEGGAGSELIFGGYDHSH 240
Query: 249 FKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH 308
F G +VPVTK+GYWQ L I +G + C GC AIVD+GTSL+ GP+ V ++
Sbjct: 241 FSGNLHWVPVTKQGYWQIALDAIQVGG-AVMFCSEGCQAIVDTGTSLITGPSDKVKQLQK 299
Query: 309 AIGGE---GVVSAEC 320
AIG E G EC
Sbjct: 300 AIGAEPMDGEYGVEC 314
>gi|210109642|gb|ACJ07131.1| cathepsin D-like protein, partial [Homarus gammarus]
Length = 231
Score = 241 bits (614), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 122/232 (52%), Positives = 157/232 (67%), Gaps = 5/232 (2%)
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFHSRYKSRKSNTYTEIGKS 142
QY+G I IG+P Q F VIFDTGSSNLW+PS KC+ +++C H+RY S KS+TY E G +
Sbjct: 1 QYYGPITIGTPGQGFDVIFDTGSSNLWIPSEKCFILNLACRLHNRYDSTKSSTYIENGTA 60
Query: 143 CEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAV 202
+I YGSG++ GF S DNVE+G V Q F EAT+E L F++ +FDGI+G+ F EI+V
Sbjct: 61 FDIQYGSGALHGFLSSDNVEMGGVNAMGQTFAEATQEPGLAFIMGKFDGILGMAFTEISV 120
Query: 203 GDAVPVWDNMVEQGLVSEEVFSFWLNRD-PDAEE--GGEIVFGGVDPKHFKGKHTYVPVT 259
V+D MV QG V + +FSF+LN D D E GGE+V GG DP H++G+ YVPV+
Sbjct: 121 MGIPTVFDTMVAQGAVDQPIFSFYLNHDVSDMNETLGGELVLGGSDPNHYEGEFHYVPVS 180
Query: 260 KKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
K GYWQ I +G+ TG C C AIVD+GTSL+AGP V EI H +G
Sbjct: 181 KVGYWQVTAEAIKVGDNVTGFCN-PCEAIVDTGTSLIAGPNAEVQEIVHMLG 231
>gi|384490965|gb|EIE82161.1| hypothetical protein RO3G_06866 [Rhizopus delemar RA 99-880]
Length = 403
Score = 241 bits (614), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 122/258 (47%), Positives = 163/258 (63%), Gaps = 5/258 (1%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+M+AQY+GEI IG+P Q F+VIFDTGSSNLWVPS+ C S +C H RY S KS
Sbjct: 78 VPLSNYMNAQYYGEIQIGTPAQTFTVIFDTGSSNLWVPSTHC-MSFACLMHRRYSSSKST 136
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + I YGSGS+ G SQD + VG + ++DQ F E+T E LTF +ARFDGI G
Sbjct: 137 TYRKNETDFVIRYGSGSLQGINSQDTLRVGGIEIRDQGFAESTVEPGLTFAMARFDGIFG 196
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEE--GGEIVFGGVDPKHFKGK 252
LG+ I+V VP + NM+ + L+ +E+FSFWL+ D GGE+ FGG+D F G
Sbjct: 197 LGYDTISVQQTVPPFYNMINKKLIDQEIFSFWLSDTNDGNNNLGGELAFGGIDEARFSGN 256
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
T+ PVT+KGYW+ EL + +Q + G A +D+GTSLL PT V +N+ IGG
Sbjct: 257 ITWSPVTRKGYWEIELQNTKFNDQPMNM--GSIGAAIDTGTSLLIAPTAVAEFVNNQIGG 314
Query: 313 EGVVSAECKLVVSQYGDL 330
+ + + S G+L
Sbjct: 315 QADAYGQYTVDCSSVGNL 332
>gi|147743015|sp|P85139.1|CARDH_CYNCA RecName: Full=Cardosin-H; Contains: RecName: Full=Cardosin-H heavy
chain; Contains: RecName: Full=Cardosin-H light chain
Length = 265
Score = 241 bits (614), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 130/243 (53%), Positives = 153/243 (62%), Gaps = 46/243 (18%)
Query: 69 DSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRY 128
DS ++ L N D YFGEIGIG+PPQ F+VIFDTGSS LWVPSSK HS Y
Sbjct: 1 DSGSAVVALTNDRDTSYFGEIGIGTPPQKFTVIFDTGSSVLWVPSSKA--------HSMY 52
Query: 129 KSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLAR 188
+S S+TY K+Q FIEAT E FL
Sbjct: 53 ESSGSSTY--------------------------------KEQDFIEATDETDNVFLHRL 80
Query: 189 FDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKH 248
FDGI+GL F+ I +VPVW NM+ QGLV FSFWLNR+ D EEGGE+VFGG+DP H
Sbjct: 81 FDGILGLSFQTI----SVPVWYNMLNQGLVKR--FSFWLNRNVDEEEGGELVFGGLDPNH 134
Query: 249 FKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH 308
F+G HTYVPVT + YWQF +GD+LIG++STG C GC A DSGTSLL+GPT +VT+INH
Sbjct: 135 FRGDHTYVPVTYQYYWQFGIGDVLIGDKSTGFCAPGCQAFADSGTSLLSGPTAIVTQINH 194
Query: 309 AIG 311
AIG
Sbjct: 195 AIG 197
>gi|426198518|gb|EKV48444.1| hypothetical protein AGABI2DRAFT_192052 [Agaricus bisporus var.
bisporus H97]
Length = 413
Score = 240 bits (613), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 132/288 (45%), Positives = 176/288 (61%), Gaps = 24/288 (8%)
Query: 57 GAGVSGVR--HRLGDSDEDIL-------------PLKNFMDAQYFGEIGIGSPPQNFSVI 101
GAG +G R H DE +L PL NFM+AQYF EI IGSPPQ F VI
Sbjct: 59 GAGGTGRRIAHPSQQDDETLLWTQEHQVQGGHGVPLSNFMNAQYFTEIQIGSPPQTFKVI 118
Query: 102 FDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNV 161
DTGSSNLWVPS KC SI+C+ H++Y S +S+TY G + EI YGSG++ GF SQD +
Sbjct: 119 LDTGSSNLWVPSVKCT-SIACFLHTKYDSGQSSTYKANGSTFEIQYGSGAMEGFVSQDQL 177
Query: 162 EVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEE 221
++GD+ +K Q F EAT+E L F +FDGI+GLG+ I+V VP + M+EQ L+ E
Sbjct: 178 QIGDLTIKGQDFAEATKEPGLAFAFGKFDGILGLGYDTISVNHIVPPFYKMIEQNLLDER 237
Query: 222 VFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVC 281
VFSF L E+GGE VFGG+D +KGK YVP+ +K YW+ +L I +G + +
Sbjct: 238 VFSFRLGS--SDEDGGEAVFGGIDESAYKGKMHYVPIRQKAYWEVQLDKISLGGEELELE 295
Query: 282 EGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLVVS 325
G A +D+GTSL+A P+ + +N IG + G + +C V S
Sbjct: 296 NTGAA--IDTGTSLIALPSDMAEMLNTQIGAKKSWNGQYTIDCAKVAS 341
>gi|367031892|ref|XP_003665229.1| aspartic protease [Myceliophthora thermophila ATCC 42464]
gi|347012500|gb|AEO59984.1| aspartic protease [Myceliophthora thermophila ATCC 42464]
Length = 397
Score = 240 bits (613), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 133/313 (42%), Positives = 190/313 (60%), Gaps = 20/313 (6%)
Query: 13 WVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDE 72
++L + +LL ++ + ++ L+K L L A I + ++G + G+R R +D
Sbjct: 5 FLLTAAVLLGSAQGAVHKMKLQKIPLS-EQLEAVPINTQLEHLGQKYM-GLRPRESQADA 62
Query: 73 -------DI-----LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI 120
D+ +P+ NFM+AQYF EI IG+PPQ+F V+ DTGSSNLWVPS +C SI
Sbjct: 63 IFKGMVADVKGNHPIPISNFMNAQYFSEITIGTPPQSFKVVLDTGSSNLWVPSVEC-GSI 121
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+CY HS+Y S S+TY + G S EI YGSGS+SGF SQD V +GD+ ++ Q F EAT E
Sbjct: 122 ACYLHSKYDSSASSTYKKNGTSFEIRYGSGSLSGFVSQDTVSIGDITIQGQDFAEATSEP 181
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
L F RFDGI+GLG+ I+V VP + MVEQ L+ E VF+F+L D E+V
Sbjct: 182 GLAFAFGRFDGILGLGYDRISVNGIVPPFYKMVEQKLIDEPVFAFYL---ADTNGQSEVV 238
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGGVD +KGK T +P+ +K YW+ + I G+ + + G I+D+GTSL+A P+
Sbjct: 239 FGGVDHDKYKGKITTIPLRRKAYWEVDFDAISYGDDTAELENTGI--ILDTGTSLIALPS 296
Query: 301 PVVTEINHAIGGE 313
+ +N IG +
Sbjct: 297 QLAEMLNAQIGAK 309
>gi|126681053|gb|ABO26561.1| cathepsin D-like aspartic protease [Ixodes ricinus]
Length = 382
Score = 240 bits (613), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 122/242 (50%), Positives = 161/242 (66%), Gaps = 6/242 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N +D +Y+G I IG+PPQ+F VIFDTGS+NLW+PSSKC + C H RY S KS+T
Sbjct: 52 PLVNLLDVEYYGPISIGTPPQDFQVIFDTGSANLWLPSSKCT-TKYCLHHHRYDSSKSST 110
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G++ I YGSG++ GF S+D +G V Q EA G + L A FDGI+GL
Sbjct: 111 YEADGRNFTIVYGSGNVEGFISKDVCRIGSAKVSGQPLGEALVVGGESLLEAPFDGILGL 170
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEE-VFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+ IAV VPV+DNM++QGL+ E+ VFS +LNRDP ++EGGE++FGG+D H+KG T
Sbjct: 171 AYPSIAVDGVVPVFDNMMKQGLLGEQNVFSVYLNRDPSSKEGGEVLFGGIDHDHYKGSIT 230
Query: 255 YVPVTKKGYWQFELGDILIGNQSTG----VCEGGCAAIVDSGTSLLAGPTPVVTEINHAI 310
YVPVT KGYWQF + + + S +C+ GC AI D+GTSL+ GP V +N +
Sbjct: 231 YVPVTAKGYWQFHVDGVKSVSASKSAPELLCKDGCEAIADTGTSLITGPPEEVDSLNQYL 290
Query: 311 GG 312
GG
Sbjct: 291 GG 292
>gi|336273300|ref|XP_003351405.1| hypothetical protein SMAC_03712 [Sordaria macrospora k-hell]
Length = 381
Score = 240 bits (612), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 133/310 (42%), Positives = 187/310 (60%), Gaps = 17/310 (5%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRK-----ERYMGGAGVSGVRHRLG 68
+L + +LL ++ G+ + LKK L L + I + ++Y G S +
Sbjct: 5 LLTAAMLLGSAQAGVHTMKLKKVPL-AEQLESVPIDMQVQHLGQKYTGLRPESHTQAMFK 63
Query: 69 DSDEDI-----LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCY 123
+D + +P+ NFM+AQYF EI +G+PPQ F V+ DTGSSNLWVPSS+C SI+CY
Sbjct: 64 ATDAQVTGNHPVPISNFMNAQYFSEITLGTPPQTFKVVLDTGSSNLWVPSSQC-GSIACY 122
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
H++Y+S +S+TY + G S EI YGSGS+SGF SQD + +GD+ + DQ+F EAT E L
Sbjct: 123 LHNKYESSESSTYKKNGTSFEIQYGSGSLSGFVSQDRMTIGDITINDQLFAEATSEPGLA 182
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
F RFDGI+GLG+ IAV P + MVEQ LV E VFSF+L D D E E+VFGG
Sbjct: 183 FAFGRFDGILGLGYSRIAVNGITPPFYKMVEQKLVDEPVFSFYL-ADQDGES--EVVFGG 239
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
V+ + GK T +P+ +K YW+ + I G + G I+D+GTSL+A P+ +
Sbjct: 240 VNKDRYTGKITTIPLRRKAYWEVDFDAIGYGEDIADL--EGHGVILDTGTSLIALPSQLA 297
Query: 304 TEINHAIGGE 313
+N IG +
Sbjct: 298 EMLNAQIGAK 307
>gi|402857430|ref|XP_003893258.1| PREDICTED: cathepsin E [Papio anubis]
Length = 396
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 119/248 (47%), Positives = 165/248 (66%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C H+R++ +S+T
Sbjct: 69 PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHTRFQPSQSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y++ G+S I YG+GS+SG D V V + V Q F E+ E TF+ A FDGI+GL
Sbjct: 128 YSQPGQSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAEFDGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV +FS +++ +P+ G E++FGG D HF G ++
Sbjct: 188 GYPSLAVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGSLSW 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
VPVTK+GYWQ L +I +G + C GC AIVD+GTSL+ GP+ + ++ +AIG
Sbjct: 248 VPVTKQGYWQIALDNIQVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGAAPV 306
Query: 313 EGVVSAEC 320
+G + EC
Sbjct: 307 DGEYAVEC 314
>gi|384498765|gb|EIE89256.1| endopeptidase [Rhizopus delemar RA 99-880]
Length = 401
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 122/256 (47%), Positives = 166/256 (64%), Gaps = 8/256 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+++AQY+GEI IG+PPQ F+V+FDTGSSNLWVPS+ C SI+C+ H RY S S
Sbjct: 77 VPLSNYLNAQYYGEIEIGTPPQPFTVVFDTGSSNLWVPSTHCT-SIACFLHKRYDSASSR 135
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY+E G I YG+GS+ GF SQD + VG + V+DQ F E+T+E LTF A+FDGI G
Sbjct: 136 TYSENGTEFAIQYGTGSLEGFISQDTLSVGGIQVEDQGFAESTKEPGLTFAFAKFDGIFG 195
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNR-DPDAEEGGEIVFGGVDPKHFKGKH 253
LG+ I+V +P + +MV + LV E +FSFWLN + D + GGE++FGGVD HF+G
Sbjct: 196 LGYDTISVKHTIPPFYHMVNRDLVDEPLFSFWLNDANKDQDNGGELIFGGVDEDHFEGDI 255
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
+ V +KGYW+ + +I G+ + G A +D+G+SLL PT V IN +G E
Sbjct: 256 HWSDVRRKGYWEITMENIKFGDDYVDIDPVGAA--IDTGSSLLVAPTTVAALINKELGAE 313
Query: 314 ----GVVSAECKLVVS 325
G +C V S
Sbjct: 314 KNWAGQYVVDCNKVPS 329
>gi|345797646|ref|XP_545694.3| PREDICTED: cathepsin E [Canis lupus familiaris]
Length = 396
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 123/255 (48%), Positives = 164/255 (64%), Gaps = 8/255 (3%)
Query: 69 DSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRY 128
D++E PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C H+++
Sbjct: 65 DTNE---PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHAKF 120
Query: 129 KSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLAR 188
+SNTY+ +G I YG+GS+SG D V V +VV Q F E+ E TF+ A
Sbjct: 121 YPSQSNTYSALGNQFSIQYGTGSLSGIIGADQVNVEGLVVVGQQFGESVTEPGQTFVNAE 180
Query: 189 FDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKH 248
FDGI+GLG+ +AVG PV+DNM+ Q LV +FS +++ DP+ G E++FGG D H
Sbjct: 181 FDGILGLGYPSLAVGGVTPVFDNMMAQNLVDIPMFSVYMSSDPEGGTGSELIFGGYDHSH 240
Query: 249 FKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH 308
F G +VPVTK+GYWQ L I +G + C GC AIVD+GTSL+ GP+ + ++ +
Sbjct: 241 FSGNLNWVPVTKQGYWQIALDAIQVGG-TVMFCSEGCQAIVDTGTSLITGPSDEIKQLQN 299
Query: 309 AIGGE---GVVSAEC 320
AIG E G EC
Sbjct: 300 AIGAEPMDGEYGVEC 314
>gi|380092926|emb|CCC09679.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 410
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 133/310 (42%), Positives = 187/310 (60%), Gaps = 17/310 (5%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRK-----ERYMGGAGVSGVRHRLG 68
+L + +LL ++ G+ + LKK L L + I + ++Y G S +
Sbjct: 5 LLTAAMLLGSAQAGVHTMKLKKVPL-AEQLESVPIDMQVQHLGQKYTGLRPESHTQAMFK 63
Query: 69 DSDEDI-----LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCY 123
+D + +P+ NFM+AQYF EI +G+PPQ F V+ DTGSSNLWVPSS+C SI+CY
Sbjct: 64 ATDAQVTGNHPVPISNFMNAQYFSEITLGTPPQTFKVVLDTGSSNLWVPSSQC-GSIACY 122
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
H++Y+S +S+TY + G S EI YGSGS+SGF SQD + +GD+ + DQ+F EAT E L
Sbjct: 123 LHNKYESSESSTYKKNGTSFEIQYGSGSLSGFVSQDRMTIGDITINDQLFAEATSEPGLA 182
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
F RFDGI+GLG+ IAV P + MVEQ LV E VFSF+L D D E E+VFGG
Sbjct: 183 FAFGRFDGILGLGYSRIAVNGITPPFYKMVEQKLVDEPVFSFYL-ADQDGES--EVVFGG 239
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
V+ + GK T +P+ +K YW+ + I G + G I+D+GTSL+A P+ +
Sbjct: 240 VNKDRYTGKITTIPLRRKAYWEVDFDAIGYGEDIADL--EGHGVILDTGTSLIALPSQLA 297
Query: 304 TEINHAIGGE 313
+N IG +
Sbjct: 298 EMLNAQIGAK 307
>gi|187608619|ref|NP_001120469.1| cathepsin E precursor [Xenopus (Silurana) tropicalis]
gi|170284872|gb|AAI61297.1| LOC100145572 protein [Xenopus (Silurana) tropicalis]
Length = 397
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 125/285 (43%), Positives = 175/285 (61%), Gaps = 2/285 (0%)
Query: 27 GLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYF 86
GL R+ LK+++ L G + ++ PL N+MD +YF
Sbjct: 16 GLIRVPLKRQKSIRKKLKEKGKLSHVWTQQGIDMIQYTDSCSNNQAPSEPLINYMDVEYF 75
Query: 87 GEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEIN 146
GEI IG+PPQNF+VIFDTGSSNLWVPS C S +C H+R++ + S+TY G + +
Sbjct: 76 GEISIGTPPQNFTVIFDTGSSNLWVPSVYC-ISPACAQHNRFQPQFSSTYQSNGNNFSLQ 134
Query: 147 YGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
YG+GS+SG D+V V ++V+ Q F E+ E TF+ A FDGI+GLG+ IAVGD
Sbjct: 135 YGTGSLSGIIGTDSVSVEGILVQSQQFGESVSEPGSTFVDAEFDGILGLGYPSIAVGDCT 194
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
PV+DNM+ Q LV +FS +++R+P++ GGE+VFGG D F G+ +V VT +GYWQ
Sbjct: 195 PVFDNMMTQNLVELPMFSVYMSRNPNSPVGGELVFGGFDASRFSGQLNWVSVTNQGYWQI 254
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
+L +I I N C GGC AIVD+GTSL+ GP+ + ++ IG
Sbjct: 255 QLDNIQI-NGEVVFCTGGCQAIVDTGTSLITGPSSDIVQLQSIIG 298
>gi|344276734|ref|XP_003410162.1| PREDICTED: LOW QUALITY PROTEIN: renin-like [Loxodonta africana]
Length = 409
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 136/319 (42%), Positives = 193/319 (60%), Gaps = 21/319 (6%)
Query: 9 VFCLWVLASCLL-LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS------ 61
+ LW SC LPA S RRI LKK + + R + KER + A +S
Sbjct: 13 LLVLW--GSCTFGLPADSGTFRRIFLKK-------MPSVRESLKERGVDVAKLSTEWSQF 63
Query: 62 GVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSI 120
R LG+ ++ L N++D QY+GEIGIG+PPQ F VIFDTGS+NLWVPSSKC
Sbjct: 64 SKRVSLGNGTSPMI-LTNYLDTQYYGEIGIGTPPQTFKVIFDTGSANLWVPSSKCSPLYT 122
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+C H+RY S +S++Y E INYGSG + GF SQD V +G + V Q F E T
Sbjct: 123 ACETHNRYDSSESSSYVENKMEFTINYGSGKVKGFLSQDVVTMGGITVT-QTFGEVTELP 181
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
+ F+LA+FDGI+G+GF AV PV+DN++ QG++ E+VFS + +R+ GGEIV
Sbjct: 182 VIPFMLAKFDGILGMGFPAQAVSGVTPVFDNIISQGVLKEDVFSVYYSRNSHL-LGGEIV 240
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
GG DP++++G YV ++K G WQ ++ + + +T CE GCAA+VD+G S + GPT
Sbjct: 241 LGGSDPQYYQGNFHYVSLSKNGLWQIKMKGVSV-RSATLFCEEGCAAMVDTGASFITGPT 299
Query: 301 PVVTEINHAIGGEGVVSAE 319
+ + A+G + +++ E
Sbjct: 300 SSLKLLMDALGAKELITNE 318
>gi|397504905|ref|XP_003823019.1| PREDICTED: renin [Pan paniscus]
Length = 406
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 126/321 (39%), Positives = 197/321 (61%), Gaps = 20/321 (6%)
Query: 3 QKLLRSVFCLWVLASCLL-LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS 61
+++ R L + SC LP + +RI LK+ + + R + KER + A +
Sbjct: 5 RRMPRWGLLLLLWGSCTFGLPTDTTTFKRIFLKR-------MPSIRESLKERGVDMARLG 57
Query: 62 G------VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSK 115
R LG++ ++ L N+MD QY+GEIGIG+PPQ F V+FDTGSSN+WVPSSK
Sbjct: 58 PEWSQPMKRLTLGNTTSSVI-LTNYMDTQYYGEIGIGTPPQTFKVVFDTGSSNVWVPSSK 116
Query: 116 C-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFI 174
C +C +H + + S++Y G + Y +G++SGF SQD + VG + V Q+F
Sbjct: 117 CSRLYTACVYHKLFDASDSSSYKHNGTELTLRYSTGTVSGFLSQDIITVGGITVT-QMFG 175
Query: 175 EATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAE 234
E T +L F+LA FDG++G+GF E A+G P++DN++ QG++ E+VFSF+ NRD +
Sbjct: 176 EVTEMPALPFMLAEFDGVVGMGFIEQAIGSVTPIFDNIISQGVLKEDVFSFYYNRDSENS 235
Query: 235 E--GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSG 292
+ GG+IV GG DP+H++G Y+ + K G WQ ++ + +G+ ST +CE GC A+VD+G
Sbjct: 236 QSLGGQIVLGGSDPQHYEGNFHYINLIKTGVWQIQMKGVSVGS-STLLCEDGCLALVDTG 294
Query: 293 TSLLAGPTPVVTEINHAIGGE 313
S ++G T + ++ A+G +
Sbjct: 295 ASYISGSTSSIEKLMEALGAK 315
>gi|297662235|ref|XP_002809619.1| PREDICTED: renin [Pongo abelii]
Length = 406
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 127/321 (39%), Positives = 197/321 (61%), Gaps = 20/321 (6%)
Query: 3 QKLLRSVFCLWVLASCLL-LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS 61
+++ R L + SC LP + +RI LK+ + + R + KER + A +
Sbjct: 5 RRMPRWGLLLLLWGSCTFGLPTDTTTFKRIFLKR-------MPSIRESLKERGVDMARLG 57
Query: 62 G------VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSK 115
R LG++ ++ L N+MD QY+GEIGIG+PPQ F V+FDTGSSN+WVPSSK
Sbjct: 58 PEWSQPMKRLTLGNTTSSVI-LTNYMDTQYYGEIGIGTPPQTFKVVFDTGSSNVWVPSSK 116
Query: 116 C-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFI 174
C +C +H + + S++Y G + Y +G++SGF SQD + VG + V Q+F
Sbjct: 117 CSRLYTACVYHKLFDASDSSSYKHNGTELTLRYSTGTVSGFLSQDIITVGGITVT-QMFG 175
Query: 175 EATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAE 234
E T +L F+LA FDG++G+GF E A+G P++DN++ QG++ E+VFSF+ NRD +
Sbjct: 176 EVTEMPALPFMLAEFDGVVGMGFIEQAIGRVTPIFDNIISQGVLKEDVFSFYYNRDSENS 235
Query: 235 E--GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSG 292
+ GG+IV GG DP+H++G YV + K G WQ ++ + +G+ ST +CE GC A+VD+G
Sbjct: 236 QSLGGQIVLGGSDPQHYEGNFHYVNLIKTGVWQIQMKGVSVGS-STLLCEDGCLALVDTG 294
Query: 293 TSLLAGPTPVVTEINHAIGGE 313
S ++G T + ++ A+G +
Sbjct: 295 ASYISGSTSSIEKLMEALGAK 315
>gi|309319873|pdb|2X0B|A Chain A, Crystal Structure Of Human Angiotensinogen Complexed With
Renin
gi|309319875|pdb|2X0B|C Chain C, Crystal Structure Of Human Angiotensinogen Complexed With
Renin
gi|309319877|pdb|2X0B|E Chain E, Crystal Structure Of Human Angiotensinogen Complexed With
Renin
gi|309319879|pdb|2X0B|G Chain G, Crystal Structure Of Human Angiotensinogen Complexed With
Renin
Length = 383
Score = 238 bits (608), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 122/302 (40%), Positives = 189/302 (62%), Gaps = 19/302 (6%)
Query: 21 LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSG------VRHRLGDSDEDI 74
LP + +RI LK+ + + R + KER + A + R LG++ +
Sbjct: 1 LPTDTTTFKRIFLKR-------MPSIRESLKERGVDMARLGPEWSQPMKRLTLGNTTSSV 53
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKS 133
+ L N+MD QY+GEIGIG+PPQ F V+FDTGSSN+WVPSSKC +C +H + + S
Sbjct: 54 I-LTNYMDTQYYGEIGIGTPPQTFKVVFDTGSSNVWVPSSKCSRLYTACVYHKLFDASDS 112
Query: 134 NTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGII 193
++Y G + Y +G++SGF SQD + VG + V Q+F E T +L F+LA FDG++
Sbjct: 113 SSYKHNGTELTLRYSTGTVSGFLSQDIITVGGITVT-QMFGEVTEMPALPFMLAEFDGVV 171
Query: 194 GLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEE--GGEIVFGGVDPKHFKG 251
G+GF E A+G P++DN++ QG++ E+VFSF+ NRD + + GG+IV GG DP+H++G
Sbjct: 172 GMGFIEQAIGRVTPIFDNIISQGVLKEDVFSFYYNRDSENSQSLGGQIVLGGSDPQHYEG 231
Query: 252 KHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
Y+ + K G WQ ++ + +G+ ST +CE GC A+VD+G S ++G T + ++ A+G
Sbjct: 232 NFHYINLIKTGVWQIQMKGVSVGS-STLLCEDGCLALVDTGASYISGSTSSIEKLMEALG 290
Query: 312 GE 313
+
Sbjct: 291 AK 292
>gi|409079719|gb|EKM80080.1| hypothetical protein AGABI1DRAFT_113304 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 413
Score = 238 bits (608), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 131/288 (45%), Positives = 175/288 (60%), Gaps = 24/288 (8%)
Query: 57 GAGVSGVR--HRLGDSDEDIL-------------PLKNFMDAQYFGEIGIGSPPQNFSVI 101
GAG +G R H DE +L PL NFM+AQYF EI IGSPPQ F VI
Sbjct: 59 GAGGTGRRIAHPSQQDDETLLWTQEHQVQGGHGVPLSNFMNAQYFTEIQIGSPPQTFKVI 118
Query: 102 FDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNV 161
DTGSSNLWVPS KC SI+C+ H++Y S +S+TY G + EI YGSG++ GF SQD +
Sbjct: 119 LDTGSSNLWVPSVKCT-SIACFLHTKYDSGQSSTYKANGSTFEIQYGSGAMEGFVSQDQL 177
Query: 162 EVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEE 221
++GD+ + Q F EAT+E L F +FDGI+GLG+ I+V VP + M+EQ L+ E
Sbjct: 178 QIGDLTINGQDFAEATKEPGLAFAFGKFDGILGLGYDTISVNHIVPPFYKMIEQNLLDER 237
Query: 222 VFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVC 281
VFSF L E+GGE VFGG+D +KGK YVP+ +K YW+ +L I +G + +
Sbjct: 238 VFSFRLGS--SDEDGGEAVFGGIDESAYKGKMHYVPIRQKAYWEVQLDKISLGGEELELE 295
Query: 282 EGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLVVS 325
G A +D+GTSL+A P+ + +N IG + G + +C V S
Sbjct: 296 NTGAA--IDTGTSLIALPSDMAEMLNTQIGAKKSWNGQYTIDCAKVAS 341
>gi|281207795|gb|EFA81975.1| cathepsin D [Polysphondylium pallidum PN500]
Length = 390
Score = 238 bits (608), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 133/325 (40%), Positives = 197/325 (60%), Gaps = 28/325 (8%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRH 65
L F V++ +P S N R +++ ++ A R+ +G +G +
Sbjct: 7 LAVFFAFIVVSQAFTVPLSFNKASRQAIRRIPQNIQKKFAGRL------LGASGTT---- 56
Query: 66 RLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI-SCYF 124
+P+ ++ DAQY+G I IG+P Q+F V+FDTGSSNLW+PS KC ++ +C
Sbjct: 57 ---------IPISDYEDAQYYGAITIGTPAQSFKVVFDTGSSNLWIPSKKCPVTVVACDL 107
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
HS+Y S KS++Y G S I YGSG++SGF SQD V+VG + V++Q+F EAT E + F
Sbjct: 108 HSKYDSSKSSSYVANGTSFSIQYGSGAMSGFVSQDTVQVGSLTVQNQLFAEATAEPGIAF 167
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
LA+FDGI+GL F+ I+V PV+ NM+ QGLV + VF+FWL++ P A GGE+ FG +
Sbjct: 168 DLAKFDGILGLAFQSISVNSIPPVFYNMMAQGLVQQPVFAFWLSKVPGA-NGGELTFGSI 226
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVC-EGGCAAIVDSGTSLLAGPTPVV 303
D + G TYVP+T + YW+F++ D + S G C GC AI DSGTSL+AGP+ +
Sbjct: 227 DTTRYTGPITYVPLTNETYWEFKMDDFALNGNSLGYCGADGCHAICDSGTSLIAGPSAQI 286
Query: 304 TEINHAIG-----GEGVVSAECKLV 323
+N +G GEG+ ++ C ++
Sbjct: 287 NALNTKLGAVVMNGEGIFTS-CSVI 310
>gi|407924694|gb|EKG17726.1| Peptidase A1 [Macrophomina phaseolina MS6]
Length = 378
Score = 238 bits (608), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 120/244 (49%), Positives = 165/244 (67%), Gaps = 7/244 (2%)
Query: 72 EDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSR 131
E +P+ NF++AQYF E+ +G+PPQ F VI DTGSSNLWVPSS+C SI+CY H++Y S
Sbjct: 52 EHPVPVTNFLNAQYFSEVSLGTPPQTFKVILDTGSSNLWVPSSECG-SIACYLHTKYDSS 110
Query: 132 KSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDG 191
S+TY++ G + EI YGSGS+SGF S D +GD+ VKDQ F EAT E L F RFDG
Sbjct: 111 ASSTYSKNGSTFEIRYGSGSLSGFVSNDVFTIGDLTVKDQDFAEATSEPGLAFAFGRFDG 170
Query: 192 IIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV--FGGVDPKHF 249
I+GLG+ I+V VP + NM++QGL+ E VF+F+L+ D EG E V FGG+D H+
Sbjct: 171 ILGLGYDTISVNHIVPPFYNMIDQGLLDEPVFAFYLSDTND--EGSESVATFGGIDESHY 228
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHA 309
GK T +P+ +K YW+ +L I G+ + + G AI+D+GTSL+A P+ + +N
Sbjct: 229 TGKLTKIPLRRKAYWEVDLDSITFGDATAELDNTG--AILDTGTSLIALPSTLAELLNKE 286
Query: 310 IGGE 313
IG +
Sbjct: 287 IGAK 290
>gi|4506475|ref|NP_000528.1| renin preproprotein [Homo sapiens]
gi|57114109|ref|NP_001009122.1| renin precursor [Pan troglodytes]
gi|132326|sp|P00797.1|RENI_HUMAN RecName: Full=Renin; AltName: Full=Angiotensinogenase; Flags:
Precursor
gi|38503275|sp|P60016.1|RENI_PANTR RecName: Full=Renin; AltName: Full=Angiotensinogenase; Flags:
Precursor
gi|11118368|gb|AAG30305.1|AF193456_1 renin [Pan troglodytes]
gi|190994|gb|AAA60363.1| renin [Homo sapiens]
gi|337340|gb|AAD03461.1| renin [Homo sapiens]
gi|29126911|gb|AAH47752.1| Renin [Homo sapiens]
gi|49168484|emb|CAG38737.1| REN [Homo sapiens]
gi|54311156|gb|AAH33474.1| Renin [Homo sapiens]
gi|166706825|gb|ABY87560.1| renin [Homo sapiens]
gi|208967276|dbj|BAG73652.1| renin [synthetic construct]
gi|312153236|gb|ADQ33130.1| renin [synthetic construct]
Length = 406
Score = 238 bits (608), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 126/321 (39%), Positives = 197/321 (61%), Gaps = 20/321 (6%)
Query: 3 QKLLRSVFCLWVLASCLL-LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS 61
+++ R L + SC LP + +RI LK+ + + R + KER + A +
Sbjct: 5 RRMPRWGLLLLLWGSCTFGLPTDTTTFKRIFLKR-------MPSIRESLKERGVDMARLG 57
Query: 62 G------VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSK 115
R LG++ ++ L N+MD QY+GEIGIG+PPQ F V+FDTGSSN+WVPSSK
Sbjct: 58 PEWSQPMKRLTLGNTTSSVI-LTNYMDTQYYGEIGIGTPPQTFKVVFDTGSSNVWVPSSK 116
Query: 116 C-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFI 174
C +C +H + + S++Y G + Y +G++SGF SQD + VG + V Q+F
Sbjct: 117 CSRLYTACVYHKLFDASDSSSYKHNGTELTLRYSTGTVSGFLSQDIITVGGITVT-QMFG 175
Query: 175 EATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAE 234
E T +L F+LA FDG++G+GF E A+G P++DN++ QG++ E+VFSF+ NRD +
Sbjct: 176 EVTEMPALPFMLAEFDGVVGMGFIEQAIGRVTPIFDNIISQGVLKEDVFSFYYNRDSENS 235
Query: 235 E--GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSG 292
+ GG+IV GG DP+H++G Y+ + K G WQ ++ + +G+ ST +CE GC A+VD+G
Sbjct: 236 QSLGGQIVLGGSDPQHYEGNFHYINLIKTGVWQIQMKGVSVGS-STLLCEDGCLALVDTG 294
Query: 293 TSLLAGPTPVVTEINHAIGGE 313
S ++G T + ++ A+G +
Sbjct: 295 ASYISGSTSSIEKLMEALGAK 315
>gi|85094599|ref|XP_959917.1| vacuolar protease A precursor [Neurospora crassa OR74A]
gi|59802879|sp|Q01294.2|CARP_NEUCR RecName: Full=Vacuolar protease A; Flags: Precursor
gi|28921374|gb|EAA30681.1| vacuolar protease A precursor [Neurospora crassa OR74A]
gi|40804614|emb|CAF05874.1| aspartic proteinase, pepstatin-sensitive [Neurospora crassa]
gi|336467530|gb|EGO55694.1| aspartic proteinase, pepstatin-sensitive [Neurospora tetrasperma
FGSC 2508]
gi|350287820|gb|EGZ69056.1| aspartic proteinase, pepstatin-sensitive [Neurospora tetrasperma
FGSC 2509]
Length = 396
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 131/309 (42%), Positives = 187/309 (60%), Gaps = 15/309 (4%)
Query: 14 VLASCLLLPASSNGLRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSGVRHRLGD 69
+L + +LL ++ G+ + LKK +L+ ++ ++Y G S +
Sbjct: 5 LLTAAMLLGSAQAGVHTMKLKKVPLAEQLESVPIDVQVQHLGQKYTGLRTESHTQAMFKA 64
Query: 70 SDEDI-----LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYF 124
+D + +P+ NFM+AQYF EI IG+PPQ F V+ DTGSSNLWVPSS+C SI+CY
Sbjct: 65 TDAQVSGNHPVPITNFMNAQYFSEITIGTPPQTFKVVLDTGSSNLWVPSSQC-GSIACYL 123
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
H++Y+S +S+TY + G S +I YGSGS+SGF SQD + +GD+ + DQ+F EAT E L F
Sbjct: 124 HNKYESSESSTYKKNGTSFKIEYGSGSLSGFVSQDRMTIGDITINDQLFAEATSEPGLAF 183
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
RFDGI+GLG+ IAV P + MVEQ LV E VFSF+L D D E E+VFGGV
Sbjct: 184 AFGRFDGILGLGYDRIAVNGITPPFYKMVEQKLVDEPVFSFYL-ADQDGES--EVVFGGV 240
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
+ + GK T +P+ +K YW+ + I G + G I+D+GTSL+A P+ +
Sbjct: 241 NKDRYTGKITTIPLRRKAYWEVDFDAIGYGKDFAEL--EGHGVILDTGTSLIALPSQLAE 298
Query: 305 EINHAIGGE 313
+N IG +
Sbjct: 299 MLNAQIGAK 307
>gi|109018632|ref|XP_001090284.1| PREDICTED: cathepsin E isoform 4 [Macaca mulatta]
Length = 396
Score = 238 bits (607), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 119/248 (47%), Positives = 164/248 (66%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C H+R++ +S+T
Sbjct: 69 PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHTRFQPSQSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y++ G+S I YG+GS+SG D V V + V Q F E+ E TF+ A FDGI+GL
Sbjct: 128 YSQPGQSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAEFDGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV +FS +++ +P+ G E++FGG D HF G +
Sbjct: 188 GYPSLAVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGVGSELIFGGYDHSHFSGSLNW 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
VPVTK+GYWQ L +I +G + C GC AIVD+GTSL+ GP+ + ++ +AIG
Sbjct: 248 VPVTKQGYWQIALDNIQVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGAAPV 306
Query: 313 EGVVSAEC 320
+G + EC
Sbjct: 307 DGEYAVEC 314
>gi|149707989|ref|XP_001491088.1| PREDICTED: cathepsin E [Equus caballus]
Length = 396
Score = 238 bits (607), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 120/248 (48%), Positives = 158/248 (63%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C H+R+ +SNT
Sbjct: 69 PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SSACKTHTRFYPSQSNT 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y+ +G I YG+GS+SG D V V + V Q F E+ E TF+ A FDGI+GL
Sbjct: 128 YSMVGSQFSIQYGTGSLSGIIGADQVSVEGLTVVGQRFGESVTEPGQTFVDAEFDGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV +FS +++ DP+ G E++FGG D HF G +
Sbjct: 188 GYPSLAVGGVTPVFDNMMAQNLVDVPMFSVYMSSDPEGGAGSELIFGGYDHSHFSGSLNW 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE-- 313
VPVTK+GYWQ L I +G + C GC AIVD+GTSL+ GP + ++ AIG +
Sbjct: 248 VPVTKQGYWQIALDAIQVGG-TVMFCSQGCQAIVDTGTSLITGPPDKIKQLQEAIGAQPM 306
Query: 314 -GVVSAEC 320
G + EC
Sbjct: 307 DGEYAVEC 314
>gi|443927046|gb|ELU45582.1| endopeptidase [Rhizoctonia solani AG-1 IA]
Length = 934
Score = 238 bits (606), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 119/255 (46%), Positives = 162/255 (63%), Gaps = 9/255 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+++AQY+ +I +GSPPQ+F V+ DTGSSNLWVP C SI+C+ H++Y S SN
Sbjct: 121 VPLHNYLNAQYYADITLGSPPQSFKVVLDTGSSNLWVPGKSCT-SIACFLHAKYDSSASN 179
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY G I YGSGS+SGF SQD + +GD+ VK Q F EAT+E L F +FDGI+G
Sbjct: 180 TYKANGTEFAIQYGSGSLSGFMSQDTLTIGDIAVKHQDFAEATKEPGLAFAFGKFDGILG 239
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L F I+V AVP NM++QGL+ E +F+F + ++GGE VFGG+D H+KGK
Sbjct: 240 LAFPRISVNGAVPPVYNMIDQGLIKEPLFTFRVGS--SEQDGGEAVFGGIDESHYKGKIH 297
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG-- 312
YVPV ++ YW+ EL + +G + + G A +D+GTSL+A PT + IN IG
Sbjct: 298 YVPVRRQAYWEVELSSVSLGEDTLELENTGAA--IDTGTSLIALPTDIAEMINAQIGASR 355
Query: 313 --EGVVSAECKLVVS 325
G + C V S
Sbjct: 356 SWNGQYTVPCDKVPS 370
>gi|4503145|ref|NP_001901.1| cathepsin E isoform a preproprotein [Homo sapiens]
gi|114572172|ref|XP_001163151.1| PREDICTED: cathepsin E isoform 2 [Pan troglodytes]
gi|181194|gb|AAA52130.1| cathepsin E precursor [Homo sapiens]
gi|181205|gb|AAA52300.1| cathepsin E [Homo sapiens]
gi|7339520|emb|CAB82850.1| procathepsin E [Homo sapiens]
gi|27502799|gb|AAH42537.1| Cathepsin E [Homo sapiens]
gi|61358295|gb|AAX41543.1| cathepsin E [synthetic construct]
gi|119611998|gb|EAW91592.1| cathepsin E, isoform CRA_a [Homo sapiens]
gi|158257546|dbj|BAF84746.1| unnamed protein product [Homo sapiens]
gi|325463731|gb|ADZ15636.1| cathepsin E [synthetic construct]
Length = 396
Score = 238 bits (606), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 119/248 (47%), Positives = 163/248 (65%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C HSR++ +S+T
Sbjct: 69 PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHSRFQPSQSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y++ G+S I YG+GS+SG D V V + V Q F E+ E TF+ A FDGI+GL
Sbjct: 128 YSQPGQSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAEFDGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV +FS +++ +P+ G E++FGG D HF G +
Sbjct: 188 GYPSLAVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGSLNW 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
VPVTK+ YWQ L +I +G + C GC AIVD+GTSL+ GP+ + ++ +AIG
Sbjct: 248 VPVTKQAYWQIALDNIQVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGAAPV 306
Query: 313 EGVVSAEC 320
+G + EC
Sbjct: 307 DGEYAVEC 314
>gi|60816208|gb|AAX36374.1| cathepsin E [synthetic construct]
Length = 396
Score = 238 bits (606), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 119/248 (47%), Positives = 163/248 (65%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C HSR++ +S+T
Sbjct: 69 PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHSRFQPSQSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y++ G+S I YG+GS+SG D V V + V Q F E+ E TF+ A FDGI+GL
Sbjct: 128 YSQPGQSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAEFDGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV +FS +++ +P+ G E++FGG D HF G +
Sbjct: 188 GYPSLAVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGSLNW 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
VPVTK+ YWQ L +I +G + C GC AIVD+GTSL+ GP+ + ++ +AIG
Sbjct: 248 VPVTKQAYWQIALDNIQVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGAAPV 306
Query: 313 EGVVSAEC 320
+G + EC
Sbjct: 307 DGEYAVEC 314
>gi|410986349|ref|XP_003999473.1| PREDICTED: cathepsin E [Felis catus]
Length = 396
Score = 238 bits (606), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 121/248 (48%), Positives = 161/248 (64%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N+MD +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C H+R+ +S+T
Sbjct: 69 PLINYMDTEYFGSISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHARFYPSQSDT 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y+ +G I YG+GS+SG D V V ++V Q F E+ E TF+ A FDGI+GL
Sbjct: 128 YSALGNHFSIQYGTGSLSGIIGTDQVYVEGLLVVGQQFGESVTEPGQTFVNAEFDGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV +FS +++ DP++ G E++FGG D HF G +
Sbjct: 188 GYPSLAVGGVTPVFDNMMAQNLVDIPMFSVYMSSDPESGVGSELIFGGYDHSHFSGTLNW 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE-- 313
VPVTK+GYWQ L I +G + C GC AIVD+GTSL+ GP+ + ++ AIG E
Sbjct: 248 VPVTKQGYWQIALDVIQVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQKAIGAEPM 306
Query: 314 -GVVSAEC 320
G + EC
Sbjct: 307 DGEYAVEC 314
>gi|397504824|ref|XP_003822980.1| PREDICTED: cathepsin E [Pan paniscus]
Length = 396
Score = 238 bits (606), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 119/248 (47%), Positives = 163/248 (65%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C HSR++ +S+T
Sbjct: 69 PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHSRFQPSQSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y++ G+S I YG+GS+SG D V V + V Q F E+ E TF+ A FDGI+GL
Sbjct: 128 YSQPGQSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAEFDGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV +FS +++ +P+ G E++FGG D HF G +
Sbjct: 188 GYPSLAVGGVTPVFDNMLAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGSLNW 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
VPVTK+ YWQ L +I +G + C GC AIVD+GTSL+ GP+ + ++ +AIG
Sbjct: 248 VPVTKQAYWQIALDNIQVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGAAPV 306
Query: 313 EGVVSAEC 320
+G + EC
Sbjct: 307 DGEYAVEC 314
>gi|426333516|ref|XP_004028322.1| PREDICTED: cathepsin E isoform 1 [Gorilla gorilla gorilla]
Length = 396
Score = 237 bits (605), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 119/248 (47%), Positives = 163/248 (65%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C HSR++ +S+T
Sbjct: 69 PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHSRFQPSQSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y++ G+S I YG+GS+SG D V V + V Q F E+ E TF+ A FDGI+GL
Sbjct: 128 YSQPGQSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAEFDGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV +FS +++ +P+ G E++FGG D HF G +
Sbjct: 188 GYPSLAVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGSLNW 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
VPVTK+ YWQ L +I +G + C GC AIVD+GTSL+ GP+ + ++ +AIG
Sbjct: 248 VPVTKQAYWQIALDNIQVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGSAPV 306
Query: 313 EGVVSAEC 320
+G + EC
Sbjct: 307 DGEYAVEC 314
>gi|126723599|ref|NP_001075713.1| cathepsin E precursor [Oryctolagus cuniculus]
gi|1168791|sp|P43159.1|CATE_RABIT RecName: Full=Cathepsin E; Flags: Precursor
gi|402729|gb|AAC37308.1| procathepsin E [Oryctolagus cuniculus]
Length = 396
Score = 237 bits (605), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 118/249 (47%), Positives = 161/249 (64%), Gaps = 5/249 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDT SSNLWVPS C S +C H +++ +SNT
Sbjct: 69 PLINYLDMEYFGTISIGSPPQNFTVIFDTVSSNLWVPSVYCT-SPACQMHPQFRPSQSNT 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y+E+G I YG+GS++G D V V + V Q F E+ +E TF+ A FDGI+GL
Sbjct: 128 YSEVGTPFSIAYGTGSLTGIIGADQVSVQGLTVVGQQFGESVKEPGQTFVNAEFDGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +A G PV+DNM+ Q LVS +FS +++ +P+ G E+ FGG D HF G +
Sbjct: 188 GYPSLAAGGVTPVFDNMMAQNLVSLPMFSVYMSSNPEGGSGSELTFGGYDSSHFSGSLNW 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
VPVTK+GYWQ L +I +G S C GC AIVD+GTSL+ GP+ + ++ AIG
Sbjct: 248 VPVTKQGYWQIALDEIQVGG-SPMFCPEGCQAIVDTGTSLITGPSDKIIQLQAAIGATPM 306
Query: 313 EGVVSAECK 321
+G + EC+
Sbjct: 307 DGEYAVECE 315
>gi|402857516|ref|XP_003893299.1| PREDICTED: renin [Papio anubis]
gi|62287423|sp|Q6DLS0.1|RENI_MACFA RecName: Full=Renin; AltName: Full=Angiotensinogenase; Flags:
Precursor
gi|50346961|gb|AAT75162.1| renin [Macaca fascicularis]
Length = 406
Score = 237 bits (605), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 126/321 (39%), Positives = 197/321 (61%), Gaps = 20/321 (6%)
Query: 3 QKLLRSVFCLWVLASCLL-LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS 61
+++ R L + SC LP + +RI LK+ + + R + KER + A +
Sbjct: 5 RRMPRWGLLLLLWGSCTFGLPTDTTTFKRIFLKR-------MPSIRESLKERGVDMARLG 57
Query: 62 G------VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSK 115
R LG++ ++ L N+MD QY+GEIGIG+PPQ F V+FDTGSSN+WVPSSK
Sbjct: 58 PEWSQPMKRLALGNTTSSVI-LTNYMDTQYYGEIGIGTPPQTFKVVFDTGSSNVWVPSSK 116
Query: 116 C-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFI 174
C +C +H + + S++Y G + Y +G++SGF SQD + VG + V Q+F
Sbjct: 117 CSRLYTACVYHKLFDASDSSSYKHNGTELTLRYSTGTVSGFLSQDIITVGGITVT-QMFG 175
Query: 175 EATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAE 234
E T +L F+LA FDG++G+GF E A+G P++DN++ QG++ E+VFSF+ NRD +
Sbjct: 176 EVTEMPALPFMLAEFDGVVGMGFIEQAIGRVTPIFDNILSQGVLKEDVFSFYYNRDSENA 235
Query: 235 E--GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSG 292
+ GG+IV GG DP+H++G Y+ + K G WQ ++ + +G+ ST +CE GC A+VD+G
Sbjct: 236 QSLGGQIVLGGSDPQHYEGNFHYINLIKTGVWQIQMKGVSVGS-STLLCEDGCLALVDTG 294
Query: 293 TSLLAGPTPVVTEINHAIGGE 313
S ++G T + ++ A+G +
Sbjct: 295 ASYISGSTSSIEKLMEALGAK 315
>gi|351710945|gb|EHB13864.1| Cathepsin E, partial [Heterocephalus glaber]
Length = 391
Score = 237 bits (605), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 120/248 (48%), Positives = 158/248 (63%), Gaps = 6/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C H + SNT
Sbjct: 65 PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHPVFHPSLSNT 123
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y+E+G I YG+GS++G D V V + V Q F E+ +E TF+ A FDGI+GL
Sbjct: 124 YSEVGNPFSIQYGTGSLTGIIGADQVSVEGLTVVGQQFGESVKEPGQTFVHAEFDGILGL 183
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +A G PV+DNM+ Q LV+ +FS +++ +P GGE+ FGG DP HF G +
Sbjct: 184 GYPSLAAGGVTPVFDNMMAQNLVALPLFSVYMSSNPGG-SGGELTFGGYDPSHFSGSLNW 242
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
VPVTK+ YWQ L IL+G+ S C GC AIVD+GTSL+ GP P + ++ A+G V
Sbjct: 243 VPVTKQAYWQIALDGILVGD-SVMFCSEGCQAIVDTGTSLITGPPPKIKQLQEALGATYV 301
Query: 316 ---VSAEC 320
+ EC
Sbjct: 302 DEEYAVEC 309
>gi|1507725|gb|AAB06575.1| aspartic protease, partial [Ancylostoma caninum]
Length = 442
Score = 237 bits (605), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 117/227 (51%), Positives = 154/227 (67%), Gaps = 5/227 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF-SISCYFHSRYKSRKSNT 135
L+N+MDAQYFG I IG+P QNF+VIFDTGSSNLWVPS K F I+C RY S S+T
Sbjct: 80 LRNYMDAQYFGTIQIGTPAQNFTVIFDTGSSNLWVPSEKMPFHDIACMLRHRYDSGASST 139
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y E G+ I YG+GS+ GF S+DNV + + ++Q F EAT E LTF+ A+FDGI+G+
Sbjct: 140 YKEDGRKMAIQYGTGSMKGFISKDNVCIAGICAEEQPFAEATSEPGLTFIAAKFDGILGI 199
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
F EI+V PV+ +EQ V VF+ WLNR+PD+E GGEI GG+D + + T+
Sbjct: 200 TFPEISVLGVPPVFHTFIEQKKVPSPVFALWLNRNPDSELGGEITLGGMDTRRYVEPITW 259
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEG---GCAAIVDSGTSLLAGP 299
PVT++GYWQF++ D + G ++ C GC AI D+GTSL+AGP
Sbjct: 260 TPVTRRGYWQFKM-DKVQGGSTSIACPNEFSGCQAIADTGTSLIAGP 305
>gi|195997417|ref|XP_002108577.1| hypothetical protein TRIADDRAFT_19349 [Trichoplax adhaerens]
gi|190589353|gb|EDV29375.1| hypothetical protein TRIADDRAFT_19349, partial [Trichoplax
adhaerens]
Length = 370
Score = 237 bits (605), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 119/241 (49%), Positives = 159/241 (65%), Gaps = 3/241 (1%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
L N++DA+YFG I IG+PPQ+F V+FDTGSS+ WVPSS+C S +C H RY KS+TY
Sbjct: 46 LNNYLDAEYFGPITIGTPPQDFLVLFDTGSSDFWVPSSECT-SQACEMHHRYDHSKSSTY 104
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
GK I YGSGS GF S D V+V + V++ F E T F A+FDGI+GLG
Sbjct: 105 RPNGKRWSIEYGSGSAEGFLSTDVVKVAGITVQNVTFGEVTNLPGPIFAAAKFDGILGLG 164
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
F ++V ++D M++QGL+ + VFS +LNR GGE+VFGG DP ++ G +YV
Sbjct: 165 FASLSVEGVKTIFDLMLQQGLIQKPVFSVYLNRQGTQNVGGELVFGGSDPNYYTGAFSYV 224
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
P++K+GYWQFEL I N+ CEGGC A++D+GTSL+ GP V +INH IG + +
Sbjct: 225 PLSKEGYWQFELDGGTIENEF--FCEGGCQAVIDTGTSLIVGPNEEVAKINHLIGADSIQ 282
Query: 317 S 317
S
Sbjct: 283 S 283
>gi|118102416|ref|XP_001235024.1| PREDICTED: cathepsin E [Gallus gallus]
Length = 397
Score = 237 bits (605), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 123/293 (41%), Positives = 180/293 (61%), Gaps = 8/293 (2%)
Query: 26 NGLRRIGLKKRRLDLHSL-NAARITR--KERYMGGAGVSGVRHRLGDSDEDILPLKNFMD 82
NGL+R+ L + R SL + ++++ K + S G+++E PL N++D
Sbjct: 20 NGLKRVTLTRHRSLRKSLRDRGQLSQFWKAHRLDMVQYSQDCSLFGEANE---PLINYLD 76
Query: 83 AQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKS 142
+YFG+I IG+PPQNF+V+FDTGSSNLWVPS C S +C H+R++ S+TY +G
Sbjct: 77 MEYFGQISIGTPPQNFTVVFDTGSSNLWVPSIYCT-SKACTKHARFQPSHSSTYQPLGIP 135
Query: 143 CEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAV 202
I YG+GS++G D V V + V +Q F E+ E TF + FDGI+GL + +AV
Sbjct: 136 VSIQYGTGSLTGIIGSDQVTVEGMTVYNQPFAESVSEPGKTFQDSEFDGILGLAYPSLAV 195
Query: 203 GDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
PV+DNM+ Q LV +FS +++ +PD+ GGE++FGG DP F G +VPVT++G
Sbjct: 196 DGVTPVFDNMMAQDLVEMPIFSVYMSANPDSSLGGEVLFGGFDPSRFLGTLHWVPVTQQG 255
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
YWQ +L ++ +G + C GC AIVD+GTSLL GPT + E+ IG +
Sbjct: 256 YWQIQLDNVQVGG-TVAFCADGCQAIVDTGTSLLTGPTKDIKEMQRYIGATAM 307
>gi|449299914|gb|EMC95927.1| hypothetical protein BAUCODRAFT_34686 [Baudoinia compniacensis UAMH
10762]
Length = 376
Score = 237 bits (604), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 115/249 (46%), Positives = 164/249 (65%), Gaps = 8/249 (3%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
+ NF++AQYF +I IG+PPQ+F V+ DTGSSNLWVPS C SI+CY HS+Y S+TY
Sbjct: 56 VSNFLNAQYFSDISIGTPPQDFKVVLDTGSSNLWVPSQDC-GSIACYLHSKYDHSDSSTY 114
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
+ G +I YGSG + G+ SQD V +GD+ +K+Q+F EAT E L F RFDGI+GLG
Sbjct: 115 KKNGSDFQIRYGSGELEGYISQDTVRIGDLSIKNQLFAEATSEPGLAFAFGRFDGIMGLG 174
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ I+V VP + NM+ QGL+ E+VF+F+L+ D + + E FGG+D H++GK T +
Sbjct: 175 YDTISVNHIVPPFYNMINQGLIDEQVFAFYLS-DTNKGDESEATFGGIDESHYEGKMTKI 233
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE--- 313
P+ +K YW+ +L I G+Q+ + G AI+D+GTSL+A PT + +N IG +
Sbjct: 234 PLRRKAYWEVDLDAITFGDQTAEIDSTG--AILDTGTSLIALPTTLAELLNREIGAKKSY 291
Query: 314 -GVVSAECK 321
G + EC
Sbjct: 292 NGQYTIECN 300
>gi|426333405|ref|XP_004028268.1| PREDICTED: renin [Gorilla gorilla gorilla]
Length = 406
Score = 237 bits (604), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 126/321 (39%), Positives = 197/321 (61%), Gaps = 20/321 (6%)
Query: 3 QKLLRSVFCLWVLASCLL-LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS 61
+++ R L + SC LP + +RI LK+ + + R + KER + A +
Sbjct: 5 RRMPRWGLLLLLWGSCTFGLPTDTTTFKRIFLKR-------MPSIRESLKERGVDMARLG 57
Query: 62 G------VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSK 115
R LG++ ++ L N+MD QY+GEIGIG+PPQ F V+FDTGSSN+WVPSSK
Sbjct: 58 PEWRQPMKRLTLGNTTSSVI-LTNYMDTQYYGEIGIGTPPQTFKVVFDTGSSNVWVPSSK 116
Query: 116 C-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFI 174
C +C +H + + S++Y G + Y +G++SGF SQD + VG + V Q+F
Sbjct: 117 CSRLYTACVYHKLFDASDSSSYKHNGTELTLRYSTGTVSGFLSQDIITVGGITVT-QMFG 175
Query: 175 EATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAE 234
E T +L F+LA FDG++G+GF E A+G P++DN++ QG++ E+VFSF+ NRD +
Sbjct: 176 EVTEMPALPFMLAEFDGVVGMGFIEQAIGRVTPIFDNIISQGVLKEDVFSFYYNRDSENF 235
Query: 235 E--GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSG 292
+ GG+IV GG DP+H++G Y+ + K G WQ ++ + +G+ ST +CE GC A+VD+G
Sbjct: 236 QSLGGQIVLGGSDPQHYEGNFHYINLIKTGVWQIQMKGVSVGS-STLLCEDGCLALVDTG 294
Query: 293 TSLLAGPTPVVTEINHAIGGE 313
S ++G T + ++ A+G +
Sbjct: 295 ASYISGSTSSIEKLMEALGAK 315
>gi|115719|sp|P00795.2|CATD_PIG RecName: Full=Cathepsin D; Contains: RecName: Full=Cathepsin D
light chain; Contains: RecName: Full=Cathepsin D heavy
chain; Flags: Precursor
Length = 345
Score = 237 bits (604), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 127/254 (50%), Positives = 178/254 (70%), Gaps = 12/254 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
LKN+MDAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S KS+T
Sbjct: 7 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSGKSST 66
Query: 136 YTEIGKSCEINYGSGSISGFFS-QDNVEV---------GDVVVKDQVFIEATREGSLTFL 185
Y + G + I+YGSGS+SG+ S QD V V G + V+ Q F EAT++ LTF+
Sbjct: 67 YVKNGTTFAIHYGSGSLSGYLSSQDTVSVPCNSALSGVGGIKVERQTFGEATKQPGLTFI 126
Query: 186 LARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVD 245
A+FDGI+G+ + I+V + VPV+DN+++Q LV +++FSF+LNRDP A+ GGE++ GG+D
Sbjct: 127 AAKFDGILGMAYPRISVNNVVPVFDNLMQQKLVDKDIFSFYLNRDPGAQPGGELMLGGID 186
Query: 246 PKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTE 305
K++KG Y VT+K YWQ + + +G+ T +C+GGC AIVD+GTSL+ G V E
Sbjct: 187 SKYYKGSLDYHNVTRKAYWQIHMNQVAVGSSLT-LCKGGCEAIVDTGTSLIVGQPEEVRE 245
Query: 306 INHAIGGEGVVSAE 319
+ AIG ++ E
Sbjct: 246 LGKAIGAVPLIQGE 259
>gi|355681644|gb|AER96811.1| cathepsin E [Mustela putorius furo]
Length = 375
Score = 237 bits (604), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 122/255 (47%), Positives = 163/255 (63%), Gaps = 8/255 (3%)
Query: 69 DSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRY 128
D++E PL N++D +YFG I +GSPPQNF+VIFDTGSSNLWVPS C S +C H+R+
Sbjct: 44 DANE---PLINYLDMEYFGTISVGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHTRF 99
Query: 129 KSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLAR 188
+S+TY+ +G I YG+GS+SG D V V +VV Q F E+ E TF+ A
Sbjct: 100 YPSQSSTYSTLGSHFSIQYGTGSLSGILGADQVNVEGLVVVGQQFGESVTEPGQTFVNAE 159
Query: 189 FDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKH 248
FDGI+GLG+ +AVG PV+DNM+ Q LV +FS +++ DP+ G E++FGG D H
Sbjct: 160 FDGILGLGYPSLAVGGVTPVFDNMMAQNLVDIPMFSVYMSSDPEGGAGSELIFGGYDHSH 219
Query: 249 FKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH 308
F G +VPVTK+GYWQ L I +G + C GC AIVD+GTSL+ GP+ + ++
Sbjct: 220 FSGNLNWVPVTKQGYWQIALDAIQVGG-AVMFCSEGCQAIVDTGTSLITGPSDKIKQLQK 278
Query: 309 AIGGE---GVVSAEC 320
AIG E G EC
Sbjct: 279 AIGAEPMDGEYGVEC 293
>gi|403294878|ref|XP_003938389.1| PREDICTED: cathepsin E [Saimiri boliviensis boliviensis]
Length = 396
Score = 237 bits (604), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 119/248 (47%), Positives = 162/248 (65%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C H+R++ +SNT
Sbjct: 69 PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKRHTRFQPSQSNT 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + G+S I YG+GS+SG D V V + V Q F E+ E TF+ A FDGI+GL
Sbjct: 128 YNQPGQSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAEFDGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV +FS +++ +P+ G E++FGG D HF G +
Sbjct: 188 GYPSLAVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGVGSELIFGGYDHSHFSGSLNW 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
VPVTK+ YWQ L +I +G + C GC AIVD+GTSL+ GP+ + ++ +AIG
Sbjct: 248 VPVTKQAYWQIALDNIQVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGAAPV 306
Query: 313 EGVVSAEC 320
+G + EC
Sbjct: 307 DGEYAVEC 314
>gi|337347|gb|AAA60364.1| renin [Homo sapiens]
Length = 403
Score = 237 bits (604), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 123/305 (40%), Positives = 191/305 (62%), Gaps = 19/305 (6%)
Query: 17 SCLL-LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSG------VRHRLGD 69
SC LP + +RI LK+ + + R + KER + A + R LG+
Sbjct: 19 SCTFGLPTDTTTFKRIFLKR-------MPSIRESLKERGVDMASLGPEWSQPMKRLTLGN 71
Query: 70 SDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRY 128
+ ++ L N+MD QY+GEIGIG+PPQ F V+FDTGSSN+WVPSSKC +C +H +
Sbjct: 72 TTSSVI-LTNYMDTQYYGEIGIGTPPQTFKVVFDTGSSNVWVPSSKCSRLYTACVYHKLF 130
Query: 129 KSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLAR 188
+ S++Y G + Y +G++SGF SQD + VG + V Q+F E T +L F+LA+
Sbjct: 131 DASDSSSYKHNGTELTLRYSTGTVSGFLSQDIITVGGITVT-QMFGEVTEMPALPFMLAQ 189
Query: 189 FDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKH 248
FDG++G+GF E A+G P++DN++ QG++ E+VFSF+ NR+ + GG+IV GG DP+H
Sbjct: 190 FDGVVGMGFIEQAIGRVTPIFDNIISQGVLKEDVFSFYYNRNSQS-LGGQIVLGGSDPQH 248
Query: 249 FKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH 308
++G Y+ + K G WQ ++ + +G+ ST +CE GC A+VD+G S ++G T + ++
Sbjct: 249 YEGNFHYINLIKTGVWQIQMKGVSVGS-STLLCEDGCLALVDTGASYISGSTSCIEKLME 307
Query: 309 AIGGE 313
A+G +
Sbjct: 308 ALGAK 312
>gi|346973691|gb|EGY17143.1| vacuolar protease A [Verticillium dahliae VdLs.17]
Length = 398
Score = 237 bits (604), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 119/250 (47%), Positives = 163/250 (65%), Gaps = 7/250 (2%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+P+ NFM+AQYF EI IG+PPQ F V+ DTGSSNLWVPS +C SI+CY H++Y S S+
Sbjct: 74 VPVSNFMNAQYFSEITIGTPPQTFKVVLDTGSSNLWVPSQQCS-SIACYLHTKYDSSDSS 132
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY G EI+YGSGS++GF SQD V +GD+ +K+Q F EAT E L F RFDGI+G
Sbjct: 133 TYKANGSEFEIHYGSGSLTGFVSQDTVTIGDIKIKNQDFAEATSEPGLAFAFGRFDGILG 192
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V VP + MV Q V E VF+F+L + + E+VFGGVD H++GK T
Sbjct: 193 LGYDTISVNKIVPPFYQMVNQKAVDEPVFAFYLGDTNEQGDESEVVFGGVDESHYEGKIT 252
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
+P+ +K YW+ +L I +G+ + + G AI+D+GTSL P+ + +N+ IG +
Sbjct: 253 TIPLRRKAYWEVDLDSISLGDNTAEL--DGHGAILDTGTSLNVLPSTLADMLNNEIGAKK 310
Query: 314 ---GVVSAEC 320
G S EC
Sbjct: 311 GYNGQWSVEC 320
>gi|354478111|ref|XP_003501259.1| PREDICTED: cathepsin E-like isoform 1 [Cricetulus griseus]
Length = 396
Score = 237 bits (604), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 116/248 (46%), Positives = 158/248 (63%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C H + +S+T
Sbjct: 69 PLINYLDVEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHPVFHPSQSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y E+G I YG+GS++G D V V + V Q F E+ +E TF+ A FDGI+GL
Sbjct: 128 YEEVGNHFSIQYGTGSLTGIIGADQVSVEGLTVDGQQFGESVKEPGQTFVNAEFDGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV +FS +++ DP G E+ FGG DP HF G +
Sbjct: 188 GYPSLAVGGVTPVFDNMMAQNLVDLPIFSVYMSSDPQGGSGSELTFGGFDPSHFSGNLNW 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
+PVTK+GYWQ L + +G+ + C GC AIVD+GTSL+ GP+ + ++ AIG
Sbjct: 248 IPVTKQGYWQIALDGVQVGD-TVMFCSEGCQAIVDTGTSLITGPSHKIKQLQEAIGATPM 306
Query: 313 EGVVSAEC 320
+G + +C
Sbjct: 307 DGEYAVDC 314
>gi|451853159|gb|EMD66453.1| hypothetical protein COCSADRAFT_34972 [Cochliobolus sativus ND90Pr]
Length = 399
Score = 236 bits (603), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 119/250 (47%), Positives = 165/250 (66%), Gaps = 8/250 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+P+ NF++AQYF +I +G+PPQ+F VI DTGSSNLWVPS++C SI+CY H++Y S S+
Sbjct: 77 VPVSNFLNAQYFSDISLGTPPQSFKVILDTGSSNLWVPSTECS-SIACYLHTKYDSSASS 135
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + G EI YGSGS+SGF S D ++GD+ VK+Q F EAT E L F RFDGI+G
Sbjct: 136 TYKKNGSEFEIRYGSGSLSGFVSNDVFQIGDLKVKNQDFAEATSEPGLAFAFGRFDGIMG 195
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V VP + NM+ QGL+ E VF+F+L D +E E FGG+D H+ GK T
Sbjct: 196 LGYDTISVNGIVPPFYNMLNQGLLDEPVFAFYLGDTKDGKE-SEATFGGIDESHYTGKLT 254
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
+P+ +K YW+ +L I G ++ + G AI+D+GTSL+A P+ + +N IG +
Sbjct: 255 KLPLRRKAYWEVDLDAITFGKETAEMENIG--AILDTGTSLIALPSAIAELLNKEIGAKK 312
Query: 314 ---GVVSAEC 320
G S EC
Sbjct: 313 GFNGQYSVEC 322
>gi|195029909|ref|XP_001987814.1| GH19747 [Drosophila grimshawi]
gi|193903814|gb|EDW02681.1| GH19747 [Drosophila grimshawi]
Length = 390
Score = 236 bits (603), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 125/298 (41%), Positives = 179/298 (60%), Gaps = 19/298 (6%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRL-------GDSDEDILPLKNFMDAQYFGEI 89
R+ LH + R R +++ G+ R RL GDS + PL N++DAQYFG I
Sbjct: 22 RVPLHRFPSVR-HRFQQF----GIRMDRLRLKYSLRTRGDSLRSV-PLSNYLDAQYFGPI 75
Query: 90 GIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGS 149
IG+PPQ F+VIFDTGS+NLWVPS C+ ++C HSRY +++S +Y G +I YGS
Sbjct: 76 SIGTPPQTFNVIFDTGSANLWVPSETCHRKLACQIHSRYNAKRSRSYKSNGSQFDIQYGS 135
Query: 150 GSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVW 209
GS++G+ SQD V + + + +Q F EAT FL A+FDGI GLG++ I++ + P +
Sbjct: 136 GSLTGYLSQDTVRMAGLELLNQTFAEATDMPGPIFLAAKFDGIFGLGYQAISIKNIKPPF 195
Query: 210 DNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELG 269
++EQ L+ VFS +LNRD + +GG + FGG ++++G TYVPVT + YWQ +L
Sbjct: 196 YAVMEQSLLERPVFSVYLNRDSTSLQGGYLFFGGSSRRYYRGNFTYVPVTHRAYWQVKLE 255
Query: 270 DILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 323
IG +C+ GC I+D+GTS +A P IN +IGG G S C+ V
Sbjct: 256 AAYIGKLQ--MCQKGCHVIIDTGTSFIAVPYEQAILINESIGGTPAAYGQFSVPCEQV 311
>gi|198457045|ref|XP_001360531.2| GA10074 [Drosophila pseudoobscura pseudoobscura]
gi|198135836|gb|EAL25106.2| GA10074 [Drosophila pseudoobscura pseudoobscura]
Length = 399
Score = 236 bits (603), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 131/334 (39%), Positives = 187/334 (55%), Gaps = 36/334 (10%)
Query: 12 LWVLASCLLLP-----ASSNGLRRIGLKKR------------RLDLHSLNAARITRKERY 54
+W+L L+LP + S L R+ L++ R+D L +R+ + R
Sbjct: 1 MWLLFLSLILPPLVAPSPSTELYRVPLRRFPSARNRFVQFGIRMDRFRLKYSRVDGRSRP 60
Query: 55 MGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSS 114
GG V PL N++DAQYFG I IGSPPQ F VIFDTGSSNLWVPS+
Sbjct: 61 RGGWEVRSE------------PLSNYLDAQYFGPITIGSPPQTFKVIFDTGSSNLWVPST 108
Query: 115 KCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF 173
C + ++C HSRY +R+S+++ G I+YGSGS++G+ S D V V + +++Q F
Sbjct: 109 SCAPTMVACMVHSRYNARQSSSHRRNGVRFAIHYGSGSLAGYLSSDTVRVAGLEIQNQTF 168
Query: 174 IEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDA 233
E T FL A+FDGI GL ++ I++ D P + ++EQ L+S VFS +LNR +
Sbjct: 169 AEVTTMPGPIFLAAKFDGIFGLAYQSISMQDVKPPFYAIMEQKLLSNPVFSVYLNRQQEH 228
Query: 234 EEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGT 293
EGG + FGG +P++++G TYVPV+ + YWQ + I + +C+ GC I+D+GT
Sbjct: 229 PEGGALFFGGSNPRYYRGNFTYVPVSHRAYWQVRMEAATIND--LRLCQHGCEVIIDTGT 286
Query: 294 SLLAGPTPVVTEINHAIGGE----GVVSAECKLV 323
S LA P IN +IGG G S C V
Sbjct: 287 SFLALPYDQAILINESIGGTPSEYGQYSVPCDQV 320
>gi|308809631|ref|XP_003082125.1| putative vacuaolar aspartic proteinase (ISS) [Ostreococcus tauri]
gi|116060592|emb|CAL55928.1| putative vacuaolar aspartic proteinase (ISS) [Ostreococcus tauri]
Length = 505
Score = 236 bits (603), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 115/242 (47%), Positives = 153/242 (63%), Gaps = 9/242 (3%)
Query: 119 SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
S+ C H+++ S S TY G I YGSGS+SGF SQD+V VGD+ VK Q F EAT+
Sbjct: 91 SVPCDLHAKFDSAASETYEADGTPFAIQYGSGSLSGFLSQDDVTVGDITVKGQYFAEATK 150
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAE---- 234
E + FL A+FDGI+GLGF I+V PV+ NM+EQ L+ + +FSFWLNR + +
Sbjct: 151 EPGIAFLFAKFDGILGLGFDTISVDKVKPVFYNMMEQKLIDKNMFSFWLNRTSNVDGTPS 210
Query: 235 -EGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEG--GCAAIVDS 291
GGE+VFGG DPKHF G+HTY PVT+ GYWQ ++ D + +S GVC+G GC I D+
Sbjct: 211 VTGGELVFGGSDPKHFVGEHTYAPVTRAGYWQIKMDDFKVAGRSLGVCKGENGCQVIADT 270
Query: 292 GTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGD--LIWDLLVSGLLPEKVCQQIG 349
GTSLL GP VV +IN IG ++ EC++++ QY D + E++C IG
Sbjct: 271 GTSLLTGPADVVKKINDYIGAHSMLGEECRMLIDQYADEXXXXXXXLETYTSEQICTSIG 330
Query: 350 LC 351
C
Sbjct: 331 AC 332
>gi|116203505|ref|XP_001227563.1| vacuolar protease A precursor [Chaetomium globosum CBS 148.51]
gi|88175764|gb|EAQ83232.1| vacuolar protease A precursor [Chaetomium globosum CBS 148.51]
Length = 396
Score = 236 bits (603), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 132/318 (41%), Positives = 189/318 (59%), Gaps = 32/318 (10%)
Query: 14 VLASCLLLPASSNGLRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSGVRHRLGD 69
+L + +LL ++ + ++ L+K +L+ LN ++YMG VR R
Sbjct: 5 LLTAAVLLGSAQGAVHKMKLQKVPLSEQLEAVPLNTQLEQLGQKYMG------VRPRQSH 58
Query: 70 SD------------EDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY 117
++ +P+ NFM+AQYF EI IGSPPQ F V+ DTGSSNLWVPS +C
Sbjct: 59 ANAVFNGMVAEVKGNHPVPISNFMNAQYFSEITIGSPPQTFKVVLDTGSSNLWVPSVEC- 117
Query: 118 FSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEAT 177
SI+CY H++Y S S+TY + G + EI YGSGS+SGF SQD + +GD+ +K Q F EAT
Sbjct: 118 GSIACYLHTKYDSSASSTYKKNGTNFEIRYGSGSLSGFVSQDTMTIGDITIKGQDFAEAT 177
Query: 178 REGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGG 237
E L F RFDGI+GLG+ I+V VP + M+EQ L+ E VF+F+L A+E G
Sbjct: 178 SEPGLAFAFGRFDGILGLGYDTISVNGIVPPFYKMLEQKLIDEPVFAFYL-----ADEKG 232
Query: 238 --EIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSL 295
E+VFGGVD +KGK T +P+ +K YW+ + I G+ + + G I+D+GTSL
Sbjct: 233 QSEVVFGGVDSDKYKGKITTIPLRRKAYWEVDFDAISYGDDTAELENTGV--ILDTGTSL 290
Query: 296 LAGPTPVVTEINHAIGGE 313
+A P+ + +N IG +
Sbjct: 291 IALPSQLAEMLNAQIGAK 308
>gi|340373429|ref|XP_003385244.1| PREDICTED: cathepsin D-like [Amphimedon queenslandica]
Length = 382
Score = 236 bits (603), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 128/313 (40%), Positives = 180/313 (57%), Gaps = 19/313 (6%)
Query: 12 LWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSD 71
L+V AS L L L R+ LH R + R + ++ D
Sbjct: 6 LFVFASLLTL----------TLAFVRVPLHRHVVPRSQTRARLLAKYPSYFSSFKVNDVP 55
Query: 72 EDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKS 130
E PL N++DA+Y+G I IG+PPQNF VIFDTGSSNLW+PSSKC +C H +Y
Sbjct: 56 E---PLTNYLDAEYYGNITIGTPPQNFLVIFDTGSSNLWIPSSKCDPKDKACQTHHQYNH 112
Query: 131 RKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFD 190
S+TY + I YG+G+++GF S D V + ++ V Q F EA + TF+ A+FD
Sbjct: 113 DHSSTYVKNDTKFAIQYGTGNLTGFLSVDTVTIANLTVPAQKFAEAVEQPGDTFVNAQFD 172
Query: 191 GIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFK 250
GI+G+ + I+V +P ++N+V+Q LV++ VF F+L+RD + GGE+ GG DP H+K
Sbjct: 173 GILGMAWPSISVDGVIPFFNNLVQQSLVAQPVFGFYLDRDENGTLGGELALGGTDPSHYK 232
Query: 251 GKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI 310
YVP++ K YWQF+L I +G T +C GC AI D+GTSLL GP+ V +I I
Sbjct: 233 APINYVPLSDKTYWQFKLDKIKVG--GTTLCSNGCQAIADTGTSLLVGPSVDVQKIMKEI 290
Query: 311 GG---EGVVSAEC 320
G +GV +C
Sbjct: 291 GAKNTDGVYMIDC 303
>gi|169600915|ref|XP_001793880.1| hypothetical protein SNOG_03312 [Phaeosphaeria nodorum SN15]
gi|111068923|gb|EAT90043.1| hypothetical protein SNOG_03312 [Phaeosphaeria nodorum SN15]
Length = 347
Score = 236 bits (603), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 118/250 (47%), Positives = 166/250 (66%), Gaps = 8/250 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+P+ NF++AQYF EI +G+PPQ F V+ DTGSSNLWVPSS+C SI+CY H++Y S S+
Sbjct: 25 VPVSNFLNAQYFSEISLGTPPQTFKVVLDTGSSNLWVPSSECN-SIACYLHTKYDSSSSS 83
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + G S EI YGSG +SGF S D ++GD+ VK+Q F EAT E L F RFDGI+G
Sbjct: 84 TYKKNGTSFEIRYGSGELSGFVSNDVFQIGDLKVKNQDFAEATSEPGLAFAFGRFDGIMG 143
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V VP + NM+EQGL+ E VF+F+L D +A++ E FGG+D H+ GK
Sbjct: 144 LGYDTISVNKIVPPFYNMLEQGLLDEPVFAFYLG-DTNAQQESEATFGGIDESHYSGKLI 202
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
+P+ +K YW+ +L I G ++ + + G I+D+GTSL+A P+ + +N IG +
Sbjct: 203 KLPLRRKAYWEVDLDAITFGKETAEMDDTGV--ILDTGTSLIALPSTIAELLNKEIGAKK 260
Query: 314 ---GVVSAEC 320
G + EC
Sbjct: 261 GFNGQYTVEC 270
>gi|73535294|pdb|1TZS|A Chain A, Crystal Structure Of An Activation Intermediate Of
Cathepsin E
Length = 351
Score = 236 bits (603), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 119/248 (47%), Positives = 163/248 (65%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C HSR++ +S+T
Sbjct: 16 PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHSRFQPSQSST 74
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y++ G+S I YG+GS+SG D V V + V Q F E+ E TF+ A FDGI+GL
Sbjct: 75 YSQPGQSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAEFDGILGL 134
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV +FS +++ +P+ G E++FGG D HF G +
Sbjct: 135 GYPSLAVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGSLNW 194
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
VPVTK+ YWQ L +I +G + C GC AIVD+GTSL+ GP+ + ++ +AIG
Sbjct: 195 VPVTKQAYWQIALDNIQVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGAAPV 253
Query: 313 EGVVSAEC 320
+G + EC
Sbjct: 254 DGEYAVEC 261
>gi|403294825|ref|XP_003938364.1| PREDICTED: renin [Saimiri boliviensis boliviensis]
Length = 400
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 120/296 (40%), Positives = 185/296 (62%), Gaps = 13/296 (4%)
Query: 21 LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNF 80
LP + +RI LK+ + + R + KER + A + R L + ++ L N+
Sbjct: 24 LPTDTITFKRISLKR-------MPSIRESLKERGVDMARLGPERMALVNVTSSVI-LTNY 75
Query: 81 MDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEI 139
MD QY+GEIGIG+PPQ F V+FDTGSSN+WVPSSKC +C +H + + S++Y
Sbjct: 76 MDTQYYGEIGIGTPPQIFKVVFDTGSSNVWVPSSKCSRLYTACAYHKLFDASDSSSYKHN 135
Query: 140 GKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFRE 199
G + Y +G++SGF SQD + VG + V Q F E T +L F+LA FDG++G+GF E
Sbjct: 136 GTELTLRYSTGTVSGFLSQDVITVGGITVT-QTFGEVTEMPALPFMLAEFDGVVGMGFIE 194
Query: 200 IAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEE--GGEIVFGGVDPKHFKGKHTYVP 257
A+G P++DN++ QG++ E+VFSF+ NRD + + GG+IV GG DP+H++G Y+
Sbjct: 195 QAIGRVTPLFDNIISQGVLKEDVFSFYYNRDSENSQSLGGQIVLGGSDPQHYEGNFHYIN 254
Query: 258 VTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
+ + G WQ + + +G+ ST +CE GC A+VD+G S ++G T + ++ A+G +
Sbjct: 255 LIRTGLWQIPMKGVSVGS-STLLCEDGCLALVDTGASYISGSTSSIEKLMEALGAK 309
>gi|405117936|gb|AFR92711.1| endopeptidase [Cryptococcus neoformans var. grubii H99]
Length = 438
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 123/255 (48%), Positives = 163/255 (63%), Gaps = 9/255 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL NFM+AQYF + +G+P Q F V+ DTGSSNLWVPS KC SI+C+ H++Y S +S+
Sbjct: 117 VPLSNFMNAQYFATVELGTPFQTFKVVLDTGSSNLWVPSVKCT-SIACFLHNKYDSSQSS 175
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY G EI+YGSGS+ GF SQD + +GD+VVK Q F EAT+E L F +FDGI+G
Sbjct: 176 TYKANGSDFEIHYGSGSLEGFISQDTLSIGDLVVKKQDFAEATKEPGLAFAFGKFDGILG 235
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V VP + NM+ Q L+ E VFSF L E+GGE +FGG+D + GK
Sbjct: 236 LGYDTISVNHIVPPFYNMLNQHLLDEPVFSFRLGS--SDEDGGEAIFGGIDDSAYSGKLA 293
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
YVPV +KGYW+ EL I G++ + G A +D+GTSL+ PT V +N IG E
Sbjct: 294 YVPVRRKGYWEVELESISFGDEELELENTGAA--IDTGTSLIVMPTDVAELLNKEIGAEK 351
Query: 314 ---GVVSAECKLVVS 325
G + +C V S
Sbjct: 352 SWNGQYTVDCNTVSS 366
>gi|74136391|ref|NP_001028088.1| renin precursor [Macaca mulatta]
gi|67461396|sp|Q6DLW5.2|RENI_MACMU RecName: Full=Renin; AltName: Full=Angiotensinogenase; Flags:
Precursor
gi|61699710|gb|AAT74864.2| prorenin [Macaca mulatta]
Length = 406
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 126/321 (39%), Positives = 196/321 (61%), Gaps = 20/321 (6%)
Query: 3 QKLLRSVFCLWVLASCLL-LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS 61
+++ R L + SC LP + +RI LK+ + + R + KER + A +
Sbjct: 5 RRMPRWGLLLLLWGSCTFGLPTDTTTFKRIFLKR-------MPSIRESLKERGVDMARLG 57
Query: 62 G------VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSK 115
R LG++ ++ L N+MD QY+GEIGIG+PPQ F V+FDTGSSN+WVPSSK
Sbjct: 58 PEWSQPMKRLALGNTTSSVI-LTNYMDTQYYGEIGIGTPPQTFKVVFDTGSSNVWVPSSK 116
Query: 116 C-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFI 174
C +C +H + + S++Y G + Y +G++SGF SQD + VG + V Q+F
Sbjct: 117 CSRLYTACVYHKLFDASDSSSYKHNGTELTLRYSTGTVSGFLSQDIITVGGITVT-QMFG 175
Query: 175 EATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAE 234
E T +L F+LA FDG++G+GF E A+G P++DN++ QG++ E+VFSF+ NRD +
Sbjct: 176 EVTEMPALPFMLAEFDGVVGMGFIEQAIGRVTPIFDNILSQGVLKEDVFSFYYNRDSENA 235
Query: 235 E--GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSG 292
+ GG+IV GG DP+H++G Y+ + K G WQ + + +G+ ST +CE GC A+VD+G
Sbjct: 236 QSLGGQIVLGGSDPQHYEGNFHYINLIKTGVWQIPMKGVSVGS-STLLCEDGCLALVDTG 294
Query: 293 TSLLAGPTPVVTEINHAIGGE 313
S ++G T + ++ A+G +
Sbjct: 295 ASYISGSTSSIEKLMEALGAK 315
>gi|58258949|ref|XP_566887.1| endopeptidase [Cryptococcus neoformans var. neoformans JEC21]
gi|134107071|ref|XP_777848.1| hypothetical protein CNBA5450 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50260546|gb|EAL23201.1| hypothetical protein CNBA5450 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57223024|gb|AAW41068.1| endopeptidase, putative [Cryptococcus neoformans var. neoformans
JEC21]
Length = 438
Score = 236 bits (602), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 125/255 (49%), Positives = 163/255 (63%), Gaps = 9/255 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+M+AQYF + IG+P Q F VI DTGSSNLWVPS KC SI+C+ HS+Y S +S+
Sbjct: 117 VPLSNYMNAQYFATMEIGTPFQTFKVILDTGSSNLWVPSVKCT-SIACFLHSKYDSSQSS 175
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY G EI+YGSGS+ GF SQD V +GD+VVK Q F EAT+E L F +FDGI+G
Sbjct: 176 TYKANGSDFEIHYGSGSLEGFISQDTVSIGDLVVKKQDFAEATKEPGLAFAFGKFDGILG 235
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V VP + NM+ Q L+ E VFSF L E+GGE +FGG+D + G+
Sbjct: 236 LGYDTISVNHIVPPFYNMLNQHLLDEPVFSFRLGS--SDEDGGEAIFGGIDDSAYSGELQ 293
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
YVPV +KGYW+ EL I G++ + G A +D+GTSL+ PT V +N IG E
Sbjct: 294 YVPVRRKGYWEVELESISFGDEELELENTGAA--IDTGTSLIVMPTDVAELLNKEIGAEK 351
Query: 314 ---GVVSAECKLVVS 325
G + +C V S
Sbjct: 352 SWNGQYTVDCSTVSS 366
>gi|1039445|gb|AAA79878.1| vacuolar protease A [Neurospora crassa]
Length = 396
Score = 236 bits (602), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 132/309 (42%), Positives = 188/309 (60%), Gaps = 15/309 (4%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRL--DLHSLNAARITRK--ERYMGGAGVSGVRHRLGD 69
+L + +LL ++ G+ + LKK L +L S+ + ++Y G S +
Sbjct: 5 LLTAAMLLGSAQAGVHTMKLKKVPLADELESVPIDVQVQHLGQKYTGLRTESHTQAMFKA 64
Query: 70 SDEDI-----LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYF 124
+D + +P+ NFM+AQYF EI IG+PPQ F V+ DTGSSNLWVPSS+C SI+CY
Sbjct: 65 TDAQVSGNHPVPITNFMNAQYFSEITIGTPPQTFKVVLDTGSSNLWVPSSQC-GSIACYL 123
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
H++Y+S +S+TY + G S +I YGSGS+SGF SQD + +GD+ + DQ+F EAT E L F
Sbjct: 124 HNKYESSESSTYKKNGTSFKIEYGSGSLSGFVSQDRMTIGDITINDQLFAEATSEPGLAF 183
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
RFDGI+GLG+ +AV P + MVEQ LV E VFSF+L D D E E+VFGGV
Sbjct: 184 AFGRFDGILGLGYDRLAVPGITPPFYKMVEQKLVDEPVFSFYL-ADQDGES--EVVFGGV 240
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
+ + GK T +P+ +K YW+ + I G + G I+D+GTSL+A P+ +
Sbjct: 241 NKDRYTGKITTIPLRRKAYWEVDFDAIGYGKDFAEL--EGHGVILDTGTSLIALPSQLAE 298
Query: 305 EINHAIGGE 313
+N IG +
Sbjct: 299 MLNAQIGAK 307
>gi|353234557|emb|CCA66581.1| probable PEP4-aspartyl protease [Piriformospora indica DSM 11827]
Length = 411
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 134/323 (41%), Positives = 189/323 (58%), Gaps = 33/323 (10%)
Query: 14 VLASCLLLP--ASSNGLRRIGLKK--RRLDLHSLNAARITRKERYMGG----AGVSGVRH 65
VL+S LL P +++G+ R+ L K R + AA + K GG AGV G+
Sbjct: 7 VLSSLLLAPFVHAADGVHRMKLNKMPRTAPGSAEEAALLAHK---YGGQVPLAGVGGLGR 63
Query: 66 RLGDS----------DEDIL-------PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSN 108
+L + +DI+ PL N+M+AQY+ +I IG+PPQ F V+ DTGSSN
Sbjct: 64 KLANPPTAGDDQMFWTQDIVANGGHGVPLNNYMNAQYYADITIGTPPQTFKVVLDTGSSN 123
Query: 109 LWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVV 168
LWVPS+ C SI+C+ H++Y S S+TY G I YGSGS+ GF SQD + +GD+ +
Sbjct: 124 LWVPSTSCT-SIACFLHTKYDSSASSTYKANGTEFAIRYGSGSLEGFVSQDTMTLGDLTI 182
Query: 169 KDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLN 228
K Q F EAT+E L F +FDGI+GL + I+V P + N ++QGL+ E+VF+F +
Sbjct: 183 KKQDFAEATKEPGLAFAFGKFDGILGLAYDTISVNHITPPFYNAIDQGLLKEKVFTFRVG 242
Query: 229 RDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAI 288
+GGE VFGG+D H+ GK TYVPV +KGYW+ EL + G+ + G A
Sbjct: 243 A--SEADGGEAVFGGIDSSHYTGKITYVPVRRKGYWEVELESVAFGDDELELENTGAA-- 298
Query: 289 VDSGTSLLAGPTPVVTEINHAIG 311
+D+GTSL+ PT + +N IG
Sbjct: 299 IDTGTSLIVMPTTIAEMLNSEIG 321
>gi|453084572|gb|EMF12616.1| aspartyl proteinase [Mycosphaerella populorum SO2202]
Length = 396
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 127/313 (40%), Positives = 189/313 (60%), Gaps = 21/313 (6%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRL--DLHSLNAARITRK--ERYMGGAGVSGVRHRLGD 69
L + L+ + G+ ++ L+K L L +N ++ ++YMG + RL +
Sbjct: 4 ALMTSALVAGAQAGVHKMKLQKIPLSEQLEGMNIESQVQRLGQKYMG----IRAQGRLDE 59
Query: 70 --SDEDILP-------LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI 120
+ + P + NF++AQYF EI +G+PPQ F V+ DTGSSNLWVPSS+C SI
Sbjct: 60 MFKETSVAPEAGHPVAVSNFLNAQYFSEIAVGTPPQEFKVVLDTGSSNLWVPSSEC-GSI 118
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+CY HS+Y SNTY + G I YGSGS+ G+ SQD V++GD+ +KDQ+F EAT E
Sbjct: 119 ACYLHSKYNHGDSNTYKQNGSEFAIRYGSGSLEGYVSQDTVQIGDLKIKDQLFAEATSEP 178
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
L F RFDGI+GLG+ I+V P + NM++QGL+ E+VF+F+L+ +E E +
Sbjct: 179 GLAFAFGRFDGIMGLGYDTISVNGIPPPFYNMIDQGLLDEKVFAFYLSSTDKGDE-SEAI 237
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGGV+ H+ G T +P+ +K YW+ +L I G Q+ + G AI+D+GTSL+A P+
Sbjct: 238 FGGVNKDHYTGDMTKIPLRRKAYWEVDLDAITFGKQTAEIDATG--AILDTGTSLIALPS 295
Query: 301 PVVTEINHAIGGE 313
+ +N IG +
Sbjct: 296 TLAELLNKEIGAK 308
>gi|396499231|ref|XP_003845423.1| similar to Vacuolar aspartyl protease (proteinase A) [Leptosphaeria
maculans JN3]
gi|21914374|gb|AAM81358.1|AF522873_1 aspartyl proteinase [Leptosphaeria maculans]
gi|312222004|emb|CBY01944.1| similar to Vacuolar aspartyl protease (proteinase A) [Leptosphaeria
maculans JN3]
Length = 397
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 129/308 (41%), Positives = 185/308 (60%), Gaps = 21/308 (6%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRK-----ERYMGGAGVSGVRHR------LGDSDEDILP 76
+ ++ LKK LD L A I + ++YM G + + + D + P
Sbjct: 19 VHKMPLKKVSLD-EQLKYASIQEQVSALSQKYMSGFKPTSHMEQVFKAPYIADGTHPV-P 76
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
+ NF++AQYF EI +G+PPQ F V+ DTGSSNLWVPSS+C SI+CY H++Y S S+TY
Sbjct: 77 VSNFLNAQYFSEISLGTPPQTFKVVLDTGSSNLWVPSSECN-SIACYLHTKYDSSASSTY 135
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
+ G S EI YGSG +SGF S D ++GD+ VK+Q F EAT E L F RFDGI+GLG
Sbjct: 136 KKNGTSFEIRYGSGELSGFVSNDVFQIGDLKVKNQDFAEATSEPGLAFAFGRFDGIMGLG 195
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ I+V VP + NM++QGL+ E VF+F+L D + ++ E FGG+D H+ GK +
Sbjct: 196 YDTISVNHIVPPFYNMLDQGLLDEPVFAFYLG-DTNEQQESEATFGGIDESHYSGKLIKL 254
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE--- 313
P+ +K YW+ +L I G ++ + G I+D+GTSL+A P+ + +N IG +
Sbjct: 255 PLRRKAYWEVDLDAITFGKETAEMDNTGV--ILDTGTSLIALPSTMAELLNREIGAKKGF 312
Query: 314 -GVVSAEC 320
G S EC
Sbjct: 313 NGQYSVEC 320
>gi|166235886|ref|NP_031825.2| cathepsin E preproprotein [Mus musculus]
gi|341940308|sp|P70269.2|CATE_MOUSE RecName: Full=Cathepsin E; Flags: Precursor
gi|5748654|emb|CAA08880.2| cathepsin E protein [Mus musculus]
gi|74146932|dbj|BAE25449.1| unnamed protein product [Mus musculus]
gi|74192082|dbj|BAE34257.1| unnamed protein product [Mus musculus]
gi|74219155|dbj|BAE26716.1| unnamed protein product [Mus musculus]
gi|74222421|dbj|BAE38113.1| unnamed protein product [Mus musculus]
gi|148707758|gb|EDL39705.1| cathepsin E [Mus musculus]
Length = 397
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 116/248 (46%), Positives = 157/248 (63%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IG+PPQNF+VIFDTGSSNLWVPS C S +C H + +S+T
Sbjct: 70 PLINYLDMEYFGTISIGTPPQNFTVIFDTGSSNLWVPSVYCT-SPACKAHPVFHPSQSDT 128
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
YTE+G I YG+GS++G D V V + V Q F E+ +E TF+ A FDGI+GL
Sbjct: 129 YTEVGNHFSIQYGTGSLTGIIGADQVSVEGLTVDGQQFGESVKEPGQTFVNAEFDGILGL 188
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +A G PV+DNM+ Q LV+ +FS +L+ DP G E+ FGG DP HF G +
Sbjct: 189 GYPSLAAGGVTPVFDNMMAQNLVALPMFSVYLSSDPQGGSGSELTFGGYDPSHFSGSLNW 248
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
+PVTK+ YWQ L I +G+ + C GC AIVD+GTSL+ GP + ++ AIG
Sbjct: 249 IPVTKQAYWQIALDGIQVGD-TVMFCSEGCQAIVDTGTSLITGPPDKIKQLQEAIGATPI 307
Query: 313 EGVVSAEC 320
+G + +C
Sbjct: 308 DGEYAVDC 315
>gi|390477486|ref|XP_003735302.1| PREDICTED: cathepsin E isoform 2 [Callithrix jacchus]
Length = 401
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 120/253 (47%), Positives = 163/253 (64%), Gaps = 10/253 (3%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C H+R++ +SNT
Sbjct: 69 PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKRHTRFQPSQSNT 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNV-----EVGDVVVKDQVFIEATREGSLTFLLARFD 190
Y + G+S I YG+GS+SG D V +V + V Q F E+ E TF+ A FD
Sbjct: 128 YNQPGQSFSIQYGTGSLSGIIGADQVSAFSWQVEGLTVVGQQFGESVTEPGQTFVDAEFD 187
Query: 191 GIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFK 250
GI+GLG+ +AVG PV+DNM+ Q LV +FS +++ +P+ G E++FGG D HF
Sbjct: 188 GILGLGYPSLAVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFS 247
Query: 251 GKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI 310
G +VPVTK+ YWQ L DI +G + C GC AIVD+GTSL+ GP+ + ++ +AI
Sbjct: 248 GSLNWVPVTKQAYWQIALDDIQVGGTAM-FCSEGCQAIVDTGTSLITGPSDKIKQLQNAI 306
Query: 311 GG---EGVVSAEC 320
G +G + EC
Sbjct: 307 GAAPVDGEYAVEC 319
>gi|378731872|gb|EHY58331.1| vacuolar protease A [Exophiala dermatitidis NIH/UT8656]
Length = 398
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 127/306 (41%), Positives = 179/306 (58%), Gaps = 17/306 (5%)
Query: 28 LRRIGLKKRRLDLH--------SLNAARITRKERYMGGAGVSGVRHRLGDSDEDI-LPLK 78
+ R+ L+K L+ L A R ++ +GG RH D D +P++
Sbjct: 20 MHRMKLQKVPLEQQLSAANIGDHLRALRHKYTQKTLGGPAEDIFRHTSIDIDSPHEVPVE 79
Query: 79 NFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTE 138
NF++AQYF I +G+PPQ F V+ DTGSSNLWVPSS+C SI+CY H +Y S S+TY +
Sbjct: 80 NFLNAQYFSTIALGTPPQEFKVVLDTGSSNLWVPSSEC-GSIACYLHQKYDSSASSTYKK 138
Query: 139 IGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFR 198
G I YGSG ++GF SQD + +GD+ +KDQ+F EAT E L F RFDGI+GLG+
Sbjct: 139 NGSEFGIRYGSGEVAGFISQDILRIGDLKIKDQLFGEATSEPGLAFAFGRFDGILGLGYD 198
Query: 199 EIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPV 258
IAV P + NM++QGL+ E VF+F+L D E E FGG+D H+ GK +P+
Sbjct: 199 TIAVNHIPPPFYNMIDQGLLDEPVFAFYLGNTNDGTE-SEATFGGIDKDHYTGKMVKIPL 257
Query: 259 TKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----G 314
+K YW+ L I G ++ + G I+D+GTSL+A P+ + +N IG + G
Sbjct: 258 RRKAYWEVNLDAITFGKETADLDNTGV--ILDTGTSLIALPSTLAELLNKEIGAKKGFNG 315
Query: 315 VVSAEC 320
+ EC
Sbjct: 316 QYTVEC 321
>gi|2288908|emb|CAA71859.1| cathepsin E [Mus musculus]
Length = 397
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 116/248 (46%), Positives = 157/248 (63%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IG+PPQNF+VIFDTGSSNLWVPS C S +C H + +S+T
Sbjct: 70 PLINYLDMEYFGTISIGTPPQNFTVIFDTGSSNLWVPSVYCT-SPACKAHPVFHPSQSDT 128
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
YTE+G I YG+GS++G D V V + V Q F E+ +E TF+ A FDGI+GL
Sbjct: 129 YTEVGNHFSIQYGTGSLTGIIGADQVSVEGLTVDGQQFGESVKEPGQTFVNAEFDGILGL 188
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +A G PV+DNM+ Q LV+ +FS +L+ DP G E+ FGG DP HF G +
Sbjct: 189 GYPSLAAGGVTPVFDNMMAQNLVALPMFSVYLSSDPQGGSGSELTFGGYDPSHFSGSLNW 248
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
+PVTK+ YWQ L I +G+ + C GC AIVD+GTSL+ GP + ++ AIG
Sbjct: 249 IPVTKQAYWQIALDGIQVGD-TVMFCSEGCQAIVDTGTSLITGPPDKIKQLQEAIGATPI 307
Query: 313 EGVVSAEC 320
+G + +C
Sbjct: 308 DGEYAVDC 315
>gi|354478113|ref|XP_003501260.1| PREDICTED: cathepsin E-like isoform 2 [Cricetulus griseus]
Length = 363
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 114/236 (48%), Positives = 153/236 (64%), Gaps = 2/236 (0%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C H + +S+T
Sbjct: 69 PLINYLDVEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHPVFHPSQSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y E+G I YG+GS++G D V V + V Q F E+ +E TF+ A FDGI+GL
Sbjct: 128 YEEVGNHFSIQYGTGSLTGIIGADQVSVEGLTVDGQQFGESVKEPGQTFVNAEFDGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV +FS +++ DP G E+ FGG DP HF G +
Sbjct: 188 GYPSLAVGGVTPVFDNMMAQNLVDLPIFSVYMSSDPQGGSGSELTFGGFDPSHFSGNLNW 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
+PVTK+GYWQ L + +G+ + C GC AIVD+GTSL+ GP+ + ++ AIG
Sbjct: 248 IPVTKQGYWQIALDGVQVGD-TVMFCSEGCQAIVDTGTSLITGPSHKIKQLQEAIG 302
>gi|429860373|gb|ELA35113.1| vacuolar protease a [Colletotrichum gloeosporioides Nara gc5]
Length = 399
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 137/321 (42%), Positives = 194/321 (60%), Gaps = 18/321 (5%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRK-----ERYMGG-----AGVSGV 63
+L + +LL A+ + ++ LKK + LNA I + ++YMG A
Sbjct: 5 LLTAAVLLGAAQADVHKLKLKKVPIS-EQLNAVPIEHQVRSLGQKYMGARPQNHADAMFN 63
Query: 64 RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCY 123
+ + + E +P+ NFM+AQYF EI IG+PPQ+F V+ DTGSSNLWVPS +C SI+CY
Sbjct: 64 QKPIKSNGEHPVPVSNFMNAQYFSEISIGTPPQSFKVVLDTGSSNLWVPSQQC-GSIACY 122
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
HS+Y S S+TY G EI+YGSGS++GF SQD+V +GD+ +K Q F EAT E L
Sbjct: 123 LHSKYDSSSSSTYKSNGSEFEIHYGSGSLTGFVSQDDVSIGDIKIKKQDFAEATSEPGLA 182
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
F RFDGI+GLG+ I+V VP + MV Q + E VF+F+L D + E VFGG
Sbjct: 183 FAFGRFDGILGLGYDTISVNKIVPPFYQMVNQKAIDEPVFAFYLGDTNDEGDESEAVFGG 242
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
VD H++GK TY+P+ +K YW+ +L I +G+++ + G AI+D+GTSL P+ +
Sbjct: 243 VDDSHYEGKITYIPLRRKAYWEVDLDAITLGDETADL--EGHGAILDTGTSLNVLPSALA 300
Query: 304 TEINHAIGGE----GVVSAEC 320
+N IG + G S EC
Sbjct: 301 ELLNKEIGAKKGFNGQYSVEC 321
>gi|37790800|gb|AAR03502.1| renin [Homo sapiens]
gi|119611911|gb|EAW91505.1| renin [Homo sapiens]
Length = 403
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 125/319 (39%), Positives = 196/319 (61%), Gaps = 19/319 (5%)
Query: 3 QKLLRSVFCLWVLASCLL-LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS 61
+++ R L + SC LP + +RI LK+ + + R + KER + A +
Sbjct: 5 RRMPRWGLLLLLWGSCTFGLPTDTTTFKRIFLKR-------MPSIRESLKERGVDMARLG 57
Query: 62 G------VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSK 115
R LG++ ++ L N+MD QY+GEIGIG+PPQ F V+FDTGSSN+WVPSSK
Sbjct: 58 PEWSQPMKRLTLGNTTSSVI-LTNYMDTQYYGEIGIGTPPQTFKVVFDTGSSNVWVPSSK 116
Query: 116 C-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFI 174
C +C +H + + S++Y G + Y +G++SGF SQD + VG + V Q+F
Sbjct: 117 CSRLYTACVYHKLFDASDSSSYKHNGTELTLRYSTGTVSGFLSQDIITVGGITVT-QMFG 175
Query: 175 EATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAE 234
E T +L F+LA FDG++G+GF E A+G P++DN++ QG++ E+VFSF+ NR+ +
Sbjct: 176 EVTEMPALPFMLAEFDGVVGMGFIEQAIGRVTPIFDNIISQGVLKEDVFSFYYNRNSQS- 234
Query: 235 EGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTS 294
GG+IV GG DP+H++G Y+ + K G WQ ++ + +G+ ST +CE GC A+VD+G S
Sbjct: 235 LGGQIVLGGSDPQHYEGNFHYINLIKTGVWQIQMKGVSVGS-STLLCEDGCLALVDTGAS 293
Query: 295 LLAGPTPVVTEINHAIGGE 313
++G T + ++ A+G +
Sbjct: 294 YISGSTSSIEKLMEALGAK 312
>gi|332247693|ref|XP_003272996.1| PREDICTED: cathepsin E [Nomascus leucogenys]
Length = 396
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 117/248 (47%), Positives = 162/248 (65%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C H+R++ +S+T
Sbjct: 69 PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHTRFQPSQSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y++ G+S I YG+GS+SG D V V + V Q F E+ E TF+ A FDGI+GL
Sbjct: 128 YSQPGQSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAEFDGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV +FS +++ +P+ G E++FGG D HF G +
Sbjct: 188 GYPSLAVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGSLNW 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
VPVTK+ YWQ L +I +G + C GC AIVD+GTSL+ GP+ + ++ + IG
Sbjct: 248 VPVTKQAYWQIALDNIQVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNTIGAAPV 306
Query: 313 EGVVSAEC 320
+G + EC
Sbjct: 307 DGEYAVEC 314
>gi|325087547|gb|EGC40857.1| aspartic endopeptidase Pep2 [Ajellomyces capsulatus H88]
Length = 398
Score = 235 bits (600), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 126/301 (41%), Positives = 187/301 (62%), Gaps = 12/301 (3%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGD----SDEDILPLKNFMDA 83
L++I L ++ +++ ++A ++YMG V+ GD S LP+ NF++A
Sbjct: 25 LQKIPLSEQFANVN-IDAHVRALGQKYMGVKPNQNVQDVFGDPAKASGGHSLPVDNFLNA 83
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
QYF EIGIG+PPQ F V+ DTGSSNLWVPSS+C SI+CY H++Y S S+T+ + G
Sbjct: 84 QYFSEIGIGTPPQTFKVVLDTGSSNLWVPSSECG-SIACYLHNKYDSSASSTHKKNGSEF 142
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVG 203
I YGSGS++GF SQD + +GD+VV++QVF EAT E L F RFDGI+GLG+ I+V
Sbjct: 143 SITYGSGSLTGFVSQDCLTIGDLVVENQVFAEATSEPGLAFAFGRFDGILGLGYDTISVN 202
Query: 204 DAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGY 263
VP + M+ + L+ E +FSF+L ++ E+VFGG++ F G+ T +P+ +K Y
Sbjct: 203 KIVPPFYEMLNKDLLDEPMFSFYLGDANIDDDQSEVVFGGMNKDRFTGELTKIPLRRKAY 262
Query: 264 WQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAE 319
W+ +L I G Q+ + G I+D+GTSL+A P+ + +N IG + G + E
Sbjct: 263 WEVDLDSITFGKQTAMMTNTGV--ILDTGTSLIALPSTIAELLNKEIGAKKSFNGQYTVE 320
Query: 320 C 320
C
Sbjct: 321 C 321
>gi|403414885|emb|CCM01585.1| predicted protein [Fibroporia radiculosa]
Length = 414
Score = 235 bits (600), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 149/382 (39%), Positives = 200/382 (52%), Gaps = 59/382 (15%)
Query: 23 ASSNGLRRIGLKKRRLDLHSLNAARITRKERY----------MGGAGVSGVRHRLGDSD- 71
A++NG+ ++ L+K L + E+Y GG G + V R D
Sbjct: 15 AAANGVHKLKLQKLPQSLGNPTLETAYLAEKYGGQAQMPLVGAGGLGRNMVLARPVHEDG 74
Query: 72 EDIL--------------PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY 117
ED+L PL NFM+AQYF EI +G+P Q+F VI DTGSSNLWVPSSKC
Sbjct: 75 EDLLWTQEEILVNGGHNVPLSNFMNAQYFAEIQLGTPAQSFKVILDTGSSNLWVPSSKCT 134
Query: 118 FSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEAT 177
SI+C+ H++Y S S TY G I YGSGS+ GF SQD +++GD+ +K Q F EAT
Sbjct: 135 -SIACFLHAKYDSSSSTTYKANGSEFSIQYGSGSMEGFVSQDLLKIGDLSIKHQDFAEAT 193
Query: 178 REGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGG 237
+E L F +FDGI+GLG+ I+V P + MV Q L+ E VF+F L E+GG
Sbjct: 194 KEPGLAFAFGKFDGILGLGYDTISVNHMTPPFYEMVAQKLIDEPVFAFRLGS--SEEDGG 251
Query: 238 EIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLA 297
E VFGG+D + G YVPV +K YW+ EL + +G+ + G A +D+GTSL+A
Sbjct: 252 EAVFGGIDRTAYTGSIDYVPVRRKAYWEVELQKVALGDDELDLEHTGAA--IDTGTSLIA 309
Query: 298 GPTPVVTEINHAIGGE----GVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAF 353
PT + IN IG + G + +C V S LPE V F
Sbjct: 310 LPTDIAEMINTQIGAQKQWNGQYTVDCSKVPS--------------LPELV------LTF 349
Query: 354 NGAEYVRLGIPITRVLFVLNVR 375
NG Y P+ +VL V+
Sbjct: 350 NGKPY-----PLKGTDYVLEVQ 366
>gi|358057753|dbj|GAA96408.1| hypothetical protein E5Q_03075 [Mixia osmundae IAM 14324]
Length = 453
Score = 235 bits (599), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 118/264 (44%), Positives = 161/264 (60%), Gaps = 9/264 (3%)
Query: 68 GDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSR 127
GD E +PL NF++AQYF +I +G+PPQ F V+ DTGSSNLWVPS++C SI+C+ H +
Sbjct: 121 GDKVEHGVPLSNFLNAQYFADITLGTPPQEFKVVLDTGSSNLWVPSTRCS-SIACFLHKK 179
Query: 128 YKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLA 187
Y + S+TY E G +I YGSGS+ G S D + +GD+ +K Q F E+T+E L F
Sbjct: 180 YDASASSTYKENGTEFKIQYGSGSLEGVISNDVMTIGDITIKKQDFAESTKEPGLAFAFG 239
Query: 188 RFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAE--EGGEIVFGGVD 245
+FDGI+GL + IAV P + NM+ GLV + FSFWL D E GGE V GG D
Sbjct: 240 KFDGILGLAYDRIAVQHVTPPFYNMIADGLVDKAEFSFWLGDTADGEGAPGGEFVMGGTD 299
Query: 246 PKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTE 305
P H+KGK + PV +KGYW+ EL I G + G A +D+GTSL+A P+ +
Sbjct: 300 PAHYKGKIQWAPVRRKGYWEVELSKIKFGKDELELESTGAA--IDTGTSLIALPSDLAEL 357
Query: 306 INHAIGGE----GVVSAECKLVVS 325
+N IG + G + +C + S
Sbjct: 358 LNKEIGAKKSWNGQYTVDCAAIPS 381
>gi|1657354|emb|CAA66056.1| procathepsin E [Mus musculus]
gi|13529380|gb|AAH05432.1| Cathepsin E [Mus musculus]
gi|71059833|emb|CAJ18460.1| Ctse [Mus musculus]
Length = 397
Score = 235 bits (599), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 116/248 (46%), Positives = 156/248 (62%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IG+PPQNF+VIFDTGSSNLWVPS C S +C H + +S+T
Sbjct: 70 PLINYLDMEYFGTISIGTPPQNFTVIFDTGSSNLWVPSVYCT-SPACKAHPVFHPSQSDT 128
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
YTE+G I YG+GS++G D V V + V Q F E+ +E TF+ A FDGI+GL
Sbjct: 129 YTEVGNHFSIQYGTGSLTGIIGADQVSVEGLTVDGQQFGESVKEPGQTFVNAEFDGILGL 188
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +A G PV+DNM+ Q LV+ +FS +L+ DP G E+ FGG DP HF G +
Sbjct: 189 GYPSLAAGGVTPVFDNMMAQNLVALPMFSVYLSSDPQGGSGSELTFGGYDPSHFSGSLNW 248
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
+PVTK+ YWQ L I +G+ + C GC AIVD+GTSL+ GP + + AIG
Sbjct: 249 IPVTKQAYWQIALDGIQVGD-TVMFCSEGCQAIVDTGTSLITGPPDKIKHLQEAIGATPI 307
Query: 313 EGVVSAEC 320
+G + +C
Sbjct: 308 DGEYAVDC 315
>gi|344277046|ref|XP_003410316.1| PREDICTED: cathepsin E [Loxodonta africana]
Length = 396
Score = 234 bits (598), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 119/248 (47%), Positives = 159/248 (64%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N+ D +YFG I IGSP QNF+VIFDTGSSNLWVPS C S +C H R+ +S+T
Sbjct: 69 PLINYFDTEYFGAISIGSPSQNFTVIFDTGSSNLWVPSVYCT-SQACQTHPRFYPSQSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y+ +G I+YG+GS+SG D V V + V DQ F E+ +E TF+ + FDGI+GL
Sbjct: 128 YSSLGSPFSISYGTGSLSGIIGTDQVSVEGLTVIDQQFGESVKEPGQTFVDSAFDGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV +FS +++ DP G E++FGG D HF G +
Sbjct: 188 GYPSLAVGGVTPVFDNMMAQNLVDLPMFSVYMSSDPAGGMGSELIFGGYDHSHFSGSLNW 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE-- 313
VPVTK+GYWQ L +I +G + C GC AIVD+GTSL+ GP+ + ++ AIG E
Sbjct: 248 VPVTKQGYWQIALDNIQVGG-TVMFCSEGCQAIVDTGTSLITGPSNNIKQLQRAIGAEPE 306
Query: 314 -GVVSAEC 320
G + EC
Sbjct: 307 NGEYAVEC 314
>gi|99031884|pdb|2BKS|A Chain A, Crystal Structure Of Renin-Pf00074777 Complex
gi|99031885|pdb|2BKS|B Chain B, Crystal Structure Of Renin-Pf00074777 Complex
gi|99031886|pdb|2BKT|A Chain A, Crystal Structure Of Renin-Pf00257567 Complex
gi|99031887|pdb|2BKT|B Chain B, Crystal Structure Of Renin-Pf00257567 Complex
gi|119390207|pdb|2IKO|A Chain A, Crystal Structure Of Human Renin Complexed With Inhibitor
gi|119390208|pdb|2IKO|B Chain B, Crystal Structure Of Human Renin Complexed With Inhibitor
gi|119390209|pdb|2IKU|A Chain A, Crystal Structure Of Human Renin Complexed With Inhibitors
gi|119390210|pdb|2IKU|B Chain B, Crystal Structure Of Human Renin Complexed With Inhibitors
gi|119390211|pdb|2IL2|A Chain A, Crystal Structure Of Human Renin Complexed With Inhibitor
gi|119390212|pdb|2IL2|B Chain B, Crystal Structure Of Human Renin Complexed With Inhibitor
gi|151568107|pdb|2V0Z|C Chain C, Crystal Structure Of Renin With Inhibitor 10 (Aliskiren)
gi|151568108|pdb|2V0Z|O Chain O, Crystal Structure Of Renin With Inhibitor 10 (Aliskiren)
gi|151568109|pdb|2V10|C Chain C, Crystal Structure Of Renin With Inhibitor 9
gi|151568110|pdb|2V10|O Chain O, Crystal Structure Of Renin With Inhibitor 9
gi|151568111|pdb|2V11|C Chain C, Crystal Structure Of Renin With Inhibitor 6
gi|151568112|pdb|2V11|O Chain O, Crystal Structure Of Renin With Inhibitor 6
gi|151568113|pdb|2V12|C Chain C, Crystal Structure Of Renin With Inhibitor 8
gi|151568114|pdb|2V12|O Chain O, Crystal Structure Of Renin With Inhibitor 8
gi|157830213|pdb|1BBS|A Chain A, X-Ray Analyses Of Peptide Inhibitor Complexes Define The
Structural Basis Of Specificity For Human And Mouse
Renins
gi|157830214|pdb|1BBS|B Chain B, X-Ray Analyses Of Peptide Inhibitor Complexes Define The
Structural Basis Of Specificity For Human And Mouse
Renins
gi|157833710|pdb|1RNE|A Chain A, The Crystal Structure Of Recombinant Glycosylated Human
Renin Alone And In Complex With A Transition State
Analog Inhibitor
gi|157836332|pdb|2REN|A Chain A, Structure Of Recombinant Human Renin, A Target For
Cardiovascular- Active Drugs, At 2.5 Angstroms
Resolution
gi|193885216|pdb|2V13|A Chain A, Crystal Structure Of Renin With Inhibitor 7
gi|193885217|pdb|2V16|C Chain C, Crystal Structure Of Renin With Inhibitor 3
gi|193885218|pdb|2V16|O Chain O, Crystal Structure Of Renin With Inhibitor 3
gi|242556522|pdb|3G72|A Chain A, Design And Preparation Of Potent, Non-Peptidic,
Bioavailable Renin Inhibitors
gi|242556523|pdb|3G72|B Chain B, Design And Preparation Of Potent, Non-Peptidic,
Bioavailable Renin Inhibitors
gi|308388162|pdb|3OQF|A Chain A, Crystal Structure Analysis Of Renin-Indole-Piperazine
Inhibitor Complexes
gi|308388163|pdb|3OQF|B Chain B, Crystal Structure Analysis Of Renin-Indole-Piperazine
Inhibitor Complexes
gi|310689956|pdb|3OOT|A Chain A, Crystal Structure Analysis Of Renin-Indole-Piperazin
Inhibitor Complexes
gi|310689957|pdb|3OOT|B Chain B, Crystal Structure Analysis Of Renin-Indole-Piperazin
Inhibitor Complexes
gi|310689958|pdb|3OQK|A Chain A, Crystal Structure Analysis Of Renin-Indole-Piperazin
Inhibitor Complexes
gi|310689959|pdb|3OQK|B Chain B, Crystal Structure Analysis Of Renin-Indole-Piperazin
Inhibitor Complexes
gi|342350963|pdb|3Q3T|A Chain A, Alkyl Amine Renin Inhibitors: Filling S1 From S3
gi|342350964|pdb|3Q3T|B Chain B, Alkyl Amine Renin Inhibitors: Filling S1 From S3
gi|345110923|pdb|3SFC|A Chain A, Structure-Based Optimization Of Potent 4- And
6-Azaindole-3- Carboxamides As Renin Inhibitors
gi|345110924|pdb|3SFC|B Chain B, Structure-Based Optimization Of Potent 4- And
6-Azaindole-3- Carboxamides As Renin Inhibitors
gi|358439749|pdb|3Q4B|A Chain A, Clinically Useful Alkyl Amine Renin Inhibitors
gi|358439750|pdb|3Q4B|B Chain B, Clinically Useful Alkyl Amine Renin Inhibitors
gi|358439751|pdb|3Q5H|A Chain A, Clinically Useful Alkyl Amine Renin Inhibitors
gi|358439752|pdb|3Q5H|B Chain B, Clinically Useful Alkyl Amine Renin Inhibitors
gi|400261138|pdb|3VSW|A Chain A, Human Renin In Complex With Compound 8
gi|400261139|pdb|3VSW|B Chain B, Human Renin In Complex With Compound 8
gi|400261140|pdb|3VSX|A Chain A, Human Renin In Complex With Compound 18
gi|400261141|pdb|3VSX|B Chain B, Human Renin In Complex With Compound 18
gi|430800765|pdb|3VYD|A Chain A, Human Renin In Complex With Inhibitor 6
gi|430800766|pdb|3VYD|B Chain B, Human Renin In Complex With Inhibitor 6
gi|430800767|pdb|3VYE|A Chain A, Human Renin In Complex With Inhibitor 7
gi|430800768|pdb|3VYE|B Chain B, Human Renin In Complex With Inhibitor 7
gi|430800769|pdb|3VYF|A Chain A, Human Renin In Complex With Inhibitor 9
gi|430800770|pdb|3VYF|B Chain B, Human Renin In Complex With Inhibitor 9
gi|449802496|pdb|4GJ8|A Chain A, Crystal Structure Of Renin In Complex With Pkf909-724
(compound 3)
gi|449802497|pdb|4GJ8|B Chain B, Crystal Structure Of Renin In Complex With Pkf909-724
(compound 3)
gi|449802498|pdb|4GJ9|A Chain A, Crystal Structure Of Renin In Complex With Gp055321
(compound 4)
gi|449802499|pdb|4GJ9|B Chain B, Crystal Structure Of Renin In Complex With Gp055321
(compound 4)
gi|449802500|pdb|4GJA|A Chain A, Crystal Structure Of Renin In Complex With Nvp-ayl747
(compound 5)
gi|449802501|pdb|4GJA|B Chain B, Crystal Structure Of Renin In Complex With Nvp-ayl747
(compound 5)
gi|449802502|pdb|4GJB|A Chain A, Crystal Structure Of Renin In Complex With Nvp-bbv031
(compound 6)
gi|449802503|pdb|4GJB|B Chain B, Crystal Structure Of Renin In Complex With Nvp-bbv031
(compound 6)
gi|449802504|pdb|4GJC|A Chain A, Crystal Structure Of Renin In Complex With Nvp-bch965
(compound 9)
gi|449802505|pdb|4GJC|B Chain B, Crystal Structure Of Renin In Complex With Nvp-bch965
(compound 9)
gi|449802506|pdb|4GJD|A Chain A, Crystal Structure Of Renin In Complex With Nvp-bgq311
(compound 12)
gi|449802507|pdb|4GJD|B Chain B, Crystal Structure Of Renin In Complex With Nvp-bgq311
(compound 12)
Length = 340
Score = 234 bits (597), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 110/250 (44%), Positives = 169/250 (67%), Gaps = 6/250 (2%)
Query: 67 LGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFH 125
LG++ ++ L N+MD QY+GEIGIG+PPQ F V+FDTGSSN+WVPSSKC +C +H
Sbjct: 3 LGNTTSSVI-LTNYMDTQYYGEIGIGTPPQTFKVVFDTGSSNVWVPSSKCSRLYTACVYH 61
Query: 126 SRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFL 185
+ + S++Y G + Y +G++SGF SQD + VG + V Q+F E T +L F+
Sbjct: 62 KLFDASDSSSYKHNGTELTLRYSTGTVSGFLSQDIITVGGITVT-QMFGEVTEMPALPFM 120
Query: 186 LARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEE--GGEIVFGG 243
LA FDG++G+GF E A+G P++DN++ QG++ E+VFSF+ NRD + + GG+IV GG
Sbjct: 121 LAEFDGVVGMGFIEQAIGRVTPIFDNIISQGVLKEDVFSFYYNRDSENSQSLGGQIVLGG 180
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
DP+H++G Y+ + K G WQ ++ + +G+ ST +CE GC A+VD+G S ++G T +
Sbjct: 181 SDPQHYEGNFHYINLIKTGVWQIQMKGVSVGS-STLLCEDGCLALVDTGASYISGSTSSI 239
Query: 304 TEINHAIGGE 313
++ A+G +
Sbjct: 240 EKLMEALGAK 249
>gi|109287596|emb|CAJ55260.1| renin-like aspartic protease [Echis ocellatus]
Length = 395
Score = 234 bits (597), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 132/338 (39%), Positives = 193/338 (57%), Gaps = 27/338 (7%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV-SGVRHRLGDSDE 72
+L SC L SS+ L+RI LKK + + R T +E M A V ++HR+ DE
Sbjct: 9 LLISCFLC-FSSDALQRISLKK-------MPSIRETLQEMGMKVADVLPSLKHRISYLDE 60
Query: 73 DI------LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFH 125
+ L NF D QY+GEI IG+P Q F V+FDTGSSNLWVPS +C +C H
Sbjct: 61 GLHNKTASTILTNFRDTQYYGEISIGTPAQIFKVVFDTGSSNLWVPSRQCSPLYSACVSH 120
Query: 126 SRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFL 185
+RY S +S+TY G + Y G I GFFSQD V V D+ + Q F EA S+ F+
Sbjct: 121 NRYDSSESSTYKPKGTKITLTYAQGYIKGFFSQDIVRVADIPII-QFFTEAIALPSIPFI 179
Query: 186 LARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVD 245
ARFDG++G+G+ + A+G +PV+DN++ + ++SE VFS + +R ++ GGEI+ GG D
Sbjct: 180 FARFDGVLGMGYPKQAIGGVIPVFDNIMSEKVLSENVFSVYYSRHSESNTGGEIILGGSD 239
Query: 246 PKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTE 305
P H+ G YV +++GYW +L + I N+ +C GC A +D+GTS ++GP ++
Sbjct: 240 PSHYTGDFHYVSTSREGYWHVDLKGVSIENKIV-LCHDGCTATIDTGTSFISGPASSISV 298
Query: 306 INHAIGG---EGVVSAECKL------VVSQYGDLIWDL 334
+ IG +G +CK + GD+ + L
Sbjct: 299 LMETIGATLSDGDYVIDCKKINLLPDITFHLGDMTYSL 336
>gi|2851407|sp|P16228.3|CATE_RAT RecName: Full=Cathepsin E; Flags: Precursor
gi|1113086|dbj|BAA08128.1| cathepsin E precursor [Rattus rattus]
gi|149058663|gb|EDM09820.1| cathepsin E, isoform CRA_a [Rattus norvegicus]
Length = 398
Score = 234 bits (597), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 116/248 (46%), Positives = 158/248 (63%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG + IGSP QNF+VIFDTGSSNLWVPS C S +C H + +S+T
Sbjct: 71 PLINYLDMEYFGTVSIGSPSQNFTVIFDTGSSNLWVPSVYCT-SPACKAHPVFHPSQSST 129
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y E+G I YG+GS++G D V V + V+ Q F E+ +E TF+ A FDGI+GL
Sbjct: 130 YMEVGNHFSIQYGTGSLTGIIGADQVSVEGLTVEGQQFGESVKEPGQTFVNAEFDGILGL 189
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV+ +FS +L+ DP G E+ FGG DP HF G +
Sbjct: 190 GYPSLAVGGVTPVFDNMMAQNLVALPMFSVYLSSDPQGGSGSELTFGGYDPSHFSGSLNW 249
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
+PVTK+GYWQ L I +G+ + C GC AIVD+GTSL+ GP + ++ AIG
Sbjct: 250 IPVTKQGYWQIALDGIQVGD-TVMFCSEGCQAIVDTGTSLITGPPKKIKQLQEAIGATPM 308
Query: 313 EGVVSAEC 320
+G + +C
Sbjct: 309 DGEYAVDC 316
>gi|38303893|gb|AAH62002.1| Ctse protein [Rattus norvegicus]
Length = 398
Score = 234 bits (597), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 116/248 (46%), Positives = 158/248 (63%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG + IGSP QNF+VIFDTGSSNLWVPS C S +C H + +S+T
Sbjct: 71 PLINYLDMEYFGTVSIGSPSQNFTVIFDTGSSNLWVPSVYCT-SPACKAHPVFHPSQSST 129
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y E+G I YG+GS++G D V V + V+ Q F E+ +E TF+ A FDGI+GL
Sbjct: 130 YMEVGNHFSIQYGTGSLTGIIGADQVSVEGLTVEGQQFGESVKEPGQTFVNAEFDGILGL 189
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV+ +FS +L+ DP G E+ FGG DP HF G +
Sbjct: 190 GYPSLAVGGVTPVFDNMMAQNLVALPMFSVYLSSDPQGGSGSELTFGGYDPSHFSGSLNW 249
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
+PVTK+GYWQ L I +G+ + C GC AIVD+GTSL+ GP + ++ AIG
Sbjct: 250 IPVTKQGYWQIALDGIQVGD-TVMFCSEGCQAIVDTGTSLITGPPKKIKQLQEAIGATPM 308
Query: 313 EGVVSAEC 320
+G + +C
Sbjct: 309 DGEYAVDC 316
>gi|190613737|pdb|3D91|A Chain A, Human Renin In Complex With Remikiren
gi|190613738|pdb|3D91|B Chain B, Human Renin In Complex With Remikiren
gi|242556515|pdb|3G6Z|A Chain A, Design And Preparation Of Potent, Non-Peptidic,
Bioavailable Renin Inhibitors
gi|242556516|pdb|3G6Z|B Chain B, Design And Preparation Of Potent, Non-Peptidic,
Bioavailable Renin Inhibitors
gi|242556519|pdb|3G70|A Chain A, Design And Preparation Of Potent, Non-Peptidic,
Bioavailable Renin Inhibitors
gi|242556520|pdb|3G70|B Chain B, Design And Preparation Of Potent, Non-Peptidic,
Bioavailable Renin Inhibitors
gi|290560276|pdb|3K1W|A Chain A, New Classes Of Potent And Bioavailable Human Renin
Inhibitors
gi|290560277|pdb|3K1W|B Chain B, New Classes Of Potent And Bioavailable Human Renin
Inhibitors
gi|315113750|pdb|3OWN|A Chain A, Potent Macrocyclic Renin Inhibitors
gi|315113751|pdb|3OWN|B Chain B, Potent Macrocyclic Renin Inhibitors
Length = 341
Score = 234 bits (596), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 110/250 (44%), Positives = 169/250 (67%), Gaps = 6/250 (2%)
Query: 67 LGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFH 125
LG++ ++ L N+MD QY+GEIGIG+PPQ F V+FDTGSSN+WVPSSKC +C +H
Sbjct: 3 LGNTTSSVI-LTNYMDTQYYGEIGIGTPPQTFKVVFDTGSSNVWVPSSKCSRLYTACVYH 61
Query: 126 SRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFL 185
+ + S++Y G + Y +G++SGF SQD + VG + V Q+F E T +L F+
Sbjct: 62 KLFDASDSSSYKHNGTELTLRYSTGTVSGFLSQDIITVGGITVT-QMFGEVTEMPALPFM 120
Query: 186 LARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEE--GGEIVFGG 243
LA FDG++G+GF E A+G P++DN++ QG++ E+VFSF+ NRD + + GG+IV GG
Sbjct: 121 LAEFDGVVGMGFIEQAIGRVTPIFDNIISQGVLKEDVFSFYYNRDSENSQSLGGQIVLGG 180
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
DP+H++G Y+ + K G WQ ++ + +G+ ST +CE GC A+VD+G S ++G T +
Sbjct: 181 SDPQHYEGNFHYINLIKTGVWQIQMKGVSVGS-STLLCEDGCLALVDTGASYISGSTSSI 239
Query: 304 TEINHAIGGE 313
++ A+G +
Sbjct: 240 EKLMEALGAK 249
>gi|389640809|ref|XP_003718037.1| vacuolar protease A [Magnaporthe oryzae 70-15]
gi|58257401|gb|AAW69322.1| vacuolar protease A-like protein [Magnaporthe grisea]
gi|351640590|gb|EHA48453.1| vacuolar protease A [Magnaporthe oryzae 70-15]
gi|440487134|gb|ELQ66940.1| vacuolar protease A [Magnaporthe oryzae P131]
Length = 395
Score = 234 bits (596), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 129/320 (40%), Positives = 191/320 (59%), Gaps = 19/320 (5%)
Query: 14 VLASCLLLPASSNGLRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSGVRHRLGD 69
++ + +LL + G+ ++ +KK +L LNA ++Y+G S + +
Sbjct: 5 MMTAAVLLGTAEAGVHKLKMKKIPLEDQLKTFDLNAQMRGLGQKYLGIRPESHQQAVFSN 64
Query: 70 -----SDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYF 124
S +P+ NFM+AQYF EI IG+PPQNF VI DTGSSNLWVPSS C SI+CY
Sbjct: 65 DAVQASGNHPVPISNFMNAQYFSEITIGTPPQNFKVILDTGSSNLWVPSSSC-GSIACYL 123
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
H++Y+S S+TY + G +I YGSGS+ GF S D + +GD+ +K+ F EAT+E L F
Sbjct: 124 HNKYESSSSSTYKKNGTEFKIQYGSGSMEGFVSNDVMTIGDLKIKNLDFAEATKEPGLAF 183
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
RFDGI+G+GF ++V VP + MV+Q L+ E VF+F+L D + E+VFGGV
Sbjct: 184 AFGRFDGILGMGFDRLSVNKIVPPFYAMVDQKLIDEPVFAFYL---ADEKSESEVVFGGV 240
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
+ H GK T +P+ +K YW+ +L I +G++ + G I+D+GTSL+A P+ +
Sbjct: 241 NKDHIDGKITEIPLRRKAYWEVDLDAIALGDEVAELDNTGV--ILDTGTSLIALPSQLAE 298
Query: 305 EINHAIGGE----GVVSAEC 320
+N IG + G S +C
Sbjct: 299 LLNSQIGAKKGYNGQYSIDC 318
>gi|440475206|gb|ELQ43907.1| vacuolar protease A [Magnaporthe oryzae Y34]
Length = 395
Score = 234 bits (596), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 129/320 (40%), Positives = 191/320 (59%), Gaps = 19/320 (5%)
Query: 14 VLASCLLLPASSNGLRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSGVRHRLGD 69
++ + +LL + G+ ++ +KK +L LNA ++Y+G S + +
Sbjct: 5 MMTAAVLLGTAEAGVHKLKMKKIPLEDQLKTFDLNAQMRGLGQKYLGIRPESHQQAVFSN 64
Query: 70 -----SDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYF 124
S +P+ NFM+AQYF EI IG+PPQNF VI DTGSSNLWVPSS C SI+CY
Sbjct: 65 DAVQASGNHPVPISNFMNAQYFSEITIGTPPQNFKVILDTGSSNLWVPSSSC-GSIACYL 123
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
H++Y+S S+TY + G +I YGSGS+ GF S D + +GD+ +K+ F EAT+E L F
Sbjct: 124 HNKYESSSSSTYKKNGTEFKIQYGSGSMEGFVSNDFMTIGDLKIKNLDFAEATKEPGLAF 183
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
RFDGI+G+GF ++V VP + MV+Q L+ E VF+F+L D + E+VFGGV
Sbjct: 184 AFGRFDGILGMGFDRLSVNKIVPPFYAMVDQKLIDEPVFAFYL---ADEKSESEVVFGGV 240
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
+ H GK T +P+ +K YW+ +L I +G++ + G I+D+GTSL+A P+ +
Sbjct: 241 NKDHIDGKITEIPLRRKAYWEVDLDAIALGDEVAELDNTGV--ILDTGTSLIALPSQLAE 298
Query: 305 EINHAIGGE----GVVSAEC 320
+N IG + G S +C
Sbjct: 299 LLNSQIGAKKGYNGQYSIDC 318
>gi|224085770|ref|XP_002189383.1| PREDICTED: cathepsin E [Taeniopygia guttata]
Length = 435
Score = 234 bits (596), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 122/296 (41%), Positives = 173/296 (58%), Gaps = 6/296 (2%)
Query: 29 RRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGE 88
RR+ L RR L ++ R + G E PL ++D +YFG+
Sbjct: 62 RRVPLSCRRY-LRTMMRERGQLSHLWRAPGGPEASSEDCAAFLESSEPLIIYLDMEYFGQ 120
Query: 89 IGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYG 148
I IG+PPQNF+V+FDTGSSNLWVPS C S +C H+R+ +S+TY IG I YG
Sbjct: 121 ISIGTPPQNFTVVFDTGSSNLWVPSVYC-VSKACTEHTRFHPTQSSTYQVIGTPFSIQYG 179
Query: 149 SGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPV 208
+GS++G D V V + V +Q F E+ E FL A FDGI+GL + +AV PV
Sbjct: 180 TGSLTGIIGSDQVAVEGLAVSNQQFAESISEPGKAFLDAEFDGILGLAYPSLAVDGVTPV 239
Query: 209 WDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFEL 268
+DNM+ Q LV +FS +++ +PD+ +GGE++FGG D F G +VPVT++GYWQ +L
Sbjct: 240 FDNMMAQNLVELPIFSVYMSSNPDSPQGGEVLFGGFDTSRFTGTLNWVPVTQQGYWQIQL 299
Query: 269 GDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---EGVVSAECK 321
+I +G T C GC AIVD+GTSL+ GPT + ++ + IG +G + +C
Sbjct: 300 DNIQLGGTVT-FCANGCQAIVDTGTSLITGPTKEIKKLQNLIGAVSVDGEYTVDCS 354
>gi|340966614|gb|EGS22121.1| aspartic-type endopeptidase-like protein [Chaetomium thermophilum
var. thermophilum DSM 1495]
Length = 396
Score = 233 bits (595), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 127/312 (40%), Positives = 184/312 (58%), Gaps = 20/312 (6%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSD-- 71
+L + +LL ++ + ++ L+K L L+A I + + +G + G R R SD
Sbjct: 5 LLTAAVLLGSAQGAVHKLKLQKVPLS-EQLDAVPIEIQVQQLGQKYM-GTRSRQSHSDAV 62
Query: 72 ----------EDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSIS 121
+P+ NFM+AQYF EI +G+PPQ F V+ DTGSSNLWVPS C SI+
Sbjct: 63 WKGMMPEAMGSHPVPISNFMNAQYFSEISLGTPPQTFKVVLDTGSSNLWVPSVDC-GSIA 121
Query: 122 CYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGS 181
CY H++Y S S+TY G EI YGSGS+SGF SQD + +GD+ VK Q F EAT E
Sbjct: 122 CYLHTKYDSSASSTYKPNGTKFEIRYGSGSLSGFVSQDVLRIGDITVKGQDFAEATSEPG 181
Query: 182 LTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVF 241
L F RFDGI+GLG+ I+V VP + NM+EQ ++ E VF+F+L+ D E+ F
Sbjct: 182 LAFAFGRFDGILGLGYDTISVNRIVPPFYNMIEQKVIDEPVFAFYLS---DTSGQSEVTF 238
Query: 242 GGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTP 301
GG+D +KGK T +P+ +K YW+ + I G+ + + G I+D+GTSL+A P+
Sbjct: 239 GGIDKTKYKGKITTIPLRRKAYWEVDFDAISYGDDTAELENTGV--ILDTGTSLIALPSQ 296
Query: 302 VVTEINHAIGGE 313
+ +N +G +
Sbjct: 297 LAEMLNAQLGAK 308
>gi|432090679|gb|ELK24020.1| Renin [Myotis davidii]
Length = 404
Score = 233 bits (595), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 134/334 (40%), Positives = 197/334 (58%), Gaps = 16/334 (4%)
Query: 4 KLLRSVFCLWVLASCLL-LPASSNGLRRIGLKK-----RRLDLHSLNAARITRKERYMGG 57
++ R L + SC+ LP + RRI LKK L ++ AR+ R E G
Sbjct: 5 RMSRWALLLLLWGSCISSLPVDTGAFRRIFLKKMPSVRESLKERGVDVARLLRAE----G 60
Query: 58 AGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY 117
+ SG R +S ++ L N++D QY+GEIGIG+PPQ F VIFDTGS+NLWVPS+KC
Sbjct: 61 SQFSG-RPPFTNSTAPVV-LTNYLDTQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCS 118
Query: 118 -FSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEA 176
+C HS Y S +S+TY E G I YGSG ++GF SQD V VG + V Q F E
Sbjct: 119 PLYTACEIHSLYDSLESSTYMENGTEFTIQYGSGKVNGFLSQDAVTVGGITVT-QTFGEV 177
Query: 177 TREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEG 236
T + F+LA+FDG++G+GF AV PV+D+++ Q ++ E+VFS + +R+ G
Sbjct: 178 TELPLMPFMLAKFDGVLGMGFPAQAVAGVTPVFDHILSQRVLKEDVFSVYYSRNSHL-LG 236
Query: 237 GEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLL 296
GEIV GG DP++++G YV ++K G WQ ++ + + ST +CE GC A+VD+G S +
Sbjct: 237 GEIVLGGSDPQYYQGNFHYVSISKTGSWQIKMKGVSV-RSSTLLCEEGCMAVVDTGASYI 295
Query: 297 AGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 330
+GPT + + +G + + + E + +Q L
Sbjct: 296 SGPTSSLRLLMETLGAKELSTDEYVVSCNQVPSL 329
>gi|302899226|ref|XP_003048007.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256728939|gb|EEU42294.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 396
Score = 233 bits (595), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 120/251 (47%), Positives = 164/251 (65%), Gaps = 12/251 (4%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+P+ NFM+AQYF EI IG+PPQ+F V+ DTGSSNLWVPS +C SI+CY HS+Y S S+
Sbjct: 76 VPISNFMNAQYFSEITIGNPPQSFKVVLDTGSSNLWVPSQEC-GSIACYLHSKYDSSASS 134
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + G EI+YGSGS+SGF S D+V +GD+ +K Q F EAT+E L F RFDGI+G
Sbjct: 135 TYKQNGSEFEIHYGSGSLSGFISNDDVSIGDLKIKGQDFAEATKEPGLAFAFGRFDGILG 194
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V VP + MV Q L+ + VF+F+L D E E+VFGGVD H++G
Sbjct: 195 LGYDTISVNHIVPPFYQMVNQKLLDDPVFAFYL---ADQEGESEVVFGGVDKSHYEGDIE 251
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEG-GCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
Y+P+ +K YW+ +L I +G++ V E AI+D+GTSL P+ + +N IG +
Sbjct: 252 YIPLRRKAYWEVDLDAIALGDE---VAEQENTGAILDTGTSLNVLPSALAELLNKEIGAK 308
Query: 314 ----GVVSAEC 320
G + EC
Sbjct: 309 KGYNGQYTVEC 319
>gi|301618285|ref|XP_002938556.1| PREDICTED: cathepsin E-A-like [Xenopus (Silurana) tropicalis]
Length = 402
Score = 233 bits (595), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 112/254 (44%), Positives = 169/254 (66%), Gaps = 2/254 (0%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
L ++M+AQY+GEI +G+PPQNFSV+FDTGSSN WVPSS C S +C H R+KS +S +Y
Sbjct: 73 LVDYMNAQYYGEISVGTPPQNFSVVFDTGSSNFWVPSSYC-LSEACQVHERFKSFESTSY 131
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
G+ I+YG+G + G +D + + ++ ++ Q F E+ E TF+LA+FDG++GLG
Sbjct: 132 EHGGRPFSIHYGTGQLVGVTGRDTLRISNMSIEGQDFGESILEPGRTFVLAQFDGVLGLG 191
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ +AV AVPV+D +V Q LV +++FSF LNRD D+E GGE++FGG+D +KG+ ++
Sbjct: 192 YPSLAVAGAVPVFDRIVNQKLVEQQLFSFHLNRDYDSEYGGELIFGGIDHSLYKGQIHWI 251
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
P+T+KGYWQ L ++ + ++ C+ C IVDSGTSL+ GP + ++ +G +
Sbjct: 252 PLTEKGYWQIRLDNVKVDGEAM-FCQSSCQVIVDSGTSLITGPKAEIKKLQELLGATPTL 310
Query: 317 SAECKLVVSQYGDL 330
E L S+ L
Sbjct: 311 FGEYILDCSRVSSL 324
>gi|149725197|ref|XP_001502028.1| PREDICTED: pepsin A-like [Equus caballus]
Length = 387
Score = 233 bits (595), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 120/251 (47%), Positives = 163/251 (64%), Gaps = 10/251 (3%)
Query: 73 DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRK 132
D PL+N++D +YFG I IG+PPQ F+VIFDTGSSNLWVPS+ C S++CY H R+ K
Sbjct: 63 DSEPLENYLDEEYFGTISIGTPPQEFTVIFDTGSSNLWVPSTYCS-SLACYDHKRFNPEK 121
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGI 192
S+TY +S I YG+GS++G D V VG + +Q+F + +E LA FDGI
Sbjct: 122 SSTYQATSESISITYGTGSMTGILGYDTVRVGGIEDTNQIFGLSEKEPGFFLFLAPFDGI 181
Query: 193 IGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGK 252
+GLG+ I+ A PV+DN+ +QGLVS+++FS +L+ D E G ++FGG+D ++ G
Sbjct: 182 LGLGYPSISASGATPVFDNIWDQGLVSQDLFSVYLSS--DDESGSVVMFGGIDSSYYTGS 239
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG- 311
+VPVT +GYWQ + I I +S C GGC AIVD+GTSLLAGPT + I IG
Sbjct: 240 LHWVPVTTEGYWQIAVDSITINGESIA-CSGGCQAIVDTGTSLLAGPTSGIDNIQSYIGA 298
Query: 312 -----GEGVVS 317
GEGV+S
Sbjct: 299 RKDLLGEGVIS 309
>gi|326933745|ref|XP_003212960.1| PREDICTED: cathepsin E-like [Meleagris gallopavo]
Length = 403
Score = 233 bits (594), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 131/319 (41%), Positives = 184/319 (57%), Gaps = 26/319 (8%)
Query: 23 ASSNGLRRIGLKKRRLDLHSLNAARITRKE-RYMGGAGVSGVRHRL------------GD 69
A +GL+R L + L H R RK R G HRL G+
Sbjct: 18 APCSGLKRPALCRVTLTRH-----RSLRKSLRDRGQLSQFWKAHRLDMVQYTQDCSLFGE 72
Query: 70 SDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYK 129
++E PL N++D +YFG+I IG+PPQNF+VIFDTGSSNLWVPS C S +C H+R++
Sbjct: 73 ANE---PLINYLDMEYFGQISIGTPPQNFTVIFDTGSSNLWVPSIYCT-SKACTNHARFQ 128
Query: 130 SRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARF 189
+S+TY +G + YG+GS++G D V V + V +Q F E+ E F + F
Sbjct: 129 PSRSSTYQPLGLPISLQYGTGSLTGIIGSDQVTVEGMTVCNQPFAESVSEPGKAFQDSEF 188
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF 249
DGI+GL + +AV PV+DNM+ Q LV +FS +++ +PD+ GGE++FGG DP F
Sbjct: 189 DGILGLAYPSLAVDGVTPVFDNMMAQDLVELPIFSVYMSANPDSSLGGEVLFGGFDPSRF 248
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHA 309
G +VPVT +GYWQ +L ++ +G + C GC AIVD+GTSLL GPT + E+
Sbjct: 249 LGTLHWVPVTVQGYWQIQLDNVQVGG-TVVFCANGCQAIVDTGTSLLTGPTKDIKEMQRY 307
Query: 310 IGG---EGVVSAECKLVVS 325
IG +G +C L+ S
Sbjct: 308 IGATPMDGEYVVDCSLLSS 326
>gi|431892878|gb|ELK03306.1| Cathepsin E [Pteropus alecto]
Length = 396
Score = 233 bits (594), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 118/248 (47%), Positives = 159/248 (64%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I +GSPPQNF+VIFDTGSSNLWVPS C S +C H+R+ +S+T
Sbjct: 69 PLINYLDMEYFGTISVGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHARFYPSQSDT 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y+ +G I+YG+GS+SG D V V + V Q F E+ E TF+ A FDGI+GL
Sbjct: 128 YSTVGSHFSIHYGTGSLSGIIGADQVSVEGLTVVSQQFGESVTEPGQTFVNAEFDGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV +FS +++ D + G E++FGG D HF G +
Sbjct: 188 GYPSLAVGGVTPVFDNMMAQNLVDVPMFSVYMSSDLEGGAGSELIFGGYDHSHFSGSLNW 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE-- 313
VPVTK+GYWQ L I +G + C GC AIVD+GTSL+ GP+ + ++ AIG E
Sbjct: 248 VPVTKQGYWQIALDTIQVGG-AVIFCSEGCQAIVDTGTSLITGPSEEIKQLQKAIGAEPT 306
Query: 314 -GVVSAEC 320
G + EC
Sbjct: 307 NGEYAVEC 314
>gi|291409618|ref|XP_002721075.1| PREDICTED: pepsin II-4-like [Oryctolagus cuniculus]
Length = 387
Score = 233 bits (594), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 117/247 (47%), Positives = 163/247 (65%), Gaps = 10/247 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
L+N++DA+YFG I IG+PPQ+F+VIFDTGSSNLWVPS+ C S++C H R+ S+TY
Sbjct: 67 LENYLDAEYFGTISIGTPPQDFTVIFDTGSSNLWVPSTYCS-SLACALHKRFNPEDSSTY 125
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
++ I YG+GS++G D V+VG + +Q+F + E LTFL A FDGI+GLG
Sbjct: 126 QGTSETLSITYGTGSMTGILGYDTVKVGSIEDTNQIFGLSKTEPGLTFLFAPFDGILGLG 185
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ I+ DA PV+DNM +GLVS+++FS +L+ D E+G ++FGG+D ++ G +V
Sbjct: 186 YPSISASDATPVFDNMWNEGLVSQDLFSVYLSS--DDEKGSLVMFGGIDSSYYTGSLNWV 243
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG----- 311
PV+ +GYWQ + I I N T C C AIVD+GTSLLAGPT ++ I IG
Sbjct: 244 PVSYEGYWQITMDSISI-NGETIACADSCQAIVDTGTSLLAGPTSAISNIQSYIGASKNL 302
Query: 312 -GEGVVS 317
GE V+S
Sbjct: 303 LGENVIS 309
>gi|149058665|gb|EDM09822.1| cathepsin E, isoform CRA_c [Rattus norvegicus]
Length = 365
Score = 233 bits (594), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 114/236 (48%), Positives = 153/236 (64%), Gaps = 2/236 (0%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG + IGSP QNF+VIFDTGSSNLWVPS C S +C H + +S+T
Sbjct: 71 PLINYLDMEYFGTVSIGSPSQNFTVIFDTGSSNLWVPSVYCT-SPACKAHPVFHPSQSST 129
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y E+G I YG+GS++G D V V + V+ Q F E+ +E TF+ A FDGI+GL
Sbjct: 130 YMEVGNHFSIQYGTGSLTGIIGADQVSVEGLTVEGQQFGESVKEPGQTFVNAEFDGILGL 189
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV+ +FS +L+ DP G E+ FGG DP HF G +
Sbjct: 190 GYPSLAVGGVTPVFDNMMAQNLVALPMFSVYLSSDPQGGSGSELTFGGYDPSHFSGSLNW 249
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
+PVTK+GYWQ L I +G+ + C GC AIVD+GTSL+ GP + ++ AIG
Sbjct: 250 IPVTKQGYWQIALDGIQVGD-TVMFCSEGCQAIVDTGTSLITGPPKKIKQLQEAIG 304
>gi|189211129|ref|XP_001941895.1| vacuolar protease A precursor [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187977988|gb|EDU44614.1| vacuolar protease A precursor [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 399
Score = 233 bits (594), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 117/250 (46%), Positives = 162/250 (64%), Gaps = 8/250 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+P+ NF++AQYF EI +G+PPQ F VI DTGSSNLWVPSS C SI+CY H++Y S S+
Sbjct: 77 VPVTNFLNAQYFSEISLGTPPQTFKVILDTGSSNLWVPSSSCN-SIACYLHTKYDSSSSS 135
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + G EI YGSGS+SGF S D ++GD+ VK+Q F EAT E L F RFDGI+G
Sbjct: 136 TYKKNGTEFEIRYGSGSLSGFVSNDVFQIGDLKVKNQDFAEATSEPGLAFAFGRFDGIMG 195
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V VP + NM+EQGL+ E VF+F+L D + ++ E FGG+D + GK
Sbjct: 196 LGYDTISVKGIVPPFYNMLEQGLLDEPVFAFYLG-DTNQQQESEATFGGIDESKYTGKMI 254
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
+P+ +K YW+ EL + G ++ + G I+D+GTSL+A P+ + +N IG +
Sbjct: 255 KLPLRRKAYWEVELDALTFGKETAEMDNTGI--ILDTGTSLIALPSTIAELLNKEIGAKK 312
Query: 314 ---GVVSAEC 320
G + EC
Sbjct: 313 SFNGQYTVEC 322
>gi|46397366|sp|P14091.2|CATE_HUMAN RecName: Full=Cathepsin E; Contains: RecName: Full=Cathepsin E form
I; Contains: RecName: Full=Cathepsin E form II; Flags:
Precursor
Length = 401
Score = 233 bits (594), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 119/253 (47%), Positives = 164/253 (64%), Gaps = 10/253 (3%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C HSR++ +S+T
Sbjct: 69 PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHSRFQPSQSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNV-----EVGDVVVKDQVFIEATREGSLTFLLARFD 190
Y++ G+S I YG+GS+SG D V +V + V Q F E+ E TF+ A FD
Sbjct: 128 YSQPGQSFSIQYGTGSLSGIIGADQVSAFATQVEGLTVVGQQFGESVTEPGQTFVDAEFD 187
Query: 191 GIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFK 250
GI+GLG+ +AVG PV+DNM+ Q LV +FS +++ +P+ G E++FGG D HF
Sbjct: 188 GILGLGYPSLAVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFS 247
Query: 251 GKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI 310
G +VPVTK+ YWQ L +I +G + C GC AIVD+GTSL+ GP+ + ++ +AI
Sbjct: 248 GSLNWVPVTKQAYWQIALDNIQVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAI 306
Query: 311 GG---EGVVSAEC 320
G +G + EC
Sbjct: 307 GAAPVDGEYAVEC 319
>gi|6978719|ref|NP_037070.1| cathepsin E precursor [Rattus norvegicus]
gi|1113084|dbj|BAA07285.1| cathepsin E precursor [Rattus norvegicus]
Length = 365
Score = 233 bits (594), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 114/236 (48%), Positives = 153/236 (64%), Gaps = 2/236 (0%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG + IGSP QNF+VIFDTGSSNLWVPS C S +C H + +S+T
Sbjct: 71 PLINYLDMEYFGTVSIGSPSQNFTVIFDTGSSNLWVPSVYCT-SSACKAHPVFHPSQSST 129
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y E+G I YG+GS++G D V V + V+ Q F E+ +E TF+ A FDGI+GL
Sbjct: 130 YMEVGNHFSIQYGTGSLTGIIGADQVSVEGLTVEGQQFGESVKEPGQTFVNAEFDGILGL 189
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV+ +FS +L+ DP G E+ FGG DP HF G +
Sbjct: 190 GYPSLAVGGVTPVFDNMMAQNLVALPMFSVYLSSDPQGGSGSELTFGGYDPSHFSGSLNW 249
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
+PVTK+GYWQ L I +G+ + C GC AIVD+GTSL+ GP + ++ AIG
Sbjct: 250 IPVTKQGYWQIALDGIQVGD-TVMFCSEGCQAIVDTGTSLITGPPKKIKQLQEAIG 304
>gi|392586802|gb|EIW76137.1| Asp-domain-containing protein [Coniophora puteana RWD-64-598 SS2]
Length = 409
Score = 233 bits (594), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 136/340 (40%), Positives = 190/340 (55%), Gaps = 31/340 (9%)
Query: 12 LWVLASCLLLPASSNGLRRIGLKK--RRLDLHSLNAARITRK-------ERYMGGAGVSG 62
L +A +LLP +S G+ ++ L+K + H+ ++ K + + GAG +G
Sbjct: 3 LSAIAPLILLPFASAGVHKLKLQKLPQITPGHTHETTYLSHKYGGQVAQQVPLMGAGGAG 62
Query: 63 VRHRLGDSDEDI-------------LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNL 109
R D+D+ +PL NFM+AQYF EI +GSP Q F VI DTGSSNL
Sbjct: 63 RNFRPSPHDDDLFWTQEVAVEGGHTVPLSNFMNAQYFTEIELGSPAQTFKVILDTGSSNL 122
Query: 110 WVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVK 169
WVPS++C SI+C+ H++Y S S +Y G I YG+GS+ GF SQD +++GDV +
Sbjct: 123 WVPSAQCT-SIACFLHAKYDSSSSASYKANGTEFSIQYGTGSMEGFVSQDTLKIGDVSIS 181
Query: 170 DQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNR 229
Q F EAT+E LTF +FDGI+GLG+ I+V P NM+ QGL+ E +FSF L
Sbjct: 182 HQDFAEATKEPGLTFAFGKFDGILGLGYDTISVNHITPPVYNMINQGLLDEPLFSFRLGS 241
Query: 230 DPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIV 289
+GGE VFGG+D + G YVPV +K YW+ EL + G + G A +
Sbjct: 242 --SESDGGEAVFGGIDHSAYTGDIEYVPVRRKAYWEVELEKVSFGGDELELESTGAA--I 297
Query: 290 DSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLVVS 325
D+GTSL+A PT V +N IG + G + +C V S
Sbjct: 298 DTGTSLIALPTDVAEMLNTQIGAKRSWNGQYTIDCSKVPS 337
>gi|355558837|gb|EHH15617.1| hypothetical protein EGK_01732 [Macaca mulatta]
Length = 401
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 119/253 (47%), Positives = 165/253 (65%), Gaps = 10/253 (3%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C H+R++ +S+T
Sbjct: 69 PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHTRFQPSQSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNV-----EVGDVVVKDQVFIEATREGSLTFLLARFD 190
Y++ G+S I YG+GS+SG D V +V + V Q F E+ E TF+ A FD
Sbjct: 128 YSQPGQSFSIQYGTGSLSGIIGADQVSAFSCQVEGLTVVGQQFGESVTEPGQTFVDAEFD 187
Query: 191 GIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFK 250
GI+GLG+ +AVG PV+DNM+ Q LV +FS +++ +P+ G E++FGG D HF
Sbjct: 188 GILGLGYPSLAVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGVGSELIFGGYDHSHFS 247
Query: 251 GKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI 310
G +VPVTK+GYWQ L +I +G + C GC AIVD+GTSL+ GP+ + ++ +AI
Sbjct: 248 GSLNWVPVTKQGYWQIALDNIQVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAI 306
Query: 311 GG---EGVVSAEC 320
G +G + EC
Sbjct: 307 GAAPVDGEYAVEC 319
>gi|326523981|dbj|BAJ97001.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 428
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 111/246 (45%), Positives = 160/246 (65%), Gaps = 3/246 (1%)
Query: 68 GDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSR 127
GD +PL ++M+AQY+ EIGIG+PPQ F V+ DTGSSNLWVPS++C SI+C+ H R
Sbjct: 86 GDHPHHGVPLTDYMNAQYYAEIGIGTPPQPFGVVMDTGSSNLWVPSTRCS-SIACWLHRR 144
Query: 128 YKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLA 187
+ + KS+T+ E G I YGSGS+ G S D V +GD+ + + F E+T+E + F L
Sbjct: 145 FDATKSSTFKENGTDFAIRYGSGSLEGVISTDTVTIGDLELTETDFGESTKEPGIAFALG 204
Query: 188 RFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDP 246
+FDGI+GLG+ IAV VP + M+ Q L+ + +F+FWL + + DAE GGE+VFG +D
Sbjct: 205 KFDGIMGLGYDTIAVQQVVPPFYQMINQKLIDKPLFTFWLGDTNKDAENGGELVFGEIDK 264
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
H++G Y PV +KGYW+ + ++LI ++ G A +D+GTSL+A PT I
Sbjct: 265 DHYEGDIVYAPVVRKGYWEVKFNELLINDEPADFL-GNATAAIDTGTSLIACPTEAAETI 323
Query: 307 NHAIGG 312
N +G
Sbjct: 324 NTMLGA 329
>gi|195150257|ref|XP_002016071.1| GL10692 [Drosophila persimilis]
gi|194109918|gb|EDW31961.1| GL10692 [Drosophila persimilis]
Length = 399
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 124/308 (40%), Positives = 179/308 (58%), Gaps = 22/308 (7%)
Query: 21 LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNF 80
P++ N + G+ R+D L +R+ + R GG V PL N+
Sbjct: 30 FPSARNRFVQFGI---RMDRFRLKYSRVDGRSRPRGGWEVRSE------------PLSNY 74
Query: 81 MDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEI 139
+DAQYFG I IGSPPQ F VIFDTGSSNLWVPS+ C + ++C HSRY +R+S+++
Sbjct: 75 LDAQYFGPITIGSPPQTFKVIFDTGSSNLWVPSTSCAPTMVACMVHSRYNARQSSSHRRN 134
Query: 140 GKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFRE 199
G I+YGSGS++G+ S D V V + +++Q F E T FL A+FDGI GL ++
Sbjct: 135 GVRFAIHYGSGSLAGYLSSDTVRVAGLEIQNQTFAEVTTMPGPIFLAAKFDGIFGLAYQS 194
Query: 200 IAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVT 259
I++ P + ++EQ L+S VFS +LNR+ + EGG + FGG +P++++G TYVPV+
Sbjct: 195 ISMQGVKPPFYAIMEQKLLSNPVFSVYLNREQEHPEGGALFFGGSNPRYYRGNFTYVPVS 254
Query: 260 KKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GV 315
++ YWQ + I + +C+ GC I+D+GTS LA P IN +IGG G
Sbjct: 255 RRAYWQVRMEAATINDLR--LCQHGCEVIIDTGTSFLALPYDQAILINESIGGTPSEYGQ 312
Query: 316 VSAECKLV 323
S C V
Sbjct: 313 YSVPCDQV 320
>gi|355745980|gb|EHH50605.1| hypothetical protein EGM_01462 [Macaca fascicularis]
Length = 401
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 119/253 (47%), Positives = 165/253 (65%), Gaps = 10/253 (3%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C H+R++ +S+T
Sbjct: 69 PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHTRFQPSQSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNV-----EVGDVVVKDQVFIEATREGSLTFLLARFD 190
Y++ G+S I YG+GS+SG D V +V + V Q F E+ E TF+ A FD
Sbjct: 128 YSQPGQSFSIQYGTGSLSGIIGADQVSAFSCQVEGLTVVGQQFGESVTEPGQTFVDAEFD 187
Query: 191 GIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFK 250
GI+GLG+ +AVG PV+DNM+ Q LV +FS +++ +P+ G E++FGG D HF
Sbjct: 188 GILGLGYPSLAVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFS 247
Query: 251 GKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI 310
G +VPVTK+GYWQ L +I +G + C GC AIVD+GTSL+ GP+ + ++ +AI
Sbjct: 248 GSLDWVPVTKQGYWQIALDNIQVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAI 306
Query: 311 GG---EGVVSAEC 320
G +G + EC
Sbjct: 307 GAAPVDGEYAVEC 319
>gi|171679543|ref|XP_001904718.1| hypothetical protein [Podospora anserina S mat+]
gi|170939397|emb|CAP64625.1| unnamed protein product [Podospora anserina S mat+]
Length = 397
Score = 233 bits (593), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 129/312 (41%), Positives = 184/312 (58%), Gaps = 20/312 (6%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDE- 72
+ A+ LL A + G ++ LKK L L A + + +++G + G+R + ++
Sbjct: 6 LTAAVLLGAAQAGGTHKLKLKKVPL-AEQLEAVPLETQMKHLGQKYM-GIRPQQSHANAV 63
Query: 73 -----------DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSIS 121
+P+ NFM+AQYF EI IG+PPQ+F V+ DTGSSNLWVPS C SI+
Sbjct: 64 FQGSLADPKGIHPVPISNFMNAQYFSEITIGTPPQSFKVVLDTGSSNLWVPSVDC-GSIA 122
Query: 122 CYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGS 181
CY HS+Y S S+T+ G S EI YGSGS+SG+ SQD + +GD+ +K+Q F EAT E
Sbjct: 123 CYLHSKYDSSASSTFKANGSSFEIRYGSGSLSGYVSQDTMTIGDIKIKEQDFAEATSEPG 182
Query: 182 LTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVF 241
L F RFDGI+GLGF I+V VP + M+EQ L+ E VF+F L D E E+ F
Sbjct: 183 LAFAFGRFDGIMGLGFDRISVNGIVPPFYKMIEQKLIDEPVFAFKLA---DTEGESEVTF 239
Query: 242 GGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTP 301
GGVD +KGK +P+ +K YW+ + I G+ + + G I+D+GTSL+A P+
Sbjct: 240 GGVDKDAYKGKLITIPLRRKAYWEVDFDAISYGDDTADLENTGI--ILDTGTSLIALPSQ 297
Query: 302 VVTEINHAIGGE 313
+ +N IG +
Sbjct: 298 LAEMLNAQIGAK 309
>gi|387915422|gb|AFK11320.1| cathepsin E-A-like protein [Callorhinchus milii]
Length = 401
Score = 233 bits (593), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 131/332 (39%), Positives = 195/332 (58%), Gaps = 19/332 (5%)
Query: 4 KLLRSVFCLWVLASCLL-LPASSNGLRRIGLKKRR-----LDLHSLNAARITRKERYMGG 57
K+ +V L CL+ +P + R L++R L H A E+Y
Sbjct: 2 KVFVTVLLFIHLTECLIRIPLTRFKPIRKVLRERDQLKEFLRHHQFEAF----AEKYQSC 57
Query: 58 AGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY 117
V+ G + E L N+MDAQY+GEIGIG+P Q F+V+FDTGSSNLWVPS+ C
Sbjct: 58 YPSKLVKTHEGTAFEH---LSNYMDAQYYGEIGIGTPLQKFTVVFDTGSSNLWVPSAYC- 113
Query: 118 FSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEAT 177
S +C H ++KS S TY G I YG+G ++G +D V +G++ ++ Q F E+
Sbjct: 114 ISEACKMHEQFKSFHSTTYAPRGNQFSIRYGTGQLAGVLGKDMVRIGNITIRAQEFGESV 173
Query: 178 REGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGG 237
E TF +A+FDGI+GLG+ IA G A+PV+D M+ Q LV E +FS +NR+ D++ GG
Sbjct: 174 FEPGSTFAVAQFDGILGLGYPSIAEGGALPVFDRMMHQNLVVEPIFSVLINREMDSDYGG 233
Query: 238 EIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLA 297
E++ GG++ + + G +VPVT++GYWQ + ++ I T +C GCAAIVD+GTSL+
Sbjct: 234 ELLLGGINHECYTGSINWVPVTERGYWQIRMDNVKIDGMLT-LCINGCAAIVDTGTSLIT 292
Query: 298 GPTPVVTEINHAIG----GEGVVSAECKLVVS 325
GP + +++ +G G+G +CK + S
Sbjct: 293 GPEKEIRKLHKQLGAMSVGDGEYVVDCKRISS 324
>gi|110590169|pdb|2G24|A Chain A, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110590170|pdb|2G24|B Chain B, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110590171|pdb|2G26|A Chain A, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110590172|pdb|2G26|B Chain B, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110590173|pdb|2G27|A Chain A, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110590174|pdb|2G27|B Chain B, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110591465|pdb|2FS4|A Chain A, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
C Ring
gi|110591466|pdb|2FS4|B Chain B, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
C Ring
gi|110591524|pdb|2G1N|A Chain A, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110591525|pdb|2G1N|B Chain B, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110591526|pdb|2G1O|A Chain A, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110591527|pdb|2G1O|B Chain B, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110591528|pdb|2G1R|A Chain A, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
C Ring
gi|110591529|pdb|2G1R|B Chain B, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
C Ring
gi|110591530|pdb|2G1S|A Chain A, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
C Ring
gi|110591531|pdb|2G1S|B Chain B, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
C Ring
gi|110591532|pdb|2G1Y|A Chain A, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110591533|pdb|2G1Y|B Chain B, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110591534|pdb|2G20|A Chain A, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
C Ring
gi|110591535|pdb|2G20|B Chain B, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
C Ring
gi|110591536|pdb|2G21|A Chain A, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110591537|pdb|2G21|B Chain B, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110591538|pdb|2G22|A Chain A, Ketopiperazine-based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110591539|pdb|2G22|B Chain B, Ketopiperazine-based Renin Inhibitors: Optimization Of The
"c" Ring
Length = 333
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 108/240 (45%), Positives = 163/240 (67%), Gaps = 5/240 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L N+MD QY+GEIGIG+PPQ F V+FDTGSSN+WVPSSKC +C +H + + S++
Sbjct: 5 LTNYMDTQYYGEIGIGTPPQTFKVVFDTGSSNVWVPSSKCSRLYTACVYHKLFDASDSSS 64
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G + Y +G++SGF SQD + VG + V Q+F E T +L F+LA FDG++G+
Sbjct: 65 YKHNGTELTLRYSTGTVSGFLSQDIITVGGITVT-QMFGEVTEMPALPFMLAEFDGVVGM 123
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEE--GGEIVFGGVDPKHFKGKH 253
GF E A+G P++DN++ QG++ E+VFSF+ NRD + + GG+IV GG DP+H++G
Sbjct: 124 GFIEQAIGRVTPIFDNIISQGVLKEDVFSFYYNRDSENSQSLGGQIVLGGSDPQHYEGNF 183
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
Y+ + K G WQ ++ + +G+ ST +CE GC A+VD+G S ++G T + ++ A+G +
Sbjct: 184 HYINLIKTGVWQIQMKGVSVGS-STLLCEDGCLALVDTGASYISGSTSSIEKLMEALGAK 242
>gi|291416270|ref|XP_002724368.1| PREDICTED: pepsin II-4-like [Oryctolagus cuniculus]
Length = 387
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 118/264 (44%), Positives = 170/264 (64%), Gaps = 13/264 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
L+N++DA+YFG I IG+PPQ+F+VIFDTGSSNLWVPS+ C S++C H R+ S+TY
Sbjct: 67 LENYLDAEYFGTISIGTPPQDFTVIFDTGSSNLWVPSTYCS-SLACALHKRFNPEDSSTY 125
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
++ I YG+GS++G D V+VG + +Q+F + E LTFL A FDGI+GLG
Sbjct: 126 QGTSETLSITYGTGSMTGILGYDTVKVGSIEDTNQIFGLSKTEPGLTFLFAPFDGILGLG 185
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ I+ DA PV+DNM +GLVS+++FS +L+ D E+G ++FGG+D ++ G +V
Sbjct: 186 YPSISASDATPVFDNMWNEGLVSQDLFSVYLSS--DDEKGSLVMFGGIDSSYYTGSLNWV 243
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG----- 311
PV+ +GYWQ + + I N T C C AIVD+GTSLLAGPT ++ I IG
Sbjct: 244 PVSYEGYWQITMDSVSI-NGETIACADSCQAIVDTGTSLLAGPTSAISNIQSYIGASKNL 302
Query: 312 -GEGVVSAECKLVVSQYGDLIWDL 334
GE V+S +S D+++ +
Sbjct: 303 LGENVISCSA---ISSLPDIVFTI 323
>gi|118138205|pdb|2I4Q|A Chain A, Human ReninPF02342674 COMPLEX
gi|118138206|pdb|2I4Q|B Chain B, Human ReninPF02342674 COMPLEX
Length = 336
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 108/240 (45%), Positives = 163/240 (67%), Gaps = 5/240 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L N+MD QY+GEIGIG+PPQ F V+FDTGSSN+WVPSSKC +C +H + + S++
Sbjct: 8 LTNYMDTQYYGEIGIGTPPQTFKVVFDTGSSNVWVPSSKCSRLYTACVYHKLFDASDSSS 67
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G + Y +G++SGF SQD + VG + V Q+F E T +L F+LA FDG++G+
Sbjct: 68 YKHNGTELTLRYSTGTVSGFLSQDIITVGGITVT-QMFGEVTEMPALPFMLAEFDGVVGM 126
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEE--GGEIVFGGVDPKHFKGKH 253
GF E A+G P++DN++ QG++ E+VFSF+ NRD + + GG+IV GG DP+H++G
Sbjct: 127 GFIEQAIGRVTPIFDNIISQGVLKEDVFSFYYNRDSENSQSLGGQIVLGGSDPQHYEGNF 186
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
Y+ + K G WQ ++ + +G+ ST +CE GC A+VD+G S ++G T + ++ A+G +
Sbjct: 187 HYINLIKTGVWQIQMKGVSVGS-STLLCEDGCLALVDTGASYISGSTSSIEKLMEALGAK 245
>gi|281339451|gb|EFB15035.1| hypothetical protein PANDA_018433 [Ailuropoda melanoleuca]
Length = 388
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 126/265 (47%), Positives = 164/265 (61%), Gaps = 18/265 (6%)
Query: 69 DSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRY 128
D++E PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C HSR+
Sbjct: 47 DANE---PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SAACKTHSRF 102
Query: 129 KSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVG----------DVVVKDQVFIEATR 178
+SNTY+ +G I YG+GS+SG D V+V +VV Q F E+
Sbjct: 103 YPSQSNTYSVLGSHFSIQYGTGSLSGIIGADQVDVTFFWVFSRQVEGLVVVGQQFGESVT 162
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
E TF+ A FDGI+GLG+ +AVG PV+DNM+ Q LV +FS +++ DP+ G E
Sbjct: 163 EPGQTFVNAEFDGILGLGYPSLAVGGVTPVFDNMMAQNLVDIPMFSVYMSSDPEGGAGSE 222
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
++FGG D HF G +VPVTK+GYWQ L I +G + C GC AIVD+GTSL+ G
Sbjct: 223 LIFGGYDHSHFSGNLHWVPVTKQGYWQIALDAIQVGG-AVMFCSEGCQAIVDTGTSLITG 281
Query: 299 PTPVVTEINHAIGGE---GVVSAEC 320
P+ V ++ AIG E G EC
Sbjct: 282 PSDKVKQLQKAIGAEPMDGEYGVEC 306
>gi|1065326|pdb|1HRN|A Chain A, High Resolution Crystal Structures Of Recombinant Human
Renin In Complex With Polyhydroxymonoamide Inhibitors
gi|1065327|pdb|1HRN|B Chain B, High Resolution Crystal Structures Of Recombinant Human
Renin In Complex With Polyhydroxymonoamide Inhibitors
gi|1310896|pdb|1BIM|A Chain A, Crystallographic Studies On The Binding Modes Of P2-P3
Butanediamide Renin Inhibitors
gi|1310897|pdb|1BIM|B Chain B, Crystallographic Studies On The Binding Modes Of P2-P3
Butanediamide Renin Inhibitors
gi|1310898|pdb|1BIL|A Chain A, Crystallographic Studies On The Binding Modes Of P2-P3
Butanediamide Renin Inhibitors
gi|1310899|pdb|1BIL|B Chain B, Crystallographic Studies On The Binding Modes Of P2-P3
Butanediamide Renin Inhibitors
gi|241913388|pdb|3GW5|A Chain A, Crystal Structure Of Human Renin Complexed With A Novel
Inhibitor
gi|241913389|pdb|3GW5|B Chain B, Crystal Structure Of Human Renin Complexed With A Novel
Inhibitor
gi|283807203|pdb|3KM4|A Chain A, Optimization Of Orally Bioavailable Alkyl Amine Renin
Inhibitors
gi|283807204|pdb|3KM4|B Chain B, Optimization Of Orally Bioavailable Alkyl Amine Renin
Inhibitors
Length = 337
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 108/240 (45%), Positives = 163/240 (67%), Gaps = 5/240 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L N+MD QY+GEIGIG+PPQ F V+FDTGSSN+WVPSSKC +C +H + + S++
Sbjct: 9 LTNYMDTQYYGEIGIGTPPQTFKVVFDTGSSNVWVPSSKCSRLYTACVYHKLFDASDSSS 68
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G + Y +G++SGF SQD + VG + V Q+F E T +L F+LA FDG++G+
Sbjct: 69 YKHNGTELTLRYSTGTVSGFLSQDIITVGGITVT-QMFGEVTEMPALPFMLAEFDGVVGM 127
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEE--GGEIVFGGVDPKHFKGKH 253
GF E A+G P++DN++ QG++ E+VFSF+ NRD + + GG+IV GG DP+H++G
Sbjct: 128 GFIEQAIGRVTPIFDNIISQGVLKEDVFSFYYNRDSENSQSLGGQIVLGGSDPQHYEGNF 187
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
Y+ + K G WQ ++ + +G+ ST +CE GC A+VD+G S ++G T + ++ A+G +
Sbjct: 188 HYINLIKTGVWQIQMKGVSVGS-STLLCEDGCLALVDTGASYISGSTSSIEKLMEALGAK 246
>gi|336373584|gb|EGO01922.1| hypothetical protein SERLA73DRAFT_177556 [Serpula lacrymans var.
lacrymans S7.3]
gi|336386403|gb|EGO27549.1| hypothetical protein SERLADRAFT_461213 [Serpula lacrymans var.
lacrymans S7.9]
Length = 413
Score = 232 bits (591), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 131/299 (43%), Positives = 175/299 (58%), Gaps = 25/299 (8%)
Query: 46 ARITRKERYMGGAGVSGVRHRLGDSDEDI---------------LPLKNFMDAQYFGEIG 90
A T ++ + GAG +G RH D ED +PL NFM+AQY+ EI
Sbjct: 49 AETTYQQLPLMGAGGAG-RHIRPDRPEDSDLFWTQEELVKGGHGVPLTNFMNAQYYTEIT 107
Query: 91 IGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSG 150
+GSP Q F VI DTGSSNLWVPSSKC SI+C+ H++Y S S+TY G I YGSG
Sbjct: 108 LGSPAQTFKVILDTGSSNLWVPSSKCT-SIACFLHTKYDSSSSSTYKANGTEFSIQYGSG 166
Query: 151 SISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWD 210
S+ GF SQ+++++GD+ ++ Q F EAT+E L F +FDGI+GLG+ I+V P +
Sbjct: 167 SMEGFVSQESMKIGDLSIQHQDFAEATKEPGLAFAFGKFDGILGLGYDTISVNHITPPFY 226
Query: 211 NMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 270
NM++QGL+ E +FSF L D +GGE VFGG+D + G TYVPV +K YW+ EL
Sbjct: 227 NMIDQGLLDEPLFSFRLGSSED--DGGEAVFGGIDSSAYTGSITYVPVRRKAYWEVELEK 284
Query: 271 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKLVVS 325
+ G + G A +D+GTSL+A PT V +N IG G +C V S
Sbjct: 285 VSFGGDELDLENTGAA--IDTGTSLIALPTDVAEMLNTQIGATRSWNGQYQVDCAKVPS 341
>gi|388326405|pdb|3VCM|A Chain A, Crystal Structure Of Human Prorenin
gi|388326406|pdb|3VCM|B Chain B, Crystal Structure Of Human Prorenin
Length = 335
Score = 232 bits (591), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 110/248 (44%), Positives = 167/248 (67%), Gaps = 7/248 (2%)
Query: 67 LGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFH 125
LG++ ++ L N+MD QY+GEIGIG+PPQ F V+FDTGSSN+WVPSSKC +C +H
Sbjct: 3 LGNTTSSVI-LTNYMDTQYYGEIGIGTPPQTFKVVFDTGSSNVWVPSSKCSRLYTACVYH 61
Query: 126 SRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFL 185
+ + S++Y G + Y +G++SGF SQD + VG + V Q+F E T +L F+
Sbjct: 62 KLFDASDSSSYKHNGTELTLRYSTGTVSGFLSQDIITVGGITVT-QMFGEVTEMPALPFM 120
Query: 186 LARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVD 245
LA FDG++G+GF E A+G P++DN++ QG++ E+VFSF+ NRD GG+IV GG D
Sbjct: 121 LAEFDGVVGMGFIEQAIGRVTPIFDNIISQGVLKEDVFSFYYNRD---SLGGQIVLGGSD 177
Query: 246 PKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTE 305
P+H++G Y+ + K G WQ ++ + +G+ ST +CE GC A+VD+G S ++G T + +
Sbjct: 178 PQHYEGNFHYINLIKTGVWQIQMKGVSVGS-STLLCEDGCLALVDTGASYISGSTSSIEK 236
Query: 306 INHAIGGE 313
+ A+G +
Sbjct: 237 LMEALGAK 244
>gi|169770745|ref|XP_001819842.1| vacuolar protease A [Aspergillus oryzae RIB40]
gi|238486794|ref|XP_002374635.1| aspartic endopeptidase Pep2 [Aspergillus flavus NRRL3357]
gi|21392388|dbj|BAC00850.1| pepsinogen [Aspergillus oryzae]
gi|83767701|dbj|BAE57840.1| unnamed protein product [Aspergillus oryzae RIB40]
gi|220699514|gb|EED55853.1| aspartic endopeptidase Pep2 [Aspergillus flavus NRRL3357]
gi|391867458|gb|EIT76704.1| aspartyl protease [Aspergillus oryzae 3.042]
Length = 397
Score = 232 bits (591), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 129/322 (40%), Positives = 192/322 (59%), Gaps = 21/322 (6%)
Query: 14 VLASCLLLPASSNGLRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSGVRHRLGD 69
++ + +LL +S + ++ L K + +LH+++ ++YMG ++ L +
Sbjct: 5 LVTASVLLGCASAEVHKLKLNKVPVSEQFNLHNIDTHVQALGQKYMGIR--PNIKQDLLN 62
Query: 70 SD-------EDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISC 122
+ D+L + NF++AQYF EI IG+PPQ F V+ DTGSSNLWVPSS+C SI+C
Sbjct: 63 ENPINDMGRHDVL-VDNFLNAQYFSEIEIGTPPQKFKVVLDTGSSNLWVPSSEC-GSIAC 120
Query: 123 YFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSL 182
Y H++Y S S+TY + G I YGSGS+SGF SQD +++GD+ VKDQ+F EAT E L
Sbjct: 121 YLHNKYDSSSSSTYQKNGSEFAIKYGSGSLSGFVSQDTLKIGDLKVKDQLFAEATSEPGL 180
Query: 183 TFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFG 242
F RFDGI+GLGF I+V P + +M++QGL+ E VF+F+L + FG
Sbjct: 181 AFAFGRFDGILGLGFDTISVNKIPPPFYSMLDQGLLDEPVFAFYLGDTNKEGDDSVATFG 240
Query: 243 GVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPV 302
GVD H+ G+ +P+ +K YW+ +L I +G+ + G I+D+GTSL+A PT +
Sbjct: 241 GVDKDHYTGELVKIPLRRKAYWEVDLDAIALGDSVAELDNTGV--ILDTGTSLIALPTTL 298
Query: 303 VTEINHAIGGE----GVVSAEC 320
IN IG + G S +C
Sbjct: 299 AELINKEIGAKKGFTGQYSVDC 320
>gi|283806592|ref|NP_001164549.1| pepsin II-1 precursor [Oryctolagus cuniculus]
gi|129777|sp|P28712.1|PEPA1_RABIT RecName: Full=Pepsin II-1; AltName: Full=Pepsin A; Flags: Precursor
gi|22218074|dbj|BAC07514.1| pepsinogen II-1 [Oryctolagus cuniculus]
Length = 387
Score = 232 bits (591), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 112/247 (45%), Positives = 165/247 (66%), Gaps = 10/247 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
L+N++DA+YFG I IG+PPQ F+VIFDTGSSNLWVPS+ C S++C+ H R+ S+T+
Sbjct: 67 LENYLDAEYFGTISIGTPPQEFTVIFDTGSSNLWVPSTYCS-SLACFLHKRFNPDDSSTF 125
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
++ I YG+GS++G D V+VG++ +Q+F + E +TFL+A FDGI+GL
Sbjct: 126 QATSETLSITYGTGSMTGILGYDTVKVGNIEDTNQIFGLSKTEPGITFLVAPFDGILGLA 185
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ I+ DA PV+DNM +GLVSE++FS +L+ + E+G ++FGG+D ++ G +V
Sbjct: 186 YPSISASDATPVFDNMWNEGLVSEDLFSVYLSS--NGEKGSMVMFGGIDSSYYTGSLNWV 243
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG----- 311
PV+ +GYWQ + I I N T C C A+VD+GTSLLAGPT +++I IG
Sbjct: 244 PVSHEGYWQITMDSITI-NGETIACADSCQAVVDTGTSLLAGPTSAISKIQSYIGASKNL 302
Query: 312 -GEGVVS 317
GE ++S
Sbjct: 303 LGENIIS 309
>gi|154284392|ref|XP_001542991.1| vacuolar protease A precursor [Ajellomyces capsulatus NAm1]
gi|150406632|gb|EDN02173.1| vacuolar protease A precursor [Ajellomyces capsulatus NAm1]
Length = 398
Score = 232 bits (591), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 126/301 (41%), Positives = 183/301 (60%), Gaps = 12/301 (3%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGD----SDEDILPLKNFMDA 83
L++I L ++ +++ ++A ++YMG + GD S LP+ NF++A
Sbjct: 25 LQKIPLSEQFANVN-IDAHVRALGQKYMGVKPNQNGQDVFGDPAKASGGHSLPVDNFLNA 83
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
QYF EIGIG+PPQ F V+ DTGSSNLWVPSS+C SI+CY H++Y S S+T+ + G
Sbjct: 84 QYFSEIGIGTPPQTFKVVLDTGSSNLWVPSSECG-SIACYLHNKYDSSASSTHKKNGSEF 142
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVG 203
I YGSGS++GF SQD + +GD+VV+ QVF EAT E L F RFDGI+GLG+ I+V
Sbjct: 143 SITYGSGSLTGFVSQDCLTIGDLVVESQVFAEATSEPGLAFAFGRFDGILGLGYDTISVN 202
Query: 204 DAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGY 263
VP + M+ L+ E +FSF+L + E+VFGG++ F GK T +P+ +K Y
Sbjct: 203 KIVPPFYEMLNNNLLDEPMFSFYLGDANVDSDDSEVVFGGMNEDRFTGKLTKIPLRRKAY 262
Query: 264 WQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAE 319
W+ +L I G Q+ + G I+D+GTSL+A P+ + +N IG + G + E
Sbjct: 263 WEVDLDSITFGKQTALMSNTGV--ILDTGTSLIALPSTIAELLNKEIGAKKSFNGQYTVE 320
Query: 320 C 320
C
Sbjct: 321 C 321
>gi|393215979|gb|EJD01470.1| aspartic peptidase A1 [Fomitiporia mediterranea MF3/22]
Length = 412
Score = 231 bits (590), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 138/339 (40%), Positives = 193/339 (56%), Gaps = 34/339 (10%)
Query: 14 VLASCLLLP-ASSNGLRRIGLKKRRLDLHSLNAARITRKERYMG-------GAGVSGVRH 65
V A LLLP A++ G+ ++ L K + + + E+Y G GAG +G +
Sbjct: 5 VFAPLLLLPFATAAGVHKLKLHKIQRENANPYLETAYLSEKYGGDSQLPLMGAGGAGRQL 64
Query: 66 RLG-----DSDEDIL------------PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSN 108
RL + E++L PL NFM+AQYF I +G+PPQ F VI DTGSSN
Sbjct: 65 RLARPSVNEEGENLLWTQEMINGGHNVPLTNFMNAQYFTTITLGTPPQEFKVILDTGSSN 124
Query: 109 LWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVV 168
LWVPS+KC SI+C+ H++Y S S+T+ + G S +I YGSGS+ GF S D + +GD+ +
Sbjct: 125 LWVPSTKCT-SIACFLHAKYDSSASSTHKKNGTSFKIEYGSGSMEGFVSNDVLSIGDLKI 183
Query: 169 KDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLN 228
DQ F EAT+E L F +FDGI+GLG+ I+V P + +MV +GL+ VFSF L
Sbjct: 184 HDQDFAEATKEPGLAFAFGKFDGILGLGYDTISVNHITPPFYSMVNKGLLDAPVFSFRLG 243
Query: 229 RDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAI 288
E+GGE VFGG+D + GK Y PV +K YW+ EL + G+ + G A
Sbjct: 244 S--SEEDGGEAVFGGIDESAYSGKINYAPVRRKAYWEVELPKVAFGDDVLELENTGAA-- 299
Query: 289 VDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKLV 323
+D+GTSL+A P+ V +N IG G + +CK V
Sbjct: 300 IDTGTSLIALPSDVAEMLNAQIGATKSWNGQYTVDCKKV 338
>gi|109287598|emb|CAJ55261.1| renin-like aspartic protease [Echis ocellatus]
Length = 395
Score = 231 bits (590), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 125/307 (40%), Positives = 180/307 (58%), Gaps = 18/307 (5%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV-SGVRHRLGDSDE 72
+L SC L SS+ L+RI LKK + + R T +E M A V ++HR DE
Sbjct: 9 LLISCFLC-FSSDALQRISLKK-------MPSIRETLQEMGMKVADVLPSLKHRFSYLDE 60
Query: 73 DI------LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFH 125
+ L NF D QY+GEI IG+P Q F V+FDTGSSNLWVPS +C +C H
Sbjct: 61 GLHNKTASTILTNFRDTQYYGEISIGTPAQIFKVVFDTGSSNLWVPSHQCSPLYSACVSH 120
Query: 126 SRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFL 185
+RY S +S+TY G + YG G I GF SQD V V D+ + Q F EA S+ F+
Sbjct: 121 NRYDSSESSTYKPKGTKITLTYGQGYIEGFLSQDIVRVADIPIT-QFFTEAIALPSIPFM 179
Query: 186 LARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVD 245
A FDG++G+G+ + A+G +PV+DN++ + ++SE VFS + +R ++ GGEI+ GG D
Sbjct: 180 YAHFDGVLGMGYPKQAIGGVIPVFDNIMSEKVLSENVFSVYYSRHSESNTGGEIILGGSD 239
Query: 246 PKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTE 305
P H+ G YV +++GYW +L + I N+ +C GC A +D+GTS ++GP ++
Sbjct: 240 PSHYTGDFHYVSTSREGYWHVDLKGVSIENK-IALCHDGCTATIDTGTSFISGPASSISV 298
Query: 306 INHAIGG 312
+ IG
Sbjct: 299 LMETIGA 305
>gi|444731560|gb|ELW71913.1| Cathepsin D [Tupaia chinensis]
Length = 684
Score = 231 bits (590), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 136/347 (39%), Positives = 192/347 (55%), Gaps = 61/347 (17%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGD--SDEDILP----------LKNFMDAQ 84
R+ LH + R T E MGG + + H S E P LKN+MDAQ
Sbjct: 23 RIPLHKFPSIRRTLTE--MGGPVENLIAHEPISKYSQEAPTPAATKGPVPEILKNYMDAQ 80
Query: 85 YFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSC 143
Y+GEIGIG+PPQ F+VIFDTGS+NLWVPS C +C+FH +Y S+KS+TY + G S
Sbjct: 81 YYGEIGIGTPPQCFTVIFDTGSANLWVPSIHCGMLDFACWFHHKYNSKKSSTYAKNGSSF 140
Query: 144 EINYGSGS--------------------------------ISGFFSQDNVEVG------- 164
+I+Y SGS +S SQ + E
Sbjct: 141 DIHYRSGSQWLRQPLRVPEPGHRVGTDIDPVLRDQELWGNMSRGDSQPHTEPSCWKVPCH 200
Query: 165 --DVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEV 222
V V Q F EAT++ +TFL A+FDGI+G+ + I+V + VPV+DN+++Q LV + +
Sbjct: 201 TVSVRVDKQTFGEATKQPGITFLAAKFDGILGMAYPRISVDNVVPVFDNLMKQKLVEKNI 260
Query: 223 FSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCE 282
F+F+LNRDP + GGE++ GGVD K++ G Y VT+K YWQ + + +G+ T +C+
Sbjct: 261 FAFYLNRDPSGQPGGELMLGGVDTKYYTGSLDYYNVTRKAYWQIHMDKLEVGDGLT-LCQ 319
Query: 283 GGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLVVS 325
GC IVD+GTSL+ GP V E++ A+G ++ E C+ V S
Sbjct: 320 EGCEVIVDTGTSLIVGPVDEVRELHKAMGAVPLIQGEYMIPCEKVAS 366
>gi|170091822|ref|XP_001877133.1| aspartic peptidase A1 [Laccaria bicolor S238N-H82]
gi|164648626|gb|EDR12869.1| aspartic peptidase A1 [Laccaria bicolor S238N-H82]
Length = 408
Score = 231 bits (590), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 120/255 (47%), Positives = 160/255 (62%), Gaps = 9/255 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL NFM+AQYF EI IG+PPQ+F VI DTGSSNLWVPS KC SI+C+ H++Y S S+
Sbjct: 87 VPLSNFMNAQYFTEISIGNPPQSFKVILDTGSSNLWVPSVKCT-SIACFLHTKYDSASSS 145
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
T+ G I+YGSGS+ GF S D + +GD+ +K Q F EA +E L F +FDGI+G
Sbjct: 146 TFKANGSEFSIHYGSGSMEGFVSNDLLSIGDITIKGQDFAEAVKEPGLAFAFGKFDGILG 205
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V +P + +M+ QGL+ VFSF L E+GGE VFGG+D +KGK T
Sbjct: 206 LGYDTISVNHIIPPFYSMINQGLIDSPVFSFRLGS--SEEDGGEAVFGGIDESAYKGKIT 263
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
YVPV +K YW+ EL + GN + G A +D+GTSL+ PT + +N IG +
Sbjct: 264 YVPVRRKAYWEVELEKVSFGNDDLELESTGAA--IDTGTSLIVLPTDIAEMLNTQIGAKK 321
Query: 314 ---GVVSAECKLVVS 325
G +C V S
Sbjct: 322 SWNGQYQVDCAKVPS 336
>gi|119491657|ref|XP_001263323.1| aspartic endopeptidase Pep2 [Neosartorya fischeri NRRL 181]
gi|119411483|gb|EAW21426.1| aspartic endopeptidase Pep2 [Neosartorya fischeri NRRL 181]
Length = 398
Score = 231 bits (590), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 131/323 (40%), Positives = 189/323 (58%), Gaps = 23/323 (7%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLD----LHSLNAARITRKERYMGGAGVSGVRHR--- 66
+L + +LL ++S + ++ L K LD H+++A ++YMG + H+
Sbjct: 6 LLTASVLLGSASAAVHKLKLNKVPLDEQLYTHNIDAHVRALGQKYMG---IRPNVHQELL 62
Query: 67 ----LGD-SDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSIS 121
L D S D+L + NF++AQYF EI +G+PPQ F V+ DTGSSNLWVP S C SI+
Sbjct: 63 EENSLNDMSRHDVL-VDNFLNAQYFSEISLGTPPQKFKVVLDTGSSNLWVPGSDCS-SIA 120
Query: 122 CYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGS 181
C+ H++Y S S+TY G I YGSG +SGF SQD +++GD+ V Q F EAT E
Sbjct: 121 CFLHNKYDSSASSTYKANGTEFAIKYGSGELSGFVSQDTLQIGDLKVVKQDFAEATNEPG 180
Query: 182 LTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVF 241
L F RFDGI+GLG+ I+V VP + NM+EQGL+ E VF+F+L + E F
Sbjct: 181 LAFAFGRFDGILGLGYDTISVNKIVPPFYNMLEQGLLDEPVFAFYLGDTNKEGDNSEASF 240
Query: 242 GGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTP 301
GGVD H+ G+ T +P+ +K YW+ + I +G+ + G I+D+GTSL+A P+
Sbjct: 241 GGVDKNHYTGELTKIPLRRKAYWEVDFDAIALGDNVAELENTGV--ILDTGTSLIALPST 298
Query: 302 VVTEINHAIGGE----GVVSAEC 320
+ +N IG + G S EC
Sbjct: 299 LADLLNKEIGAKKGFTGQYSIEC 321
>gi|342882947|gb|EGU83511.1| hypothetical protein FOXB_05921 [Fusarium oxysporum Fo5176]
Length = 396
Score = 231 bits (590), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 122/251 (48%), Positives = 160/251 (63%), Gaps = 12/251 (4%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+P+ NFM+AQYF EI IG+PPQ+F V+ DTGSSNLWVPS +C SI+CY HS+Y S S+
Sbjct: 76 VPVSNFMNAQYFSEITIGTPPQSFKVVLDTGSSNLWVPSQQC-GSIACYLHSKYDSSASS 134
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY E G EI+YGSGS+SGF S D V +GD+ +KDQ F EAT+E L F RFDGI+G
Sbjct: 135 TYKENGTEFEIHYGSGSLSGFVSNDVVSIGDLEIKDQDFAEATKEPGLAFAFGRFDGILG 194
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ IAV VP + MV Q L+ E VF+F+L+ D E E FGG+D F G
Sbjct: 195 LGYDRIAVNGMVPPFYQMVNQKLLDEPVFAFYLD---DQEGESEATFGGIDKSKFTGDIE 251
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEG-GCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
Y+P+ +K YW+ +L I G++ V E AI+D+GTSL P+ + +N IG +
Sbjct: 252 YIPLRRKAYWEVDLEAIAFGDE---VAEQENTGAILDTGTSLNVLPSALAELLNKEIGAK 308
Query: 314 ----GVVSAEC 320
G + EC
Sbjct: 309 KGYNGQYTIEC 319
>gi|225556537|gb|EEH04825.1| aspartic endopeptidase Pep2 [Ajellomyces capsulatus G186AR]
Length = 398
Score = 231 bits (590), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 127/302 (42%), Positives = 189/302 (62%), Gaps = 14/302 (4%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGD----SDEDILPLKNFMDA 83
L++I L ++ +++ ++A ++YMG + GD S LP+ NF++A
Sbjct: 25 LQKIPLSEQFANVN-IDAHVRALGQKYMGVKPNQNGQDVFGDPAKASGGHSLPVDNFLNA 83
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
QYF EIGIG+PPQ F V+ DTGSSNLWVPSS+C SI+CY H++Y S S+T+ + G
Sbjct: 84 QYFSEIGIGTPPQTFKVVLDTGSSNLWVPSSECG-SIACYLHNKYDSSASSTHKKNGSEF 142
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVG 203
I YGSGS++GF SQD + +GD+VV++QVF EAT E L F RFDGI+GLG+ I+V
Sbjct: 143 SITYGSGSLTGFVSQDCLTIGDLVVENQVFAEATSEPGLAFAFGRFDGILGLGYDTISVN 202
Query: 204 DAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
VP + M+ + L+ E +FSF+L + + D +E E+VFGG++ F G+ T +P+ +K
Sbjct: 203 KIVPPFYEMLNKNLLDEPMFSFYLGDANVDGDE-SEVVFGGMNKNRFMGELTKIPLRRKA 261
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVVSA 318
YW+ +L I G Q+ + G I+D+GTSL+A P+ + +N IG + G +
Sbjct: 262 YWEVDLDSITFGKQTAMMANTGV--ILDTGTSLIALPSTIAELLNKEIGAKKSFNGQYTI 319
Query: 319 EC 320
EC
Sbjct: 320 EC 321
>gi|195134378|ref|XP_002011614.1| GI11124 [Drosophila mojavensis]
gi|193906737|gb|EDW05604.1| GI11124 [Drosophila mojavensis]
Length = 373
Score = 231 bits (590), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 123/293 (41%), Positives = 180/293 (61%), Gaps = 17/293 (5%)
Query: 5 LLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVR 64
+L+S+ L V+ L +S L R+ + K + +E +
Sbjct: 1 MLKSITVLAVV-----LAVASAELHRVPILKHE--------NFVKTRENVKAEKAYLRAK 47
Query: 65 HRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCY 123
+ L ++ + L N ++ Y+G I IG+PPQ+F V+FD+GSSNLWVPSS C +F ++C
Sbjct: 48 YNLPNARLNEEELSNSINMAYYGTISIGTPPQSFKVLFDSGSSNLWVPSSTCWFFDVACM 107
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
H++Y KS+TY G+S I YGSGS+SGF S D V+V +V+K Q F EAT E +
Sbjct: 108 NHNQYDHDKSSTYEANGESFSIQYGSGSLSGFLSTDTVDVNGLVIKKQTFAEATSEPGNS 167
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
F ++FDGI+G+ ++ +AV + VP + NMV QGLV E VFSF+L RD + EGGE++FGG
Sbjct: 168 FTNSKFDGILGMAYQSLAVDNVVPPFYNMVSQGLVDESVFSFYLARDGTSNEGGELIFGG 227
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLL 296
D + G+ TYVP++++GYWQF + I I Q+ +C+ C AI D+GTSLL
Sbjct: 228 SDSSLYTGELTYVPISQQGYWQFAVDSISIDGQT--LCD-NCQAIADTGTSLL 277
>gi|115396430|ref|XP_001213854.1| vacuolar protease A precursor [Aspergillus terreus NIH2624]
gi|114193423|gb|EAU35123.1| vacuolar protease A precursor [Aspergillus terreus NIH2624]
Length = 397
Score = 231 bits (589), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 134/325 (41%), Positives = 199/325 (61%), Gaps = 26/325 (8%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLD----LHSLNAARITRKERYMGGAGVSGVRHRLGD 69
+L + +L+ +S + ++ L K LD +++A ++YMG + LGD
Sbjct: 6 LLTASVLVGCASAEVHKLKLNKLPLDEQLFTQNIDAHIHALGQKYMGVR--PNQQEPLGD 63
Query: 70 S------DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCY 123
+ + ++L + NFM+AQYF EI +G+PPQ F V+ DTGSSNLWVPSS+C SI+CY
Sbjct: 64 NPVNDLGNHNVL-VDNFMNAQYFSEIELGTPPQKFKVVLDTGSSNLWVPSSECS-SIACY 121
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
H++Y S S+TY + G I YGSGS+SGF S+D +++GD+ +K+Q+F EAT E L
Sbjct: 122 LHNKYDSSASSTYKKNGTEFSIRYGSGSLSGFVSEDTLKIGDLTIKEQLFAEATNEPGLA 181
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDA-EEGGEIV-- 240
F RFDGI+GLGF I+V P + MV QGL+ E VF+F+L DA +EG E V
Sbjct: 182 FAFGRFDGILGLGFDTISVNRIEPPFYKMVNQGLLDEPVFAFYLG---DANKEGDESVAT 238
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
FGGVD H+ G+ +P+ +K YW+ +L I +G+++ + G I+D+GTSL+A P+
Sbjct: 239 FGGVDKSHYTGELIKIPLRRKAYWEVDLDAITLGDETADLENTGV--ILDTGTSLIALPS 296
Query: 301 PVVTEINHAIGGE----GVVSAECK 321
+ IN IG + G S +C+
Sbjct: 297 NLAEMINAQIGAKKGFTGQYSVDCE 321
>gi|392568782|gb|EIW61956.1| aspartic peptidase A1 [Trametes versicolor FP-101664 SS1]
Length = 415
Score = 231 bits (588), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 137/340 (40%), Positives = 192/340 (56%), Gaps = 33/340 (9%)
Query: 12 LWVLASCLLLP-ASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS---GV---- 63
L A LLP ++G+ R+ LKK + + E+Y GG+ V G+
Sbjct: 6 LASFAPLALLPFVVADGVHRMKLKKLPPAISNPQLESAYLAEKYGGGSQVPLGGGIGRNV 65
Query: 64 ---RHRLGDSDE-------------DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSS 107
R + D +E +PL NFM+AQYF EI +G+PPQ+F VI DTGSS
Sbjct: 66 RVSRPTVKDGEELFWTQDEFSTEGGHTVPLSNFMNAQYFAEITLGTPPQSFKVILDTGSS 125
Query: 108 NLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVV 167
NLWVPS+KC SI+C+ H++Y S S+TY G I YGSGS+ GF S+D + +GD+
Sbjct: 126 NLWVPSTKCT-SIACFLHAKYDSSASSTYKANGSEFSIQYGSGSMEGFVSRDVLTIGDLT 184
Query: 168 VKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL 227
VK+ F EAT+E L F +FDGI+GLG+ I+V VP + +V QGL+ VFSF L
Sbjct: 185 VKNLDFAEATKEPGLAFAFGKFDGILGLGYDTISVNHIVPPFYALVNQGLLDSPVFSFRL 244
Query: 228 NRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAA 287
E+GGE +FGG+D + GK YVPV +K YW+ EL I +G++ + G A
Sbjct: 245 GD--SEEDGGEAIFGGIDDSAYSGKIEYVPVRRKAYWEVELEKIRLGDEELELENTGAA- 301
Query: 288 IVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECKLV 323
+D+GTSL+A P+ + +N IG + G + +C V
Sbjct: 302 -IDTGTSLIALPSDLAEMLNAQIGAKKSWNGQYTVDCAKV 340
>gi|46138535|ref|XP_390958.1| hypothetical protein FG10782.1 [Gibberella zeae PH-1]
gi|408391598|gb|EKJ70970.1| hypothetical protein FPSE_08829 [Fusarium pseudograminearum CS3096]
Length = 396
Score = 231 bits (588), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 124/257 (48%), Positives = 163/257 (63%), Gaps = 14/257 (5%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+P+ NFM+AQYF EI IG+PPQ+F V+ DTGSSNLWVPS +C SI+CY HS+Y S S+
Sbjct: 76 VPVSNFMNAQYFSEITIGTPPQSFKVVLDTGSSNLWVPSQEC-GSIACYLHSKYDSSASS 134
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + G EI+YGSGS+SGF S D V +GD+ +KDQ F EAT+E L F RFDGI+G
Sbjct: 135 TYKKNGSEFEIHYGSGSLSGFVSNDVVSIGDLKIKDQDFAEATKEPGLAFAFGRFDGILG 194
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEG-GEIVFGGVDPKHFKGKH 253
LG+ IAV VP + MV Q L+ E VF+F+L D +EG E FGGVD + G
Sbjct: 195 LGYDRIAVNGMVPPFYQMVNQKLLDEPVFAFYL----DGQEGQSEATFGGVDKSKYTGDL 250
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEG-GCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
Y+P+ +K YW+ +L I G++ V E AI+D+GTSL P+ + +N IG
Sbjct: 251 EYIPLRRKAYWEVDLDAIAFGDE---VAEQENTGAILDTGTSLNVLPSALAELLNKEIGA 307
Query: 313 E----GVVSAECKLVVS 325
+ G + EC V S
Sbjct: 308 KKGYNGQYTIECDKVSS 324
>gi|452981069|gb|EME80829.1| hypothetical protein MYCFIDRAFT_89289 [Pseudocercospora fijiensis
CIRAD86]
Length = 396
Score = 231 bits (588), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 125/309 (40%), Positives = 185/309 (59%), Gaps = 13/309 (4%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRL--DLHSLNAARITRK--ERYMGGAGVSGVRHRLGD 69
L + L + G+ ++ L+K L L LN R ++YMG + + +
Sbjct: 4 ALLTSALAAGAQAGVHKMKLQKISLSEQLEGLNIEDHVRHLGQKYMGVRPQNPLSEMFKE 63
Query: 70 SD---EDILPL--KNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYF 124
+ ED P+ NF++AQYF +I IG+PPQ F V+ DTGSSNLWVPS C SI+CY
Sbjct: 64 TSVHAEDGHPVAVDNFLNAQYFSQIAIGTPPQEFKVVLDTGSSNLWVPSQDC-GSIACYL 122
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
HS+Y +S TY + G I YGSGS+ G+ SQD V++GD+ +K+Q+F EAT E L F
Sbjct: 123 HSKYDHGESTTYKQNGSDFAIRYGSGSLEGYVSQDTVQIGDLKIKNQLFAEATSEPGLAF 182
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
RFDGI+GLG+ I+V P + NM++QGL+ E+ F+F+L+ +E E +FGGV
Sbjct: 183 AFGRFDGIMGLGYDTISVNGIPPPFYNMIDQGLLDEKKFAFYLSSTDKGDE-SEAIFGGV 241
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
+ H+ GK +P+ +K YW+ +L I G+Q+ + G AI+D+GTSL+A P+ +
Sbjct: 242 NEDHYTGKMINIPLRRKAYWEVDLDAITFGDQTAEIDATG--AILDTGTSLIALPSTLAE 299
Query: 305 EINHAIGGE 313
+N IG +
Sbjct: 300 LLNKEIGAK 308
>gi|449280945|gb|EMC88160.1| Cathepsin E, partial [Columba livia]
Length = 374
Score = 231 bits (588), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 113/249 (45%), Positives = 159/249 (63%), Gaps = 5/249 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG+I IG+PPQNF+V+FDTGSSNLWVPS C S +C H++++ +S+T
Sbjct: 47 PLINYLDMEYFGQISIGTPPQNFTVVFDTGSSNLWVPSVYC-VSKACAEHAKFQPSQSST 105
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y IG I YG+GS++G D V V + V +Q F E+ E FL A FDG++GL
Sbjct: 106 YQAIGTPFSIQYGTGSLTGVIGSDQVVVEGLTVNNQQFAESISEPGKAFLDAPFDGVLGL 165
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ +AV PV+DNM+ Q LV +FS +L+ +P++ GGE++FGG DP F G +
Sbjct: 166 AYPSLAVDGVTPVFDNMMAQNLVELPIFSVYLSTNPESSLGGELLFGGFDPSRFMGTLNW 225
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
VPVT++GYWQ +L +I + + C GC AIVD+GTSL+ GPT V + IG
Sbjct: 226 VPVTQQGYWQIQLDNIQLAG-TVAFCTNGCQAIVDTGTSLITGPTKDVKVLQKYIGATPV 284
Query: 313 EGVVSAECK 321
+G + EC
Sbjct: 285 DGEYAVECN 293
>gi|195399277|ref|XP_002058247.1| GJ15982 [Drosophila virilis]
gi|194150671|gb|EDW66355.1| GJ15982 [Drosophila virilis]
Length = 374
Score = 231 bits (588), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 116/235 (49%), Positives = 160/235 (68%), Gaps = 9/235 (3%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L N ++ Y+G I IG+PPQ+F V+FD+GSSNLWVPSS C +F ++C H++Y KS+T
Sbjct: 61 LSNSINMAYYGAITIGTPPQSFKVLFDSGSSNLWVPSSTCWFFDVACMNHNQYDHDKSST 120
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
YT G+S I YGSGS+SGF S D V+V +V+K Q F EAT E +F A+FDGI+G+
Sbjct: 121 YTSNGESFSIQYGSGSLSGFLSTDTVDVNGLVIKSQTFAEATSEPGTSFNNAKFDGILGM 180
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
++ +AV + VP + NMV QGLV + VFSF+L RD + +GGE++FGG D + G TY
Sbjct: 181 AYQSLAVDNVVPPFYNMVSQGLVDQSVFSFYLARDGTSSQGGELIFGGSDSSLYSGDLTY 240
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI 310
VP++++GYWQF + I QS +C+ C AI D+GTSLL VV+E + I
Sbjct: 241 VPISEQGYWQFTMAGASIDGQS--LCD-NCQAIADTGTSLL-----VVSEAAYDI 287
>gi|194218276|ref|XP_001501986.2| PREDICTED: pepsin A-like [Equus caballus]
Length = 387
Score = 230 bits (587), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 119/251 (47%), Positives = 162/251 (64%), Gaps = 10/251 (3%)
Query: 73 DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRK 132
D PL+N++D +YFG I IG+PPQ F+VIFDTGSSNLWVPS+ C S++CY H R+ K
Sbjct: 63 DSEPLENYLDEEYFGTISIGTPPQEFTVIFDTGSSNLWVPSTYCS-SLACYDHKRFNPEK 121
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGI 192
S+TY +S I YG+GS++G D V VG + +Q+F + +E LA FDGI
Sbjct: 122 SSTYQATSESISITYGTGSMTGILGYDTVRVGGIEDTNQIFGLSEKEPGFFLFLAPFDGI 181
Query: 193 IGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGK 252
+GLG+ I+ A PV+DN+ +QGLVS+++FS +L+ D E G ++FGG+D ++ G
Sbjct: 182 LGLGYPSISASGATPVFDNIWDQGLVSQDLFSVYLSS--DDESGSVVMFGGIDSSYYTGS 239
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG- 311
+VPVT +GYWQ + I I +S C GGC AIVD+GTSLLAGPT + I IG
Sbjct: 240 LHWVPVTTEGYWQIAVDSITINGESIA-CSGGCQAIVDTGTSLLAGPTSGIDNIQSYIGA 298
Query: 312 -----GEGVVS 317
GE V+S
Sbjct: 299 RKDLLGEEVIS 309
>gi|50978946|ref|NP_001003194.1| renin precursor [Canis lupus familiaris]
gi|62287424|sp|Q6DYE7.1|RENI_CANFA RecName: Full=Renin; AltName: Full=Angiotensinogenase; Flags:
Precursor
gi|50058380|gb|AAT68959.1| preprorenin [Canis lupus familiaris]
Length = 403
Score = 230 bits (587), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 129/319 (40%), Positives = 191/319 (59%), Gaps = 21/319 (6%)
Query: 9 VFCLWVLASCLL-LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSG----- 62
+ LW SC LPA + RRI LKK + + R + KER + AG+
Sbjct: 12 LLVLW--GSCTFGLPADTGAFRRIFLKK-------MPSIRESLKERGVDVAGLGAEWNQF 62
Query: 63 -VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSI 120
R G+S ++ L N++D QY+GEIGIG+PPQ F V+FDTGS+NLWVPS++C
Sbjct: 63 TKRLSSGNSTSPVV-LTNYLDTQYYGEIGIGTPPQTFKVVFDTGSANLWVPSTRCSPLYT 121
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+C H Y S +S++Y E G + I YGSG + GF SQD V VG + V Q F E T
Sbjct: 122 ACEIHCLYDSSESSSYMENGTTFTIRYGSGKVKGFLSQDMVTVGGITVT-QTFGEVTELP 180
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
+ F+LA+FDG++G+GF AVG PV+D+++ QG++ EEVFS + +R+ GGE+V
Sbjct: 181 LIPFMLAKFDGVLGMGFPAQAVGGVTPVFDHILSQGVLKEEVFSVYYSRNSHL-LGGEVV 239
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
GG DP++++G YV ++K G WQ ++ + + +T VCE GC +VD+G S ++GPT
Sbjct: 240 LGGSDPQYYQGNFHYVSISKTGSWQIKMKGVSV-RSATLVCEEGCMVVVDTGASYISGPT 298
Query: 301 PVVTEINHAIGGEGVVSAE 319
+ + +G + + + E
Sbjct: 299 SSLRLLMDTLGAQELSTNE 317
>gi|326475448|gb|EGD99457.1| aspartyl proteinase [Trichophyton tonsurans CBS 112818]
gi|326477485|gb|EGE01495.1| vacuolar protease A [Trichophyton equinum CBS 127.97]
Length = 400
Score = 230 bits (587), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 123/296 (41%), Positives = 178/296 (60%), Gaps = 18/296 (6%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRL-------GDSDEDILPLKNF 80
L+++ LK++ L+ ++ + ++YMG +H +S ++L + NF
Sbjct: 25 LKKVSLKEQ-LERADIDVQVKSLGQKYMGIRPEQHEQHMFKEQTPIEAESGHNVL-IDNF 82
Query: 81 MDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIG 140
++AQYF EI IG+PPQ F V+ DTGSSNLWVP C SI+C+ HS Y S S+TY++ G
Sbjct: 83 LNAQYFSEISIGTPPQTFKVVLDTGSSNLWVPGKDCS-SIACFLHSTYDSSASSTYSKNG 141
Query: 141 KSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREI 200
I YGSGS+ GF S+DNV++GD+ +K Q+F EAT E L F RFDGI+G+GF I
Sbjct: 142 TKFAIRYGSGSLEGFVSRDNVKIGDMTIKKQLFAEATSEPGLAFAFGRFDGIMGMGFSSI 201
Query: 201 AVGDAVPVWDNMVEQGLVSEEVFSFWL---NRDPDAEEGGEIVFGGVDPKHFKGKHTYVP 257
+V P + NM++QGL+ E VFSF+L N+D D + FGG D HF G T +P
Sbjct: 202 SVNGITPPFYNMIDQGLIDEPVFSFYLGDTNKDGDQS---VVTFGGSDASHFTGDMTTIP 258
Query: 258 VTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
+ +K YW+ + I +G + + G I+D+GTSL+A PT + IN IG +
Sbjct: 259 LRRKAYWEVDFDAISLGEDTAALENTGV--ILDTGTSLIALPTTLAEMINTQIGAK 312
>gi|451992127|gb|EMD84649.1| hypothetical protein COCHEDRAFT_1189444 [Cochliobolus
heterostrophus C5]
gi|452004574|gb|EMD97030.1| hypothetical protein COCHEDRAFT_1189956 [Cochliobolus
heterostrophus C5]
Length = 399
Score = 230 bits (587), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 114/250 (45%), Positives = 162/250 (64%), Gaps = 8/250 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+P+ N+++AQYF EI +G+PPQ+F VI DTGSSNLWVPS++C SI+C+ H +Y S S+
Sbjct: 77 VPVSNYLNAQYFSEISLGTPPQSFKVILDTGSSNLWVPSTQCT-SIACFLHDKYDSSSSS 135
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + G EI YGSGS+ GF S D +++GD+ VK+Q F EAT E L F +FDGI+G
Sbjct: 136 TYQKNGSDFEIRYGSGSMKGFVSNDVLQIGDLKVKNQDFAEATSEPGLAFAFGKFDGILG 195
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V VP + NM+ QGL+ E VF+F+L D ++G E FGG+D H+ GK
Sbjct: 196 LGYDTISVNHIVPPFYNMINQGLLDEPVFAFYLGDVAD-KQGSEATFGGIDESHYTGKLI 254
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
+P+ +K YW+ +L I G ++ G I+D+GTSL+A P+ + +N IG +
Sbjct: 255 KLPLRRKAYWEVDLDAITFGKETAETENVGV--ILDTGTSLIALPSAMAELLNKEIGAKK 312
Query: 314 ---GVVSAEC 320
G S EC
Sbjct: 313 GFNGQYSVEC 322
>gi|388579370|gb|EIM19694.1| aspartyl proteinase [Wallemia sebi CBS 633.66]
Length = 411
Score = 230 bits (587), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 113/256 (44%), Positives = 165/256 (64%), Gaps = 4/256 (1%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
LP+ NF++AQY+ EIG+GSP Q F+V+ DTGSSNLWVPS+KC SI+C+ H ++ +S
Sbjct: 89 LPVSNFLNAQYYAEIGLGSPEQKFNVVLDTGSSNLWVPSNKC-MSIACFLHRKFNPEESK 147
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y G EI YGSGS+ G QD + + D+ VK+Q+F EAT E L F +FDGI+G
Sbjct: 148 SYKANGTDFEIRYGSGSLKGIVGQDTLAIDDLHVKNQLFAEATSEPGLAFAFGKFDGILG 207
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V D P + N+++QGL+ E VFSF+L + +E + VFGG+D H+KG+
Sbjct: 208 LGYDTISVNDIPPPFYNLIDQGLLDEPVFSFYLTDEQSGKE-SQAVFGGIDHDHYKGQLH 266
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
YVP+ +KGYW+ EL + G+ + G A +D+GTSL+A PT + +N IG +
Sbjct: 267 YVPLRRKGYWEVELEKLTFGDDEVELENTGAA--IDTGTSLIAIPTDMAEMLNKMIGAKK 324
Query: 315 VVSAECKLVVSQYGDL 330
S + + ++ DL
Sbjct: 325 SWSGQYTVDCNKVDDL 340
>gi|452840489|gb|EME42427.1| hypothetical protein DOTSEDRAFT_73302 [Dothistroma septosporum
NZE10]
Length = 398
Score = 230 bits (587), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 111/237 (46%), Positives = 160/237 (67%), Gaps = 4/237 (1%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
+ NF++AQYF EI IG+PPQ F V+ DTGSSNLWVPS C SI+CY HS+Y +S+TY
Sbjct: 78 VDNFLNAQYFSEIAIGTPPQEFKVVLDTGSSNLWVPSQDC-GSIACYLHSKYDHSESSTY 136
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
+ G I YGSGS+ G+ S+D V++GD+ +KDQ+F EAT E L F RFDGI+GLG
Sbjct: 137 KKNGSDFAIRYGSGSLEGYVSKDTVQIGDLKIKDQLFAEATSEPGLAFAFGRFDGILGLG 196
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ I+V P + NM++Q L+ E+VF+F+L+ D + + E +FGGV+ H+ G+ T +
Sbjct: 197 YDTISVNGIPPPFYNMIDQDLLDEKVFAFYLS-DTNKGDESEAIFGGVNKDHYTGEMTKI 255
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
P+ +K YW+ +L I G+Q+ + G AI+D+GTSLLA P+ + +N IG +
Sbjct: 256 PLRRKAYWEVDLDAITFGDQTAEIDSTG--AILDTGTSLLALPSTLAELLNKEIGAK 310
>gi|121705756|ref|XP_001271141.1| aspartic endopeptidase Pep2 [Aspergillus clavatus NRRL 1]
gi|119399287|gb|EAW09715.1| aspartic endopeptidase Pep2 [Aspergillus clavatus NRRL 1]
Length = 398
Score = 230 bits (587), Expect = 9e-58, Method: Compositional matrix adjust.
Identities = 133/326 (40%), Positives = 194/326 (59%), Gaps = 29/326 (8%)
Query: 14 VLASCLLLPASSNGLRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSGVRHR--- 66
+L + LL +S + ++ L K +L H+++A ++YMG + H+
Sbjct: 6 LLTASALLGCASAEVHKLKLNKVPLEEQLYTHNIDAHVRALGQKYMG---IRPNIHKELL 62
Query: 67 ----LGD-SDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSIS 121
D S D+L + NF++AQYF EI +G+PPQ F V+ DTGSSNLWVPSS+C SI+
Sbjct: 63 EENSFNDMSRHDVL-VDNFLNAQYFSEIELGTPPQKFKVVLDTGSSNLWVPSSEC-GSIA 120
Query: 122 CYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGS 181
CY H++Y S S+TY + G I YGSG +SGF SQDN+++GD+ ++ Q F EAT E
Sbjct: 121 CYLHTKYDSSASSTYKKNGTEFAIRYGSGELSGFVSQDNLKIGDLKIEKQDFAEATNEPG 180
Query: 182 LTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE--- 238
L F RFDGI+GLG+ I+V VP + NM+ QGL+ E VF+F+L DA + G+
Sbjct: 181 LAFAFGRFDGILGLGYDTISVNKIVPPFYNMLNQGLLDEPVFAFYLG---DANKEGDSSV 237
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
FGG+D HF G+ T +P+ +K YW+ +L I +G+ + G I+D+GTSL+A
Sbjct: 238 ATFGGIDKDHFTGELTKIPLRRKAYWEVDLDAIALGDNVAELDNTGV--ILDTGTSLIAL 295
Query: 299 PTPVVTEINHAIGGE----GVVSAEC 320
P+ + +N IG + G S EC
Sbjct: 296 PSTLADLLNKEIGAKKGFTGQYSVEC 321
>gi|70999520|ref|XP_754479.1| aspartic endopeptidase Pep2 [Aspergillus fumigatus Af293]
gi|74675969|sp|O42630.1|CARP_ASPFU RecName: Full=Vacuolar protease A; AltName: Full=Aspartic
endopeptidase pep2; AltName: Full=Aspartic protease
pep2; Flags: Precursor
gi|2664292|emb|CAA75754.1| cellular aspartic protease [Aspergillus fumigatus]
gi|4200293|emb|CAA10674.1| aspartic protease [Aspergillus fumigatus]
gi|66852116|gb|EAL92441.1| aspartic endopeptidase Pep2 [Aspergillus fumigatus Af293]
gi|159127496|gb|EDP52611.1| aspartic endopeptidase Pep2 [Aspergillus fumigatus A1163]
Length = 398
Score = 230 bits (587), Expect = 9e-58, Method: Compositional matrix adjust.
Identities = 130/323 (40%), Positives = 189/323 (58%), Gaps = 23/323 (7%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLD----LHSLNAARITRKERYMGGAGVSGVRHR--- 66
+L + +LL ++S + ++ L K LD H+++A ++YMG + H+
Sbjct: 6 LLTASVLLGSASAAVHKLKLNKVPLDEQLYTHNIDAHVRALGQKYMG---IRPNVHQELL 62
Query: 67 ----LGD-SDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSIS 121
L D S D+L + NF++AQYF EI +G+PPQ F V+ DTGSSNLWVP S C SI+
Sbjct: 63 EENSLNDMSRHDVL-VDNFLNAQYFSEISLGTPPQKFKVVLDTGSSNLWVPGSDCS-SIA 120
Query: 122 CYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGS 181
C+ H++Y S S+TY G I YGSG +SGF SQD +++GD+ V Q F EAT E
Sbjct: 121 CFLHNKYDSSASSTYKANGTEFAIKYGSGELSGFVSQDTLQIGDLKVVKQDFAEATNEPG 180
Query: 182 LTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVF 241
L F RFDGI+GLG+ I+V VP + NM++QGL+ E VF+F+L + E F
Sbjct: 181 LAFAFGRFDGILGLGYDTISVNKIVPPFYNMLDQGLLDEPVFAFYLGDTNKEGDNSEASF 240
Query: 242 GGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTP 301
GGVD H+ G+ T +P+ +K YW+ + I +G+ + G I+D+GTSL+A P+
Sbjct: 241 GGVDKNHYTGELTKIPLRRKAYWEVDFDAIALGDNVAELENTGI--ILDTGTSLIALPST 298
Query: 302 VVTEINHAIGGE----GVVSAEC 320
+ +N IG + G S EC
Sbjct: 299 LADLLNKEIGAKKGFTGQYSIEC 321
>gi|195121164|ref|XP_002005091.1| GI20282 [Drosophila mojavensis]
gi|193910159|gb|EDW09026.1| GI20282 [Drosophila mojavensis]
Length = 392
Score = 230 bits (586), Expect = 9e-58, Method: Compositional matrix adjust.
Identities = 112/256 (43%), Positives = 158/256 (61%), Gaps = 2/256 (0%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N++DAQYFG I IG+P Q F+VIFDTGS+NLWVPS C ++C HSR+ ++KS+
Sbjct: 63 VPLSNYLDAQYFGPISIGTPQQTFNVIFDTGSANLWVPSESCQKKLACQIHSRFNAKKSS 122
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y GK +I YGSGS++G+ S D V V + + +Q F EAT FL A+FDGI G
Sbjct: 123 SYRSNGKRFDIQYGSGSLAGYLSHDTVRVAGLEIPNQTFAEATDMPGPIFLAAKFDGIFG 182
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+R I++ + P + ++EQ L+ VFS +LNR+ + +GG + FGG ++++G T
Sbjct: 183 LGYRGISIQNIKPPFYAIMEQNLLKRPVFSVYLNRELGSNQGGYLFFGGSSSRYYRGNFT 242
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
YVPVT + YWQ +L IG +C GC I+D+GTS LA P IN +IGG
Sbjct: 243 YVPVTHRAYWQVKLETARIGKLQ--LCLNGCQVIIDTGTSFLAVPYEQAILINESIGGTP 300
Query: 315 VVSAECKLVVSQYGDL 330
+ + Q L
Sbjct: 301 AAYGQFSVPCDQVAHL 316
>gi|145232965|ref|XP_001399855.1| vacuolar protease A [Aspergillus niger CBS 513.88]
gi|134056777|emb|CAK37685.1| aspartic protease pepE-Aspergillus niger
Length = 398
Score = 230 bits (586), Expect = 9e-58, Method: Compositional matrix adjust.
Identities = 133/314 (42%), Positives = 191/314 (60%), Gaps = 23/314 (7%)
Query: 14 VLASCLLLPASSNGLRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSGVRHR--- 66
+L + +LL +S + ++ L K +L H+++A ++YMG + H+
Sbjct: 6 LLTASVLLGCASAEVHKLKLNKVPLEEQLYTHNIDAHVRALGQKYMG---IRPSIHKELV 62
Query: 67 ----LGD-SDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSIS 121
+ D S D+L + NF++AQYF EI +G+PPQ F V+ DTGSSNLWVPSS+C SI+
Sbjct: 63 EENPINDMSRHDVL-VDNFLNAQYFSEIELGTPPQKFKVVLDTGSSNLWVPSSECS-SIA 120
Query: 122 CYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGS 181
CY H++Y S S+TY + G I YGSGS+SGF SQD +++GD+ VK Q F EAT E
Sbjct: 121 CYLHNKYDSSASSTYHKNGSEFAIKYGSGSLSGFISQDTLKIGDLKVKGQDFAEATNEPG 180
Query: 182 LTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV- 240
L F RFDGI+GLG+ I+V VP + NM++QGL+ E VF+F+L +EG E V
Sbjct: 181 LAFAFGRFDGILGLGYDTISVNKIVPPFYNMLDQGLLDEPVFAFYLGD--TNKEGDESVA 238
Query: 241 -FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGP 299
FGGVD H+ G+ +P+ +K YW+ EL I +G+ + G I+D+GTSL+A P
Sbjct: 239 TFGGVDKDHYTGELIKIPLRRKAYWEVELDAIALGDDVAEMENTGV--ILDTGTSLIALP 296
Query: 300 TPVVTEINHAIGGE 313
+ IN IG +
Sbjct: 297 ADLAEMINAQIGAK 310
>gi|355558869|gb|EHH15649.1| Renin [Macaca mulatta]
gi|355746005|gb|EHH50630.1| Renin [Macaca fascicularis]
Length = 406
Score = 230 bits (586), Expect = 9e-58, Method: Compositional matrix adjust.
Identities = 124/307 (40%), Positives = 190/307 (61%), Gaps = 20/307 (6%)
Query: 17 SCLL-LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSG------VRHRLGD 69
SC LP + +RI LK+ + + R + KER + A + R LG+
Sbjct: 19 SCTFGLPTDTTTFKRIFLKR-------MPSIRESLKERGVDMARLGPEWSQPMKRLALGN 71
Query: 70 SDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRY 128
+ ++ L N+MD QY+GEIGIG+PPQ F V+FDTGSSN+WVPSSKC +C +H +
Sbjct: 72 TTSSVI-LTNYMDTQYYGEIGIGTPPQTFKVVFDTGSSNVWVPSSKCSRLYTACVYHKLF 130
Query: 129 KSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLAR 188
+ S++Y G + Y +G++SGF SQD + VG + V Q+F E T +L F+LA
Sbjct: 131 DASDSSSYKHNGTELTLRYSTGTVSGFLSQDIITVGGITVT-QMFGEVTEMPALPFMLAE 189
Query: 189 FDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNR-DPDAEE-GGEIVFGGVDP 246
FDG++G+GF E A+G P++DN++ QG++ E+VFSF+ NR +A+ GG+IV GG DP
Sbjct: 190 FDGVVGMGFIEQAIGRVTPIFDNILSQGVLKEDVFSFYYNRWGLNAQSLGGQIVLGGSDP 249
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
+H++G Y+ + K G WQ + + +G+ ST +CE GC A+VD+G S ++G T + ++
Sbjct: 250 QHYEGNFHYINLIKTGVWQIPMKGVSVGS-STLLCEDGCLALVDTGASYISGSTSSIEKL 308
Query: 307 NHAIGGE 313
A+G +
Sbjct: 309 MEALGAK 315
>gi|330930051|ref|XP_003302872.1| hypothetical protein PTT_14856 [Pyrenophora teres f. teres 0-1]
gi|311321500|gb|EFQ89048.1| hypothetical protein PTT_14856 [Pyrenophora teres f. teres 0-1]
Length = 399
Score = 230 bits (586), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 114/250 (45%), Positives = 162/250 (64%), Gaps = 8/250 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+P+ NF++AQYF EI +G+PPQ F V+ DTGSSNLWVPS+ C SI+CY H++Y S S+
Sbjct: 77 VPVSNFLNAQYFSEISLGTPPQTFKVVLDTGSSNLWVPSTSCN-SIACYLHTKYDSSSSS 135
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + G EI YGSGS+SGF S D ++GD+ VK+Q F EAT E L F RFDGI+G
Sbjct: 136 TYKKNGTEFEIRYGSGSLSGFVSNDVFQIGDLKVKNQDFAEATSEPGLAFAFGRFDGIMG 195
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V VP + NM++QGL+ E VF+F+L D + ++ E FGG+D + GK
Sbjct: 196 LGYDTISVKGIVPPFYNMLDQGLLDEPVFAFYLG-DTNQQQESEATFGGIDESKYTGKMI 254
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
+P+ +K YW+ EL + G ++ + G I+D+GTSL+A P+ + +N IG +
Sbjct: 255 KLPLRRKAYWEVELDALTFGKETAEMDNTGI--ILDTGTSLIALPSTIAELLNKEIGAKK 312
Query: 314 ---GVVSAEC 320
G + EC
Sbjct: 313 SFNGQYTVEC 322
>gi|530795|gb|AAA20876.1| pepsinogen [Aspergillus niger]
gi|350634685|gb|EHA23047.1| extracellular aspartic protease [Aspergillus niger ATCC 1015]
Length = 398
Score = 230 bits (586), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 133/314 (42%), Positives = 191/314 (60%), Gaps = 23/314 (7%)
Query: 14 VLASCLLLPASSNGLRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSGVRHR--- 66
+L + +LL +S + ++ L K +L H+++A ++YMG + H+
Sbjct: 6 LLTASVLLGCASAEVHKLKLNKVPLEEQLYTHNIDAHVRALGQKYMG---IRPSIHKELV 62
Query: 67 ----LGD-SDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSIS 121
+ D S D+L + NF++AQYF EI +G+PPQ F V+ DTGSSNLWVPSS+C SI+
Sbjct: 63 EENPINDMSRHDVL-VDNFLNAQYFSEIELGTPPQKFKVVLDTGSSNLWVPSSECS-SIA 120
Query: 122 CYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGS 181
CY H++Y S S+TY + G I YGSGS+SGF SQD +++GD+ VK Q F EAT E
Sbjct: 121 CYLHNKYDSSASSTYHKNGSEFAIKYGSGSLSGFVSQDTLKIGDLKVKGQDFAEATNEPG 180
Query: 182 LTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV- 240
L F RFDGI+GLG+ I+V VP + NM++QGL+ E VF+F+L +EG E V
Sbjct: 181 LAFAFGRFDGILGLGYDTISVNKIVPPFYNMLDQGLLDEPVFAFYLGD--TNKEGDESVA 238
Query: 241 -FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGP 299
FGGVD H+ G+ +P+ +K YW+ EL I +G+ + G I+D+GTSL+A P
Sbjct: 239 TFGGVDKDHYTGELIKIPLRRKAYWEVELDAIALGDDVAEMENTGV--ILDTGTSLIALP 296
Query: 300 TPVVTEINHAIGGE 313
+ IN IG +
Sbjct: 297 ADLAEMINAQIGAK 310
>gi|367047895|ref|XP_003654327.1| hypothetical protein THITE_2117251 [Thielavia terrestris NRRL 8126]
gi|347001590|gb|AEO67991.1| hypothetical protein THITE_2117251 [Thielavia terrestris NRRL 8126]
Length = 396
Score = 230 bits (586), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 115/239 (48%), Positives = 157/239 (65%), Gaps = 6/239 (2%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+P+ N+M+AQYF EI +G+PPQ+F V+ DTGSSNLWVPS +C SI+CY HS+Y S S+
Sbjct: 76 VPISNYMNAQYFSEITLGTPPQSFKVVLDTGSSNLWVPSVEC-GSIACYLHSKYDSSASS 134
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + G S +I YGSGS+SGF SQD + +GD+ VK Q F EAT E L F RFDGI+G
Sbjct: 135 TYKKNGTSFDIRYGSGSLSGFVSQDTLSIGDITVKGQDFAEATSEPGLAFAFGRFDGILG 194
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V VP + MVEQ LV E VF+F+L D E+VFGGVD +KGK T
Sbjct: 195 LGYDTISVNGIVPPFYKMVEQKLVDEPVFAFYL---ADTNGESEVVFGGVDKDRYKGKIT 251
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
+P+ +K YW+ + + G+ + G AI+D+GTSL+ P+ + +N +G +
Sbjct: 252 TIPLRRKAYWEVDFESLSYGDDTADFENTG--AILDTGTSLITLPSQLAEMLNAQLGAK 308
>gi|283806612|ref|NP_001164557.1| pepsin II-2/3 precursor [Oryctolagus cuniculus]
gi|129781|sp|P27821.1|PEPA2_RABIT RecName: Full=Pepsin II-2/3; AltName: Full=Pepsin A; Flags:
Precursor
gi|165600|gb|AAA85369.1| pepsinogen [Oryctolagus cuniculus]
Length = 387
Score = 230 bits (586), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 114/247 (46%), Positives = 162/247 (65%), Gaps = 10/247 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
++N++DA+YFG I IG+PPQ+F+VIFDTGSSNLWVPS+ C S++C H R+ S+TY
Sbjct: 67 MENYLDAEYFGTISIGTPPQDFTVIFDTGSSNLWVPSTYCS-SLACALHKRFNPEDSSTY 125
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
++ I YG+GS++G D V+VG + +Q+F + E SLTFL A FDGI+GL
Sbjct: 126 QGTSETLSITYGTGSMTGILGYDTVKVGSIEDTNQIFGLSKTEPSLTFLFAPFDGILGLA 185
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ I+ DA PV+DNM +GLVS+++FS +L+ D E+G ++FGG+D ++ G +V
Sbjct: 186 YPSISSSDATPVFDNMWNEGLVSQDLFSVYLSS--DDEKGSLVMFGGIDSSYYTGSLNWV 243
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG----- 311
PV+ +GYWQ + + I N T C C AIVD+GTSLL GPT ++ I IG
Sbjct: 244 PVSYEGYWQITMDSVSI-NGETIACADSCQAIVDTGTSLLTGPTSAISNIQSYIGASKNL 302
Query: 312 -GEGVVS 317
GE V+S
Sbjct: 303 LGENVIS 309
>gi|315051426|ref|XP_003175087.1| hypothetical protein MGYG_02617 [Arthroderma gypseum CBS 118893]
gi|311340402|gb|EFQ99604.1| hypothetical protein MGYG_02617 [Arthroderma gypseum CBS 118893]
Length = 401
Score = 230 bits (586), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 112/236 (47%), Positives = 149/236 (63%), Gaps = 3/236 (1%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
+ NF++AQYF EI IG+PPQ F V+ DTGSSNLWVP C SI+C+ HS Y S S+TY
Sbjct: 80 IDNFLNAQYFSEISIGTPPQTFKVVLDTGSSNLWVPGKDCS-SIACFLHSTYDSSASSTY 138
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
+ G I YGSGS+ GF SQD+V++GD+ +KDQ+F EAT E L F RFDGI+G+G
Sbjct: 139 HKNGTKFAIRYGSGSLEGFVSQDDVKIGDMTIKDQLFAEATSEPGLAFAFGRFDGIMGMG 198
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
F I+V P + M++QGL+ E VFSF+L + + FGG D HF GK T +
Sbjct: 199 FSSISVNGITPPFYKMIDQGLIDEPVFSFYLGDTNKEGDQSVVTFGGSDESHFTGKMTTI 258
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
P+ +K YW+ E I +G + + G I+D+GTSL+A PT + IN IG
Sbjct: 259 PLRRKAYWEVEFNAISLGKDTAALENTGI--ILDTGTSLIALPTTLAEMINSQIGA 312
>gi|302657131|ref|XP_003020295.1| hypothetical protein TRV_05606 [Trichophyton verrucosum HKI 0517]
gi|306531031|sp|D4DEN7.1|CARP_TRIVH RecName: Full=Probable vacuolar protease A; AltName: Full=Aspartic
endopeptidase PEP2; AltName: Full=Aspartic protease
PEP2; Flags: Precursor
gi|291184114|gb|EFE39677.1| hypothetical protein TRV_05606 [Trichophyton verrucosum HKI 0517]
Length = 400
Score = 229 bits (585), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 121/295 (41%), Positives = 177/295 (60%), Gaps = 18/295 (6%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRL----------GDSDEDILPL 77
L+++ LK++ L+ ++ + ++YMG + +H +S ++L +
Sbjct: 25 LKKVSLKEQ-LEHADIDVQIKSLGQKYMG---IRPEQHEQQMFKEQTPIEAESGHNVL-I 79
Query: 78 KNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYT 137
NF++AQYF EI IG+PPQ F V+ DTGSSNLWVP C SI+C+ HS Y S S+TY+
Sbjct: 80 DNFLNAQYFSEISIGTPPQTFKVVLDTGSSNLWVPGKDCS-SIACFLHSTYDSSASSTYS 138
Query: 138 EIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGF 197
+ G I YGSGS+ GF SQD+V++GD+ +K+Q+F EAT E L F RFDGI+G+GF
Sbjct: 139 KNGTKFAIRYGSGSLEGFVSQDSVKIGDMTIKNQLFAEATSEPGLAFAFGRFDGIMGMGF 198
Query: 198 REIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVP 257
I+V P + NM++QGL+ E VFSF+L + + FGG D KHF G T +P
Sbjct: 199 SSISVNGITPPFYNMIDQGLIDEPVFSFYLGDTNKEGDQSVVTFGGSDTKHFTGDMTTIP 258
Query: 258 VTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
+ +K YW+ + I +G + + G I+D+GTSL+A PT + IN IG
Sbjct: 259 LRRKAYWEVDFDAISLGEDTAALENTGI--ILDTGTSLIALPTTLAEMINTQIGA 311
>gi|389747274|gb|EIM88453.1| Asp-domain-containing protein [Stereum hirsutum FP-91666 SS1]
Length = 416
Score = 229 bits (585), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 132/331 (39%), Positives = 186/331 (56%), Gaps = 36/331 (10%)
Query: 24 SSNGLRRIGLKK--RRLDLHSLNAARITRK-------ERYMGGAGVSGVRHRL---GDSD 71
S++G+ ++ LKK + L +A + K + + G+ + R R G SD
Sbjct: 17 SASGIHKLKLKKLPQVASNQHLESAYLAEKYGAQAPAQMPLAGSADAAGRMRFSRPGQSD 76
Query: 72 EDI---------------LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC 116
+D+ +PL NFM+AQY+ EI IG+PPQ F VI DTGSSNLWVPSS+C
Sbjct: 77 DDLFWTQEESIIANGGHGVPLTNFMNAQYYTEIDIGTPPQTFKVILDTGSSNLWVPSSQC 136
Query: 117 YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEA 176
SI+C+ H++Y S S++Y G I YGSGS+ GF S D++ GD+ + F EA
Sbjct: 137 T-SIACFLHTKYDSSASSSYKANGTEFSIQYGSGSMEGFVSNDDIVFGDMSLSSVDFAEA 195
Query: 177 TREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEG 236
T+E L F +FDGI+GL + IAV PV+ +V QG++SE VFSF L D +G
Sbjct: 196 TKEPGLAFAFGKFDGILGLAYDTIAVNHITPVFYELVNQGIISEPVFSFRLGSSED--DG 253
Query: 237 GEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLL 296
GE +FGG+DP + GK Y PV +K YW+ EL + G+ + G A +D+GTSL+
Sbjct: 254 GEAIFGGIDPSAYSGKIDYAPVRRKAYWEVELEKVSFGDDDLELENTGAA--IDTGTSLI 311
Query: 297 AGPTPVVTEINHAIGGE----GVVSAECKLV 323
A PT V +N IG + G + +C V
Sbjct: 312 ALPTDVAEMLNTQIGAKKSWNGQYTVDCAKV 342
>gi|410986287|ref|XP_003999442.1| PREDICTED: renin [Felis catus]
Length = 407
Score = 229 bits (585), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 138/339 (40%), Positives = 200/339 (58%), Gaps = 26/339 (7%)
Query: 1 MEQ--KLLRSVFCLWVLASCLL-LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGG 57
M+Q ++ R L + +SC LPA S RRI LKK + + R + KER +
Sbjct: 1 MDQGSRMPRWGLLLVLCSSCTFGLPADSGAFRRIFLKK-------MPSIRESLKERGVDV 53
Query: 58 AGVSG------VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWV 111
A + R G+S ++ L N++D QY+GEIGIG+PPQ F VIFDTGS+NLWV
Sbjct: 54 ARLGAEWSQFTKRFSFGNSTSPVV-LTNYLDTQYYGEIGIGTPPQTFKVIFDTGSANLWV 112
Query: 112 PSSKCY-FSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKD 170
PS+KC +C HS Y S +S++Y E G + I+YGSG + GF SQD V VG + V
Sbjct: 113 PSTKCSPLYTACEIHSLYDSSESSSYMENGTAFAIHYGSGKVKGFLSQDEVTVGGITVT- 171
Query: 171 QVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRD 230
Q F E T + F+LA+FDGI+G+GF AVG PV+D+++ QG++ E+VFS + +R+
Sbjct: 172 QTFGEVTELPLIPFMLAKFDGILGMGFPAQAVGGVTPVFDHILSQGVLKEDVFSVYYSRN 231
Query: 231 PDAEE--GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAI 288
GGE+V GG DP++++G YV V+K G WQ ++ + + +T VCE GC +
Sbjct: 232 SKNSHLLGGEVVLGGSDPQYYQGNFHYVSVSKTGSWQIKMKGVSV-RSATVVCEEGCMVV 290
Query: 289 VDSGTSLLAGPTPVVTEINHAIGGEGVVSAE----CKLV 323
VD+G S ++GPT + + +G + + E CK V
Sbjct: 291 VDTGASYISGPTSSLRLLMETLGAKELSRNEYVVNCKQV 329
>gi|241687194|ref|XP_002412838.1| aspartyl protease, putative [Ixodes scapularis]
gi|215506640|gb|EEC16134.1| aspartyl protease, putative [Ixodes scapularis]
Length = 320
Score = 229 bits (584), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 118/230 (51%), Positives = 155/230 (67%), Gaps = 3/230 (1%)
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
+Y+G I IG+PPQ+F VIFDTGS+NLW+PSSKC + C H RY S +S+TY G++
Sbjct: 3 EYYGPITIGTPPQDFQVIFDTGSANLWLPSSKCT-TKYCLHHHRYDSSRSSTYEADGRNF 61
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVG 203
I YGSG++ GF S+D +G V Q EA G + L A FDGI+GL + IAV
Sbjct: 62 TIVYGSGNVEGFISKDVCRIGSAKVSGQPLGEALVVGGESLLEAPFDGILGLAYPSIAVD 121
Query: 204 DAVPVWDNMVEQGLVSEE-VFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
VPV+DNM++QGL+ E+ VFS +LNRDP ++EGGEI+FGG+D H+KG TYVPVT KG
Sbjct: 122 GVVPVFDNMMKQGLLGEQNVFSVYLNRDPSSKEGGEILFGGIDHDHYKGSITYVPVTAKG 181
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
YWQF + D + +C+ GC AI D+GTSL+ GP V +N +GG
Sbjct: 182 YWQFHV-DGASKSVPELLCKDGCEAIADTGTSLITGPPEEVDSLNQYLGG 230
>gi|358372259|dbj|GAA88863.1| aspartic protease (PepE) [Aspergillus kawachii IFO 4308]
Length = 398
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 125/311 (40%), Positives = 184/311 (59%), Gaps = 17/311 (5%)
Query: 14 VLASCLLLPASSNGLRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSGVRHR--- 66
+L + +LL +S + ++ L K +L H+++A ++YMG + H+
Sbjct: 6 LLTASVLLGCASAEVHKLKLNKVPLEEQLYTHNIDAHVRALGQKYMG---IRPSIHKELV 62
Query: 67 ----LGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISC 122
+ D + + NF++AQYF EI +G+PPQ F V+ DTGSSNLWVPSS+C SI+C
Sbjct: 63 EENPINDMSRHDVLVDNFLNAQYFSEIELGTPPQKFKVVLDTGSSNLWVPSSECS-SIAC 121
Query: 123 YFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSL 182
Y H++Y S S+TY + G I YGSGS+SGF SQD +++GD+ VK Q F EAT E L
Sbjct: 122 YLHNKYDSSASSTYHKNGSEFAIKYGSGSLSGFISQDTLKIGDLKVKGQDFAEATNEPGL 181
Query: 183 TFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFG 242
F RFDGI+GLG+ I+V VP + NM++QGL+ E VF+F+L + FG
Sbjct: 182 AFAFGRFDGILGLGYDTISVNKIVPPFYNMLDQGLLDEPVFAFYLGDTNKEGDDSVATFG 241
Query: 243 GVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPV 302
GVD H+ G+ +P+ +K YW+ +L I +G+ + G I+D+GTSL+A P +
Sbjct: 242 GVDKDHYTGELIKIPLRRKAYWEVDLDAIALGDDVAELDNTGV--ILDTGTSLIALPADL 299
Query: 303 VTEINHAIGGE 313
IN IG +
Sbjct: 300 AEMINAQIGAK 310
>gi|449481456|ref|XP_002189698.2| PREDICTED: cathepsin E-A-like [Taeniopygia guttata]
Length = 405
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 108/236 (45%), Positives = 161/236 (68%), Gaps = 2/236 (0%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
L ++M+AQY+G + +G+PPQ+F+V+FDTGSSN WVPS+ C S +C H ++KS KS++Y
Sbjct: 73 LYDYMNAQYYGVVSVGTPPQSFTVVFDTGSSNFWVPSAYC-ISEACRVHQKFKSFKSDSY 131
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
G++ + YGSG + G +D +++ ++ +K Q F E+ E TF+LA FDG++GLG
Sbjct: 132 EHGGEAFSLQYGSGQLLGIAGKDTLQISNISIKGQDFGESVFEPGATFVLAHFDGVLGLG 191
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ +AVG+A+PV+D+++ Q LV E VFSF+L R D E GGE++ GG+D +KG +V
Sbjct: 192 YPSLAVGNALPVFDSIMNQHLVEEPVFSFYLKRGEDTENGGELILGGIDHSLYKGSIHWV 251
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
PVT+K YWQ + +I I + T C GC AIVDSGTSL+ GP+ + + IG
Sbjct: 252 PVTEKSYWQIHMNNIKIQGRVT-FCSHGCEAIVDSGTSLITGPSSQIRRLQAYIGA 306
>gi|291409616|ref|XP_002721074.1| PREDICTED: pepsin II-4-like [Oryctolagus cuniculus]
Length = 387
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 126/298 (42%), Positives = 178/298 (59%), Gaps = 22/298 (7%)
Query: 32 GLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGI 91
GL + L HS N A +Y A + E ++N+MDA+YFG I I
Sbjct: 36 GLLQDYLKTHSPNPAT-----KYFPNAAYA---------KESTEKMENYMDAEYFGTISI 81
Query: 92 GSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGS 151
G+PPQ+F+VIFDTGSSNLWVPS C S++C FH ++ +KS+TY K+ I YG+GS
Sbjct: 82 GTPPQDFTVIFDTGSSNLWVPSIYCS-SLACAFHKQFNPKKSSTYQATDKTVSIAYGTGS 140
Query: 152 ISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDN 211
++G D V+VG + Q+F + E TF+ A FDGI+GLG+ I+ DA PV+DN
Sbjct: 141 MTGILGYDIVKVGSIDDTHQIFGLSETEPGDTFVFAPFDGILGLGYPSISSSDATPVFDN 200
Query: 212 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 271
M + LVSE++FS +L+ D ++G ++FGG+D ++KG +VPV+ +GYWQF + +
Sbjct: 201 MWDHRLVSEDLFSVYLSS--DDKKGSLVMFGGIDESYYKGSLHWVPVSYEGYWQFTMDSV 258
Query: 272 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI----GGEGVVSAECKLVVS 325
I N T C C AI+D+GTSLLAGPT +++I I EG +C V S
Sbjct: 259 TI-NGKTIACADSCQAIIDTGTSLLAGPTNAISKIQRHIRAYDNSEGEAIVKCSDVKS 315
>gi|407260952|ref|XP_003946102.1| PREDICTED: renin-1-like [Mus musculus]
Length = 400
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 132/322 (40%), Positives = 189/322 (58%), Gaps = 30/322 (9%)
Query: 6 LRSVFCLWVLASCLL-LPASSNGLRRIGLKK----------RRLDLHSLNAARITRKERY 54
L ++ LW + C LP + RI LKK R +D+ L+A R
Sbjct: 3 LWALLLLW--SPCTFSLPTRTATFERIPLKKMPSVREILEERGVDMTRLSAER------- 53
Query: 55 MGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSS 114
GV R L + ++ L N+++ QY+GEIGIG+PPQ F VIFDTGS+NLWVPS+
Sbjct: 54 ----GVFTKRPSLINLTSPVV-LTNYLNTQYYGEIGIGTPPQTFKVIFDTGSANLWVPST 108
Query: 115 KC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF 173
KC ++C HS Y+S S++Y E G I+YGSG + GF SQD V VG + V Q F
Sbjct: 109 KCSRLYLACGIHSLYESSDSSSYMENGSDFTIHYGSGRVKGFLSQDVVTVGGITVT-QTF 167
Query: 174 IEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDA 233
E T + F+LA+FDG++G+GF AVG PV+D+++ QG++ EEVFS + NR
Sbjct: 168 GEVTELPLIPFMLAKFDGVLGMGFPAQAVGGVTPVFDHILSQGVLKEEVFSVYYNRKTKG 227
Query: 234 EE--GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDS 291
GGE+V GG DP+H++G YV ++K WQ + + +G+ ST +CE GCA +VD+
Sbjct: 228 SHLLGGEVVLGGSDPQHYQGNFHYVSISKTDSWQITMKGVSVGS-STLLCEEGCAVVVDT 286
Query: 292 GTSLLAGPTPVVTEINHAIGGE 313
G+S ++ PT + I A+G +
Sbjct: 287 GSSFISAPTSSLKLIMQALGAK 308
>gi|390601248|gb|EIN10642.1| endopeptidase [Punctularia strigosozonata HHB-11173 SS5]
Length = 412
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 118/253 (46%), Positives = 159/253 (62%), Gaps = 9/253 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL NFM+AQYF EI +G+PPQ+F VI DTGSSNLWVPS KC SI+C+ H +Y S +S+
Sbjct: 91 VPLSNFMNAQYFSEITLGTPPQSFKVILDTGSSNLWVPSVKCT-SIACFLHQKYDSSQSS 149
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y G I YGSGS+ GF S+D + +GD+ +K Q F EAT+E L F +FDGI+G
Sbjct: 150 SYKANGSEFSIQYGSGSMEGFVSRDTLTIGDLTIKGQDFAEATKEPGLAFAFGKFDGILG 209
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V P + +M+ L+ + VFSF L E+GGE VFGG+D ++GK T
Sbjct: 210 LGYDTISVNHITPPFYSMINAALLDDPVFSFRLGSS--EEDGGEAVFGGIDSSAYEGKIT 267
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG-- 312
YVPV +K YW+ EL I G+ + G A +D+GTSL+A PT + +N IG
Sbjct: 268 YVPVRRKAYWEVELEKIKFGDDELELENTGAA--IDTGTSLIALPTDLAEMLNAQIGATK 325
Query: 313 --EGVVSAECKLV 323
G + EC V
Sbjct: 326 SWNGQYTVECSKV 338
>gi|30575834|gb|AAP32823.1| aspartyl proteinase [Paracoccidioides brasiliensis]
Length = 400
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 125/291 (42%), Positives = 176/291 (60%), Gaps = 10/291 (3%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDE----DILPLKNFMDA 83
L +I L ++ LD ++ ++YMG + D+ + + + NF++A
Sbjct: 26 LNKISLSQQ-LDHANIETQVKALGQKYMGVRPSQHLNEMFKDTSKASGGHSVLVDNFLNA 84
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
QYF EI IG+PPQ F V+ DTGSSNLWVPSS+C SI+CY HS+Y S S+T+ + G
Sbjct: 85 QYFSEISIGTPPQTFKVVLDTGSSNLWVPSSQCS-SIACYLHSKYDSSASSTHRKNGTEF 143
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVG 203
I YGSGS+SGF SQD + +GD+ V+ Q F EAT E L F RFDGI+GLG+ I+V
Sbjct: 144 AIRYGSGSLSGFVSQDVLRIGDMTVESQDFAEATSEPGLAFAFGRFDGILGLGYDTISVN 203
Query: 204 DAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKG 262
VP + MV QGL+ E VFSF+L N D D ++ E FGG+D H+ G T + + +K
Sbjct: 204 RIVPTFYLMVNQGLLDEPVFSFYLGNSDTDGDD-SEATFGGIDKDHYTGNLTMISLRRKA 262
Query: 263 YWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
YW+ +L I G+++ + G I+D+GTSLLA P+ V +N IG +
Sbjct: 263 YWEVDLDAITFGSETAELENTGV--ILDTGTSLLALPSTVAEILNQKIGAK 311
>gi|425767355|gb|EKV05929.1| Vacuolar protease A [Penicillium digitatum PHI26]
gi|425779798|gb|EKV17829.1| Vacuolar protease A [Penicillium digitatum Pd1]
Length = 399
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 130/311 (41%), Positives = 186/311 (59%), Gaps = 27/311 (8%)
Query: 28 LRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILP------- 76
+ R+ L K +L+ H+++A ++YMG + +H+ D + P
Sbjct: 21 VHRLKLNKVPLSEQLNTHNIDAHLHNLGQKYMG---IRPEKHQDLFHDTSLNPASGHDVL 77
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
+ NF++AQYF EI IG+PPQ F V+ DTGSSNLWVPSS+C SI+C+ HS+Y S S+TY
Sbjct: 78 VDNFLNAQYFSEITIGTPPQTFKVVLDTGSSNLWVPSSQCS-SIACFLHSKYDSSSSSTY 136
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
+ G EI YGSGS+SGF S+D +++GD+ V+ Q F EAT E L F RFDGI+GLG
Sbjct: 137 QKNGTDFEIRYGSGSLSGFVSRDTLQIGDLKVEGQDFAEATNEPGLAFAFGRFDGILGLG 196
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE---IVFGGVDPKHFKGKH 253
+ I+V VP + M++Q LV E VF+F+L DA + G+ FGG+D H+ G+
Sbjct: 197 YDTISVNKMVPPFYQMIKQKLVDEPVFAFYLG---DANKDGDNSVATFGGIDESHYTGEL 253
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG- 312
+PV +K YW+ EL I +GN + + G I+D+GTSL+A P+ + +N IG
Sbjct: 254 IKIPVRRKAYWEVELNSIALGNNVAELDDTGV--ILDTGTSLIALPSTMAELLNKEIGAT 311
Query: 313 ---EGVVSAEC 320
G S EC
Sbjct: 312 KGFTGQYSVEC 322
>gi|119567604|gb|ABL84270.1| aspartic protease [Musca domestica]
Length = 379
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 120/262 (45%), Positives = 167/262 (63%), Gaps = 12/262 (4%)
Query: 70 SDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFHSRY 128
SDE PL+N ++ +Y+G+I IG+PPQ F V+FDTGSSNLWVPSS C+ + I+C H++Y
Sbjct: 57 SDE---PLENSLNMKYYGDITIGTPPQKFVVLFDTGSSNLWVPSSHCWIWDIACKKHNQY 113
Query: 129 KSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLAR 188
S+TY + G+ I+YGSGS+SGF SQD+V V + +K+QVF EA E +F A
Sbjct: 114 NHDDSSTYVKNGELISISYGSGSMSGFLSQDDVTVEGLTIKNQVFAEAMNEPGNSFTDAN 173
Query: 189 FDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKH 248
FDGI G+ ++ +A + VP + NM QGLV +FSF LNRD + +GG+++ GGVD
Sbjct: 174 FDGIFGMAYQSLAEDNVVPPFYNMFAQGLVDANMFSFLLNRDGTSTDGGQMILGGVDSSL 233
Query: 249 FKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH 308
+ G TYVPV+ +GYWQFE+ I QS +C+ C AI D+GTSL+ P+ +N
Sbjct: 234 YTGDITYVPVSSQGYWQFEVTSGAIKGQS--ICD-NCQAIADTGTSLIVAPSDAYNTLNA 290
Query: 309 AIGG-----EGVVSAECKLVVS 325
IG +G +C V S
Sbjct: 291 EIGATYNEDDGNYYVDCSAVDS 312
>gi|283806610|ref|NP_001164556.1| pepsin II-4 precursor [Oryctolagus cuniculus]
gi|129787|sp|P28713.1|PEPA4_RABIT RecName: Full=Pepsin II-4; AltName: Full=Pepsin A; Flags: Precursor
gi|22218076|dbj|BAC07515.1| pepsinogen II-4 [Oryctolagus cuniculus]
Length = 387
Score = 228 bits (581), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 114/247 (46%), Positives = 161/247 (65%), Gaps = 10/247 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
L+N++DA+YFG I IG+PPQ+F+VIFDTGSSNLWVPS+ C S++C H R+ S+TY
Sbjct: 67 LENYLDAEYFGTISIGTPPQDFTVIFDTGSSNLWVPSTYCS-SLACALHKRFNPEDSSTY 125
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
++ I YG+GS++G D V+VG + +Q+F + E LTFL A FDGI+GL
Sbjct: 126 QGTSETLSITYGTGSMTGILGYDTVKVGSIEDTNQIFGLSKTEPGLTFLFAPFDGILGLA 185
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ I+ DA PV+DNM +GLVS+++FS +L+ D E+G ++FGG+D ++ G +V
Sbjct: 186 YPSISSSDATPVFDNMWNEGLVSQDLFSVYLSS--DDEKGSLVMFGGIDSSYYTGSLNWV 243
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG----- 311
PV+ +GYWQ + + I N T C C AIVD+GTSLL GPT ++ I IG
Sbjct: 244 PVSYEGYWQITMDSVSI-NGETIACADSCQAIVDTGTSLLTGPTSAISNIQSYIGASKNL 302
Query: 312 -GEGVVS 317
GE V+S
Sbjct: 303 LGENVIS 309
>gi|409050032|gb|EKM59509.1| hypothetical protein PHACADRAFT_250062 [Phanerochaete carnosa
HHB-10118-sp]
Length = 407
Score = 228 bits (581), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 135/334 (40%), Positives = 186/334 (55%), Gaps = 30/334 (8%)
Query: 15 LASCLLLP--ASSNGLRRIGLKK-----RRLDLHSLNAARITRKERYMGGAGVSGVRHRL 67
LA ++LP A++ G+ + L K + S + A + M GAG +G RL
Sbjct: 6 LAPLVILPFAAAAAGVHKFKLHKLPPVSQDFAFESAHLAEKYGGQVPMLGAGGAGRNVRL 65
Query: 68 GDSDED--------------ILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPS 113
D LPL+NFM+AQYF I IG+PPQ+F+VI DTGSSNLWVPS
Sbjct: 66 SRPTPDDGLFRTQEEFTSGHTLPLQNFMNAQYFTTIEIGTPPQSFNVILDTGSSNLWVPS 125
Query: 114 SKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF 173
++C SI+C+ H +Y S S+TY G I YGSGS+ GF S+D + +GD+ + Q F
Sbjct: 126 TQCT-SIACFLHKKYDSGSSSTYKPNGSEFSIQYGSGSMEGFVSRDVLTMGDITIGQQDF 184
Query: 174 IEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDA 233
EAT+E L F +FDGI+GL + IAV P NM E+GL+ + VF+F L
Sbjct: 185 AEATKEPGLAFAFGKFDGILGLAYDTIAVNHITPPHYNMFEKGLIEKPVFAFRLGS--TE 242
Query: 234 EEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGT 293
E+ GE FGG+D F+GK VPV +K YW+ EL + +G+ + + G A +D+GT
Sbjct: 243 EDAGEATFGGIDESAFEGKLHRVPVRRKAYWEVELEKVRLGDDELELEDTGAA--IDTGT 300
Query: 294 SLLAGPTPVVTEINHAIGGE----GVVSAECKLV 323
SL+A PT + IN IG + G + EC V
Sbjct: 301 SLIALPTDMAEMINAQIGAKRGWNGQYTVECSTV 334
>gi|395821502|ref|XP_003784077.1| PREDICTED: gastricsin-like [Otolemur garnettii]
Length = 390
Score = 228 bits (581), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 113/302 (37%), Positives = 183/302 (60%), Gaps = 4/302 (1%)
Query: 12 LWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSD 71
L ++ +CL L S GL R+ L+K + ++ + + G ++ G+
Sbjct: 4 LVLILACLYL---SEGLERVILRKGKSIRQAMEEQGVLEEYLKNHPKGDPVAKYHFGNYA 60
Query: 72 EDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSR 131
P+ N+M++ YFGEI IG+PPQNF V+FDTGSSNLWVPS+ C S +C H +
Sbjct: 61 VAYEPITNYMESFYFGEISIGTPPQNFLVLFDTGSSNLWVPSTYCQ-SQACSNHHVFNPS 119
Query: 132 KSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDG 191
+S+T++ G++ ++YGSGS++ D V + ++VV +Q F + E ++ F + FDG
Sbjct: 120 QSSTFSNNGQTYTLSYGSGSLTVVMGYDTVTIQNIVVNNQEFGLSENEPTVPFYYSAFDG 179
Query: 192 IIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKG 251
I+G+ + IAVG+A V +M++Q +++ +FSF+ +R P A+ GGE++ GGVD + + G
Sbjct: 180 ILGMAYPAIAVGNAPTVVQDMLQQNQLTQPIFSFYFSRQPTAQYGGELILGGVDSQLYSG 239
Query: 252 KHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
+ + PVT++ YWQ + + IGNQ+TG+C GC IVD+GTSLL P ++ A G
Sbjct: 240 EIVWTPVTQEMYWQIAIQEFSIGNQATGLCSQGCQGIVDTGTSLLTVPQQYISSFVEATG 299
Query: 312 GE 313
+
Sbjct: 300 AQ 301
>gi|13676837|ref|NP_112469.1| renin-1 precursor [Mus musculus]
gi|132327|sp|P06281.1|RENI1_MOUSE RecName: Full=Renin-1; AltName: Full=Angiotensinogenase; AltName:
Full=Kidney renin; Flags: Precursor
gi|53931|emb|CAA34636.1| unnamed protein product [Mus musculus]
gi|26342875|dbj|BAC35094.1| unnamed protein product [Mus musculus]
gi|26351563|dbj|BAC39418.1| unnamed protein product [Mus musculus]
gi|38512029|gb|AAH61053.1| Renin 1 structural [Mus musculus]
gi|148707703|gb|EDL39650.1| mCG131545 [Mus musculus]
Length = 402
Score = 228 bits (580), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 129/310 (41%), Positives = 186/310 (60%), Gaps = 9/310 (2%)
Query: 6 LRSVFCLWVLASCLL-LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVR 64
L ++ LW + C LP + RI LKK + + R R GV R
Sbjct: 8 LWALLLLW--SPCTFSLPTRTATFERIPLKKMP-SVREILEERGVDMTRLSAEWGVFTKR 64
Query: 65 HRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCY 123
L + ++ L N+++ QY+GEIGIG+PPQ F VIFDTGS+NLWVPS+KC ++C
Sbjct: 65 PSLTNLTSPVV-LTNYLNTQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCSRLYLACG 123
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
HS Y+S S++Y E G I+YGSG + GF SQD+V VG + V Q F E T +
Sbjct: 124 IHSLYESSDSSSYMENGSDFTIHYGSGRVKGFLSQDSVTVGGITVT-QTFGEVTELPLIP 182
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
F+LA+FDG++G+GF AVG PV+D+++ QG++ EEVFS + NR GGE+V GG
Sbjct: 183 FMLAKFDGVLGMGFPAQAVGGVTPVFDHILSQGVLKEEVFSVYYNRGSHL-LGGEVVLGG 241
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
DP+H++G YV ++K WQ + + +G+ ST +CE GCA +VD+G+S ++ PT +
Sbjct: 242 SDPQHYQGNFHYVSISKTDSWQITMKGVSVGS-STLLCEEGCAVVVDTGSSFISAPTSSL 300
Query: 304 TEINHAIGGE 313
I A+G +
Sbjct: 301 KLIMQALGAK 310
>gi|326911558|ref|XP_003202125.1| PREDICTED: cathepsin E-A-like [Meleagris gallopavo]
Length = 404
Score = 228 bits (580), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 110/243 (45%), Positives = 160/243 (65%), Gaps = 2/243 (0%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
L ++M+AQY+G I +G+PPQ+F+V+FDTGSSN WVPS C S +C H R+KS S++Y
Sbjct: 73 LYDYMNAQYYGVISVGTPPQSFTVVFDTGSSNFWVPSVYC-ISEACRVHQRFKSFLSDSY 131
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
G+ + YG+G + G ++D +++ ++ +K Q F E+ E +TF LA FDG++GLG
Sbjct: 132 EHGGEPFSLQYGTGQLLGIAAKDTLQISNISIKGQDFGESVFEPGMTFALAHFDGVLGLG 191
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ +AVG+A+PV+D+++ Q LV E VFSF+L R D E GGE++ GG+D +KG +V
Sbjct: 192 YPSLAVGNALPVFDSIMNQKLVEEPVFSFYLKRGDDTENGGELILGGIDHSLYKGSIHWV 251
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
PVT+K YWQ L +I I + C GC AIVDSGTSL+ GP+ + + IG
Sbjct: 252 PVTEKSYWQIHLNNIKIQGR-VAFCSHGCEAIVDSGTSLITGPSSQIRRLQEYIGASPSR 310
Query: 317 SAE 319
S E
Sbjct: 311 SGE 313
>gi|449549767|gb|EMD40732.1| hypothetical protein CERSUDRAFT_44393 [Ceriporiopsis subvermispora
B]
Length = 413
Score = 227 bits (579), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 132/329 (40%), Positives = 187/329 (56%), Gaps = 34/329 (10%)
Query: 24 SSNGLRRIGLKKRRLDLHSLNAARITRKERYMG-------GAGVSGVRHRLGD------- 69
+++G+ R+ L K + E+Y G GAG G RLG
Sbjct: 15 AADGVHRLKLHKVPPTTSNPALESAYLAEKYGGQAQSPLMGAGGYGRNVRLGRPTHQDGE 74
Query: 70 ----SDEDIL-------PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF 118
+ ED++ PL NFM+AQYF EI +G+PPQ+F V+ DTGSSNLWVPS+KC
Sbjct: 75 ELFWTQEDLVTEGGHTVPLSNFMNAQYFAEITLGTPPQSFKVVLDTGSSNLWVPSTKCT- 133
Query: 119 SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
SI+C+ H++Y S S++Y G EI+YGSGS+ GF SQD + +GD+ + + F EAT+
Sbjct: 134 SIACFLHAKYDSSASSSYKANGTEFEIHYGSGSMEGFISQDVLSIGDISINNLDFAEATK 193
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
E L F +FDGI+GL + I+V VP + +MV + L+ VFSF L E+GGE
Sbjct: 194 EPGLAFAFGKFDGILGLAYDTISVNHVVPPFYHMVNKNLIDSPVFSFRLGS--SEEDGGE 251
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
+FGGVD + GK YVPV +K YW+ EL I +G+ + G A +D+GTSL+A
Sbjct: 252 AIFGGVDESAYTGKIDYVPVRRKAYWEVELQKISLGDDELELENTGAA--IDTGTSLIAL 309
Query: 299 PTPVVTEINHAIGGE----GVVSAECKLV 323
P+ + +N IG + G + EC+ V
Sbjct: 310 PSDMAEMLNTQIGAKRSWNGQYTVECEKV 338
>gi|395328846|gb|EJF61236.1| endopeptidase [Dichomitus squalens LYAD-421 SS1]
Length = 412
Score = 227 bits (579), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 120/255 (47%), Positives = 156/255 (61%), Gaps = 9/255 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL NFM+AQYF EI +G+PPQ F VI DTGSSNLWVPS KC SI+C+ H++Y S S+
Sbjct: 91 VPLSNFMNAQYFAEISLGTPPQTFKVILDTGSSNLWVPSVKCT-SIACFLHTKYDSSSSS 149
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY G I YGSGS+ GF SQD +GD+ V F EAT+E L F +FDGI+G
Sbjct: 150 TYKANGTEFSIQYGSGSMEGFVSQDTFRIGDLTVDGLDFAEATKEPGLAFAFGKFDGILG 209
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + IAV P + +++ +GLV E VFSF L D +GGE +FGGVD + GK
Sbjct: 210 LAYDTIAVNHITPPFYHLINKGLVDEPVFSFRLGSSED--DGGEAIFGGVDDSAYTGKIQ 267
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG-- 312
YVPV +K YW+ EL + +G+ + G A +D+GTSL+A PT + IN IG
Sbjct: 268 YVPVRRKAYWEVELEKVSLGDDVLELESTGAA--IDTGTSLIALPTDIAEMINTQIGATK 325
Query: 313 --EGVVSAECKLVVS 325
G + +C V S
Sbjct: 326 SWNGQYTVDCAKVPS 340
>gi|46395759|sp|Q800A0.1|CATE_RANCA RecName: Full=Cathepsin E; Flags: Precursor
gi|29647357|dbj|BAC75398.1| cathepsin E [Rana catesbeiana]
Length = 397
Score = 227 bits (579), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 107/236 (45%), Positives = 155/236 (65%), Gaps = 2/236 (0%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG+I IG+PPQ F+VIFDTGSSNLWVPS C S +C H+RY+ +S T
Sbjct: 65 PLMNYLDVEYFGQISIGTPPQQFTVIFDTGSSNLWVPSIYCT-SQACTKHNRYRPSESTT 123
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G++ I YG+G+++G D V V + V+ Q F E+ E TF + FDGI+GL
Sbjct: 124 YVSNGEAFFIQYGTGNLTGILGIDQVTVQGITVQSQTFAESVSEPGSTFQDSNFDGILGL 183
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ +AV + +PV+DNM+ Q LV +F ++NRDP++ +GGE+V GG D F G+ +
Sbjct: 184 AYPNLAVDNCIPVFDNMIAQNLVELPLFGVYMNRDPNSADGGELVLGGFDTSRFSGQLNW 243
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
VP+T +GYWQ ++ I + Q C GC AIVD+GTSL+ GP+ + ++ + IG
Sbjct: 244 VPITVQGYWQIQVDSIQVAGQVI-FCSDGCQAIVDTGTSLITGPSGDIEQLQNYIG 298
>gi|307175238|gb|EFN65290.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 357
Score = 227 bits (579), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 118/256 (46%), Positives = 151/256 (58%), Gaps = 6/256 (2%)
Query: 67 LGDSDEDI--LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYF 124
L DSD+D + L N+ + Y+G I IG+PPQ F VIFDTGS+NLW+PS KC + +C
Sbjct: 21 LNDSDDDFPSVILSNYQNINYYGVITIGTPPQEFKVIFDTGSANLWIPSKKCNLT-ACLI 79
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR-EGSLT 183
H++Y S SNTY +I Y + I G S D V V V++Q F E T
Sbjct: 80 HNQYNSTASNTYIAKNALIQIKYFNSIIDGLISTDIVNVAGFNVQNQTFAELTNMSNEEL 139
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
FL A FDGI+GL + I+ + +PV+DNMV Q LVS +FSF+LNRDP AE GE + GG
Sbjct: 140 FLPAPFDGILGLAYSYISDNNIIPVFDNMVNQNLVSSHIFSFYLNRDPSAELDGEFILGG 199
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
DP H+ G TYVPVT KG+WQF + I + N S +C+ C AI D+G GPT V
Sbjct: 200 SDPAHYDGNFTYVPVTHKGFWQFTMDKIEVNNIS--LCQSSCQAIADTGMGETYGPTSDV 257
Query: 304 TEINHAIGGEGVVSAE 319
IN IG + E
Sbjct: 258 KTINELIGTTNIDGME 273
>gi|321250483|ref|XP_003191823.1| endopeptidase [Cryptococcus gattii WM276]
gi|317458290|gb|ADV20036.1| Endopeptidase, putative [Cryptococcus gattii WM276]
Length = 432
Score = 227 bits (579), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 118/255 (46%), Positives = 159/255 (62%), Gaps = 9/255 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+M+AQYF +I +G+P Q F VI DTGSSNLWVPS C SI+C+ HS+Y S +S+
Sbjct: 111 VPLSNYMNAQYFAQIELGTPAQTFKVILDTGSSNLWVPSVGCT-SIACFLHSKYDSSQSS 169
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY G EI+YGSGS+ GF SQD + +GD+ +K Q F EAT+E L F +FDGI+G
Sbjct: 170 TYKANGSDFEIHYGSGSLEGFISQDTLAIGDLAIKGQDFAEATKEPGLAFAFGKFDGILG 229
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+V VP + NM+ Q L+ + VFSF L + +GGE +FGG+D + G
Sbjct: 230 LAYDTISVNHIVPPFYNMLNQDLLDDPVFSFRLGSSEN--DGGEAIFGGIDKSAYSGSLH 287
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
YVPV +KGYW+ EL I G+ + G A +D+GTSL+ PT V +N IG E
Sbjct: 288 YVPVRRKGYWEVELESISFGDDELELENTGAA--IDTGTSLIVMPTDVAEMLNKEIGAEK 345
Query: 314 ---GVVSAECKLVVS 325
G + +C V S
Sbjct: 346 SWNGQYTVDCNTVPS 360
>gi|358385852|gb|EHK23448.1| hypothetical protein TRIVIDRAFT_215801 [Trichoderma virens Gv29-8]
Length = 395
Score = 227 bits (578), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 127/310 (40%), Positives = 183/310 (59%), Gaps = 17/310 (5%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRK-----ERYMGGAGVSGVRHRLG 68
++A+ L+ ++ G+ ++ L+K L+ L + I + ++YMG S
Sbjct: 5 LIAAAALVGSAQAGVHKMKLQKVSLE-QQLEGSTIESQVQHLGQKYMGVRPTSRADVMFN 63
Query: 69 DSDEDI-----LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCY 123
D I +P+ NFM+AQYF EI IG+PPQ F V+ DTGSSNLWVPS C SI+C+
Sbjct: 64 DKLPKIQGGHPVPVTNFMNAQYFSEITIGTPPQTFKVVLDTGSSNLWVPSQSCN-SIACF 122
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
H+ Y S S+TY + G EI+YGSGS++GF S D V +GD+ ++ Q F EAT E L
Sbjct: 123 LHATYDSSSSSTYKQNGSDFEIHYGSGSLTGFISNDVVTIGDLKIQKQDFAEATSEPGLA 182
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
F RFDGI+GLG+ I+V +P + MV Q L+ E VF+F+L +EG E VFGG
Sbjct: 183 FAFGRFDGILGLGYDTISVNGIIPPFYQMVNQKLLDEPVFAFYLGS---GDEGSEAVFGG 239
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
VD H+ GK Y+P+ +K YW+ +L I G++ + G AI+D+GTSL P+ +
Sbjct: 240 VDESHYSGKIEYIPLRRKAYWEVDLDSIAFGDEVAELENTG--AILDTGTSLNVLPSGLA 297
Query: 304 TEINHAIGGE 313
+N IG +
Sbjct: 298 ELLNAEIGAK 307
>gi|146386352|gb|ABQ23964.1| cathepsin D [Oryctolagus cuniculus]
Length = 292
Score = 227 bits (578), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 108/222 (48%), Positives = 157/222 (70%), Gaps = 7/222 (3%)
Query: 104 TGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVE 162
TGSSNLWVPS C I+C+ H +Y S+KS+TY + G + +I+YGSGS+SG+ SQD V
Sbjct: 1 TGSSNLWVPSVHCKLLDIACWIHHKYNSKKSSTYVKNGTTFDIHYGSGSLSGYLSQDTVS 60
Query: 163 V-----GDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGL 217
V + V+ Q+F EAT++ +TF+ A+FDGI+G+ + I+V + +PV+DN+++Q L
Sbjct: 61 VPCTASSSIQVQKQIFGEATKQPGITFIAAKFDGILGMAYPRISVNNVLPVFDNLMQQKL 120
Query: 218 VSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQS 277
V + VFSF+LNRDP A+ GGE++ GGVDPK+++G +Y+ VT+K YWQ + + +G+
Sbjct: 121 VEKNVFSFYLNRDPAAQPGGELMLGGVDPKYYQGSLSYLNVTRKAYWQVHMDQLNVGSGL 180
Query: 278 TGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
T +CEGGC AIVD+GTSLL GP V E+ AIG ++ E
Sbjct: 181 T-LCEGGCEAIVDTGTSLLVGPVDEVRELQRAIGAVPLIQGE 221
>gi|290543422|ref|NP_001166408.1| cathepsin E precursor [Cavia porcellus]
gi|115721|sp|P25796.1|CATE_CAVPO RecName: Full=Cathepsin E; Flags: Precursor
gi|191295|gb|AAA37052.1| procathepsin E [Cavia porcellus]
gi|1246041|gb|AAB35844.1| procathepsin E [Cavia]
Length = 391
Score = 227 bits (578), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 114/236 (48%), Positives = 151/236 (63%), Gaps = 3/236 (1%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C H + S+T
Sbjct: 65 PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACQTHPVFHPSLSST 123
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y E+G S I YG+GS++G D V V + V Q F E+ +E TF+ A FDGI+GL
Sbjct: 124 YREVGNSFSIQYGTGSLTGIIGADQVSVEGLTVVGQQFGESVQEPGKTFVHAEFDGILGL 183
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +A G PV+DNM+ Q LV+ +FS +++ +P G E+ FGG DP HF G +
Sbjct: 184 GYPSLAAGGVTPVFDNMMAQNLVALPMFSVYMSSNPGG-SGSELTFGGYDPSHFSGSLNW 242
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
VPVTK+ YWQ L I +G+ S C GC AIVD+GTSL+ GP + ++ A+G
Sbjct: 243 VPVTKQAYWQIALDGIQVGD-SVMFCSEGCQAIVDTGTSLITGPPGKIKQLQEALG 297
>gi|261194088|ref|XP_002623449.1| aspartyl proteinase [Ajellomyces dermatitidis SLH14081]
gi|239588463|gb|EEQ71106.1| aspartyl proteinase [Ajellomyces dermatitidis SLH14081]
gi|239606974|gb|EEQ83961.1| aspartyl proteinase [Ajellomyces dermatitidis ER-3]
gi|327354563|gb|EGE83420.1| aspartyl proteinase [Ajellomyces dermatitidis ATCC 18188]
Length = 398
Score = 227 bits (578), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 117/262 (44%), Positives = 161/262 (61%), Gaps = 12/262 (4%)
Query: 52 ERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWV 111
E + G A SG L D NF++AQY+ EI IG+PPQ F V+ DTGSSNLWV
Sbjct: 61 EMFKGAAQASGGHSVLVD---------NFLNAQYYSEITIGTPPQTFKVVLDTGSSNLWV 111
Query: 112 PSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQ 171
PSS+C SI+CY H++Y S S+TY + G I YGSGS+SGF SQD V +GD+ +K Q
Sbjct: 112 PSSEC-GSIACYLHNKYDSSTSSTYQKNGSEFAIRYGSGSLSGFVSQDTVRIGDLTIKSQ 170
Query: 172 VFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDP 231
+F EAT E L F RFDGI+GLG+ I+V P + MV QGL+ E VFSF+L
Sbjct: 171 LFAEATNEPGLAFAFGRFDGILGLGYDTISVNKIPPPFYEMVNQGLLDEPVFSFYLGDAN 230
Query: 232 DAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDS 291
++ E VFGG++ H+ G+ +P+ +K YW+ +L I G ++ + G I+D+
Sbjct: 231 IEDDDSEAVFGGINKDHYTGELVMIPLRRKAYWEVDLDAITFGKETAQLENTGV--ILDT 288
Query: 292 GTSLLAGPTPVVTEINHAIGGE 313
GTSL+A P+ + +N IG +
Sbjct: 289 GTSLIALPSTLAELLNKEIGAK 310
>gi|340374170|ref|XP_003385611.1| PREDICTED: cathepsin D-like [Amphimedon queenslandica]
Length = 389
Score = 227 bits (578), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 109/248 (43%), Positives = 160/248 (64%), Gaps = 3/248 (1%)
Query: 73 DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSR 131
D P+K+++ AQY+G I +G+P Q+F+ +FDTGSSNLWVPS KC I+C H++Y S
Sbjct: 60 DDEPMKDYLMAQYYGPISLGTPDQDFNCMFDTGSSNLWVPSKKCGLLDIACRLHNKYDST 119
Query: 132 KSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDG 191
KS+TY G + YGSG+ SGFFS DN+++G+ + Q EAT E + F+ A+FDG
Sbjct: 120 KSSTYIANGTKFSLQYGSGATSGFFSTDNMKIGNSTITKQSIGEATHEPGVAFVAAKFDG 179
Query: 192 IIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKG 251
I G+ + I+ P +DNM+ Q LV+ +F +L+ D A GG++ GG + K++ G
Sbjct: 180 ICGMAYPAISAERQTPFFDNMISQNLVNAGMFGVFLSADTSASLGGDLNLGGPNEKYYTG 239
Query: 252 KHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
YVP+T K Y+ ++ + GN S +C+GGC IVD+GTSL+AGPT VT+I AIG
Sbjct: 240 DFNYVPLTSKTYYMIKVDGMNAGNLS--LCDGGCNGIVDTGTSLIAGPTAEVTKIATAIG 297
Query: 312 GEGVVSAE 319
+ ++ E
Sbjct: 298 AKSTLAGE 305
>gi|393246119|gb|EJD53628.1| aspartic peptidase A1 [Auricularia delicata TFB-10046 SS5]
Length = 415
Score = 226 bits (577), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 117/243 (48%), Positives = 154/243 (63%), Gaps = 5/243 (2%)
Query: 71 DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKS 130
D +PL NF +AQYF EI +GSP QNF V+ DTGSSNLWVPSS C SI+C+ H++Y S
Sbjct: 90 DGHKVPLSNFANAQYFAEISLGSPAQNFKVVLDTGSSNLWVPSSGCT-SIACFLHAKYDS 148
Query: 131 RKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFD 190
S+TY + G S EI+YGSGS+ GF SQD +++GD+ + Q F EA +E L F +FD
Sbjct: 149 SASSTYKKNGSSFEIHYGSGSMEGFISQDTLKIGDISIPGQDFAEAMKEPGLAFAFGKFD 208
Query: 191 GIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFK 250
GI+GL + IAV P + NMV + L+ + VFSF L +GG VFGGVD H+K
Sbjct: 209 GILGLAYDTIAVNHITPPFYNMVNKKLLDQPVFSFRLGA--SESDGGSAVFGGVDSSHYK 266
Query: 251 GKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI 310
G+ TYVPV +K YW+ EL I +G+ G A +D+GTSL+ P + IN I
Sbjct: 267 GQITYVPVRRKAYWEVELEGIKLGDDEVDFENTGAA--IDTGTSLIVLPVDIGEMINAQI 324
Query: 311 GGE 313
G +
Sbjct: 325 GAK 327
>gi|200688|gb|AAA40043.1| renin (Ren-1-d) [Mus musculus]
gi|148669208|gb|EDL01155.1| mCG129412 [Mus musculus]
Length = 402
Score = 226 bits (577), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 132/320 (41%), Positives = 189/320 (59%), Gaps = 29/320 (9%)
Query: 6 LRSVFCLWVLASCLL-LPASSNGLRRIGLKK----------RRLDLHSLNAARITRKERY 54
L ++ LW + C LP + RI LKK R +D+ L+A R
Sbjct: 8 LWALLLLW--SPCTFSLPTRTATFERIPLKKMPSVREILEERGVDMTRLSAER------- 58
Query: 55 MGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSS 114
GV R L + ++ L N+++ QY+GEIGIG+PPQ F VIFDTGS+NLWVPS+
Sbjct: 59 ----GVFTKRPSLINLTSPVV-LTNYLNTQYYGEIGIGTPPQTFKVIFDTGSANLWVPST 113
Query: 115 KC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF 173
KC ++C HS Y+S S++Y E G I+YGSG + GF SQD V VG + V Q F
Sbjct: 114 KCSRLYLACGIHSLYESSDSSSYMENGSDFTIHYGSGRVKGFLSQDVVTVGGITVT-QTF 172
Query: 174 IEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDA 233
E T + F+LA+FDG++G+GF AVG PV+D+++ QG++ EEVFS + NR
Sbjct: 173 GEVTELPLIPFMLAKFDGVLGMGFPAQAVGGVTPVFDHILSQGVLKEEVFSVYYNRGSHL 232
Query: 234 EEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGT 293
GGE+V GG DP+H++G YV ++K WQ + + +G+ ST +CE GCA +VD+G+
Sbjct: 233 -LGGEVVLGGSDPQHYQGNFHYVSISKTDSWQITMKGVSVGS-STLLCEEGCAVVVDTGS 290
Query: 294 SLLAGPTPVVTEINHAIGGE 313
S ++ PT + I A+G +
Sbjct: 291 SFISAPTSSLKLIMQALGAK 310
>gi|118082412|ref|XP_416090.2| PREDICTED: cathepsin E-A-like [Gallus gallus]
Length = 404
Score = 226 bits (577), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 110/243 (45%), Positives = 160/243 (65%), Gaps = 2/243 (0%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
L ++M+AQY+G I +G+PPQ+F+V+FDTGSSN WVPS C S +C H R+KS S++Y
Sbjct: 73 LYDYMNAQYYGVISVGTPPQSFTVVFDTGSSNFWVPSVYC-ISEACRVHQRFKSFLSDSY 131
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
G+ + YG+G + G ++D +++ ++ +K Q F E+ E +TF LA FDG++GLG
Sbjct: 132 EHGGEPFSLQYGTGQLLGIAAKDTLQISNISIKGQDFGESVFEPGMTFALAHFDGVLGLG 191
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ +AVG+A+PV+D+++ Q LV E VFSF+L R D E GGE++ GG+D +KG +V
Sbjct: 192 YPSLAVGNALPVFDSIMNQKLVEEPVFSFYLKRGDDTENGGELILGGIDHSLYKGSIHWV 251
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
PVT+K YWQ L +I I + C GC AIVDSGTSL+ GP+ + + IG
Sbjct: 252 PVTEKSYWQIHLNNIKIQGRVV-FCSHGCEAIVDSGTSLITGPSSQIRRLQEYIGASPSR 310
Query: 317 SAE 319
S E
Sbjct: 311 SGE 313
>gi|110277433|gb|ABG57251.1| vacuolar protease A [Trichoderma atroviride]
gi|358394485|gb|EHK43878.1| hypothetical protein TRIATDRAFT_137844 [Trichoderma atroviride IMI
206040]
Length = 395
Score = 226 bits (577), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 128/309 (41%), Positives = 184/309 (59%), Gaps = 15/309 (4%)
Query: 14 VLASCLLLPASSNGLRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSGVRHRLGD 69
++A+ L+ ++ G+ ++ L+K ++L+ S+ A ++YMG S V D
Sbjct: 5 LIAAAALVGSAQAGVHKMKLQKVSLEQQLEGSSIEAQVQQLGQKYMGVRPTSRVDVMFND 64
Query: 70 SDEDI-----LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYF 124
+ + +P+ NFM+AQYF EI IGSPPQ F V+ DTGSSNLWVPS C SI+C+
Sbjct: 65 NVPKVKGGHPVPVTNFMNAQYFSEITIGSPPQTFKVVLDTGSSNLWVPSQSCN-SIACFL 123
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
HS Y S S++Y + G EI+YGSGS++GF S D V +GD+ +K Q F EAT E L F
Sbjct: 124 HSTYDSSSSSSYKKNGSDFEIHYGSGSLTGFISNDVVTIGDLQIKGQDFAEATSEPGLAF 183
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
RFDGI+GLG+ I+V VP + MV Q L+ E VF+F+L +EG FGGV
Sbjct: 184 AFGRFDGILGLGYDTISVNGIVPPFYQMVNQKLLDEPVFAFYLGS---GDEGSVATFGGV 240
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
D H+ GK Y+P+ +K YW+ +L I G++ + G AI+D+GTSL P+ +
Sbjct: 241 DESHYSGKIEYIPLRRKAYWEVDLDSIAFGDEVAELENTG--AILDTGTSLNVLPSGIAE 298
Query: 305 EINHAIGGE 313
+N IG +
Sbjct: 299 LLNAEIGAK 307
>gi|338712318|ref|XP_001501960.2| PREDICTED: pepsin II-1-like [Equus caballus]
Length = 397
Score = 226 bits (577), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 113/251 (45%), Positives = 160/251 (63%), Gaps = 10/251 (3%)
Query: 73 DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRK 132
D PL+N++D +YFG I IG+PPQ F+VIFDTGSSNLWVPS+ C S++CY H R+ K
Sbjct: 73 DTEPLENYLDEEYFGTISIGTPPQEFTVIFDTGSSNLWVPSTYCS-SLACYDHKRFNPEK 131
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGI 192
S+TY +S I YG+GS++G D V VG + +Q+F + +E LA FDGI
Sbjct: 132 SSTYRATSESISITYGTGSMTGILGYDTVRVGGIEDTNQIFGLSEKEPGFFLFLAPFDGI 191
Query: 193 IGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGK 252
+GL + I+ A PV+DN+ +QGLVS+++FS +L+ + E G ++FGG+D ++ G
Sbjct: 192 LGLAYPSISASGATPVFDNIWDQGLVSQDLFSVYLSSN--DESGSVVMFGGIDSSYYTGS 249
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG- 311
+VPV+ +GYWQ + I + +S C GGC A+VD+GTSLL GPT + I IG
Sbjct: 250 LHWVPVSHEGYWQITVDSITVNGESIA-CSGGCQAVVDTGTSLLTGPTSAIDNIQSYIGA 308
Query: 312 -----GEGVVS 317
GE V+S
Sbjct: 309 RKDLLGEAVIS 319
>gi|74136511|ref|NP_001028152.1| gastricsin precursor [Monodelphis domestica]
gi|73621388|sp|Q689Z7.1|PEPC_MONDO RecName: Full=Gastricsin; AltName: Full=Pepsinogen C; Flags:
Precursor
gi|51534970|dbj|BAD36918.1| pepsinogen C [Monodelphis domestica]
Length = 391
Score = 226 bits (577), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 119/297 (40%), Positives = 181/297 (60%), Gaps = 11/297 (3%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
P+ N++D+ YFGEI IG+PPQNF V+FDTGSSNLWVPS+ C S +C H+R+ +S+T
Sbjct: 66 PITNYLDSFYFGEISIGTPPQNFLVLFDTGSSNLWVPSTYCQ-SQACSNHNRFSPSQSST 124
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
+T G++ ++YGSGS++ D V V ++VV +Q F + E + F + FDGI+G+
Sbjct: 125 FTNGGQTYTLSYGSGSLTVVLGYDTVTVQNIVVSNQEFGLSESEPTSPFYYSDFDGILGM 184
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ +AVG++ V M++QG +SE +FSF+ +R P + GGE++ GGVDP+ + G+ T+
Sbjct: 185 AYPAMAVGNSPTVMQGMLQQGQLSEPIFSFYFSRQPTHQYGGELILGGVDPQLYSGQITW 244
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
PVT++ YWQ + + IGNQ+TG C GC AIVD+GT LLA P ++ A G +
Sbjct: 245 TPVTQEVYWQIGIEEFAIGNQATGWCSQGCQAIVDTGTFLLAVPQQYMSAFLQATGAQQA 304
Query: 316 VSAECKLVVSQYGDL-IWDLLVSG---LLPEKVCQQIGLCAFNGAEYVRLGIPITRV 368
+ + + + D+ +++G LP FN Y RLGI T +
Sbjct: 305 QNGDFMVNCNYIQDMPTITFVINGSQFPLPPSA------YVFNNNGYCRLGIEATYL 355
>gi|345568347|gb|EGX51242.1| hypothetical protein AOL_s00054g478 [Arthrobotrys oligospora ATCC
24927]
Length = 392
Score = 226 bits (577), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 124/311 (39%), Positives = 183/311 (58%), Gaps = 19/311 (6%)
Query: 24 SSNGLRRIGLKKRRLDLHSLNAARITR----KERYMGGAGVSGVRHRLGDSDED---ILP 76
+S G+ ++ LKK ++ L T+ ++Y+ AG + D + D +P
Sbjct: 16 ASAGVHKMSLKKIPVEDTMLGQNFQTQVQALAQKYINRAG--NQQAFTNDVNADGGHSVP 73
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
+ NF++AQY+ EI +G+PPQ F V+ DTGSSNLWVPS C SI+C+ H++Y S +S+TY
Sbjct: 74 VNNFLNAQYYSEITLGTPPQTFKVVLDTGSSNLWVPSKSCS-SIACFLHTKYDSSESSTY 132
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
G I YGSGS+ GF SQD + +GD+ +K+Q+F EAT+E L F +FDGI+GLG
Sbjct: 133 KANGTEFSIQYGSGSMEGFISQDTLTIGDLTIKNQLFAEATKEPGLAFAFGKFDGILGLG 192
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ I+V P + M+ Q LV E VF+F+L R+ E+ E VFGG+D H+ G T+V
Sbjct: 193 YDTISVNKIPPPFYQMISQKLVDEPVFAFYLGRE---EDESEAVFGGIDKSHYTGDITWV 249
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG---- 312
V +K YW+ I G+Q+ + G A++D+GTSL+ P+ +N AIG
Sbjct: 250 DVRRKAYWEVPFDSISFGDQTAELDSWG--AVLDTGTSLITLPSDYAEMLNSAIGATKGW 307
Query: 313 EGVVSAECKLV 323
G S C+ V
Sbjct: 308 NGQYSVPCEKV 318
>gi|149245862|ref|XP_001472682.1| PREDICTED: renin-1-like isoform 1 [Mus musculus]
Length = 425
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 132/320 (41%), Positives = 189/320 (59%), Gaps = 29/320 (9%)
Query: 6 LRSVFCLWVLASCLL-LPASSNGLRRIGLKK----------RRLDLHSLNAARITRKERY 54
L ++ LW + C LP + RI LKK R +D+ L+A R
Sbjct: 31 LWALLLLW--SPCTFSLPTRTATFERIPLKKMPSVREILEERGVDMTRLSAER------- 81
Query: 55 MGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSS 114
GV R L + ++ L N+++ QY+GEIGIG+PPQ F VIFDTGS+NLWVPS+
Sbjct: 82 ----GVFTKRPSLINLTSPVV-LTNYLNTQYYGEIGIGTPPQTFKVIFDTGSANLWVPST 136
Query: 115 KC-YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF 173
KC ++C HS Y+S S++Y E G I+YGSG + GF SQD V VG + V Q F
Sbjct: 137 KCSRLYLACGIHSLYESSDSSSYMENGSDFTIHYGSGRVKGFLSQDVVTVGGITVT-QTF 195
Query: 174 IEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDA 233
E T + F+LA+FDG++G+GF AVG PV+D+++ QG++ EEVFS + NR
Sbjct: 196 GEVTELPLIPFMLAKFDGVLGMGFPAQAVGGVTPVFDHILSQGVLKEEVFSVYYNRGSHL 255
Query: 234 EEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGT 293
GGE+V GG DP+H++G YV ++K WQ + + +G+ ST +CE GCA +VD+G+
Sbjct: 256 -LGGEVVLGGSDPQHYQGNFHYVSISKTDSWQITMKGVSVGS-STLLCEEGCAVVVDTGS 313
Query: 294 SLLAGPTPVVTEINHAIGGE 313
S ++ PT + I A+G +
Sbjct: 314 SFISAPTSSLKLIMQALGAK 333
>gi|24653643|ref|NP_610961.1| CG10104 [Drosophila melanogaster]
gi|7303185|gb|AAF58249.1| CG10104 [Drosophila melanogaster]
Length = 404
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 128/318 (40%), Positives = 185/318 (58%), Gaps = 25/318 (7%)
Query: 12 LWVLASCL----LLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRL 67
+W+L S L +LP L+ R+ L +AR R E+ G+ R RL
Sbjct: 1 MWLLVSLLPVLFILPVQFQHPVSCKLQLYRVPLRRFPSAR-HRFEK----LGIRMDRLRL 55
Query: 68 GDSDE------------DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSK 115
++E PL N++DAQYFG I IG+PPQ F VIFDTGSSNLWVPS+
Sbjct: 56 KYAEEVSHFRGEWNSAVKSTPLSNYLDAQYFGPITIGTPPQTFKVIFDTGSSNLWVPSAT 115
Query: 116 CYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFI 174
C + ++C H+RY +++S ++ G I+YGSGS+SGF S D V V + ++DQ F
Sbjct: 116 CASTMVACRVHNRYFAKRSTSHQVRGDHFAIHYGSGSLSGFLSTDTVRVAGLEIRDQTFA 175
Query: 175 EATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAE 234
EAT FL A+FDGI GL +R I++ P + M+EQGL+++ +FS +L+R+ + +
Sbjct: 176 EATEMPGPIFLAAKFDGIFGLAYRSISMQRIKPPFYAMMEQGLLTKPIFSVYLSRNGE-K 234
Query: 235 EGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTS 294
+GG I FGG +P ++ G TYV V+ + YWQ ++ +I N +C+ GC I+D+GTS
Sbjct: 235 DGGAIFFGGSNPHYYTGNFTYVQVSHRAYWQVKMDSAVIRNLE--LCQQGCEVIIDTGTS 292
Query: 295 LLAGPTPVVTEINHAIGG 312
LA P IN +IGG
Sbjct: 293 FLALPYDQAILINESIGG 310
>gi|50557048|ref|XP_505932.1| YALI0F27071p [Yarrowia lipolytica]
gi|49651802|emb|CAG78744.1| YALI0F27071p [Yarrowia lipolytica CLIB122]
Length = 396
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 126/306 (41%), Positives = 181/306 (59%), Gaps = 20/306 (6%)
Query: 37 RLDLHSLNAARITRKER------YMGGAGVSGVRHRLGDSDE-----DIL--PLKNFMDA 83
++ ++ ++ A + KE M G G +LG+ +E D+ PL N+++A
Sbjct: 22 KVSINKMSTAELLGKENGFEDHLRMMGQKYMGKFQKLGEFNELASIQDVSNSPLTNYLNA 81
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
QY+ EI IG+PPQ F+VI DTGSSNLWVPS +C SI+CY H +Y S S++Y G +
Sbjct: 82 QYYTEIEIGTPPQKFNVILDTGSSNLWVPSVQCN-SIACYLHQKYDSAASSSYKANGTAF 140
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVG 203
EI YGSGS+ GF SQD +++G +V+ +Q F EAT E L F +FDGI+GL + I+V
Sbjct: 141 EIQYGSGSMEGFVSQDTLKLGSLVLPEQDFAEATSEPGLAFAFGKFDGILGLAYDTISVN 200
Query: 204 DAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGY 263
VP N V +GL+ + FSF+L +GG FGGVD +F+GK T++PV +K Y
Sbjct: 201 KIVPPVYNAVNRGLLDKNQFSFFLGDTNKGTDGGVATFGGVDEDYFEGKITWLPVRRKAY 260
Query: 264 WQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAE 319
W+ E I +G+Q+ + G A +D+GTSLLA P+ + +N IG G + E
Sbjct: 261 WEVEFNSITLGDQTAELVNTGAA--IDTGTSLLALPSGLAEVLNSEIGATKGWSGQYTVE 318
Query: 320 CKLVVS 325
C V S
Sbjct: 319 CDKVDS 324
>gi|67524891|ref|XP_660507.1| hypothetical protein AN2903.2 [Aspergillus nidulans FGSC A4]
gi|40744298|gb|EAA63474.1| hypothetical protein AN2903.2 [Aspergillus nidulans FGSC A4]
gi|259486160|tpe|CBF83780.1| TPA: vacuolar aspartyl protease (proteinase A) (Eurofung)
[Aspergillus nidulans FGSC A4]
Length = 394
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 131/316 (41%), Positives = 185/316 (58%), Gaps = 29/316 (9%)
Query: 15 LASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRK---------ERYMGGAGVSGVRH 65
+ + LL + G + K +L+ L ITR ++YMG +H
Sbjct: 1 MKASLLTASVLLGYASAEVHKLKLNKVPLTEQFITRNIADHANALGQKYMG----QFQQH 56
Query: 66 RLGDSD------EDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS 119
L D D+L + NFM+AQYF EI +G+PPQ F V+ DTGSSNLWVPSS+C S
Sbjct: 57 VLEDEPVNAMRGHDVL-VDNFMNAQYFSEIQLGTPPQTFKVVLDTGSSNLWVPSSECG-S 114
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
I+CY H ++ S S+TY + G I YGSGS+SGF S+DN+++GD+ VK Q F EAT E
Sbjct: 115 IACYLHQKFDSSASSTYKKNGSEFAIKYGSGSLSGFVSRDNLQIGDLKVKGQDFAEATSE 174
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL---NRDPDAEEG 236
L F RFDGI+GLGF I+V VP + NM+ QGL+ E VF+F+L N+D D+
Sbjct: 175 PGLAFAFGRFDGILGLGFDTISVNRIVPPFYNMIHQGLLDEPVFAFYLGDANKDGDSSVA 234
Query: 237 GEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLL 296
FGG+D H++G+ +P+ +K YW+ +L I +G++ + G I+D+GTSL+
Sbjct: 235 ---TFGGIDKDHYEGELIKIPLRRKAYWEVDLDAIALGDEVAELENTGV--ILDTGTSLI 289
Query: 297 AGPTPVVTEINHAIGG 312
A P+ + IN IG
Sbjct: 290 ALPSNLAEMINTEIGA 305
>gi|195485971|ref|XP_002091310.1| GE13586 [Drosophila yakuba]
gi|194177411|gb|EDW91022.1| GE13586 [Drosophila yakuba]
Length = 404
Score = 226 bits (576), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 119/293 (40%), Positives = 173/293 (59%), Gaps = 19/293 (6%)
Query: 21 LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNF 80
P++ + ++G++ RL L R E G+ + PL N+
Sbjct: 36 FPSARHRFEKLGIRMDRLRLKYAEEVSQFRGE---------------GNLEVKSTPLSNY 80
Query: 81 MDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEI 139
+DAQYFG I IG+PPQ+F VIFDTGSSNLWVPS+ C ++C H+RY +++S ++
Sbjct: 81 LDAQYFGPITIGTPPQSFKVIFDTGSSNLWVPSATCASRMVACRVHNRYFAKRSTSHQVR 140
Query: 140 GKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFRE 199
G I+YGSGS+ GF S D V V + ++DQ F EAT FL A+FDGI GLG+R
Sbjct: 141 GDRFAIHYGSGSLFGFLSTDTVRVAGLEIRDQTFAEATEMPGPIFLAAKFDGIFGLGYRS 200
Query: 200 IAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVT 259
I++ P + M+EQGL+++ +FS +L+R + +EGG I FGG +P ++ G TYV V+
Sbjct: 201 ISMQRIKPPFYAMMEQGLLTKPIFSVYLSRHGE-KEGGAIFFGGSNPHYYTGNFTYVQVS 259
Query: 260 KKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
+ YWQ ++ +I N +C+ GC I+D+GTS LA P IN +IGG
Sbjct: 260 HRAYWQVKMDSAVIRNLE--LCQQGCEVIIDTGTSFLALPYDQAILINESIGG 310
>gi|327296035|ref|XP_003232712.1| hypothetical protein TERG_06704 [Trichophyton rubrum CBS 118892]
gi|326465023|gb|EGD90476.1| hypothetical protein TERG_06704 [Trichophyton rubrum CBS 118892]
Length = 400
Score = 226 bits (576), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 123/298 (41%), Positives = 178/298 (59%), Gaps = 24/298 (8%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRL----------GDSDEDILPL 77
L+++ LK++ L+ ++ + ++YMG + +H +S ++L +
Sbjct: 25 LKKVSLKEQ-LEHADIDVQIKSLGQKYMG---IRPEQHEQQMFKEQTPIEAESGHNVL-I 79
Query: 78 KNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYT 137
NF++AQYF EI IG+PPQ F V+ DTGSSNLWVP C SI+C+ HS Y S S+TY+
Sbjct: 80 DNFLNAQYFSEISIGTPPQTFKVVLDTGSSNLWVPGKDCS-SIACFLHSTYDSSASSTYS 138
Query: 138 EIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGF 197
G I YGSGS+ GF S+DNV++GD+ +K+Q+F EAT E L F RFDGI+G+GF
Sbjct: 139 RNGTKFAIRYGSGSLEGFVSRDNVKIGDLTIKNQLFAEATSEPGLAFAFGRFDGIMGMGF 198
Query: 198 REIAVGDAVPVWDNMVEQGLVSEEVFSFWL---NRDPDAEEGGEIVFGGVDPKHFKGKHT 254
I+V P + NM++QGL+ E VFSF+L N+D D + FGG D HF G T
Sbjct: 199 SSISVNGIPPPFYNMIDQGLLDEPVFSFYLGDTNKDGDQS---VVTFGGSDTNHFTGDMT 255
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
+P+ +K YW+ + I +G + + G I+D+GTSL+A PT + IN IG
Sbjct: 256 TIPLRRKAYWEVDFDAISLGKDTAALENTGI--ILDTGTSLIALPTTLAEMINTQIGA 311
>gi|301786583|ref|XP_002928700.1| PREDICTED: pepsin A-like isoform 2 [Ailuropoda melanoleuca]
Length = 393
Score = 226 bits (575), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 126/304 (41%), Positives = 177/304 (58%), Gaps = 21/304 (6%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L+ GL K L HS N A + + A V + PL+N+MD +YFG
Sbjct: 32 LKEHGLLKDFLKNHSPNPA----SKYFPQEAAVMATQ-----------PLENYMDMEYFG 76
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+PPQ F+VIFDTGSSNLWVPS C S +C H+R+ ++S+TY ++ I Y
Sbjct: 77 TIGIGTPPQEFTVIFDTGSSNLWVPSVYCS-SPACSNHNRFNPQQSSTYEGTSQTVSIAY 135
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
G+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + +I+ A
Sbjct: 136 GTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPQISSSGAT 194
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
PV+DNM QGLVS+++FS +L+ D + G ++FGG+D +F G +VPV+ +GYWQ
Sbjct: 195 PVFDNMWNQGLVSQDLFSVYLSS--DDQSGSVVMFGGIDSSYFTGNLNWVPVSVEGYWQI 252
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQ 326
+ + I Q+ C GC AIVD+GTSLLAGPT + I IG + E + S
Sbjct: 253 TMDSVTINGQAIA-CSQGCQAIVDTGTSLLAGPTNSIANIQSYIGASEDSNGEMTISCSA 311
Query: 327 YGDL 330
DL
Sbjct: 312 INDL 315
>gi|301786581|ref|XP_002928699.1| PREDICTED: pepsin A-like isoform 1 [Ailuropoda melanoleuca]
gi|281347483|gb|EFB23067.1| hypothetical protein PANDA_018738 [Ailuropoda melanoleuca]
Length = 385
Score = 226 bits (575), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 126/304 (41%), Positives = 177/304 (58%), Gaps = 21/304 (6%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L+ GL K L HS N A + + A V + PL+N+MD +YFG
Sbjct: 32 LKEHGLLKDFLKNHSPNPA----SKYFPQEAAVMATQ-----------PLENYMDMEYFG 76
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+PPQ F+VIFDTGSSNLWVPS C S +C H+R+ ++S+TY ++ I Y
Sbjct: 77 TIGIGTPPQEFTVIFDTGSSNLWVPSVYCS-SPACSNHNRFNPQQSSTYEGTSQTVSIAY 135
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
G+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + +I+ A
Sbjct: 136 GTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPQISSSGAT 194
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
PV+DNM QGLVS+++FS +L+ D + G ++FGG+D +F G +VPV+ +GYWQ
Sbjct: 195 PVFDNMWNQGLVSQDLFSVYLSS--DDQSGSVVMFGGIDSSYFTGNLNWVPVSVEGYWQI 252
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQ 326
+ + I Q+ C GC AIVD+GTSLLAGPT + I IG + E + S
Sbjct: 253 TMDSVTINGQAIA-CSQGCQAIVDTGTSLLAGPTNSIANIQSYIGASEDSNGEMTISCSA 311
Query: 327 YGDL 330
DL
Sbjct: 312 INDL 315
>gi|255936729|ref|XP_002559391.1| Pc13g09680 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211584011|emb|CAP92037.1| Pc13g09680 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 398
Score = 226 bits (575), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 129/311 (41%), Positives = 184/311 (59%), Gaps = 27/311 (8%)
Query: 28 LRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILP------- 76
+ R+ L K +L+ H+++A ++YMG + +H+ D P
Sbjct: 20 VHRLKLNKVPLAEQLNTHNIDAHVHNLGQKYMG---IRPEKHQDLFHDTSFNPAAGHDVL 76
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
+ NF++AQYF EI IG+PPQ F V+ DTGSSNLWVPSS+C SI+C+ HS+Y S S+TY
Sbjct: 77 VDNFLNAQYFSEISIGTPPQTFKVVLDTGSSNLWVPSSQCS-SIACFLHSKYDSSSSSTY 135
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
+ G EI YGSGS+SGF S+D +++GD+ VK Q F EAT E L F RFDGI+GLG
Sbjct: 136 EKNGTEFEIRYGSGSLSGFVSRDTLQIGDLKVKGQDFAEATNEPGLAFAFGRFDGILGLG 195
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE---IVFGGVDPKHFKGKH 253
+ I+V VP + +M+ Q LV E VF+F+L DA + G+ FGG+D H+ G+
Sbjct: 196 YDTISVNKMVPPFYHMINQKLVDEPVFAFYLG---DANKDGDNSVATFGGIDESHYTGEL 252
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG- 312
+P+ +K YW+ EL I +G+ + G I+D+GTSL+A P+ + +N IG
Sbjct: 253 IKIPLRRKAYWEVELNSIALGDNVAELENTGV--ILDTGTSLIALPSTMAELLNKEIGAT 310
Query: 313 ---EGVVSAEC 320
G S EC
Sbjct: 311 KGFTGQYSVEC 321
>gi|68051036|emb|CAI46901.1| nothepsin [Podarcis siculus]
Length = 414
Score = 225 bits (574), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 123/320 (38%), Positives = 186/320 (58%), Gaps = 13/320 (4%)
Query: 6 LRSVFCLWVLASCLL----LPASSNGLRRIGLKKRRLDLHSLNAARITR--KERYMGGAG 59
+R + WV CL +P + R L+KR +LH L R +RY
Sbjct: 1 MRVLLAFWVYIPCLTAVVRIPLTRFESIRGKLRKRG-ELHKLLEDRQPDIFGQRY-PHCL 58
Query: 60 VSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS 119
S + G + E L ++M+AQY+GE+ +G+PPQ F+V+FDTGSS+ WVPS++CY S
Sbjct: 59 PSDINLSQGLATER---LYDYMNAQYYGEVSVGTPPQRFTVVFDTGSSDFWVPSARCY-S 114
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
+C H R++S S +Y ++G+ + YG+GS+ G ++D V+ ++ ++ Q F E E
Sbjct: 115 KACSMHKRFESFMSYSYAQVGEPFYLQYGTGSLIGVTAKDTVQFSNLSIEAQDFGEVRYE 174
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEI 239
LTF A FDG++GLG+ ++V +PV+D M+ Q L+ E VFSF LNR + E GGE+
Sbjct: 175 PDLTFTFAHFDGVLGLGYPSLSVLHGLPVFDGMLRQQLIEEPVFSFILNRGGNTENGGEL 234
Query: 240 VFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGP 299
+FGG+D +KG +VPVT++ YW+ + ++ I C+ GCAAIVDSGTSL+ GP
Sbjct: 235 IFGGIDHSLYKGSIHWVPVTEQKYWKIHMDNVKIQGH-IAACKDGCAAIVDSGTSLITGP 293
Query: 300 TPVVTEINHAIGGEGVVSAE 319
+ + IG E
Sbjct: 294 PSQIIRLQQKIGAHPAPHGE 313
>gi|125984612|ref|XP_001356070.1| GA14340 [Drosophila pseudoobscura pseudoobscura]
gi|54644388|gb|EAL33129.1| GA14340 [Drosophila pseudoobscura pseudoobscura]
Length = 387
Score = 225 bits (574), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 118/261 (45%), Positives = 163/261 (62%), Gaps = 8/261 (3%)
Query: 70 SDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRY 128
S E L+N ++ +Y+G IGIG+P Q F V+FDTGS+NLWVPS+KC +++C H++Y
Sbjct: 56 SSESTETLQNTLNMEYYGLIGIGTPEQIFRVLFDTGSANLWVPSAKCPSTNVACQKHNQY 115
Query: 129 KSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLAR 188
S +S+TY G+S I YG+GS++GF S+D V V + ++ Q F EA E TF+ A
Sbjct: 116 HSEQSSTYVANGESFSIQYGTGSLTGFLSEDTVWVAGIEIQQQTFAEALNEPGSTFVSAP 175
Query: 189 FDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKH 248
F GI+GL F+ IAV P +DNM+ QGL+ E V SF+L R A +GGE++ GGVDP
Sbjct: 176 FAGIMGLAFKSIAVDGVTPPFDNMIAQGLLDEPVISFYLQRQGTAVQGGELILGGVDPSL 235
Query: 249 FKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH 308
+ G TYVPV+ GYWQF++ + G +C GC AI D+GTSL+ P +IN
Sbjct: 236 YTGNLTYVPVSVAGYWQFKVNSVKSGGFL--LCS-GCQAIADTGTSLIVVPEAAYAKINS 292
Query: 309 AIG----GEGVVSAECKLVVS 325
+G GEG +C V S
Sbjct: 293 LLGATDNGEGEAFVKCADVSS 313
>gi|402072590|gb|EJT68339.1| vacuolar protease A [Gaeumannomyces graminis var. tritici
R3-111a-1]
Length = 396
Score = 225 bits (573), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 122/310 (39%), Positives = 183/310 (59%), Gaps = 17/310 (5%)
Query: 14 VLASCLLLPASSNGLRRIGLKK----RRLDLHSLNAARITRKERYMG------GAGVSGV 63
+L + +LL + G+ ++ +KK +L+ L A ++Y+G V
Sbjct: 5 LLTAAVLLGSVDAGVHKLKMKKVPLSEQLETVPLTAQLRGLGQKYLGLRPDSHAQAVFES 64
Query: 64 RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCY 123
R + + P+ NFM+AQY+ EI +G+PPQ+F V+ DTGSSNLWVPS C SI+CY
Sbjct: 65 RPIRAQGNHPV-PVSNFMNAQYYSEITVGTPPQSFKVVLDTGSSNLWVPSQSC-GSIACY 122
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
HS+Y S S+TY + G EI YGSGS+SGF S D +++GD+ +K+Q F EAT+E L
Sbjct: 123 LHSKYDSSASSTYKKNGTEFEITYGSGSLSGFVSNDVMQIGDIKIKNQDFAEATKEPGLA 182
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
F RFDGI+GLGF ++V VP + M++Q L+ E VF+F+L D ++ E +FGG
Sbjct: 183 FAFGRFDGILGLGFDRLSVNKMVPPFYQMIDQKLIDEPVFAFYL---ADQDDESEAIFGG 239
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
++ H GK +P+ +K YW+ + I +G++ G E I+D+GTSL PT +
Sbjct: 240 INKDHIDGKIIEIPLRRKAYWEVDFDAIALGDE-VGELE-NTGVILDTGTSLNVLPTQLA 297
Query: 304 TEINHAIGGE 313
+N IG +
Sbjct: 298 EMLNAQIGAK 307
>gi|149725292|ref|XP_001501875.1| PREDICTED: pepsin A-like [Equus caballus]
Length = 387
Score = 225 bits (573), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 116/249 (46%), Positives = 160/249 (64%), Gaps = 12/249 (4%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL+N+MD +YFG I IG+PPQ F+VIFDTGSSNLWVPS+ C S++C H+R+ S+T
Sbjct: 66 PLENYMDEEYFGTISIGTPPQEFTVIFDTGSSNLWVPSTYCS-SLACSNHNRFNPEDSST 124
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIG 194
Y +S I YG+GS++G D V VG + +Q+F + T GS + A FDGI+G
Sbjct: 125 YEATSESVSITYGTGSMTGVLGYDTVRVGGIEDTNQIFGLSETEPGSFLYY-APFDGILG 183
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+ A PV+DNM +QGLVS+++FS +L+ D E G ++FGG+D ++ G
Sbjct: 184 LAYPSISASGATPVFDNMWDQGLVSQDLFSVYLSSD--DESGSVVMFGGIDSSYYSGSLN 241
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG--- 311
+VPV+ +GYWQ + I + +S C GGC AIVD+GTSLLAGPT + I IG
Sbjct: 242 WVPVSNEGYWQITMDSITMNGESIA-CSGGCQAIVDTGTSLLAGPTSAIDNIQSYIGASE 300
Query: 312 ---GEGVVS 317
GE V+S
Sbjct: 301 DSSGESVIS 309
>gi|21063965|gb|AAM29212.1| AT05209p [Drosophila melanogaster]
Length = 404
Score = 225 bits (573), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 128/318 (40%), Positives = 184/318 (57%), Gaps = 25/318 (7%)
Query: 12 LWVLASCL----LLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRL 67
+W L S L +LP L+ R+ L +AR R E+ G+ R RL
Sbjct: 1 MWPLVSLLPVLFILPVQFQHPVSCKLQLYRVPLRRFPSAR-HRFEK----LGIRMDRLRL 55
Query: 68 GDSDE------------DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSK 115
++E PL N++DAQYFG I IG+PPQ F VIFDTGSSNLWVPS+
Sbjct: 56 KYAEEVSHFRGEWNSAVKSTPLSNYLDAQYFGPITIGTPPQTFKVIFDTGSSNLWVPSAT 115
Query: 116 CYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFI 174
C + ++C H+RY +++S ++ G I+YGSGS+SGF S D V V + ++DQ F
Sbjct: 116 CASTMVACRVHNRYFAKRSTSHQVRGDHFAIHYGSGSLSGFLSTDTVRVAGLEIRDQTFA 175
Query: 175 EATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAE 234
EAT FL A+FDGI GL +R I++ P + M+EQGL+++ +FS +L+R+ + +
Sbjct: 176 EATEMPGPIFLAAKFDGIFGLAYRSISMQRIKPPFYAMMEQGLLTKPIFSVYLSRNGE-K 234
Query: 235 EGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTS 294
+GG I FGG +P ++ G TYV V+ + YWQ ++ +I N +C+ GC I+D+GTS
Sbjct: 235 DGGAIFFGGSNPHYYTGNFTYVQVSHRAYWQVKMDSAVIRNLE--LCQQGCEVIIDTGTS 292
Query: 295 LLAGPTPVVTEINHAIGG 312
LA P IN +IGG
Sbjct: 293 FLALPYDQAILINESIGG 310
>gi|291409611|ref|XP_002721072.1| PREDICTED: pepsin-3-like isoform 2 [Oryctolagus cuniculus]
Length = 387
Score = 225 bits (573), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 124/296 (41%), Positives = 173/296 (58%), Gaps = 24/296 (8%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL K L H+LN A +Y+ A V L+N++D +YFG
Sbjct: 32 LIEKGLLKDYLKTHTLNLAT-----KYLPKAAFDSVPTE---------SLENYLDTEYFG 77
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
I IG+PPQ+F+VIFDTGSSNLWVPS C S +C H+++ S+T+ +S I Y
Sbjct: 78 TISIGTPPQDFTVIFDTGSSNLWVPSVYCS-SAACSVHNQFNPEDSSTFQATSESLSITY 136
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVP 207
G+GS++GF D V VG++ +Q+F + E A FDGI+GL + I+ DA P
Sbjct: 137 GTGSMTGFLGYDTVNVGNIEDTNQIFGLSESEPGSFLYYAPFDGILGLAYPSISASDATP 196
Query: 208 VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFE 267
V+DNM +GLVSE++FS +L+ D D+ G ++FGGVD ++ G +VPV+ +GYWQ
Sbjct: 197 VFDNMWNEGLVSEDLFSVYLSSDDDS--GSVVMFGGVDSSYYTGSLNWVPVSYEGYWQIT 254
Query: 268 LGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 317
+ I + + T C GC AIVD+GTSLLAGPT ++ I IG GE +VS
Sbjct: 255 VDSITMDGE-TIACADGCQAIVDTGTSLLAGPTSAISNIQSYIGASENSDGEMIVS 309
>gi|195161645|ref|XP_002021673.1| GL26637 [Drosophila persimilis]
gi|194103473|gb|EDW25516.1| GL26637 [Drosophila persimilis]
Length = 387
Score = 225 bits (573), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 119/263 (45%), Positives = 165/263 (62%), Gaps = 12/263 (4%)
Query: 70 SDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRY 128
S E L+N ++ +Y+G IGIG+P Q F V+FDTGS+NLWVPS+KC +++C H++Y
Sbjct: 56 SSESTETLQNTLNMEYYGLIGIGTPEQIFRVLFDTGSANLWVPSAKCPSTNVACQKHNQY 115
Query: 129 KSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLAR 188
S +S+TY G+S I YG+GS++GF S+D V V + ++ Q F EA E TF+ A
Sbjct: 116 HSGQSSTYVANGESFSIQYGTGSLTGFLSEDTVWVAGIEIQQQTFAEALNEPGSTFVSAP 175
Query: 189 FDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKH 248
F GI+GL F+ IAV P +DNM+ QGL+ E V SF+L R A +GGE++ GGVDP
Sbjct: 176 FAGIMGLAFKSIAVDGVTPPFDNMIAQGLLDEPVISFYLQRQGTAVQGGELILGGVDPSL 235
Query: 249 FKGKHTYVPVTKKGYWQFELGDILIGNQSTGV--CEGGCAAIVDSGTSLLAGPTPVVTEI 306
+ G TYVPV+ GYWQF++ + +S G+ C GC AI D+GTSL+ P +I
Sbjct: 236 YTGNLTYVPVSVAGYWQFKVNSV----KSGGILLCS-GCQAIADTGTSLIVVPEAAYAKI 290
Query: 307 NHAIG----GEGVVSAECKLVVS 325
N +G GEG +C V S
Sbjct: 291 NSLLGATDNGEGEAFVKCADVSS 313
>gi|302497761|ref|XP_003010880.1| hypothetical protein ARB_02919 [Arthroderma benhamiae CBS 112371]
gi|306531030|sp|D4B385.1|CARP_ARTBC RecName: Full=Probable vacuolar protease A; AltName: Full=Aspartic
endopeptidase PEP2; AltName: Full=Aspartic protease
PEP2; Flags: Precursor
gi|291174425|gb|EFE30240.1| hypothetical protein ARB_02919 [Arthroderma benhamiae CBS 112371]
Length = 400
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 112/239 (46%), Positives = 152/239 (63%), Gaps = 9/239 (3%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
+ NF++AQYF EI IG+PPQ F V+ DTGSSNLWVP C SI+C+ HS Y S S+TY
Sbjct: 79 IDNFLNAQYFSEISIGTPPQTFKVVLDTGSSNLWVPGKDCS-SIACFLHSTYDSSASSTY 137
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
++ G I YGSGS+ GF S+D+V++GD+ +K Q+F EAT E L F RFDGI+G+G
Sbjct: 138 SKNGTKFAIRYGSGSLEGFVSRDSVKIGDMTIKKQLFAEATSEPGLAFAFGRFDGIMGMG 197
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWL---NRDPDAEEGGEIVFGGVDPKHFKGKH 253
F I+V P + NM++QGL+ E VFSF+L N+D D + FGG D HF G
Sbjct: 198 FSSISVNGITPPFYNMIDQGLIDEPVFSFYLGDTNKDGDQS---VVTFGGSDTNHFTGDM 254
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
T +P+ +K YW+ + I +G + + G I+D+GTSL+A PT + IN IG
Sbjct: 255 TTIPLRRKAYWEVDFDAISLGKDTAALENTGI--ILDTGTSLIALPTTLAEMINTQIGA 311
>gi|296479430|tpg|DAA21545.1| TPA: renin [Bos taurus]
Length = 401
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 126/307 (41%), Positives = 184/307 (59%), Gaps = 20/307 (6%)
Query: 17 SCLL-LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRL------GD 69
SC LPA + RRI LKK + + R + KER + A + +L G+
Sbjct: 13 SCTFSLPADTAAFRRIFLKK-------MPSVRESLKERGVDMARLGAEWSQLTKTLSFGN 65
Query: 70 SDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFHSRY 128
++ L N++D QY+GEIGIG+PPQ F V+FDTGS+NLWVPS+KC +C HS Y
Sbjct: 66 RTSPVV-LTNYLDTQYYGEIGIGTPPQTFKVVFDTGSANLWVPSTKCSPLYTACEIHSLY 124
Query: 129 KSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLAR 188
S +S++Y E G I+YGSG + GF SQD V VG + V Q F E T L F+LA+
Sbjct: 125 DSLESSSYVENGTEFTIHYGSGKVKGFLSQDLVTVGGITVT-QTFGEVTELPLLPFMLAK 183
Query: 189 FDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEE--GGEIVFGGVDP 246
FDG++G+GF AVG PV+D+++ Q +++++VFS + +RD GGEIV GG DP
Sbjct: 184 FDGVLGMGFPAQAVGGVTPVFDHILAQRVLTDDVFSVYYSRDSKNSHLLGGEIVLGGSDP 243
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
++++ YV ++K G WQ + + + +T +CE GC IVD+G S ++GPT + +
Sbjct: 244 QYYQENFHYVSISKPGSWQIRMKGVSV-RSTTLLCEEGCMVIVDTGASYISGPTSSLRLL 302
Query: 307 NHAIGGE 313
A+G +
Sbjct: 303 MEALGAK 309
>gi|21629629|gb|AAM61957.1| synthetic renin 2/1d [Mus musculus]
Length = 401
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 130/313 (41%), Positives = 186/313 (59%), Gaps = 12/313 (3%)
Query: 7 RSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSG---V 63
R LW L LLL + G R+ L + + R +ER + +S V
Sbjct: 3 RRRMPLWAL---LLLWSPCTFSLPTGTTFERIPLKKMPSVREILEERGVDMTRLSAEWDV 59
Query: 64 RHRLGDSDEDILP--LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSI 120
R + I P L N++++QY+GEIGIG+PPQ F VIFDTGS+NLWVPS+KC +
Sbjct: 60 RTKRSSLTNLISPVVLTNYLNSQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCSRLYL 119
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+C HS Y+S S++Y E G I+YGSG + GF SQD V VG + V Q F E T
Sbjct: 120 ACGIHSLYESSDSSSYMENGDDFTIHYGSGRVKGFLSQDVVTVGGITVT-QTFGEVTELP 178
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
+ F+LA+FDG++G+GF AVG PV+D+++ QG++ EEVFS + NR P GGE+V
Sbjct: 179 LIPFMLAKFDGVLGMGFPAQAVGGVTPVFDHILSQGVLKEEVFSVYYNRGPHL-LGGEVV 237
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
GG DP+H++G YV ++K WQ + + +G+ ST +CE GC +VD+G+S ++ PT
Sbjct: 238 LGGSDPEHYQGDFHYVSLSKTDSWQITMKGVSVGS-STLLCEEGCEVVVDTGSSFISAPT 296
Query: 301 PVVTEINHAIGGE 313
+ I A+G +
Sbjct: 297 SSLKLIMQALGAK 309
>gi|291409609|ref|XP_002721071.1| PREDICTED: pepsin-3-like isoform 1 [Oryctolagus cuniculus]
Length = 387
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 124/296 (41%), Positives = 173/296 (58%), Gaps = 24/296 (8%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL K L H+LN A +Y+ A V L+N++D +YFG
Sbjct: 32 LIEKGLLKDYLKTHTLNLAT-----KYLPKAAFDSVPTE---------SLENYLDTEYFG 77
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
I IG+PPQ+F+VIFDTGSSNLWVPS C S +C H+++ S+T+ +S I Y
Sbjct: 78 TISIGTPPQDFTVIFDTGSSNLWVPSVYCS-SAACSVHNQFNPEDSSTFQATSESLSITY 136
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVP 207
G+GS++GF D V VG++ +Q+F + E A FDGI+GL + I+ DA P
Sbjct: 137 GTGSMTGFLGYDTVNVGNIEDTNQIFGLSESEPGSFLYYAPFDGILGLAYPSISASDATP 196
Query: 208 VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFE 267
V+DNM +GLVSE++FS +L+ D D+ G ++FGGVD ++ G +VPV+ +GYWQ
Sbjct: 197 VFDNMWNEGLVSEDLFSVYLSSDDDS--GSVVMFGGVDSSYYTGSLNWVPVSYEGYWQIT 254
Query: 268 LGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 317
+ I + + T C GC AIVD+GTSLLAGPT ++ I IG GE +VS
Sbjct: 255 VDSITMDGE-TIACADGCQAIVDTGTSLLAGPTSAISNIQSYIGASENSDGEMIVS 309
>gi|327271207|ref|XP_003220379.1| PREDICTED: gastricsin-like [Anolis carolinensis]
Length = 388
Score = 224 bits (571), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 119/305 (39%), Positives = 187/305 (61%), Gaps = 12/305 (3%)
Query: 12 LWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRK--ERYMGGAGVSGVR-HRLG 68
L ++ +C L S GL + LKK + S+ I + E Y+ + R +
Sbjct: 4 LMLMLACFQL---SEGLVTVPLKKGK----SIRETMIEKGVLEDYLKHHNLDPARKYHFN 56
Query: 69 DSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRY 128
+ + P+ +MDA Y+G+IGIG+P QNF V+FDTGSSNLWVPS C + +C H+R+
Sbjct: 57 EYNVAYEPMA-YMDASYYGQIGIGTPAQNFLVLFDTGSSNLWVPSIYCN-TEACTRHARF 114
Query: 129 KSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLAR 188
+S+TY+ G++ + YGSG+++GFF D + + ++VV +Q F + E F+ A
Sbjct: 115 NPSQSSTYSTNGQTFFLQYGSGNLAGFFGYDTLTLQNIVVTNQEFGLSKNEPGANFIYAE 174
Query: 189 FDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKH 248
FDGI+G+ + +AVG A + M+++ L+S+ VFSF+L+R P+++ GGE+VFGGVD +
Sbjct: 175 FDGILGMAYPSLAVGGATTALERMLQENLLSQSVFSFYLSRQPNSQYGGEVVFGGVDTRL 234
Query: 249 FKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH 308
+ G+ + PVT++ YWQ + + IG Q+TG C GC AIVD+GTSLL P ++
Sbjct: 235 YSGEIYWAPVTQELYWQIGIQEFSIGGQATGWCSQGCQAIVDTGTSLLTVPQQYMSNFLS 294
Query: 309 AIGGE 313
A+G +
Sbjct: 295 AVGAQ 299
>gi|432116085|gb|ELK37212.1| Cathepsin E [Myotis davidii]
Length = 396
Score = 224 bits (571), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 120/248 (48%), Positives = 157/248 (63%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C H R+ +S+T
Sbjct: 69 PLVNYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHPRFSPSQSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y+ G I YG+GS+SG +D V V + V Q F E+ E TF+ A FDGI+GL
Sbjct: 128 YSSPGSHFFIQYGTGSLSGVIGEDQVSVEGLTVVGQQFGESVTEPGQTFVDAEFDGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV +FS +++ DP+ G E++FGG D HF G +
Sbjct: 188 GYPSLAVGGVTPVFDNMMAQNLVDVPMFSVYMSSDPEGGAGSELIFGGYDHSHFSGSLNW 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE-- 313
VPVTK+GYWQ L I +G + C GC AIVD+GTSL+ GP + ++ AIG E
Sbjct: 248 VPVTKQGYWQIALDTIQVGG-AVMFCSEGCQAIVDTGTSLITGPPAEIKQLQKAIGAEPV 306
Query: 314 -GVVSAEC 320
G + EC
Sbjct: 307 DGEYAVEC 314
>gi|194762106|ref|XP_001963199.1| GF19728 [Drosophila ananassae]
gi|190616896|gb|EDV32420.1| GF19728 [Drosophila ananassae]
Length = 390
Score = 224 bits (570), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 116/246 (47%), Positives = 155/246 (63%), Gaps = 6/246 (2%)
Query: 70 SDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRY 128
+ E L + D +Y+G + IG+P QNF+++FDTGS+NLWVPS+KC S +C H++Y
Sbjct: 60 ASEGTETLHDSADREYYGLLSIGTPKQNFNILFDTGSANLWVPSAKCSASNKACQKHNKY 119
Query: 129 KSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLAR 188
S +S+TY G+S I YG+GS+SGF S D VEV + +K Q F EAT E TF A+
Sbjct: 120 HSGESSTYVANGESFSIEYGTGSLSGFLSTDTVEVAGIQIKSQTFAEATNEPGSTFTDAK 179
Query: 189 FDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKH 248
F GI+GL F+ IAV P WDNM+EQ L+ E V SF+L A +GGE++ GG+D
Sbjct: 180 FAGILGLAFKSIAVDGVTPPWDNMIEQKLLDEPVISFYLKLKGTAVQGGEMILGGIDSSL 239
Query: 249 FKGKHTYVPVTKKGYWQFELGDILIGNQSTGV-CEGGCAAIVDSGTSLLAGPTPVVTEIN 307
+KG T+VPVTK YWQF+L I ++ GV AI D+GTSL+ P T IN
Sbjct: 240 YKGSLTWVPVTKAAYWQFKLTAI----KTKGVFISRNTQAIADTGTSLIVLPKAAYTRIN 295
Query: 308 HAIGGE 313
+ IG E
Sbjct: 296 NLIGAE 301
>gi|242781757|ref|XP_002479865.1| aspartic endopeptidase Pep2 [Talaromyces stipitatus ATCC 10500]
gi|218720012|gb|EED19431.1| aspartic endopeptidase Pep2 [Talaromyces stipitatus ATCC 10500]
Length = 395
Score = 224 bits (570), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 117/279 (41%), Positives = 165/279 (59%), Gaps = 6/279 (2%)
Query: 36 RRLDLHSLNAARITRKERYMG--GAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGS 93
+ D S+N + ++YMG GV + D+L + NF++AQYF EI IG+
Sbjct: 32 EQFDKRSMNDHMRSLGQKYMGVVPEGVYEDTSIRPEGGHDVL-VDNFLNAQYFSEITIGT 90
Query: 94 PPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSIS 153
PPQNF V+ DTGSSNLWVPS+ C SI+CY H++Y S S+TY + G I YGSGS+
Sbjct: 91 PPQNFKVVLDTGSSNLWVPSASCN-SIACYLHNKYDSSSSSTYKKNGSEFAIQYGSGSLE 149
Query: 154 GFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMV 213
GF S+D V +GD+ +KDQ F EAT E L F RFDGI+GLGF I+V VP + NM+
Sbjct: 150 GFVSRDVVTIGDITIKDQDFAEATNEPGLAFAFGRFDGILGLGFDTISVNKIVPPFYNML 209
Query: 214 EQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILI 273
Q + E VF+F+L + E FGG+D H+ G+ +P+ +K YW+ + +
Sbjct: 210 NQKTLDEPVFAFYLGDSNKEGDNSEATFGGIDKSHYTGELVKIPLRRKAYWEVDFDAVAF 269
Query: 274 GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
G+ + G I+D+GTSL+A P+ + +N IG
Sbjct: 270 GDNVAELENTGV--ILDTGTSLIALPSTLAELLNKEIGA 306
>gi|194883084|ref|XP_001975634.1| GG20455 [Drosophila erecta]
gi|190658821|gb|EDV56034.1| GG20455 [Drosophila erecta]
Length = 404
Score = 224 bits (570), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 128/318 (40%), Positives = 182/318 (57%), Gaps = 25/318 (7%)
Query: 12 LWVLASCL----LLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRL 67
+W+L S L +LP L+ R+ L +AR R E+ G+ R RL
Sbjct: 1 MWLLVSLLPVLFILPVQFQPPVSCTLQLYRVPLRRFPSAR-HRFEK----LGIRMDRLRL 55
Query: 68 GDSDE------------DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSK 115
++E PL N++DAQYFG I IG+PPQ+F VIFDTGSSNLWVPS+
Sbjct: 56 KYAEEVSHFRGEWNSEVKATPLSNYLDAQYFGPITIGTPPQSFKVIFDTGSSNLWVPSAT 115
Query: 116 CYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFI 174
C ++C H+RY +++S ++ G I+YGSGS+ GF S D V V + + DQ F
Sbjct: 116 CASRMVACRVHNRYFAKRSTSHQVRGDRFAIHYGSGSLFGFLSTDTVRVAGLEIHDQTFA 175
Query: 175 EATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAE 234
EAT FL A+FDGI GL +R I++ P + M+EQGL+++ +FS +L+R + +
Sbjct: 176 EATEMPGPIFLAAKFDGIFGLAYRSISMQRIKPPFYAMMEQGLLTKPIFSVYLSRHGE-K 234
Query: 235 EGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTS 294
EGG I FGG +P ++ G TYV V+ + YWQ ++ +I N +C+ GC I+D+GTS
Sbjct: 235 EGGAIFFGGSNPHYYTGNFTYVQVSHRAYWQVKMDSAVIRNLE--LCQQGCEVIIDTGTS 292
Query: 295 LLAGPTPVVTEINHAIGG 312
LA P IN +IGG
Sbjct: 293 FLALPYDQAILINESIGG 310
>gi|164657049|ref|XP_001729651.1| hypothetical protein MGL_3195 [Malassezia globosa CBS 7966]
gi|159103544|gb|EDP42437.1| hypothetical protein MGL_3195 [Malassezia globosa CBS 7966]
Length = 419
Score = 224 bits (570), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 116/255 (45%), Positives = 155/255 (60%), Gaps = 9/255 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL +F++AQYF +I +GSPPQ+F VI DTGS+NLWVPS C SI+C H +Y + S
Sbjct: 96 VPLTDFLNAQYFADIELGSPPQSFKVILDTGSANLWVPSESCT-SIACLLHKKYDNSLSK 154
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY G +I+YGSGS+ GF S+D + +GD+ VKDQ F EA +E L F +FDGI+G
Sbjct: 155 TYQANGSEFQIHYGSGSMEGFVSRDTLRIGDLDVKDQDFAEAIKEPGLAFAFGKFDGILG 214
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+V VP + M EQ L+ + F F+L EGGE FGGVDP F+G
Sbjct: 215 LAYDTISVNKIVPPFYRMKEQNLLDQNQFGFYLGS--SESEGGEATFGGVDPSRFEGPIV 272
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
Y PV ++GYW+ L I GN+ + G A +D+GTSL+A PT V +N IG +
Sbjct: 273 YAPVRRRGYWEVALNKIGFGNEELVLTRTGAA--IDTGTSLIAMPTDVAEILNKEIGAKR 330
Query: 314 ---GVVSAECKLVVS 325
G S +C V S
Sbjct: 331 SWTGQYSVDCSKVPS 345
>gi|449282010|gb|EMC88940.1| Cathepsin E-B, partial [Columba livia]
Length = 387
Score = 224 bits (570), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 106/243 (43%), Positives = 160/243 (65%), Gaps = 2/243 (0%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
L ++M+AQY+G + +G+PPQ F+V+FDTGSSN WVPS+ C S +C H ++KS S++Y
Sbjct: 55 LYDYMNAQYYGVVSVGTPPQRFTVVFDTGSSNFWVPSAYC-ISEACRVHQKFKSFLSDSY 113
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
G++ + YG+G + G +D +++ ++ +K Q F E+ E TF+ A FDG++GLG
Sbjct: 114 EHGGEAFSLQYGTGQLLGVAGKDTLQISNISIKGQDFGESVFEPGSTFVFAHFDGVLGLG 173
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ +AVG+A+PV+D+++ Q LV E +FSF+L R+ D E GGE++ GG+D +KG +V
Sbjct: 174 YPSLAVGNALPVFDSIMNQQLVEEPIFSFYLKREDDTENGGELILGGIDHSLYKGSIHWV 233
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
PVT+K YWQ L +I I + C GC AIVDSGTSL+ GP+ + + IG
Sbjct: 234 PVTEKSYWQIHLNNIKIQGR-VAFCSHGCEAIVDSGTSLITGPSSQIRRLQEYIGASPSH 292
Query: 317 SAE 319
S E
Sbjct: 293 SGE 295
>gi|402226359|gb|EJU06419.1| endopeptidase [Dacryopinax sp. DJM-731 SS1]
Length = 413
Score = 223 bits (569), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 119/256 (46%), Positives = 158/256 (61%), Gaps = 11/256 (4%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL +FM+AQYF EI +G+PPQ F V+ DTGSSNLWVPS KC SI+C+ H +Y S S+
Sbjct: 92 VPLTDFMNAQYFAEITLGTPPQTFKVVLDTGSSNLWVPSIKCT-SIACFLHQKYDSAASS 150
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY G + EI+YGSGS+ GF S D + +GD+ V+ F EAT+E L F L RFDGI+G
Sbjct: 151 TYKSNGTAFEIHYGSGSMEGFVSNDLLTIGDLQVQKLDFAEATKEPGLAFALGRFDGILG 210
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKH 253
L + I+V PV+ M+ Q L+ VF+F L N D D GGE FGG+D + GK
Sbjct: 211 LAYDTISVLHMTPVFYQMINQKLLENPVFAFRLGNSDAD---GGEATFGGIDESAYTGKI 267
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG- 312
YVPV +KGYW+ EL I +G + + G A +D+GTSL+A P+ + +N IG
Sbjct: 268 DYVPVRRKGYWEIELDKISLGGEDLELESTGAA--IDTGTSLIALPSDIAEMLNKEIGAT 325
Query: 313 ---EGVVSAECKLVVS 325
+ EC V S
Sbjct: 326 KSWNNQYTVECSTVDS 341
>gi|118344572|ref|NP_001072053.1| cathepsin D2 precursor [Takifugu rubripes]
gi|55771084|dbj|BAD69802.1| cathepsin D2 [Takifugu rubripes]
Length = 386
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 116/294 (39%), Positives = 177/294 (60%), Gaps = 14/294 (4%)
Query: 20 LLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKN 79
LL S + I L + R L R++ +R + S D + + L N
Sbjct: 13 LLITESAAITSISLHRARSLL-----TRMSNNQRSLLRVAASST-----DPESPAVRLIN 62
Query: 80 FMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFHSRYKSRKSNTYTE 138
D QYFG+I IG+PPQ F+V+FDTGSS+LWVPS C ++C H Y+S +S+TY +
Sbjct: 63 IYDLQYFGKISIGTPPQEFTVLFDTGSSDLWVPSVYCSPLYLACGLHRHYRSYRSSTYVQ 122
Query: 139 IGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFR 198
+ I Y SG +SGF S+D + +G + V Q+F EA R+ TF+ +FDGI+G+ +
Sbjct: 123 CDRGFFIEYQSGRLSGFVSKDTLSIGGLQVPGQLFGEAVRQPGETFIYTQFDGILGMAYP 182
Query: 199 EIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPV 258
I+ PV+D ++ L+ + VFSF+LNRDP+A GG+++ GG++P+H+ G+ YV V
Sbjct: 183 SIST--IAPVFDRIMAAKLLPQNVFSFYLNRDPEAAIGGQLILGGLNPEHYAGELHYVNV 240
Query: 259 TKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
T+K YWQ E+ I +G+Q +C+ C IVD+GTSL+ GP+ + +++AI G
Sbjct: 241 TRKAYWQIEVNRINVGDQ-LSLCKPSCQTIVDTGTSLITGPSEEIRALHNAIPG 293
>gi|410968030|ref|XP_003990516.1| PREDICTED: pepsin B-like [Felis catus]
Length = 390
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 118/308 (38%), Positives = 179/308 (58%), Gaps = 13/308 (4%)
Query: 25 SNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDIL-PLKNFMDA 83
S G+ RI LKK + + + R + V L ++D P N++++
Sbjct: 14 SEGVERIILKKGK-SIRQVMEERGVLQTFLKNHPKVDPAAKYLFNNDAVAYEPFTNYLNS 72
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
YFGEI IG+PPQNF V+FDTGSSNLWVPS+ C S +C H+ + S+TY G++
Sbjct: 73 YYFGEISIGTPPQNFLVLFDTGSSNLWVPSTYCK-SQACSNHNTFNPSMSSTYQNNGQTY 131
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVG 203
+ YGSGS++ D V V ++V+ +Q F + E S F A FDGI+G+ + +AVG
Sbjct: 132 TLYYGSGSLTVLLGYDTVTVQNIVIHNQEFGLSEIEPSNPFYYANFDGILGMAYPNLAVG 191
Query: 204 DAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGY 263
++ V ++M++QG ++ +FSF+ +R P E GGE++ GG++ + + G+ + PVT++ Y
Sbjct: 192 NSPTVMESMMQQGQLTSPIFSFYFSRQPTYEYGGELILGGMNSQFYSGEIVWTPVTRELY 251
Query: 264 WQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLV 323
WQ + + L+GNQ TG+C GC AIVD+GT +LA P + A G E
Sbjct: 252 WQVAIDEFLVGNQPTGLCSQGCQAIVDTGTYVLAVPQQYMNSFLQATGAE---------- 301
Query: 324 VSQYGDLI 331
VSQYGD +
Sbjct: 302 VSQYGDFV 309
>gi|169861123|ref|XP_001837196.1| endopeptidase [Coprinopsis cinerea okayama7#130]
gi|116501918|gb|EAU84813.1| endopeptidase [Coprinopsis cinerea okayama7#130]
Length = 411
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 116/255 (45%), Positives = 155/255 (60%), Gaps = 9/255 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL NFM+AQY+ EI +G+PPQ F VI DTGSSNLWVPS KC SI+C+ H++Y S +S
Sbjct: 89 VPLTNFMNAQYYTEITLGTPPQTFKVILDTGSSNLWVPSIKCT-SIACFLHTKYDSSQST 147
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY G I YGSGS+ GF SQD + +GD+ +K Q F EA +E L F +FDGI+G
Sbjct: 148 TYKANGTEFSIQYGSGSMEGFVSQDTLGIGDLTIKGQDFAEALKEPGLAFAFGKFDGILG 207
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+V VP + NM+ Q L+ VF+F + E+GGE FGG+D + + GK
Sbjct: 208 LAYDTISVNRIVPPFYNMINQKLIDSPVFAFRIGS--SEEDGGEATFGGIDHEAYTGKLH 265
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG-- 312
YVPV +K YW+ EL I G+ + G A +D+GTSL+A PT + +N IG
Sbjct: 266 YVPVRRKAYWEVELEKISFGDDELELEHTGAA--IDTGTSLIALPTDMAEMLNTQIGARK 323
Query: 313 --EGVVSAECKLVVS 325
G +C V S
Sbjct: 324 SWNGQYQVDCNKVPS 338
>gi|320588396|gb|EFX00865.1| aspartic endopeptidase pep2 [Grosmannia clavigera kw1407]
Length = 401
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 131/312 (41%), Positives = 189/312 (60%), Gaps = 18/312 (5%)
Query: 23 ASSNGLRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDS-----DED 73
A ++G++++ LKK ++L+ ++A ++YMG S + D
Sbjct: 16 AQASGIQKLKLKKVPLAKQLESIPIDAQIRGLGQKYMGARLGSHADEMFKTAVVETDDNH 75
Query: 74 ILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKS 133
LP+ NF++AQYF EI IG+PPQ+F V+ DTGSSNLWVPSS+C SI+CY H++Y S S
Sbjct: 76 PLPVSNFLNAQYFAEISIGTPPQSFKVVLDTGSSNLWVPSSQCG-SIACYLHTKYDSESS 134
Query: 134 NTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGII 193
++Y G + YGSGS+SGF SQD V +GD+ + Q F EAT E L F ARFDGI+
Sbjct: 135 SSYKSNGSAFAAQYGSGSLSGFVSQDTVSIGDLKIVKQDFAEATEEPGLAFAFARFDGIL 194
Query: 194 GLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGK 252
GLGF I+V VP + N++ Q L+ VF+F+L N D D ++ E VFGGVD H+ GK
Sbjct: 195 GLGFDTISVNHIVPPFYNLINQKLIDSGVFAFYLGNADSDGDD-SEAVFGGVDKAHYTGK 253
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
T +P+ +K YW+ +L I +G + + G I+D+GTSL+A P+ + +N IG
Sbjct: 254 ITTIPLRRKAYWEVDLDSISLGEDTAELENTGV--ILDTGTSLIALPSSLAEMLNAQIGA 311
Query: 313 E----GVVSAEC 320
+ G S +C
Sbjct: 312 KKGYNGQYSVDC 323
>gi|212526768|ref|XP_002143541.1| aspartic endopeptidase Pep2 [Talaromyces marneffei ATCC 18224]
gi|210072939|gb|EEA27026.1| aspartic endopeptidase Pep2 [Talaromyces marneffei ATCC 18224]
Length = 395
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 122/291 (41%), Positives = 168/291 (57%), Gaps = 10/291 (3%)
Query: 28 LRRIGLKK----RRLDLHSLNAARITRKERYMG--GAGVSGVRHRLGDSDEDILPLKNFM 81
+ R+ L K + D S+N + ++YMG G + D+L + NF+
Sbjct: 20 VHRLKLDKLSLSEQFDKRSMNDHMRSLSQKYMGVVPEGTYQDTSIRPEGGHDVL-VDNFL 78
Query: 82 DAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGK 141
+AQYF EI IG+PPQNF V+ DTGSSNLWVPSS C SI+CY HS+Y S S+TY + G
Sbjct: 79 NAQYFSEITIGTPPQNFKVVLDTGSSNLWVPSSSCN-SIACYLHSKYDSSSSSTYKKNGS 137
Query: 142 SCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIA 201
I YGSGS+ GF S+D V +GD+ +KDQ F EAT E L F RFDGI+GLGF I+
Sbjct: 138 DFAIQYGSGSLEGFVSRDTVTIGDITIKDQDFAEATNEPGLAFAFGRFDGILGLGFDTIS 197
Query: 202 VGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKK 261
V VP + NM+ Q + E VF+F+L + E FGG+D H+ G+ +P+ +K
Sbjct: 198 VNKIVPPFYNMLNQKSLDEPVFAFYLGDSNKEGDASEATFGGIDKSHYTGELVKIPLRRK 257
Query: 262 GYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
YW+ + I G + G I+D+GTSL+A P+ + +N IG
Sbjct: 258 AYWEVDFDAIAFGENVAELENTGV--ILDTGTSLIALPSTLAELLNKEIGA 306
>gi|57164325|ref|NP_001009299.1| renin precursor [Ovis aries]
gi|1710090|sp|P52115.1|RENI_SHEEP RecName: Full=Renin; AltName: Full=Angiotensinogenase; Flags:
Precursor
gi|896318|gb|AAA69809.1| renin [Ovis aries]
Length = 400
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 125/306 (40%), Positives = 181/306 (59%), Gaps = 19/306 (6%)
Query: 17 SCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRL------GDS 70
S LPA + RRI LKK + + R + KER + A + +L G+
Sbjct: 14 STFSLPADTAAFRRIFLKK-------MPSVRESLKERGVDMAQLGAEWSQLTKTLSFGNR 66
Query: 71 DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFHSRYK 129
++ L N++D QY+GEIGIG+PPQ F VIFDTGS+NLWVPS+KC +C HS Y
Sbjct: 67 TSPVV-LTNYLDTQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCSPLYTACEIHSLYD 125
Query: 130 SRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARF 189
S +S++Y E G I YGSG + GF SQD V VG + V Q F E T F+LA+F
Sbjct: 126 SLESSSYVENGTEFTIYYGSGKVKGFLSQDLVTVGGITVT-QTFGEVTELPLRPFMLAKF 184
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEE--GGEIVFGGVDPK 247
DG++G+GF AVG PV+D+++ Q +++E+VFS + +RD GGEIV GG DP+
Sbjct: 185 DGVLGMGFPAQAVGGVTPVFDHILAQRVLTEDVFSVYYSRDSKNSHLLGGEIVLGGSDPQ 244
Query: 248 HFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEIN 307
+++ YV ++K G WQ + + + +T +CE GC +VD+G S ++GPT + +
Sbjct: 245 YYQENFHYVSISKPGSWQIRMKGVSV-RSTTLLCEEGCMVVVDTGASYISGPTSSLRLLM 303
Query: 308 HAIGGE 313
A+G +
Sbjct: 304 EALGAK 309
>gi|354487263|ref|XP_003505793.1| PREDICTED: renin-like [Cricetulus griseus]
Length = 403
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 130/317 (41%), Positives = 186/317 (58%), Gaps = 29/317 (9%)
Query: 9 VFCLWVLASCLL-LPASSNGLRRIGLKK----------RRLDLHSLNAARITRKERYMGG 57
+ LW +SC LP + RI LKK R +D+ L+A +R+ G
Sbjct: 12 LLILW--SSCAFSLPTDTAAFGRILLKKMPSVREILKERGVDMTKLSAEWGKFTKRFSFG 69
Query: 58 AGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY 117
G S V L N++D QY+GEIGIG+PPQ F VIFDTGS+NLWVPS+KC
Sbjct: 70 NGTSPVI------------LTNYLDTQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCS 117
Query: 118 -FSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEA 176
+C HS Y S +S++Y E G I+YGSG + GF SQD V VG ++V Q F E
Sbjct: 118 PLYSACEIHSLYDSSESSSYMENGTEFTIHYGSGKVKGFLSQDIVTVGGIIVT-QTFGEV 176
Query: 177 TREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEG 236
T + F+LA+FDG++G+GF AVG PV+D+++ Q ++ EEVFS + +RD G
Sbjct: 177 TELPLIPFMLAKFDGVLGMGFPAQAVGGVTPVFDHILSQRVLKEEVFSVYYSRDSHL-LG 235
Query: 237 GEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLL 296
GE+V GG DP+H++G YV V++ G W+ + + +G+ +T +CE GC +VD+G S +
Sbjct: 236 GEVVLGGSDPQHYQGNFHYVSVSRTGSWEIAMKGVSVGS-ATLLCEEGCVVVVDTGASYI 294
Query: 297 AGPTPVVTEINHAIGGE 313
+GPT + I +G +
Sbjct: 295 SGPTSSLKLIMQTLGAK 311
>gi|148669271|gb|EDL01218.1| mCG6933 [Mus musculus]
Length = 401
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 126/309 (40%), Positives = 184/309 (59%), Gaps = 8/309 (2%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRH 65
L ++ LW + C + RI LKK + + R R V R
Sbjct: 8 LWALLLLW--SPCTFSLPTGTTFERIPLKKMP-SVREILEERGVDMTRLSAEWDVFTKRS 64
Query: 66 RLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYF 124
L D ++ L N++++QY+GEIGIG+PPQ F VIFDTGS+NLWVPS+KC ++C
Sbjct: 65 SLTDLISPVV-LTNYLNSQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCSRLYLACGI 123
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
HS Y+S S++Y E G I+YGSG + GF SQD+V VG + V Q F E T + F
Sbjct: 124 HSLYESSDSSSYMENGDDFTIHYGSGRVKGFLSQDSVTVGGITVT-QTFGEVTELPLIPF 182
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
+LA+FDG++G+GF AVG PV+D+++ QG++ E+VFS + NR P GGE+V GG
Sbjct: 183 MLAQFDGVLGMGFPAQAVGGVTPVFDHILSQGVLKEKVFSVYYNRGPHL-LGGEVVLGGS 241
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
DP+H++G YV ++K WQ + + +G+ ST +CE GC +VD+G+S ++ PT +
Sbjct: 242 DPEHYQGDFHYVSLSKTDSWQITMKGVSVGS-STLLCEEGCEVVVDTGSSFISAPTSSLK 300
Query: 305 EINHAIGGE 313
I A+G +
Sbjct: 301 LIMQALGAK 309
>gi|132329|sp|P00796.1|RENI2_MOUSE RecName: Full=Renin-2; AltName: Full=Angiotensinogenase; AltName:
Full=Submandibular gland renin; Contains: RecName:
Full=Renin-2 heavy chain; Contains: RecName:
Full=Renin-2 light chain; Flags: Precursor
gi|15029868|gb|AAH11157.1| Ren2 protein [Mus musculus]
Length = 401
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 126/309 (40%), Positives = 184/309 (59%), Gaps = 8/309 (2%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRH 65
L ++ LW + C + RI LKK + + R R V R
Sbjct: 8 LWALLLLW--SPCTFSLPTGTTFERIPLKKMP-SVREILEERGVDMTRLSAEWDVFTKRS 64
Query: 66 RLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYF 124
L D ++ L N++++QY+GEIGIG+PPQ F VIFDTGS+NLWVPS+KC ++C
Sbjct: 65 SLTDLISPVV-LTNYLNSQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCSRLYLACGI 123
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
HS Y+S S++Y E G I+YGSG + GF SQD+V VG + V Q F E T + F
Sbjct: 124 HSLYESSDSSSYMENGDDFTIHYGSGRVKGFLSQDSVTVGGITVT-QTFGEVTELPLIPF 182
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
+LA+FDG++G+GF AVG PV+D+++ QG++ E+VFS + NR P GGE+V GG
Sbjct: 183 MLAQFDGVLGMGFPAQAVGGVTPVFDHILSQGVLKEKVFSVYYNRGPHL-LGGEVVLGGS 241
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
DP+H++G YV ++K WQ + + +G+ ST +CE GC +VD+G+S ++ PT +
Sbjct: 242 DPEHYQGDFHYVSLSKTDSWQITMKGVSVGS-STLLCEEGCEVVVDTGSSFISAPTSSLK 300
Query: 305 EINHAIGGE 313
I A+G +
Sbjct: 301 LIMQALGAK 309
>gi|291223845|ref|XP_002731921.1| PREDICTED: expressed hypothetical protein-like [Saccoglossus
kowalevskii]
Length = 959
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 110/235 (46%), Positives = 158/235 (67%), Gaps = 4/235 (1%)
Query: 78 KNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTY 136
++DA Y+GEIGIG+PP F V+FDTGSS LWVPS+ C S ++C FH+ Y + KS+TY
Sbjct: 634 NTYIDASYYGEIGIGTPPATFLVLFDTGSSYLWVPSAMCPESNMACAFHNSYDNLKSSTY 693
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
T +S I YGSGS+SG S+D + +GDV +++Q+F E T + +LARFDGI+GLG
Sbjct: 694 TATRESFNITYGSGSVSGVISRDTIVIGDVRIENQLFGETTAWPDTSIVLARFDGILGLG 753
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ + +PV+DNM+ Q L+SE VFS ++ D + GE++ GG D H+ G+ TY+
Sbjct: 754 YPNLQTRSILPVFDNMLAQHLISEPVFSVYVRGDGNK---GELILGGSDQHHYSGEFTYL 810
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
PVT KGYWQF + I + ++ + C GC A+VD+GTS++AGP + +N IG
Sbjct: 811 PVTIKGYWQFTMDSIHVYDKPSQYCLDGCQAVVDTGTSVIAGPMEDIETLNTEIG 865
>gi|296810640|ref|XP_002845658.1| vacuolar protease A [Arthroderma otae CBS 113480]
gi|263406266|sp|C5FS55.1|CARP_NANOT RecName: Full=Vacuolar protease A; AltName: Full=Aspartic
endopeptidase PEP2; AltName: Full=Aspartic protease
PEP2; Flags: Precursor
gi|238843046|gb|EEQ32708.1| vacuolar protease A [Arthroderma otae CBS 113480]
Length = 395
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 124/296 (41%), Positives = 175/296 (59%), Gaps = 24/296 (8%)
Query: 18 CLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRL---------- 67
C S L+++ LK++ L+ ++ + ++YMG + +H
Sbjct: 15 CTSAKLHSLKLKKVSLKEQ-LEHADIDVQIKSLGQKYMG---IRPGQHEQQMFKEQTPIE 70
Query: 68 GDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSR 127
+S ++L + NF++AQYF EI IG+PPQ F V+ DTGSSNLWVP C SI+C+ HS
Sbjct: 71 AESGHNVL-IDNFLNAQYFSEISIGTPPQTFKVVLDTGSSNLWVPGKDCS-SIACFLHST 128
Query: 128 YKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLA 187
Y S S+T+T G S I YGSGS+ GF SQDNV++GD+ +K+Q+F EAT E L F
Sbjct: 129 YDSSASSTFTRNGTSFAIRYGSGSLEGFVSQDNVQIGDMKIKNQLFAEATSEPGLAFAFG 188
Query: 188 RFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL---NRDPDAEEGGEIVFGGV 244
RFDGI+G+G+ I+V P + MVEQGLV E VFSF+L N+D D + FGG
Sbjct: 189 RFDGILGMGYDTISVNKITPPFYKMVEQGLVDEPVFSFYLGDTNKDGDQS---VVTFGGA 245
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
D H+ G T +P+ +K YW+ E I +G + + G I+D+GTSL+A PT
Sbjct: 246 DKSHYTGDITTIPLRRKAYWEVEFNAITLGKDTATLDNTGI--ILDTGTSLIALPT 299
>gi|15079273|gb|AAH11473.1| Ren2 protein [Mus musculus]
Length = 401
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 126/309 (40%), Positives = 184/309 (59%), Gaps = 8/309 (2%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRH 65
L ++ LW + C + RI LKK + + R R V R
Sbjct: 8 LWALLLLW--SPCTFSLPTGTTFERIPLKKMP-SVREILEERGVDMTRLSAEWDVFTKRS 64
Query: 66 RLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYF 124
L D ++ L N++++QY+GEIGIG+PPQ F VIFDTGS+NLWVPS+KC ++C
Sbjct: 65 SLTDLISPVV-LTNYLNSQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCSRLYLACGI 123
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
HS Y+S S++Y E G I+YGSG + GF SQD+V VG + V Q F E T + F
Sbjct: 124 HSLYESSDSSSYMENGDDFTIHYGSGRVKGFLSQDSVTVGGITVT-QTFGEVTELPLIPF 182
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
+LA+FDG++G+GF AVG PV+D+++ QG++ E+VFS + NR P GGE+V GG
Sbjct: 183 MLAQFDGVLGMGFPAQAVGGVTPVFDHILSQGVLKEKVFSVYYNRGPHL-LGGEVVLGGS 241
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
DP+H++G YV ++K WQ + + +G+ ST +CE GC +VD+G+S ++ PT +
Sbjct: 242 DPEHYQGDFHYVSLSKTDSWQITMKGVSVGS-STLLCEEGCEVVVDTGSSFISAPTSSLK 300
Query: 305 EINHAIGGE 313
I A+G +
Sbjct: 301 LIMQALGAK 309
>gi|206611|gb|AAA42031.1| renin [Rattus norvegicus]
Length = 352
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 125/312 (40%), Positives = 185/312 (59%), Gaps = 17/312 (5%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSG--- 62
L ++ LW S LP + RI LKK + + R +ER + +S
Sbjct: 8 LWALLLLWTSCS-FSLPTDTASFGRILLKK-------MPSVREILEERGVDMTRISAEWG 59
Query: 63 --VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFS 119
++ + + L N++D QY+GEIGIG+P Q F VIFDTGS+NLWVPS+KC
Sbjct: 60 EFIKKSSFTNVTSPVVLTNYLDTQYYGEIGIGTPSQTFKVIFDTGSANLWVPSTKCGPLY 119
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
+C H+ Y S +S++Y E G I+YGSG + GF SQD V VG ++V Q F E T
Sbjct: 120 TACEIHNLYDSSESSSYMENGTEFTIHYGSGKVKGFLSQDVVTVGGIIVT-QTFGEVTEL 178
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEI 239
+ F+LA+FDG++G+GF AV +PV+D+++ Q ++ EEVFS + +R+ GGE+
Sbjct: 179 PLIPFMLAKFDGVLGMGFPAQAVDGVIPVFDHILSQRVLKEEVFSVYYSRESHL-LGGEV 237
Query: 240 VFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGP 299
V GG DP+H++G YV ++K G WQ + + +G +T +CE GC A+VD+GTS ++GP
Sbjct: 238 VLGGSDPQHYQGNFHYVSISKAGSWQITMKGVSVG-PATLLCEEGCMAVVDTGTSYISGP 296
Query: 300 TPVVTEINHAIG 311
T + I A+G
Sbjct: 297 TSSLQLIMQALG 308
>gi|118150650|ref|NP_112470.2| renin-2 [Mus musculus]
Length = 424
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 126/309 (40%), Positives = 184/309 (59%), Gaps = 8/309 (2%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRH 65
L ++ LW + C + RI LKK + + R R V R
Sbjct: 31 LWALLLLW--SPCTFSLPTGTTFERIPLKKMP-SVREILEERGVDMTRLSAEWDVFTKRS 87
Query: 66 RLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYF 124
L D ++ L N++++QY+GEIGIG+PPQ F VIFDTGS+NLWVPS+KC ++C
Sbjct: 88 SLTDLISPVV-LTNYLNSQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCSRLYLACGI 146
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
HS Y+S S++Y E G I+YGSG + GF SQD+V VG + V Q F E T + F
Sbjct: 147 HSLYESSDSSSYMENGDDFTIHYGSGRVKGFLSQDSVTVGGITVT-QTFGEVTELPLIPF 205
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
+LA+FDG++G+GF AVG PV+D+++ QG++ E+VFS + NR P GGE+V GG
Sbjct: 206 MLAQFDGVLGMGFPAQAVGGVTPVFDHILSQGVLKEKVFSVYYNRGPHL-LGGEVVLGGS 264
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
DP+H++G YV ++K WQ + + +G+ ST +CE GC +VD+G+S ++ PT +
Sbjct: 265 DPEHYQGDFHYVSLSKTDSWQITMKGVSVGS-STLLCEEGCEVVVDTGSSFISAPTSSLK 323
Query: 305 EINHAIGGE 313
I A+G +
Sbjct: 324 LIMQALGAK 332
>gi|291416142|ref|XP_002724306.1| PREDICTED: cathepsin D [Oryctolagus cuniculus]
Length = 377
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 112/244 (45%), Positives = 154/244 (63%), Gaps = 35/244 (14%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L+N+MDAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C+ H +Y S+KS+T
Sbjct: 80 LRNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSVHCKLLDIACWIHHKYNSKKSST 139
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + G + +I+YGSGS+SG+ SQD V V
Sbjct: 140 YVKNGTTFDIHYGSGSLSGYLSQDTVSXXXXXXXXNV----------------------- 176
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+PV+DN+++Q LV + VFSF+LNRDP A+ GGE++ GGVDPK+++G +Y
Sbjct: 177 ----------LPVFDNLMQQKLVEKNVFSFYLNRDPAAQPGGELMLGGVDPKYYQGSLSY 226
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
+ VT+K YWQ + + +G+ T +CEGGC AIVD+GTSLL GP V E+ AIG +
Sbjct: 227 LNVTRKAYWQVHMDQLNVGSGLT-LCEGGCEAIVDTGTSLLVGPVDEVRELQRAIGAVPL 285
Query: 316 VSAE 319
+ E
Sbjct: 286 IQGE 289
>gi|366991455|ref|XP_003675493.1| hypothetical protein NCAS_0C01360 [Naumovozyma castellii CBS 4309]
gi|342301358|emb|CCC69126.1| hypothetical protein NCAS_0C01360 [Naumovozyma castellii CBS 4309]
Length = 406
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 114/273 (41%), Positives = 163/273 (59%), Gaps = 4/273 (1%)
Query: 42 SLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVI 101
SL + ER S + D +PL N+++AQYF +I +G+PPQNF VI
Sbjct: 49 SLGHKYMNHFERANPEVSFSRDHPFFAEGDGHNVPLTNYLNAQYFADISVGTPPQNFKVI 108
Query: 102 FDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNV 161
DTGSSNLWVPSS+C S++C+ HS+Y S++Y G I YGSGS+ G+ SQD +
Sbjct: 109 LDTGSSNLWVPSSECN-SLACFLHSKYDHDASSSYKANGTKFAIQYGSGSLEGYISQDTL 167
Query: 162 EVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEE 221
+GD+ + Q F EAT E LTF +FDGI+GL + I+V VP + N +EQGL+ E+
Sbjct: 168 NIGDLTIPKQDFAEATSEPGLTFAFGKFDGILGLAYDTISVDKVVPPFYNAIEQGLLDEK 227
Query: 222 VFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGV 280
F+F+L + D + GGEI GG+D FKG ++PV +K YW+ + I +G+Q +
Sbjct: 228 KFAFYLGDTKKDEKNGGEITIGGIDESKFKGDIEWLPVRRKAYWEVKFEGIALGDQYAAL 287
Query: 281 CEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
G A +D+GTSL+ P+ + IN IG +
Sbjct: 288 ENHGAA--IDTGTSLITLPSGLAEIINTEIGAK 318
>gi|126310959|ref|XP_001372683.1| PREDICTED: chymosin-like [Monodelphis domestica]
Length = 383
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 121/306 (39%), Positives = 178/306 (58%), Gaps = 17/306 (5%)
Query: 12 LWVLASCLLLPASSNGLRRIGLKK-----RRLDLHSLNAARITRKERYMGGAGVSGVRHR 66
L++LA ++ S RRI L K + L H L + + + +Y +
Sbjct: 5 LFLLA---VIAISECAFRRIPLTKGKTLRKVLKEHGLLESFL-KSHKYSPSSKYQLYGEA 60
Query: 67 LGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHS 126
+DE PL N++D+QYFG+I IG+PPQ F+V+FDTGSSNLWVPS C S +C H
Sbjct: 61 AKVTDE---PLTNYLDSQYFGKIYIGTPPQEFTVVFDTGSSNLWVPSVYCN-SDACQNHH 116
Query: 127 RYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLL 186
R+ S T+ + I YG+GS+ G D V V +VV DQ+F +T+E F
Sbjct: 117 RFNPASSTTFRSTQEPLSIQYGTGSMEGVLGYDTVTVSQIVVPDQIFGLSTQEPGEIFTY 176
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
+ FDGI+GLG+ +A A PV+DNM+ + LV++++FS +++RD +G ++ G +DP
Sbjct: 177 SEFDGILGLGYPSLAEDQATPVFDNMMNKNLVAQDLFSVYMSRD---SQGSMLILGAIDP 233
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
++ G +VPVT++GYWQF + I + Q CEGGC AI+D+GTSLL GP+ + I
Sbjct: 234 SYYTGSLHWVPVTEQGYWQFSVDSITVNGQVVA-CEGGCQAILDTGTSLLVGPSYDIANI 292
Query: 307 NHAIGG 312
IG
Sbjct: 293 QSIIGA 298
>gi|311260416|ref|XP_003128442.1| PREDICTED: gastricsin-like [Sus scrofa]
Length = 394
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 120/331 (36%), Positives = 193/331 (58%), Gaps = 28/331 (8%)
Query: 5 LLRSVFCLWVL-ASCLLLPASS-----NGLRRIGLKKRRLDLHSLNAARITRKERYMGGA 58
++ ++ CL +L AS + +P ++ GL + L H + A+
Sbjct: 9 MVVALVCLQLLEASVIKVPLKKLKSIRQAMKEKGLLEEFLKTHKYDPAQ----------- 57
Query: 59 GVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF 118
R+R GD + P+ +++A YFGEI IG+PPQNF V+FDTGSSNLWVPS C
Sbjct: 58 -----RYRFGDFSVALEPMA-YLEAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCK- 110
Query: 119 SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
S++C H+R+ KS+TY+ ++ + YGSGS++GFF D +++ + V DQ F +
Sbjct: 111 SLACTTHARFNPSKSSTYSTDRQTFSLQYGSGSLTGFFGYDTLKIQSIQVPDQEFGLSET 170
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
E +FL A+FDGI+GL + +++ G A ++++ ++ VFSF+L+ +++GGE
Sbjct: 171 EPGTSFLYAQFDGIMGLAYPDLSAGGATTAMQGLLQEDALTSPVFSFYLSNQQSSQDGGE 230
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
+V GGVD + G+ + PVT++ YWQ + + LIG++++G C GC AIVD+GTSLL
Sbjct: 231 LVLGGVDSSLYTGQIYWAPVTQELYWQIGIEEFLIGDEASGWCSEGCQAIVDTGTSLLTV 290
Query: 299 PTPVVTEINHAIGGE----GVVSAECKLVVS 325
P ++++ A G E G +CK + S
Sbjct: 291 PQDYLSDLVQATGAEENEYGEFLVDCKDIQS 321
>gi|45643446|gb|AAS72876.1| aspartyl protease [Triatoma infestans]
Length = 387
Score = 222 bits (565), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 108/232 (46%), Positives = 155/232 (66%), Gaps = 3/232 (1%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
L+N ++ QY+G + +G+PPQ +V+FDTGS+NLWVP + C S +C H+ Y ++S+TY
Sbjct: 63 LRNSLNTQYYGNVTLGTPPQELTVVFDTGSANLWVPLANCP-SFACIIHNTYDHKQSSTY 121
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
GK+ INYG+GSI+G S D +++GD+ VK+Q+F EA + + F ++ DGI+GL
Sbjct: 122 QPNGKALRINYGTGSITGEMSSDVLQIGDLQVKNQLFGEAPQVSNSPFGRSKADGILGLA 181
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF-KGKHTY 255
F IA G A+P + NM++QGL+ + VFS +LNR+PD E GGEI+FGGVD K F K T
Sbjct: 182 FPPIAKGQAIPPFFNMIDQGLLDKPVFSVYLNRNPDEEVGGEIIFGGVDEKRFNKESLTT 241
Query: 256 VPVTKKGYWQFELGDILI-GNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
VP+T YW F++ ++ G C+ GC A D+GTS + GPT V EI
Sbjct: 242 VPLTNPTYWMFKMDEVSTSGTNGKSWCQNGCRATADTGTSFIVGPTKEVAEI 293
>gi|330688453|ref|NP_001193438.1| renin precursor [Bos taurus]
Length = 398
Score = 222 bits (565), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 125/305 (40%), Positives = 184/305 (60%), Gaps = 19/305 (6%)
Query: 17 SCLL-LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRL------GD 69
SC LPA + RRI LKK + + R + KER + A + +L G+
Sbjct: 13 SCTFSLPADTAAFRRIFLKK-------MPSVRESLKERGVDMARLGAEWSQLTKTLSFGN 65
Query: 70 SDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFHSRY 128
++ L N++D QY+GEIGIG+PPQ F V+FDTGS+NLWVPS+KC +C HS Y
Sbjct: 66 RTSPVV-LTNYLDTQYYGEIGIGTPPQTFKVVFDTGSANLWVPSTKCSPLYTACEIHSLY 124
Query: 129 KSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLAR 188
S +S++Y E G I+YGSG + GF SQD V VG + V Q F E T L F+LA+
Sbjct: 125 DSLESSSYVENGTEFTIHYGSGKVKGFLSQDLVTVGGITVT-QTFGEVTELPLLPFMLAK 183
Query: 189 FDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKH 248
FDG++G+GF AVG PV+D+++ Q +++++VFS + +R+ GGEIV GG DP++
Sbjct: 184 FDGVLGMGFPAQAVGGVTPVFDHILAQRVLTDDVFSVYYSRNSHL-LGGEIVLGGSDPQY 242
Query: 249 FKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH 308
++ YV ++K G WQ + + + +T +CE GC IVD+G S ++GPT + +
Sbjct: 243 YQENFHYVSISKPGSWQIRMKGVSV-RSTTLLCEEGCMVIVDTGASYISGPTSSLRLLME 301
Query: 309 AIGGE 313
A+G +
Sbjct: 302 ALGAK 306
>gi|440903924|gb|ELR54511.1| Renin, partial [Bos grunniens mutus]
Length = 404
Score = 222 bits (565), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 125/305 (40%), Positives = 184/305 (60%), Gaps = 19/305 (6%)
Query: 17 SCLL-LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRL------GD 69
SC LPA + RRI LKK + + R + KER + A + +L G+
Sbjct: 19 SCTFSLPADTAAFRRIFLKK-------MPSVRESLKERGVDMARLGAEWSQLTKTLSFGN 71
Query: 70 SDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFHSRY 128
++ L N++D QY+GEIGIG+PPQ F V+FDTGS+NLWVPS+KC +C HS Y
Sbjct: 72 RTSPVV-LTNYLDTQYYGEIGIGTPPQTFKVVFDTGSANLWVPSTKCSPLYTACEIHSLY 130
Query: 129 KSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLAR 188
S +S++Y E G I+YGSG + GF SQD V VG + V Q F E T L F+LA+
Sbjct: 131 DSLESSSYVENGTEFTIHYGSGKVKGFLSQDLVTVGGITVT-QTFGEVTELPLLPFMLAK 189
Query: 189 FDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKH 248
FDG++G+GF AVG PV+D+++ Q +++++VFS + +R+ GGEIV GG DP++
Sbjct: 190 FDGVLGMGFPAQAVGGVTPVFDHILAQRVLTDDVFSVYYSRNSHL-LGGEIVLGGSDPQY 248
Query: 249 FKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH 308
++ YV ++K G WQ + + + +T +CE GC IVD+G S ++GPT + +
Sbjct: 249 YQENFHYVSISKPGSWQIRMKGVSV-RSTTLLCEEGCMVIVDTGASYISGPTSSLRLLME 307
Query: 309 AIGGE 313
A+G +
Sbjct: 308 ALGAK 312
>gi|328860092|gb|EGG09199.1| hypothetical protein MELLADRAFT_42703 [Melampsora larici-populina
98AG31]
Length = 429
Score = 222 bits (565), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 110/256 (42%), Positives = 162/256 (63%), Gaps = 10/256 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+++AQYF EI IG+PPQ+F VI DTGSSNLWVPS++C SI+C+ HS+Y S+
Sbjct: 103 VPLSNYLNAQYFSEITIGTPPQSFKVILDTGSSNLWVPSTRCT-SIACFLHSKYDCEASS 161
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y G +I YGSGS+ G S D V +GD+ ++D F E+T+E L F +FDGI+G
Sbjct: 162 SYKANGTEFQIRYGSGSLEGVISNDVVRIGDLEIRDTDFAESTKEPGLAFAFGKFDGILG 221
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDA---EEGGEIVFGGVDPKHFKG 251
LG+ I+V VP + M+EQGL+ E VF+F+L ++ +GGE +FGG+D H++G
Sbjct: 222 LGYDTISVLHTVPPFYEMIEQGLLDEPVFAFYLGTSHESGVDNQGGEAIFGGIDEAHYEG 281
Query: 252 KHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
Y PV ++GYW+ L + G + + G A +D+GTSL+A PT IN ++G
Sbjct: 282 DIHYAPVRRRGYWEVALEGVRFGKEEMKLVNVGAA--IDTGTSLIALPTDTAEIINASLG 339
Query: 312 GE----GVVSAECKLV 323
+ G + +C +
Sbjct: 340 AKKSWSGQYTVDCDKI 355
>gi|148747255|ref|NP_036774.4| renin precursor [Rattus norvegicus]
gi|1350571|sp|P08424.2|RENI_RAT RecName: Full=Renin; AltName: Full=Angiotensinogenase; Flags:
Precursor
gi|30027675|gb|AAP13916.1| renin [Rattus sp.]
gi|51261221|gb|AAH78878.1| Renin [Rattus norvegicus]
gi|149058615|gb|EDM09772.1| renin 1, isoform CRA_b [Rattus norvegicus]
Length = 402
Score = 222 bits (565), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 125/312 (40%), Positives = 185/312 (59%), Gaps = 17/312 (5%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSG--- 62
L ++ LW S LP + RI LKK + + R +ER + +S
Sbjct: 8 LWALLLLWTSCS-FSLPTDTASFGRILLKK-------MPSVREILEERGVDMTRISAEWG 59
Query: 63 --VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFS 119
++ + + L N++D QY+GEIGIG+P Q F VIFDTGS+NLWVPS+KC
Sbjct: 60 EFIKKSSFTNVTSPVVLTNYLDTQYYGEIGIGTPSQTFKVIFDTGSANLWVPSTKCGPLY 119
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
+C H+ Y S +S++Y E G I+YGSG + GF SQD V VG ++V Q F E T
Sbjct: 120 TACEIHNLYDSSESSSYMENGTEFTIHYGSGKVKGFLSQDVVTVGGIIVT-QTFGEVTEL 178
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEI 239
+ F+LA+FDG++G+GF AV +PV+D+++ Q ++ EEVFS + +R+ GGE+
Sbjct: 179 PLIPFMLAKFDGVLGMGFPAQAVDGVIPVFDHILSQRVLKEEVFSVYYSRESHL-LGGEV 237
Query: 240 VFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGP 299
V GG DP+H++G YV ++K G WQ + + +G +T +CE GC A+VD+GTS ++GP
Sbjct: 238 VLGGSDPQHYQGNFHYVSISKAGSWQITMKGVSVG-PATLLCEEGCMAVVDTGTSYISGP 296
Query: 300 TPVVTEINHAIG 311
T + I A+G
Sbjct: 297 TSSLQLIMQALG 308
>gi|296219067|ref|XP_002755720.1| PREDICTED: cathepsin D [Callithrix jacchus]
Length = 392
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 121/296 (40%), Positives = 176/296 (59%), Gaps = 27/296 (9%)
Query: 37 RLDLHSLNAARITRKERYMGGA--------GVSGVRHRLGDSDEDILP--LKNFMDAQYF 86
R+ LH + R T E MGG +S + +P LKN+MDAQY+
Sbjct: 23 RIPLHKFTSIRRTMSE--MGGPVEDLIAKGPISKYSQEMPAMPGGPIPEILKNYMDAQYY 80
Query: 87 GEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKS--C 143
GEIGIG+PPQ F+V+FDTGSSNLWVPS C I+C + + S + G C
Sbjct: 81 GEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACSALGQGGRKWSQLCLDPGPPVPC 140
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVG 203
+ + ++ G V V+ QVF EAT++ +TF+ A+FDGI+G+ + I+V
Sbjct: 141 RSSLSASALGG-----------VKVERQVFGEATKQPGITFIAAKFDGILGMAYPRISVN 189
Query: 204 DAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGY 263
+ +PV+DN+++Q LV + +FSF+LNRDPDA+ GGE++ GG D K++KG Y+ VT+K Y
Sbjct: 190 NVLPVFDNLMQQKLVDQNIFSFYLNRDPDAQPGGELMLGGTDSKYYKGSLFYLNVTRKAY 249
Query: 264 WQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
WQ + + + + T +C+GGC AIVD+GTSL+ GP V E+ AIG ++ E
Sbjct: 250 WQVHMDQVEVASGLT-LCKGGCEAIVDTGTSLMVGPVDEVRELQKAIGAMPLIQGE 304
>gi|195046656|ref|XP_001992194.1| GH24344 [Drosophila grimshawi]
gi|193893035|gb|EDV91901.1| GH24344 [Drosophila grimshawi]
Length = 373
Score = 221 bits (564), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 111/230 (48%), Positives = 155/230 (67%), Gaps = 8/230 (3%)
Query: 69 DSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF-SISCYFHSR 127
D+DE+ L N ++ Y+G I IG+PPQ+F V+FD+GSSNLWVPSS+C+F I+C H++
Sbjct: 54 DADEE---LSNSINMAYYGAITIGTPPQSFKVLFDSGSSNLWVPSSRCFFLDIACQNHNK 110
Query: 128 YKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLA 187
Y KS+TY G+S I YGSGS+SGF S D+V+V + +K Q F EAT E +F A
Sbjct: 111 YDHDKSSTYVANGESFSIQYGSGSLSGFLSTDDVDVSGLTIKSQTFAEATNEPGTSFNNA 170
Query: 188 RFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRD-PDAEEGGEIVFGGVDP 246
+FDGI+G+ ++ I+ + VP + NMV QGLV + VFSF+L RD +GGE++FGG DP
Sbjct: 171 KFDGILGMAYQSISSDNVVPPFYNMVSQGLVDDSVFSFYLARDGTSTTDGGELIFGGSDP 230
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLL 296
+ G +YVP++++GYWQF + I Q+ G AI D+GTSLL
Sbjct: 231 AKYTGDLSYVPISEQGYWQFAVDSATIDGQTLGES---FQAIADTGTSLL 277
>gi|194218271|ref|XP_001501895.2| PREDICTED: pepsin A-like [Equus caballus]
Length = 387
Score = 221 bits (564), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 115/252 (45%), Positives = 160/252 (63%), Gaps = 12/252 (4%)
Query: 73 DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRK 132
D PL+N+MD YFG I IG+P Q F+VIFDTGSSNLWVPS C S++C H+R+
Sbjct: 63 DTQPLENYMDEAYFGTISIGTPAQEFTVIFDTGSSNLWVPSIYCS-SLACSDHNRFNPED 121
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDG 191
S+TY +S I YG+GS++G D V VG + +Q+F + T GS + A FDG
Sbjct: 122 SSTYRATSESVSITYGTGSMTGVLGYDTVRVGGIEDTNQIFGLSETEPGSFLYY-APFDG 180
Query: 192 IIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKG 251
I+GL + I+ A PV+DN+ +QGLVS+++FS +L+ D E G ++FGG+DP ++ G
Sbjct: 181 ILGLAYPSISASGATPVFDNIWDQGLVSQDLFSVYLSS--DDESGSVVMFGGIDPSYYTG 238
Query: 252 KHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
+VPV+ +GYWQ + + + +S C GGC AIVD+GTSLLAGPT + I +G
Sbjct: 239 SLHWVPVSNEGYWQITMDSVTVNGESIA-CSGGCQAIVDTGTSLLAGPTSAIDNIQSYLG 297
Query: 312 ------GEGVVS 317
GEGV+S
Sbjct: 298 FSEDSSGEGVIS 309
>gi|332024604|gb|EGI64802.1| Lysosomal aspartic protease [Acromyrmex echinatior]
Length = 361
Score = 221 bits (564), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 124/292 (42%), Positives = 170/292 (58%), Gaps = 17/292 (5%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQ 96
R+ LH ++AR + Y G S VR PL NF +AQY+G I IG+P Q
Sbjct: 3 RILLHKTSSARKSIGIDYRQGNLTSIVRE----------PLLNFRNAQYYGVISIGTPRQ 52
Query: 97 NFSVIFDTGSSNLWVPSSKCYF-SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
F V+FDTGS+NLWVPS C I+C H +Y +R S TY G +I Y G++SG+
Sbjct: 53 RFKVLFDTGSANLWVPSVHCNLEDITCLSHRKYNNRTSRTYIPNGTLFDIQYEYGTLSGY 112
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
S D V V + + +Q F EA E + FL A+FDGI+G+G+ I++ PV+ NMV+Q
Sbjct: 113 LSTDVVNVAGLNIINQTFGEAINEPGIAFLYAKFDGILGMGYPNISILGVTPVFTNMVQQ 172
Query: 216 GLVSEEVFSFWLNRD-PDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIG 274
GLVS +FSF+LNR+ D+ G ++ GG DP + G+ TYV VT KGYWQF + I +
Sbjct: 173 GLVSSPIFSFYLNRNLLDSSAGSVLILGGSDPALYDGELTYVNVTHKGYWQFTMDKIQME 232
Query: 275 NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE---GVVSAECKLV 323
N++ +C GC AI D+G S LAGP + I I + GVV +C +
Sbjct: 233 NET--LCVNGCQAIADTGFSRLAGPPTDIAIITSRIAIDDFNGVVYVDCDQI 282
>gi|223468|prf||0807285A renin precursor
Length = 401
Score = 221 bits (564), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 125/309 (40%), Positives = 183/309 (59%), Gaps = 8/309 (2%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRH 65
L ++ LW + C + RI LKK + + R R V R
Sbjct: 8 LWALLLLW--SPCTFSLPTGTTFERIPLKKMP-SVREILEERGVDMTRLSAEWDVFTKRS 64
Query: 66 RLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYF 124
L D ++ L N++++QY+GEIGIG+PPQ F VIFDTGS+NLWVPS+KC ++C
Sbjct: 65 SLTDLISPVV-LTNYLNSQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCSRLYLACGI 123
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
HS Y+S S++Y E G I+YGSG + GF SQD+V VG + V Q F E T + F
Sbjct: 124 HSLYESSDSSSYMENGDDFTIHYGSGRVKGFLSQDSVTVGGITVT-QTFGEVTELPLIPF 182
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
+LA+FDG++G+G AVG PV+D+++ QG++ E+VFS + NR P GGE+V GG
Sbjct: 183 MLAQFDGVLGMGLSRSAVGGVTPVFDHILSQGVLKEKVFSVYYNRGPHL-LGGEVVLGGS 241
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
DP+H++G YV ++K WQ + + +G+ ST +CE GC +VD+G+S ++ PT +
Sbjct: 242 DPEHYQGDFHYVSLSKTDSWQITMKGVSVGS-STLLCEEGCEVVVDTGSSFISAPTSSLK 300
Query: 305 EINHAIGGE 313
I A+G +
Sbjct: 301 LIMQALGAK 309
>gi|327278828|ref|XP_003224162.1| PREDICTED: pepsin A-like isoform 2 [Anolis carolinensis]
Length = 386
Score = 221 bits (564), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 116/287 (40%), Positives = 171/287 (59%), Gaps = 11/287 (3%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L++ ++ L H L + + +G G+ + + PL+N+MD +Y G
Sbjct: 22 LKKTKSLRQNLKEHGLLEKYLQKHHHNLGSKYFPGLAN-----ENAAEPLENYMDIEYIG 76
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
I IG+P Q F V+FDTGSSNLWVPS C S +C H+R+ + S+TY +S + Y
Sbjct: 77 TISIGTPAQQFVVLFDTGSSNLWVPSVYCS-SSACSNHNRFNPQDSSTYQATSQSVSVTY 135
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
G+GS++GF + D V+VG +VV +Q+F + T GS + + FDGI+GL F IA A
Sbjct: 136 GTGSMTGFLAYDTVQVGSIVVTNQIFGLSETEPGSFLYY-SPFDGILGLAFPSIASSGAT 194
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
PV+DNM+ +GLVS+++FS +L+ D + G ++FGGVD ++ G +VP++ + YWQ
Sbjct: 195 PVFDNMMSEGLVSQDLFSVYLSS--DDQSGSFVMFGGVDTSYYSGSLNWVPLSSESYWQI 252
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
L I + QS C GGC AIVD+GTSLLAGP + I + IG
Sbjct: 253 TLDSITLNGQSI-ACSGGCQAIVDTGTSLLAGPPNGIANIQYYIGAS 298
>gi|392575952|gb|EIW69084.1| hypothetical protein TREMEDRAFT_39371 [Tremella mesenterica DSM
1558]
Length = 446
Score = 221 bits (564), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 120/268 (44%), Positives = 162/268 (60%), Gaps = 15/268 (5%)
Query: 68 GDSDEDIL------PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSIS 121
GDS++ +L PL ++M+AQY+ I IG+PPQ F V+ DTGSSNLWVPSS C SI+
Sbjct: 112 GDSEKRVLKGGHGVPLSDYMNAQYYAPITIGTPPQEFKVVLDTGSSNLWVPSSSCT-SIA 170
Query: 122 CYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGS 181
C+ HS+Y S S+TY G I YGSGS+ GF S D V + D+ +K Q F EAT+E
Sbjct: 171 CFLHSKYDSSASSTYKANGSDFAIRYGSGSLEGFVSSDTVTIADLSLKHQDFAEATKEPG 230
Query: 182 LTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVF 241
L F +FDGI+GL + I+V VP + M+ +GL+ E VFSF L D + +GGE +F
Sbjct: 231 LAFAFGKFDGIMGLAYDTISVNHIVPPFYTMLNRGLLDEPVFSFRLGSDEN--DGGECIF 288
Query: 242 GGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTP 301
GGVD + GK YVP+ +KGYW+ EL I G + + G A +D+GTSL+ P+
Sbjct: 289 GGVDDSAYTGKIQYVPIRRKGYWEVELEKIGFGEEELELENTGAA--IDTGTSLIVMPSD 346
Query: 302 VVTEINHAIGG----EGVVSAECKLVVS 325
V +N IG G + +C V S
Sbjct: 347 VAEMLNKEIGATKSWNGQYTVDCNTVPS 374
>gi|73620985|sp|P81498.2|PEPC_SUNMU RecName: Full=Gastricsin; AltName: Full=Pepsinogen C-1; Flags:
Precursor
gi|9798662|dbj|BAB11753.1| pepsinogen C [Suncus murinus]
Length = 389
Score = 221 bits (563), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 109/266 (40%), Positives = 165/266 (62%), Gaps = 6/266 (2%)
Query: 64 RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCY 123
++ GD P+ +MDA YFGEI IG+PPQNF V+FDTGSSNLWVPS C S +C
Sbjct: 53 KYHFGDFSVAYEPMA-YMDASYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACT 110
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
H+R+ +S+TY+ G++ + YGSGS++GFF D + V ++ V Q F + E
Sbjct: 111 GHARFNPNQSSTYSTNGQTFSLQYGSGSLTGFFGYDTMTVQNIKVPHQEFGLSQNEPGTN 170
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
F+ A+FDGI+G+ + +A+G A M+++G ++ VFSF+L+ ++ GG ++FGG
Sbjct: 171 FIYAQFDGIMGMAYPSLAMGGATTALQGMLQEGALTSPVFSFYLSNQQGSQNGGAVIFGG 230
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
VD + G+ + PVT++ YWQ + + LIG Q+TG C+ GC AIVD+GTSLL P +
Sbjct: 231 VDNSLYTGQIFWAPVTQELYWQIGVEEFLIGGQATGWCQQGCQAIVDTGTSLLTVPQQFM 290
Query: 304 TEINHAIGGE----GVVSAECKLVVS 325
+ + A G + G ++ C + S
Sbjct: 291 SALQQATGAQQDQYGQLAVNCNSIQS 316
>gi|327278826|ref|XP_003224161.1| PREDICTED: pepsin A-like isoform 1 [Anolis carolinensis]
Length = 387
Score = 221 bits (563), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 116/287 (40%), Positives = 171/287 (59%), Gaps = 11/287 (3%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L++ ++ L H L + + +G G+ + + PL+N+MD +Y G
Sbjct: 22 LKKTKSLRQNLKEHGLLEKYLQKHHHNLGSKYFPGLAN-----ENAAEPLENYMDIEYIG 76
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
I IG+P Q F V+FDTGSSNLWVPS C S +C H+R+ + S+TY +S + Y
Sbjct: 77 TISIGTPAQQFVVLFDTGSSNLWVPSVYCS-SSACSNHNRFNPQDSSTYQATSQSVSVTY 135
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
G+GS++GF + D V+VG +VV +Q+F + T GS + + FDGI+GL F IA A
Sbjct: 136 GTGSMTGFLAYDTVQVGSIVVTNQIFGLSETEPGSFLYY-SPFDGILGLAFPSIASSGAT 194
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
PV+DNM+ +GLVS+++FS +L+ D + G ++FGGVD ++ G +VP++ + YWQ
Sbjct: 195 PVFDNMMSEGLVSQDLFSVYLSS--DDQSGSFVMFGGVDTSYYSGSLNWVPLSSESYWQI 252
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
L I + QS C GGC AIVD+GTSLLAGP + I + IG
Sbjct: 253 TLDSITLNGQSI-ACSGGCQAIVDTGTSLLAGPPNGIANIQYYIGAS 298
>gi|73915318|gb|AAZ92540.1| aspartyl protease 1 [Coccidioides posadasii]
gi|73915320|gb|AAZ92541.1| aspartyl protease 1 [Coccidioides posadasii]
Length = 399
Score = 221 bits (563), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 117/278 (42%), Positives = 166/278 (59%), Gaps = 11/278 (3%)
Query: 52 ERYMGGAGVSGVRHRLGD----SDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSS 107
++Y G S + L D +D + + NF++AQYF EI IG+PPQNF V+ DTGSS
Sbjct: 48 QKYFGSLPSSQQQTVLSDEYSTTDGHNVLVDNFLNAQYFSEISIGNPPQNFKVVLDTGSS 107
Query: 108 NLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVV 167
NLWVPSS+C SI+CY H++Y S S+TY + G I YGSGS+SGF SQD + +GD+
Sbjct: 108 NLWVPSSEC-GSIACYLHNKYDSSASSTYKKNGTEFAIRYGSGSLSGFVSQDTLRIGDLT 166
Query: 168 VKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL 227
++ Q F EAT E L F RFDGI+GLG+ I+V VP + NM+ +GL+ E VF F+L
Sbjct: 167 IEGQDFAEATNEPGLAFAFGRFDGILGLGYDTISVNKIVPPFYNMINEGLIDEPVFGFYL 226
Query: 228 NRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAA 287
+ FGGVD F G+ +P+ +K YW+ + I GN+ + + G
Sbjct: 227 GDTNKEGDDSYATFGGVDSSLFSGEMIKIPLRRKAYWEVDFDAIAFGNERAELEDTGI-- 284
Query: 288 IVDSGTSLLAGPTPVVTEINHAIGGE----GVVSAECK 321
I+D+GTSL+A P+ + +N IG + G + +C
Sbjct: 285 ILDTGTSLIALPSTLAELLNREIGAKKSWNGQYTVDCN 322
>gi|322708430|gb|EFZ00008.1| vacuolar protease A [Metarhizium anisopliae ARSEF 23]
Length = 395
Score = 221 bits (563), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 121/264 (45%), Positives = 168/264 (63%), Gaps = 11/264 (4%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+P+ NFM+AQYF EI +G+PPQ F V+ DTGSSNLWVPS C SI+CY HS Y S S+
Sbjct: 75 VPVSNFMNAQYFSEITVGTPPQTFKVVLDTGSSNLWVPSQSCS-SIACYLHSTYDSSSSS 133
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + G S EI YGSGS+SGF SQD V +GD+ +KDQ F EAT E L F +FDGI+G
Sbjct: 134 TYKKNGSSFEIRYGSGSLSGFVSQDVVTIGDLKIKDQDFAEATSEPGLAFAFGKFDGILG 193
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ ++V VP + M+ Q L+ E VF+F+L +EEG E VFGG+D H+ GK
Sbjct: 194 LGYDTLSVNKIVPPFYQMINQKLLDEPVFAFYLG---SSEEGSEAVFGGIDKDHYTGKIE 250
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
Y+P+ +K YW+ ++ I G+ + G AI+D+GTSL P+ + +N IG +
Sbjct: 251 YIPLRRKAYWEVDIHSIAFGDDVAELDRTG--AILDTGTSLNVLPSTLAELLNKEIGAKK 308
Query: 314 ---GVVSAECKLVVSQYGDLIWDL 334
G + +C + S D++++L
Sbjct: 309 SWNGQYTVDCAQIKS-LPDIVFNL 331
>gi|494607|pdb|1SMR|A Chain A, The 3-D Structure Of Mouse Submaxillary Renin Complexed
With A Decapeptide Inhibitor Ch-66 Based On The 4-16
Fragment Of Rat Angiotensinogen
gi|157880102|pdb|1SMR|C Chain C, The 3-D Structure Of Mouse Submaxillary Renin Complexed
With A Decapeptide Inhibitor Ch-66 Based On The 4-16
Fragment Of Rat Angiotensinogen
gi|157880104|pdb|1SMR|E Chain E, The 3-D Structure Of Mouse Submaxillary Renin Complexed
With A Decapeptide Inhibitor Ch-66 Based On The 4-16
Fragment Of Rat Angiotensinogen
gi|157880106|pdb|1SMR|G Chain G, The 3-D Structure Of Mouse Submaxillary Renin Complexed
With A Decapeptide Inhibitor Ch-66 Based On The 4-16
Fragment Of Rat Angiotensinogen
Length = 335
Score = 221 bits (562), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 111/238 (46%), Positives = 162/238 (68%), Gaps = 4/238 (1%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L N++++QY+GEIGIG+PPQ F VIFDTGS+NLWVPS+KC ++C HS Y+S S++
Sbjct: 9 LTNYLNSQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCSRLYLACGIHSLYESSDSSS 68
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y E G I+YGSG + GF SQD+V VG + V Q F E T+ + F+LA+FDG++G+
Sbjct: 69 YMENGDDFTIHYGSGRVKGFLSQDSVTVGGITVT-QTFGEVTQLPLIPFMLAQFDGVLGM 127
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
GF AVG PV+D+++ QG++ E+VFS + NR P GGE+V GG DP+H++G Y
Sbjct: 128 GFPAQAVGGVTPVFDHILSQGVLKEKVFSVYYNRGPHL-LGGEVVLGGSDPQHYQGDFHY 186
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
V ++K WQ + + +G+ ST +CE GC +VD+G+S ++ PT + I A+G +
Sbjct: 187 VSLSKTDSWQITMKGVSVGS-STLLCEEGCEVVVDTGSSFISAPTSSLKLIMQALGAK 243
>gi|327279867|ref|XP_003224677.1| PREDICTED: cathepsin E-A-like [Anolis carolinensis]
Length = 406
Score = 221 bits (562), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 104/253 (41%), Positives = 162/253 (64%), Gaps = 6/253 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
L ++M+ +Y+GE+ IG+P Q F+VIFDTGS++ WVPS+ C S +C H ++K+ S +Y
Sbjct: 73 LCDYMNTEYYGEVSIGTPAQKFTVIFDTGSADFWVPSAYC-ISDACELHQKFKAFSSESY 131
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
G+ + YG+G + G ++D V++G++ ++DQ F E+ E +TF A FDG++GLG
Sbjct: 132 AHGGQKFTLQYGTGRLMGIVAKDKVQIGNITIEDQAFGESVFEPGMTFAFAHFDGVLGLG 191
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ ++V +++PV+DN+++Q LV E +FSF LNR+ + + GG ++ GG+D F G +
Sbjct: 192 YPTLSVTNSMPVFDNIIKQHLVEEPLFSFSLNREHNVDNGGVLILGGIDHSLFTGPIHWF 251
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
PVTKKGYWQ + + I Q T C GC AIVDSGTSL+ GP + + +IG
Sbjct: 252 PVTKKGYWQIHMNSVKIQGQVTS-CISGCEAIVDSGTSLITGPLSQIVRLQQSIGAFPTA 310
Query: 317 SAE----CKLVVS 325
+ E C+ V S
Sbjct: 311 TGEFLVDCRRVSS 323
>gi|331215715|ref|XP_003320537.1| saccharopepsin [Puccinia graminis f. sp. tritici CRL 75-36-700-3]
gi|309299527|gb|EFP76118.1| saccharopepsin [Puccinia graminis f. sp. tritici CRL 75-36-700-3]
Length = 430
Score = 221 bits (562), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 130/364 (35%), Positives = 195/364 (53%), Gaps = 43/364 (11%)
Query: 4 KLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLD--------LHSLNAARITRKERYM 55
K +V + LA+ +P+++ G R+ L K + L++L + ++Y
Sbjct: 2 KSTSAVVAITALAAVASIPSATAGKHRMKLHKMPITSSANSQTILNNLQSQTAWVSQKYF 61
Query: 56 GGAGVSGVR-----HRLGDSDE--DI----------------LPLKNFMDAQYFGEIGIG 92
G + + H L E D+ +PL N+++AQYF EI +G
Sbjct: 62 GVDDTASEKKFRYGHALKQPKEGDDVSIQMIEEAELASAGHEVPLSNYLNAQYFSEISLG 121
Query: 93 SPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSI 152
+PPQ+F V+ DTGSSNLWVPS++C SI+C+ HS+Y S TY G +I YGSGS+
Sbjct: 122 TPPQSFKVVLDTGSSNLWVPSTRCT-SIACFLHSKYDCEASETYQANGTEFKIRYGSGSL 180
Query: 153 SGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNM 212
G S D + +GD+ V D F E+T+E L F +FDGI GLG+ I+V VP + M
Sbjct: 181 EGVISNDVLTIGDLTVPDVDFAESTKEPGLAFAFGKFDGIFGLGYDTISVLHTVPPFYKM 240
Query: 213 VEQGLVSEEVFSFWL------NRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
+E G++ + VF+F+L DP+ GGE+VFGGVD H++G+ Y PV ++GYW+
Sbjct: 241 MENGMLDDPVFAFYLGSAQGNKADPN---GGEVVFGGVDEAHYEGEIFYAPVRRRGYWEV 297
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQ 326
EL + G + + G A +D+GTSL+A PT IN IG S + + S+
Sbjct: 298 ELKSVKFGKEEMKLHNVGAA--IDTGTSLIALPTDTAEIINAEIGATKSWSGQYTVDCSR 355
Query: 327 YGDL 330
+L
Sbjct: 356 IPEL 359
>gi|307167891|gb|EFN61280.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 431
Score = 221 bits (562), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 127/363 (34%), Positives = 186/363 (51%), Gaps = 61/363 (16%)
Query: 9 VFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLG 68
+F L+++A+ L + I + +R+ LH ++ R + + G+ +
Sbjct: 1 MFRLFLMATALFV--------LIDAQLQRIQLHKMDPIR-----KRLRKIGIDLQQINFT 47
Query: 69 DSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSR 127
S+ L N++D++Y+G I IG+PPQ F V+FDTGSSNLW+PS C + ++C H++
Sbjct: 48 KSNPSSQSLYNYLDSEYYGNITIGTPPQQFKVLFDTGSSNLWIPSILCSTANVACALHNK 107
Query: 128 YKSRKSNTYTEIGKSCEINY-------GSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
Y S KS TY C + Y SGS+SGF S D V V + V+ Q F EA E
Sbjct: 108 YDSTKSRTYKVNNTICSLQYDITSIPFNSGSVSGFLSTDVVNVAGLNVQGQTFAEAIDEL 167
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLN------------ 228
L ++A FDGI+G+G+ IAV PV+ N+++Q LV + VFSF+LN
Sbjct: 168 VLALVVAEFDGILGMGYSTIAVDGVTPVFYNLIKQKLVPQPVFSFYLNRHVFSYSIFKSI 227
Query: 229 ------------------------RDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYW 264
RDP A+ GGE++ GG DP ++ G YV VTKKGYW
Sbjct: 228 SNKYIYNKKKYIYIAILKRIYNVYRDPSAKVGGELILGGSDPAYYTGHFKYVDVTKKGYW 287
Query: 265 QFELGDILIG----NQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAEC 320
QF + + I N+ +C GGC AI D+G SL+ GPT + IN IG +
Sbjct: 288 QFLMDRVRITRTKFNKGRTLCMGGCQAIADTGMSLIVGPTSEIDIINKYIGANKTTDSSG 347
Query: 321 KLV 323
++
Sbjct: 348 NII 350
>gi|291409620|ref|XP_002721076.1| PREDICTED: pepsinogen III-like [Oryctolagus cuniculus]
Length = 387
Score = 221 bits (562), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 123/296 (41%), Positives = 172/296 (58%), Gaps = 24/296 (8%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL K L H+ N A +Y+ A V L+N++D +YFG
Sbjct: 32 LIEKGLLKDYLKTHTPNLAT-----KYLPKAAFDSVPTET---------LENYLDTEYFG 77
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+P Q+F+VIFDTGSSNLWVPS C S +C H+++ S+T+ +S I Y
Sbjct: 78 TIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SAACSVHNKFNPEDSSTFQATSESLSITY 136
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVP 207
G+GS++GF D V+VG++ +Q+F + E A FDGI+GL + I+ DA P
Sbjct: 137 GTGSMTGFLGYDTVKVGNIEDTNQIFGLSESEPGSFLYYAPFDGILGLAYPSISSSDATP 196
Query: 208 VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFE 267
V+DNM +GLVSE++FS +L+ D E G ++FGG+D ++ G +VPV+ +GYWQ
Sbjct: 197 VFDNMWNEGLVSEDLFSVYLSSDD--ESGSVVMFGGIDSSYYTGSLNWVPVSYEGYWQIT 254
Query: 268 LGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 317
L I + + T C GC AIVD+GTSLLAGPT ++ I IG GE +VS
Sbjct: 255 LDSITMDGE-TIACADGCQAIVDTGTSLLAGPTSAISNIQSYIGASENSDGEMIVS 309
>gi|384485237|gb|EIE77417.1| hypothetical protein RO3G_02121 [Rhizopus delemar RA 99-880]
Length = 399
Score = 221 bits (562), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 114/261 (43%), Positives = 162/261 (62%), Gaps = 10/261 (3%)
Query: 72 EDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSR 131
E +PL N+++AQY+GEI +G+PPQ FSV+FDTGSSN WVPS++C FS++C H RY +
Sbjct: 67 EHGVPLANYLNAQYYGEISLGTPPQIFSVVFDTGSSNTWVPSTRC-FSLACLTHRRYSAS 125
Query: 132 KSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDG 191
+S+TY G I YG+G++ G SQD + VG + + +Q F E+T E LTF+ A+FDG
Sbjct: 126 RSSTYVRNGTQFSITYGTGALQGVISQDTLRVGGIQIDNQQFAESTIEPGLTFIYAQFDG 185
Query: 192 IIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNR---DPDAEEGGEIVFGGVDPKH 248
I GLG+ I+V VP + NMV + L+SE VFSFW+N + + GGEI FG +D
Sbjct: 186 IFGLGYDTISVQRVVPPFYNMVNRNLISESVFSFWINDINVQAENDIGGEIAFGEIDQTR 245
Query: 249 FKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH 308
+ G + PV +KGYW+ + + +G + V A +D+GTSL+ PT V EI+
Sbjct: 246 YTGDLIWSPVQRKGYWEIAIDNFRVG--ADPVNPSSLTAAIDTGTSLILVPTSVSIEIHA 303
Query: 309 AIG----GEGVVSAECKLVVS 325
+G G G+ C V S
Sbjct: 304 RLGAQLSGNGLYIFSCATVSS 324
>gi|291409605|ref|XP_002721070.1| PREDICTED: pepsin II-1-like [Oryctolagus cuniculus]
Length = 387
Score = 221 bits (562), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 108/247 (43%), Positives = 159/247 (64%), Gaps = 10/247 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
L+N++D +YFG I IG+PPQ F+VIFDTGSSNLWVPS+ C S++C H R+ S+T+
Sbjct: 67 LENYLDTEYFGTISIGTPPQEFTVIFDTGSSNLWVPSTYCS-SLACILHKRFNPDDSSTF 125
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
++ I YG+GS++G D V+VG + +Q+F + E L L+A FDGI+GL
Sbjct: 126 QATSETLSITYGTGSMTGILGYDTVKVGSIEDTNQIFGLSKTEPGLFLLVAPFDGILGLA 185
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ I+ DA PV+DNM QGLVS+++FS +L+ D ++G ++FGG+D ++ G +V
Sbjct: 186 YPSISASDATPVFDNMWNQGLVSQDLFSVYLSS--DEQKGSLVMFGGIDSSYYTGSLNWV 243
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG----- 311
PV+ +GYWQ + I + + T C C A+VD+GTSLLAGPT ++ I IG
Sbjct: 244 PVSHEGYWQITVDSITMDGE-TIACADSCQAVVDTGTSLLAGPTSAISNIQSYIGASKNL 302
Query: 312 -GEGVVS 317
GE ++S
Sbjct: 303 LGENIIS 309
>gi|122938522|gb|ABM69085.1| aspartic proteinase AspMD02 [Musca domestica]
Length = 379
Score = 220 bits (561), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 126/300 (42%), Positives = 182/300 (60%), Gaps = 14/300 (4%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDED 73
VL S LLL ++ L ++ + K + N R K +Y GG + +R +
Sbjct: 7 VLWSALLLAEAT--LVQVPITKVKETKSKANEIR-KLKAKY-GGTPKAEIRDLV------ 56
Query: 74 ILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF-SISCYFHSRYKSRK 132
+ L N++D Y+G+I IG+P Q F V+FDTGSSNLWVP + C + +C H+ Y
Sbjct: 57 VEKLFNYVDDSYYGKITIGTPGQEFLVLFDTGSSNLWVPVAPCSADNAACENHNTYDPSA 116
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGI 192
S+T+ + G+S I YGSGS+SG+ +D V+V + +K QVF AT E TF+ A FDGI
Sbjct: 117 SSTHVKKGESFSIQYGSGSLSGYLVEDTVDVEGLKIKKQVFAAATNEPGETFVYAPFDGI 176
Query: 193 IGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGK 252
+G+GF+ IAV D P W NM+ Q L+SE+VFSF+L R ++EGG +V GG D ++++G
Sbjct: 177 MGMGFKSIAVDDVTPPWYNMISQHLISEKVFSFYLARRGTSDEGGVMVVGGNDDRYYEGD 236
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
YVPV+++GYWQFE+ + + +C+ C AI D+GTSL+A PT EI IG
Sbjct: 237 FHYVPVSEQGYWQFEMAEAHV--NGVRICD-RCQAIADTGTSLIAVPTDKYEEIQKEIGA 293
>gi|200702|gb|AAA40050.1| renin [Mus musculus]
Length = 401
Score = 220 bits (561), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 124/309 (40%), Positives = 183/309 (59%), Gaps = 8/309 (2%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRH 65
L ++ LW + C + RI LKK + + R R V R
Sbjct: 8 LWALLLLW--SPCTFSLPTGTTFERIPLKKMP-SVREILEERGVDMTRLSAEWDVFTKRS 64
Query: 66 RLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYF 124
L D ++ L N++++QY+GEIGIG+PPQ F V+FDTGS+NLWVPS+KC ++C
Sbjct: 65 SLTDLISPVV-LTNYLNSQYYGEIGIGTPPQTFKVMFDTGSANLWVPSTKCSRLYLACGI 123
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
HS Y+S S++Y E G I+YGSG + GF SQD+V VG + V Q F E T + F
Sbjct: 124 HSLYESSDSSSYMENGDDFTIHYGSGRVKGFLSQDSVTVGGITVT-QTFGEVTELPLIPF 182
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
+LA+FDG++G+G AVG PV+D+++ QG++ E+VFS + NR P GGE+V GG
Sbjct: 183 MLAQFDGVLGMGLSRSAVGGVTPVFDHILSQGVLKEKVFSVYYNRGPHL-LGGEVVLGGS 241
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
DP+H++G YV ++K WQ + + +G+ ST +CE GC +VD+G+S ++ PT +
Sbjct: 242 DPEHYQGDFHYVSLSKTDSWQITMKGVSVGS-STLLCEEGCEVVVDTGSSFISAPTSSLK 300
Query: 305 EINHAIGGE 313
I A+G +
Sbjct: 301 LIMQALGAK 309
>gi|226288833|gb|EEH44345.1| vacuolar protease A [Paracoccidioides brasiliensis Pb18]
Length = 400
Score = 220 bits (561), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 120/300 (40%), Positives = 177/300 (59%), Gaps = 13/300 (4%)
Query: 23 ASSNGLRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDE----DI 74
SS + ++ L K ++LD ++ ++YMG D+ +
Sbjct: 16 TSSAKVHKLKLNKISLSQQLDHANIETQVKALGQKYMGVRPSQHFNEMFKDTSKASGGHS 75
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+ + NF++AQYF EI IG+PPQ F V+ DTGSSNLWVPS++C SI+C+ H++Y S S+
Sbjct: 76 VLVDNFLNAQYFSEISIGTPPQTFKVVLDTGSSNLWVPSAQC-MSIACFLHNKYDSSVSS 134
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
T+ + G I YGSGS+SGF SQD V +GD+ V +Q F EAT E L F RFDGI+G
Sbjct: 135 THRKNGTEFAIRYGSGSLSGFVSQDVVRIGDMTVNNQDFAEATSEPGLAFAFGRFDGILG 194
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKH 253
LG+ I+V VP++ M+ Q L+ VF F+L N D D ++ E FGG+D HF G+
Sbjct: 195 LGYDTISVNHIVPLFYQMINQKLLDMPVFGFYLGNSDVDGDD-SEATFGGIDESHFTGEL 253
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
T + + ++ YW+ +L I+ GN+ + G I+D+GTSLLA P+ + +N IG +
Sbjct: 254 TTISLRRRAYWEVDLDAIIFGNEMAELENTGV--ILDTGTSLLALPSTIAELLNKQIGAK 311
>gi|119187279|ref|XP_001244246.1| hypothetical protein CIMG_03687 [Coccidioides immitis RS]
gi|303317132|ref|XP_003068568.1| aspartyl proteinase [Coccidioides posadasii C735 delta SOWgp]
gi|6760077|gb|AAF28186.1|AF162132_1 aspartyl proteinase [Coccidioides posadasii]
gi|240108249|gb|EER26423.1| aspartyl proteinase [Coccidioides posadasii C735 delta SOWgp]
gi|392870962|gb|EAS32810.2| vacuolar protease A [Coccidioides immitis RS]
Length = 399
Score = 220 bits (561), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 111/249 (44%), Positives = 155/249 (62%), Gaps = 7/249 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
+ NF++AQYF EI IG+PPQNF V+ DTGSSNLWVPSS+C SI+CY H++Y S S+TY
Sbjct: 77 VDNFLNAQYFSEISIGNPPQNFKVVLDTGSSNLWVPSSEC-GSIACYLHNKYDSSASSTY 135
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
+ G I YGSGS+SGF SQD + +GD+ ++ Q F EAT E L F RFDGI+GLG
Sbjct: 136 KKNGTEFAIRYGSGSLSGFVSQDTLRIGDLTIEGQDFAEATNEPGLAFAFGRFDGILGLG 195
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ I+V VP + NM+ +GL+ E VF F+L + FGGVD F G+ +
Sbjct: 196 YDTISVNKIVPPFYNMINEGLIDEPVFGFYLGDTNKEGDDSYATFGGVDSSLFSGEMIKI 255
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE--- 313
P+ +K YW+ + I GN+ + + G I+D+GTSL+A P+ + +N IG +
Sbjct: 256 PLRRKAYWEVDFDAIAFGNERAELEDTGI--ILDTGTSLIALPSTLAELLNREIGAKKSW 313
Query: 314 -GVVSAECK 321
G + +C
Sbjct: 314 NGQYTVDCN 322
>gi|253762217|gb|ACT35560.1| pepsinogen A2 precursor [Siniperca chuatsi]
Length = 376
Score = 220 bits (561), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 114/251 (45%), Positives = 164/251 (65%), Gaps = 13/251 (5%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
P+ N D Y+G I IGSPPQ+FSVIFDTGSSNLW+PS C S +C H R+ ++S T
Sbjct: 60 PMTNDADLSYYGVISIGSPPQSFSVIFDTGSSNLWIPSVYCS-SQACENHRRFNPQQSTT 118
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIG 194
+ + I YG+GS++G+ + D VEVG + V +QVF I T + ++ A DGI+G
Sbjct: 119 FKWGNQPLSIQYGTGSMTGYLAIDTVEVGGISVANQVFGISRTEAPFMAYMQA--DGILG 176
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L F+ IA + VPV+DNMV+QGLVS+ +FS +L+ ++E+G E+VFGG+D H+ G+ T
Sbjct: 177 LAFQTIASDNVVPVFDNMVKQGLVSQPLFSVYLSS--NSEQGSEVVFGGIDSSHYTGQIT 234
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG--- 311
++P++ YWQ ++ + I Q T C GGC AI+D+GTSL+ GPT + +N +G
Sbjct: 235 WIPLSSATYWQIKMDSVTINGQ-TVACSGGCQAIIDTGTSLIVGPTSDINNMNAWVGAST 293
Query: 312 ---GEGVVSAE 319
GE VVS +
Sbjct: 294 NQYGEAVVSCQ 304
>gi|258563860|ref|XP_002582675.1| vacuolar protease A [Uncinocarpus reesii 1704]
gi|237908182|gb|EEP82583.1| vacuolar protease A [Uncinocarpus reesii 1704]
Length = 400
Score = 220 bits (561), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 113/248 (45%), Positives = 152/248 (61%), Gaps = 7/248 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
+ NF++AQYF EI IG+PPQNF V+ DTGSSNLWVPSS+C SI+C+ HS+Y S S+TY
Sbjct: 76 VDNFLNAQYFSEISIGNPPQNFKVVLDTGSSNLWVPSSQCG-SIACFLHSKYDSSASSTY 134
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
+ G I YGSGS+SGF SQD + +GD+VVK+Q F EAT E L F RFDGI+GLG
Sbjct: 135 KKNGTEFSIRYGSGSLSGFVSQDTLRIGDLVVKEQDFAEATNEPGLAFAFGRFDGILGLG 194
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ I+V VP + NM+ Q L+ E VF F+L + FGGVD F +
Sbjct: 195 YDTISVNKIVPPFYNMLNQKLIDEPVFGFYLGDTNKEGDDSYATFGGVDDSLFSDDMIKI 254
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE--- 313
P+ +K YW+ + + GN + G I+D+GTSL+A P+ + +N IG +
Sbjct: 255 PLRRKAYWEVDFDAVTFGNDRAELENTGI--ILDTGTSLIALPSTLAELLNKEIGAKKSW 312
Query: 314 -GVVSAEC 320
G + EC
Sbjct: 313 NGQYTVEC 320
>gi|430811193|emb|CCJ31368.1| unnamed protein product, partial [Pneumocystis jirovecii]
Length = 411
Score = 220 bits (561), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 135/332 (40%), Positives = 183/332 (55%), Gaps = 35/332 (10%)
Query: 5 LLRSVFCLWVLASCLLLPASSNGLRRIGLKK-----RRLDLHS-LNAARITRKERYMGGA 58
++ + L+VL C S GL R+ L+K R +H+ + A + RK
Sbjct: 1 MVSIAYWLYVLFVCQT--GVSRGLHRLELRKIPGDHRVNKVHNDIEAYSLARKYTLFYSY 58
Query: 59 GVSGVRHR--------LGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVI-FDTGSSNL 109
G +++ LG + ++ L NF +AQ +I IG+PPQ F V+ DTGSSNL
Sbjct: 59 GRDERKNKEPIIHGKPLGTNAHEV-SLTNFFNAQCRIDITIGTPPQTFKVVVLDTGSSNL 117
Query: 110 WVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVK 169
WVPSSKC S++C HS+Y S S+TY G EI YGSGSISGF S D V D+V+
Sbjct: 118 WVPSSKCT-SLACIIHSKYDSSLSSTYIANGSKFEIRYGSGSISGFISTDKFSVSDIVLP 176
Query: 170 DQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNR 229
Q F EA E TF RFDGI+GLG+ IAV +P + NMVEQ ++E VF+FW+
Sbjct: 177 AQEFAEAMSEPGFTFTFGRFDGILGLGYSSIAVNGIIPPFYNMVEQNAINEPVFAFWMGN 236
Query: 230 DPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQ---------FELGDILIGNQSTGV 280
EGGE FGG+DP H++G TY+PV +K YW+ F G IG ++ G
Sbjct: 237 IEKDIEGGECTFGGIDPMHYEGDLTYIPVRRKAYWEAFCLVDLSFFAYGKDFIGMENVG- 295
Query: 281 CEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
AI+D+GTSL+ P + +N+AIG
Sbjct: 296 ------AILDTGTSLIVMPKNIADLLNNAIGA 321
>gi|195399279|ref|XP_002058248.1| GJ15983 [Drosophila virilis]
gi|194150672|gb|EDW66356.1| GJ15983 [Drosophila virilis]
Length = 372
Score = 220 bits (561), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 112/249 (44%), Positives = 155/249 (62%), Gaps = 4/249 (1%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
L N M+ Y+G I IG+PPQ+F V+FD+GSSNLWVPS C S +C H++Y S S+TY
Sbjct: 61 LSNSMNMAYYGAITIGTPPQSFKVLFDSGSSNLWVPSKTCS-SYACEVHNQYDSSASSTY 119
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
G+S I YG+GS+SG + D V V + V+ Q F EAT E F A FDGI+G+G
Sbjct: 120 QANGESFSIQYGTGSLSGILATDIVNVNGLSVESQTFAEATNEPGTNFNDANFDGILGMG 179
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
++ IA + VP + NMV QGLV + VFSF+L RD + +GGE++FGG D + G TYV
Sbjct: 180 YQSIAQDNVVPPFYNMVSQGLVDQSVFSFYLARDGTSSQGGELIFGGSDSSLYSGDLTYV 239
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
P++++GYWQF + I QS +C+ C AI D+GTSL+ P ++N + +
Sbjct: 240 PISEQGYWQFTMAGASIDGQS--LCD-NCQAIADTGTSLIVAPANAYMQLNDILNVDDQG 296
Query: 317 SAECKLVVS 325
+C V S
Sbjct: 297 LVDCSSVSS 305
>gi|149058614|gb|EDM09771.1| renin 1, isoform CRA_a [Rattus norvegicus]
Length = 366
Score = 220 bits (560), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 110/236 (46%), Positives = 158/236 (66%), Gaps = 4/236 (1%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L N++D QY+GEIGIG+P Q F VIFDTGS+NLWVPS+KC +C H+ Y S +S++
Sbjct: 40 LTNYLDTQYYGEIGIGTPSQTFKVIFDTGSANLWVPSTKCGPLYTACEIHNLYDSSESSS 99
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y E G I+YGSG + GF SQD V VG ++V Q F E T + F+LA+FDG++G+
Sbjct: 100 YMENGTEFTIHYGSGKVKGFLSQDVVTVGGIIVT-QTFGEVTELPLIPFMLAKFDGVLGM 158
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
GF AV +PV+D+++ Q ++ EEVFS + +R+ GGE+V GG DP+H++G Y
Sbjct: 159 GFPAQAVDGVIPVFDHILSQRVLKEEVFSVYYSRESHL-LGGEVVLGGSDPQHYQGNFHY 217
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
V ++K G WQ + + +G +T +CE GC A+VD+GTS ++GPT + I A+G
Sbjct: 218 VSISKAGSWQITMKGVSVG-PATLLCEEGCMAVVDTGTSYISGPTSSLQLIMQALG 272
>gi|223891|prf||1004236A renin
Length = 336
Score = 220 bits (560), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 111/238 (46%), Positives = 161/238 (67%), Gaps = 4/238 (1%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFHSRYKSRKSNT 135
L N++++QY+GEIGIG+PPQ F VIFDTGS+NLWVPS+KC ++C HS Y+S S++
Sbjct: 12 LTNYLNSQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCSRLYLACGIHSLYESSDSSS 71
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y E G I+YGSG + GF SQD+V VG + V Q F E T + F+LA+FDG++G+
Sbjct: 72 YMENGDDFTIHYGSGRVKGFLSQDSVTVGGITVT-QTFGEVTELPLIPFMLAQFDGVLGM 130
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
GF AVG PV+D+++ QG++ E+VFS + NR P GGE+V GG DP+H++G Y
Sbjct: 131 GFPAQAVGGVTPVFDHILSQGVLKEKVFSVYYNRGPHL-LGGEVVLGGSDPEHYQGDFGY 189
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
V ++K WQ + + +G+ ST +CE GC +VD+G+S ++ PT + I A+G +
Sbjct: 190 VSLSKTDSWQITMKGVSVGS-STLLCEEGCEVVVDTGSSFISAPTSSLKLIMQALGAK 246
>gi|206609|gb|AAA42030.1| preprorenin (EC 3.4.99.19) [Rattus norvegicus]
Length = 402
Score = 220 bits (560), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 124/312 (39%), Positives = 184/312 (58%), Gaps = 17/312 (5%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSG--- 62
L ++ LW S LP + RI LKK + + R +ER + +S
Sbjct: 8 LWALLLLWTSCS-FSLPTDTASFGRILLKK-------MPSVREILEERGVDMTRISAEWG 59
Query: 63 --VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFS 119
++ + + L N++D QY+GEIGIG+P Q F VIFDTGS+NLWVPS+KC
Sbjct: 60 EFIKKSSFTNVTSPVVLTNYLDTQYYGEIGIGTPSQTFKVIFDTGSANLWVPSTKCGPLY 119
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
+C H+ Y S +S++Y E G I+YGSG + GF SQD V VG ++V Q F E T
Sbjct: 120 TACEIHNLYDSSESSSYMENGTEFTIHYGSGKVKGFLSQDVVTVGGIIVT-QTFGEVTEL 178
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEI 239
+ F+LA+FDG++G+GF AV +PV+D+++ ++ EEVFS + +R+ GGE+
Sbjct: 179 PLIPFMLAKFDGVLGMGFPAQAVDGVIPVFDHILSHEVLKEEVFSVYYSRESHL-LGGEV 237
Query: 240 VFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGP 299
V GG DP+H++G YV ++K G WQ + + +G +T +CE GC A+VD+GTS ++GP
Sbjct: 238 VLGGSDPQHYQGNFHYVSISKAGSWQITMKGVSVG-PATLLCEEGCMAVVDTGTSYISGP 296
Query: 300 TPVVTEINHAIG 311
T + I A+G
Sbjct: 297 TSSLQLIMQALG 308
>gi|50294061|ref|XP_449442.1| hypothetical protein [Candida glabrata CBS 138]
gi|49528756|emb|CAG62418.1| unnamed protein product [Candida glabrata]
Length = 415
Score = 220 bits (560), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 114/257 (44%), Positives = 160/257 (62%), Gaps = 13/257 (5%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+MDAQYF +I +G+PPQ F VI DTGSSNLWVPS C S++C+ H++Y +S+
Sbjct: 81 VPLSNYMDAQYFADISLGTPPQKFKVILDTGSSNLWVPSVDC-GSLACFLHNKYDHSQSS 139
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + G+ I+YGSGSI G+ S+DN+++GD+ +++Q F E T E L F +FDGI+G
Sbjct: 140 TYIKDGRPLSISYGSGSIEGYISEDNLQIGDLTIQNQKFGETTSEPGLAFAFGKFDGILG 199
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLN--RDPDAE-----EGGEIVFGGVDPK 247
L + IA D P + + ++Q L+ E FSF+L DP AE +GG GGVD
Sbjct: 200 LAYDTIAQDDITPPFYSAIQQHLLDESKFSFYLKSVNDPAAEGGSASDGGVFTLGGVDSS 259
Query: 248 HFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEIN 307
FKG + V ++ YW+ L I +G+QSTG E AAI D+GTSL+ P+ + IN
Sbjct: 260 KFKGDLIPLHVRRQAYWEVPLNAIKLGDQSTGKLENTGAAI-DTGTSLITLPSDMAEIIN 318
Query: 308 HAIGGE----GVVSAEC 320
IG + G + EC
Sbjct: 319 AQIGAKKGWTGQYTLEC 335
>gi|195134380|ref|XP_002011615.1| GI11125 [Drosophila mojavensis]
gi|193906738|gb|EDW05605.1| GI11125 [Drosophila mojavensis]
Length = 371
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 112/251 (44%), Positives = 157/251 (62%), Gaps = 8/251 (3%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
L N ++ Y+G I IG+PPQNF V+FD+GSSNLWVPS C S +C H++Y S S+TY
Sbjct: 60 LSNSLNMAYYGAITIGTPPQNFKVLFDSGSSNLWVPSKNCP-SYACEVHNQYDSSASSTY 118
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
G+S I YG+GS+SGF S D V+V + +K Q F EAT E F A FDGI+G+G
Sbjct: 119 EANGESFSIQYGTGSLSGFLSTDTVDVNGLSIKKQTFAEATNEPGTNFNNANFDGILGMG 178
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
++ I+ + VP + NMV Q L+ + VFSF+L RD + +GGE++FGG D + G TYV
Sbjct: 179 YQSISQDNVVPPFYNMVSQDLIDQSVFSFYLARDGTSSQGGELIFGGSDSSLYSGDFTYV 238
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH--AIGGEG 314
P++++GYWQF + + + +C+ C AI D+GTSLL P +N + EG
Sbjct: 239 PISQEGYWQFTMAGASV--EGYSLCD-NCQAIADTGTSLLVAPANAYELLNEILNVNDEG 295
Query: 315 VVSAECKLVVS 325
+V +C V S
Sbjct: 296 LV--DCSTVSS 304
>gi|348502999|ref|XP_003439054.1| PREDICTED: renin-like [Oreochromis niloticus]
Length = 396
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 116/305 (38%), Positives = 178/305 (58%), Gaps = 11/305 (3%)
Query: 13 WV-LASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSD 71
W+ L + L +S+ LRRI LKK +L ++ ++ A + + D++
Sbjct: 8 WIYLVALSLTVTTSHALRRIALKKMPSIRETLQELGVSVEQVMTELA-----QKSIADTN 62
Query: 72 EDILP--LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFHSRY 128
+P L N++D QYFGEI IGSP Q F+V+FDTGS+NLWVPS C FS +C+ H+RY
Sbjct: 63 NGTVPTPLTNYLDTQYFGEISIGSPAQMFNVVFDTGSANLWVPSQSCSPFSTACFTHNRY 122
Query: 129 KSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLAR 188
+ KS TY E G I Y SG++ GF S+D V V + QVF EAT ++ F+ A+
Sbjct: 123 DASKSRTYIENGTGFSIKYASGNVRGFLSEDVVVV-GGIPVVQVFAEATALSAMPFIFAK 181
Query: 189 FDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKH 248
FDG++G+G+ +A+ PV+D ++ Q ++ EEVFS + +RDP GGE+V GG DP +
Sbjct: 182 FDGVLGMGYPNVAIDGITPVFDRIMSQHVLKEEVFSIYYSRDPKRSPGGELVLGGTDPNY 241
Query: 249 FKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH 308
+ G Y+ + G W+ + + +G + C GC A++D+G+S + GP V+ +
Sbjct: 242 YTGSFNYINTRQTGKWELTMKGVSVGREMM-FCAEGCTAVIDTGSSYITGPASSVSVLMK 300
Query: 309 AIGGE 313
IG +
Sbjct: 301 TIGAQ 305
>gi|432103960|gb|ELK30793.1| Gastricsin [Myotis davidii]
Length = 390
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 114/289 (39%), Positives = 170/289 (58%), Gaps = 1/289 (0%)
Query: 25 SNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQ 84
S G+ RI LKK + ++ + K ++ + P+ N++DA
Sbjct: 14 SEGVERIILKKGKSIRQTMEEKGVLEKFLKNHRKEDPAAKYHFNNDAVAYEPITNYLDAF 73
Query: 85 YFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCE 144
YFGEI IG+PPQNF V+FDTGSSNLWVPS+ C S +C H+R+ S+T+ G++
Sbjct: 74 YFGEISIGTPPQNFLVLFDTGSSNLWVPSTYCQ-SQACSNHNRFNPSLSSTFRNNGQTYT 132
Query: 145 INYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGD 204
++YGSGS+S D V V ++VV +Q F + E + F + FDGI+G+ + +AVGD
Sbjct: 133 LSYGSGSLSVVLGYDTVTVQNIVVNNQEFGLSENEPNDPFYYSDFDGILGMAYPNMAVGD 192
Query: 205 AVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYW 264
A V M++QG ++ +FSF+ +R P + GGE++ GGVD + + G+ + PVT++ YW
Sbjct: 193 APTVMQGMLQQGQLTLPIFSFYFSRQPTRQYGGELILGGVDQQLYSGQIVWAPVTQELYW 252
Query: 265 QFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
Q + + IG+Q+TG C GC AIVD+GT LLA P + A G E
Sbjct: 253 QIAIQEFAIGDQATGWCSQGCQAIVDTGTFLLAVPQQYMGSFLQATGAE 301
>gi|255713834|ref|XP_002553199.1| KLTH0D11264p [Lachancea thermotolerans]
gi|238934579|emb|CAR22761.1| KLTH0D11264p [Lachancea thermotolerans CBS 6340]
Length = 417
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 118/309 (38%), Positives = 180/309 (58%), Gaps = 16/309 (5%)
Query: 26 NGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDIL--------PL 77
N +G+ +LH ++ ++ + A +G LG ++D+L PL
Sbjct: 36 NDGSELGVMMSVANLHQKYLSQFSKAYPEVDFASHAGSGIGLGAVEQDVLSAMGGHDVPL 95
Query: 78 KNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYT 137
N+++AQYF EI +G+PPQ+F VI DTGSSNLWVPS +C S++C+ HS+Y S++Y
Sbjct: 96 SNYLNAQYFTEITLGTPPQSFKVILDTGSSNLWVPSDEC-GSLACFLHSKYSHDASSSYK 154
Query: 138 EIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGF 197
G + I YGSGS+ G+ SQD + +GD+ + Q F EAT E L F +FDGI+GLG+
Sbjct: 155 ANGTNFAIQYGSGSLEGYISQDTLSIGDLTIPKQDFAEATSEPGLAFAFGKFDGILGLGY 214
Query: 198 REIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEG-GEIVFGGVDPKHFKGKHTYV 256
IAV VP + GL+ E F+F+LN D+EE GE+ FGG+D +KG T++
Sbjct: 215 DTIAVDKVVPPVYKAINDGLLDEPRFAFYLNNADDSEESTGEVTFGGIDSSKYKGNITWL 274
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE--- 313
PV +K YW+ + I +G++ + G A +D+GTSL+A P+ + +N IG +
Sbjct: 275 PVRRKAYWEVKFDGIGLGDEYAEL--EGTGAAIDTGTSLIALPSGLAEVLNAEIGAKKGW 332
Query: 314 -GVVSAECK 321
G + +C+
Sbjct: 333 SGQYTVDCE 341
>gi|57046|emb|CAA30082.1| unnamed protein product [Rattus norvegicus]
Length = 402
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 124/312 (39%), Positives = 184/312 (58%), Gaps = 17/312 (5%)
Query: 6 LRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSG--- 62
L ++ LW S LP + RI LKK + + R +ER + +S
Sbjct: 8 LWALLLLWTSCS-FSLPTDTASFGRILLKK-------MPSVREILEERGVDMTRISAEWG 59
Query: 63 --VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFS 119
++ + + L N++D QY+GEIGIG+P Q F VIFDTGS+NLWVPS+KC
Sbjct: 60 EFIKKSSFTNVTSPVVLTNYLDTQYYGEIGIGTPSQTFKVIFDTGSANLWVPSTKCGPLY 119
Query: 120 ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATRE 179
+C H+ Y S +S++Y E G I+YGSG + GF SQD V VG ++V Q F E T
Sbjct: 120 TACEIHNLYDSSESSSYMENGTEFTIHYGSGKVKGFLSQDVVTVGGIIVT-QTFGEVTEL 178
Query: 180 GSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEI 239
+ F+LA+FDG++G+GF V +PV+D+++ Q ++ EEVFS + +R+ GGE+
Sbjct: 179 PLIPFMLAKFDGVLGMGFPAQVVDGVIPVFDHILSQRVLKEEVFSVYYSRESHL-LGGEV 237
Query: 240 VFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGP 299
V GG DP+H++G YV ++K G WQ + + +G +T +CE GC A+VD+GTS ++GP
Sbjct: 238 VLGGSDPQHYQGNFHYVSISKAGSWQITMKGVSLG-PATLLCEEGCMAVVDTGTSYISGP 296
Query: 300 TPVVTEINHAIG 311
T + I A+G
Sbjct: 297 TSSLQLIMQALG 308
>gi|313220508|emb|CBY31359.1| unnamed protein product [Oikopleura dioica]
gi|313229843|emb|CBY07548.1| unnamed protein product [Oikopleura dioica]
Length = 397
Score = 219 bits (559), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 129/337 (38%), Positives = 191/337 (56%), Gaps = 52/337 (15%)
Query: 5 LLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVR 64
+L S LA +L+P L++ + + +LHS A + E
Sbjct: 2 MLTSALLGMALADPILIP-----LKKTKMTRGIGNLHSKYRADVPTNE------------ 44
Query: 65 HRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI---- 120
L N+ DAQYFG + IG+P QNF+VIFDTGSSNLWVPSSKC I
Sbjct: 45 ------------LTNYFDAQYFGPLTIGTPAQNFTVIFDTGSSNLWVPSSKCDPHIGTGF 92
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEV----GDVVVKDQVFIEA 176
+C H++Y S S+T+TE G EI YG+GS+ GF S D++++ G ++ K F EA
Sbjct: 93 ACLNHNKYDSDLSSTWTEDGTKFEIQYGTGSMVGFQSTDDIDIAPGSGGLIAKQATFAEA 152
Query: 177 TREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDP----D 232
E +TFL A FDGI+GL + I+V A P+++ ++E+G V+ VF+F+++R+ +
Sbjct: 153 VEEPGITFLAAAFDGIMGLAYPSISVNGATPIYNQLMEEGQVN-GVFAFFVHRNSSKPGE 211
Query: 233 AEEGGEIVFGGVDPKHFKGKHT----YVPVTKKGYWQFEL------GDILIGNQSTGVCE 282
++ GGEI +GGV+P+ F+G + V+++ YWQ + GD + +Q +CE
Sbjct: 212 SDIGGEIAWGGVNPERFEGTFPDSFIWHEVSRQAYWQVNMGTVTVNGDGFVSDQPIVMCE 271
Query: 283 GGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
GGC IVDSGTSL+ GPT + +IN AIG ++ E
Sbjct: 272 GGCQGIVDSGTSLITGPTEITDQINKAIGAIEFIAGE 308
>gi|193735605|gb|ACF20292.1| vacuolar protease A [Trichoderma aureoviride]
gi|226374420|gb|ACO52389.1| vacuolar protease A [Trichoderma aureoviride]
Length = 395
Score = 219 bits (559), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 130/309 (42%), Positives = 185/309 (59%), Gaps = 15/309 (4%)
Query: 14 VLASCLLLPASSNGLRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSGVRHRLGD 69
++A+ L+ ++ G+ ++ L+K ++L+ S+ A ++YMG S D
Sbjct: 5 LIAAAALVGSAQAGVHKMKLQKVSLEQQLEGSSIEAQVQQLGQKYMGVRPTSRADVMFND 64
Query: 70 SDEDI-----LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYF 124
+ I +P+ NFM+AQYF EI IGSPPQ F V+ DTGSSNLWVPS C SI+C+
Sbjct: 65 NLPKIKGGHPVPVTNFMNAQYFSEITIGSPPQTFKVVLDTGSSNLWVPSQSCN-SIACFL 123
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
HS Y S S+TY + G EI+YGSGS++GF S D V +GD+ +K Q F EAT E L F
Sbjct: 124 HSTYDSSSSSTYKKNGSDFEIHYGSGSLTGFISNDVVTIGDLKIKGQDFAEATSEPGLAF 183
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
RFDGI+GLG+ I+V VP + MV Q L+ E VF+F+L +++EG FGGV
Sbjct: 184 AFGRFDGILGLGYDTISVNGIVPPFYQMVNQKLLDEPVFAFYLG---NSDEGSVATFGGV 240
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
D HF GK Y+P+ +K YW+ +L I G++ + G AI+D+GTSL P+ +
Sbjct: 241 DESHFSGKIEYIPLRRKAYWEVDLDSIAFGDEVAELENTG--AILDTGTSLNVLPSGIAE 298
Query: 305 EINHAIGGE 313
+N IG +
Sbjct: 299 LLNAEIGAK 307
>gi|351707611|gb|EHB10530.1| Renin [Heterocephalus glaber]
Length = 397
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 131/315 (41%), Positives = 186/315 (59%), Gaps = 16/315 (5%)
Query: 5 LLRSVFCLWVLASCLL-LPASSNGLRRIGLKKRRLDLHSLNAARITRKER--YMG--GAG 59
+LR F L + S LP + RRI LKK + + R + KER MG GA
Sbjct: 1 MLRWGFLLLLWGSYTFGLPTDTAAFRRIFLKK-------MPSVRDSLKERGVDMGRLGAK 53
Query: 60 VSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-F 118
RL D+ + L N+++ QY+GEIGIG+PPQ F VIFDTGS+NLWVPS+KC
Sbjct: 54 WGEFAKRLSDNSTSPVVLTNYLNTQYYGEIGIGTPPQAFKVIFDTGSANLWVPSTKCSPL 113
Query: 119 SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
+C HS Y S +S++Y E G I YGSG + GF SQD V VG + V Q F E T
Sbjct: 114 YTACEIHSLYDSAESSSYIENGTEFSIRYGSGKVKGFLSQDVVTVGGITVT-QTFGEVTE 172
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
+ F+LA+FDG++G+GF AVG PV+D+++ Q ++ E+VFS + +RD G
Sbjct: 173 LPLIPFMLAKFDGVLGMGFPAQAVGGITPVFDHILSQRVLKEDVFSVYYSRDSHLLGGEL 232
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
++ G DP+H++G YV ++K G WQ + + +G +T +CE GC A+VD+G S ++G
Sbjct: 233 LLGGS-DPQHYQGNFHYVSISKSGSWQITMKGVSVGF-ATLLCEEGCMAVVDTGASYISG 290
Query: 299 PTPVVTEINHAIGGE 313
PT + I A+G +
Sbjct: 291 PTSSLRLIMEALGAK 305
>gi|871442|emb|CAA25391.1| renin [Mus musculus]
Length = 387
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 111/240 (46%), Positives = 159/240 (66%), Gaps = 5/240 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
L N+++ QY+GEIGIG+PPQ F VIFDTGS+NLWVPS+KC ++C HS Y+S S++
Sbjct: 58 LTNYLNTQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCSRLYLACGIHSLYESSDSSS 117
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDV--VVKDQVFIEATREGSLTFLLARFDGII 193
Y E G I+YGSG + GF SQD+V V V + Q F E T + F+LA+FDG++
Sbjct: 118 YMENGSDFTIHYGSGRVKGFLSQDSVTVSRVGGITVTQTFGEVTELPLIPFMLAKFDGVL 177
Query: 194 GLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKH 253
G+GF AVG PV+D+++ QG++ EEVFS + NR GGE+V GG DP+H++G
Sbjct: 178 GMGFPAQAVGGVTPVFDHILSQGVLKEEVFSVYYNRGSHL-LGGEVVLGGSDPQHYQGNF 236
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
YV ++K WQ + + +G+ ST +CE GCA +VD+G+S ++ PT + I A+G +
Sbjct: 237 HYVSISKTDSWQITMKGVSVGS-STLLCEEGCAVVVDTGSSFISAPTSSLKLIMQALGAK 295
>gi|126309849|ref|XP_001370462.1| PREDICTED: gastricsin-like [Monodelphis domestica]
Length = 390
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 104/238 (43%), Positives = 152/238 (63%), Gaps = 1/238 (0%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N+MD Y+GEI IG+PPQNF V+FDTGSSNLWVPS C S +C H ++ KS+T
Sbjct: 64 PLANYMDMSYYGEISIGTPPQNFLVLFDTGSSNLWVPSIYCQ-SQACTNHPQFNPSKSST 122
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y+ G++ + YG+GS++G F D V + + + +Q F + E F+ A+FDGI+GL
Sbjct: 123 YSSNGQTFSLQYGTGSLTGVFGYDTVTIQGISITNQEFGLSETEPGTNFVYAQFDGILGL 182
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ I+ G A V +++ L++ VF+F+L+ + ++ GGE+VFGGVD + G +
Sbjct: 183 AYPAISSGGATTVMQGFLQENLLNSPVFAFYLSGNENSNNGGEVVFGGVDTSMYTGDIYW 242
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
PVT++ YWQ + IG Q+TG C GGC AIVD+GTSLL P + +E+ IG +
Sbjct: 243 APVTEEAYWQIAINGFSIGGQATGWCSGGCQAIVDTGTSLLTAPQQIFSELMQYIGAQ 300
>gi|198475392|ref|XP_001357030.2| GA17303 [Drosophila pseudoobscura pseudoobscura]
gi|198138802|gb|EAL34096.2| GA17303 [Drosophila pseudoobscura pseudoobscura]
Length = 401
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 122/314 (38%), Positives = 181/314 (57%), Gaps = 18/314 (5%)
Query: 13 WVLASCLLLPASSNGLRRIGLKK---RRLDLHSLNAARITRK------ERYM----GGAG 59
W + C+L AS+ L+RI + K +R H R R E Y+ G
Sbjct: 4 WFVLLCVLALASAE-LQRIKIHKSEHKRSRHHVRQEVRSLRHKYQQLIENYVVYDYGQPD 62
Query: 60 VSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS 119
+ D L N M+ Y+G+I IG+PPQ F+V+FDTGSSNLW+PS++C +
Sbjct: 63 YGNDYPSNSEPDYTTEELGNSMNMYYYGQISIGTPPQYFNVVFDTGSSNLWIPSAQCLST 122
Query: 120 -ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
++C H++Y + S+TY ++ I YG+GS++G+ + D V + + + +Q F EA
Sbjct: 123 DVACQQHNQYNASASSTYVANSQNFSIQYGTGSVTGYLAMDTVTINGLAIANQTFGEAVS 182
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
+ +F FDGI+G+G++ IAV VP + N+ EQGL+ E F F+L R+ +EEGG+
Sbjct: 183 QPGSSFTDVAFDGILGMGYQTIAVDSVVPPFYNLYEQGLIDEPTFGFYLARNGSSEEGGQ 242
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
++ GGVD G TYVPV+++GYWQF + + I T +C+ GC AI D+GTSLLA
Sbjct: 243 LLLGGVDETLMAGDLTYVPVSQEGYWQFSVNN--ISWNGTVLCD-GCQAIADTGTSLLAC 299
Query: 299 PTPVVTEINHAIGG 312
P V T+IN IG
Sbjct: 300 PQAVYTQINQLIGA 313
>gi|156846613|ref|XP_001646193.1| hypothetical protein Kpol_1013p6 [Vanderwaltozyma polyspora DSM
70294]
gi|156116867|gb|EDO18335.1| hypothetical protein Kpol_1013p6 [Vanderwaltozyma polyspora DSM
70294]
Length = 402
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 105/239 (43%), Positives = 154/239 (64%), Gaps = 3/239 (1%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+++AQY+ +I +G+P Q+F VI DTGSSNLWVPS C S++CY H++Y S+
Sbjct: 79 IPLSNYLNAQYYTDITLGTPAQSFKVILDTGSSNLWVPSVDCN-SLACYLHAKYDHSDSS 137
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + G + I YGSGS+ G+ SQD +++GD+V+ Q F EAT E L F +FDGI+G
Sbjct: 138 TYKKNGTTFSIQYGSGSMEGYISQDVLQIGDLVIPGQDFAEATSEPGLAFAFGKFDGILG 197
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + IAV VP + N + + LV E +FSF+L D +E+GG++ FGG D F G T
Sbjct: 198 LAYDTIAVNRVVPPFYNAINKKLVDEPIFSFYLGDDTKSEDGGQVTFGGYDSSLFTGDIT 257
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
++PV +K YW+ + I +GN+ + G A +D+GTSL+ P+ + IN IG +
Sbjct: 258 WLPVRRKAYWEVKFDAIALGNEVADLVNHGAA--IDTGTSLITLPSGLAEVINSQIGAK 314
>gi|22218078|dbj|BAC07516.1| pepsinogen III [Oryctolagus cuniculus]
Length = 387
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 122/296 (41%), Positives = 171/296 (57%), Gaps = 24/296 (8%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL K L H+ N A +Y+ A V L+N++D +YFG
Sbjct: 32 LIEKGLLKDYLKTHTPNLAT-----KYLPKAAFDSVPTET---------LENYLDTEYFG 77
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+P Q+F+VIFDTGSSNLWVPS C S +C H+++ S+T+ +S I Y
Sbjct: 78 TIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SAACSVHNKFNPEDSSTFQATSESLSITY 136
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVP 207
G+GS++GF D V+VG++ +Q+F + E A FDGI+GL + I+ DA P
Sbjct: 137 GTGSMTGFLGYDTVKVGNIEDTNQIFGLSESEPGSFLYYAPFDGILGLAYPSISSSDATP 196
Query: 208 VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFE 267
V+DNM +GLVSE++FS +L+ D E G ++FGG+D ++ G +VPV+ +GYWQ
Sbjct: 197 VFDNMWNEGLVSEDLFSVYLSSDD--ESGSVVMFGGIDSSYYTGSLNWVPVSYEGYWQIT 254
Query: 268 LGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 317
L I + + T C C AIVD+GTSLLAGPT ++ I IG GE +VS
Sbjct: 255 LDSITMDGE-TIACADSCQAIVDTGTSLLAGPTSAISNIQSYIGASENSDGEMIVS 309
>gi|225681688|gb|EEH19972.1| cathepsin D [Paracoccidioides brasiliensis Pb03]
Length = 349
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 110/238 (46%), Positives = 155/238 (65%), Gaps = 5/238 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
+ NF++AQYF EI IG+PPQ F V+ DTGSSNLWVPS++C SI+C+ H++Y S S+T+
Sbjct: 27 VDNFLNAQYFSEISIGTPPQTFKVVLDTGSSNLWVPSAQC-MSIACFLHNKYDSSVSSTH 85
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
+ G I YGSGS+SGF SQD V +GD+ V +Q F EAT E L F RFDGI+GLG
Sbjct: 86 RKNGTEFTIRYGSGSLSGFVSQDVVRIGDMTVNNQDFAEATSEPGLAFAFGRFDGILGLG 145
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ I+V VP++ M+ Q L+ VF F+L N D D ++ E FGG+D HF G+ T
Sbjct: 146 YDSISVNHIVPLFYQMINQKLLDTPVFGFYLGNSDVDGDD-SEATFGGIDESHFTGELTT 204
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
+ + ++ YW+ +L I+ GN+ + G I+D+GTSLLA P+ + +N IG +
Sbjct: 205 ISLRRRAYWEVDLDAIIFGNEMAELENTGV--ILDTGTSLLALPSTIAELLNKQIGAK 260
>gi|118344578|ref|NP_001072054.1| renin precursor [Takifugu rubripes]
gi|39540664|tpg|DAA01803.1| TPA: pro-renin [Takifugu rubripes]
gi|55771086|dbj|BAD69803.1| renin [Takifugu rubripes]
Length = 396
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 121/310 (39%), Positives = 177/310 (57%), Gaps = 21/310 (6%)
Query: 13 WV-LASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGD-S 70
W+ LA+ L SS LRRI LH + + R T E G V V + + S
Sbjct: 8 WMSLAALSLALTSSQALRRI-------TLHKMPSIRETLGEM---GVSVEQVLSEMAEKS 57
Query: 71 DEDIL------PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCY 123
D+ PL N++D QYFGEI IGSP Q F+V+FDTGS+NLWVPS C FS +C+
Sbjct: 58 AGDVFNKTVPTPLTNYLDTQYFGEISIGSPAQMFNVVFDTGSANLWVPSQSCSPFSTACF 117
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
H+RY + KS T+ E G I Y SG++ GF S+D V V + QVF EAT ++
Sbjct: 118 THNRYDASKSQTHVENGTGFSIQYASGNVRGFLSEDVVVV-GGIPVIQVFAEATSLSAMP 176
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
F+ A+FDG++G+G+ +A+ PV+D ++ Q ++ EEVFS + +RDP GGE+V GG
Sbjct: 177 FVFAKFDGVLGMGYPNMAIDGITPVFDRIMSQHVLKEEVFSIYYSRDPKHSPGGELVLGG 236
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
DP ++ G Y+ + G W+ + + +G + C GC A++D+G+S + GP V
Sbjct: 237 TDPNYYTGSFNYMGTRETGKWEITMKGVSVGMEMM-FCTEGCTAVIDTGSSYITGPASSV 295
Query: 304 TEINHAIGGE 313
+ + IG +
Sbjct: 296 SLLMKTIGAQ 305
>gi|198451348|ref|XP_001358330.2| GA19187 [Drosophila pseudoobscura pseudoobscura]
gi|198131448|gb|EAL27468.2| GA19187 [Drosophila pseudoobscura pseudoobscura]
Length = 393
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 118/295 (40%), Positives = 174/295 (58%), Gaps = 10/295 (3%)
Query: 43 LNAARITRKERYMGGAGVSGVRHRLGDSDEDIL------PLKNFMDAQYFGEIGIGSPPQ 96
L A+ + + ++ G + ++ L +S+ PL N ++ +Y G I IG+P Q
Sbjct: 31 LQASFMATRRQHRAGKQLLYAKYNLANSEASQSSGGASEPLDNRLNLEYAGPISIGTPRQ 90
Query: 97 NFSVIFDTGSSNLWVPSSKCYF-SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
F+++FDTGS+NLWVPS++C +++C H RY + S+++ G+ I YG+GS+SG
Sbjct: 91 PFNMLFDTGSANLWVPSAECSARNVACQHHHRYNASASSSHVPDGRRFAIAYGTGSLSGR 150
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
+QD V VG +VV++Q F A E TF+ F GI+GL FR IA A P++ NM +Q
Sbjct: 151 LAQDTVSVGRLVVQNQTFGMAIHEPGSTFVDTNFAGIVGLAFRSIAEQQATPLFQNMCDQ 210
Query: 216 GLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGN 275
GLV + VFSF+L R+ A++GGE++FGG+D F TYVP+T GYWQF++ + +
Sbjct: 211 GLVDQCVFSFYLKRNGSAQQGGELLFGGIDASRFTAPLTYVPLTHAGYWQFQMQSVEVVG 270
Query: 276 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 330
++ G AIVD+GTSLLA P IN +GG S E L S G L
Sbjct: 271 KTI---SQGRQAIVDTGTSLLAAPPREYLIINSLLGGLPTASGEYLLRCSDIGRL 322
>gi|322700747|gb|EFY92500.1| vacuolar protease A [Metarhizium acridum CQMa 102]
Length = 395
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 120/264 (45%), Positives = 170/264 (64%), Gaps = 11/264 (4%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+P+ NFM+AQYF EI IGSPPQ+F V+ DTGSSNLWVPS C SI+CY HS Y S S+
Sbjct: 75 VPVSNFMNAQYFSEITIGSPPQSFKVVLDTGSSNLWVPSQSCN-SIACYLHSTYDSSSSS 133
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + G S EI YGSGS+SGF SQD V +GD+ ++ Q F EAT E L F +FDGI+G
Sbjct: 134 TYKKNGSSFEIRYGSGSLSGFVSQDVVSIGDLKIEHQDFAEATSEPGLAFAFGKFDGILG 193
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ ++V VP + M++Q L+ E VF+F+L EEG E VFGG+D H+ G+
Sbjct: 194 LGYDTLSVNKIVPPFYQMIDQKLLDEPVFAFYLGS---KEEGSEAVFGGIDKNHYTGELE 250
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
Y+P+ +K YW+ ++ I +G++ + G AI+D+GTSL P+ + +N IG +
Sbjct: 251 YLPLRRKAYWEVDINSIALGDEIAELDHTG--AILDTGTSLNVLPSTLAELLNKEIGAKK 308
Query: 314 ---GVVSAECKLVVSQYGDLIWDL 334
G + +C + S D++++L
Sbjct: 309 SWNGQYTVDCDKIKS-LPDIVFNL 331
>gi|444706374|gb|ELW47716.1| Renin [Tupaia chinensis]
Length = 401
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 126/299 (42%), Positives = 183/299 (61%), Gaps = 20/299 (6%)
Query: 9 VFCLWVLASCLL-LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRH-- 65
+ LW SC LPA +NG +RI LKK + + R + KER A + +
Sbjct: 7 LLVLW--GSCTFGLPADANGFQRIFLKK-------MPSVRESLKERGADAARLVAKWNLS 57
Query: 66 ---RLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSIS 121
LG+S ++ L N++D QY+GEIGIG+P Q F V+FDTGS+NLWVPS+KC +
Sbjct: 58 KTLSLGNSTSPVV-LTNYLDTQYYGEIGIGTPAQTFKVVFDTGSANLWVPSTKCSPLYTA 116
Query: 122 CYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGS 181
C HS Y S +S++Y E G I+YGSG + GF SQD V VG + V Q F E T
Sbjct: 117 CEIHSLYDSSESSSYMENGTEFAIHYGSGKVRGFLSQDVVTVGGITVT-QTFGEVTELPV 175
Query: 182 LTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVF 241
+ F+LA+FDG++G+G AVG PV+D+++ Q ++ E+VFS + +++ GGEIV
Sbjct: 176 IPFMLAKFDGVLGMGLPAQAVGGVTPVFDHILSQRVLKEDVFSVYYSKNSHV-LGGEIVL 234
Query: 242 GGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
GG DP++++G YV V+ G WQ ++ + + +T +CE GC A+VD+GTS ++GPT
Sbjct: 235 GGSDPQYYQGHFHYVSVSSTGSWQVKMKGVSV-RSATLLCENGCMAVVDTGTSYISGPT 292
>gi|195144214|ref|XP_002013091.1| GL23572 [Drosophila persimilis]
gi|194102034|gb|EDW24077.1| GL23572 [Drosophila persimilis]
Length = 393
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 118/295 (40%), Positives = 174/295 (58%), Gaps = 10/295 (3%)
Query: 43 LNAARITRKERYMGGAGVSGVRHRLGDSDEDIL------PLKNFMDAQYFGEIGIGSPPQ 96
L A+ + + ++ G + ++ L +S+ PL N ++ +Y G I IG+P Q
Sbjct: 31 LQASFMATRRQHRAGKQLLYAKYNLANSEASQSSGGASEPLDNRLNLEYAGPISIGTPRQ 90
Query: 97 NFSVIFDTGSSNLWVPSSKCYF-SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
F+++FDTGS+NLWVPS++C +++C H RY + S+++ G+ I YG+GS+SG
Sbjct: 91 PFNMLFDTGSANLWVPSAECSARNVACQHHHRYNASASSSHVPDGRRFAIAYGTGSLSGR 150
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
+QD V VG +VV++Q F A E TF+ F GI+GL FR IA A P++ NM +Q
Sbjct: 151 LAQDTVSVGRLVVQNQTFGMAIHEPGSTFVDTNFAGIVGLAFRSIAEQHATPLFQNMCDQ 210
Query: 216 GLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGN 275
GLV + VFSF+L R+ A++GGE++FGG+D F TYVP+T GYWQF++ + +
Sbjct: 211 GLVDQCVFSFYLKRNGSAQQGGELLFGGIDASRFTAPLTYVPLTHAGYWQFQMQSVEVVG 270
Query: 276 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 330
++ G AIVD+GTSLLA P IN +GG S E L S G L
Sbjct: 271 KTI---SQGRQAIVDTGTSLLAAPPREYLIINSLLGGLPTASGEYLLRCSDIGRL 322
>gi|291409613|ref|XP_002721073.1| PREDICTED: pepsinogen III-like [Oryctolagus cuniculus]
Length = 387
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 125/304 (41%), Positives = 176/304 (57%), Gaps = 26/304 (8%)
Query: 28 LRRIGLKKRRLDLHSLN-AARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYF 86
L GL K L H+ N A + KE + A VS L+N++D +YF
Sbjct: 32 LIEKGLLKDYLKTHTPNLATKYFPKETF---ASVSTES------------LENYLDTEYF 76
Query: 87 GEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEIN 146
G I IG+PPQ+F+VIFDTGSSNLWVPS+ C S +C H+R+ S+T+ ++ I
Sbjct: 77 GTISIGTPPQDFTVIFDTGSSNLWVPSTYCS-SAACTVHNRFNPDDSSTFQATSETLSIT 135
Query: 147 YGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDA 205
YG+GS++G D V VG + +Q+F + T GS + A FDGI+GL + I+ DA
Sbjct: 136 YGTGSMTGILGYDTVNVGSIEDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISASDA 194
Query: 206 VPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQ 265
PV+DNM +GLVS+++FS +L+ D E G ++FGG+D ++ G +VPV+ +GYWQ
Sbjct: 195 TPVFDNMWNEGLVSQDLFSVYLSS--DDESGSLVMFGGIDSSYYTGSLNWVPVSYEGYWQ 252
Query: 266 FELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECK 321
L I + + T C GC AIVD+GTSLLAGPT ++ I IG EG + C
Sbjct: 253 ITLDSITMDGE-TIACADGCQAIVDTGTSLLAGPTSAISNIQSYIGASENYEGEMIVSCS 311
Query: 322 LVVS 325
+ S
Sbjct: 312 SMYS 315
>gi|195159706|ref|XP_002020719.1| GL15694 [Drosophila persimilis]
gi|194117669|gb|EDW39712.1| GL15694 [Drosophila persimilis]
Length = 401
Score = 219 bits (557), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 125/318 (39%), Positives = 185/318 (58%), Gaps = 19/318 (5%)
Query: 9 VFCLWVLASCLLLPASSNGLRRIGLKK---RRLDLHSLNAARITRK------ERYM---- 55
+F L+VL C+L AS+ L+RI + K +R H R R E Y+
Sbjct: 1 MFKLFVLL-CVLALASAE-LQRIKIHKSEHKRSRHHVRQEVRSLRHKYQQLIENYVVYDY 58
Query: 56 GGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSK 115
G + D L N M+ Y+G+I IG+PPQ F+V+FDTGSSNLW+PS++
Sbjct: 59 GQPDYGNDYPSNSEPDYTTEELGNSMNMYYYGQISIGTPPQYFNVVFDTGSSNLWIPSAQ 118
Query: 116 CYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFI 174
C + ++C H++Y + S+TY ++ I YG+GS++G+ + D V + + + +Q F
Sbjct: 119 CLSTDVACQQHNQYNASASSTYVANSQNFSIQYGTGSVTGYLATDTVTINGLAIANQTFG 178
Query: 175 EATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAE 234
EA + +F FDGI+G+G++ IAV VP + N+ EQGL+ E F F+L R+ +E
Sbjct: 179 EAVSQPGSSFTDVAFDGILGMGYQTIAVDSVVPPFYNLYEQGLIDEPTFGFYLARNGSSE 238
Query: 235 EGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTS 294
EGG+++ GGVD G TYVPV+++GYWQF + + I T +C+ GC AI D+GTS
Sbjct: 239 EGGQLLLGGVDETLMAGDLTYVPVSQEGYWQFSVNN--ISWNGTVLCD-GCQAIADTGTS 295
Query: 295 LLAGPTPVVTEINHAIGG 312
LLA P V T+IN IG
Sbjct: 296 LLACPQAVYTQINQLIGA 313
>gi|283806594|ref|NP_001164550.1| pepsin-3 precursor [Oryctolagus cuniculus]
gi|129783|sp|P27822.1|PEPA3_RABIT RecName: Full=Pepsin-3; AltName: Full=Pepsin A; AltName:
Full=Pepsin III; Flags: Precursor
gi|165598|gb|AAA85370.1| pepsinogen [Oryctolagus cuniculus]
Length = 387
Score = 219 bits (557), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 122/296 (41%), Positives = 171/296 (57%), Gaps = 24/296 (8%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL K L H+ N A +Y+ A V L+N++D +YFG
Sbjct: 32 LIEKGLLKDYLKTHTPNLAT-----KYLPKAAFDSVPTET---------LENYLDTEYFG 77
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+P Q+F+VIFDTGSSNLWVPS C S +C H+++ S+T+ +S I Y
Sbjct: 78 TIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SAACSVHNQFNPEDSSTFQATSESLSITY 136
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVP 207
G+GS++GF D V+VG++ +Q+F + E A FDGI+GL + I+ DA P
Sbjct: 137 GTGSMTGFLGYDTVKVGNIEDTNQIFGLSESEPGSFLYYAPFDGILGLAYPSISSSDATP 196
Query: 208 VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFE 267
V+DNM +GLVSE++FS +L+ D E G ++FGG+D ++ G +VPV+ +GYWQ
Sbjct: 197 VFDNMWNEGLVSEDLFSVYLSSDD--ESGSVVMFGGIDSSYYTGSLNWVPVSYEGYWQIT 254
Query: 268 LGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 317
L I + + T C C AIVD+GTSLLAGPT ++ I IG GE +VS
Sbjct: 255 LDSITMDGE-TIACADSCQAIVDTGTSLLAGPTSAISNIQSYIGASENSDGEMIVS 309
>gi|224458280|ref|NP_001138943.1| gastricsin precursor [Pongo abelii]
gi|222425206|dbj|BAH20552.1| pepsinogen C [Pongo abelii]
Length = 388
Score = 218 bits (556), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 116/283 (40%), Positives = 169/283 (59%), Gaps = 9/283 (3%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRH------RLGDSDEDILPLKNFMDAQYFGEIG 90
++ L + R T KE+ + G + +H R GD P+ +MDA YFGEI
Sbjct: 20 KVPLKKFKSIRETMKEKGLLGEFLRTHKHDPAWKYRFGDLSVSYEPMA-YMDAAYFGEIS 78
Query: 91 IGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSG 150
IG+PPQNF V+FDTGSSNLWVPS C S +C HSR+ +S+TY+ G++ + YGSG
Sbjct: 79 IGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHSRFNPSESSTYSTNGQTFSLQYGSG 137
Query: 151 SISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWD 210
S++GFF D + V + V +Q F + E F+ A+FDGI+GL + ++V +A
Sbjct: 138 SLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGIMGLAYPALSVDEATTAMQ 197
Query: 211 NMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 270
MV++G ++ VFSF+L+ GG +VFGGVD + G+ + PVT++ YWQ + +
Sbjct: 198 GMVQEGALTSPVFSFYLSNQ-QGSSGGAVVFGGVDSSLYTGQIYWAPVTQELYWQIGIEE 256
Query: 271 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
LIG Q++G C GC AIVD+GTSLL P ++ + A G +
Sbjct: 257 FLIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMSALLQATGAQ 299
>gi|345802472|ref|XP_854465.2| PREDICTED: pepsin B-like [Canis lupus familiaris]
Length = 390
Score = 218 bits (556), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 112/276 (40%), Positives = 168/276 (60%), Gaps = 3/276 (1%)
Query: 25 SNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDIL-PLKNFMDA 83
S G+ RI LKK + + + R + V L ++D P N++++
Sbjct: 14 SEGVERIILKKGK-SIRQVMEERGVLETFLRNHPKVDPAAKYLFNNDAVAYEPFTNYLNS 72
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
YFGEI IG+PPQNF V+FDTGSSNLWVPS+ C S +C H+ + S+TY G++
Sbjct: 73 YYFGEISIGTPPQNFLVLFDTGSSNLWVPSTYCQ-SQACSNHNTFNPSSSSTYRNNGQTY 131
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVG 203
+ YGSGS++ D V V ++V+ +Q F + E S F A FDGI+G+ + +AVG
Sbjct: 132 TLYYGSGSLTVLLGYDTVTVQNIVINNQEFGLSEIEPSNPFYYANFDGILGMAYPNLAVG 191
Query: 204 DAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGY 263
D+ V +MV+QG +++ +FSF+ +R P E GGE++ GGVD + + G+ + PVT++ Y
Sbjct: 192 DSPTVMQSMVQQGQLTQPIFSFYFSRQPTYEYGGELILGGVDTQFYSGEIVWAPVTREMY 251
Query: 264 WQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGP 299
WQ + + L+ NQ+TG+C GC AIVD+GT +LA P
Sbjct: 252 WQVAIDEFLVNNQATGLCSQGCQAIVDTGTYVLAVP 287
>gi|253762215|gb|ACT35559.1| pepsinogen A2 precursor [Siniperca scherzeri]
Length = 376
Score = 218 bits (556), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 114/251 (45%), Positives = 163/251 (64%), Gaps = 13/251 (5%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
P+ N D Y+G I IGSPPQ+FSVIFDTGSSNLW+PS C S +C H R+ ++S T
Sbjct: 60 PMTNDADLSYYGVISIGSPPQSFSVIFDTGSSNLWIPSVYCS-SQACENHRRFNPQQSTT 118
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIG 194
+ + I YG+GS++G+ + D VEVG + V +QVF I T + + A DGI+G
Sbjct: 119 FKWGNQPLSIQYGTGSMTGYLAIDTVEVGGISVANQVFGISRTEAPFMAHMQA--DGILG 176
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L F+ IA + VPV+DNMV+QGLVS+ +FS +L+ ++E+G E+VFGG+D H+ G+ T
Sbjct: 177 LAFQTIASDNVVPVFDNMVKQGLVSQPLFSVYLSS--NSEQGSEVVFGGIDSSHYTGQIT 234
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG--- 311
++P++ YWQ ++ + I Q T C GGC AI+D+GTSL+ GPT + +N +G
Sbjct: 235 WIPLSSATYWQIKMDSVTINGQ-TVACSGGCQAIIDTGTSLIVGPTSDINNMNAWVGAST 293
Query: 312 ---GEGVVSAE 319
GE VVS +
Sbjct: 294 NQYGEAVVSCQ 304
>gi|441648777|ref|XP_003266334.2| PREDICTED: LOW QUALITY PROTEIN: gastricsin [Nomascus leucogenys]
Length = 388
Score = 218 bits (555), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 115/283 (40%), Positives = 169/283 (59%), Gaps = 9/283 (3%)
Query: 37 RLDLHSLNAARITRKERYMGGAGV------SGVRHRLGDSDEDILPLKNFMDAQYFGEIG 90
+ L+ + R T KE+ + G + ++ GD P+ +MDA YFGE+
Sbjct: 20 KXPLNEFKSIRETMKEKGLLGEFLRTHKYDPAWKYHFGDLSVSYEPMA-YMDAAYFGEVS 78
Query: 91 IGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSG 150
IG+PPQNF V+FDTGSSNLWVPS C S +C HSR+ KS+TY+ G++ + YGSG
Sbjct: 79 IGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHSRFNPSKSSTYSTNGQTFSLQYGSG 137
Query: 151 SISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWD 210
S++GFF D + V + V +Q F + E F+ ARFDGI+GL + ++V +A
Sbjct: 138 SLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFIYARFDGIMGLAYPALSVDEATTAMQ 197
Query: 211 NMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 270
MV++G ++ VFSF+L+ + GG +VFGGVD + G+ + PVT++ YWQ + +
Sbjct: 198 GMVQEGALTSPVFSFYLSNQ-EGSSGGAVVFGGVDSSLYTGQIYWAPVTQELYWQIGIEE 256
Query: 271 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
LIG Q++G C GC AIVD+GTSLL P ++ + A G +
Sbjct: 257 FLIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMSALLQATGAQ 299
>gi|193499293|gb|ACF18589.1| pepsinogen A2 precursor [Siniperca scherzeri]
Length = 376
Score = 218 bits (555), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 114/251 (45%), Positives = 163/251 (64%), Gaps = 13/251 (5%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
P+ N D Y+G I IGSPPQ+FSVIFDTGSSNLW+PS C S +C H R+ ++S T
Sbjct: 60 PMTNDADLSYYGVISIGSPPQSFSVIFDTGSSNLWIPSVYCS-SQACENHRRFNPQQSTT 118
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIG 194
+ + I YG+GS++G+ + D VEVG + V +QVF I T + + A DGI+G
Sbjct: 119 FKWGNQPLSIQYGTGSMTGYLAIDTVEVGGISVANQVFGISRTEAPFMAHMQA--DGILG 176
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L F+ IA + VPV+DNMV+QGLVS+ +FS +L+ ++E+G E+VFGG+D H+ G+ T
Sbjct: 177 LAFQTIASDNVVPVFDNMVKQGLVSQPLFSVYLSS--NSEQGSEVVFGGIDSSHYTGQIT 234
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG--- 311
++P++ YWQ ++ + I Q T C GGC AI+D+GTSL+ GPT + +N +G
Sbjct: 235 WIPLSSATYWQIKMDSVTINGQ-TVACSGGCQAIIDTGTSLIVGPTSDINNMNAWVGAST 293
Query: 312 ---GEGVVSAE 319
GE VVS +
Sbjct: 294 NQYGEAVVSCQ 304
>gi|126309851|ref|XP_001370482.1| PREDICTED: gastricsin-like [Monodelphis domestica]
Length = 390
Score = 218 bits (555), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 103/238 (43%), Positives = 152/238 (63%), Gaps = 1/238 (0%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N+MD Y+GEI IG+PPQNF V+FDTGSSNLWVPS C S +C H ++ +S+T
Sbjct: 64 PLANYMDMSYYGEISIGTPPQNFLVLFDTGSSNLWVPSIYCQ-SQACTNHPQFNPSQSST 122
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y+ G++ + YG+GS++G F D V + + + +Q F + E F+ A+FDGI+GL
Sbjct: 123 YSSNGQTFSLQYGTGSLTGVFGYDTVTIQGISITNQEFGLSETEPGTNFVYAQFDGILGL 182
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ I+ G A V +++ L++ VF+F+L+ + ++ GGE+VFGGVD + G +
Sbjct: 183 AYPAISSGGATTVMQGFLQENLLNSPVFAFYLSGNENSNNGGEVVFGGVDTSMYTGDIYW 242
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
PVT++ YWQ + IG Q+TG C GGC AIVD+GTSLL P + +E+ IG +
Sbjct: 243 APVTEEAYWQIAINGFSIGGQATGWCSGGCQAIVDTGTSLLTAPQQIFSELMQYIGAQ 300
>gi|256274192|gb|EEU09100.1| Pep4p [Saccharomyces cerevisiae JAY291]
Length = 405
Score = 218 bits (555), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 111/265 (41%), Positives = 166/265 (62%), Gaps = 9/265 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+++AQY+ +I +G+PPQNF VI DTGSSNLWVPS++C S++C+ HS+Y S+
Sbjct: 81 VPLTNYLNAQYYTDITLGTPPQNFKVILDTGSSNLWVPSNEC-GSLACFLHSKYDHEASS 139
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y G I YG+GS+ G+ SQD + +GD+ + Q F EAT E LTF +FDGI+G
Sbjct: 140 SYKANGTEFAIQYGTGSLEGYISQDTLSIGDLTIPKQDFAEATSEPGLTFAFGKFDGILG 199
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKH 253
LG+ I+V VP + N ++Q L+ E+ F+F+L + D E GGE FGG+D FKG
Sbjct: 200 LGYDTISVDKVVPPFYNAIQQDLLDEKRFAFYLGDTSKDTENGGEATFGGIDESKFKGDI 259
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
T++PV +K YW+ + I +G++ + G A +D+GTSL+ P+ + IN IG +
Sbjct: 260 TWLPVRRKAYWEVKFEGIGLGDEYAELESHGAA--IDTGTSLITLPSGLAEMINAEIGAK 317
Query: 314 ----GVVSAECKLVVSQYGDLIWDL 334
G + +C DLI++L
Sbjct: 318 KGWTGQYTLDCN-TRDNLPDLIFNL 341
>gi|410045159|ref|XP_001145764.3| PREDICTED: pepsin A-5 isoform 1 [Pan troglodytes]
Length = 434
Score = 218 bits (555), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 123/312 (39%), Positives = 178/312 (57%), Gaps = 24/312 (7%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL K L H+ N A +Y + H PL+N++D +YFG
Sbjct: 32 LSERGLLKDFLKKHNFNPA-----SKYFPQWEAPTLLHEQ--------PLENYLDVEYFG 78
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+P Q+F+V+FDTGSSNLWVPS CY S++C H+ + + S+TY K+ I Y
Sbjct: 79 TIGIGTPAQDFTVVFDTGSSNLWVPSVYCY-SLACMDHNLFNPQDSSTYKSTSKTVSITY 137
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
G+GS++G D V+VG + +Q+F + T GS F A FDGI+GL + I+ A
Sbjct: 138 GTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLFF-APFDGILGLAYPSISSSGAT 196
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ
Sbjct: 197 PVFDNIWNQGLVSQDLFSVYLSA--DDKSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQI 254
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKL 322
+ I + N T C GC AIVD+GTSLL GPT + I IG +G + C
Sbjct: 255 TVDSITM-NGKTIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCS- 312
Query: 323 VVSQYGDLIWDL 334
+S D+++ +
Sbjct: 313 AISSLPDIVFTI 324
>gi|12248414|dbj|BAB20092.1| pepsinogen A [Rana catesbeiana]
Length = 385
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 123/312 (39%), Positives = 174/312 (55%), Gaps = 27/312 (8%)
Query: 9 VFCLWVLASCLLLPASSNG-------LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS 61
+F L VLA C ++ S L R+GL L H N A + + A S
Sbjct: 6 LFGLVVLAECGVVKVSLRKGESLRARLNRLGLLGDYLKKHHYNPA----TKYFPSLAQAS 61
Query: 62 GVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSIS 121
G PL+N+MD +YFG I IG+PPQ+F+VIFDTGSSNLWVPS C S +
Sbjct: 62 GE------------PLQNYMDIEYFGTISIGTPPQSFTVIFDTGSSNLWVPSVYCS-SPA 108
Query: 122 CYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGS 181
C H + ++S+T+ I YG+GS+SGF D V+VG++ + +Q+F + E
Sbjct: 109 CTNHHMFNPQQSSTFQATNTPVSIQYGTGSMSGFLGYDTVQVGNIQITNQIFGLSQSEPG 168
Query: 182 LTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVF 241
+ FDGI+GL F +A A PV+DNM QGL+ +++FS +L+ + G ++F
Sbjct: 169 SFLYYSPFDGILGLAFPSLASSQATPVFDNMWNQGLIPQDLFSVYLSS--QGQSGSFVLF 226
Query: 242 GGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTP 301
GGVD ++ G +VP+T + YWQ + I IG Q C G C+AIVD+GTSLLAGP+
Sbjct: 227 GGVDTSYYTGNLNWVPLTAETYWQITVDSISIGGQVIA-CSGSCSAIVDTGTSLLAGPST 285
Query: 302 VVTEINHAIGGE 313
+ I + IG
Sbjct: 286 PIANIQYYIGAN 297
>gi|1246038|gb|AAB35842.1| pepsinogen A [turtles, Peptide, 361 aa]
Length = 361
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 107/238 (44%), Positives = 153/238 (64%), Gaps = 4/238 (1%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N+MDA+YFG I IG+P Q+F+V+FDTGSSNLWVPS C S +C H+R+ S+T
Sbjct: 50 PLTNYMDAEYFGTISIGTPAQDFTVVFDTGSSNLWVPSVTCS-SAACTQHNRFNPSDSST 108
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y ++ I YG+GS++G DNV+VG +V +Q+F + E TF A DGI+GL
Sbjct: 109 YRATSQNLSIQYGTGSMTGILGYDNVQVGGLVDTNQIFGLSETEPGSTFYYAPMDGILGL 168
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ IA A PV+DNM+ +GLVS+++FS +L+ D + G ++FGG D ++ G +
Sbjct: 169 AYPSIASSGATPVFDNMMSEGLVSQDLFSVYLSSD--EQSGSFVMFGGNDTSYYSGSLNW 226
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
+P++ + YW+ + I + Q T C GGC AI+D+GTSLLAGP V+ IN IG
Sbjct: 227 IPLSAETYWEITMDSITMNGQ-TIACSGGCQAIIDTGTSLLAGPPSDVSNINSYIGAS 283
>gi|401881725|gb|EJT46014.1| endopeptidase [Trichosporon asahii var. asahii CBS 2479]
Length = 528
Score = 218 bits (554), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 115/255 (45%), Positives = 161/255 (63%), Gaps = 12/255 (4%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+M+AQY+ I IG+PPQ F V+ DTGSSNLWVPS +C SI+C+ +Y + +S+
Sbjct: 192 VPLSNYMNAQYYAPITIGTPPQEFGVVLDTGSSNLWVPSVQCS-SIACF---KYDNSQSS 247
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY G I YGSGS+ GF S+D +E+ + VKDQ+F EAT+E + F+ +FDGI+G
Sbjct: 248 TYKANGSEFAIRYGSGSLEGFVSEDTLEIAGLKVKDQLFAEATKEPGMAFVFGKFDGILG 307
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V P + NM++Q L+ E+VFSF L D +GGE +FGG D K K
Sbjct: 308 LGYNTISVNQIPPPFYNMIDQNLLDEKVFSFRLGSSED--DGGECIFGGYDKKWSDEKPI 365
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
Y+PV +KGYW+ EL I G++ + G A +D+GTSL+A PT + +N IG E
Sbjct: 366 YIPVRRKGYWEVELEGIKFGDEELPLENTGAA--IDTGTSLIALPTDIAEILNKEIGAEK 423
Query: 314 ---GVVSAECKLVVS 325
G + +C V S
Sbjct: 424 SWNGQYTVDCSKVPS 438
>gi|354493821|ref|XP_003509038.1| PREDICTED: gastricsin-like [Cricetulus griseus]
gi|344238302|gb|EGV94405.1| Gastricsin [Cricetulus griseus]
Length = 391
Score = 217 bits (553), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 119/304 (39%), Positives = 178/304 (58%), Gaps = 7/304 (2%)
Query: 13 WVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGD-SD 71
W++ + L LP L R+ LKK + ++ + K+ ++R G+ D
Sbjct: 3 WLVVALLCLPLLEAALVRVPLKKMKTIRQNMKEKGV-LKDFLKTHKYDPAQKYRFGNFGD 61
Query: 72 EDIL--PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYK 129
+L P+ +MDA YFGEI IG+PPQNF V+FDTGSSNLWVPS C S +C H RY
Sbjct: 62 FSVLYEPIA-YMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSIYCQ-SEACTTHPRYN 119
Query: 130 SRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARF 189
KS+TY G++ + YG+GS++GFF D + V + V +Q F + E F+ A F
Sbjct: 120 PNKSSTYYTEGQTFSLQYGTGSLTGFFGYDTLTVQGIQVPNQEFGLSENEPGTNFVYADF 179
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF 249
DGI+GL + ++ G A ++++G +S+ +F +L GG+IVFGGVD +
Sbjct: 180 DGIMGLAYPGLSAGGATTAMQGLLQEGALSQPLFGVYLGSQ-QGSNGGQIVFGGVDENLY 238
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHA 309
G+ T++PVT++ YWQ + D LIG+Q +G C GCA IVD+GTSLL P+ ++++
Sbjct: 239 TGEITWIPVTQELYWQITIDDFLIGDQVSGWCSQGCAGIVDTGTSLLTMPSQYLSDLLQT 298
Query: 310 IGGE 313
IG +
Sbjct: 299 IGAQ 302
>gi|406701140|gb|EKD04292.1| endopeptidase [Trichosporon asahii var. asahii CBS 8904]
Length = 824
Score = 217 bits (553), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 116/255 (45%), Positives = 161/255 (63%), Gaps = 12/255 (4%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+M+AQY+ I IG+PPQ F V+ DTGSSNLWVPS +C SI+C+ +Y + +S+
Sbjct: 226 VPLSNYMNAQYYAPITIGTPPQEFGVVLDTGSSNLWVPSVQCS-SIACF---KYDNSQSS 281
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY G I YGSGS+ GF S+D +E+ + VKDQ+F EAT+E + F+ +FDGI+G
Sbjct: 282 TYKANGSEFAIRYGSGSLEGFVSEDTLEIAGLKVKDQLFAEATKEPGMAFVFGKFDGILG 341
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V P + NM++Q L+ E+VFSF L D +GGE +FGG D K K
Sbjct: 342 LGYNTISVNQIPPPFYNMIDQNLLDEKVFSFRLGSSED--DGGECIFGGYDKKWSDEKPI 399
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
YVPV +KGYW+ EL I G++ + G A +D+GTSL+A PT + +N IG E
Sbjct: 400 YVPVRRKGYWEVELEGIKFGDEELPLENTGAA--IDTGTSLIALPTDIAEILNKEIGAEK 457
Query: 314 ---GVVSAECKLVVS 325
G + +C V S
Sbjct: 458 SWNGQYTVDCSKVPS 472
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 113/255 (44%), Positives = 157/255 (61%), Gaps = 12/255 (4%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+M+AQY+ I IG+PPQ F V+ DTGSSNLWVPS +C SI+C+ +Y + +S+
Sbjct: 527 VPLSNYMNAQYYAPITIGTPPQEFGVVLDTGSSNLWVPSVQCS-SIACF---KYDNSQSS 582
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY G I YGSGS+ GF S+D +E+ + VKDQ+F EAT+E + F+ +F G
Sbjct: 583 TYKANGSEFAIRYGSGSLEGFVSEDTLEIAGLKVKDQLFAEATKEPGMAFVFGKFTVSFG 642
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V P + NM++Q L+ E+VFSF L D +GGE +FGG D K K
Sbjct: 643 LGYNTISVNQIPPPFYNMIDQNLLDEKVFSFRLGSSED--DGGECIFGGYDKKWSDEKPI 700
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
YVPV +KGYW+ EL I G++ + G A +D+GTSL+A PT + +N IG E
Sbjct: 701 YVPVRRKGYWEVELEGIKFGDEELPLENTGAA--IDTGTSLIALPTDIAEILNKEIGAEK 758
Query: 314 ---GVVSAECKLVVS 325
G + +C V S
Sbjct: 759 SWNGQYTVDCSKVPS 773
>gi|444316168|ref|XP_004178741.1| hypothetical protein TBLA_0B03830 [Tetrapisispora blattae CBS 6284]
gi|387511781|emb|CCH59222.1| hypothetical protein TBLA_0B03830 [Tetrapisispora blattae CBS 6284]
Length = 413
Score = 217 bits (553), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 111/251 (44%), Positives = 160/251 (63%), Gaps = 8/251 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+M+AQYF +I IG+PPQ+F V+ DTGSSNLWVPS +C S++CY HS+Y +S+
Sbjct: 89 VPLSNYMNAQYFADIKIGTPPQSFKVVLDTGSSNLWVPSKEC-GSLACYLHSKYNHDESS 147
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY G + I YGSGS+ G+ SQD +E+GD+ + Q F EAT E ++F +FDGI+G
Sbjct: 148 TYKANGSAFAIQYGSGSLEGYISQDVMEIGDLKITKQDFAEATSEPGISFAFGKFDGILG 207
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKH 253
L + IAV VP N + QGL+ E F+F+L + + GGE VFGG+D F+G
Sbjct: 208 LAYDTIAVNRVVPPVYNAINQGLLDEPKFAFYLGDASKSKDNGGEAVFGGIDETKFEGDI 267
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
T++PV +K YW+ +L + +G + T + G A +D+GTSL+ P+ + IN IG +
Sbjct: 268 TWLPVRRKAYWEVKLEGLGLGEEYTELENHGAA--IDTGTSLITLPSGLAEIINSEIGAK 325
Query: 314 ----GVVSAEC 320
G + EC
Sbjct: 326 KGWTGQYTIEC 336
>gi|254583898|ref|XP_002497517.1| ZYRO0F07392p [Zygosaccharomyces rouxii]
gi|238940410|emb|CAR28584.1| ZYRO0F07392p [Zygosaccharomyces rouxii]
Length = 418
Score = 217 bits (553), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 107/251 (42%), Positives = 162/251 (64%), Gaps = 7/251 (2%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+++AQY+ E+ +G+PPQNF VI DTGSSNLWVPS++C S++C+ HS+Y S+
Sbjct: 95 VPLTNYLNAQYYTEVSLGTPPQNFKVILDTGSSNLWVPSTECS-SLACFLHSKYDHDSSS 153
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y G I YGSGS+ G+ SQD + +GD+ + Q F EAT E L F +FDGI+G
Sbjct: 154 SYKPNGTEFAIRYGSGSLEGYISQDTLNLGDLSITKQDFAEATSEPGLQFAFGKFDGILG 213
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
LG+ I+V VP + N +QGL+ E F+F+L RD ++++GG FGGVD ++G+ T
Sbjct: 214 LGYDTISVDGVVPPFYNAWKQGLLDEPKFAFYLGRDGESQDGGVATFGGVDDSKYEGEIT 273
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
++P+ +K YW+ + I +G + + G A +D+GTSL+A P+ + IN IG +
Sbjct: 274 WLPIRRKAYWEVKFDGIGLGEEYAELENHGAA--IDTGTSLIALPSGLAEIINAEIGAKK 331
Query: 314 ---GVVSAECK 321
G + EC+
Sbjct: 332 SWTGQYTVECE 342
>gi|194764262|ref|XP_001964249.1| GF20814 [Drosophila ananassae]
gi|190619174|gb|EDV34698.1| GF20814 [Drosophila ananassae]
Length = 405
Score = 217 bits (553), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 113/231 (48%), Positives = 145/231 (62%), Gaps = 5/231 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
L N+ + QY+G I IG+P QNF V FDTGSSNLW+PSS+C S SC H+RY S +S+TY
Sbjct: 68 LSNYDNFQYYGSINIGTPGQNFQVQFDTGSSNLWIPSSQCT-SSSCMVHTRYSSYQSSTY 126
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
G I YG+GS+SGF SQD V V +V+++Q F E T E FL A FDGI+GL
Sbjct: 127 KSNGSIFNITYGTGSVSGFMSQDVVSVAGLVIRNQTFGEVTSESGSNFLNASFDGILGLA 186
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
F +AV P + N++ Q +V + VFSF+L N GGE++ GG DPK ++GK TY
Sbjct: 187 FPMLAVNLVTPFFQNLISQKVVQQPVFSFYLRNNGTTVTYGGELILGGSDPKLYRGKLTY 246
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
VPV+ YWQF I +GN + G AAI D+GTSLL P T+I
Sbjct: 247 VPVSYPAYWQFYTDSIQMGNT---LISTGDAAIADTGTSLLVAPQAEYTQI 294
>gi|50978822|ref|NP_001003117.1| pepsin A preproprotein [Canis lupus familiaris]
gi|73621384|sp|Q9GMY6.1|PEPA_CANFA RecName: Full=Pepsin A; Flags: Precursor
gi|9798660|dbj|BAB11752.1| pepsinogen A [Canis lupus familiaris]
Length = 386
Score = 217 bits (553), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 108/238 (45%), Positives = 154/238 (64%), Gaps = 6/238 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
LKN+MD +YFG IGIG+PPQ F+VIFDTGSSNLWVPS C S +C H+R+ ++S+TY
Sbjct: 66 LKNYMDMEYFGTIGIGTPPQEFTVIFDTGSSNLWVPSVYCS-SPACSNHNRFNPQESSTY 124
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGL 195
+ I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+GL
Sbjct: 125 QGTNRPVSIAYGTGSMTGILGYDTVQVGGIADTNQIFGLSETEPGSFLYY-APFDGILGL 183
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ +I+ A PV+DNM +GLVS+++FS +L+ D + G ++FGG+D ++ G +
Sbjct: 184 AYPQISASGATPVFDNMWNEGLVSQDLFSVYLSS--DDQSGSVVMFGGIDSSYYSGNLNW 241
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
VPV+ +GYWQ + + + Q+ C GC AIVD+GTSLLAGPT + I IG
Sbjct: 242 VPVSVEGYWQITVDSVTMNGQAIA-CSDGCQAIVDTGTSLLAGPTNAIANIQSYIGAS 298
>gi|340518711|gb|EGR48951.1| predicted protein [Trichoderma reesei QM6a]
Length = 395
Score = 217 bits (553), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 127/300 (42%), Positives = 180/300 (60%), Gaps = 15/300 (5%)
Query: 23 ASSNGLRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDI---- 74
++ G+ ++ L+K ++L+ S+ A ++YMG S D +
Sbjct: 14 SAQAGIHKMKLQKVSLEQQLEGSSIEAHVQQLGQKYMGVRPTSRAEVMFNDKPPKVQGGH 73
Query: 75 -LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKS 133
+P+ NFM+AQYF EI IG+PPQ+F V+ DTGSSNLWVPS C SI+C+ HS Y S S
Sbjct: 74 PVPVTNFMNAQYFSEITIGTPPQSFKVVLDTGSSNLWVPSQSCN-SIACFLHSTYDSSSS 132
Query: 134 NTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGII 193
+TY G EI+YGSGS++GF S D V +GD+ +K Q F EAT E L F RFDGI+
Sbjct: 133 STYKPNGSDFEIHYGSGSLTGFISNDVVTIGDLKIKGQDFAEATSEPGLAFAFGRFDGIL 192
Query: 194 GLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKH 253
GLG+ I+V VP + MV Q L+ E VF+F+L ++EG E VFGGVD H++GK
Sbjct: 193 GLGYDTISVNGIVPPFYQMVNQKLIDEPVFAFYLGS---SDEGSEAVFGGVDDAHYEGKI 249
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
Y+P+ +K YW+ +L I G++ + G AI+D+GTSL P+ + +N IG +
Sbjct: 250 EYIPLRRKAYWEVDLDSIAFGDEVAELENTG--AILDTGTSLNVLPSGLAELLNAEIGAK 307
>gi|190576563|gb|ACE79054.1| gastricsin precursor (predicted) [Sorex araneus]
Length = 389
Score = 217 bits (553), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 128/343 (37%), Positives = 192/343 (55%), Gaps = 31/343 (9%)
Query: 64 RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCY 123
++ GD P+ ++DA YFGEI IG+PPQNF V+FDTGSSNLWVPS C S +C
Sbjct: 53 KYHFGDFSVAYEPMA-YLDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACT 110
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
H+R+ KS+TY+ G++ + YGSGS++GFF D + + ++ V Q F + E
Sbjct: 111 GHARFNPSKSSTYSTNGQTFSLQYGSGSLTGFFGYDTMTLQNIKVPHQEFGLSQNEPGDN 170
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
F+ A+FDGI+G+ + +A+G A M++ G + VFSF+L+ +++GG +VFGG
Sbjct: 171 FVYAQFDGIMGMAYPTLAMGGATTALQGMLQAGALDSPVFSFYLSNQQSSQDGGAVVFGG 230
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
VD + G+ + PVT++ YWQ + LIG Q+TG C GC AIVD+GTSLL P +
Sbjct: 231 VDNSLYTGQIFWTPVTQELYWQIGVEQFLIGGQATGWCSQGCQAIVDTGTSLLTVPQQYM 290
Query: 304 TEINHAIGGEGVVSAECKLVVSQYGDLIWD-----------LLVSG----LLPEK-VCQQ 347
+ + A G + + QYG ++ + +++G LLP V
Sbjct: 291 SALQQATGAQ----------LDQYGQMVVNCNNIQNLPTLTFVINGVQFPLLPSAYVLNN 340
Query: 348 IGLCAFNGAEYVRLGIPITRVLFVL-NVRL-AMQLVYSLGSCR 388
G C G E L P + L++L +V L + VY +G+ R
Sbjct: 341 NGYCTL-GVEPTYLPSPTGQPLWILGDVFLRSYYSVYDMGNNR 382
>gi|197247086|gb|AAI65335.1| Nots protein [Danio rerio]
Length = 416
Score = 217 bits (552), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 111/246 (45%), Positives = 163/246 (66%), Gaps = 6/246 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
L NFMDAQ+FG+I +G P QNF+V+FDTGSS+LWVPSS C S +C H+++K+ +S+TY
Sbjct: 78 LYNFMDAQFFGQISLGRPEQNFTVVFDTGSSDLWVPSSYC-VSQACALHNKFKAFESSTY 136
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
T G+ I+YGSG + G ++D ++VG V V++QVF EA E +F+LA+FDG++GLG
Sbjct: 137 THDGRVFGIHYGSGHLLGVMARDELKVGSVCVQNQVFGEAVYEPGFSFVLAQFDGVLGLG 196
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
F ++A PV+D+M+EQ ++ + VFSF+L + + GGE+VFGG+D F ++
Sbjct: 197 FPQLAEEKGSPVFDSMMEQNMLDQPVFSFYLTNN-GSGFGGELVFGGMDESRFLPPINWI 255
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCE---GGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
PVT+KGYWQ +L + + + C GC AIVD+GTSL+ GP + + IG
Sbjct: 256 PVTQKGYWQIKLDAVKV-QGALSFCYRSVQGCQAIVDTGTSLIGGPARDILILQQFIGAT 314
Query: 314 GVVSAE 319
+ E
Sbjct: 315 PTANGE 320
>gi|56269596|gb|AAH86835.1| Nots protein [Danio rerio]
Length = 443
Score = 217 bits (552), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 111/246 (45%), Positives = 163/246 (66%), Gaps = 6/246 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
L NFMDAQ+FG+I +G P QNF+V+FDTGSS+LWVPSS C S +C H+++K+ +S+TY
Sbjct: 105 LYNFMDAQFFGQISLGRPEQNFTVVFDTGSSDLWVPSSYC-VSQACALHNKFKAFESSTY 163
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
T G+ I+YGSG + G ++D ++VG V V++QVF EA E +F+LA+FDG++GLG
Sbjct: 164 THDGRVFGIHYGSGHLLGVMARDELKVGSVCVQNQVFGEAVYEPGFSFVLAQFDGVLGLG 223
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
F ++A PV+D+M+EQ ++ + VFSF+L + + GGE+VFGG+D F ++
Sbjct: 224 FPQLAEEKGSPVFDSMMEQNMLDQPVFSFYLTNN-GSGFGGELVFGGMDESRFLPPINWI 282
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCE---GGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
PVT+KGYWQ +L + + + C GC AIVD+GTSL+ GP + + IG
Sbjct: 283 PVTQKGYWQIKLDAVKV-QGALSFCYRSVQGCQAIVDTGTSLIGGPARDILILQQFIGAT 341
Query: 314 GVVSAE 319
+ E
Sbjct: 342 PTANGE 347
>gi|327270926|ref|XP_003220239.1| PREDICTED: embryonic pepsinogen-like [Anolis carolinensis]
Length = 382
Score = 217 bits (552), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 113/284 (39%), Positives = 164/284 (57%), Gaps = 13/284 (4%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDIL--------PLKNFMDAQYFGE 88
R+ L R T KE + + + R+ +G +L PL N++D +Y+G
Sbjct: 19 RIPLQRGKKGRNTLKENGLLDSFLKEHRYDIGSKYRPMLEAAEVAGEPLMNYLDTEYYGT 78
Query: 89 IGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYG 148
I IG+PPQ F+V+FDTGSSNLWVPS+ C C H R+ +S+T+ ++ I YG
Sbjct: 79 INIGTPPQAFTVVFDTGSSNLWVPSTYCS-DAPCQNHPRFDPSQSSTFENTQQTMSIQYG 137
Query: 149 SGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPV 208
+GS+ G D + V + V Q F ++ E + F FDGI+GLG+ IAV D PV
Sbjct: 138 TGSMQGILGYDTLTVTGITVPKQEFALSSSEPGVFFTYVPFDGILGLGYPSIAVSDVTPV 197
Query: 209 WDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFEL 268
+DNM+ +GLV E +FS +L R G I FGG+D ++ G ++PVT++GYWQ EL
Sbjct: 198 FDNMMNEGLVQENLFSVYLGR---GGTGSIITFGGIDESYYTGSINWIPVTEQGYWQIEL 254
Query: 269 GDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
IL+ ++ C GC AIVD+GTSL+AGP ++ + +AIG
Sbjct: 255 DSILVNGEAI-ACSDGCQAIVDTGTSLVAGPPSDISNLQNAIGA 297
>gi|349581664|dbj|GAA26821.1| K7_Pep4p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 405
Score = 217 bits (552), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 105/240 (43%), Positives = 156/240 (65%), Gaps = 4/240 (1%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+++AQY+ +I +G+PPQNF VI DTGSSNLWVPS++C S++C+ HS+Y S+
Sbjct: 81 VPLTNYLNAQYYTDITLGTPPQNFKVILDTGSSNLWVPSNEC-GSLACFLHSKYDHEASS 139
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y G I YG+GS+ G+ SQD + +GD+ + Q F EAT E LTF +FDGI+G
Sbjct: 140 SYKANGTEFAIQYGTGSLEGYISQDTLSIGDLTIPKQDFAEATSEPGLTFAFGKFDGILG 199
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKH 253
LG+ I+V VP + N ++Q L+ E+ F+F+L + D E GGE FGG+D FKG
Sbjct: 200 LGYDTISVDKVVPPFYNAIQQDLLDEKKFAFYLGDTSKDTENGGEATFGGIDESKFKGDI 259
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
T++PV +K YW+ + I +G++ + G A +D+GTSL+ P+ + IN IG +
Sbjct: 260 TWLPVRRKAYWEVKFEGIGLGDEYAELESHGAA--IDTGTSLITLPSGLAEMINAEIGAK 317
>gi|444725492|gb|ELW66056.1| Gastricsin [Tupaia chinensis]
Length = 389
Score = 217 bits (552), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 110/286 (38%), Positives = 168/286 (58%), Gaps = 18/286 (6%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
++ GL K L H + A+ ++ D P+ +MDA YFG
Sbjct: 33 MKEKGLLKEFLRTHKYDPAQ----------------KYHFNDFSVAYEPMA-YMDAAYFG 75
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
EI IG+PPQNF V+FDTGSSNLWVPS C S +C H R+ +S+TY+ G++ + Y
Sbjct: 76 EISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTNHPRFNPSQSSTYSTNGQTFSLQY 134
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVP 207
GSGS++GFF D + V + V +Q F + E F+ A+FDGI+G+ + +++G A
Sbjct: 135 GSGSLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGIMGMAYPALSMGGATT 194
Query: 208 VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFE 267
M+++G+++ VFSF+L+ +E+GG ++FGGVD + G+ + PVT++ YWQ
Sbjct: 195 ALQGMLQEGVLTSPVFSFYLSNQQGSEDGGAVIFGGVDNSLYSGQIYWAPVTQELYWQIG 254
Query: 268 LGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
+ + LIG Q++G C GC AIVD+GTSLL P ++ + A G +
Sbjct: 255 IEEFLIGGQASGWCSQGCQAIVDTGTSLLTVPQQYMSTLLQATGAQ 300
>gi|344257339|gb|EGW13443.1| Napsin-A [Cricetulus griseus]
Length = 532
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 109/253 (43%), Positives = 155/253 (61%), Gaps = 4/253 (1%)
Query: 81 MDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPS---SKCYFSISCYFHSRYKSRKSNTYT 137
M+ QYFG+IG+G+PPQNF+V+FDTGSSNL S S S FH R+ + S+++
Sbjct: 1 MNTQYFGDIGLGTPPQNFTVVFDTGSSNLCSVSHRLSDPILSPELGFHRRFNPKASSSFR 60
Query: 138 EIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGF 197
G I YGSG ++G SQDN+ +G++ F EA E S+ F LA FDGI+GLGF
Sbjct: 61 PNGTKLAIQYGSGQLTGILSQDNLTIGEIRGVSVTFGEALWESSMVFTLAHFDGILGLGF 120
Query: 198 REIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVP 257
+AV P D MVEQGL+ + +FSF+LNRD + +GGE+V GG DP H+ T++P
Sbjct: 121 PSLAVDGVQPPLDAMVEQGLLQKPIFSFYLNRDAEGSDGGELVLGGSDPAHYIPPLTFIP 180
Query: 258 VTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVS 317
VT YWQ + + +G +C GC I+D+GTSL+ GP+ + +N AIGG ++
Sbjct: 181 VTIPAYWQVHMESVNVGT-GLSLCAQGCGVILDTGTSLITGPSEEIHALNKAIGGLPFLA 239
Query: 318 AECKLVVSQYGDL 330
+ + S+ +L
Sbjct: 240 GQYFIQCSKTPEL 252
>gi|343425806|emb|CBQ69339.1| probable PEP4-aspartyl protease [Sporisorium reilianum SRZ2]
Length = 419
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 113/253 (44%), Positives = 158/253 (62%), Gaps = 9/253 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL +F++AQYF +I +G+P Q F VI DTGSSNLWVPS+KC SI+C+ H +Y S S+
Sbjct: 98 VPLTDFLNAQYFCDISLGTPAQEFKVILDTGSSNLWVPSTKCS-SIACFLHKKYDSSASS 156
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y + G +I YGSGS+ G SQD +++GD+ +K Q F EAT E L F +FDGI+G
Sbjct: 157 SYKKNGTEFKIQYGSGSMEGIVSQDTLKIGDLTIKGQDFAEATSEPGLAFAFGKFDGILG 216
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+V VP + M++QGL+ SF+L E+GGE VFGG+D H+ GK
Sbjct: 217 LAYDTISVNGIVPPFYQMIDQGLLDSPQVSFYLGS--SEEDGGEAVFGGIDESHYSGKIH 274
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG-- 312
+ PV +KGYW+ L + +G++ + E G AAI D+GTSL+A T +N IG
Sbjct: 275 WAPVKRKGYWEVALDKLALGDEELEL-ENGSAAI-DTGTSLIAMATDTAEILNAEIGATK 332
Query: 313 --EGVVSAECKLV 323
G S +C V
Sbjct: 333 SWNGQYSVDCDKV 345
>gi|6325103|ref|NP_015171.1| Pep4p [Saccharomyces cerevisiae S288c]
gi|115643|sp|P07267.1|CARP_YEAST RecName: Full=Saccharopepsin; AltName: Full=Aspartate protease;
Short=PrA; Short=Proteinase A; AltName:
Full=Carboxypeptidase Y-deficient protein 4; AltName:
Full=Proteinase YSCA; Flags: Precursor
gi|172122|gb|AAB63975.1| vacuolar proteinase A precursor [Saccharomyces cerevisiae]
gi|1370328|emb|CAA97859.1| PEP4 [Saccharomyces cerevisiae]
gi|1403555|emb|CAA65567.1| P2585 protein [Saccharomyces cerevisiae]
gi|151942645|gb|EDN60991.1| vacuolar proteinase A [Saccharomyces cerevisiae YJM789]
gi|190407806|gb|EDV11071.1| vacuolar proteinase A [Saccharomyces cerevisiae RM11-1a]
gi|259150002|emb|CAY86805.1| Pep4p [Saccharomyces cerevisiae EC1118]
gi|285815388|tpg|DAA11280.1| TPA: Pep4p [Saccharomyces cerevisiae S288c]
gi|323302701|gb|EGA56507.1| Pep4p [Saccharomyces cerevisiae FostersB]
gi|323331178|gb|EGA72596.1| Pep4p [Saccharomyces cerevisiae AWRI796]
gi|323346153|gb|EGA80443.1| Pep4p [Saccharomyces cerevisiae Lalvin QA23]
gi|323351977|gb|EGA84516.1| Pep4p [Saccharomyces cerevisiae VL3]
gi|365762755|gb|EHN04288.1| Pep4p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
gi|392295854|gb|EIW06957.1| Pep4p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 405
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 105/240 (43%), Positives = 156/240 (65%), Gaps = 4/240 (1%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+++AQY+ +I +G+PPQNF VI DTGSSNLWVPS++C S++C+ HS+Y S+
Sbjct: 81 VPLTNYLNAQYYTDITLGTPPQNFKVILDTGSSNLWVPSNEC-GSLACFLHSKYDHEASS 139
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y G I YG+GS+ G+ SQD + +GD+ + Q F EAT E LTF +FDGI+G
Sbjct: 140 SYKANGTEFAIQYGTGSLEGYISQDTLSIGDLTIPKQDFAEATSEPGLTFAFGKFDGILG 199
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKH 253
LG+ I+V VP + N ++Q L+ E+ F+F+L + D E GGE FGG+D FKG
Sbjct: 200 LGYDTISVDKVVPPFYNAIQQDLLDEKRFAFYLGDTSKDTENGGEATFGGIDESKFKGDI 259
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
T++PV +K YW+ + I +G++ + G A +D+GTSL+ P+ + IN IG +
Sbjct: 260 TWLPVRRKAYWEVKFEGIGLGDEYAELESHGAA--IDTGTSLITLPSGLAEMINAEIGAK 317
>gi|222425180|dbj|BAH20539.1| pepsinogen A-43 [Pongo abelii]
Length = 388
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 121/303 (39%), Positives = 174/303 (57%), Gaps = 23/303 (7%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL K L H+LN A +Y + H PL+N++D +YFG
Sbjct: 32 LSERGLLKDFLKKHNLNPA-----SKYFPQGKAPTLLHEQ--------PLENYLDVEYFG 78
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+P Q+F+V+FDTGSSNLWVPS CY S++C H+ + + S+TY ++ I Y
Sbjct: 79 SIGIGTPAQDFTVVFDTGSSNLWVPSVYCY-SLACMDHNLFNPQDSSTYKSTSETVSITY 137
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
G+GS++G D V+VG + +Q+F + + GS F A FDGI+GL + I+ A
Sbjct: 138 GTGSMTGILGYDTVKVGGISDTNQIFGLSESEPGSFLFF-APFDGILGLAYPSISSSGAT 196
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ
Sbjct: 197 PVFDNIWNQGLVSQDLFSVYLSA--DDKSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQI 254
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKL 322
+ I + N T C GC AIVD+GTSLL GPT + I IG +G + C
Sbjct: 255 TVDSITM-NGKTIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSA 313
Query: 323 VVS 325
+ S
Sbjct: 314 ISS 316
>gi|401623301|gb|EJS41405.1| pep4p [Saccharomyces arboricola H-6]
Length = 405
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 105/240 (43%), Positives = 157/240 (65%), Gaps = 4/240 (1%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+++AQY+ +I +G+PPQNF VI DTGSSNLWVPS++C S++C+ HS+Y S+
Sbjct: 81 VPLTNYLNAQYYTDITLGTPPQNFKVILDTGSSNLWVPSNEC-GSLACFLHSKYDHEASS 139
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y G I YG+GS+ G+ SQD + +GD+ + Q F EAT E LTF +FDGI+G
Sbjct: 140 SYKANGTEFAIQYGTGSLEGYISQDTLSIGDLTIPKQDFAEATSEPGLTFAFGKFDGILG 199
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKH 253
LG+ I+V VP + N ++Q L+ E+ F+F+L + D+E GGE FGG+D FKG
Sbjct: 200 LGYDSISVDKVVPPFYNAIQQDLLDEKKFAFYLGDTSKDSENGGEATFGGIDESKFKGDI 259
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
T++PV +K YW+ + I +G++ + G A +D+GTSL+ P+ + IN IG +
Sbjct: 260 TWLPVRRKAYWEVKFEGIGLGDEFAELENHGAA--IDTGTSLITLPSGLAEMINAEIGAK 317
>gi|301030231|gb|ADK47877.1| cathepsin D [Triatoma infestans]
Length = 390
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 109/239 (45%), Positives = 154/239 (64%), Gaps = 3/239 (1%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
L+N + QY+G I +G+PPQ F+VIFDTGSSNLW+PS+ C S++C H+ Y +S+TY
Sbjct: 63 LRNSFNTQYYGNITLGTPPQEFTVIFDTGSSNLWIPSAVCS-SVACRVHNTYDHDRSSTY 121
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
G+ + YG+GSI+G S D +++GD+ VK+Q+F EA + F A+ DGI+GL
Sbjct: 122 QPDGRILRLTYGTGSIAGIMSSDVLQIGDLQVKNQLFGEALQVSDSPFARAKPDGILGLA 181
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF-KGKHTY 255
F IA AVP + NM++Q L+ + VFS +LNR+PD E GGEI+FGGVD + + K T
Sbjct: 182 FPSIAQDHAVPPFFNMIKQELLDKPVFSVYLNRNPDEEVGGEIIFGGVDEELYNKESMTT 241
Query: 256 VPVTKKGYWQFELGDILIGNQS-TGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
VP+T YW F++ I + T C+ GC I D+GTS + GP+ V EI +G E
Sbjct: 242 VPLTSTSYWMFQMDGISTSAEDGTSWCQNGCPGIADTGTSFIVGPSSDVDEIMELVGAE 300
>gi|89111566|dbj|BAE80442.1| pepsinogen B isozyme [Canis lupus familiaris]
Length = 374
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 103/238 (43%), Positives = 152/238 (63%), Gaps = 1/238 (0%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
P N++++ YFGEI IG+PPQNF V+FDTGSSNLWVPS+ C S +C H+ + S+T
Sbjct: 49 PFTNYLNSYYFGEISIGTPPQNFLVVFDTGSSNLWVPSTYCQ-SQACSNHNTFNPSSSST 107
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G++ + YGSGS++ D V V ++V+ +Q F + E S F A FDGI+G+
Sbjct: 108 YRNNGQTYTLYYGSGSLTVLLGYDTVTVQNIVINNQEFGLSEIEPSNPFYYANFDGILGM 167
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ +AVGD+ V +MV+QG +++ +FSF+ +R P E GGE++ GGVD + + G+ +
Sbjct: 168 AYPNLAVGDSPTVMQSMVQQGQLTQPIFSFYFSRQPTYEYGGELILGGVDTQFYSGEIVW 227
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
PVT++ YWQ + + LIGNQ+TG+C GC IVD+GT L P + A G +
Sbjct: 228 APVTREMYWQVAIDEFLIGNQATGLCSQGCQGIVDTGTFPLTVPQQYLDSFVKATGAQ 285
>gi|73621385|sp|Q9GMY7.1|PEPA_RHIFE RecName: Full=Pepsin A; Flags: Precursor
gi|9798658|dbj|BAB11751.1| pepsinogen A [Rhinolophus ferrumequinum]
Length = 386
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 123/300 (41%), Positives = 176/300 (58%), Gaps = 25/300 (8%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL + L HS+N A KE A + + PL+N+MD +YFG
Sbjct: 32 LMEQGLLQDYLKTHSINPASKYLKE----AASMMATQ-----------PLENYMDMEYFG 76
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+PPQ F+VIFDTGSSNLWVPS C S +C H+R+ ++S+TY + + Y
Sbjct: 77 TIGIGTPPQEFTVIFDTGSSNLWVPSVYCS-SPACSNHNRFNPQQSSTYQGTNQKLSVAY 135
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
G+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + IA A
Sbjct: 136 GTGSMTGILGYDTVQVGGITDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSIASSGAT 194
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV-FGGVDPKHFKGKHTYVPVTKKGYWQ 265
PV+DN+ QGLVS+++FS +L+ + ++GG +V FGG+D +F G +VP++ + YWQ
Sbjct: 195 PVFDNIWNQGLVSQDLFSVYLSSN---DQGGSVVMFGGIDSSYFTGNLNWVPLSSETYWQ 251
Query: 266 FELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVS 325
+ I + Q C G C AIVD+GTSLL+GPT + I IG +A ++VVS
Sbjct: 252 ITVDSITMNGQVI-ACSGSCQAIVDTGTSLLSGPTNAIASIQGYIGASQ--NANGEMVVS 308
>gi|290974880|ref|XP_002670172.1| predicted protein [Naegleria gruberi]
gi|284083728|gb|EFC37428.1| predicted protein [Naegleria gruberi]
Length = 388
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 106/240 (44%), Positives = 152/240 (63%), Gaps = 6/240 (2%)
Query: 74 ILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKS 133
I+PLK++ D +Y+GEI IG+P Q F V+FDTGSSNLWVPS C +SC H+RY KS
Sbjct: 66 IVPLKDYDDVEYYGEITIGTPAQTFKVVFDTGSSNLWVPSVACK-DLSCVRHARYNHTKS 124
Query: 134 NTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGII 193
+TY G+S I YG+G++ G S D V VG + +K QVF E T E + TFL A+ DGI
Sbjct: 125 STYVPNGQSFNITYGTGAVKGILSSDTVVVGGLAIKGQVFGETTNEYTDTFLNAKIDGIC 184
Query: 194 GLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKH 253
G F IAV PV++N+++Q LV + +FSF++++ ++ GG++ K++ G
Sbjct: 185 GFAFPNIAVDGVTPVFNNLMKQRLVDKNIFSFYMSKKA-GSGASAMILGGINSKYYTGSF 243
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGP----TPVVTEINHA 309
+YVP+ + YW L DI + Q +C GC AIVD+GTSL+AG P++ ++N A
Sbjct: 244 SYVPLIQHNYWSIALDDIAMNGQGQSLCGFGCMAIVDTGTSLIAGTPDVMQPIINQLNVA 303
>gi|156843876|ref|XP_001645003.1| hypothetical protein Kpol_1072p15 [Vanderwaltozyma polyspora DSM
70294]
gi|156115658|gb|EDO17145.1| hypothetical protein Kpol_1072p15 [Vanderwaltozyma polyspora DSM
70294]
Length = 399
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 107/251 (42%), Positives = 160/251 (63%), Gaps = 7/251 (2%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+++AQY+ ++ IG+PPQ F VI DTGSSNLWVPS C S++CY HS+Y S+
Sbjct: 76 VPLDNYLNAQYYTDVSIGTPPQKFKVILDTGSSNLWVPSVGCS-SLACYLHSKYDHSLSS 134
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY G I YGSGS+ G+ SQD + +GD+++ Q F EAT E L F +FDGI+G
Sbjct: 135 TYRSNGSDFVIQYGSGSLKGYISQDTLTIGDLIIPQQDFAEATAEPGLAFAFGKFDGILG 194
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+V AVP N + +GL+ + +F+F+L + ++ GGE FGG DP F+G+
Sbjct: 195 LAYDSISVNKAVPPLYNAIHRGLLDKPMFAFYLGDEKSSKNGGEATFGGYDPSRFEGEIK 254
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
++PV +K YW+ + I +G++ + EG AAI D+GTSL+ P+ + +N+ IG +
Sbjct: 255 WLPVRRKAYWEVQFDGIKLGDKFMKL-EGHGAAI-DTGTSLITLPSQIADFLNNEIGAKK 312
Query: 314 ---GVVSAECK 321
G + +CK
Sbjct: 313 SWNGQYTIDCK 323
>gi|224458278|ref|NP_001138942.1| pepsinogen A precursor [Pongo abelii]
gi|222425178|dbj|BAH20538.1| pepsinogen A-75 [Pongo abelii]
Length = 388
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 121/303 (39%), Positives = 174/303 (57%), Gaps = 23/303 (7%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL K L H+LN A +Y + H PL+N++D +YFG
Sbjct: 32 LSERGLLKDFLKKHNLNPA-----SKYFPQGKAPTLLHEQ--------PLENYLDVEYFG 78
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+P Q+F+V+FDTGSSNLWVPS CY S++C H+ + + S+TY ++ I Y
Sbjct: 79 TIGIGTPAQDFTVVFDTGSSNLWVPSVYCY-SLACMDHNLFNPQDSSTYKSTSETVSITY 137
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
G+GS++G D V+VG + +Q+F + + GS F A FDGI+GL + I+ A
Sbjct: 138 GTGSMTGILGYDTVKVGGISDTNQIFGLSESEPGSFLFF-APFDGILGLAYPSISSSGAT 196
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ
Sbjct: 197 PVFDNIWNQGLVSQDLFSVYLSA--DDKSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQI 254
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKL 322
+ I + N T C GC AIVD+GTSLL GPT + I IG +G + C
Sbjct: 255 TVDSITM-NGKTIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSA 313
Query: 323 VVS 325
+ S
Sbjct: 314 ISS 316
>gi|130484814|ref|NP_001076103.1| gastricsin precursor [Oryctolagus cuniculus]
gi|73621389|sp|Q9GMY2.1|PEPC_RABIT RecName: Full=Gastricsin; AltName: Full=Pepsinogen C; Flags:
Precursor
gi|9798668|dbj|BAB11756.1| pepsinogen C [Oryctolagus cuniculus]
Length = 388
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 129/376 (34%), Positives = 199/376 (52%), Gaps = 46/376 (12%)
Query: 5 LLRSVFCLWVLASCLL------LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGA 58
LL ++ CL +L + ++ + L+ GL K L+ H + A
Sbjct: 4 LLVALVCLHLLEAAVIKVPLRKFKSIRETLKEKGLLKEFLNTHKYDPA------------ 51
Query: 59 GVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF 118
+++R GD P+ +++DA YFGEI IG+P QNF V+FDTGSSNLWVPS C
Sbjct: 52 ----LKYRFGDFSVTYEPM-DYLDAAYFGEISIGTPSQNFLVLFDTGSSNLWVPSVYCQ- 105
Query: 119 SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
S +C H+R+ KS+T+ ++ + YGSGS++GFF D + ++ V +Q F +
Sbjct: 106 SEACTTHNRFNPSKSSTFYTYDQTFSLEYGSGSLTGFFGYDTFTIQNIEVPNQEFGLSET 165
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
E FL A FDGI+GL + ++VGDA P MV+ G +S VFSF+L+ +GG
Sbjct: 166 EPGTNFLYAEFDGIMGLAYPSLSVGDATPALQGMVQDGTISSSVFSFYLSSQ-QGTDGGA 224
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
+V GGVD + G + PVT++ YWQ + + LI ++++G C GC AIVD+GTSLL
Sbjct: 225 LVLGGVDSSLYTGDIYWAPVTRELYWQIGIDEFLISSEASGWCSQGCQAIVDTGTSLLTV 284
Query: 299 PTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEY 358
P ++++ A G + ++YG+ + D + LP NG E+
Sbjct: 285 PQEYMSDLLEATGAQE----------NEYGEFLVDCDSTESLPTFT------FVINGVEF 328
Query: 359 VRLGIPITRVLFVLNV 374
P++ ++LN
Sbjct: 329 -----PLSPSAYILNT 339
>gi|344246136|gb|EGW02240.1| Renin [Cricetulus griseus]
Length = 720
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 122/285 (42%), Positives = 175/285 (61%), Gaps = 19/285 (6%)
Query: 33 LKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIG 92
LK+R +D+ L+A +R+ G G S V L N++D QY+GEIGIG
Sbjct: 8 LKERGVDMTKLSAEWGKFTKRFSFGNGTSPVI------------LTNYLDTQYYGEIGIG 55
Query: 93 SPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGS 151
+PPQ F VIFDTGS+NLWVPS+KC +C HS Y S +S++Y E G I+YGSG
Sbjct: 56 TPPQTFKVIFDTGSANLWVPSTKCSPLYSACEIHSLYDSSESSSYMENGTEFTIHYGSGK 115
Query: 152 ISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDN 211
+ GF SQD V VG ++V Q F E T + F+LA+FDG++G+GF AVG PV+D+
Sbjct: 116 VKGFLSQDIVTVGGIIVT-QTFGEVTELPLIPFMLAKFDGVLGMGFPAQAVGGVTPVFDH 174
Query: 212 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFE---L 268
++ Q ++ EEVFS + +RD GGE+V GG DP+H++G YV V++ G W+ L
Sbjct: 175 ILSQRVLKEEVFSVYYSRDSHL-LGGEVVLGGSDPQHYQGNFHYVSVSRTGSWEIAMKGL 233
Query: 269 GDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
+ +G+ +T +CE GC +VD+G S ++GPT + I +G +
Sbjct: 234 RRVSVGS-ATLLCEEGCVVVVDTGASYISGPTSSLKLIMQTLGAK 277
>gi|194900440|ref|XP_001979765.1| GG22202 [Drosophila erecta]
gi|190651468|gb|EDV48723.1| GG22202 [Drosophila erecta]
Length = 395
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 133/325 (40%), Positives = 181/325 (55%), Gaps = 17/325 (5%)
Query: 12 LWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITR-----KERYMGGAGVSGVRHR 66
LWVL CL L RI ++ + + S R R K +GG V+ R
Sbjct: 11 LWVL--CLFWAKCQGQLIRIPMQFQASFMASRRQHRAGRSSLLAKYNVVGGQEVTS---R 65
Query: 67 LGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF-SISCYFH 125
G + E L N ++ +Y G I IGSP Q F+++FDTGS+NLWVPS++C S++C+ H
Sbjct: 66 NGGATET---LDNRLNLEYAGPISIGSPGQPFNMLFDTGSANLWVPSAECSLKSVACHHH 122
Query: 126 SRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFL 185
RY + S+T+ G+ I YG+GS+SG +QD V +G +VV++Q F AT E TF+
Sbjct: 123 HRYNASASSTFVPDGRRFSIAYGTGSLSGILAQDTVAIGQLVVRNQTFAMATHEPGPTFV 182
Query: 186 LARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVD 245
F GI+GLGFR IA P++++M +Q LV E VFSF+L R+ GGE++FGGVD
Sbjct: 183 DTNFAGIVGLGFRPIAEQRIKPLFESMCDQQLVDECVFSFYLKRNGSERMGGELLFGGVD 242
Query: 246 PKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTE 305
F G TYVP+T GYWQF L I +G + AI D+GTSLLA P
Sbjct: 243 KTKFSGSLTYVPLTHAGYWQFPLDGIELGGTTISRHR---QAIADTGTSLLAAPPREYLI 299
Query: 306 INHAIGGEGVVSAECKLVVSQYGDL 330
IN +GG + E L S+ L
Sbjct: 300 INSLLGGLPTSNNEYLLNCSEIDSL 324
>gi|395535589|ref|XP_003769805.1| PREDICTED: chymosin-like [Sarcophilus harrisii]
Length = 382
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 120/312 (38%), Positives = 172/312 (55%), Gaps = 19/312 (6%)
Query: 4 KLLRSVFCLWVLASCLL-LPASSNGLRRIGLKKRRL--DLHSLNAARITRKERYMGGAGV 60
+ L + L+ C++ LP R LKK L D N ++ K R G A
Sbjct: 2 RCLLVFLAIIALSDCMIRLPLMKGNTLRHKLKKHGLLADFLEENKYSLSSKYRRYGEAAK 61
Query: 61 SGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI 120
PL NF+D+QYFG+I IG+PPQ F+V+FDTGSSNLWVPS C S
Sbjct: 62 VASE-----------PLTNFLDSQYFGKIYIGTPPQEFTVVFDTGSSNLWVPSVYCN-ST 109
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+C H R+ +S+T+ + I YG+GS+ G D V V +V DQ+F +T+E
Sbjct: 110 ACENHHRFSPSESSTFNSTEEPLSIQYGTGSMEGVLGYDTVIVSSIVDPDQIFGLSTQEP 169
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
F + FDGI+GLG+ +AV A PV+DNM+ + LV++ +FS ++NR G +
Sbjct: 170 GNIFTYSEFDGILGLGYPSLAVDQATPVFDNMMNKHLVAQNLFSVYMNRH---GPGSMLT 226
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
G +D ++ G +VP+T +GYWQF + I + Q C+GGC AI+D+GTSLL GP+
Sbjct: 227 LGAIDSSYYTGSLHWVPITVQGYWQFSVDRITVNGQVVA-CDGGCQAILDTGTSLLVGPS 285
Query: 301 PVVTEINHAIGG 312
++ I IG
Sbjct: 286 YDISNIQSVIGA 297
>gi|253762219|gb|ACT35561.1| pepsinogen A2 precursor [Siniperca chuatsi]
Length = 376
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 113/251 (45%), Positives = 162/251 (64%), Gaps = 13/251 (5%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
P+ N D Y+G I IGSPPQ+FSVIFDTGSSNLW+PS C S +C H R+ ++ T
Sbjct: 60 PMTNDADLSYYGVISIGSPPQSFSVIFDTGSSNLWIPSVYCS-SQACENHRRFNPQQPTT 118
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIG 194
+ + I YG+GS++G+ + D VEVG + V +QVF I T + + A DGI+G
Sbjct: 119 FKWGNQPLSIQYGTGSMTGYLAIDTVEVGGISVANQVFGISRTEAPFMAHMQA--DGILG 176
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L F+ IA + VPV+DNMV+QGLVS+ +FS +L+ ++E+G E+VFGG+D H+ G+ T
Sbjct: 177 LAFQTIASDNVVPVFDNMVKQGLVSQPLFSVYLSS--NSEQGSEVVFGGIDSSHYTGQIT 234
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG--- 311
++P++ YWQ ++ + I Q T C GGC AI+D+GTSL+ GPT + +N +G
Sbjct: 235 WIPLSSATYWQIKMDSVTINGQ-TVACSGGCQAIIDTGTSLIVGPTSDINNMNAWVGAST 293
Query: 312 ---GEGVVSAE 319
GE VVS +
Sbjct: 294 NQYGEAVVSCQ 304
>gi|222425184|dbj|BAH20541.1| pepsinogen A-14 [Pongo abelii]
Length = 388
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 118/287 (41%), Positives = 168/287 (58%), Gaps = 19/287 (6%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL K L H+LN A +Y + H PL+N++D +YFG
Sbjct: 32 LSERGLLKDFLKKHNLNPA-----SKYFPQGKAPTLLHEQ--------PLENYLDVEYFG 78
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+P Q+F+V+FDTGSSNLWVPS CY S++C H+ + + S+TY ++ I Y
Sbjct: 79 TIGIGTPAQDFTVVFDTGSSNLWVPSVYCY-SLACMDHNLFNPQDSSTYKSTSETVSITY 137
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
G+GS++G D V+VG + +Q+F + + GS F A FDGI+GL + I+ A
Sbjct: 138 GTGSMTGILGYDTVKVGGISDTNQIFGLSESEPGSFLFF-APFDGILGLAYPSISSSGAT 196
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ
Sbjct: 197 PVFDNIWNQGLVSQDLFSVYLSA--DDKSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQI 254
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
+ I + N T C GC AIVD+GTSLL GPT + I IG
Sbjct: 255 TVDSITM-NGKTIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGAS 300
>gi|365986877|ref|XP_003670270.1| hypothetical protein NDAI_0E02105 [Naumovozyma dairenensis CBS 421]
gi|343769040|emb|CCD25027.1| hypothetical protein NDAI_0E02105 [Naumovozyma dairenensis CBS 421]
Length = 408
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 110/251 (43%), Positives = 161/251 (64%), Gaps = 8/251 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+++AQYF +I +G+PPQ+F VI DTGSSNLWVPS +C S++CY HS+Y KS+
Sbjct: 84 IPLSNYLNAQYFADITLGTPPQSFKVILDTGSSNLWVPSVECG-SLACYLHSKYDHDKSS 142
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y G I YG+GS+ G+ SQD + +GD+ + Q F EAT E LTF +FDGI+G
Sbjct: 143 SYKPNGTDFAIRYGTGSLEGYISQDTLNIGDLNIPKQDFAEATSEPGLTFAFGKFDGILG 202
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKH 253
L + I+V VP + N +EQ L+ E+ F+F+L + + +E+GGEI GG+D FKG
Sbjct: 203 LAYDSISVNKVVPPFYNAIEQELLDEKKFAFYLGDANKKSEDGGEITIGGIDKTKFKGDI 262
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
++PV +K YW+ + I +G+Q + G A +D+GTSL+A P+ + IN IG +
Sbjct: 263 DWLPVRRKAYWEVKFEGIGLGDQFAELENHGAA--IDTGTSLIALPSGLAEIINTEIGAK 320
Query: 314 ----GVVSAEC 320
G + EC
Sbjct: 321 KGWTGQYTVEC 331
>gi|301784222|ref|XP_002927531.1| PREDICTED: pepsin B-like [Ailuropoda melanoleuca]
Length = 390
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 112/297 (37%), Positives = 173/297 (58%), Gaps = 6/297 (2%)
Query: 18 CLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS-GVRHRLGDSDEDILP 76
CL L S G+ RI LKK + + + R + V G ++ + P
Sbjct: 10 CLHL---SEGVERIVLKKGK-SIRQVMEERGVLETFLKNHPKVDPGAKYLYSNDAVAYEP 65
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
N++++ YFGEI IG+PPQNF V+FDTGSSNLWVPS+ C S +C H+ + S+TY
Sbjct: 66 FTNYLNSYYFGEISIGTPPQNFLVLFDTGSSNLWVPSTYCQ-SQACTNHNMFNPSSSSTY 124
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
G++ + YGSGS++ D V V ++++ +Q F + E + F A FDGI+G+
Sbjct: 125 RNNGQTYTLYYGSGSLTVLLGYDTVNVQNIIINNQEFGLSEIEPNNPFYYANFDGILGMA 184
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ +AVG+A V +MV+Q +++ +FSF+ +R P E GGE++ GGVD + + G+ +
Sbjct: 185 YPNLAVGNAPTVTQSMVQQDQLTQPIFSFYFSRQPTYEYGGELILGGVDSQFYSGEIVWT 244
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
PVT++ YWQ + + L+ NQ+TG+C GC AIVD+GT +LA P + G +
Sbjct: 245 PVTREMYWQIAIDEFLVSNQATGLCSQGCQAIVDTGTYMLAVPQQFIGSFLQTTGAQ 301
>gi|195046637|ref|XP_001992191.1| GH24623 [Drosophila grimshawi]
gi|193893032|gb|EDV91898.1| GH24623 [Drosophila grimshawi]
Length = 374
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 114/252 (45%), Positives = 156/252 (61%), Gaps = 9/252 (3%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
L N M+ Y+G I IG+PPQ+F V+FD+GSSNLWVPS+ C S +C H++Y S S+TY
Sbjct: 62 LSNSMNMAYYGAITIGTPPQSFEVLFDSGSSNLWVPSNTCT-STACEVHNQYDSSASSTY 120
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
G+S I YG+GS+SGF S D V++ + V Q F EAT E F A FDGI+G+G
Sbjct: 121 QSNGESFSIQYGTGSLSGFLSTDTVDINGLSVTSQTFAEATDEPGTNFNNANFDGILGMG 180
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRD-PDAEEGGEIVFGGVDPKHFKGKHTY 255
++ I+ D VPV+ NMV QGLV + VFSF+L R +GGE++FGG D + G TY
Sbjct: 181 YQTISQDDVVPVFYNMVSQGLVDQSVFSFYLARAGTSTTDGGELIFGGSDSSLYSGDLTY 240
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH--AIGGE 313
VPV+++GYWQF + S +C+ C AI D+GTSL+ P +N + E
Sbjct: 241 VPVSQEGYWQFTMDSATADGNS--LCD-DCQAIADTGTSLIVAPANAYELLNEILNVDDE 297
Query: 314 GVVSAECKLVVS 325
G+V +C + S
Sbjct: 298 GLV--DCSTISS 307
>gi|355706340|gb|AES02605.1| napsin A aspartic peptidase [Mustela putorius furo]
Length = 258
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 96/196 (48%), Positives = 136/196 (69%), Gaps = 1/196 (0%)
Query: 74 ILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF-SISCYFHSRYKSRK 132
+PL N+++AQY+GEIG+G+PPQNFSV+FDTGSSNLWVPS +C+F S+ C+FH R+ S+
Sbjct: 63 FVPLSNYLNAQYYGEIGLGTPPQNFSVVFDTGSSNLWVPSIRCHFLSLPCWFHHRFNSKA 122
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGI 192
S+++ G I YG+G + G S+D + +G + +F EA E SL F A FDG+
Sbjct: 123 SSSFQPNGTKFAIQYGTGKLDGILSEDKLTIGGIKGASVIFGEALWEPSLVFTFAHFDGV 182
Query: 193 IGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGK 252
+GLGF +AVG P D +V++GL+ + +FSF+LNRDP A +GGE+V GG DP H+
Sbjct: 183 LGLGFPILAVGGVRPPLDTLVDEGLLDKPIFSFYLNRDPKAADGGELVLGGSDPAHYIPP 242
Query: 253 HTYVPVTKKGYWQFEL 268
T++PVT YWQ +
Sbjct: 243 LTFLPVTIPAYWQIHM 258
>gi|207340638|gb|EDZ68928.1| YPL154Cp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 385
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 105/240 (43%), Positives = 156/240 (65%), Gaps = 4/240 (1%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+++AQY+ +I +G+PPQNF VI DTGSSNLWVPS++C S++C+ HS+Y S+
Sbjct: 81 VPLTNYLNAQYYTDITLGTPPQNFKVILDTGSSNLWVPSNEC-GSLACFLHSKYDHEASS 139
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y G I YG+GS+ G+ SQD + +GD+ + Q F EAT E LTF +FDGI+G
Sbjct: 140 SYKANGTEFAIQYGTGSLEGYISQDTLSIGDLTIPKQDFAEATSEPGLTFAFGKFDGILG 199
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKH 253
LG+ I+V VP + N ++Q L+ E+ F+F+L + D E GGE FGG+D FKG
Sbjct: 200 LGYDTISVDKVVPPFYNAIQQDLLDEKRFAFYLGDTSKDTENGGEATFGGIDESKFKGDI 259
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
T++PV +K YW+ + I +G++ + G A +D+GTSL+ P+ + IN IG +
Sbjct: 260 TWLPVRRKAYWEVKFEGIGLGDEYAELESHGAA--IDTGTSLITLPSGLAEMINAEIGAK 317
>gi|73621391|sp|Q9GMY4.1|PEPC_SORUN RecName: Full=Gastricsin; AltName: Full=Pepsinogen C; Flags:
Precursor
gi|9798664|dbj|BAB11754.1| pepsinogen C [Sorex unguiculatus]
Length = 389
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 113/286 (39%), Positives = 165/286 (57%), Gaps = 18/286 (6%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
LR GL L H + A+ ++ GD P+ ++DA YFG
Sbjct: 33 LREQGLLGEFLRTHPYDPAQ----------------KYHFGDFSVAYEPMA-YLDAAYFG 75
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
EI IG+PPQNF V+FDTGSSNLWVPS C S +C H+R+ KS+TY+ G++ + Y
Sbjct: 76 EISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTGHARFNPSKSSTYSTNGQTFSLQY 134
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVP 207
GSGS++GFF D + + ++ V Q F + E F+ A+FDGI+G+ + +A+G A
Sbjct: 135 GSGSLTGFFGYDTMTLQNIKVPHQEFGLSQNEPGENFVYAQFDGIMGMAYPTLAMGGATT 194
Query: 208 VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFE 267
M++ G + VFSF+L+ +++GG +VFGGVD + G+ + PVT++ YWQ
Sbjct: 195 ALQGMLQAGALDSPVFSFYLSNQQSSKDGGAVVFGGVDNSLYTGQIFWTPVTQELYWQIG 254
Query: 268 LGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
+ LIG Q+TG C GC AIVD+GTSLL P ++ + A G +
Sbjct: 255 VEQFLIGGQATGWCSQGCQAIVDTGTSLLTVPQQYLSALQQATGAQ 300
>gi|73620984|sp|P81497.2|PEPA_SUNMU RecName: Full=Pepsin A; Flags: Precursor
gi|9798654|dbj|BAB11749.1| pepsinogen A [Suncus murinus]
Length = 387
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 119/295 (40%), Positives = 174/295 (58%), Gaps = 22/295 (7%)
Query: 32 GLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGI 91
GL K L H++N A +Y + L D PL N+MD +YFG IGI
Sbjct: 36 GLLKDFLAKHNVNPA-----SKYFPTEAAT----ELADQ-----PLVNYMDMEYFGTIGI 81
Query: 92 GSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGS 151
G+PPQ F+VIFDTGSSNLWVPS C S +C H+R+ +KS+T+ ++ I YG+GS
Sbjct: 82 GTPPQEFTVIFDTGSSNLWVPSVYCS-SPACSNHNRFNPQKSSTFQSTSQTLSIAYGTGS 140
Query: 152 ISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWD 210
++G D V+V + +Q+F + T GS + + FDGI+GL + IA A PV+D
Sbjct: 141 MTGVLGYDTVQVAGIADTNQIFGLSQTEPGSFLYY-SPFDGILGLAYPNIASSGATPVFD 199
Query: 211 NMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 270
NM QGLVS+++FS +L+ + + G ++FGG+D ++ G +VP++ +GYWQ +
Sbjct: 200 NMWNQGLVSQDLFSVYLSS--NDQSGSVVIFGGIDSSYYTGNLNWVPLSSEGYWQITVDS 257
Query: 271 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVS 325
I + Q+ C G C AIVD+GTSLL+GP + I +IG +A ++VVS
Sbjct: 258 ITMNGQAIA-CSGSCQAIVDTGTSLLSGPNNAIANIQKSIGASQ--NANGQMVVS 309
>gi|158257160|dbj|BAF84553.1| unnamed protein product [Homo sapiens]
Length = 388
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 121/303 (39%), Positives = 175/303 (57%), Gaps = 23/303 (7%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL K L H+LN AR +Y + D PL+N++D +YFG
Sbjct: 32 LSERGLLKDFLKKHNLNPAR-----KYF--------PQWEAPTLVDEQPLENYLDMEYFG 78
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+P Q+F+V+FDTGSSNLWVPS C S++C H+R+ S+TY ++ I Y
Sbjct: 79 TIGIGTPAQDFTVVFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTYQSTSETVSITY 137
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
G+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + I+ A
Sbjct: 138 GTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISSSGAT 196
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ
Sbjct: 197 PVFDNIWNQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQI 254
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKL 322
+ I + ++ C GC AIVD+GTSLL GPT +T I IG +G + C
Sbjct: 255 TVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPITNIQSDIGASENSDGDMVVSCSA 313
Query: 323 VVS 325
+ S
Sbjct: 314 ISS 316
>gi|323335315|gb|EGA76604.1| Pep4p [Saccharomyces cerevisiae Vin13]
Length = 368
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 105/240 (43%), Positives = 156/240 (65%), Gaps = 4/240 (1%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+++AQY+ +I +G+PPQNF VI DTGSSNLWVPS++C S++C+ HS+Y S+
Sbjct: 44 VPLTNYLNAQYYTDITLGTPPQNFKVILDTGSSNLWVPSNEC-GSLACFLHSKYDHEASS 102
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y G I YG+GS+ G+ SQD + +GD+ + Q F EAT E LTF +FDGI+G
Sbjct: 103 SYKANGTEFAIQYGTGSLEGYISQDTLSIGDLTIPKQDFAEATSEPGLTFAFGKFDGILG 162
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKH 253
LG+ I+V VP + N ++Q L+ E+ F+F+L + D E GGE FGG+D FKG
Sbjct: 163 LGYDTISVDKVVPPFYNAIQQDLLDEKRFAFYLGDTSKDTENGGEATFGGIDESKFKGDI 222
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
T++PV +K YW+ + I +G++ + G A +D+GTSL+ P+ + IN IG +
Sbjct: 223 TWLPVRRKAYWEVKFEGIGLGDEYAELESHGAA--IDTGTSLITLPSGLAEMINAEIGAK 280
>gi|332267172|ref|XP_003282561.1| PREDICTED: pepsin A-5 [Nomascus leucogenys]
Length = 372
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 123/303 (40%), Positives = 174/303 (57%), Gaps = 23/303 (7%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL K L H+LN AR +Y + D PL+N++D +YFG
Sbjct: 16 LSEHGLLKDFLKKHNLNPAR-----KYF--------PQLEAPTLVDEQPLENYLDMEYFG 62
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H+R+ S+TY ++ I Y
Sbjct: 63 TIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTYQSTSETVSIAY 121
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
G+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + I+ A
Sbjct: 122 GTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISSSGAT 180
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ
Sbjct: 181 PVFDNIWNQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYSGSLNWVPVTVEGYWQI 238
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKL 322
+ I + N T C GC AIVD+GTSLL GPT + I IG +G + C
Sbjct: 239 TVDSITM-NGETIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSA 297
Query: 323 VVS 325
+ S
Sbjct: 298 ISS 300
>gi|126309845|ref|XP_001370435.1| PREDICTED: gastricsin-like [Monodelphis domestica]
Length = 390
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 103/238 (43%), Positives = 151/238 (63%), Gaps = 1/238 (0%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N+MD Y+GEI IG+PPQNF V+FDTGSSNLWV S C S +C H ++ KS+T
Sbjct: 64 PLANYMDMSYYGEISIGTPPQNFLVLFDTGSSNLWVASIYCQ-SQACTNHPQFNPSKSST 122
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y+ G++ + YG+GS++G F D V + + + +Q F + E F+ A+FDGI+GL
Sbjct: 123 YSSNGQTFSLQYGTGSLTGVFGYDTVTIQGISITNQEFGLSETEPGTNFVYAQFDGILGL 182
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ I+ G A V +++ L++ VF+F+L+ + ++ GGE+VFGGVD + G +
Sbjct: 183 AYPAISSGGATTVMQGFLQENLLNSPVFAFYLSGNENSNNGGEVVFGGVDTSMYTGDIYW 242
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
PVT++ YWQ + IG Q+TG C GGC AIVD+GTSLL P + +E+ IG +
Sbjct: 243 APVTEEAYWQIAINGFSIGGQATGWCSGGCQAIVDTGTSLLTAPQQIFSELMQYIGAQ 300
>gi|401838744|gb|EJT42213.1| PEP4-like protein [Saccharomyces kudriavzevii IFO 1802]
Length = 405
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 107/240 (44%), Positives = 159/240 (66%), Gaps = 4/240 (1%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+++AQY+ +I +G+PPQNF VI DTGSSNLWVPS++C S++C+ HS+Y S+
Sbjct: 81 VPLTNYLNAQYYTDITLGTPPQNFKVILDTGSSNLWVPSNEC-GSLACFLHSKYDHEASS 139
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y G I YG+GS+ G+ SQD + +GD+ + Q F EAT E LTF +FDGI+G
Sbjct: 140 SYKANGTEFAIQYGTGSLEGYISQDTLSIGDLTIPKQDFAEATSEPGLTFAFGKFDGILG 199
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKH 253
LG+ I+V VP + N ++Q L+ E+ F+F+L + D+E GGE FGG+D FKG
Sbjct: 200 LGYDTISVDKVVPPFYNAIQQDLLDEKKFAFYLGDTSKDSENGGEATFGGIDESKFKGDI 259
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
T++PV +K YW+ + I +G++ + EG AAI D+GTSL+ P+ + IN +G +
Sbjct: 260 TWLPVRRKAYWEVKFEGIGLGDEYAEL-EGHGAAI-DTGTSLITLPSGLAEMINAELGAK 317
>gi|14278413|pdb|1G0V|A Chain A, The Structure Of Proteinase A Complexed With A Ia3 Mutant,
Mvv
Length = 329
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 105/240 (43%), Positives = 156/240 (65%), Gaps = 4/240 (1%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+++AQY+ +I +G+PPQNF VI DTGSSNLWVPS++C S++C+ HS+Y S+
Sbjct: 5 VPLTNYLNAQYYTDITLGTPPQNFKVILDTGSSNLWVPSNEC-GSLACFLHSKYDHEASS 63
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y G I YG+GS+ G+ SQD + +GD+ + Q F EAT E LTF +FDGI+G
Sbjct: 64 SYKANGTEFAIQYGTGSLEGYISQDTLSIGDLTIPKQDFAEATSEPGLTFAFGKFDGILG 123
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKH 253
LG+ I+V VP + N ++Q L+ E+ F+F+L + D E GGE FGG+D FKG
Sbjct: 124 LGYDTISVDKVVPPFYNAIQQDLLDEKRFAFYLGDTSKDTENGGEATFGGIDESKFKGDI 183
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
T++PV +K YW+ + I +G++ + G A +D+GTSL+ P+ + IN IG +
Sbjct: 184 TWLPVRRKAYWEVKFEGIGLGDEYAELESHGAA--IDTGTSLITLPSGLAEMINAEIGAK 241
>gi|328771090|gb|EGF81130.1| hypothetical protein BATDEDRAFT_16209 [Batrachochytrium
dendrobatidis JAM81]
Length = 400
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 137/355 (38%), Positives = 193/355 (54%), Gaps = 26/355 (7%)
Query: 12 LWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNA----ARITRKERYMGGAGVSGVRHRL 67
+W++A+ ++ A I LKKR +LNA + R + S ++ L
Sbjct: 6 VWLVAAASVVSAHKG--NTIKLKKRPHTQDTLNALFSNVQSVYSNRLAFQSETSEDQYIL 63
Query: 68 GDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSR 127
G E +PL +F +AQYFGEI IG+PPQ F+VIFDTGSSNLWVPS++C SI+C+ H R
Sbjct: 64 GGGAEHSVPLTDFANAQYFGEIQIGTPPQPFTVIFDTGSSNLWVPSTRCS-SIACWMHRR 122
Query: 128 YKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLA 187
Y + +S+TY G I YG+G++ G SQD V +G + +++Q F E+ +E +TF +
Sbjct: 123 YDASESSTYVNNGTEFAIQYGTGALEGVISQDTVTIGGLTIENQGFGESVKEPGITFAVG 182
Query: 188 RFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPK 247
RFDGI+GLGF I+V VP N++ + +F WL EEGGEIVFG V+
Sbjct: 183 RFDGILGLGFDTISVQKVVPPMYNLINNHQLDTPLFGVWLGSS-SGEEGGEIVFGAVNHD 241
Query: 248 HFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEIN 307
HFKG T+VPV +K YW+ EL + IG + + A +D+G+SL A P IN
Sbjct: 242 HFKGAVTWVPVVRKAYWEVELEGVTIGGKKLAIKS--SRAAIDTGSSLFALPVAEADAIN 299
Query: 308 HAIGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLG 362
+GG+ + G I D LPE Q F G ++V G
Sbjct: 300 GILGGKK----------NWNGQFIVDCATIDSLPELTLQ------FGGQKFVITG 338
>gi|254596794|gb|ACT75642.1| pepsinogen A [Channa argus]
Length = 361
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 112/267 (41%), Positives = 164/267 (61%), Gaps = 14/267 (5%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
P+ N D Y+G I IG+PPQ+FSVIFD+GSSNLWVPS C S +C H+++ ++S++
Sbjct: 44 PMTNDADMSYYGVISIGTPPQSFSVIFDSGSSNLWVPSVYCSSSQACQNHNKFNPQQSSS 103
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
+ G+S I YG+GS++G+ D V VG V V +QVF + E + + DGI+GL
Sbjct: 104 FQWNGESLSIQYGTGSMTGYLGADTVGVGGVSVANQVFGLSQSEAPFMAHM-QADGILGL 162
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
F+ IA + VPV++NMV QGLVS+ +FS +L+ ++ +G E+VFGGVD H+ G+ +
Sbjct: 163 AFQSIASDNVVPVFNNMVSQGLVSQPMFSVYLSS--NSAQGSEVVFGGVDSNHYTGQIAW 220
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
+P+T YWQ ++ + I Q T C GGC AI+D+GTSL+ GPT ++ IN +G
Sbjct: 221 IPLTSATYWQIKMDSVSINGQ-TVACSGGCQAIIDTGTSLIVGPTSDISNINSWVGAS-- 277
Query: 316 VSAECKLVVSQYGDLIWDLLVSGLLPE 342
QYGD + +PE
Sbjct: 278 --------TDQYGDATVNCQNIQSMPE 296
>gi|414887123|tpg|DAA63137.1| TPA: hypothetical protein ZEAMMB73_794362 [Zea mays]
Length = 608
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 98/153 (64%), Positives = 117/153 (76%)
Query: 211 NMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 270
NMV+QGL+S+ VFSFW NR D EGGEIVFGG+D H+KG HT+VPVT+KGYWQF +GD
Sbjct: 309 NMVKQGLISDPVFSFWFNRHADEGEGGEIVFGGMDSSHYKGDHTFVPVTRKGYWQFNMGD 368
Query: 271 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 330
+L+ +STG C GGCAA+ DSGTSLLAGPT ++TEIN IG GVVS ECK VVSQYG
Sbjct: 369 VLVDGKSTGFCAGGCAAVADSGTSLLAGPTAIITEINEKIGAAGVVSQECKTVVSQYGQQ 428
Query: 331 IWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
I DLL++ P K+C Q+GLC F+G V GI
Sbjct: 429 ILDLLLAETQPAKICSQVGLCTFDGTHGVSAGI 461
>gi|114607413|ref|XP_518465.2| PREDICTED: gastricsin isoform 2 [Pan troglodytes]
Length = 388
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 119/307 (38%), Positives = 178/307 (57%), Gaps = 16/307 (5%)
Query: 13 WVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV------SGVRHR 66
W++ + L S + ++ LKK + + R T KE+ + G + ++R
Sbjct: 3 WMVVVLVCLQLSEAAVVKVPLKKFK-------SIRETMKEKGLLGEFLRTHKYDPAWKYR 55
Query: 67 LGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHS 126
GD P+ +MDA YFGEI IG+PPQNF V+FDTGSSNLWVPS C S +C HS
Sbjct: 56 FGDLSVTYEPMA-YMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHS 113
Query: 127 RYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLL 186
R+ +S+TY+ G++ + YGSGS++GFF D + V + V +Q F + E F+
Sbjct: 114 RFNPSESSTYSTNGQTFSLQYGSGSLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVY 173
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
A+FDGI+GL + ++V +A MV++G ++ VFS +L+ GG +VFGGVD
Sbjct: 174 AQFDGIMGLAYPALSVDEATTAMQGMVQEGALTSPVFSVYLSNQ-QGSSGGAVVFGGVDS 232
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
+ G+ + PVT++ YWQ + + LIG Q++G C GC AIVD+GTSLL P ++ +
Sbjct: 233 SLYTGQIYWAPVTQELYWQIGIEEFLIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMSAL 292
Query: 307 NHAIGGE 313
A G +
Sbjct: 293 LEATGAQ 299
>gi|129786|sp|P27678.1|PEPA4_MACFU RecName: Full=Pepsin A-4; AltName: Full=Pepsin I/II; Flags:
Precursor
gi|38071|emb|CAA42425.1| prepropepsin A [Macaca fuscata]
Length = 388
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 124/313 (39%), Positives = 177/313 (56%), Gaps = 26/313 (8%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL K L H+LN A +Y A + D PL+N++D +YFG
Sbjct: 32 LSEHGLLKDFLKKHNLNPA-----SKYFPQAEAPTLI--------DEQPLENYLDVEYFG 78
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+P QNF+V+FDTGSSNLWVPS CY S++C H+ + + S+TY K+ I Y
Sbjct: 79 TIGIGTPAQNFTVVFDTGSSNLWVPSVYCY-SLACMDHNLFNPQDSSTYRATSKTVSITY 137
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVP 207
G+GS++G D V+VG + +Q+F + E A FDGI+GL + I+ A P
Sbjct: 138 GTGSMTGILGYDTVKVGGISDTNQIFGLSETEPGFFLYFAPFDGILGLAYPSISSSGATP 197
Query: 208 VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFE 267
V+DN+ Q LVS+++FS +L+ D + G ++FGG+D ++ G +VPV+ +GYWQ
Sbjct: 198 VFDNIWNQRLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTGSLNWVPVSVEGYWQIS 255
Query: 268 LGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVSAECK 321
+ I + N T C GC AIVD+GTSLL GPT + I IG GE VVS
Sbjct: 256 VDSITM-NGKTIACAKGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGEMVVSCSA- 313
Query: 322 LVVSQYGDLIWDL 334
+S D+++ +
Sbjct: 314 --ISSLPDIVFTI 324
>gi|365758066|gb|EHM99929.1| Pep4p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 405
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 107/240 (44%), Positives = 159/240 (66%), Gaps = 4/240 (1%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+++AQY+ +I +G+PPQNF VI DTGSSNLWVPS++C S++C+ HS+Y S+
Sbjct: 81 VPLTNYLNAQYYTDITLGTPPQNFKVILDTGSSNLWVPSNEC-GSLACFLHSKYDHEASS 139
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y G I YG+GS+ G+ SQD + +GD+ + Q F EAT E LTF +FDGI+G
Sbjct: 140 SYKANGTEFAIQYGTGSLEGYISQDTLTIGDLTIPKQDFAEATSEPGLTFAFGKFDGILG 199
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKH 253
LG+ I+V VP + N ++Q L+ E+ F+F+L + D+E GGE FGG+D FKG
Sbjct: 200 LGYDTISVDKVVPPFYNAIQQDLLDEKKFAFYLGDTSKDSENGGEATFGGIDESKFKGDI 259
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
T++PV +K YW+ + I +G++ + EG AAI D+GTSL+ P+ + IN +G +
Sbjct: 260 TWLPVRRKAYWEVKFEGIGLGDEYAEL-EGHGAAI-DTGTSLITLPSGLAEMINAELGAK 317
>gi|426368715|ref|XP_004051348.1| PREDICTED: pepsin A-5-like isoform 1 [Gorilla gorilla gorilla]
Length = 388
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 123/303 (40%), Positives = 174/303 (57%), Gaps = 23/303 (7%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL K L H+LN AR +Y + D PL+N++D +YFG
Sbjct: 32 LSERGLLKDFLKKHNLNPAR-----KYF--------PQWEAPTLVDEQPLENYLDMEYFG 78
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H+R+ S+TY ++ I Y
Sbjct: 79 TIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTYQSTSETVSITY 137
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
G+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + I+ A
Sbjct: 138 GTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISSSGAT 196
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ
Sbjct: 197 PVFDNIWNQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQI 254
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKL 322
+ I + N T C GC AIVD+GTSLL GPT + I IG +G + C
Sbjct: 255 TVDSITM-NGETIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSA 313
Query: 323 VVS 325
+ S
Sbjct: 314 ISS 316
>gi|7766834|pdb|1DP5|A Chain A, The Structure Of Proteinase A Complexed With A Ia3 Mutant
Inhibitor
gi|7766836|pdb|1DPJ|A Chain A, The Structure Of Proteinase A Complexed With Ia3 Peptide
Inhibitor
gi|22218637|pdb|1FMU|A Chain A, Structure Of Native Proteinase A In P3221 Space Group.
gi|22218638|pdb|1FMX|A Chain A, Structure Of Native Proteinase A In The Space Group P21
gi|22218639|pdb|1FMX|B Chain B, Structure Of Native Proteinase A In The Space Group P21
gi|225346|prf||1301217A proteinase A,Asp
Length = 329
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 105/240 (43%), Positives = 156/240 (65%), Gaps = 4/240 (1%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+++AQY+ +I +G+PPQNF VI DTGSSNLWVPS++C S++C+ HS+Y S+
Sbjct: 5 VPLTNYLNAQYYTDITLGTPPQNFKVILDTGSSNLWVPSNEC-GSLACFLHSKYDHEASS 63
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y G I YG+GS+ G+ SQD + +GD+ + Q F EAT E LTF +FDGI+G
Sbjct: 64 SYKANGTEFAIQYGTGSLEGYISQDTLSIGDLTIPKQDFAEATSEPGLTFAFGKFDGILG 123
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKH 253
LG+ I+V VP + N ++Q L+ E+ F+F+L + D E GGE FGG+D FKG
Sbjct: 124 LGYDTISVDKVVPPFYNAIQQDLLDEKRFAFYLGDTSKDTENGGEATFGGIDESKFKGDI 183
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
T++PV +K YW+ + I +G++ + G A +D+GTSL+ P+ + IN IG +
Sbjct: 184 TWLPVRRKAYWEVKFEGIGLGDEYAELESHGAA--IDTGTSLITLPSGLAEMINAEIGAK 241
>gi|426368717|ref|XP_004051349.1| PREDICTED: pepsin A-5-like isoform 2 [Gorilla gorilla gorilla]
Length = 388
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 123/303 (40%), Positives = 174/303 (57%), Gaps = 23/303 (7%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL K L H+LN AR +Y + D PL+N++D +YFG
Sbjct: 32 LSERGLLKDFLKKHNLNPAR-----KYF--------PQWEAPTLVDEQPLENYLDMEYFG 78
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H+R+ S+TY ++ I Y
Sbjct: 79 TIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTYQSTSETVSITY 137
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
G+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + I+ A
Sbjct: 138 GTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISSSGAT 196
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ
Sbjct: 197 PVFDNIWNQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQI 254
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKL 322
+ I + N T C GC AIVD+GTSLL GPT + I IG +G + C
Sbjct: 255 TVDSITM-NGETIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSA 313
Query: 323 VVS 325
+ S
Sbjct: 314 ISS 316
>gi|2624629|pdb|2JXR|A Chain A, Structure Of Yeast Proteinase A
gi|10835733|pdb|1FQ4|A Chain A, Crystal Structure Of A Complex Between Hydroxyethylene
Inhibitor Cp- 108,420 And Yeast Aspartic Proteinase A
gi|10835734|pdb|1FQ5|A Chain A, X-Ray Struture Of A Cyclic Statine Inhibitor Pd-129,541
Bound To Yeast Proteinase A
gi|10835735|pdb|1FQ6|A Chain A, X-Ray Structure Of Glycol Inhibitor Pd-133,450 Bound To
Saccharopepsin
gi|10835736|pdb|1FQ7|A Chain A, X-Ray Structure Of Inhibitor Cp-72,647 Bound To
Saccharopepsin
gi|10835737|pdb|1FQ8|A Chain A, X-Ray Structure Of Difluorostatine Inhibitor Cp81,198
Bound To Saccharopepsin
Length = 329
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 105/240 (43%), Positives = 156/240 (65%), Gaps = 4/240 (1%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+++AQY+ +I +G+PPQNF VI DTGSSNLWVPS++C S++C+ HS+Y S+
Sbjct: 5 VPLTNYLNAQYYTDITLGTPPQNFKVILDTGSSNLWVPSNEC-GSLACFLHSKYDHEASS 63
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y G I YG+GS+ G+ SQD + +GD+ + Q F EAT E LTF +FDGI+G
Sbjct: 64 SYKANGTEFAIQYGTGSLEGYISQDTLSIGDLTIPKQDFAEATSEPGLTFAFGKFDGILG 123
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKH 253
LG+ I+V VP + N ++Q L+ E+ F+F+L + D E GGE FGG+D FKG
Sbjct: 124 LGYDTISVDKVVPPFYNAIQQDLLDEKRFAFYLGDTSKDTENGGEATFGGIDESKFKGDI 183
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
T++PV +K YW+ + I +G++ + G A +D+GTSL+ P+ + IN IG +
Sbjct: 184 TWLPVRRKAYWEVKFEGIGLGDEYAELESHGAA--IDTGTSLITLPSGLAEMINAEIGAK 241
>gi|432943847|ref|XP_004083297.1| PREDICTED: cathepsin E-A-like [Oryzias latipes]
Length = 412
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 125/333 (37%), Positives = 184/333 (55%), Gaps = 17/333 (5%)
Query: 12 LWVLASCLLLPASSNGLRRIGLKKR-RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDS 70
+W ++ L +P N R ++ LD + T RY RLG S
Sbjct: 12 IWTASALLRVPLRRNPTIRTQMRAEGLLDQFLKDNQPDTFNRRYAQCFPPGTQSLRLGRS 71
Query: 71 DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKS 130
E I NFMDAQY+GEI +G+P QNFSVIFDTGSS+LWVPSS C S +C FH +K+
Sbjct: 72 SEKIY---NFMDAQYYGEIRLGTPEQNFSVIFDTGSSDLWVPSSYC-VSQACAFHRHFKA 127
Query: 131 RKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFD 190
KS+++ G++ I+YGSG + G +D + +G++ V +Q F E+ E TF+ A+FD
Sbjct: 128 FKSSSFHHDGRTFGIHYGSGHLLGVMGKDTLRIGNLTVLNQEFGESVYEPGSTFVTAKFD 187
Query: 191 GIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAE-EGGEIVFGGVDPKHF 249
G++GL + +A PV+DNM+ Q ++ E +FSF+L+R G+++ GG D +
Sbjct: 188 GVLGLAYPSLAEIIGKPVFDNMLAQKILDEPIFSFYLSRSKSKSVPEGQLLLGGTDESLY 247
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHA 309
G +VPVT KGYWQ + + + S+ +C GC AIVD+GTSL+AGP + ++
Sbjct: 248 SGPINWVPVTIKGYWQIRMDSVSVQGVSS-LCRRGCEAIVDTGTSLIAGPPREILRLHQL 306
Query: 310 IGGEGVVSAECKLVVSQYGDLIWDLLVSGLLPE 342
IG + +GD + D LP
Sbjct: 307 IGA----------TPTHFGDFVVDCARLSSLPH 329
>gi|73621390|sp|Q9GMY3.1|PEPC_RHIFE RecName: Full=Gastricsin; AltName: Full=Pepsinogen C; Flags:
Precursor
gi|9798666|dbj|BAB11755.1| pepsinogen C [Rhinolophus ferrumequinum]
Length = 389
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 111/286 (38%), Positives = 171/286 (59%), Gaps = 8/286 (2%)
Query: 34 KKRRLDLHSLNAARITRKERYM------GGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
K ++ L L + R T KE+ + ++R D P+ +MDA YFG
Sbjct: 17 KVVKVPLKKLKSLRETMKEKGLLEEFLKNHKYDPAQKYRYTDFSVAYEPMA-YMDAAYFG 75
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
EI IG+PPQNF V+FDTGSSNLWVPS C + +C H+R+ +S+TY+ G++ + Y
Sbjct: 76 EISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-TQACTGHTRFNPSQSSTYSTNGQTFSLQY 134
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVP 207
GSGS++GFF D + V + V +Q F + E F+ A+FDGI+G+ + +A+G A
Sbjct: 135 GSGSLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGIMGMAYPSLAMGGATT 194
Query: 208 VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFE 267
M+++G ++ VFSF+L+ ++ GG ++FGGVD ++G+ + PVT++ YWQ
Sbjct: 195 ALQGMLQEGALTSPVFSFYLSNQQGSQNGGAVIFGGVDNSLYQGQIYWAPVTQELYWQIG 254
Query: 268 LGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
+ + LIG Q++G C GC AIVD+GTSLL P ++ + A G +
Sbjct: 255 IEEFLIGGQASGWCSQGCQAIVDTGTSLLTVPQQYMSALLQATGAQ 300
>gi|363743175|ref|XP_003642787.1| PREDICTED: renin-like [Gallus gallus]
Length = 451
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 107/252 (42%), Positives = 155/252 (61%), Gaps = 7/252 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFHSRYKSRKSNT 135
L N++D QY+GEI IG+PPQ F V+FDTGS+NLWVPS KC +C HSRY S KS T
Sbjct: 124 LTNYLDTQYYGEISIGTPPQTFKVVFDTGSANLWVPSCKCSPLYSACISHSRYDSSKSRT 183
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G I YG+GS+ GF SQD V V D+ + QVF EAT + F+ ARFDG++G+
Sbjct: 184 YIANGTGFAIRYGTGSVKGFLSQDVVMVSDIPII-QVFAEATVLPAFPFIFARFDGVLGM 242
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ A+ PV+D ++ Q ++ E+VFS + +R+ + GGEI+ GG DP ++ G Y
Sbjct: 243 GYPSQAIDGITPVFDRILSQQILKEDVFSVYYSRNSPLKPGGEIILGGTDPAYYTGDFHY 302
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
+ +++ GYWQ + + +G + C+ GC+ +D+G S + GP V+ + AIG
Sbjct: 303 LSISRSGYWQISMKGVSVGAEML-FCKEGCSVAIDTGASYITGPAGPVSVLMKAIGAAEM 361
Query: 313 -EGVVSAECKLV 323
EG +C+ V
Sbjct: 362 TEGEYVVDCEKV 373
>gi|344295434|ref|XP_003419417.1| PREDICTED: pepsin A-2/A-3-like [Loxodonta africana]
Length = 384
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 119/286 (41%), Positives = 170/286 (59%), Gaps = 22/286 (7%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L+ GL L H LN A +Y S + D L+N++D +YFG
Sbjct: 31 LKEHGLLDDFLKTHRLNPAS-----KYFPKEASSLL---------DTQTLENYLDVEYFG 76
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+P Q F+VIFDTGSSNLWVPS+ C S++C H+R+ S+TY ++ I Y
Sbjct: 77 TIGIGTPAQEFTVIFDTGSSNLWVPSTYCS-SLACTNHNRFNPDDSSTYRSTSETVSITY 135
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
G+GS++G D V+VG + +Q+F + T GS + + FDGI+GL + I+ DA
Sbjct: 136 GTGSMTGILGYDTVKVGGISDTNQIFGLSETEPGSFLYY-SPFDGILGLAYPSISSSDAT 194
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV-FGGVDPKHFKGKHTYVPVTKKGYWQ 265
PV+DN+ +QGLVS+++FS +L+ D EEGG +V FGG+D ++ G +VPV+ +GYWQ
Sbjct: 195 PVFDNIWDQGLVSQDLFSVYLSSD---EEGGSVVIFGGIDSSYYTGSLNWVPVSYEGYWQ 251
Query: 266 FELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
L + I +S C C AI+D+GTSLLAGPT + I +G
Sbjct: 252 ITLDSVSIDGESVA-CSDTCQAIIDTGTSLLAGPTTAIANIQEYLG 296
>gi|2687645|gb|AAB88862.1| cathepsin D [Sparus aurata]
Length = 399
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 116/271 (42%), Positives = 161/271 (59%), Gaps = 14/271 (5%)
Query: 77 LKNFMDAQYFGEIGIGSP-PQNFSVIFDTGSSNLWVPSSKCYF-SISCYFHSRYKSRKSN 134
L NFMDAQY+G I IG+P ++F+V+FDTGSSNLWVPS C F I+C Y S+KS
Sbjct: 69 LTNFMDAQYYGVISIGTPVHRDFTVLFDTGSSNLWVPSIHCSFLDIACCASPSYNSKKST 128
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY + G I YG GS+SGF S +V V + V Q F EA ++ +TF +ARFDG +G
Sbjct: 129 TYVQNGTEFSIRYGRGSLSGFISGSDVSVAGLPVPRQQFGEAVKQPGITFAVARFDGSLG 188
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFK-GKH 253
+ + + + VPV+D + L+ + +FSF+L RDP A GGE+ GG DP G
Sbjct: 189 MAYPFHIIANVVPVFDTAMAAKLLPQNIFSFYLTRDPKAAVGGELTLGGTDPHVLTLGDL 248
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
YV VT+K YW + + +GNQ + +C+ GC AIVD+GTSL+ GP V ++ AIG
Sbjct: 249 HYVNVTRKAYWHIGMDGLQVGNQLS-LCKAGCEAIVDTGTSLIVGPVEEVRALHKAIGAL 307
Query: 314 GVVSAE----------CKLVVSQYGDLIWDL 334
++ E C L +S G +++L
Sbjct: 308 PLIDGEYGLDCSGSHRCLLSLSTLGGRMFNL 338
>gi|222425186|dbj|BAH20542.1| pepsinogen A-35 [Pongo abelii]
Length = 388
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 121/303 (39%), Positives = 173/303 (57%), Gaps = 23/303 (7%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL K L H+LN A +Y + H PL+N++D +YFG
Sbjct: 32 LSERGLLKDFLKKHNLNPA-----SKYFPQGKAPTLLHEQ--------PLENYLDVEYFG 78
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+P Q+F+V+FDTGSSNLWVPS CY S+ C H+ + + S+TY ++ I Y
Sbjct: 79 SIGIGTPAQDFTVVFDTGSSNLWVPSVYCY-SLVCMDHNLFNPQDSSTYKSTSETVSITY 137
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
G+GS++G D V+VG + +Q+F + + GS F A FDGI+GL + I+ A
Sbjct: 138 GTGSMTGILGYDTVKVGGISDTNQIFGLSESEPGSFLFF-APFDGILGLAYPSISSSGAT 196
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ
Sbjct: 197 PVFDNIWNQGLVSQDLFSVYLSA--DDKSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQI 254
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKL 322
+ I + N T C GC AIVD+GTSLL GPT + I IG +G + C
Sbjct: 255 TVDSITM-NGKTIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSA 313
Query: 323 VVS 325
+ S
Sbjct: 314 ISS 316
>gi|195433875|ref|XP_002064932.1| GK15196 [Drosophila willistoni]
gi|194161017|gb|EDW75918.1| GK15196 [Drosophila willistoni]
Length = 415
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 123/351 (35%), Positives = 189/351 (53%), Gaps = 32/351 (9%)
Query: 12 LWVLASCLLLPASSNGLRRIGLKKRRL---------------DLHSLNAARITRKERYMG 56
LW L + +L+ SS R+ + R+ DL SL+ + + +
Sbjct: 8 LWFLLALVLIAFSSAEARKRKHNRVRVGRHNNPSSSHYNVKHDLKSLSIKHKLKLSKAIV 67
Query: 57 GAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC 116
VS + + L N + +Y+ + IG+PPQ F ++ DTGS+NLWVPSSKC
Sbjct: 68 DNAVSASTKTGTTAATNAASLGNAYNTEYYITVHIGTPPQEFRLLIDTGSANLWVPSSKC 127
Query: 117 YFSI-SCYFHSRYKSRKSNTYTEIGKSCEINYGSGS-----ISGFFSQDNVEVGDVVVKD 170
++ +C H RY S S+TY + +I Y S + + GF SQD V +GD+ +K+
Sbjct: 128 PSTVKACAAHQRYNSSASSTYKANNTAFQIEYASNTAGGVALDGFLSQDTVAIGDLAIKN 187
Query: 171 QVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRD 230
QVF E T E TFL + FDG+IGL + I++ +P N++ QGL+ E +FS +LNR+
Sbjct: 188 QVFAEMTNEPDGTFLTSPFDGMIGLAYASISINGVIPPLYNLISQGLIPEPIFSIYLNRN 247
Query: 231 -PDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIV 289
+A GGE++ GG+DP + G TYVPV+++GYWQFE+ + +Q C+ C AI+
Sbjct: 248 GTNATNGGELILGGIDPALYSGCLTYVPVSQQGYWQFEMTSATLNDQE--FCD-NCQAIL 304
Query: 290 DSGTSLLAGPTPVVTEINHAIG------GEGVVSAECKLVVSQYGDLIWDL 334
D GTSL+ P + EIN +G G +C +S+ D+I+ +
Sbjct: 305 DVGTSLIVVPNSEIKEINQILGVTNPNATSGAFLVDCA-TISKLPDIIFTI 354
>gi|222425182|dbj|BAH20540.1| pepsinogen A-15 [Pongo abelii]
Length = 388
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 116/287 (40%), Positives = 168/287 (58%), Gaps = 19/287 (6%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL K L H+LN A +Y + H PL+N++D +YFG
Sbjct: 32 LSERGLLKDFLKKHNLNPA-----SKYFPQGKAPTLLHEQ--------PLENYLDVEYFG 78
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+P Q+F+V+FDTGSSNLWVPS CY S++C H+ + + S+TY ++ I Y
Sbjct: 79 TIGIGTPAQDFTVVFDTGSSNLWVPSVYCY-SLACMDHNLFNPQDSSTYKSTSETVSITY 137
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
G+GS++G D V+VG + +Q+F + + GS F A FDGI+GL + I+ A
Sbjct: 138 GTGSMTGILGYDTVKVGGISDTNQIFGLSESEPGSFLFF-APFDGILGLAYPSISSSGAT 196
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ
Sbjct: 197 PVFDNIWNQGLVSQDLFSVYLSA--DDKSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQI 254
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
+ I + ++ C GC AIVD+GTSLL GPT + I IG
Sbjct: 255 TVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGAS 300
>gi|221048011|gb|ACL98113.1| pepsinogen [Epinephelus coioides]
Length = 311
Score = 215 bits (547), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 116/267 (43%), Positives = 166/267 (62%), Gaps = 17/267 (6%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
+ N D Y+G I IG+PPQ+FSVIFDTGSSNLWVPS C S +C H ++ ++S+T+
Sbjct: 61 MTNDADLSYYGVISIGTPPQSFSVIFDTGSSNLWVPSVYCS-SQACQNHRKFNPQQSSTF 119
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGL 195
+ I YG+GS++G + DNVEVG + V++QVF I T + + A DGI+GL
Sbjct: 120 KWGDQPLSIQYGTGSMTGHLAIDNVEVGGITVQNQVFGISRTEAPFMAHMTA--DGILGL 177
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
F+ IA + VPV+DNMV+QGLVS+ +FS +L+ E+G E+VFGG+D H+ G+ T+
Sbjct: 178 AFQTIAADNVVPVFDNMVKQGLVSQPLFSVYLSS--HGEQGSEVVFGGIDSSHYTGQVTW 235
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
VP+T YWQ ++ + I Q T C GGC AI+D+GTSL+ GPT + +N +G
Sbjct: 236 VPLTSATYWQIKMDGVKINGQ-TVACAGGCQAIIDTGTSLIVGPTNDINNMNSWVGAS-- 292
Query: 316 VSAECKLVVSQYGDLIWDLLVSGLLPE 342
+QYG+ + G +PE
Sbjct: 293 --------TNQYGESTVNCQNVGSMPE 311
>gi|194210206|ref|XP_001488754.2| PREDICTED: renin-like [Equus caballus]
Length = 391
Score = 215 bits (547), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 120/291 (41%), Positives = 176/291 (60%), Gaps = 10/291 (3%)
Query: 37 RLDLHSLNAARITRKERYMG----GAGVSGVRHRLG-DSDEDILPLKNFMDAQYFGEIGI 91
R+ L + + R + +ER + GA S RL D+ + L N++D QY+GEIGI
Sbjct: 17 RIFLRKMPSVRESLRERGVDVSRIGAEWSQFTKRLSRDNSTSPVVLTNYLDTQYYGEIGI 76
Query: 92 GSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFHSRYKSRKSNTYTEIGKSCEINYGSG 150
G+PPQ F VIFDTGS+NLWVPS+KC +C HS Y S +S++Y E G I YGSG
Sbjct: 77 GTPPQTFKVIFDTGSANLWVPSTKCSPLYAACEIHSLYDSSESSSYMENGTEFTIRYGSG 136
Query: 151 SISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWD 210
+ GF SQD V VG + V Q F E T + F+LA+FDG++G+GF AVG PV+D
Sbjct: 137 KVKGFLSQDMVTVGGITVT-QTFAEVTELPLIPFMLAKFDGVLGMGFPAQAVGGVTPVFD 195
Query: 211 NMVEQGLVSEEVFSFWLNRDPDAEE--GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFEL 268
+++ Q ++ E+VFS + +R+ GGEIV GG DP++++G YV V+K WQ ++
Sbjct: 196 HILSQRVLKEDVFSVYYSRNSKNSHLLGGEIVLGGSDPQYYQGNFHYVSVSKTDSWQIKM 255
Query: 269 GDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
+ + +T +CE GC +VD+G S ++GPT + + +G + + S E
Sbjct: 256 KGVSV-RSATLLCEEGCMVVVDTGASYISGPTSSLRLLMETLGAKELSSDE 305
>gi|51534964|dbj|BAD36915.1| pepsinogen C [Myocastor coypus]
Length = 393
Score = 215 bits (547), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 117/305 (38%), Positives = 176/305 (57%), Gaps = 7/305 (2%)
Query: 13 WVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITR---KERYMGGAGVSGVRHRLGD 69
W + + L LP + RI L+K + ++ + + K+ A +H GD
Sbjct: 3 WAIVALLCLPLLEAAVLRIPLRKSKSIREAMKENGLLKQYLKDHKQDPAQKFFGKH-FGD 61
Query: 70 SDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYK 129
+ P+ +MDA YFGEI +G+PPQ+F V+FDTGSSNLWV S C S++C HSR+
Sbjct: 62 YSVLLEPM-TYMDASYFGEISLGTPPQSFQVLFDTGSSNLWVASIYCK-SLACTTHSRFN 119
Query: 130 SRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARF 189
KS+TYT G++ + YGSGS++G F D + + D V Q F + +E +FL A F
Sbjct: 120 PNKSSTYTSAGQTFSLQYGSGSLTGLFGYDTLTIQDTQVPKQEFGLSEQEPGGSFLYAAF 179
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDA-EEGGEIVFGGVDPKH 248
DGI+GL + ++ GDA ++ +G +S+ +FS +L DA EGG ++ GGVD
Sbjct: 180 DGIMGLAYPGLSAGDATTAMQGLLREGALSQSLFSVYLGSQQDATNEGGALILGGVDESL 239
Query: 249 FKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINH 308
+ G ++ PVT++ YWQ + D L+ +++G C GC AIVD+GTSLL P ++ +
Sbjct: 240 YSGAISWTPVTQELYWQIGIEDFLLDGEASGWCSEGCQAIVDTGTSLLTVPQQYLSTLIE 299
Query: 309 AIGGE 313
AIG E
Sbjct: 300 AIGAE 304
>gi|410974069|ref|XP_003993470.1| PREDICTED: pepsin A-like [Felis catus]
Length = 387
Score = 215 bits (547), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 108/239 (45%), Positives = 153/239 (64%), Gaps = 6/239 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL+N+MD +YFG IGIG+PPQ F+VIFDTGSSNLWVPS C S +C H R+ ++S+T
Sbjct: 66 PLENYMDMEYFGTIGIGTPPQQFTVIFDTGSSNLWVPSVYCK-SPACTNHKRFNPQESST 124
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIG 194
Y I YG+GS++G D V+VG V +Q+F + T GS + A FDGI+G
Sbjct: 125 YQATNNPVSIAYGTGSMTGILGYDTVQVGGVSDTNQIFGLSETEPGSFLYY-APFDGILG 183
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + +I+ A PV+DNM +GLVS+++FS +L+ + + G ++FGG+D ++ G
Sbjct: 184 LAYPQISASGATPVFDNMWNEGLVSQDLFSVYLSG--NDQSGSVVMFGGIDSSYYTGNLN 241
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
++PV+ +GYWQ + I + QS C GGC AIVD+GTSLL GP+ + I IG
Sbjct: 242 WIPVSVEGYWQISVDSITMNGQSI-ACNGGCQAIVDTGTSLLTGPSNAIANIQSDIGAS 299
>gi|229576947|ref|NP_001153272.1| pepsinogen A precursor [Pongo abelii]
gi|222425188|dbj|BAH20543.1| pepsinogen A-19 [Pongo abelii]
gi|222425190|dbj|BAH20544.1| pepsinogen A-13 [Pongo abelii]
gi|222425204|dbj|BAH20551.1| pepsinogen A-41 [Pongo abelii]
Length = 388
Score = 214 bits (546), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 121/303 (39%), Positives = 174/303 (57%), Gaps = 23/303 (7%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL K L H+LN AR +Y + D PL+N++D +YFG
Sbjct: 32 LSEHGLLKDFLKTHNLNPAR-----KYF--------PQWEAPTLVDEQPLENYLDMEYFG 78
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H+R+ S+TY ++ I Y
Sbjct: 79 TIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTYQSTSETVSIAY 137
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
G+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + I+ A
Sbjct: 138 GTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISSSGAT 196
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ
Sbjct: 197 PVFDNIWNQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQI 254
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKL 322
+ I + ++ C GC AIVD+GTSLL GPT + I IG +G + C
Sbjct: 255 TVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSA 313
Query: 323 VVS 325
+ S
Sbjct: 314 ISS 316
>gi|217038345|gb|ACJ76637.1| pepsinogen C (predicted) [Oryctolagus cuniculus]
Length = 391
Score = 214 bits (546), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 124/347 (35%), Positives = 187/347 (53%), Gaps = 40/347 (11%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L+ GL K L+ H + A +++R GD P+ +++DA YFG
Sbjct: 36 LKEKGLLKEFLNTHKYDPA----------------LKYRFGDFSVTYEPM-DYLDAAYFG 78
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
EI IG+P QNF V+FDTGSSNLWVPS C S +C H+R+ KS+T+ ++ + Y
Sbjct: 79 EISIGTPSQNFLVLFDTGSSNLWVPSVYCQ-SEACTTHNRFNPSKSSTFYTYDQTFSLEY 137
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVP 207
GSGS++GFF D + ++ V +Q F + E FL A FDGI+GL + ++VGDA P
Sbjct: 138 GSGSLTGFFGYDTFTIQNIEVPNQEFGLSETEPGTNFLYAEFDGIMGLAYPSLSVGDATP 197
Query: 208 VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFE 267
MV+ G +S VFSF+L+ +GG +V GGVD + G + PVT++ YWQ
Sbjct: 198 ALQGMVQDGTISSSVFSFYLSSQ-QGTDGGALVLGGVDSSLYTGDIYWAPVTRELYWQIG 256
Query: 268 LGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQY 327
+ + LI ++++G C GC AIVD+GTSLL P ++++ A G + ++Y
Sbjct: 257 IDEFLISSEASGWCSQGCQAIVDTGTSLLTVPQEYMSDLLEATGAQ----------ENEY 306
Query: 328 GDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGIPITRVLFVLNV 374
G+ + D + LP NG E+ P++ ++LN
Sbjct: 307 GEFLVDCDSTESLPTFT------FVINGVEF-----PLSPSAYILNT 342
>gi|23943854|ref|NP_055039.1| pepsin A-5 preproprotein [Homo sapiens]
gi|378522017|sp|P0DJD9.1|PEPA5_HUMAN RecName: Full=Pepsin A-5; AltName: Full=Pepsinogen-5; Flags:
Precursor
gi|20810074|gb|AAH29055.1| Pepsinogen 5, group I (pepsinogen A) [Homo sapiens]
gi|119594334|gb|EAW73928.1| pepsinogen 5, group I (pepsinogen A) [Homo sapiens]
gi|219520836|gb|AAI71889.1| Pepsinogen 5, group I (pepsinogen A) [Homo sapiens]
gi|223461673|gb|AAI47000.1| Pepsinogen 5, group I (pepsinogen A) [Homo sapiens]
Length = 388
Score = 214 bits (546), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 122/303 (40%), Positives = 174/303 (57%), Gaps = 23/303 (7%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL K L H+LN AR +Y + D PL+N++D +YFG
Sbjct: 32 LSERGLLKDFLKKHNLNPAR-----KYF--------PQWEAPTLVDEQPLENYLDMEYFG 78
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+P Q+F+V+FDTGSSNLWVPS C S++C H+R+ S+TY ++ I Y
Sbjct: 79 TIGIGTPAQDFTVVFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTYQSTSETVSITY 137
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
G+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + I+ A
Sbjct: 138 GTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISSSGAT 196
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ
Sbjct: 197 PVFDNIWNQGLVSQDLFSVYLSA--DDKSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQI 254
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKL 322
+ I + N T C GC AIVD+GTSLL GPT + I IG +G + C
Sbjct: 255 TVDSITM-NGETIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSA 313
Query: 323 VVS 325
+ S
Sbjct: 314 ISS 316
>gi|407726061|dbj|BAM46128.1| pepsinogen C [Cynops pyrrhogaster]
Length = 383
Score = 214 bits (546), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 116/321 (36%), Positives = 173/321 (53%), Gaps = 24/321 (7%)
Query: 3 QKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSG 62
+ L+ ++ CL + +P L + ++ + H + A R+ +Y
Sbjct: 2 KNLILALVCLQFAEGLVRIP-----LHKFKPMRQVMAEHGVKAPRVDPATKY-------- 48
Query: 63 VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISC 122
R + PL N+MD Y+GEI IG+PPQNF V+FDTGSSNLWV S+ C S +C
Sbjct: 49 ---RFNNFAVGYEPLSNYMDMSYYGEISIGTPPQNFLVLFDTGSSNLWVASTYCS-SSAC 104
Query: 123 YFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSL 182
H+ + +S+TYT + I YG+GS++G D V + + + Q F + E
Sbjct: 105 TNHATFNPSQSSTYTSNNQKFSIQYGTGSLTGILGYDTVSIQGITITQQEFALSVNEPGT 164
Query: 183 TFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFG 242
F+ A+FDGI+GL + IA A V + M+ QGL+S+ +F F+L + ++ GGE+VFG
Sbjct: 165 NFVYAQFDGILGLAYPSIAADGATTVMEGMMNQGLLSQNIFGFYLGQQ-GSQSGGELVFG 223
Query: 243 GVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPV 302
GVD ++ G+ T+ PVT++ YWQ + + Q TG C GC IVD+GTSLL P
Sbjct: 224 GVDSNYYTGQITWTPVTQQMYWQIGISGFGVNGQPTGWCGQGCQGIVDTGTSLLTAPGQY 283
Query: 303 VTEINHAIG------GEGVVS 317
+ + IG GE VVS
Sbjct: 284 IAALMQEIGATQDSNGEYVVS 304
>gi|413917603|gb|AFW57535.1| hypothetical protein ZEAMMB73_218341 [Zea mays]
gi|413917604|gb|AFW57536.1| hypothetical protein ZEAMMB73_218341 [Zea mays]
Length = 294
Score = 214 bits (546), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 97/153 (63%), Positives = 122/153 (79%), Gaps = 1/153 (0%)
Query: 212 MVEQGLVSEEVFSFWLNRDPDAEE-GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 270
M EQ L++E+VFSFWLNR PDA GGE+VFGGVDP HF G HTYVPV++KGYWQF++GD
Sbjct: 1 MQEQELLAEDVFSFWLNRSPDAAAAGGELVFGGVDPAHFSGNHTYVPVSRKGYWQFDMGD 60
Query: 271 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 330
+LI STG C GCAAIVDSGTSLLAGPT ++ ++N AIG +G++S ECK VVSQYG++
Sbjct: 61 LLIDGHSTGFCAKGCAAIVDSGTSLLAGPTAIIAQVNEAIGADGIISTECKEVVSQYGEM 120
Query: 331 IWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
I D+L++ P++VC Q+GLC F+GA V GI
Sbjct: 121 ILDMLIAQTDPQRVCSQVGLCVFDGARSVSEGI 153
>gi|73621386|sp|Q9GMY8.1|PEPA_SORUN RecName: Full=Pepsin A; Flags: Precursor
gi|9798656|dbj|BAB11750.1| pepsinogen A [Sorex unguiculatus]
Length = 387
Score = 214 bits (546), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 116/300 (38%), Positives = 171/300 (57%), Gaps = 22/300 (7%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL + L HSLN A +Y + ++ PL N+MD +YFG
Sbjct: 32 LWENGLLEDFLKTHSLNPA-----SKYFPTEATTLSANQ---------PLVNYMDMEYFG 77
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
I IG+PPQ F+VIFDTGSSNLWVPS C S +C H+R+ +KS+T+ ++ I Y
Sbjct: 78 TISIGTPPQEFTVIFDTGSSNLWVPSIYCS-SPACSNHNRFDPQKSSTFKPTSQTVSIAY 136
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVP 207
G+GS++G D V+V + +Q+F + E + FDGI+GL + I+ A P
Sbjct: 137 GTGSMTGVLGYDTVQVAGIADTNQIFGLSQSEPGSFLYYSPFDGILGLAYPSISSSGATP 196
Query: 208 VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFE 267
V+DNM QGLVS+++FS +L+ + + G ++FGG+D ++ G +VP++ +GYWQ
Sbjct: 197 VFDNMWNQGLVSQDLFSVYLSS--NDQSGSVVMFGGIDSSYYTGSLNWVPLSSEGYWQIT 254
Query: 268 LGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKLV 323
+ I + QS C GGC AIVD+GTSLL+GPT + I IG +G ++ C +
Sbjct: 255 VDSITMNGQSI-ACNGGCQAIVDTGTSLLSGPTNAIANIQSKIGASQNSQGQMAVSCSSI 313
>gi|219521036|gb|AAI71897.1| Pepsinogen 5, group I (pepsinogen A) [Homo sapiens]
Length = 388
Score = 214 bits (546), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 122/303 (40%), Positives = 174/303 (57%), Gaps = 23/303 (7%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL K L H+LN AR +Y + D PL+N++D +YFG
Sbjct: 32 LSERGLLKDFLKKHNLNPAR-----KYF--------PQWEAPTLVDEQPLENYLDMEYFG 78
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+P Q+F+V+FDTGSSNLWVPS C S++C H+R+ S+TY ++ I Y
Sbjct: 79 TIGIGTPAQDFTVVFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTYQSTSETVSITY 137
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
G+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + I+ A
Sbjct: 138 GTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISSSGAT 196
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ
Sbjct: 197 PVFDNIWNQGLVSQDLFSVYLSA--DDKSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQI 254
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKL 322
+ I + N T C GC AIVD+GTSLL GPT + I IG +G + C
Sbjct: 255 TVDSITM-NGETIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSA 313
Query: 323 VVS 325
+ S
Sbjct: 314 ISS 316
>gi|149725191|ref|XP_001501954.1| PREDICTED: pepsin A-like [Equus caballus]
Length = 387
Score = 214 bits (545), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 121/296 (40%), Positives = 167/296 (56%), Gaps = 24/296 (8%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
LR GL L H N A + A G L+N+MD +YFG
Sbjct: 32 LRENGLLADFLKQHPRNPASKYFPKEAATLAATEG--------------LENYMDEEYFG 77
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
I IG+P Q F+VIFDTGSSNLWVPS+ C S++C H+R+ S+TY +S I Y
Sbjct: 78 TISIGTPAQEFTVIFDTGSSNLWVPSTYCS-SLACSDHNRFNPEDSSTYEATSESVSITY 136
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVP 207
G+GS++G D V VG + +Q+F + E S A FDGI+GL + I+ A P
Sbjct: 137 GTGSMTGVLGYDTVRVGGIEDTNQIFGLSESEPSSFLYYAPFDGILGLAYPSISASGATP 196
Query: 208 VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFE 267
V+DN+ +QGLVS+++FS +L+ D E G ++FGG+D ++ G +VPV+++ YWQ
Sbjct: 197 VFDNIWDQGLVSQDLFSVYLSSD--DESGSVVMFGGIDSSYYSGSLNWVPVSEEAYWQIT 254
Query: 268 LGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 317
+ I + +S C GGC AIVD+GTSLLAGPT + I IG GE V+S
Sbjct: 255 VDSITMNGESIA-CSGGCQAIVDTGTSLLAGPTSGIDNIQSYIGASEDSSGEAVIS 309
>gi|119372298|ref|NP_001073275.1| pepsin A preproprotein [Homo sapiens]
gi|378521956|sp|P0DJD8.1|PEPA3_HUMAN RecName: Full=Pepsin A-3; AltName: Full=Pepsinogen-3; Flags:
Precursor
gi|182887917|gb|AAI60184.1| Pepsinogen 3, group I (pepsinogen A) [synthetic construct]
Length = 388
Score = 214 bits (545), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 120/303 (39%), Positives = 174/303 (57%), Gaps = 23/303 (7%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL K L H+LN AR +Y + D PL+N++D +YFG
Sbjct: 32 LSERGLLKDFLKKHNLNPAR-----KYF--------PQWKAPTLVDEQPLENYLDMEYFG 78
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+P Q+F+V+FDTGSSNLWVPS C S++C H+R+ S+TY ++ I Y
Sbjct: 79 TIGIGTPAQDFTVVFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTYQSTSETVSITY 137
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
G+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + I+ A
Sbjct: 138 GTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISSSGAT 196
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ
Sbjct: 197 PVFDNIWNQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQI 254
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKL 322
+ I + ++ C GC AIVD+GTSLL GPT + I IG +G + C
Sbjct: 255 TVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSA 313
Query: 323 VVS 325
+ S
Sbjct: 314 ISS 316
>gi|194762104|ref|XP_001963198.1| GF19727 [Drosophila ananassae]
gi|190616895|gb|EDV32419.1| GF19727 [Drosophila ananassae]
Length = 449
Score = 214 bits (545), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 127/298 (42%), Positives = 173/298 (58%), Gaps = 19/298 (6%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L+RI ++ R L H+L + + K +Y+ A S ++ E ++ NF Y+G
Sbjct: 20 LKRIEIRPRNL-THNLQSEILLLKAKYLSSADESV------EAKEILVNAANFA---YYG 69
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEIN 146
EI IG+PPQNFSV+FDTGSSN WVPSS C S ++C H++YKS S+TY +G + I
Sbjct: 70 EISIGTPPQNFSVLFDTGSSNTWVPSSLCPASDVACQSHNQYKSSASSTYVPVGTNISIV 129
Query: 147 YGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
YG+GS+ GF S D V + + V +Q F EAT E F FDGI+GLGF ++ G
Sbjct: 130 YGTGSMEGFLSNDTVRIAGLNVTNQTFAEATAEPDGFFDSQPFDGILGLGFNTLSNGINT 189
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
PV DNM+ QGL+ + FS +L R+ + GGEI++GG DP + G TYVPV+ YWQF
Sbjct: 190 PV-DNMIAQGLLDKPEFSVYLRRNGSSLIGGEIIWGGTDPSIYHGSITYVPVSVPQYWQF 248
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI----GGEGVVSAEC 320
+ I Q +C GC AI D+GTSL+ P T IN + G+G S C
Sbjct: 249 TVDTGTINGQI--LCR-GCQAIADTGTSLIIVPKRAFTAINKQLNATDNGDGTASIPC 303
>gi|426251840|ref|XP_004019629.1| PREDICTED: pepsin A-like [Ovis aries]
Length = 386
Score = 214 bits (545), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 110/245 (44%), Positives = 158/245 (64%), Gaps = 6/245 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL+N++D +YFG IGIG+P Q+F+VIFDTGSSNLWVPS C S +C H+R+ + S+T
Sbjct: 65 PLQNYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSIYCS-SEACTNHNRFNPQDSST 123
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIG 194
Y ++ I YG+GS++G D VEVG + +Q+F + T GS + A FDGI+G
Sbjct: 124 YEATSETLSITYGTGSMTGILGYDTVEVGGISDTNQIFGLSETEPGSFLYY-APFDGILG 182
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+ A PV+DN+ +QGLVS+++FS +L+ + E G ++FGG+D ++ G
Sbjct: 183 LAYPSISSSGATPVFDNIWDQGLVSQDLFSVYLSSN--EESGSVVMFGGIDSSYYSGSLN 240
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
+VPV+ +GYWQ + I + +S C GC AIVD+GTSLLAGPT ++ I IG
Sbjct: 241 WVPVSVEGYWQITVDSITMNGESIA-CSDGCQAIVDTGTSLLAGPTTAISNIQSYIGASE 299
Query: 315 VVSAE 319
S E
Sbjct: 300 DSSGE 304
>gi|195471992|ref|XP_002088286.1| GE18491 [Drosophila yakuba]
gi|194174387|gb|EDW87998.1| GE18491 [Drosophila yakuba]
Length = 392
Score = 214 bits (545), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 114/254 (44%), Positives = 152/254 (59%), Gaps = 12/254 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNT 135
L N M+ +Y+G I IG+P Q F+++FDTGS+NLWVPSS C S I+C H++Y S S+T
Sbjct: 68 LHNSMNNEYYGVIAIGTPKQRFNILFDTGSANLWVPSSSCPASNIACKKHNKYNSAASST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G+ I YG+GS+SG S D V + + ++DQ F EA E TF+ A F GI+GL
Sbjct: 128 YVANGEEFAIEYGTGSLSGILSTDTVTIAGISIQDQTFGEALNEPGTTFVDAPFAGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
F IAV P +DNMV QGL+ E V SF+L R A GGE++ GG+D +KG TY
Sbjct: 188 AFSAIAVDGVTPPFDNMVSQGLLDEPVISFYLKRQGTAVRGGELILGGIDSSLYKGSLTY 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGV--CEGGCAAIVDSGTSLLAGPTPVVTEINHAIG-- 311
VPV+ YWQF + I ++ G+ C GC AI D+GTSL+ P +IN +G
Sbjct: 248 VPVSVPAYWQFAVNTI----KTNGIVLCN-GCQAIADTGTSLIVAPLAAYRKINRQLGAT 302
Query: 312 --GEGVVSAECKLV 323
G+G C V
Sbjct: 303 DNGDGEAFVSCSRV 316
>gi|222425194|dbj|BAH20546.1| pepsinogen A-28 [Pongo abelii]
gi|222425196|dbj|BAH20547.1| pepsinogen A-17 [Pongo abelii]
gi|222425202|dbj|BAH20550.1| pepsinogen A-71 [Pongo abelii]
Length = 388
Score = 214 bits (545), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 121/303 (39%), Positives = 174/303 (57%), Gaps = 23/303 (7%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL K L H+LN AR +Y + D PL+N++D +YFG
Sbjct: 32 LSEHGLLKDFLKKHNLNPAR-----KYF--------PQWEAPTLVDEQPLENYLDMEYFG 78
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H+R+ S+TY ++ I Y
Sbjct: 79 SIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTYQSTSETVSIAY 137
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
G+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + I+ A
Sbjct: 138 GTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISSSGAT 196
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ
Sbjct: 197 PVFDNIWNQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQI 254
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKL 322
+ I + ++ C GC AIVD+GTSLL GPT + I IG +G + C
Sbjct: 255 TVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSA 313
Query: 323 VVS 325
+ S
Sbjct: 314 ISS 316
>gi|403217759|emb|CCK72252.1| hypothetical protein KNAG_0J01710 [Kazachstania naganishii CBS
8797]
Length = 415
Score = 214 bits (545), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 110/265 (41%), Positives = 164/265 (61%), Gaps = 9/265 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+++AQY+ +I +G+PPQ F VI DTGSSNLWVPSS+C S++C+ H +Y S+
Sbjct: 91 VPLSNYLNAQYYTDITLGTPPQQFKVILDTGSSNLWVPSSEC-GSLACFLHEKYDHSASS 149
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y G I YGSGS+ G+ SQD + +GD+ + Q F EAT E L F +FDGI+G
Sbjct: 150 SYKANGTDFSIQYGSGSLEGYISQDTLSIGDLTIPKQDFAEATSEPGLAFAFGKFDGILG 209
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKH 253
L + I+V VP + N +EQ L+ E F+F+L + + DAE+GGE +FGGVD + G
Sbjct: 210 LAYDTISVDKVVPPFYNALEQDLLDEAKFAFYLGDTNKDAEDGGEAIFGGVDKSKYTGDV 269
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
T++PV +K YW+ +L + +G++ + G A +D+GTSL+ P+ + IN IG +
Sbjct: 270 TWLPVRRKAYWEVKLEGLGLGDEYAELESHGAA--IDTGTSLITLPSGLAEIINSEIGAK 327
Query: 314 ----GVVSAECKLVVSQYGDLIWDL 334
G + EC Q DL ++
Sbjct: 328 KGWTGQYTLECN-TRDQLPDLTFNF 351
>gi|540097|gb|AAB08492.1| preprochymosin, partial [Sus scrofa]
Length = 380
Score = 214 bits (545), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 111/258 (43%), Positives = 157/258 (60%), Gaps = 15/258 (5%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D QYFG+I IG+PPQ F+V+FDTGSS LWVPS C S +C H R+ KS+T
Sbjct: 64 PLTNYLDTQYFGKIYIGTPPQEFTVVFDTGSSELWVPSVYCK-SDACQNHHRFNPSKSST 122
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
+ + K I YG+GSI GF D V V +V Q +T+E S F + FDGI+GL
Sbjct: 123 FQNLDKPLSIQYGTGSIQGFLGYDTVMVAGIVDAHQTVGLSTQEPSDIFTYSEFDGILGL 182
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ E+A VPV+DNM+ + LV++++F+ +++R+ +EG + G +DP ++ G +
Sbjct: 183 GYPELASEYTVPVFDNMMHRHLVAQDLFAVYMSRN---DEGSMLTLGAIDPSYYTGSLHW 239
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
VPVT + YWQF + + I N C GGC AI+D+GTS+LAGP+ + I AIG
Sbjct: 240 VPVTMQLYWQFTVDSVTI-NGVVVACNGGCQAILDTGTSMLAGPSSDILNIQMAIGA--- 295
Query: 316 VSAECKLVVSQYGDLIWD 333
SQYG+ D
Sbjct: 296 -------TESQYGEFDID 306
>gi|4505757|ref|NP_002621.1| gastricsin isoform 1 preproprotein [Homo sapiens]
gi|129796|sp|P20142.1|PEPC_HUMAN RecName: Full=Gastricsin; AltName: Full=Pepsinogen C; Flags:
Precursor
gi|387015|gb|AAA60063.1| pepsinogen C [Homo sapiens]
gi|551176|gb|AAA60074.1| pepsinogen [Homo sapiens]
gi|1658286|gb|AAB18273.1| gastricsin [Homo sapiens]
gi|49522219|gb|AAH73740.1| Progastricsin (pepsinogen C) [Homo sapiens]
gi|119624464|gb|EAX04059.1| progastricsin (pepsinogen C) [Homo sapiens]
Length = 388
Score = 214 bits (545), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 114/283 (40%), Positives = 168/283 (59%), Gaps = 9/283 (3%)
Query: 37 RLDLHSLNAARITRKERYMGGAGV------SGVRHRLGDSDEDILPLKNFMDAQYFGEIG 90
++ L + R T KE+ + G + ++R GD P+ +MDA YFGEI
Sbjct: 20 KVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYRFGDLSVTYEPMA-YMDAAYFGEIS 78
Query: 91 IGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSG 150
IG+PPQNF V+FDTGSSNLWVPS C S +C HSR+ +S+TY+ G++ + YGSG
Sbjct: 79 IGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHSRFNPSESSTYSTNGQTFSLQYGSG 137
Query: 151 SISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWD 210
S++GFF D + V + V +Q F + E F+ A+FDGI+GL + ++V +A
Sbjct: 138 SLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGIMGLAYPALSVDEATTAMQ 197
Query: 211 NMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 270
MV++G ++ VFS +L+ GG +VFGGVD + G+ + PVT++ YWQ + +
Sbjct: 198 GMVQEGALTSPVFSVYLSNQ-QGSSGGAVVFGGVDSSLYTGQIYWAPVTQELYWQIGIEE 256
Query: 271 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
LIG Q++G C GC AIVD+GTSLL P ++ + A G +
Sbjct: 257 FLIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMSALLQATGAQ 299
>gi|413942271|gb|AFW74920.1| hypothetical protein ZEAMMB73_522985 [Zea mays]
Length = 468
Score = 214 bits (545), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 99/153 (64%), Positives = 117/153 (76%)
Query: 211 NMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 270
NMV+QGL+S+ VFSFW NR D EGGEIVFGG+D H+KG HT+VPVT+KGYWQF +GD
Sbjct: 169 NMVKQGLISDPVFSFWFNRHADEGEGGEIVFGGMDSSHYKGDHTFVPVTRKGYWQFNMGD 228
Query: 271 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 330
+L+ +STG C GGCAAI DSGTSLLAGPT ++TEIN IG GVVS ECK VVSQYG
Sbjct: 229 VLVDGKSTGFCAGGCAAIADSGTSLLAGPTAIITEINEKIGAAGVVSQECKTVVSQYGQQ 288
Query: 331 IWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
I DLL++ P K+C Q+GLC F+G V GI
Sbjct: 289 ILDLLLAETQPAKICSQVGLCTFDGTHGVSAGI 321
>gi|126309843|ref|XP_001370404.1| PREDICTED: gastricsin-like isoform 2 [Monodelphis domestica]
Length = 389
Score = 214 bits (545), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 102/234 (43%), Positives = 152/234 (64%), Gaps = 1/234 (0%)
Query: 80 FMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEI 139
+MD+ Y+GEI IG+PPQNF V+FDTGSSNLWVPS C S +C H+R+ +S+TY+
Sbjct: 67 YMDSSYYGEISIGTPPQNFLVLFDTGSSNLWVPSIYCQ-SQACSGHARFNPSQSSTYSTN 125
Query: 140 GKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFRE 199
G++ + YGSGS++GFF D + V + V +Q F + E F+ A+FDGI+G+ +
Sbjct: 126 GQTFSLQYGSGSLTGFFGYDTMTVQGIQVPNQEFGLSENEPGTNFIYAQFDGIMGMAYPA 185
Query: 200 IAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVT 259
+AVG A M++Q +++ +FSF+L+ ++ GGE++FGGVD + G+ + PVT
Sbjct: 186 LAVGGATTALQGMLQQNVLTNPIFSFYLSNQQSSQSGGEVIFGGVDNNLYSGQIYWAPVT 245
Query: 260 KKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
++ YWQ + + IG Q+TG C GC AIVD+GTSLL P ++ A GG+
Sbjct: 246 QELYWQIGIQEFSIGGQATGWCSQGCQAIVDTGTSLLTVPQQYMSAFLQATGGQ 299
>gi|222425192|dbj|BAH20545.1| pepsinogen A-59 [Pongo abelii]
Length = 388
Score = 214 bits (545), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 121/303 (39%), Positives = 174/303 (57%), Gaps = 23/303 (7%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL K L H+LN AR +Y + D PL+N++D +YFG
Sbjct: 32 LSEHGLLKDFLKKHNLNPAR-----KYF--------PQWEAPTLVDEQPLENYLDMEYFG 78
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H+R+ S+TY ++ I Y
Sbjct: 79 SIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTYQSTSETVSIAY 137
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
G+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + I+ A
Sbjct: 138 GTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISSSGAT 196
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ
Sbjct: 197 PVFDNIWNQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQI 254
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKL 322
+ I + ++ C GC AIVD+GTSLL GPT + I IG +G + C
Sbjct: 255 TVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSA 313
Query: 323 VVS 325
+ S
Sbjct: 314 ISS 316
>gi|166361871|gb|ABY87034.1| pepsinogen A1 [Epinephelus coioides]
gi|166361875|gb|ABY87036.1| pepsinogen A1 [Epinephelus coioides]
Length = 376
Score = 214 bits (545), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 116/267 (43%), Positives = 166/267 (62%), Gaps = 17/267 (6%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
+ N D Y+G I IG+PPQ+FSVIFDTGSSNLWVPS C S +C H ++ ++S+T+
Sbjct: 61 MTNDADLSYYGVISIGTPPQSFSVIFDTGSSNLWVPSVYCS-SQACQNHRKFNPQQSSTF 119
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGL 195
+ I YG+GS++G + DNVEVG + V++QVF I T + + A DGI+GL
Sbjct: 120 KWGDQPLSIQYGTGSMTGHLAIDNVEVGGITVQNQVFGISRTEAPFMAHMTA--DGILGL 177
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
F+ IA + VPV+DNMV+QGLVS+ +FS +L+ E+G E+VFGG+D H+ G+ T+
Sbjct: 178 AFQTIAADNVVPVFDNMVKQGLVSQPLFSVYLSS--HGEQGSEVVFGGIDSSHYTGQVTW 235
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
VP+T YWQ ++ + I Q T C GGC AI+D+GTSL+ GPT + +N +G
Sbjct: 236 VPLTSATYWQIKMDGVKINGQ-TVACAGGCQAIIDTGTSLIVGPTNDINNMNSWVGAS-- 292
Query: 316 VSAECKLVVSQYGDLIWDLLVSGLLPE 342
+QYG+ + G +PE
Sbjct: 293 --------TNQYGESTVNCQNVGSMPE 311
>gi|119372302|ref|NP_001073276.1| pepsin A preproprotein [Homo sapiens]
gi|378521995|sp|P0DJD7.1|PEPA4_HUMAN RecName: Full=Pepsin A-4; AltName: Full=Pepsinogen-4; Flags:
Precursor
gi|387012|gb|AAA98529.1| pepsinogen [Homo sapiens]
gi|157170280|gb|AAI52845.1| Pepsinogen 4, group I (pepsinogen A) [synthetic construct]
gi|219520853|gb|AAI71920.1| Pepsinogen 4, group I (pepsinogen A) [Homo sapiens]
gi|219521176|gb|AAI71910.1| Pepsinogen 4, group I (pepsinogen A) [Homo sapiens]
gi|223462201|gb|AAI50660.1| Pepsinogen 4, group I (pepsinogen A) [Homo sapiens]
gi|261860840|dbj|BAI46942.1| pepsinogen 4, group I [synthetic construct]
Length = 388
Score = 214 bits (544), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 120/303 (39%), Positives = 174/303 (57%), Gaps = 23/303 (7%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL K L H+LN AR +Y + D PL+N++D +YFG
Sbjct: 32 LSERGLLKDFLKKHNLNPAR-----KYF--------PQWEAPTLVDEQPLENYLDMEYFG 78
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+P Q+F+V+FDTGSSNLWVPS C S++C H+R+ S+TY ++ I Y
Sbjct: 79 TIGIGTPAQDFTVVFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTYQSTSETVSITY 137
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
G+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + I+ A
Sbjct: 138 GTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISSSGAT 196
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ
Sbjct: 197 PVFDNIWNQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQI 254
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKL 322
+ I + ++ C GC AIVD+GTSLL GPT + I IG +G + C
Sbjct: 255 TVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSA 313
Query: 323 VVS 325
+ S
Sbjct: 314 ISS 316
>gi|126309841|ref|XP_001370380.1| PREDICTED: gastricsin-like isoform 1 [Monodelphis domestica]
Length = 388
Score = 214 bits (544), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 102/234 (43%), Positives = 152/234 (64%), Gaps = 1/234 (0%)
Query: 80 FMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEI 139
+MD+ Y+GEI IG+PPQNF V+FDTGSSNLWVPS C S +C H+R+ +S+TY+
Sbjct: 67 YMDSSYYGEISIGTPPQNFLVLFDTGSSNLWVPSIYCQ-SQACSGHARFNPSQSSTYSTN 125
Query: 140 GKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFRE 199
G++ + YGSGS++GFF D + V + V +Q F + E F+ A+FDGI+G+ +
Sbjct: 126 GQTFSLQYGSGSLTGFFGYDTMTVQGIQVPNQEFGLSENEPGTNFIYAQFDGIMGMAYPA 185
Query: 200 IAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVT 259
+AVG A M++Q +++ +FSF+L+ ++ GGE++FGGVD + G+ + PVT
Sbjct: 186 LAVGGATTALQGMLQQNVLTNPIFSFYLSNQQSSQSGGEVIFGGVDNNLYSGQIYWAPVT 245
Query: 260 KKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
++ YWQ + + IG Q+TG C GC AIVD+GTSLL P ++ A GG+
Sbjct: 246 QELYWQIGIQEFSIGGQATGWCSQGCQAIVDTGTSLLTVPQQYMSAFLQATGGQ 299
>gi|189066533|dbj|BAG35783.1| unnamed protein product [Homo sapiens]
gi|193785072|dbj|BAG54225.1| unnamed protein product [Homo sapiens]
gi|219521010|gb|AAI71815.1| Pepsinogen 3, group I (pepsinogen A) [Homo sapiens]
Length = 388
Score = 214 bits (544), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 120/303 (39%), Positives = 174/303 (57%), Gaps = 23/303 (7%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL K L H+LN AR +Y + D PL+N++D +YFG
Sbjct: 32 LSERGLLKDFLKKHNLNPAR-----KYF--------PQWKAPTLVDEQPLENYLDMEYFG 78
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+P Q+F+V+FDTGSSNLWVPS C S++C H+R+ S+TY ++ I Y
Sbjct: 79 TIGIGTPAQDFTVLFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTYQSTSETVSITY 137
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
G+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + I+ A
Sbjct: 138 GTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISSSGAT 196
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ
Sbjct: 197 PVFDNIWNQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQI 254
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKL 322
+ I + ++ C GC AIVD+GTSLL GPT + I IG +G + C
Sbjct: 255 TVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSA 313
Query: 323 VVS 325
+ S
Sbjct: 314 ISS 316
>gi|387014|gb|AAA60062.1| pepsinogen [Homo sapiens]
Length = 385
Score = 214 bits (544), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 114/283 (40%), Positives = 168/283 (59%), Gaps = 9/283 (3%)
Query: 37 RLDLHSLNAARITRKERYMGGAGV------SGVRHRLGDSDEDILPLKNFMDAQYFGEIG 90
++ L + R T KE+ + G + ++R GD P+ +MDA YFGEI
Sbjct: 17 KVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYRFGDLSVTYEPMA-YMDAAYFGEIS 75
Query: 91 IGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSG 150
IG+PPQNF V+FDTGSSNLWVPS C S +C HSR+ +S+TY+ G++ + YGSG
Sbjct: 76 IGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHSRFNPSESSTYSTNGQTFSLQYGSG 134
Query: 151 SISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWD 210
S++GFF D + V + V +Q F + E F+ A+FDGI+GL + ++V +A
Sbjct: 135 SLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGIMGLAYPALSVDEATTAMQ 194
Query: 211 NMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 270
MV++G ++ VFS +L+ GG +VFGGVD + G+ + PVT++ YWQ + +
Sbjct: 195 GMVQEGALTSPVFSVYLSNQ-QGSSGGAVVFGGVDSSLYTGQIYWAPVTQELYWQIGIEE 253
Query: 271 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
LIG Q++G C GC AIVD+GTSLL P ++ + A G +
Sbjct: 254 FLIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMSALLQATGAQ 296
>gi|45382395|ref|NP_990208.1| gastricsin precursor [Gallus gallus]
gi|4589840|dbj|BAA76893.1| pepsinogen C [Gallus gallus]
Length = 389
Score = 214 bits (544), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 119/312 (38%), Positives = 173/312 (55%), Gaps = 13/312 (4%)
Query: 3 QKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSG 62
++L+ +V CL + L +P R +K+ + LH A Y + +
Sbjct: 2 KRLILTVLCLHLCEGILRVPLKKGKSIREAMKESGV-LHDYLANHRHYDPAYKFFSNFAT 60
Query: 63 VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISC 122
PL N MD Y+GEI IG+PPQNF V+FDTGSSNLWVPS+ C S +C
Sbjct: 61 AYE----------PLANNMDMSYYGEISIGTPPQNFLVLFDTGSSNLWVPSTLCQ-SQAC 109
Query: 123 YFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSL 182
H+ + +S+T++ + + YGSGS++G F D V + + + +Q F + E
Sbjct: 110 ANHNEFDPNESSTFSTQDEFFSLQYGSGSLTGIFGFDTVTIQGISITNQEFGLSETEPGT 169
Query: 183 TFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFG 242
+FL + FDGI+GL F I+ G A V M+++ L+ VFSF+L+ + +GGE+VFG
Sbjct: 170 SFLYSPFDGILGLAFPSISAGGATTVMQKMLQENLLDFPVFSFYLSGQ-EGSQGGELVFG 228
Query: 243 GVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPV 302
GVDP + G+ T+ PVT+ YWQ + D +G QS+G C GC IVD+GTSLL P V
Sbjct: 229 GVDPNLYTGQITWTPVTQTTYWQIGIEDFAVGGQSSGWCSQGCQGIVDTGTSLLTVPNQV 288
Query: 303 VTEINHAIGGEG 314
TE+ IG +
Sbjct: 289 FTELMQYIGAQA 300
>gi|148691635|gb|EDL23582.1| progastricsin (pepsinogen C) [Mus musculus]
Length = 392
Score = 214 bits (544), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 117/304 (38%), Positives = 174/304 (57%), Gaps = 6/304 (1%)
Query: 13 WVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRK--ERYMGGAGVSGVRHRLGDS 70
W++ + L LP L R+ LKK + ++ + + + + G + GD
Sbjct: 3 WMVVALLCLPLLEAALIRVPLKKMKSIRETMKEQGVLKDFLKNHKYDPGQKYHFGKFGDY 62
Query: 71 DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKS 130
P+ +MDA Y+GEI IG+PPQNF V+FDTGSSNLWV S C S +C H+RY
Sbjct: 63 SVLYEPMA-YMDASYYGEISIGTPPQNFLVLFDTGSSNLWVSSVYCQ-SEACTTHTRYNP 120
Query: 131 RKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFD 190
KS+TY G++ + YG+GS++GFF D + V + V +Q F + E F+ A+FD
Sbjct: 121 SKSSTYYTQGQTFSLQYGTGSLTGFFGYDTLRVQSIQVPNQEFGLSENEPGTNFVYAQFD 180
Query: 191 GIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFK 250
GI+GL + ++ G A M+ +G +S+ +F +L +GG+IVFGGVD +
Sbjct: 181 GIMGLAYPGLSSGGATTALQGMLGEGALSQPLFGVYLGSQ-QGSDGGQIVFGGVDENLYT 239
Query: 251 GKHTYVPVTKKGYWQFELGDILIGNQSTGVC-EGGCAAIVDSGTSLLAGPTPVVTEINHA 309
G+ T++PVT++ YWQ + D LIGNQ++G C GC IVD+GTSLL P + E+
Sbjct: 240 GELTWIPVTQELYWQITIDDFLIGNQASGWCSSSGCQGIVDTGTSLLVMPAQYLNELLQT 299
Query: 310 IGGE 313
IG +
Sbjct: 300 IGAQ 303
>gi|387013|gb|AAA60061.1| pepsinogen A [Homo sapiens]
Length = 388
Score = 214 bits (544), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 120/303 (39%), Positives = 174/303 (57%), Gaps = 23/303 (7%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL K L H+LN AR +Y + D PL+N++D +YFG
Sbjct: 32 LSERGLLKDFLKKHNLNPAR-----KYF--------PQWKAPTLVDEQPLENYLDMEYFG 78
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+P Q+F+V+FDTGSSNLWVPS C S++C H+R+ S+TY ++ I Y
Sbjct: 79 TIGIGTPAQDFTVLFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTYQSTSETVSITY 137
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
G+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + I+ A
Sbjct: 138 GTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISSSGAT 196
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ
Sbjct: 197 PVFDNIWNQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQI 254
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKL 322
+ I + ++ C GC AIVD+GTSLL GPT + I IG +G + C
Sbjct: 255 TVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSA 313
Query: 323 VVS 325
+ S
Sbjct: 314 ISS 316
>gi|219521691|gb|AAI71808.1| Pepsinogen 4, group I (pepsinogen A) [Homo sapiens]
Length = 388
Score = 213 bits (543), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 120/303 (39%), Positives = 174/303 (57%), Gaps = 23/303 (7%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL K L H+LN AR +Y + D PL+N++D +YFG
Sbjct: 32 LSERGLLKDFLKKHNLNPAR-----KYF--------PQWEAPTLVDEQPLENYLDMEYFG 78
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+P Q+F+V+FDTGSSNLWVPS C S++C H+R+ S+TY ++ I Y
Sbjct: 79 TIGIGTPAQDFTVLFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTYQSTSETVSITY 137
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
G+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + I+ A
Sbjct: 138 GTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISSSGAT 196
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ
Sbjct: 197 PVFDNIWNQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQI 254
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKL 322
+ I + ++ C GC AIVD+GTSLL GPT + I IG +G + C
Sbjct: 255 TVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSA 313
Query: 323 VVS 325
+ S
Sbjct: 314 ISS 316
>gi|219520803|gb|AAI71814.1| Pepsinogen 4, group I (pepsinogen A) [Homo sapiens]
Length = 388
Score = 213 bits (543), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 120/303 (39%), Positives = 174/303 (57%), Gaps = 23/303 (7%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL K L H+LN AR +Y + D PL+N++D +YFG
Sbjct: 32 LSERGLLKDFLKKHNLNPAR-----KYF--------PQWEAPTLVDEQPLENYLDMEYFG 78
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+P Q+F+V+FDTGSSNLWVPS C S++C H+R+ S+TY ++ I Y
Sbjct: 79 TIGIGTPAQDFTVLFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTYQSTSETVSITY 137
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
G+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + I+ A
Sbjct: 138 GTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISSSGAT 196
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ
Sbjct: 197 PVFDNIWNQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQI 254
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKL 322
+ I + ++ C GC AIVD+GTSLL GPT + I IG +G + C
Sbjct: 255 TVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSA 313
Query: 323 VVS 325
+ S
Sbjct: 314 ISS 316
>gi|426250269|ref|XP_004018860.1| PREDICTED: gastricsin [Ovis aries]
Length = 431
Score = 213 bits (543), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 110/283 (38%), Positives = 168/283 (59%), Gaps = 8/283 (2%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRH------RLGDSDEDILPLKNFMDAQYFGEIG 90
++ L + R T KE+ + + +H GD P+ ++MDA YFGEI
Sbjct: 21 KIPLKKFKSIRETMKEKGLLEDFLRTYKHDPAQKYHFGDFSVATEPM-DYMDAAYFGEIS 79
Query: 91 IGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSG 150
IG+PPQNF V+FDTGSSNLWVPS C S +C H R+ S+TY+ ++ + YGSG
Sbjct: 80 IGTPPQNFLVLFDTGSSNLWVPSLYCQ-SQACTSHPRFNPSLSSTYSSNEQTFSLQYGSG 138
Query: 151 SISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWD 210
S++G D + V + V +Q F + E FL A+FDGI+G+ + ++V A V
Sbjct: 139 SLTGLLGYDTLTVQGIQVPNQEFGLSKTEPGTNFLYAKFDGIMGMAYPSLSVDGATTVLQ 198
Query: 211 NMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 270
MV++G ++ +FSF+L+ +++GG ++FGGVD + + G+ + PVT++ YWQ + +
Sbjct: 199 GMVQEGALTSPIFSFYLSSQQGSQDGGAVIFGGVDSRLYTGQIYWAPVTQELYWQIGIEE 258
Query: 271 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
LIG+Q+TG C GC AIVD+GTSLL P ++ + A G +
Sbjct: 259 FLIGDQATGWCSAGCQAIVDTGTSLLTVPQQFLSALLQATGAQ 301
>gi|395534115|ref|XP_003769093.1| PREDICTED: gastricsin-like [Sarcophilus harrisii]
Length = 392
Score = 213 bits (543), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 100/238 (42%), Positives = 151/238 (63%), Gaps = 1/238 (0%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N+MD Y+GEI IG+PPQNF V+FDTGSSNLWV S C S +C H ++ +S+T
Sbjct: 66 PLANYMDMSYYGEISIGTPPQNFLVLFDTGSSNLWVSSIYCQ-SQACTNHPQFNPNQSST 124
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y+ G++ + YG+GS++G F D V + + + +Q F + E +F+ A+FDGI+GL
Sbjct: 125 YSSNGQTFSLQYGTGSLTGVFGYDTVTIQGISITNQEFGLSETEPGTSFVYAQFDGILGL 184
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ I+ G A V ++++ L++ VF+F+L+ + ++ GGE+ FGGVD F G +
Sbjct: 185 AYPSISSGGATTVMQGLLQENLINAPVFAFYLSGNENSNNGGEVTFGGVDTSMFTGDIYW 244
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
PVT++ YWQ + IG Q+TG C GC A+VD+GTSLL P + +E+ IG +
Sbjct: 245 APVTQEAYWQIAINGFSIGGQATGWCSEGCQAVVDTGTSLLTAPQQIFSELMQYIGAQ 302
>gi|195034430|ref|XP_001988894.1| GH11416 [Drosophila grimshawi]
gi|193904894|gb|EDW03761.1| GH11416 [Drosophila grimshawi]
Length = 400
Score = 213 bits (543), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 125/314 (39%), Positives = 179/314 (57%), Gaps = 19/314 (6%)
Query: 28 LRRIGLKK---RRLDLHSLNAARITRK------ERYMG-GAGVSGVRHRLGDSDEDILP- 76
L RI + K ++ H +AAR R+ E Y+ GA + + + DS+ D
Sbjct: 19 LHRIPIHKHQQKKTRQHMKSAARHLRQKYHKQSELYVDYGAPNNDLSGSVEDSNADYTTE 78
Query: 77 -LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSN 134
L N + Y+GEI IG+PPQ F V+FDTGSSNLWVPS C + ++C H++Y S S+
Sbjct: 79 ELSNNQNMDYYGEIAIGTPPQYFKVVFDTGSSNLWVPSVNCLPTDLACQTHNQYNSSASS 138
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY G+S I YG+GS++G+ S D V + + + +Q F EAT + + +F FDGI+G
Sbjct: 139 TYVANGESFSIQYGTGSLTGYLSSDTVSISGLSIVNQSFAEATSQPNSSFTGVPFDGILG 198
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+ + IA VP + N+ QGL+ + F F+L + AE GGE++ GGVD F+G T
Sbjct: 199 MAYSSIAEDSVVPPFYNLWNQGLIDKPTFGFYLTHNGSAELGGELILGGVDNTLFEGNLT 258
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG-- 312
VPV++ GYWQF + + + N V C AI D+GTSLLA P +T IN+ IG
Sbjct: 259 SVPVSQMGYWQFAMAVVAMDN---NVICSDCQAIADTGTSLLAVPANQLTYINNIIGAYQ 315
Query: 313 -EGVVSAECKLVVS 325
+G +C LV S
Sbjct: 316 MDGDYFVDCSLVNS 329
>gi|449542760|gb|EMD33738.1| hypothetical protein CERSUDRAFT_56642 [Ceriporiopsis subvermispora
B]
Length = 395
Score = 213 bits (543), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 123/271 (45%), Positives = 165/271 (60%), Gaps = 11/271 (4%)
Query: 56 GGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSK 115
GG G + H G ++L L N+ +AQYF E+ +G+PPQNF VI DTGSSNLWVPS
Sbjct: 57 GGLGRNTEVHHSGPG-HNVL-LSNYANAQYFTEVSLGTPPQNFKVILDTGSSNLWVPSVH 114
Query: 116 CYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIE 175
C SI+C+ HS+Y S KS++Y G S EI YGSGS+ G SQD + +GD+ + +Q F E
Sbjct: 115 C-MSIACFMHSKYDSSKSSSYNANGSSFEIQYGSGSMQGIVSQDTLSIGDLNITNQDFAE 173
Query: 176 ATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEE 235
AT+E L+F +FDGI+GL + I+V P + NMVEQGL+ +FSF L DA
Sbjct: 174 ATKEPGLSFTFGKFDGILGLAYNSISVNYITPPFYNMVEQGLLDNPIFSFKLG---DAPL 230
Query: 236 GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSL 295
GGE +FGG D + G+ Y PV ++ YW+ EL + +G+Q + G A +D+GTSL
Sbjct: 231 GGEAIFGGTDESAYTGEIIYAPVRRQAYWEVELDKVTLGDQVFEFQDTGAA--IDTGTSL 288
Query: 296 LAGPTPVVTEINHAIGG---EGVVSAECKLV 323
+A PT T IN IG G EC +
Sbjct: 289 IAVPTAQATAINKLIGATSKSGTYVVECSTI 319
>gi|402893203|ref|XP_003909790.1| PREDICTED: pepsin A-2/A-3-like [Papio anubis]
Length = 388
Score = 213 bits (543), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 124/316 (39%), Positives = 180/316 (56%), Gaps = 28/316 (8%)
Query: 26 NGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQY 85
+ L GL K L H+ N AR +Y A + D PL+N++D +Y
Sbjct: 30 HNLSEHGLLKDFLKKHNFNPAR-----KYFPQAEAPTLI--------DEQPLENYLDMEY 76
Query: 86 FGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEI 145
FG IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H R+ + S+TY + I
Sbjct: 77 FGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACTNHKRFNPQDSSTYQSTSGTLSI 135
Query: 146 NYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGD 204
YG+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + I+
Sbjct: 136 TYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISSSG 194
Query: 205 AVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYW 264
A PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPV+ +GYW
Sbjct: 195 ATPVFDNIWNQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTGSLNWVPVSVEGYW 252
Query: 265 QFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVSA 318
Q + I + ++ C GC AIVD+GTSLL GPT + I IG GE VVS
Sbjct: 253 QISVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGEMVVSC 311
Query: 319 ECKLVVSQYGDLIWDL 334
+S D+++ +
Sbjct: 312 SA---ISSLPDIVFTI 324
>gi|397526910|ref|XP_003833357.1| PREDICTED: gastricsin [Pan paniscus]
Length = 388
Score = 213 bits (543), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 114/283 (40%), Positives = 168/283 (59%), Gaps = 9/283 (3%)
Query: 37 RLDLHSLNAARITRKERYMGGAGV------SGVRHRLGDSDEDILPLKNFMDAQYFGEIG 90
++ L + R T KE+ + G + ++R GD P+ +MDA YFGEI
Sbjct: 20 KVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYRFGDLSVTYEPMA-YMDAAYFGEIS 78
Query: 91 IGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSG 150
IG+PPQNF V+FDTGSSNLWVPS C S +C HSR+ +S+TY+ G++ + YGSG
Sbjct: 79 IGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHSRFNPSESSTYSTNGQTFSLQYGSG 137
Query: 151 SISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWD 210
S++GFF D + V + V +Q F + E F+ A+FDGI+GL + ++V +A
Sbjct: 138 SLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGIMGLAYPALSVDEATTAMQ 197
Query: 211 NMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 270
MV++G ++ VFS +L+ GG +VFGGVD + G+ + PVT++ YWQ + +
Sbjct: 198 GMVQEGALTSPVFSVYLSNQ-QGSSGGAVVFGGVDSSLYTGQIYWAPVTQELYWQIGIEE 256
Query: 271 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
LIG Q++G C GC AIVD+GTSLL P ++ + A G +
Sbjct: 257 FLIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMSALLEATGAQ 299
>gi|346322842|gb|EGX92440.1| vacuolar protease A precursor [Cordyceps militaris CM01]
Length = 395
Score = 213 bits (543), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 123/309 (39%), Positives = 183/309 (59%), Gaps = 15/309 (4%)
Query: 14 VLASCLLLPASSNGLRRIGLKK----RRLDLHSLNAARITRKERYMGGAGVSGV----RH 65
++A+ +L ++ G+ ++ L+K +L S A ++Y+G S
Sbjct: 5 LIAAAVLAGSAHAGIHKMKLQKIPLAEQLVGASFEAQAQQLGQKYLGARPASRADIIFNA 64
Query: 66 RLGDSDE-DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYF 124
++ +S ++P+ NF +AQYF EI IG+PPQ F V+ DTGSSNLWVPS C SI+C+
Sbjct: 65 KVAESKNGHLVPVSNFANAQYFSEITIGTPPQTFKVVLDTGSSNLWVPSQSCS-SIACFL 123
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
HS Y S S+TY + G EI+YGSGS++G+ S D V +GD+ +K+ F EAT E L F
Sbjct: 124 HSTYDSSSSSTYKKNGSDFEIHYGSGSLTGYVSNDVVRIGDLTIKNTDFAEATNEPGLAF 183
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
RFDGI+GLG+ I+V VP + M++Q L+ E VF+F+L + EEG E VFGGV
Sbjct: 184 AFGRFDGILGLGYDTISVNHMVPPFYQMIKQKLLDEPVFAFYLGSE---EEGSEAVFGGV 240
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
D H++GK Y+P+ +K YW+ + I G + + G I+D+GTSL P+ +
Sbjct: 241 DKNHYEGKIEYLPLRRKAYWEVDFDAIAFGKEVAELENTGV--ILDTGTSLNTLPSDLAE 298
Query: 305 EINHAIGGE 313
+N IG +
Sbjct: 299 LLNKEIGAK 307
>gi|380865655|gb|AFF19538.1| pepsin F, partial [Camelus dromedarius]
Length = 354
Score = 213 bits (543), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 111/287 (38%), Positives = 171/287 (59%), Gaps = 9/287 (3%)
Query: 38 LDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDED-----ILPLKNFMDAQYFGEIGIG 92
+ L + R +E+ M + +RL D+ PL+N++D Y +I IG
Sbjct: 16 IPLTKVKPMRENLREKNMLKDFLEQYTYRLSDNTAPAKRVYTQPLRNYLDLVYIADISIG 75
Query: 93 SPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSI 152
+PPQNF V+FDTGS+NLWVPS C S +C HS + +S T++ G+S EI YG+G I
Sbjct: 76 TPPQNFKVVFDTGSANLWVPSIYCD-SKACANHSVFNPPRSTTFSLEGRSFEITYGTGKI 134
Query: 153 SGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNM 212
+GF D V +G++V+ Q F + +E + A FDGI+GLG+ +++ PV+DN+
Sbjct: 135 AGFLGYDTVRIGNLVIGSQAFGMSQKEPGIFLEHAVFDGILGLGYPALSIVGTTPVFDNL 194
Query: 213 VEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDIL 272
+Q L+ E +F+F+L+ E G ++FGG+D ++KG+ +VPV+++ YWQ + I
Sbjct: 195 KKQRLLKEPIFAFYLST--KKENGSVVMFGGLDHSYYKGELKWVPVSQRLYWQISMDSIT 252
Query: 273 IGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
+ + G C+GGC AIVD+GT++L GPT VVT I AI + E
Sbjct: 253 MNGKILG-CKGGCQAIVDTGTAVLVGPTNVVTNIQKAINARPLTGYE 298
>gi|385301236|gb|EIF45441.1| proteinase a [Dekkera bruxellensis AWRI1499]
Length = 429
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 105/238 (44%), Positives = 148/238 (62%), Gaps = 3/238 (1%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N+M+AQYF EI +G+P Q F VI DTGSSNLWVPSS C S++CY H++Y +S+T
Sbjct: 107 PLTNYMNAQYFSEIELGTPGQKFKVILDTGSSNLWVPSSDCA-SLACYLHTKYDHEQSST 165
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + G I YGSGS+ G+ SQD +++ D+ + +Q F EAT E L F +FDGI+GL
Sbjct: 166 YKKNGSEFSIQYGSGSMKGYISQDTLKISDLEITNQDFAEATEEPGLAFAFGKFDGILGL 225
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ I+V VP N + GL+ FSF+L E+GG FGG+D F GK T+
Sbjct: 226 GYDTISVNHIVPPVYNAINSGLLDNPQFSFYLGDTSKTEDGGVCTFGGIDDSKFTGKITW 285
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
+PV +K YW+ + I +G++ + G A +D+GTSL+ P+ + +N IG E
Sbjct: 286 LPVRRKAYWEVKFEGIGLGDEYAELQSHGAA--IDTGTSLIVLPSQLAEILNSEIGAE 341
>gi|444724642|gb|ELW65241.1| Chymosin [Tupaia chinensis]
Length = 381
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 103/237 (43%), Positives = 152/237 (64%), Gaps = 5/237 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D QYFG+I IG+PPQ F+V+FDTGSS+LWVPS C S +C H R+ KS+T
Sbjct: 65 PLTNYLDTQYFGKITIGTPPQEFTVVFDTGSSDLWVPSVYCD-SAACQNHQRFDPSKSST 123
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
+ + K I YG+GS+ GF D V V D+V Q +T+E F A FDGI+GL
Sbjct: 124 FQNLDKPLSIQYGTGSMQGFLGYDTVTVSDIVDTHQTVGLSTQEPGNVFTYAEFDGILGL 183
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ +A +VPV+DNM+++ LV++++FS +++R+ ++G + G +D ++ G +
Sbjct: 184 AYPSLAAEYSVPVFDNMMQKHLVAKDLFSVYMSRN---DQGSMLTLGAIDSSYYTGSLHW 240
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
VPVT + YWQF + + I N C+GGC AI+D+GTSL+AGP+ + I AIG
Sbjct: 241 VPVTMQDYWQFTMDSVTI-NGVVVACDGGCQAILDTGTSLVAGPSSDILNIQQAIGA 296
>gi|29244579|ref|NP_080249.2| gastricsin precursor [Mus musculus]
gi|73921722|sp|Q9D7R7.1|PEPC_MOUSE RecName: Full=Gastricsin; AltName: Full=Pepsinogen C; Flags:
Precursor
gi|12843461|dbj|BAB25990.1| unnamed protein product [Mus musculus]
gi|68534888|gb|AAH99409.1| Progastricsin (pepsinogen C) [Mus musculus]
Length = 392
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 117/304 (38%), Positives = 173/304 (56%), Gaps = 6/304 (1%)
Query: 13 WVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRK--ERYMGGAGVSGVRHRLGDS 70
W++ + L LP L R+ LKK + ++ + + + + G + GD
Sbjct: 3 WMVVALLCLPLLEAALIRVPLKKMKSIRETMKEQGVLKDFLKNHKYDPGQKYHFGKFGDY 62
Query: 71 DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKS 130
P+ +MDA Y+GEI IG+PPQNF V+FDTGSSNLWV S C S +C H+RY
Sbjct: 63 SVLYEPMA-YMDASYYGEISIGTPPQNFLVLFDTGSSNLWVSSVYCQ-SEACTTHTRYNP 120
Query: 131 RKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFD 190
KS+TY G++ + YG+GS++GFF D + V + V +Q F + E F+ A+FD
Sbjct: 121 SKSSTYYTQGQTFSLQYGTGSLTGFFGYDTLRVQSIQVPNQEFGLSENEPGTNFVYAQFD 180
Query: 191 GIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFK 250
GI+GL + ++ G A M+ +G +S+ +F +L GG+IVFGGVD +
Sbjct: 181 GIMGLAYPGLSSGGATTALQGMLGEGALSQPLFGVYLGSQ-QGSNGGQIVFGGVDENLYT 239
Query: 251 GKHTYVPVTKKGYWQFELGDILIGNQSTGVC-EGGCAAIVDSGTSLLAGPTPVVTEINHA 309
G+ T++PVT++ YWQ + D LIGNQ++G C GC IVD+GTSLL P + E+
Sbjct: 240 GELTWIPVTQELYWQITIDDFLIGNQASGWCSSSGCQGIVDTGTSLLVMPAQYLNELLQT 299
Query: 310 IGGE 313
IG +
Sbjct: 300 IGAQ 303
>gi|50978660|ref|NP_001003028.1| pepsin B precursor [Canis lupus familiaris]
gi|73621387|sp|Q8SQ41.1|PEPB_CANFA RecName: Full=Pepsin B; Flags: Precursor
gi|19911571|dbj|BAB86888.1| pepsinogen B [Canis lupus familiaris]
Length = 390
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 111/297 (37%), Positives = 171/297 (57%), Gaps = 6/297 (2%)
Query: 18 CLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDIL-P 76
CL L S G+ RI LKK + + + R + V L ++D P
Sbjct: 10 CLHL---SEGVERIILKKGK-SIRQVMEERGVLETFLRNHPKVDPAAKYLFNNDAVAYEP 65
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
N++D+ YFGEI IG+PPQNF ++FDTGSSNLWVPS+ C S +C H+R+ +S+TY
Sbjct: 66 FTNYLDSYYFGEISIGTPPQNFLILFDTGSSNLWVPSTYCQ-SQACSNHNRFNPSRSSTY 124
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
++ + YG GS++ D V V ++V+ +Q+F + E + F + FDGI+G+
Sbjct: 125 QSSEQTYTLAYGFGSLTVLLGYDTVTVQNIVIHNQLFGMSENEPNYPFYYSYFDGILGMA 184
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
+ +AV + V NM++QG +++ +FSF+ + P E GGE++ GGVD + + G+ +
Sbjct: 185 YSNLAVDNGPTVLQNMMQQGQLTQPIFSFYFSPQPTYEYGGELILGGVDTQFYSGEIVWA 244
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
PVT++ YWQ + + LIGNQ+TG+C GC IVD+GT L P + A G +
Sbjct: 245 PVTREMYWQVAIDEFLIGNQATGLCSQGCQGIVDTGTFPLTVPQQYLDSFVKATGAQ 301
>gi|459426|emb|CAA54478.1| aspartic protease [Brassica oleracea]
Length = 292
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 100/147 (68%), Positives = 116/147 (78%), Gaps = 2/147 (1%)
Query: 217 LVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQ 276
LVSE FSFWLNR+ D EEGGE+VFGGVDPKHFKG+H YVPVT+KGYWQF++GD+LIG
Sbjct: 2 LVSE--FSFWLNRNADDEEGGELVFGGVDPKHFKGQHIYVPVTQKGYWQFDMGDVLIGGA 59
Query: 277 STGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLIWDLLV 336
TG CE GC+AI DSGTSLLAGPT ++T INHAIG GV S +CK VV QYG I DLL+
Sbjct: 60 PTGYCESGCSAIADSGTSLLAGPTTIITMINHAIGASGVASQQCKTVVDQYGQTILDLLL 119
Query: 337 SGLLPEKVCQQIGLCAFNGAEYVRLGI 363
S P+K+C QIGLC F+G V +GI
Sbjct: 120 SETQPKKICSQIGLCTFDGKRGVSMGI 146
>gi|444513055|gb|ELV10247.1| Pepsin A [Tupaia chinensis]
Length = 396
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 120/313 (38%), Positives = 174/313 (55%), Gaps = 31/313 (9%)
Query: 32 GLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGI 91
GL + L H+LN A +Y + V + PL+N++D +YFG IGI
Sbjct: 30 GLLEEYLKKHTLNPAS-----KYFPKEAATMVSTQ---------PLENYLDMEYFGTIGI 75
Query: 92 GSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGS 151
G+P Q F+VIFDTGSSNLWVPS C S +C H+R+ ++S+TY ++ I YG+GS
Sbjct: 76 GTPAQEFTVIFDTGSSNLWVPSVYCS-SPACSNHNRFNPQQSSTYQATSQTVSIAYGTGS 134
Query: 152 ISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWD 210
++G D V+VG + +Q+F + T GS + + FDGI+GL + IA A PV+D
Sbjct: 135 MTGILGYDTVQVGGIADTNQIFGLSETEPGSFLYY-SPFDGILGLAYPNIASSGATPVFD 193
Query: 211 NMVEQGLVSEEVFSFWLNR--DPDA-----------EEGGEIVFGGVDPKHFKGKHTYVP 257
NM QGLVS+++FS +L+ PD E G ++FGG+D ++ G +VP
Sbjct: 194 NMWNQGLVSQDLFSVYLSSMGTPDILTSCITFHSNDESGSVVIFGGIDSSYYTGSLNWVP 253
Query: 258 VTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVS 317
++ +GYWQ + I + Q C G C AIVD+GTSLL+GPT + I IG +
Sbjct: 254 LSAEGYWQITVDSITMNGQPIA-CSGSCQAIVDTGTSLLSGPTNAIANIQSYIGASQNSN 312
Query: 318 AECKLVVSQYGDL 330
E + S +L
Sbjct: 313 GEMVISCSAINNL 325
>gi|4589842|dbj|BAA76892.1| pepsinogen C [Gallus gallus]
Length = 389
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 118/312 (37%), Positives = 173/312 (55%), Gaps = 13/312 (4%)
Query: 3 QKLLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSG 62
++L+ ++ CL + L +P R +K+ + LH A Y + +
Sbjct: 2 KRLILTMLCLHLCEGILRVPLKKGKSIREAMKESGV-LHDYLANHRHYDPAYKFFSNFAT 60
Query: 63 VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISC 122
PL N MD Y+GEI IG+PPQNF V+FDTGSSNLWVPS+ C S +C
Sbjct: 61 AYE----------PLANNMDMSYYGEISIGTPPQNFLVLFDTGSSNLWVPSTLCQ-SQAC 109
Query: 123 YFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSL 182
H+ + +S+T++ + + YGSGS++G F D V + + + +Q F + E
Sbjct: 110 ANHNEFDPNESSTFSTQDEFFSLQYGSGSLTGIFGFDTVTIQGISITNQEFGLSETEPGT 169
Query: 183 TFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFG 242
+FL + FDGI+GL F I+ G A V M+++ L+ VFSF+L+ + +GGE+VFG
Sbjct: 170 SFLYSPFDGILGLAFPSISAGGATTVMQKMLQENLLDFPVFSFYLSGQ-EGSQGGELVFG 228
Query: 243 GVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPV 302
GVDP + G+ T+ PVT+ YWQ + D +G QS+G C GC IVD+GTSLL P V
Sbjct: 229 GVDPNLYTGQITWTPVTQTTYWQIGIEDFAVGGQSSGWCSQGCQGIVDTGTSLLTVPNQV 288
Query: 303 VTEINHAIGGEG 314
TE+ IG +
Sbjct: 289 FTELMQYIGAQA 300
>gi|335287195|ref|XP_003355296.1| PREDICTED: gastricsin-like [Sus scrofa]
Length = 391
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 116/313 (37%), Positives = 179/313 (57%), Gaps = 9/313 (2%)
Query: 18 CLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPL 77
CL L S G+ RI L+K + ++ + K ++ + P
Sbjct: 10 CLYL---SEGMERIILRKGKSIREAMEEQGVLEKFLKNRPKIDPAAKYHFNNDAVAYEPF 66
Query: 78 KNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYT 137
N++D+ YFGEI IG+PPQNF V+FDTGSSNLWVPS+ C + +C H R+ +S+T+
Sbjct: 67 TNYLDSFYFGEISIGTPPQNFLVLFDTGSSNLWVPSTYCQ-TQACSDHRRFNPDQSSTFR 125
Query: 138 EIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGF 197
G++ ++YGSGS+S D V V ++V+ +Q F + E S F + FDGI+G+ +
Sbjct: 126 INGQTYTLSYGSGSLSVVLGYDTVTVQNIVIDNQEFGLSESEPSDPFYYSYFDGILGMAY 185
Query: 198 REIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVP 257
+AVG++ V +M++Q +++ +FSF+ +R P E GGE++ GGVD + + G+ + P
Sbjct: 186 PNMAVGNSPTVMQSMLQQDQLTQPIFSFYFSRQPTYEYGGELILGGVDTQLYSGQIVWTP 245
Query: 258 VTKKGYWQFELGDILIGNQSTGVC-EGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE--- 313
VT++ YWQ + + IG+Q+TG C GC AIVD+GT LLA P + A G +
Sbjct: 246 VTRELYWQIAIQEFAIGDQATGWCFSQGCQAIVDTGTFLLAVPQQYLASFLQATGAQEAQ 305
Query: 314 -GVVSAECKLVVS 325
G +C LV S
Sbjct: 306 NGDFVVDCDLVQS 318
>gi|195339961|ref|XP_002036585.1| GM18746 [Drosophila sechellia]
gi|194130465|gb|EDW52508.1| GM18746 [Drosophila sechellia]
Length = 392
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 109/238 (45%), Positives = 149/238 (62%), Gaps = 8/238 (3%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNT 135
L+N M+ +Y+G I IG+P Q F+++FDTGS+NLWVPS+ C S +C H++Y S S+T
Sbjct: 68 LQNSMNNEYYGVIAIGTPKQRFNILFDTGSANLWVPSASCPASNTACQRHNKYNSAASST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G+ I YG+GS+SGF S D V + + ++DQ F EA E TF+ A F GI+GL
Sbjct: 128 YVANGEEFAIEYGTGSLSGFLSTDTVTIAGISIQDQTFGEALSEPGTTFVDAPFAGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
F IAV P +DNMV QGL+ E V SF+L R A GGE++ GG+D ++G TY
Sbjct: 188 AFSAIAVDGVTPPFDNMVSQGLLDEPVISFYLKRQGTAVRGGELILGGIDSSLYRGSLTY 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGV--CEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
VPV+ YWQF + I ++ G+ C GC AI D+GTSL+A P +IN +G
Sbjct: 248 VPVSVPAYWQFTVNTI----KTNGILLCN-GCQAIADTGTSLIAVPLAAYRKINRQLG 300
>gi|348514690|ref|XP_003444873.1| PREDICTED: pepsin A-like [Oreochromis niloticus]
Length = 377
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 106/238 (44%), Positives = 154/238 (64%), Gaps = 7/238 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
+ N D Y+G I IG+PPQ+FSVIFDTGSSNLWVPS C S +C H+++ +S+T+
Sbjct: 62 MTNDADLSYYGTISIGTPPQSFSVIFDTGSSNLWVPSVYCN-STACENHNQFNPSQSSTF 120
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGL 195
+S I YG+GS++GF D VEVG + V +QVF + T +T++ A DGI+GL
Sbjct: 121 QWGNQSLSIQYGTGSMTGFLGSDTVEVGGISVANQVFGLSQTEASFMTYMQA--DGILGL 178
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
F+ IA + VPV++ M+ +GLVSE +FS +L+ ++E+G E+VFGG D H+ G T+
Sbjct: 179 AFQSIASDNVVPVFNTMITEGLVSEPIFSVYLSG--NSEQGSEVVFGGTDSTHYTGTITW 236
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
+P++ YWQ + + I Q T C GGC AI+D+GTSL+ GPT + +N +G
Sbjct: 237 IPLSSATYWQINMDSVTINGQ-TVACSGGCQAIIDTGTSLIVGPTTDINNLNSWVGAS 293
>gi|448115983|ref|XP_004202951.1| Piso0_001822 [Millerozyma farinosa CBS 7064]
gi|359383819|emb|CCE79735.1| Piso0_001822 [Millerozyma farinosa CBS 7064]
Length = 414
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 107/250 (42%), Positives = 157/250 (62%), Gaps = 8/250 (3%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL ++++AQY+ IG+GSP Q F VI DTGSSNLWVPS+ C S++C+ HS+Y +S++
Sbjct: 91 PLVDYLNAQYYTTIGLGSPAQEFKVILDTGSSNLWVPSTDCS-SLACFLHSKYYHDESSS 149
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + G I YG+GS+ G+ SQD + + + ++ Q F EAT E LTF A+FDGI+GL
Sbjct: 150 YKQNGSDFSIQYGTGSLEGYVSQDTLNLAGLTIEKQDFAEATSEPGLTFAFAKFDGILGL 209
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+ I+V + VP N ++QGL+ E F+F+L ++D D EGG FGGVD KH+KG
Sbjct: 210 AYDSISVDNIVPPIYNAIDQGLLDEPKFAFYLGDKDKDENEGGVATFGGVDTKHYKGDII 269
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
+PV +K YW+ I +G++ + G A +D+GTSL+ P+ + IN IG +
Sbjct: 270 ELPVRRKAYWEVSFDGIGLGDEYAELTSTGAA--IDTGTSLITLPSSLAEIINAKIGAKK 327
Query: 314 ---GVVSAEC 320
G S +C
Sbjct: 328 SWSGQYSVDC 337
>gi|50306705|ref|XP_453326.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49642460|emb|CAH00422.1| KLLA0D05929p [Kluyveromyces lactis]
Length = 409
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 109/251 (43%), Positives = 161/251 (64%), Gaps = 7/251 (2%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+++AQYF EI +GSPPQ+F VI DTGSSNLWVPS++C S++C+ H++Y S+
Sbjct: 86 VPLTNYLNAQYFTEITLGSPPQSFKVILDTGSSNLWVPSAEC-GSLACFLHTKYDHEASS 144
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY G I YGSGS+ G+ S+D + +GD+V+ DQ F EAT E L F +FDGI+G
Sbjct: 145 TYKANGSEFAIQYGSGSLEGYVSRDLLTIGDLVIPDQDFAEATSEPGLAFAFGKFDGILG 204
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+V VP N ++ L+ + VF+F+L +E+GGE FGG+D + + G+ T
Sbjct: 205 LAYDSISVNRIVPPVYNAIKNKLLDDPVFAFYLGDSDKSEDGGEASFGGIDEEKYTGEIT 264
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
++PV +K YW+ + I +G + EG AAI D+GTSL+A P+ + +N IG +
Sbjct: 265 WLPVRRKAYWEVKFEGIGLGEE-YATLEGHGAAI-DTGTSLIALPSGLAEILNAEIGAKK 322
Query: 314 ---GVVSAECK 321
G S +C+
Sbjct: 323 GWSGQYSVDCE 333
>gi|348578169|ref|XP_003474856.1| PREDICTED: renin-like [Cavia porcellus]
Length = 404
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 128/312 (41%), Positives = 184/312 (58%), Gaps = 19/312 (6%)
Query: 9 VFCLWVLASCLL-LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRL 67
+ LW SC LP + RRI LKK + + R + KER + A +S +
Sbjct: 13 LLVLW--GSCTFSLPMDTAAFRRIILKK-------MPSIRDSLKERGVDMARLSAKWGQF 63
Query: 68 GDS---DEDILP--LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSIS 121
S D P L N++D QY+GEIGIG+PPQ F VIFDTGS+NLWVPS+KC +
Sbjct: 64 SKSLSLDNSTFPVVLTNYLDTQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCSPLYTA 123
Query: 122 CYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGS 181
C HS Y S +S++Y E G I YGSG + GF SQD V VG + V Q F E T
Sbjct: 124 CEIHSLYDSSESSSYMENGTEFTIRYGSGKVKGFLSQDVVTVGGITVT-QTFGEVTELPL 182
Query: 182 LTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVF 241
+ F+LA+FDG++G+GF AVG PV+D+++ Q ++ E+VFS + +R+ G ++
Sbjct: 183 IPFMLAKFDGVLGMGFPAQAVGGVTPVFDHILSQRVLKEDVFSVYYSRNSHLLGGELLLG 242
Query: 242 GGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTP 301
G DP+H++G YV ++K G WQ + + +G+ +T +CE GC A+VD+G S ++GPT
Sbjct: 243 GN-DPQHYQGNFHYVRISKTGSWQIMMKGVSVGS-ATLLCEEGCMAVVDTGASYISGPTS 300
Query: 302 VVTEINHAIGGE 313
+ I A+G +
Sbjct: 301 SLRLIMEALGAK 312
>gi|112950081|gb|ABI26643.1| aspartic proteinase [Cucumis sativus]
Length = 399
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 122/322 (37%), Positives = 180/322 (55%), Gaps = 25/322 (7%)
Query: 14 VLASCLLLPASSNGLRRIGLKKR---RLDLHSLNAARITRKERYMGGA---GVSGVRHRL 67
V+A ++ +++ + RI L+++ +L +++ AA++ + +Y + G SG +L
Sbjct: 4 VIAFLAIVALAASEMHRIPLQRQENFKLTKNNIQAAKVHLRNKYNVKSNLLGRSGTTEQL 63
Query: 68 GDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHS 126
+ ++Y+G IGIG+P Q F+V+FD+GSSNLWVPS+KC S +C H
Sbjct: 64 TQGQ---------LTSEYYGTIGIGTPAQEFTVVFDSGSSNLWVPSAKCSSSDQACKNH- 113
Query: 127 RYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLL 186
S S+TY G+ I YG+GS++GF S D V V + ++ Q F EAT E TF+
Sbjct: 114 --NSAASSTYVPNGEQFSIQYGTGSLTGFLSTDTVTVNGLTIQSQTFAEATNEPGSTFVD 171
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
+ FDGI+GL + I+ + VP + NMV Q LVS VFS + R A GE++FGG D
Sbjct: 172 STFDGILGLAYETISQDNVVPPFYNMVSQSLVSNPVFSVYFGRSKAANNNGEVIFGGSDS 231
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
++G YVPVT++GYWQF + + + Q AI D+GTSLLA PT +
Sbjct: 232 TVYQGPINYVPVTQQGYWQFTMDGVYVNGQQ---VISSAQAIADTGTSLLAAPTSAFYTL 288
Query: 307 NHAIGG---EGVVSAECKLVVS 325
N AIG EG +C V S
Sbjct: 289 NEAIGATYQEGDYFVDCSSVSS 310
>gi|335955136|gb|AEH76574.1| pepsinogen [Epinephelus bruneus]
Length = 375
Score = 212 bits (540), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 114/266 (42%), Positives = 165/266 (62%), Gaps = 15/266 (5%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
+ N D Y+G I IG+PPQ+F+VIFDTGSSNLWVPS C S +C H ++ ++S+T+
Sbjct: 61 MTNDADLSYYGVISIGTPPQSFTVIFDTGSSNLWVPSVYCN-SQACQNHRKFNPQQSSTF 119
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
+ I YG+GS++G + DNVEVG + V++QVF + E +A DGI+GL
Sbjct: 120 KWGDQPLSIQYGTGSMTGRLAIDNVEVGGITVQNQVFGISQTEAPFMAHMAA-DGILGLA 178
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
F+ IA + VPV+DNMV+QGLVS+ +FS +L+ D +G E+VFGG+D H+ G+ T+V
Sbjct: 179 FQTIAADNVVPVFDNMVKQGLVSQPLFSVYLSSHGD--QGSEVVFGGIDNSHYTGQVTWV 236
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVV 316
P+T YWQ ++ + I Q T C GGC AI+D+GTSL+ GPT + +N +G
Sbjct: 237 PLTSATYWQIKMDGVKINGQ-TVACAGGCQAIIDTGTSLIVGPTNDINNMNSWVGAS--- 292
Query: 317 SAECKLVVSQYGDLIWDLLVSGLLPE 342
+QYG+ + G +PE
Sbjct: 293 -------TNQYGESTVNCQNVGSMPE 311
>gi|297688536|ref|XP_002821738.1| PREDICTED: pepsin A-4 [Pongo abelii]
Length = 388
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 120/303 (39%), Positives = 173/303 (57%), Gaps = 23/303 (7%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL K L H+LN AR +Y + D PL+N++D +YFG
Sbjct: 32 LSEHGLLKDFLKTHNLNPAR-----KYF--------PQWEAPTLVDEQPLENYLDVEYFG 78
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H+ + S+TY ++ I Y
Sbjct: 79 TIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACTNHNLFNPEDSSTYQSTSETVSIAY 137
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
G+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + I+ A
Sbjct: 138 GTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISSSGAT 196
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ
Sbjct: 197 PVFDNIWNQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQI 254
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKL 322
+ I + ++ C GC AIVD+GTSLL GPT + I IG +G + C
Sbjct: 255 TVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSA 313
Query: 323 VVS 325
+ S
Sbjct: 314 ISS 316
>gi|194218273|ref|XP_001501915.2| PREDICTED: pepsin A-like [Equus caballus]
Length = 387
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 120/296 (40%), Positives = 164/296 (55%), Gaps = 24/296 (8%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
LR GL L H N A A G L+N+MD +YFG
Sbjct: 32 LRENGLLADFLKQHPRNPASKYFPREAATLAATEG--------------LENYMDEEYFG 77
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
I IG+P Q F+VIFDTGSSNLWVPS C S++C H+R+ S+TY +S I Y
Sbjct: 78 TISIGTPAQEFTVIFDTGSSNLWVPSVYCS-SLACSDHNRFNPEDSSTYEATSESVSITY 136
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVP 207
G+GS++G D V VG + +Q+F + E S A FDGI+GL + I+ A P
Sbjct: 137 GTGSMTGVLGYDTVRVGGIEDTNQIFGLSESEPSSFLYYAPFDGILGLAYPSISASGATP 196
Query: 208 VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFE 267
V+DN+ +QGLVS+++FS +L+ D E G ++FGG+D ++ G +VPV+++ YWQ
Sbjct: 197 VFDNIWDQGLVSQDLFSVYLSSD--DESGSVVMFGGIDSSYYSGSLNWVPVSEEAYWQIT 254
Query: 268 LGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVS 317
+ I + +S C GGC AIVD+GTSLLAGP + I IG GEG +S
Sbjct: 255 VDSITMNGESIA-CSGGCQAIVDTGTSLLAGPPSAIDNIQSYIGASEDSSGEGAIS 309
>gi|296198131|ref|XP_002746573.1| PREDICTED: gastricsin [Callithrix jacchus]
gi|18203304|sp|Q9N2D3.1|PEPC_CALJA RecName: Full=Gastricsin; AltName: Full=Pepsinogen C; Flags:
Precursor
gi|7008023|dbj|BAA90872.1| pepsinogen C [Callithrix jacchus]
Length = 388
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 112/286 (39%), Positives = 165/286 (57%), Gaps = 19/286 (6%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
++ GL L H + AR ++R+ D P+ ++MDA YFG
Sbjct: 33 MKEKGLLWEFLKTHKHDPAR----------------KYRVSDLSVSYEPM-DYMDAAYFG 75
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
EI IG+PPQNF V+FDTGSSNLWVPS C S +C HSR+ S+TY+ G++ + Y
Sbjct: 76 EISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHSRFNPSASSTYSSNGQTFSLQY 134
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVP 207
GSGS++GFF D + V + V +Q F + E F+ A+FDGI+GL + +++G A
Sbjct: 135 GSGSLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGIMGLAYPALSMGGATT 194
Query: 208 VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFE 267
M+++G ++ VFSF+L+ GG ++FGGVD + G+ + PVT++ YWQ
Sbjct: 195 AMQGMLQEGALTSPVFSFYLSNQ-QGSSGGAVIFGGVDSSLYTGQIYWAPVTQELYWQIG 253
Query: 268 LGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
+ + LIG Q++G C GC AIVD+GTSLL P ++ A G +
Sbjct: 254 IEEFLIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMSAFLEATGAQ 299
>gi|129797|sp|P03955.2|PEPC_MACFU RecName: Full=Gastricsin; AltName: Full=Pepsinogen C; Flags:
Precursor
gi|38073|emb|CAA42426.1| pepsinogen C [Macaca fuscata]
Length = 377
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 113/283 (39%), Positives = 167/283 (59%), Gaps = 9/283 (3%)
Query: 37 RLDLHSLNAARITRKERYMGGAGV------SGVRHRLGDSDEDILPLKNFMDAQYFGEIG 90
++ L + R T KE+ + G + ++ GD P+ +MDA YFGEI
Sbjct: 9 KVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYHFGDLSVSYEPMA-YMDAAYFGEIS 67
Query: 91 IGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSG 150
IG+PPQNF V+FDTGSSNLWVPS C S +C HSR+ +S+TY+ G++ + YGSG
Sbjct: 68 IGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHSRFNPSESSTYSTNGQTFSLQYGSG 126
Query: 151 SISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWD 210
S++GFF D + V + V +Q F + E F+ A+FDGI+GL + ++V A
Sbjct: 127 SLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGIMGLAYPTLSVDGATTAMQ 186
Query: 211 NMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 270
MV++G ++ +FS +L+ D GG +VFGGVD + G+ + PVT++ YWQ + +
Sbjct: 187 GMVQEGALTSPIFSVYLS-DQQGSSGGAVVFGGVDSSLYTGQIYWAPVTQELYWQIGIEE 245
Query: 271 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
LIG Q++G C GC AIVD+GTSLL P ++ + A G +
Sbjct: 246 FLIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMSALLQATGAQ 288
>gi|73620983|sp|P00792.2|PEPA_BOVIN RecName: Full=Pepsin A; Flags: Precursor
gi|24415088|emb|CAD55693.1| pepsinogen A [synthetic construct]
gi|37622272|gb|AAQ95219.1| pepsinogen A [Bos taurus]
Length = 372
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 110/256 (42%), Positives = 160/256 (62%), Gaps = 6/256 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL+N++D +YFG IGIG+P Q+F+VIFDTGSSNLWVPS C S +C H+R+ + S+T
Sbjct: 51 PLQNYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSIYCS-SEACTNHNRFNPQDSST 109
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIG 194
Y ++ I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+G
Sbjct: 110 YEATSETLSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILG 168
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+ A PV+DN+ +QGLVS+++FS +L+ + E G ++FG +D ++ G
Sbjct: 169 LAYPSISSSGATPVFDNIWDQGLVSQDLFSVYLSS--NEESGSVVIFGDIDSSYYSGSLN 226
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
+VPV+ +GYWQ + I + +S C GC AIVD+GTSLLAGPT ++ I IG
Sbjct: 227 WVPVSVEGYWQITVDSITMNGESIA-CSDGCQAIVDTGTSLLAGPTTAISNIQSYIGASE 285
Query: 315 VVSAECKLVVSQYGDL 330
S E + S L
Sbjct: 286 DSSGEVVISCSSIDSL 301
>gi|194862073|ref|XP_001969914.1| GG23678 [Drosophila erecta]
gi|190661781|gb|EDV58973.1| GG23678 [Drosophila erecta]
Length = 392
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 124/306 (40%), Positives = 171/306 (55%), Gaps = 21/306 (6%)
Query: 24 SSNGLRRIGLKKRR--LDLH-SLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNF 80
S+ L R+ L + R H S+ A + +Y + S GD++ L+N
Sbjct: 16 SAGKLNRVQLHRNRNFKKTHGSVKAEKTVLASKYSVVSETSFSTSSAGDTES----LQNS 71
Query: 81 MDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEI 139
M+ +Y+G I IG+P Q F+++FDTGS+NLWVPS+ C S +C H++Y S S+TY
Sbjct: 72 MNNEYYGVITIGTPQQRFNILFDTGSANLWVPSASCPASNTACQRHNKYNSTASSTYVAN 131
Query: 140 GKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFRE 199
G+ I YG+GS+SGF S D V + V ++DQ F EA E TF+ A F GI+GL F
Sbjct: 132 GEEFAIEYGTGSLSGFLSTDTVAIAGVTIRDQTFGEALSEPGTTFVDAPFAGILGLAFST 191
Query: 200 IAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVT 259
IA P +DNM+ QG++ E V SF+L R A GGE++ GG+D +KG TYVPV+
Sbjct: 192 IADDGVTPPFDNMISQGVLDEPVISFYLKRQGTAVLGGELILGGIDSSLYKGSLTYVPVS 251
Query: 260 KKGYWQFELGDILIGNQSTGV--CEGGCAAIVDSGTSLLAGPTPVVTEINHAI------G 311
YWQF + I ++ GV C GC AI D+GTSL+ P IN + G
Sbjct: 252 VPAYWQFTVNTI----KTNGVLLCS-GCQAIADTGTSLIVAPLAAYKRINRQLGATDNGG 306
Query: 312 GEGVVS 317
GE VS
Sbjct: 307 GEAFVS 312
>gi|301622166|ref|XP_002940408.1| PREDICTED: renin-like [Xenopus (Silurana) tropicalis]
Length = 371
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 112/266 (42%), Positives = 159/266 (59%), Gaps = 11/266 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFHSRYKSRKSNT 135
L N+MD QYFGEI IGSPPQ F V+FDTGS+NLWVPS +C +C H+RY S KS T
Sbjct: 41 LTNYMDTQYFGEISIGSPPQTFKVVFDTGSANLWVPSQRCSPLYSACVSHNRYDSTKSQT 100
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y E G I YGSG + GF SQD V V + V QVF EAT + F+ ARFDG++G+
Sbjct: 101 YMENGAGFSIQYGSGGVKGFLSQDVVVVAGIPVI-QVFAEATALPAFPFIFARFDGVLGM 159
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLN---RDPDAEEGGEIVFGGVDPKHFKGK 252
GF A+ PV+D ++ + ++ E+VFS + + RD + GGEI+ GG DP ++ G
Sbjct: 160 GFPGQAIDGITPVFDRIISEQVLQEDVFSVYYSRSYRDSHLKPGGEIILGGSDPSYYTGS 219
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG- 311
Y+ + K+GYW + + IG + C+ GC+ +D+G + + GP V+ + AIG
Sbjct: 220 FQYLNLEKEGYWHIRMKGVSIGAEIL-FCKDGCSVAIDTGAAYITGPASSVSVLMKAIGA 278
Query: 312 ---GEGVVSAECKLVVSQYGDLIWDL 334
EG + +C +SQ D+ + +
Sbjct: 279 TELAEGEYTVDCDK-ISQLPDVSFHM 303
>gi|374431137|gb|AEZ51819.1| pepsin, partial [Oreochromis niloticus]
Length = 339
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 107/236 (45%), Positives = 155/236 (65%), Gaps = 7/236 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
+ N D Y+G I IG+PPQ+FSVIFDTGSSNLWVPS C S +C H+++ +S+T+
Sbjct: 24 MTNDADLSYYGTISIGTPPQSFSVIFDTGSSNLWVPSVYCN-STACENHNQFNPSQSSTF 82
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGS-LTFLLARFDGIIGL 195
+S I YG+GS++GF D VEVG + V +QVF + E S +T++ A DGI+GL
Sbjct: 83 QWGNQSLSIQYGTGSMTGFLGSDTVEVGGISVANQVFGLSQTEASFMTYMQA--DGILGL 140
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
F+ IA + VPV++ M+ +GLVSE +FS +L+ ++E+G E+VFGG D H+ G T+
Sbjct: 141 AFQSIASDNVVPVFNTMITEGLVSEPIFSVYLSG--NSEQGSEVVFGGTDSTHYTGTITW 198
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
+P++ YWQ + + I Q T C GGC AI+D+GTSL+ GPT + +N +G
Sbjct: 199 IPLSSATYWQINMDSVTINGQ-TVACSGGCQAIIDTGTSLIVGPTTDINNLNSWVG 253
>gi|18859121|ref|NP_571879.1| nothepsin [Danio rerio]
gi|12053847|emb|CAC20112.1| nothepsin [Danio rerio]
Length = 416
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 110/249 (44%), Positives = 161/249 (64%), Gaps = 12/249 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
L NFMDAQ+FG+I +G P QNF+V+FDTGSS+LWVPSS C + +C H+++K+ +S+TY
Sbjct: 78 LYNFMDAQFFGQISLGRPEQNFTVVFDTGSSDLWVPSSYC-VTQACALHNKFKAFESSTY 136
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
T G+ I+YGSG + G ++D ++VG V V++QVF EA E +F+LA+FDG++GLG
Sbjct: 137 THDGRVFGIHYGSGHLLGVMARDELKVGSVRVQNQVFGEAVYEPGFSFVLAQFDGVLGLG 196
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
F ++A PV+D M+EQ ++ + VFSF+L + + GGE+VFG D F ++
Sbjct: 197 FPQLAEEKGSPVFDTMMEQNMLDQPVFSFYLTNN-GSGFGGELVFGANDESRFLPPINWI 255
Query: 257 PVTKKGYWQFEL------GDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI 310
PVT+KGYWQ +L G + ++S GC AIVD+GTSL+ GP + + I
Sbjct: 256 PVTQKGYWQIKLDAVKVQGALSFSDRSV----QGCQAIVDTGTSLIGGPARDILILQQFI 311
Query: 311 GGEGVVSAE 319
G + E
Sbjct: 312 GATPTANGE 320
>gi|400598686|gb|EJP66395.1| vacuolar protease A [Beauveria bassiana ARSEF 2860]
Length = 395
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 138/365 (37%), Positives = 193/365 (52%), Gaps = 51/365 (13%)
Query: 40 LHSLNAARITRKERYMGGAGVSGVRHRLGD--------SDEDIL--------------PL 77
+H + +I E+ +G A H+LG S DI+ P+
Sbjct: 19 IHKMKLQKIPLAEQLVG-ASFEAQAHQLGQKYLGARPASRADIMFNNQVAESKDGHPVPV 77
Query: 78 KNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYT 137
NF +AQYF EI IG+PPQ F V+ DTGSSNLWVPS C SI+C+ HS Y S S+TY
Sbjct: 78 TNFANAQYFSEITIGTPPQTFKVVLDTGSSNLWVPSQSCS-SIACFLHSTYDSSSSSTYK 136
Query: 138 EIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGF 197
+ G EI+YGSGS++GF S D V +GD+ +K+ F EAT E L F RFDGI+GLG+
Sbjct: 137 KNGSDFEIHYGSGSLTGFVSNDVVSIGDLTIKNTDFAEATSEPGLAFAFGRFDGILGLGY 196
Query: 198 REIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVP 257
I+V VP + M+ Q L+ E VF+F+L + + G E +FGGVD H++GK Y+P
Sbjct: 197 DTISVNKMVPPFYQMINQKLIDEPVFAFYLGSE---DSGSEAIFGGVDKDHYEGKIEYIP 253
Query: 258 VTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE---- 313
+ +K YW+ + I G++ + G I+D+GTSL PT + +N IG +
Sbjct: 254 LRRKAYWEVDFDAIAFGDEVAELENTGV--ILDTGTSLNTLPTDLAELLNKEIGAKKGFG 311
Query: 314 GVVSAECK-----------LVVSQY----GDLIWDL---LVSGLLPEKVCQQIGLCAFNG 355
G S +CK L S Y D I +L VS P + + +G A G
Sbjct: 312 GQYSIDCKARDSLPDITFTLAGSNYTLPASDYILELGGSCVSTFTPLDMPEPVGPIAILG 371
Query: 356 AEYVR 360
++R
Sbjct: 372 DAFLR 376
>gi|151553998|gb|AAI49645.1| PGA5 protein [Bos taurus]
Length = 381
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 110/256 (42%), Positives = 160/256 (62%), Gaps = 6/256 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL+N++D +YFG IGIG+P Q+F+VIFDTGSSNLWVPS C S +C H+R+ + S+T
Sbjct: 60 PLQNYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSIYCS-SEACTNHNRFNPQDSST 118
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIG 194
Y ++ I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+G
Sbjct: 119 YEATSETLSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILG 177
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+ A PV+DN+ +QGLVS+++FS +L+ + E G ++FG +D ++ G
Sbjct: 178 LAYPSISSSGATPVFDNIWDQGLVSQDLFSVYLSS--NEESGSVVIFGDIDSSYYSGSLN 235
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
+VPV+ +GYWQ + I + +S C GC AIVD+GTSLLAGPT ++ I IG
Sbjct: 236 WVPVSVEGYWQITVDSITMNGESIA-CSDGCQAIVDTGTSLLAGPTTAISNIQSYIGASE 294
Query: 315 VVSAECKLVVSQYGDL 330
S E + S L
Sbjct: 295 DSSGEVVISCSSIDSL 310
>gi|56971217|gb|AAH88066.1| pga5-prov protein, partial [Xenopus (Silurana) tropicalis]
Length = 382
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 113/288 (39%), Positives = 161/288 (55%), Gaps = 20/288 (6%)
Query: 26 NGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQY 85
N L+R+GL L + N A S L S ++L +N+MD +Y
Sbjct: 27 NRLQRLGLLGDYLKKYPYNPA--------------SKYFPTLAQSSAEVL--QNYMDIEY 70
Query: 86 FGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEI 145
+G I IG+PPQ F+VIFDTGS+NLWVPS C S +C H+R+ ++S T+ I
Sbjct: 71 YGTISIGTPPQEFTVIFDTGSANLWVPSVYCS-SSACTNHNRFNPQQSTTFQATNTPVSI 129
Query: 146 NYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDA 205
YG+GS+SGF D ++VG++ + +Q+F + E + FDGI+GL F IA A
Sbjct: 130 QYGTGSMSGFLGYDTLQVGNIKISNQMFGLSESEPGSFLYYSPFDGILGLAFPSIASSQA 189
Query: 206 VPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQ 265
PV+DNM QGL+ + +FS +L+ D + G ++FGGVD ++ G +VP+T + YWQ
Sbjct: 190 TPVFDNMWSQGLIPQNLFSVYLSS--DGQSGSYVLFGGVDTSYYSGSLNWVPLTAETYWQ 247
Query: 266 FELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
L I I Q C C AIVD+GTSL+ GPT + I + IG
Sbjct: 248 ITLDSISINGQVIA-CSQSCQAIVDTGTSLMTGPTTPIANIQYYIGAS 294
>gi|355561685|gb|EHH18317.1| hypothetical protein EGK_14890 [Macaca mulatta]
gi|355748551|gb|EHH53034.1| hypothetical protein EGM_13592 [Macaca fascicularis]
Length = 388
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 113/283 (39%), Positives = 167/283 (59%), Gaps = 9/283 (3%)
Query: 37 RLDLHSLNAARITRKERYMGGAGV------SGVRHRLGDSDEDILPLKNFMDAQYFGEIG 90
++ L + R T KE+ + G + ++ GD P+ +MDA YFGEI
Sbjct: 20 KVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYHFGDLSVSYEPMA-YMDAAYFGEIS 78
Query: 91 IGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSG 150
IG+PPQNF V+FDTGSSNLWVPS C S +C HSR+ +S+TY+ G++ + YGSG
Sbjct: 79 IGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHSRFNPSESSTYSTNGQTFSLQYGSG 137
Query: 151 SISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWD 210
S++GFF D + V + V +Q F + E F+ A+FDGI+GL + ++V A
Sbjct: 138 SLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGIMGLAYPTLSVDGATTAMQ 197
Query: 211 NMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 270
MV++G ++ +FS +L+ D GG +VFGGVD + G+ + PVT++ YWQ + +
Sbjct: 198 GMVQEGALTSPIFSVYLS-DQQGSSGGAVVFGGVDSSLYTGQIYWAPVTQELYWQIGIEE 256
Query: 271 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
LIG Q++G C GC AIVD+GTSLL P ++ + A G +
Sbjct: 257 FLIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMSALLQATGAQ 299
>gi|292658855|ref|NP_001001600.2| pepsin A preproprotein [Bos taurus]
Length = 386
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 110/256 (42%), Positives = 160/256 (62%), Gaps = 6/256 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL+N++D +YFG IGIG+P Q+F+VIFDTGSSNLWVPS C S +C H+R+ + S+T
Sbjct: 65 PLQNYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSIYCS-SEACTNHNRFNPQDSST 123
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIG 194
Y ++ I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+G
Sbjct: 124 YEATSETLSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILG 182
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+ A PV+DN+ +QGLVS+++FS +L+ + E G ++FG +D ++ G
Sbjct: 183 LAYPSISSSGATPVFDNIWDQGLVSQDLFSVYLSS--NEESGSVVIFGDIDSSYYSGSLN 240
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
+VPV+ +GYWQ + I + +S C GC AIVD+GTSLLAGPT ++ I IG
Sbjct: 241 WVPVSVEGYWQITVDSITMNGESIA-CSDGCQAIVDTGTSLLAGPTTAISNIQSYIGASE 299
Query: 315 VVSAECKLVVSQYGDL 330
S E + S L
Sbjct: 300 DSSGEVVISCSSIDSL 315
>gi|129780|sp|P27677.1|PEPA2_MACFU RecName: Full=Pepsin A-2/A-3; AltName: Full=Pepsin III-2/III-1;
Flags: Precursor
gi|38069|emb|CAA42427.1| prepropepsin a [Macaca fuscata]
Length = 388
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 123/314 (39%), Positives = 179/314 (57%), Gaps = 28/314 (8%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL K L H+ N A +Y A + D PL+N++D +YFG
Sbjct: 32 LSEHGLLKDFLKKHNFNPAS-----KYFPQAEAPTLI--------DEQPLENYLDMEYFG 78
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H+R+ + S+TY + I Y
Sbjct: 79 TIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACTNHNRFNPQDSSTYQSTSGTVSITY 137
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
G+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + I+ A
Sbjct: 138 GTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISSSGAT 196
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPV+ +GYWQ
Sbjct: 197 PVFDNIWNQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTGSLNWVPVSVEGYWQI 254
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVSAEC 320
+ I + ++ C GC AIVD+GTSLL GPT + I IG GE VVS
Sbjct: 255 SVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGEMVVSCSA 313
Query: 321 KLVVSQYGDLIWDL 334
+S D+++ +
Sbjct: 314 ---ISSLPDIVFTI 324
>gi|296471634|tpg|DAA13749.1| TPA: pepsin A precursor [Bos taurus]
Length = 367
Score = 211 bits (538), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 110/256 (42%), Positives = 160/256 (62%), Gaps = 6/256 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL+N++D +YFG IGIG+P Q+F+VIFDTGSSNLWVPS C S +C H+R+ + S+T
Sbjct: 51 PLQNYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSIYCS-SEACTNHNRFNPQDSST 109
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIG 194
Y ++ I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+G
Sbjct: 110 YEATSETLSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILG 168
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+ A PV+DN+ +QGLVS+++FS +L+ + E G ++FG +D ++ G
Sbjct: 169 LAYPSISSSGATPVFDNIWDQGLVSQDLFSVYLSS--NEESGSVVIFGDIDSSYYSGSLN 226
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
+VPV+ +GYWQ + I + +S C GC AIVD+GTSLLAGPT ++ I IG
Sbjct: 227 WVPVSVEGYWQITVDSITMNGESIA-CSDGCQAIVDTGTSLLAGPTTAISNIQSYIGASE 285
Query: 315 VVSAECKLVVSQYGDL 330
S E + S L
Sbjct: 286 DSSGEVVISCSSIDSL 301
>gi|281183192|ref|NP_001162218.1| gastricsin precursor [Papio anubis]
gi|157939796|gb|ABW05535.1| progastricsin (predicted) [Papio anubis]
Length = 388
Score = 211 bits (538), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 113/283 (39%), Positives = 167/283 (59%), Gaps = 9/283 (3%)
Query: 37 RLDLHSLNAARITRKERYMGGAGV------SGVRHRLGDSDEDILPLKNFMDAQYFGEIG 90
++ L + R T KE+ + G + ++ GD P+ +MDA YFGEI
Sbjct: 20 KVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYHFGDLSVSYEPMA-YMDAAYFGEIS 78
Query: 91 IGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSG 150
IG+PPQNF V+FDTGSSNLWVPS C S +C HSR+ +S+TY+ G++ + YGSG
Sbjct: 79 IGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHSRFNPSESSTYSTNGQTFSLQYGSG 137
Query: 151 SISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWD 210
S++GFF D + V + V +Q F + E F+ A+FDGI+GL + ++V A
Sbjct: 138 SLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGIMGLAYPTLSVDGATTAMQ 197
Query: 211 NMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 270
MV++G ++ +FS +L+ D GG +VFGGVD + G+ + PVT++ YWQ + +
Sbjct: 198 GMVQEGALTSPIFSVYLS-DQQGSSGGAVVFGGVDSSLYTGQIYWAPVTQELYWQIGIEE 256
Query: 271 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
LIG Q++G C GC AIVD+GTSLL P ++ + A G +
Sbjct: 257 FLIGGQASGWCSEGCQAIVDTGTSLLTVPQQYLSALLQATGAQ 299
>gi|431910409|gb|ELK13482.1| Pepsin A [Pteropus alecto]
Length = 386
Score = 211 bits (538), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 112/282 (39%), Positives = 165/282 (58%), Gaps = 21/282 (7%)
Query: 32 GLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGI 91
GL L H LN A KE S D L+N++D +YFG IGI
Sbjct: 36 GLLADYLKTHKLNPASKYLKE---------------AASFTDTETLENYLDMEYFGTIGI 80
Query: 92 GSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGS 151
G+P Q F+VIFDTGSSNLWVPS C S++CY H+ + S+T+ ++ I YG+GS
Sbjct: 81 GTPAQEFTVIFDTGSSNLWVPSVYCS-SLACYNHNVFNPEDSSTFEATSETVSITYGTGS 139
Query: 152 ISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWD 210
++G D V+VG + +Q+F + T GS + A FDGI+GL + I+ A PV+D
Sbjct: 140 MTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISASGATPVFD 198
Query: 211 NMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 270
N+ +QGLVS+++FS +L+ D D+ G ++FGG+D ++ G +VP++ + YWQ +
Sbjct: 199 NLWDQGLVSQDLFSVYLSSDDDS--GSVVIFGGIDSSYYSGSLNWVPLSSETYWQITVDS 256
Query: 271 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
+++ ++ C C AIVD+GTSLLAGPT ++ I IG
Sbjct: 257 VILDGEAIA-CSATCQAIVDTGTSLLAGPTTAISSIQKYIGA 297
>gi|222425198|dbj|BAH20548.1| pepsinogen A-36 [Pongo abelii]
Length = 388
Score = 211 bits (538), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 120/303 (39%), Positives = 173/303 (57%), Gaps = 23/303 (7%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL K L H+LN AR +Y + D PL+N++D +YFG
Sbjct: 32 LSEHGLLKDFLKKHNLNPAR-----KYF--------PQWEAPTLVDEQPLENYLDMEYFG 78
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H+ + S+TY ++ I Y
Sbjct: 79 SIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACTNHNLFNPEDSSTYQSTSETVSIAY 137
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
G+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + I+ A
Sbjct: 138 GTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISSSGAT 196
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ
Sbjct: 197 PVFDNIWNQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQI 254
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKL 322
+ I + ++ C GC AIVD+GTSLL GPT + I IG +G + C
Sbjct: 255 TVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSA 313
Query: 323 VVS 325
+ S
Sbjct: 314 ISS 316
>gi|335281744|ref|XP_003122705.2| PREDICTED: pregnancy-associated glycoprotein 2-like [Sus scrofa]
Length = 388
Score = 211 bits (538), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 112/307 (36%), Positives = 174/307 (56%), Gaps = 22/307 (7%)
Query: 12 LWVLASCLL------LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRH 65
L L+ CL+ + + LR G K LD H + R E +
Sbjct: 9 LVTLSECLVTIPLRKVKSIRENLREKGFLKNFLDEHPHDMIRSRLTE------------N 56
Query: 66 RLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFH 125
LPL+N++D Y G I IG+PPQ FSV+FDTGSS+ WVPS C S++C H
Sbjct: 57 SAPQKKNTTLPLRNYLDVIYVGNISIGTPPQQFSVVFDTGSSDTWVPSIYCQ-SMACVTH 115
Query: 126 SRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFL 185
+ + +S T+ G E+ Y +G+++GF D ++VGD+++KDQ F + E + F
Sbjct: 116 NTFDPFQSTTFRFPGFIVELQYATGAVTGFLGYDTIQVGDLIIKDQAFAISQSEDDVVFE 175
Query: 186 LARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVD 245
A FDGI+GL F +A+ P++D+++ Q L+++ VF+F+L+ +A+EG ++FGGVD
Sbjct: 176 NAAFDGIVGLSFPSMAIEGTTPIFDSLMNQSLIAQTVFAFYLSS--NAQEGSVVMFGGVD 233
Query: 246 PKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTE 305
K++KG +VP+++ YWQ L I I S+ C+ GC I+D+GTSLL GP V +
Sbjct: 234 KKYYKGDLKWVPLSQPHYWQIPLDKITI-RGSSAACKNGCQGILDTGTSLLMGPKNQVYK 292
Query: 306 INHAIGG 312
++ + G
Sbjct: 293 LHKRLPG 299
>gi|296474377|tpg|DAA16492.1| TPA: progastricsin (pepsinogen C) [Bos taurus]
Length = 421
Score = 211 bits (538), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 105/250 (42%), Positives = 157/250 (62%), Gaps = 2/250 (0%)
Query: 64 RHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCY 123
++R GD P+ ++MDA YFGEI IG+PPQNF V+FDTGSSNLWVPS C S +C
Sbjct: 54 KYRFGDFIVATEPM-DYMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACT 111
Query: 124 FHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLT 183
H+R+ S+TY+ ++ + YGSGS++G D + V + V +Q F + E
Sbjct: 112 SHTRFNHSLSSTYSTNEQTFSLQYGSGSLTGILGYDTLTVQGIKVPNQEFGLSKTEPGTN 171
Query: 184 FLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGG 243
FL A+FDGI+G+ + ++V A V M+++G ++ VFSF+L+ +++GG ++FGG
Sbjct: 172 FLYAKFDGIMGMAYPSLSVDGATTVLQGMLQEGALTSPVFSFYLSSQQGSQDGGAVIFGG 231
Query: 244 VDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVV 303
VD + G+ + PVT++ YWQ + LIG+Q+TG C GC AIVD+GTSLL P +
Sbjct: 232 VDNCLYTGQIYWAPVTQELYWQIGFEEFLIGDQATGWCSTGCQAIVDTGTSLLTVPQQFL 291
Query: 304 TEINHAIGGE 313
+ + A G +
Sbjct: 292 SALLQATGAQ 301
>gi|149725185|ref|XP_001501907.1| PREDICTED: pepsin A-like [Equus caballus]
Length = 387
Score = 211 bits (538), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 118/303 (38%), Positives = 167/303 (55%), Gaps = 18/303 (5%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
LR GL + L H N A + A G L+N+ D +YFG
Sbjct: 32 LRENGLLEDFLKQHPRNPASKYFPKEAATLAATEG--------------LENYKDEEYFG 77
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
I IG+PPQ F+VIFDTGSSNLWVPS+ C S++C H+R+ S+TY +S I Y
Sbjct: 78 TISIGTPPQEFTVIFDTGSSNLWVPSTYCS-SLACSDHNRFNPEDSSTYEATSESISITY 136
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVP 207
G+GS++G + V VG + +Q+F + E S A FDGI+GL + I+ A P
Sbjct: 137 GTGSMTGVLRYNTVRVGGIEDTNQIFGLSESEPSSFLYYAPFDGILGLAYPSISSSGATP 196
Query: 208 VWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFE 267
V+DN+ +QGLVS+++FS +L+ D E G ++F G+D ++ G +VPV+++ YWQ
Sbjct: 197 VFDNIWDQGLVSQDLFSVYLSS--DDESGSMVIFSGIDSSYYSGSLCWVPVSEEAYWQIT 254
Query: 268 LGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQY 327
+ I + +S C GGC AIVD+GTSLLAGP + I IG S+E + S
Sbjct: 255 VDSITMNGESIA-CSGGCQAIVDTGTSLLAGPPSAIDNIQSYIGASEDYSSEAVISCSSI 313
Query: 328 GDL 330
L
Sbjct: 314 DSL 316
>gi|327271277|ref|XP_003220414.1| PREDICTED: renin-like [Anolis carolinensis]
Length = 398
Score = 211 bits (538), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 120/320 (37%), Positives = 178/320 (55%), Gaps = 13/320 (4%)
Query: 13 WVLA---SCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMG-GAGVSGVRHRLG 68
WV A SC L SS+ +RI LKK +L I + + G+ +
Sbjct: 5 WVFAVVTSCFL-SFSSDAFQRIPLKKMPSIRETLQKMGIKVADFFPSLKHGIYFLNDGFY 63
Query: 69 DSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFHSR 127
+ + L N++D QY+GEI IG+P Q F V+FDTGS+NLWVPS +C +C H+R
Sbjct: 64 NGTAPTI-LTNYLDMQYYGEISIGTPAQIFKVVFDTGSANLWVPSQQCSPLYSACVSHNR 122
Query: 128 YKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLA 187
Y S +S+TY G I YG G + GF SQD V V D+ V Q+F EA + F+ A
Sbjct: 123 YDSSRSSTYKPNGTEIAIQYGQGYVKGFLSQDIVRVADIPVV-QLFAEAIALPNKPFIYA 181
Query: 188 RFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPK 247
RFDG++G+G+ A+ +PV+D ++ + ++SEEVFS + +R+ + GGEI+ GG DP
Sbjct: 182 RFDGVLGMGYPSQAIDGVIPVFDKIISERVLSEEVFSVYYSRNSEMNTGGEIILGGSDPS 241
Query: 248 HFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEIN 307
++ G YV ++ GYW +L + +G++ C GC A VD+G+S + GP V+ +
Sbjct: 242 YYTGDFHYVSISTPGYWHIDLKGVSLGSEML-FCHEGCTAAVDTGSSFITGPASAVSILM 300
Query: 308 HAIGG----EGVVSAECKLV 323
+IG E ECK +
Sbjct: 301 KSIGATLLEERDYVVECKKI 320
>gi|340506705|gb|EGR32788.1| hypothetical protein IMG5_070700 [Ichthyophthirius multifiliis]
Length = 389
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 102/259 (39%), Positives = 168/259 (64%), Gaps = 7/259 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNT 135
+ NFMDAQY+GE+ IG+PPQ+F VIFDTGSSNLWVPSS+C SI+C H+RY KS+T
Sbjct: 68 INNFMDAQYYGEVQIGTPPQSFQVIFDTGSSNLWVPSSECGILSIACRLHTRYDKTKSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + G +I YGSG +SG ++Q+ + +G + ++ EAT L+FL+++FDGI+GL
Sbjct: 128 YGKNGTHFDIKYGSGGVSGHWTQETIILGGLTAQNVTIGEATSMKGLSFLVSKFDGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ +I+V +A PV+ ++EQG V + F+F+L + +EG ++ GG DP++ Y
Sbjct: 188 AYPKISVDNATPVFMKLIEQGKVQDGSFAFFLT-NKAGQEGSRLILGGFDPQYAATPFKY 246
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
PV+ + +W ++ + +GN + + + AIVD+GTS++ GP V+ E+ + +G
Sbjct: 247 YPVSLEAWWVIDVDRVALGNTTYQIQK----AIVDTGTSVMVGPKSVIEEMKKQLPNQGK 302
Query: 316 VSAECKLVVSQYGDLIWDL 334
+C +S++ +L +++
Sbjct: 303 QKVDCS-TISEFPNLTFNI 320
>gi|195114666|ref|XP_002001888.1| GI14567 [Drosophila mojavensis]
gi|193912463|gb|EDW11330.1| GI14567 [Drosophila mojavensis]
Length = 402
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 115/262 (43%), Positives = 161/262 (61%), Gaps = 8/262 (3%)
Query: 69 DSDEDIL-PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHS 126
DS+E ++ L N + Y+G IGIG+PPQ F+V+FDTGSSNLWVPS +C + ++C H+
Sbjct: 73 DSNEYVIETLSNNQNMDYYGVIGIGTPPQYFNVVFDTGSSNLWVPSVQCLSTDVACQNHN 132
Query: 127 RYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLL 186
+Y S S+TY G+S I YG+GS++GF S D V + + + Q F EA + + +F
Sbjct: 133 QYNSSASSTYVPNGESFSIQYGTGSLTGFLSTDTVTINGLSIASQTFGEAISQPNGSFTG 192
Query: 187 ARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDP 246
FDGI+G+G+ IAV + VP + N+ EQ L+ E F F+L RD A+ GG++V GG+D
Sbjct: 193 VPFDGILGMGYMSIAVDNVVPPFYNLYEQRLIDEPTFGFYLARDGSAQAGGQLVLGGIDS 252
Query: 247 KHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEI 306
+ F G TYV V ++GYWQF + +G VC C AI D+GTSLLA P T +
Sbjct: 253 QLFSGNLTYVSVVQQGYWQFVVNSAEMGGYV--VCY-NCQAIADTGTSLLACPGSAYTML 309
Query: 307 NHAIGG---EGVVSAECKLVVS 325
N IGG +G +C V S
Sbjct: 310 NQLIGGYLMDGDYYVDCSTVSS 331
>gi|124514108|gb|ABN13683.1| preprochymosin [Capra hircus]
Length = 381
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 116/308 (37%), Positives = 173/308 (56%), Gaps = 17/308 (5%)
Query: 5 LLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVR 64
+L +VF L A +P R LK+R L L + Y G V+ V
Sbjct: 6 VLLAVFALSHGAEITRIPLYKGKPLRKALKERGLLEDFLQKQQYGVSSEYSGFGEVANV- 64
Query: 65 HRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYF 124
PL N++D+QYFG+I +G+PPQ F+V+FDTGSS+ WVPS C S +C
Sbjct: 65 -----------PLTNYLDSQYFGKIYLGTPPQEFTVLFDTGSSDFWVPSIYCK-SNACKN 112
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
H R+ RKS+T+ +GK I YG+GS+ G D V V ++V Q +T+E F
Sbjct: 113 HQRFDPRKSSTFQNLGKPLSIRYGTGSMQGILGYDTVTVSNIVDTQQTVGLSTQEPGDVF 172
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
A FDGI+G+ + +A +VPV+DNM+++ LV++++FS +++R+ +G + G +
Sbjct: 173 TYAEFDGILGMAYPSLASEYSVPVFDNMMDRHLVAQDLFSVYMDRN---GQGSMLTLGAI 229
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
DP ++ G +VPVT + YWQF + + I + CEGGC AI+D+GTS L GP+ +
Sbjct: 230 DPSYYTGSLHWVPVTLQKYWQFTVDSVTISG-AVVACEGGCQAILDTGTSKLVGPSSDIL 288
Query: 305 EINHAIGG 312
I AIG
Sbjct: 289 NIQQAIGA 296
>gi|426353119|ref|XP_004044046.1| PREDICTED: gastricsin [Gorilla gorilla gorilla]
Length = 388
Score = 211 bits (537), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 113/283 (39%), Positives = 167/283 (59%), Gaps = 9/283 (3%)
Query: 37 RLDLHSLNAARITRKERYMGGAGV------SGVRHRLGDSDEDILPLKNFMDAQYFGEIG 90
++ L + R T KE+ + G + ++ GD P+ +MDA YFGEI
Sbjct: 20 KVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYHFGDLSVTYEPMA-YMDAAYFGEIS 78
Query: 91 IGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSG 150
IG+PPQNF V+FDTGSSNLWVPS C S +C HSR+ +S+TY+ G++ + YGSG
Sbjct: 79 IGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHSRFNPSESSTYSTNGQTFSLQYGSG 137
Query: 151 SISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWD 210
S++GFF D + V + V +Q F + E F+ A+FDGI+GL + ++V +A
Sbjct: 138 SLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGIMGLAYPALSVDEATTAMQ 197
Query: 211 NMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 270
MV++G ++ VFS +L+ GG +VFGGVD + G+ + PVT++ YWQ + +
Sbjct: 198 GMVQEGALTSPVFSVYLSNQ-QGSSGGAVVFGGVDNSLYTGQIYWAPVTQELYWQIGIEE 256
Query: 271 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
LIG Q++G C GC AIVD+GTSLL P ++ + A G +
Sbjct: 257 FLIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMSALLQATGAQ 299
>gi|440893605|gb|ELR46308.1| Pepsin A, partial [Bos grunniens mutus]
Length = 388
Score = 211 bits (537), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 110/256 (42%), Positives = 160/256 (62%), Gaps = 6/256 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL+N++D +YFG IGIG+P Q+F+VIFDTGSSNLWVPS C S +C H+R+ + S+T
Sbjct: 67 PLQNYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSIYCS-SEACTNHNRFNPQDSST 125
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIG 194
Y ++ I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+G
Sbjct: 126 YEATSETLSITYGTGSMTGVLGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILG 184
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+ A PV+DN+ +QGLVS+++FS +L+ + E G ++FG +D ++ G
Sbjct: 185 LAYPSISSSGATPVFDNIWDQGLVSQDLFSVYLSS--NEESGSVVIFGDIDSSYYSGSLN 242
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
+VPV+ +GYWQ + I + +S C GC AIVD+GTSLLAGPT ++ I IG
Sbjct: 243 WVPVSVEGYWQITVDSITMNGESIA-CSDGCQAIVDTGTSLLAGPTTAISNIQSYIGASE 301
Query: 315 VVSAECKLVVSQYGDL 330
S E + S L
Sbjct: 302 DSSGEVVISCSSIDSL 317
>gi|147905812|ref|NP_001079036.1| gastricsin precursor [Xenopus laevis]
gi|12082174|dbj|BAB20797.1| pepsinogen C [Xenopus laevis]
gi|213625030|gb|AAI69665.1| Pepsinogen C [Xenopus laevis]
gi|213626584|gb|AAI69663.1| Pepsinogen C [Xenopus laevis]
Length = 383
Score = 211 bits (537), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 113/296 (38%), Positives = 165/296 (55%), Gaps = 12/296 (4%)
Query: 18 CLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPL 77
CL L S G+ R+ LKK + + R +E + V PL
Sbjct: 10 CLQL---SEGIIRVPLKKFK-------SMREVMRENGIKAPLVDPATKYYNQYATAYEPL 59
Query: 78 KNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYT 137
N+MD Y+GEI IG+PPQNF V+FDTGSSNLWV S+ C S +C H + +S+TY+
Sbjct: 60 SNYMDMSYYGEISIGTPPQNFLVLFDTGSSNLWVASTYCQ-SQACTNHPLFNPSQSSTYS 118
Query: 138 EIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGF 197
+ + YG+GS++G D V + +V + Q F + E F+ A+FDGI+GL +
Sbjct: 119 SNQQQFSLQYGTGSLTGILGYDTVTIQNVAISQQEFGLSETEPGTNFVYAQFDGILGLAY 178
Query: 198 REIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVP 257
IAVG A V M++Q L+++ +F F+L+ ++ GGE+ FGGVD ++ G+ + P
Sbjct: 179 PSIAVGGATTVMQGMMQQNLLNQPIFGFYLSGQ-SSQNGGEVAFGGVDQNYYTGQIYWTP 237
Query: 258 VTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
VT + YWQ + I Q+TG C GC AIVD+GTSLL P V + + +IG +
Sbjct: 238 VTSETYWQIGIQGFSINGQATGWCSQGCQAIVDTGTSLLTAPQSVFSSLIQSIGAQ 293
>gi|38640718|gb|AAR25994.1| prochymosin [Capra hircus]
Length = 381
Score = 211 bits (537), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 116/308 (37%), Positives = 173/308 (56%), Gaps = 17/308 (5%)
Query: 5 LLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVR 64
+L +VF L A +P R LK+R L L + Y G V+ V
Sbjct: 6 VLLAVFALSHGAEITRIPLYKGKPLRKALKERGLLEDFLQKQQYGVSSEYSGFGEVASV- 64
Query: 65 HRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYF 124
PL N++D+QYFG+I +G+PPQ F+V+FDTGSS+ WVPS C S +C
Sbjct: 65 -----------PLTNYLDSQYFGKIYLGTPPQEFTVLFDTGSSDFWVPSIYCK-SNACKN 112
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
H R+ RKS+T+ +GK I YG+GS+ G D V V ++V Q +T+E F
Sbjct: 113 HQRFDPRKSSTFQNLGKPLSIRYGTGSMQGILGYDTVTVSNIVDTQQTVGLSTQEPGDVF 172
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
A FDGI+G+ + +A +VPV+DNM+++ LV++++FS +++R+ +G + G +
Sbjct: 173 TYAEFDGILGMAYPSLASEYSVPVFDNMMDRRLVAQDLFSVYMDRN---GQGSMLTLGAI 229
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
DP ++ G +VPVT + YWQF + + I + CEGGC AI+D+GTS L GP+ +
Sbjct: 230 DPSYYTGSLHWVPVTLQKYWQFTVDSVTISG-AVVACEGGCQAILDTGTSKLVGPSSDIL 288
Query: 305 EINHAIGG 312
I AIG
Sbjct: 289 NIQQAIGA 296
>gi|71021685|ref|XP_761073.1| hypothetical protein UM04926.1 [Ustilago maydis 521]
gi|46100637|gb|EAK85870.1| hypothetical protein UM04926.1 [Ustilago maydis 521]
Length = 418
Score = 211 bits (536), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 109/253 (43%), Positives = 155/253 (61%), Gaps = 9/253 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL +F++AQYF +I +G+P Q+F VI DTGSSNLWVPS+KC SI+C+ H +Y S S+
Sbjct: 97 VPLTDFLNAQYFCDISLGTPAQDFKVILDTGSSNLWVPSTKCS-SIACFLHKKYDSSASS 155
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y + G +I YGSGS+ G S D +++GD+ +K Q F EAT E L F +FDGI+G
Sbjct: 156 SYKKNGTEFKIQYGSGSMEGIVSNDVLKIGDLTIKGQDFAEATSEPGLAFAFGKFDGILG 215
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+V VP M+ QGL+ SF+L E+GGE VFGG+D H+ GK
Sbjct: 216 LAYDTISVNGIVPPMYQMINQGLLDAPQVSFYLGS--SEEDGGEAVFGGIDDSHYTGKIH 273
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG-- 312
+ PV +KGYW+ L + +G++ + G A +D+GTSL+A T +N IG
Sbjct: 274 WSPVKRKGYWEVALDKLALGDEELELDNGSAA--IDTGTSLIAMATDTAEILNAEIGATK 331
Query: 313 --EGVVSAECKLV 323
G S +C+ V
Sbjct: 332 SWNGQYSVDCEKV 344
>gi|129776|sp|P03954.2|PEPA1_MACFU RecName: Full=Pepsin A-1; AltName: Full=Pepsin III-3; Flags:
Precursor
gi|38075|emb|CAA42424.1| prepropepsin a [Macaca fuscata]
Length = 388
Score = 211 bits (536), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 123/314 (39%), Positives = 180/314 (57%), Gaps = 28/314 (8%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL K L H+LN A +Y A + D PL+N++D +YFG
Sbjct: 32 LSEHGLLKDFLKKHNLNPAS-----KYFPQAEAPTLI--------DEQPLENYLDVEYFG 78
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H+ + + S+TY + I Y
Sbjct: 79 TIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACTNHNLFNPQDSSTYQSTSGTLSITY 137
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
G+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + I+ A
Sbjct: 138 GTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISSSGAT 196
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
PV+DN+ +QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPV+ +GYWQ
Sbjct: 197 PVFDNIWDQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTGSLNWVPVSVEGYWQI 254
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG------GEGVVSAEC 320
+ I + ++ C GC AIVD+GTSLL GPT + I IG GE VVS
Sbjct: 255 SVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGEMVVSCSA 313
Query: 321 KLVVSQYGDLIWDL 334
+S D+++ +
Sbjct: 314 ---ISSLPDIVFTI 324
>gi|395534129|ref|XP_003769100.1| PREDICTED: LOW QUALITY PROTEIN: gastricsin-like [Sarcophilus
harrisii]
Length = 391
Score = 211 bits (536), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 113/294 (38%), Positives = 170/294 (57%), Gaps = 13/294 (4%)
Query: 25 SNGLRRIGLKKRRLDLHSLNAARITRKER-----YMGGAGVSGVRHRLGDSDEDILPLKN 79
S G RI LKK + + R T KE+ ++ ++ L L +
Sbjct: 14 SEGFFRIPLKKGK-------SIRDTMKEKGVLEDFLKTHKYDPAKNYHFKDFSVALHLPS 66
Query: 80 FMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEI 139
++DA Y+GEI IG+PPQNF V+FDTG SNLWVPS C S +C H+++ +S+TY+
Sbjct: 67 YLDAAYYGEISIGTPPQNFLVLFDTGFSNLWVPSIYCQ-SQACSGHAQFSPSQSSTYSTN 125
Query: 140 GKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFRE 199
G++ + YGSGS++GFF D + V + V +QVF + E F+ A+FDGI+G+ +
Sbjct: 126 GQTFSLQYGSGSLTGFFGYDTITVQGIKVPNQVFGLSENEPGTNFVHAQFDGIMGMAYPA 185
Query: 200 IAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVT 259
+AVG A M++Q +++ +FSF+L + GGE++FGGVD + G+ + PVT
Sbjct: 186 LAVGGATTALQGMLQQNILTNPIFSFYLGNQQSSXNGGEVIFGGVDNNLYTGQIYWAPVT 245
Query: 260 KKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
++ YWQ + + IG Q+TG C GC AIVD+GTSLL P ++ A G +
Sbjct: 246 QELYWQIGIQEFSIGGQATGWCSQGCQAIVDTGTSLLTVPQQYMSAFLQATGAQ 299
>gi|355329699|dbj|BAL14143.1| pepsinogen 2 [Pagrus major]
Length = 377
Score = 211 bits (536), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 105/253 (41%), Positives = 162/253 (64%), Gaps = 9/253 (3%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
+ N D Y+G + IG+PPQ+F+VIFDTGSSNLW+PS C S +C H ++ ++S+T+
Sbjct: 62 MTNDADLSYYGVVSIGTPPQSFTVIFDTGSSNLWIPSVYCN-SQACQNHKKFNPQQSSTF 120
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
++ I YG+GS++G+ + D VEVG + V +QVF + E + +A DGI+GL
Sbjct: 121 KWGNEALSIQYGTGSMTGYLAIDTVEVGGISVANQVFGISQTEAAFMASMAA-DGILGLA 179
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
F+ IA + VPV+DNM++QGLVS+ +FS +L+ ++E+G E+VFGG D H+ G+ T++
Sbjct: 180 FQSIASDNVVPVFDNMIKQGLVSQPMFSVYLSG--NSEQGSEVVFGGTDSNHYTGQITWI 237
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE--- 313
P++ YWQ + + I Q T C GGC AI+D+GTSL+ GPT + +N +G
Sbjct: 238 PLSSATYWQISMDSVTINGQ-TVACSGGCQAIIDTGTSLIVGPTNDINNMNSWVGASTNQ 296
Query: 314 -GVVSAECKLVVS 325
G + C+ + S
Sbjct: 297 YGEATVNCQNIQS 309
>gi|114637856|ref|XP_001145457.1| PREDICTED: pepsin A-5 isoform 6 [Pan troglodytes]
Length = 388
Score = 211 bits (536), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 120/303 (39%), Positives = 173/303 (57%), Gaps = 23/303 (7%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL K L H+LN A +Y + D PL+N++D +YFG
Sbjct: 32 LSERGLLKDFLKKHNLNPAS-----KYF--------PQWEAPTLVDEQPLENYLDMEYFG 78
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H+R+ S+TY ++ I Y
Sbjct: 79 TIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTYQSTSETVSIAY 137
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
G+GS++G D V+VG + +Q+F + T GS + A FDGI+GL + I+ A
Sbjct: 138 GTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILGLAYPSISSSGAT 196
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ
Sbjct: 197 PVFDNIWNQGLVSQDLFSVYLSA--DDKSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQI 254
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKL 322
+ I + ++ C GC AIVD+GTSLL GPT + I IG +G + C
Sbjct: 255 TVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSA 313
Query: 323 VVS 325
+ S
Sbjct: 314 ISS 316
>gi|326933881|ref|XP_003213026.1| PREDICTED: gastricsin-like [Meleagris gallopavo]
Length = 389
Score = 211 bits (536), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 121/316 (38%), Positives = 174/316 (55%), Gaps = 32/316 (10%)
Query: 13 WVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGV------RHR 66
W++ + L L GL R+ LKK + + R KE SGV HR
Sbjct: 3 WLIFTVLCLHLC-EGLLRVPLKKGK-------SIREVMKE--------SGVLHDYLANHR 46
Query: 67 LGDSDEDIL--------PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF 118
D PL N MD Y+GEI IG+PPQNF V+FDTGSSNLWVPS+ C
Sbjct: 47 YYDPAYKFFSNFATAYEPLANSMDMSYYGEISIGTPPQNFLVLFDTGSSNLWVPSTLCQ- 105
Query: 119 SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
S +C H+ + +S+T++ + + YGSGS++G F D V + + + +Q F +
Sbjct: 106 SQACANHNEFNPNESSTFSTQNEFFSLQYGSGSLTGIFGFDTVTIQGISITNQEFGLSET 165
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
E FL + FDGI+GL F I+ G A V M+++ L+ +FSF+L+ + +GGE
Sbjct: 166 EPGTNFLYSPFDGILGLAFPAISAGGATTVMQQMLQENLLDSPIFSFYLSGQ-EGSQGGE 224
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAG 298
++FGGV+P + G+ ++ PVT+ YWQ + D +G QS+G C GC AIVD+GTSLL
Sbjct: 225 LIFGGVNPNLYTGQISWTPVTQTTYWQIGIEDFTVGGQSSGWCSQGCQAIVDTGTSLLTV 284
Query: 299 PTPVVTEINHAIGGEG 314
P V +E+ IG +
Sbjct: 285 PNQVFSELMQYIGAQA 300
>gi|351707910|gb|EHB10829.1| Gastricsin [Heterocephalus glaber]
Length = 391
Score = 211 bits (536), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 113/308 (36%), Positives = 178/308 (57%), Gaps = 15/308 (4%)
Query: 13 WVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRH----RLG 68
W++ + L LP +I LKK + + R T +++ + G + + +L
Sbjct: 3 WMVVALLCLPLLEATKLKIPLKKFK-------SIRETMRDKGLLGDFLKTHKQDHIRKLS 55
Query: 69 DSDEDILPL---KNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFH 125
++ + L +++DA YFGEI +G+PPQ+F V+FDTGSSNLWVPS C S++C H
Sbjct: 56 NNFDHFSVLFEPMSYLDAAYFGEISLGTPPQSFQVLFDTGSSNLWVPSVYCQ-SLACTTH 114
Query: 126 SRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFL 185
R+ KS+TYT G+S + YGSGS++G F D + + V Q F + +E TF+
Sbjct: 115 PRFNPSKSSTYTSTGQSFSLQYGSGSLTGVFGYDTMTIQGTQVPKQEFGLSEQEPGTTFV 174
Query: 186 LARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVD 245
A+FDGI+GLG+ +A G A ++ +G +S+ +FS +L + +GG ++ GGVD
Sbjct: 175 YAQFDGIMGLGYPGLAAGGATTALQGLIREGALSQPLFSVYLGSQQGSSDGGALILGGVD 234
Query: 246 PKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTE 305
+ G+ ++ PVT++ YWQ + D+ + NQ+ G C GC IVD+GTSLL P +T
Sbjct: 235 ESLYNGQISWTPVTQELYWQIGIEDVQLDNQALGWCSQGCQGIVDTGTSLLTLPQQYLTT 294
Query: 306 INHAIGGE 313
+ AIG +
Sbjct: 295 LIQAIGAQ 302
>gi|169731523|gb|ACA64894.1| progastricsin (predicted) [Callicebus moloch]
Length = 388
Score = 211 bits (536), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 114/283 (40%), Positives = 164/283 (57%), Gaps = 9/283 (3%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRH------RLGDSDEDILPLKNFMDAQYFGEIG 90
++ L + R T KE+ + + +H D P+ ++MDA YFGEI
Sbjct: 20 KVPLKKFKSIRETMKEKGLLREFLKTHKHDPAWKYHFSDLRVSYEPM-DYMDAAYFGEIS 78
Query: 91 IGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSG 150
IG+PPQNF V+FDTGSSNLWVPS C S +C HSR+ KS+TY+ ++ + YGSG
Sbjct: 79 IGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHSRFNPSKSSTYSSNEQTFSLQYGSG 137
Query: 151 SISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWD 210
S++GFF D + V + V Q F + E F+ A+FDGI+GL + ++VG A
Sbjct: 138 SLTGFFGYDTLTVQSIQVPKQEFGLSENEPGTNFIYAKFDGIMGLAYPALSVGGATTAMQ 197
Query: 211 NMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGD 270
M+++G ++ VFSF+L+ GG +VFGGVD + G+ + PVT++ YWQ + +
Sbjct: 198 GMLQEGALTSPVFSFYLSNQ-QGSSGGAVVFGGVDSSLYTGQIYWAPVTQELYWQIGIEE 256
Query: 271 ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
LIG Q++G C GC AIVD+GTSLL P ++ A G E
Sbjct: 257 FLIGGQASGWCSEGCQAIVDTGTSLLTVPQQYLSAFLEATGAE 299
>gi|57526769|ref|NP_001009804.1| chymosin precursor [Ovis aries]
gi|116405|sp|P18276.1|CHYM_SHEEP RecName: Full=Chymosin; AltName: Full=Preprorennin; Flags:
Precursor
gi|1374|emb|CAA37209.1| preprochymosin [Ovis aries]
gi|229045|prf||1817165A prepro-chymosin
Length = 381
Score = 211 bits (536), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 116/308 (37%), Positives = 173/308 (56%), Gaps = 17/308 (5%)
Query: 5 LLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVR 64
+L +VF L A +P R LK+R L L + Y G V+ V
Sbjct: 6 VLLAVFALSQGAEITRIPLYKGKPLRKALKERGLLEDFLQKQQYGVSSEYSGFGEVASV- 64
Query: 65 HRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYF 124
PL N++D+QYFG+I +G+PPQ F+V+FDTGSS+ WVPS C S +C
Sbjct: 65 -----------PLTNYLDSQYFGKIYLGTPPQEFTVLFDTGSSDFWVPSIYCK-SNACKN 112
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
H R+ RKS+T+ +GK I YG+GS+ G D V V ++V Q +T+E F
Sbjct: 113 HQRFDPRKSSTFQNLGKPLSIRYGTGSMQGILGYDTVTVSNIVDIQQTVGLSTQEPGDVF 172
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
A FDGI+G+ + +A +VPV+DNM+++ LV++++FS +++R + +G + G +
Sbjct: 173 TYAEFDGILGMAYPSLASEYSVPVFDNMMDRRLVAQDLFSVYMDR---SGQGSMLTLGAI 229
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
DP ++ G +VPVT + YWQF + + I + CEGGC AI+D+GTS L GP+ +
Sbjct: 230 DPSYYTGSLHWVPVTLQKYWQFTVDSVTISG-AVVACEGGCQAILDTGTSKLVGPSSDIL 288
Query: 305 EINHAIGG 312
I AIG
Sbjct: 289 NIQQAIGA 296
>gi|301625941|ref|XP_002942158.1| PREDICTED: pepsin A [Xenopus (Silurana) tropicalis]
Length = 384
Score = 211 bits (536), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 113/288 (39%), Positives = 161/288 (55%), Gaps = 20/288 (6%)
Query: 26 NGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQY 85
N L+R+GL L + N A S L S ++L +N+MD +Y
Sbjct: 29 NRLQRLGLLGDYLKKYPYNPA--------------SKYFPTLAQSSAEVL--QNYMDIEY 72
Query: 86 FGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEI 145
+G I IG+PPQ F+VIFDTGS+NLWVPS C S +C H+R+ ++S T+ I
Sbjct: 73 YGTISIGTPPQEFTVIFDTGSANLWVPSVYCS-SSACTNHNRFNPQQSTTFQATNTPVSI 131
Query: 146 NYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDA 205
YG+GS+SGF D ++VG++ + +Q+F + E + FDGI+GL F IA A
Sbjct: 132 QYGTGSMSGFLGYDTLQVGNIKISNQMFGLSESEPGSFLYYSPFDGILGLAFPSIASSQA 191
Query: 206 VPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQ 265
PV+DNM QGL+ + +FS +L+ D + G ++FGGVD ++ G +VP+T + YWQ
Sbjct: 192 TPVFDNMWSQGLIPQNLFSVYLSS--DGQSGSYVLFGGVDTSYYSGSLNWVPLTAETYWQ 249
Query: 266 FELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
L I I Q C C AIVD+GTSL+ GPT + I + IG
Sbjct: 250 IILDSISINGQVIA-CSQSCQAIVDTGTSLMTGPTTPIANIQYYIGAS 296
>gi|1065259|pdb|1PSO|E Chain E, The Crystal Structure Of Human Pepsin And Its Complex With
Pepstatin
gi|5542461|pdb|1QRP|E Chain E, Human Pepsin 3a In Complex With A Phosphonate Inhibitor
Iva-Val-Val- Leu(P)-(O)phe-Ala-Ala-Ome
gi|157833570|pdb|1PSN|A Chain A, The Crystal Structure Of Human Pepsin And Its Complex With
Pepstatin
gi|361132440|pdb|3UTL|A Chain A, Human Pepsin 3b
Length = 326
Score = 211 bits (536), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 109/258 (42%), Positives = 160/258 (62%), Gaps = 10/258 (3%)
Query: 73 DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRK 132
D PL+N++D +YFG IGIG+P Q+F+V+FDTGSSNLWVPS C S++C H+R+
Sbjct: 2 DEQPLENYLDMEYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCS-SLACTNHNRFNPED 60
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDG 191
S+TY ++ I YG+GS++G D V+VG + +Q+F + T GS + A FDG
Sbjct: 61 SSTYQSTSETVSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDG 119
Query: 192 IIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKG 251
I+GL + I+ A PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G
Sbjct: 120 ILGLAYPSISSSGATPVFDNIWNQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTG 177
Query: 252 KHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
+VPVT +GYWQ + I + ++ C GC AIVD+GTSLL GPT + I IG
Sbjct: 178 SLNWVPVTVEGYWQITVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIG 236
Query: 312 G----EGVVSAECKLVVS 325
+G + C + S
Sbjct: 237 ASENSDGDMVVSCSAISS 254
>gi|12843350|dbj|BAB25952.1| unnamed protein product [Mus musculus]
Length = 396
Score = 211 bits (536), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 116/304 (38%), Positives = 172/304 (56%), Gaps = 6/304 (1%)
Query: 13 WVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRK--ERYMGGAGVSGVRHRLGDS 70
W++ + L LP L R+ KK + ++ + + + + G + GD
Sbjct: 3 WMVVALLCLPLLEAALIRVPPKKMKSIRETMKEQGVLKDFLKNHKYDPGQKYHFGKFGDY 62
Query: 71 DEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKS 130
P+ +MDA Y+GEI IG+PPQNF V+FDTGSSNLWV S C S +C H+RY
Sbjct: 63 SVLYEPMA-YMDASYYGEISIGTPPQNFLVLFDTGSSNLWVSSVYCQ-SEACTTHTRYNP 120
Query: 131 RKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFD 190
KS+TY G++ + YG+GS++GFF D + V + V +Q F + E F+ A+FD
Sbjct: 121 SKSSTYYTQGQTFSLQYGTGSLTGFFGYDTLRVQSIQVPNQEFGLSENEPGTNFVYAQFD 180
Query: 191 GIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFK 250
GI+GL + ++ G A M+ +G +S+ +F +L GG+IVFGGVD +
Sbjct: 181 GIMGLAYPGLSSGGATTALQGMLGEGALSQPLFGVYLGSQ-QGSNGGQIVFGGVDENLYT 239
Query: 251 GKHTYVPVTKKGYWQFELGDILIGNQSTGVC-EGGCAAIVDSGTSLLAGPTPVVTEINHA 309
G+ T++PVT++ YWQ + D LIGNQ++G C GC IVD+GTSLL P + E+
Sbjct: 240 GELTWIPVTQELYWQITIDDFLIGNQASGWCSSSGCQGIVDTGTSLLVMPAQYLNELLQT 299
Query: 310 IGGE 313
IG +
Sbjct: 300 IGAQ 303
>gi|16974928|pdb|1FLH|A Chain A, Crystal Structure Of Human Uropepsin At 2.45 A Resolution
Length = 326
Score = 210 bits (535), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 109/258 (42%), Positives = 160/258 (62%), Gaps = 10/258 (3%)
Query: 73 DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRK 132
D PL+N++D +YFG IGIG+P Q+F+V+FDTGSSNLWVPS C S++C H+R+
Sbjct: 2 DEQPLENYLDMEYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCS-SLACTNHNRFNPED 60
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDG 191
S+TY ++ I YG+GS++G D V+VG + +Q+F + T GS + A FDG
Sbjct: 61 SSTYQSTSETVSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDG 119
Query: 192 IIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKG 251
I+GL + I+ A PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G
Sbjct: 120 ILGLAYPSISSSGATPVFDNIWNQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTG 177
Query: 252 KHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
+VPVT +GYWQ + I + ++ C GC AIVD+GTSLL GPT + I IG
Sbjct: 178 SLNWVPVTVEGYWQITVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIG 236
Query: 312 G----EGVVSAECKLVVS 325
+G + C + S
Sbjct: 237 ASENSDGDMVVSCSAISS 254
>gi|2832610|emb|CAA11580.1| cathepsin [Chionodraco hamatus]
Length = 402
Score = 210 bits (535), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 123/317 (38%), Positives = 181/317 (57%), Gaps = 19/317 (5%)
Query: 6 LRSVF---CLWVLASCLLL-------PASSNGLRRIGLKKRRLDLHSLNAARITRKERYM 55
+RSV C+W S L+ P + LR GL + L + + +R+
Sbjct: 1 MRSVLLLLCIWTCRSSALIRVPLRKVPTIRSQLRSEGLLQDFLVENRPDM--FSRRYAQC 58
Query: 56 GGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSK 115
AG +R LG S E I NFMDAQY+G+I +G+P QNFSV+FDTGSS+LWVPS+
Sbjct: 59 FPAGTPSLR--LGRSSEKIY---NFMDAQYYGDIALGTPEQNFSVVFDTGSSDLWVPSAY 113
Query: 116 CYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIE 175
C + +C R+K+ KS ++ G+ INYGSG + G +D + V ++VK Q F E
Sbjct: 114 C-VTEACALPKRFKAFKSTSFLHDGRQFGINYGSGHLLGVMGRDYLMVAGMMVKRQEFRE 172
Query: 176 ATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEE 235
+ E FL ARFDG++GLG+ +A PV+DNM+ Q L+ + +FSF+L+R +
Sbjct: 173 SVYEPGTAFLKARFDGVLGLGYPALAEILGNPVFDNMLAQNLLDKPIFSFYLSRKLNGSP 232
Query: 236 GGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSL 295
GE++ GG D + + ++PVT K YWQ ++ +++ + C GC AIVD+GTSL
Sbjct: 233 EGELLLGGTDERLYDLPINWLPVTAKAYWQIKIDSVVVQGVNP-FCPHGCQAIVDTGTSL 291
Query: 296 LAGPTPVVTEINHAIGG 312
+ GPT + +I IG
Sbjct: 292 ITGPTDDILDIQQLIGA 308
>gi|222425200|dbj|BAH20549.1| pepsinogen A-50 [Pongo abelii]
Length = 388
Score = 210 bits (535), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 120/303 (39%), Positives = 172/303 (56%), Gaps = 23/303 (7%)
Query: 28 LRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFG 87
L GL K L H+LN AR +Y + D PL+N++D +YFG
Sbjct: 32 LSEHGLLKDFLKKHNLNPAR-----KYF--------PQWEAPTLVDEQPLENYLDMEYFG 78
Query: 88 EIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINY 147
IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H+ + S+TY ++ I Y
Sbjct: 79 TIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACTNHNLFNPEDSSTYQSTSETVSIAY 137
Query: 148 GSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIGLGFREIAVGDAV 206
G+GS++G D V+VG + Q+F + T GS + A FDGI+GL + I+ A
Sbjct: 138 GTGSMTGILGYDTVQVGGISDTSQIFGLSETEPGSFLYY-APFDGILGLAYPSISSSGAT 196
Query: 207 PVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQF 266
PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G +VPVT +GYWQ
Sbjct: 197 PVFDNIWNQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQI 254
Query: 267 ELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG----EGVVSAECKL 322
+ I + ++ C GC AIVD+GTSLL GPT + I IG +G + C
Sbjct: 255 TVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSA 313
Query: 323 VVS 325
+ S
Sbjct: 314 ISS 316
>gi|388856266|emb|CCF50075.1| probable PEP4-aspartyl protease [Ustilago hordei]
Length = 418
Score = 210 bits (535), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 108/253 (42%), Positives = 155/253 (61%), Gaps = 9/253 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL +F++AQYF +I +G+P Q F VI DTGSSNLWVPS+KC SI+C+ H +Y S S+
Sbjct: 97 VPLTDFLNAQYFCDISLGTPAQEFKVILDTGSSNLWVPSNKCS-SIACFLHKKYDSSASS 155
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y + G +I YGSGS+ G S D +++GD+ +K Q F EAT E L F +FDGI+G
Sbjct: 156 SYKKNGTEFKIQYGSGSMEGIVSNDVLKIGDLTIKGQDFAEATSEPGLAFAFGKFDGILG 215
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+V VP M+ QGL+ SF+L ++GGE VFGG+D H+ GK
Sbjct: 216 LAYDTISVNGIVPPMYQMINQGLLDAPQVSFYLGS--SEQDGGEAVFGGIDESHYTGKIH 273
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG-- 312
+ PV +KGYW+ L + +G+++ + G A +D+GTSL+A T +N IG
Sbjct: 274 WAPVKRKGYWEVALDKLALGDEALELDNGSAA--IDTGTSLIAMATDTAEILNAEIGATK 331
Query: 313 --EGVVSAECKLV 323
G S +C+ V
Sbjct: 332 SWNGQYSVDCEKV 344
>gi|440905526|gb|ELR55898.1| Gastricsin [Bos grunniens mutus]
Length = 391
Score = 210 bits (535), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 118/306 (38%), Positives = 176/306 (57%), Gaps = 15/306 (4%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRH------RL 67
VLA L + L +I LKK + + R KE+ + + +H R
Sbjct: 5 VLALVCLQALEAAALVKIPLKKFK-------SIREIMKEKGLLEDFLRTYKHDPAQKYRF 57
Query: 68 GDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSR 127
GD P+ ++MDA YFGEI IG+PPQNF V+FDTGSSNLWVPS C S +C H+R
Sbjct: 58 GDFIVATEPM-DYMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHTR 115
Query: 128 YKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLA 187
+ S+TY+ ++ + YGSGS++G D + V + V +Q F + E FL A
Sbjct: 116 FNHSLSSTYSTNEQTFSLQYGSGSLTGILGYDTLTVQGIKVPNQEFGLSKTEPGTNFLYA 175
Query: 188 RFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPK 247
+FDGI+G+ + ++V A V M+++G ++ VFSF+L+ +++GG ++FGGVD
Sbjct: 176 KFDGIMGMAYPSLSVDGATTVLQGMLQEGALTSPVFSFYLSSQQGSQDGGAVIFGGVDSC 235
Query: 248 HFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEIN 307
+ G+ + PVT++ YWQ + LIG+Q+TG C GC AIVD+GTSLL P ++ +
Sbjct: 236 LYTGQIYWAPVTQELYWQIGFEEFLIGDQATGWCSTGCQAIVDTGTSLLTVPQQFLSALL 295
Query: 308 HAIGGE 313
A G +
Sbjct: 296 QATGAQ 301
>gi|19921120|ref|NP_609458.1| CG17134 [Drosophila melanogaster]
gi|7297766|gb|AAF53016.1| CG17134 [Drosophila melanogaster]
gi|17944939|gb|AAL48533.1| RE02351p [Drosophila melanogaster]
gi|220947772|gb|ACL86429.1| CG17134-PA [synthetic construct]
gi|220957078|gb|ACL91082.1| CG17134-PA [synthetic construct]
Length = 391
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 107/237 (45%), Positives = 147/237 (62%), Gaps = 4/237 (1%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNT 135
L N M+ +Y+G I IG+P Q F+++FDTGS+NLWVPS+ C S +C H++Y S S+T
Sbjct: 68 LHNSMNNEYYGVIAIGTPEQRFNILFDTGSANLWVPSASCPASNTACQRHNKYDSSASST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G+ I YG+GS+SGF S D V + + +++Q F EA E TF+ A F GI+GL
Sbjct: 128 YVANGEEFAIEYGTGSLSGFLSNDIVTIAGISIQNQTFGEALSEPGTTFVDAPFAGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
F IAV P +DNM+ QGL+ E V SF+L R A GGE++ GG+D ++G TY
Sbjct: 188 AFSAIAVDGVTPPFDNMISQGLLDEPVISFYLKRQGTAVRGGELILGGIDSSLYRGSLTY 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
VPV+ YWQF++ I T +C GC AI D+GTSL+A P +IN +G
Sbjct: 248 VPVSVPAYWQFKVNT--IKTNGTLLCN-GCQAIADTGTSLIAVPLAAYRKINRQLGA 301
>gi|195391510|ref|XP_002054403.1| GJ24430 [Drosophila virilis]
gi|194152489|gb|EDW67923.1| GJ24430 [Drosophila virilis]
Length = 376
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 110/274 (40%), Positives = 160/274 (58%), Gaps = 4/274 (1%)
Query: 40 LHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFS 99
L++ AR+ R + S + + L L N + +Y+G I +G+PPQ F
Sbjct: 16 LYTFAKARMLRVPLEVQRKPASQLSQSFLATQSLQLMLDNRDNVEYYGRIAMGTPPQLFR 75
Query: 100 VIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQ 158
VIFDTGS+N W+PSS C S I+C HSRYK+ KS +Y + G++ + YG+G +SG+ SQ
Sbjct: 76 VIFDTGSANTWLPSSNCPDSNIACQQHSRYKAHKSKSYVKNGRNFSLAYGNGHVSGYLSQ 135
Query: 159 DNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLV 218
D + + DVVV D +F E TF+ FDGI+GLGFR+IA ++ P + +Q LV
Sbjct: 136 DTLRIADVVVPDLIFGETLSHHQATFIPTSFDGIVGLGFRQIAWKNSTPFLELFCQQHLV 195
Query: 219 SEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQST 278
+FS +L R GGEI FGG+D +KG YVP++K GYWQF + + +GN+
Sbjct: 196 KRCLFSVYLRRMAGELYGGEITFGGIDHSRYKGALDYVPLSKVGYWQFVMSGVSVGNKK- 254
Query: 279 GVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
+G AI+D+GTSL+ P + ++ AIG
Sbjct: 255 --IDGRVNAILDTGTSLVLMPRRIFEQLQQAIGA 286
>gi|448113357|ref|XP_004202330.1| Piso0_001822 [Millerozyma farinosa CBS 7064]
gi|359465319|emb|CCE89024.1| Piso0_001822 [Millerozyma farinosa CBS 7064]
Length = 414
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 104/250 (41%), Positives = 156/250 (62%), Gaps = 8/250 (3%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL+N+++AQY+ IG+GSP Q F V+ DTGSSNLWVPS+ C S++C+ H++Y +S++
Sbjct: 91 PLENYLNAQYYTTIGLGSPVQEFKVVLDTGSSNLWVPSTDCS-SLACFLHTKYDHSESSS 149
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + G I YGSGS+ G+ SQD + + + ++ Q F EAT E L F A+FDGI+GL
Sbjct: 150 YKQNGSEFAIRYGSGSLEGYVSQDTLNLAGLTIEKQDFAEATSEPGLAFAFAKFDGILGL 209
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+ I+V + VP N + QGL+ E F+F+L ++D D +GG FGGVD KH+KG
Sbjct: 210 AYDTISVNNIVPPIYNAINQGLLDEPKFAFYLGDKDKDENDGGVATFGGVDTKHYKGDIV 269
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
+P+ +K YW+ I +G++ + G A +D+GTSL+ P+ + IN IG +
Sbjct: 270 ELPIRRKAYWEVSFDGIGLGDEYAELTSTGAA--IDTGTSLITLPSSLAEIINAKIGAKK 327
Query: 314 ---GVVSAEC 320
G S +C
Sbjct: 328 SWSGQYSVDC 337
>gi|114572170|ref|XP_001163076.1| PREDICTED: cathepsin E isoform 1 [Pan troglodytes]
Length = 363
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 109/255 (42%), Positives = 149/255 (58%), Gaps = 12/255 (4%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C HSR++ +S+T
Sbjct: 69 PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHSRFQPSQSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y++ G+S I YG+GS+SG D V V + V Q F E+ E TF+ A FDGI+GL
Sbjct: 128 YSQPGQSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAEFDGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV +FS +++ +P+ G E++FGG D HF G +
Sbjct: 188 GYPSLAVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGSLNW 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
VPVTK+ YWQ L ++L + C + + +S PTP T
Sbjct: 248 VPVTKQAYWQIALDNMLWSVPTLTSCRMSPSPLTESPIPSAQLPTPYWTSW--------- 298
Query: 316 VSAECKLVVSQYGDL 330
EC + + DL
Sbjct: 299 --MECSSAAAAFKDL 311
>gi|23110952|ref|NP_683865.1| cathepsin E isoform b preproprotein [Homo sapiens]
gi|7339518|emb|CAB82849.1| cathepsin E, alternative [Homo sapiens]
gi|119611999|gb|EAW91593.1| cathepsin E, isoform CRA_b [Homo sapiens]
Length = 363
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 105/229 (45%), Positives = 143/229 (62%), Gaps = 1/229 (0%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C HSR++ +S+T
Sbjct: 69 PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHSRFQPSQSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y++ G+S I YG+GS+SG D V V + V Q F E+ E TF+ A FDGI+GL
Sbjct: 128 YSQPGQSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAEFDGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV +FS +++ +P+ G E++FGG D HF G +
Sbjct: 188 GYPSLAVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGSLNW 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
VPVTK+ YWQ L ++L + C + + +S PTP T
Sbjct: 248 VPVTKQAYWQIALDNMLWSVPTLTSCRMSPSPLTESPIPSAQLPTPYWT 296
>gi|444706401|gb|ELW47743.1| Cathepsin E [Tupaia chinensis]
Length = 396
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 108/248 (43%), Positives = 155/248 (62%), Gaps = 5/248 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N + QY+G + IGSP QNFSV+FDTGSS+ WV S C S +C H+++ S +SNT
Sbjct: 67 PLTNSFNMQYYGTVSIGSPLQNFSVLFDTGSSDFWVTSVYC-ISPACEKHTKFFSSRSNT 125
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y++ G + I YGSGS+SG D V VG + V DQ F E+ E F+ A FDGI+GL
Sbjct: 126 YSKKGSNFFIEYGSGSLSGITGVDRVSVGGLTVVDQEFGESVTEPGQHFVYAAFDGILGL 185
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ ++V A PV+DNM+ +V++ +FS +++ D + G E++FGG D HF G +
Sbjct: 186 GYPSLSVTGATPVFDNMIVHNMVAQPMFSVYMSSDIENGTGSELIFGGYDCSHFSGSLNW 245
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
+PVTK+G+WQ L + +G+ + C GC AIVD+GTS + GP + ++ AIG
Sbjct: 246 IPVTKQGFWQIALDGVQVGD-TMMFCSKGCQAIVDTGTSRIIGPLNKIERLHRAIGATLV 304
Query: 313 EGVVSAEC 320
G+ EC
Sbjct: 305 NGIYFVEC 312
>gi|329665035|ref|NP_001192720.1| gastricsin precursor [Bos taurus]
Length = 391
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 118/306 (38%), Positives = 176/306 (57%), Gaps = 15/306 (4%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRH------RL 67
VLA L + L +I LKK + + R KE+ + + +H R
Sbjct: 5 VLALVCLQALEAAALVKIPLKKFK-------SIREIMKEKGLLEDFLRTYKHDPAQKYRF 57
Query: 68 GDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSR 127
GD P+ ++MDA YFGEI IG+PPQNF V+FDTGSSNLWVPS C S +C H+R
Sbjct: 58 GDFIVATEPM-DYMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHTR 115
Query: 128 YKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLA 187
+ S+TY+ ++ + YGSGS++G D + V + V +Q F + E FL A
Sbjct: 116 FNHSLSSTYSTNEQTFSLQYGSGSLTGILGYDTLTVQGIKVPNQEFGLSKTEPGTNFLYA 175
Query: 188 RFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPK 247
+FDGI+G+ + ++V A V M+++G ++ VFSF+L+ +++GG ++FGGVD
Sbjct: 176 KFDGIMGMAYPSLSVDGATTVLQGMLQEGALTSPVFSFYLSSQQGSQDGGAVIFGGVDNC 235
Query: 248 HFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEIN 307
+ G+ + PVT++ YWQ + LIG+Q+TG C GC AIVD+GTSLL P ++ +
Sbjct: 236 LYTGQIYWAPVTQELYWQIGFEEFLIGDQATGWCSTGCQAIVDTGTSLLTVPQQFLSALL 295
Query: 308 HAIGGE 313
A G +
Sbjct: 296 QATGAQ 301
>gi|354497176|ref|XP_003510697.1| PREDICTED: chymosin-like [Cricetulus griseus]
gi|344243543|gb|EGV99646.1| Chymosin [Cricetulus griseus]
Length = 379
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 116/313 (37%), Positives = 175/313 (55%), Gaps = 20/313 (6%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDI-----LPLKNFMDAQYFGEIGI 91
R+ LH + R T KE + +S + + D + PL N++D++YFG I I
Sbjct: 21 RIPLHKGTSLRNTLKEHGLLEDFLSRHQSEFSEKDSNTGMVANEPLTNYLDSEYFGTIYI 80
Query: 92 GSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGS 151
G+PPQ F+V+FDTGSS LWVPS C S C H R+ KS T+ + K + YG+G
Sbjct: 81 GTPPQEFTVVFDTGSSELWVPSVYCS-SRVCQNHHRFDPSKSFTFQNLSKPLFVQYGTGR 139
Query: 152 ISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDN 211
+ GF D V + D+VV Q +T+E F+ + FDGI+GL + +A +VP++DN
Sbjct: 140 MQGFLGYDTVTISDIVVPHQTVGLSTQEPGEIFIYSPFDGILGLSYPSLASKYSVPIFDN 199
Query: 212 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 271
M+ + LV++++FS +++R+ ++G + G +D +F G +VPVT +GYWQF + I
Sbjct: 200 MMNRHLVAQDLFSVYMSRN---DQGSMLTLGAIDQSYFVGSLHWVPVTVQGYWQFTVDRI 256
Query: 272 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 331
I N C+GGC A++D+GT+LLAGP + I AIG V QYG
Sbjct: 257 TI-NDEVVACQGGCTAVLDTGTALLAGPGRDILNIQQAIGA----------VQGQYGQFK 305
Query: 332 WDLLVSGLLPEKV 344
+ G++P V
Sbjct: 306 INCWRLGIMPTIV 318
>gi|18959216|ref|NP_579818.1| gastricsin precursor [Rattus norvegicus]
gi|129798|sp|P04073.1|PEPC_RAT RecName: Full=Gastricsin; AltName: Full=Pepsinogen C; Flags:
Precursor
gi|56881|emb|CAA28305.1| unnamed protein product [Rattus norvegicus]
gi|206083|gb|AAA41827.1| pepsinogen [Rattus norvegicus]
gi|149069457|gb|EDM18898.1| progastricsin (pepsinogen C) [Rattus norvegicus]
Length = 392
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 123/316 (38%), Positives = 180/316 (56%), Gaps = 30/316 (9%)
Query: 13 WVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGV-----------S 61
W++ + L LP L R+ L+K + + R T KE+ GV
Sbjct: 3 WMVVALLCLPLLEASLLRVPLRKMK-------SIRETMKEQ-----GVLKDFLKTHKYDP 50
Query: 62 GVRHRLGD-SDEDIL--PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYF 118
G ++ G+ D +L P+ +MDA YFGEI IG+PPQNF V+FDTGSSNLWV S C
Sbjct: 51 GQKYHFGNFGDYSVLYEPMA-YMDASYFGEISIGTPPQNFLVLFDTGSSNLWVSSVYCQ- 108
Query: 119 SISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
S +C H+R+ KS+TY G++ + YG+GS++GFF D + V + V +Q F +
Sbjct: 109 SEACTTHARFNPSKSSTYYTEGQTFSLQYGTGSLTGFFGYDTLTVQSIQVPNQEFGLSEN 168
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGE 238
E F+ A+FDGI+GL + ++ G A M+ +G +S+ +F +L GG+
Sbjct: 169 EPGTNFVYAQFDGIMGLAYPGLSSGGATTALQGMLGEGALSQPLFGVYLGSQ-QGSNGGQ 227
Query: 239 IVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEG-GCAAIVDSGTSLLA 297
IVFGGVD + G+ T+VPVT++ YWQ + D LIG+Q++G C GC IVD+GTSLL
Sbjct: 228 IVFGGVDKNLYTGEITWVPVTQELYWQITIDDFLIGDQASGWCSSQGCQGIVDTGTSLLV 287
Query: 298 GPTPVVTEINHAIGGE 313
P ++E+ IG +
Sbjct: 288 MPAQYLSELLQTIGAQ 303
>gi|157837066|pdb|5PEP|A Chain A, X-Ray Analyses Of Aspartic Proteases. Ii.
Three-Dimensional Structure Of The Hexagonal Crystal
Form Of Porcine Pepsin At 2.3 Angstroms Resolution
Length = 326
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 111/257 (43%), Positives = 162/257 (63%), Gaps = 14/257 (5%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL+N++D +YFG IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H+++ S+T
Sbjct: 5 PLENYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACSDHNQFNPDDSST 63
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIG 194
+ + I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+G
Sbjct: 64 FEATSQELSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILG 122
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+ A PV+DN+ +QGLVS+++FS +L+ + D+ G ++ GG+D ++ G
Sbjct: 123 LAYPSISASGATPVFDNLWDQGLVSQDLFSVYLSSNDDS--GSVVLLGGIDSSYYTGSLN 180
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG--- 311
+VPV+ +GYWQ L I + + T C GGC AIVD+GTSLL GPT + I IG
Sbjct: 181 WVPVSVEGYWQITLDSITMDGE-TIACSGGCQAIVDTGTSLLTGPTSAIANIQSDIGASE 239
Query: 312 ---GEGVVSAECKLVVS 325
GE V+S C + S
Sbjct: 240 NSDGEMVIS--CSSIAS 254
>gi|431896476|gb|ELK05888.1| Chymosin [Pteropus alecto]
Length = 348
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 115/315 (36%), Positives = 180/315 (57%), Gaps = 21/315 (6%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDI-----LPLKNFMDAQYFGEIGI 91
R+ LH + R KER + + R+ + + + PL N++D+QYFG+I I
Sbjct: 21 RVPLHKGKSLRKALKERGLLEDFLRTHRYAISKENSGVGKVAREPLVNYLDSQYFGKISI 80
Query: 92 GSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGS 151
G+PPQ+F+V+FDTGSS+LWVPS C S +C H R+ S +S+T+ ++G+ I YG+GS
Sbjct: 81 GTPPQDFTVVFDTGSSDLWVPSVYCK-SDACKNHRRFNSSESSTFQKLGQPLSIQYGTGS 139
Query: 152 ISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDN 211
+ G D V V ++V Q +T+E F FDGI+GL + +A D+VPV+DN
Sbjct: 140 MEGILGSDTVTVSNIVDSRQTVGLSTQEPGDVFTYFEFDGILGLAYPSLAAKDSVPVFDN 199
Query: 212 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 271
M++ LV++++FS +++R+ ++G + G +D +++G +VPVT + YWQF + +
Sbjct: 200 MMKHHLVAQDLFSVYMSRN---DQGSMLTLGAIDSSYYRGSLHWVPVTVREYWQFTVDSV 256
Query: 272 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 331
+ + C+GGC AI+D+GTS+L GP+ + I AIG QYG L
Sbjct: 257 TV-DGVVVACDGGCQAILDTGTSMLVGPSSDILNIQQAIGA----------TEDQYG-LD 304
Query: 332 WDLLVSGLLPEKVCQ 346
D SG E Q
Sbjct: 305 MDFCTSGFQGEDDSQ 319
>gi|157836875|pdb|3PSG|A Chain A, The High Resolution Crystal Structure Of Porcine
Pepsinogen
Length = 370
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 109/249 (43%), Positives = 159/249 (63%), Gaps = 12/249 (4%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL+N++D +YFG IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H+++ S+T
Sbjct: 49 PLENYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACSDHNQFNPDDSST 107
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIG 194
+ + I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+G
Sbjct: 108 FEATSQELSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILG 166
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+ A PV+DN+ +QGLVS+++FS +L+ + D+ G ++ GG+D ++ G
Sbjct: 167 LAYPSISASGATPVFDNLWDQGLVSQDLFSVYLSSNDDS--GSVVLLGGIDSSYYTGSLN 224
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG--- 311
+VPV+ +GYWQ L I + + T C GGC AIVD+GTSLL GPT + I IG
Sbjct: 225 WVPVSVEGYWQITLDSITMDGE-TIACSGGCQAIVDTGTSLLTGPTSAIANIQSDIGASE 283
Query: 312 ---GEGVVS 317
GE V+S
Sbjct: 284 NSDGEMVIS 292
>gi|360431|prf||1403354A pepsinogen
Length = 383
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 102/254 (40%), Positives = 155/254 (61%), Gaps = 9/254 (3%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N +D +Y+G I IG+PPQ+F+V+FDTGSSNLWVPS C S +C H + +S+T
Sbjct: 67 PLLNTLDMEYYGTISIGTPPQDFTVVFDTGSSNLWVPSVSCT-SPACQSHQMFNPSQSST 125
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G++ I+YG+G + G D V V ++ +Q+F +T E F+ +FDGI+GL
Sbjct: 126 YKSTGQNLSIHYGTGDMEGTVGCDTVTVASLMDTNQLFGLSTSEPGQFFVYVKFDGILGL 185
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +A PV+DNMV + L+ + +FS +L+R+P G +VFGG+D +F G +
Sbjct: 186 GYPSLAADGITPVFDNMVNESLLEQNLFSVYLSREP---MGSMVVFGGIDESYFTGSINW 242
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE-- 313
+PV+ +GYWQ + I++ Q C GC AI+D+GTSL+AGP + +I A+G
Sbjct: 243 IPVSYQGYWQISMDSIIVNKQEIA-CSSGCQAIIDTGTSLVAGPASDINDIQSAVGANQN 301
Query: 314 --GVVSAECKLVVS 325
G S C +++
Sbjct: 302 TYGEYSVNCSHILA 315
>gi|45384244|ref|NP_990385.1| embryonic pepsinogen precursor [Gallus gallus]
gi|129801|sp|P16476.1|PEPE_CHICK RecName: Full=Embryonic pepsinogen; Flags: Precursor
gi|222853|dbj|BAA00153.1| pepsinogen [Gallus gallus]
Length = 383
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 102/254 (40%), Positives = 155/254 (61%), Gaps = 9/254 (3%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N +D +Y+G I IG+PPQ+F+V+FDTGSSNLWVPS C S +C H + +S+T
Sbjct: 67 PLLNTLDMEYYGTISIGTPPQDFTVVFDTGSSNLWVPSVSCT-SPACQSHQMFNPSQSST 125
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G++ I+YG+G + G D V V ++ +Q+F +T E F+ +FDGI+GL
Sbjct: 126 YKSTGQNLSIHYGTGDMEGTVGCDTVTVASLMDTNQLFGLSTSEPGQFFVYVKFDGILGL 185
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +A PV+DNMV + L+ + +FS +L+R+P G +VFGG+D +F G +
Sbjct: 186 GYPSLAADGITPVFDNMVNESLLEQNLFSVYLSREP---MGSMVVFGGIDESYFTGSINW 242
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE-- 313
+PV+ +GYWQ + I++ Q C GC AI+D+GTSL+AGP + +I A+G
Sbjct: 243 IPVSYQGYWQISMDSIIVNKQEIA-CSSGCQAIIDTGTSLVAGPASDINDIQSAVGANQN 301
Query: 314 --GVVSAECKLVVS 325
G S C +++
Sbjct: 302 TYGEYSVNCSHILA 315
>gi|118572685|sp|P00791.3|PEPA_PIG RecName: Full=Pepsin A; Flags: Precursor
Length = 385
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 109/249 (43%), Positives = 159/249 (63%), Gaps = 12/249 (4%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL+N++D +YFG IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H+++ S+T
Sbjct: 64 PLENYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACSDHNQFNPDDSST 122
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIG 194
+ + I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+G
Sbjct: 123 FEATSQELSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILG 181
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+ A PV+DN+ +QGLVS+++FS +L+ + D+ G ++ GG+D ++ G
Sbjct: 182 LAYPSISASGATPVFDNLWDQGLVSQDLFSVYLSSNDDS--GSVVLLGGIDSSYYTGSLN 239
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG--- 311
+VPV+ +GYWQ L I + + T C GGC AIVD+GTSLL GPT + I IG
Sbjct: 240 WVPVSVEGYWQITLDSITMDGE-TIACSGGCQAIVDTGTSLLTGPTSAIANIQSDIGASE 298
Query: 312 ---GEGVVS 317
GE V+S
Sbjct: 299 NSDGEMVIS 307
>gi|164604|gb|AAA31096.1| pepsinogen A precursor [Sus scrofa]
Length = 385
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 109/249 (43%), Positives = 159/249 (63%), Gaps = 12/249 (4%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL+N++D +YFG IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H+++ S+T
Sbjct: 64 PLENYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACSDHNQFNPDDSST 122
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIG 194
+ + I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+G
Sbjct: 123 FEATSQELSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILG 181
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+ A PV+DN+ +QGLVS+++FS +L+ + D+ G ++ GG+D ++ G
Sbjct: 182 LAYPSISASGATPVFDNLWDQGLVSQDLFSVYLSSNDDS--GSVVLLGGIDSSYYTGSLN 239
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG--- 311
+VPV+ +GYWQ L I + + T C GGC AIVD+GTSLL GPT + I IG
Sbjct: 240 WVPVSVEGYWQITLDSITMDGE-TIACSGGCQAIVDTGTSLLTGPTSAIANIQSDIGASE 298
Query: 312 ---GEGVVS 317
GE V+S
Sbjct: 299 NSYGEMVIS 307
>gi|414871124|tpg|DAA49681.1| TPA: hypothetical protein ZEAMMB73_239621 [Zea mays]
Length = 299
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 97/152 (63%), Positives = 115/152 (75%)
Query: 212 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 271
MV+QGL+S+ VFSFW NR D EGGEIVFGG+D H+KG HT+VPVT+KGYWQF +GD+
Sbjct: 1 MVKQGLISDPVFSFWFNRHADEGEGGEIVFGGMDSSHYKGDHTFVPVTRKGYWQFNMGDV 60
Query: 272 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDLI 331
L+ +STG C GGCAAI DSGTSLLAGP ++TEIN IG GVVS ECK VVSQYG I
Sbjct: 61 LVDGKSTGFCAGGCAAIADSGTSLLAGPIAIITEINEKIGAAGVVSQECKTVVSQYGQQI 120
Query: 332 WDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGI 363
DLL++ P K+C Q+GLC F+G V GI
Sbjct: 121 LDLLLAETQPAKICSQVGLCTFDGTHGVSAGI 152
>gi|195583376|ref|XP_002081498.1| GD11051 [Drosophila simulans]
gi|194193507|gb|EDX07083.1| GD11051 [Drosophila simulans]
Length = 399
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 122/315 (38%), Positives = 175/315 (55%), Gaps = 24/315 (7%)
Query: 12 LWVLASCL----LLPASSNGLRRIGLKKRRLDLHSLNAARI------TRKERYMGGAGVS 61
+W+L S L +LP L+ R+ L +AR R +R +
Sbjct: 1 MWLLVSLLPVLFILPVQFQHPVSCKLQLYRVPLRRFPSARHRFEKLGIRMDR-LRLKYAE 59
Query: 62 GVRHRLGDSDEDI--LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS 119
V H GD + PL N++DAQYFG I IG+PPQ F VIFDTGSSNLWVPS+ C +
Sbjct: 60 EVSHFRGDWSSAVKSTPLSNYLDAQYFGPITIGTPPQTFKVIFDTGSSNLWVPSATCAST 119
Query: 120 -ISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATR 178
++C H+RY +++S ++ G I+YGSGS+SGF S D V V + ++DQ F EAT
Sbjct: 120 MVACRVHNRYFAKRSKSHQARGDRFAIHYGSGSLSGFLSTDTVRVAGLEIRDQTFAEATE 179
Query: 179 EGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFW-LNRDPDAEEGG 237
FL A+FDGI GL + I++ P + M+EQGL+++ +F+ + +P
Sbjct: 180 MPGPIFLAAKFDGIFGLAYHSISMQRIKPPFYAMMEQGLLTKPIFNMARMMVEP------ 233
Query: 238 EIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLA 297
I FGG +P ++ G TYV V+ + YWQ ++ +I N +C+ GC I+D+GTS LA
Sbjct: 234 -IFFGGSNPHYYTGNFTYVQVSHRAYWQVKMDSAVIRNLE--LCQQGCEVIIDTGTSFLA 290
Query: 298 GPTPVVTEINHAIGG 312
P IN +IGG
Sbjct: 291 LPYDQAILINESIGG 305
>gi|395838792|ref|XP_003792290.1| PREDICTED: renin [Otolemur garnettii]
Length = 404
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 120/285 (42%), Positives = 171/285 (60%), Gaps = 16/285 (5%)
Query: 30 RIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEI 89
R LK+R +D+ L+A R G S V L N++D QY+GEI
Sbjct: 43 REKLKERGVDMARLSAEWSQFTRRLSSGNSTSSVV------------LTNYLDTQYYGEI 90
Query: 90 GIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFHSRYKSRKSNTYTEIGKSCEINYG 148
GIG+PPQ F VIFDTGS+NLWVPS+KC +C HS Y S S++Y E G I YG
Sbjct: 91 GIGTPPQTFKVIFDTGSANLWVPSTKCSPLYTACEIHSLYDSSDSSSYMENGTEFTIQYG 150
Query: 149 SGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPV 208
+G + GF SQD V VG + V Q F E T + F+LA+FDG++G+GF AVG PV
Sbjct: 151 TGKVKGFLSQDVVTVGGLTVT-QGFGEVTELPLMPFMLAKFDGVLGMGFPAQAVGGITPV 209
Query: 209 WDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFEL 268
+DN++ Q ++ E+VFS + +R+ GGEIV GG DP++++G YV ++K G WQ ++
Sbjct: 210 FDNILSQRVLKEDVFSVYYSRNSHL-LGGEIVLGGSDPQYYQGNFHYVSISKTGSWQIKM 268
Query: 269 GDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
+ + +T +CE GC A+VD+G S ++GPT + + A+G +
Sbjct: 269 KGVSV-RSTTLLCEDGCMAVVDTGASYISGPTSSLRLLMKALGAQ 312
>gi|253723303|pdb|2PSG|A Chain A, Refined Structure Of Porcine Pepsinogen At 1.8 Angstroms
Resolution
Length = 370
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 109/249 (43%), Positives = 159/249 (63%), Gaps = 12/249 (4%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL+N++D +YFG IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H+++ S+T
Sbjct: 49 PLENYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACSDHNQFNPDDSST 107
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIG 194
+ + I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+G
Sbjct: 108 FEATXQELSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILG 166
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+ A PV+DN+ +QGLVS+++FS +L+ + D+ G ++ GG+D ++ G
Sbjct: 167 LAYPSISASGATPVFDNLWDQGLVSQDLFSVYLSSNDDS--GSVVLLGGIDSSYYTGSLN 224
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG--- 311
+VPV+ +GYWQ L I + + T C GGC AIVD+GTSLL GPT + I IG
Sbjct: 225 WVPVSVEGYWQITLDSITMDGE-TIACSGGCQAIVDTGTSLLTGPTSAIANIQSDIGASE 283
Query: 312 ---GEGVVS 317
GE V+S
Sbjct: 284 NSDGEMVIS 292
>gi|157836865|pdb|3PEP|A Chain A, Revised 2.3 Angstroms Structure Of Porcine Pepsin.
Evidence For A Flexible Subdomain
Length = 326
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 109/249 (43%), Positives = 159/249 (63%), Gaps = 12/249 (4%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL+N++D +YFG IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H+++ S+T
Sbjct: 5 PLENYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACSDHNQFNPDDSST 63
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIG 194
+ + I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+G
Sbjct: 64 FEATSQELSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILG 122
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+ A PV+DN+ +QGLVS+++FS +L+ + D+ G ++ GG+D ++ G
Sbjct: 123 LAYPSISASGATPVFDNLWDQGLVSQDLFSVYLSSNDDS--GSVVLLGGIDSSYYTGSLN 180
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG--- 311
+VPV+ +GYWQ L I + + T C GGC AIVD+GTSLL GPT + I IG
Sbjct: 181 WVPVSVEGYWQITLDSITMDGE-TIACSGGCQAIVDTGTSLLTGPTSAIANIQSDIGASE 239
Query: 312 ---GEGVVS 317
GE V+S
Sbjct: 240 NSDGEMVIS 248
>gi|494476|pdb|1PSA|A Chain A, Structure Of A Pepsin(Slash)renin Inhibitor Complex
Reveals A Novel Crystal Packing Induced By Minor
Chemical Alterations In The Inhibitor
gi|494478|pdb|1PSA|B Chain B, Structure Of A Pepsin(Slash)renin Inhibitor Complex
Reveals A Novel Crystal Packing Induced By Minor
Chemical Alterations In The Inhibitor
gi|67463919|pdb|1YX9|A Chain A, Effect Of Dimethyl Sulphoxide On The Crystal Structure Of
Porcine Pepsin
Length = 326
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 109/249 (43%), Positives = 159/249 (63%), Gaps = 12/249 (4%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL+N++D +YFG IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H+++ S+T
Sbjct: 5 PLENYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACSDHNQFNPDDSST 63
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIG 194
+ + I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+G
Sbjct: 64 FEATSQELSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILG 122
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+ A PV+DN+ +QGLVS+++FS +L+ + D+ G ++ GG+D ++ G
Sbjct: 123 LAYPSISASGATPVFDNLWDQGLVSQDLFSVYLSSNDDS--GSVVLLGGIDSSYYTGSLN 180
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG--- 311
+VPV+ +GYWQ L I + + T C GGC AIVD+GTSLL GPT + I IG
Sbjct: 181 WVPVSVEGYWQITLDSITMDGE-TIACSGGCQAIVDTGTSLLTGPTSAIANIQSDIGASE 239
Query: 312 ---GEGVVS 317
GE V+S
Sbjct: 240 NSDGEMVIS 248
>gi|24647683|ref|NP_650623.1| CG5863 [Drosophila melanogaster]
gi|7300255|gb|AAF55418.1| CG5863 [Drosophila melanogaster]
Length = 395
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 131/325 (40%), Positives = 180/325 (55%), Gaps = 17/325 (5%)
Query: 12 LWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITR-----KERYMGGAGVSGVRHR 66
LW+L CL L RI ++ + + S R R K +GG V+ R
Sbjct: 11 LWIL--CLFWAKCQGQLIRIPMQFQASFMASRRQHRAGRSSLLAKYNVVGGQEVTS---R 65
Query: 67 LGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFH 125
G + E L N ++ +Y G I IGSP Q F+++FDTGS+NLWVPS++C S++C+ H
Sbjct: 66 NGGATET---LDNRLNLEYAGPISIGSPGQPFNMLFDTGSANLWVPSAECSPKSVACHHH 122
Query: 126 SRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFL 185
RY + S+T+ G+ I YG+GS+SG +QD V +G +VV++Q F AT E TF+
Sbjct: 123 HRYNASASSTFVPDGRRFSIAYGTGSLSGRLAQDTVAIGQLVVQNQTFGMATHEPGPTFV 182
Query: 186 LARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVD 245
F GI+GLGFR IA P++++M +Q LV E VFSF+L R+ +GGE++FGGVD
Sbjct: 183 DTNFAGIVGLGFRPIAELGIKPLFESMCDQQLVDECVFSFYLKRNGSERKGGELLFGGVD 242
Query: 246 PKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTE 305
F G TYVP+T GYWQF L I + AI D+GTSLLA P
Sbjct: 243 KTKFSGSLTYVPLTHAGYWQFPLDVIEVAGTRINQNR---QAIADTGTSLLAAPPREYLI 299
Query: 306 INHAIGGEGVVSAECKLVVSQYGDL 330
IN +GG + E L S+ L
Sbjct: 300 INSLLGGLPTSNNEYLLNCSEIDSL 324
>gi|407726059|dbj|BAM46127.1| pepsinogen C [Cynops pyrrhogaster]
Length = 385
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 107/276 (38%), Positives = 158/276 (57%), Gaps = 3/276 (1%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVS-GVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPP 95
R+ LH + R E + V G ++RL + PL N+MD Y+GEI IG+PP
Sbjct: 19 RVPLHKFKSMRQVMIEHGLKVPWVDPGTKYRLNNFAVASEPLTNYMDMSYYGEISIGTPP 78
Query: 96 QNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
QNF V+FDTGSSNLWV S+ C S +C H + +S+TY+ + I YG+GS++G
Sbjct: 79 QNFLVLFDTGSSNLWVASTYCS-SSACTNHPLFNPSQSSTYSTENQQFSIQYGTGSLTGI 137
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
D V + + + Q F + E F+ A+FDGI+GL + IA A V + M+ Q
Sbjct: 138 LGYDTVSIQGLSITQQEFALSINEPGSNFVYAQFDGILGLAYPSIAADGATTVMEGMMNQ 197
Query: 216 GLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGN 275
GL+S+ +F F+++ + + GGE++FGGVD ++ G+ T+ PVT++ YWQ + +
Sbjct: 198 GLLSQNIFGFYMSEE-GTQPGGELIFGGVDSNYYTGEITWTPVTQQMYWQIGIQGFAVNG 256
Query: 276 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
Q TG C GC IVD+GTSLL P + + IG
Sbjct: 257 QETGWCSQGCQGIVDTGTSLLTAPGQYMAALMQDIG 292
>gi|195501958|ref|XP_002098019.1| GE10129 [Drosophila yakuba]
gi|194184120|gb|EDW97731.1| GE10129 [Drosophila yakuba]
Length = 396
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 119/287 (41%), Positives = 164/287 (57%), Gaps = 7/287 (2%)
Query: 45 AARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDT 104
A R + +Y G R G + E L N ++ +Y G I IGSP Q F+++FDT
Sbjct: 45 AGRSSLLAKYNVAGGQEAATLRNGGATET---LDNRLNLEYAGPISIGSPGQPFNMLFDT 101
Query: 105 GSSNLWVPSSKCY-FSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEV 163
GS+NLWVPS++C S++C+ H RY + S+T+ G+ I YG+GS+SG +QD V +
Sbjct: 102 GSANLWVPSAECSPKSVACHRHHRYNASASSTFVPDGRRFSIAYGTGSLSGILAQDMVTI 161
Query: 164 GDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVF 223
G +VV++Q F AT E TF+ F GI+GLGFR +A P++++M EQ LV E VF
Sbjct: 162 GQLVVRNQTFAMATHEPGPTFVDTNFAGIVGLGFRPLAEQRIKPLFESMCEQQLVDECVF 221
Query: 224 SFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEG 283
SF+L R+ GGE++FGG+D F G TYVP+T YWQF L I +G +
Sbjct: 222 SFYLKRNGSERMGGELLFGGLDKTKFSGTLTYVPLTHAAYWQFPLDAIEVGGTAISHHR- 280
Query: 284 GCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAECKLVVSQYGDL 330
AI D+GTSLLA P IN +GG + E L S+ L
Sbjct: 281 --QAIADTGTSLLAAPPREYLIINSLLGGLPTANNEYLLNCSEIDSL 325
>gi|290993274|ref|XP_002679258.1| predicted protein [Naegleria gruberi]
gi|284092874|gb|EFC46514.1| predicted protein [Naegleria gruberi]
Length = 316
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 103/231 (44%), Positives = 145/231 (62%), Gaps = 7/231 (3%)
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
QY+G + +G+P QNF VIFDTGSSN+WVPS C+ SI+C H+RY KS+TY G+
Sbjct: 1 QYYGFVSLGTPQQNFKVIFDTGSSNVWVPSESCW-SITCLLHNRYDHTKSSTYVANGQKF 59
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVG 203
I YGSG ++GF SQD + G + VK QVF E E L FL + DGI+G+ F I+V
Sbjct: 60 NITYGSGGVNGFLSQDALSCGGIPVKGQVFGEVMSEQGLAFLFGKSDGIVGMAFPSISVD 119
Query: 204 DAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGY 263
P+++NM+ Q LV + +FSF+L++ ++ GG+D K++ G TYVP+ + Y
Sbjct: 120 GVTPMFNNMMNQKLVDKNLFSFYLSKT-SGSTASAMILGGIDTKYYTGPLTYVPLANRTY 178
Query: 264 WQFELGDILIGNQSTGVC-EGGCAAIVDSGTSLLAGPT----PVVTEINHA 309
W + D+ +G GVC GGC A VD+GTSL+AGP P++ +N A
Sbjct: 179 WAIRINDVGVGGDYKGVCPPGGCLAAVDTGTSLIAGPALKIGPIIESLNIA 229
>gi|395852554|ref|XP_003798803.1| PREDICTED: pepsin A-like [Otolemur garnettii]
Length = 387
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 104/239 (43%), Positives = 153/239 (64%), Gaps = 6/239 (2%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL+N+MD +YFG IGIG+P Q F+VIFDTGSSNLWVPS C S +C H+R+ + S+T
Sbjct: 66 PLENYMDTEYFGTIGIGTPAQEFTVIFDTGSSNLWVPSVYCS-SPACSNHNRFNPQSSST 124
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIG 194
Y ++ I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+G
Sbjct: 125 YQATSQTVSIAYGTGSMTGILGYDTVQVGGITDTNQIFGLSETEPGSFLYY-APFDGILG 183
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+ A PV+DNM QGLVS+++FS +L+ + + G ++FGG+D ++ G+
Sbjct: 184 LAYPSISSSGATPVFDNMWNQGLVSQDLFSVFLSS--NDQSGSVVMFGGIDSSYYTGELN 241
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
++P++ +GYWQ + I + + C GC AIVD+GTSLL+GPT + I IG
Sbjct: 242 WIPLSSEGYWQITVDSITMNGEPI-ACSQGCQAIVDTGTSLLSGPTSPIANIQSYIGAS 299
>gi|443894057|dbj|GAC71407.1| aspartyl protease [Pseudozyma antarctica T-34]
Length = 418
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 108/253 (42%), Positives = 154/253 (60%), Gaps = 9/253 (3%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL +F++AQYF +I +G+P Q F VI DTGSSNLWVPS+KC SI+C+ H +Y S S+
Sbjct: 97 VPLTDFLNAQYFCDISLGTPAQEFKVILDTGSSNLWVPSTKCS-SIACFLHKKYDSSASS 155
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+Y + G +I YGSGS+ G S D +++GD+ +K Q F EAT E L F +FDGI+G
Sbjct: 156 SYKKNGTEFKIQYGSGSMEGIVSNDVLKIGDLTIKGQDFAEATSEPGLAFAFGKFDGILG 215
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+V VP M++QGL+ SF+L +GGE VFGG+D H+ GK
Sbjct: 216 LAYDTISVNGIVPPMYQMIDQGLLDAPQVSFYLGS--SEADGGEAVFGGIDDSHYTGKIH 273
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG-- 312
+ PV +KGYW+ L + +G++ + G A +D+GTSL+A T +N IG
Sbjct: 274 WAPVKRKGYWEVALDKLALGDEELELDNGSAA--IDTGTSLIAMATDTAEILNAEIGATK 331
Query: 313 --EGVVSAECKLV 323
G S +C+ V
Sbjct: 332 SWNGQYSVDCEKV 344
>gi|195386060|ref|XP_002051722.1| GJ17077 [Drosophila virilis]
gi|194148179|gb|EDW63877.1| GJ17077 [Drosophila virilis]
Length = 404
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 113/256 (44%), Positives = 159/256 (62%), Gaps = 7/256 (2%)
Query: 74 ILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRK 132
I L N + Y+G IGIG+PPQ F+V+FDTGS+NLWVPS +C + ++C H++Y S
Sbjct: 81 IETLSNNQNMDYYGVIGIGTPPQYFNVVFDTGSANLWVPSVQCLPTDVACQNHNQYNSSA 140
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGI 192
S+TY G+S I YG+GS++GF S D V + + + Q F EA + + +F FDGI
Sbjct: 141 SSTYVANGQSFSIQYGTGSLTGFLSTDTVTINGLSIACQTFGEAISQPNGSFTGVPFDGI 200
Query: 193 IGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGK 252
+G+G+ IAV VP + N+ EQGL+ E F F+L R A++GG++V GGVD + F G
Sbjct: 201 LGMGYSTIAVDQVVPPFYNLYEQGLIDEPSFGFYLARTGSAQDGGQLVLGGVDYQLFSGN 260
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
TYVPV+++GYWQF + ++ VC C AI D+GTSLLA P T++N IGG
Sbjct: 261 LTYVPVSQEGYWQFVVTSAVM--NGFVVCS-NCQAIADTGTSLLACPGSSYTQLNQLIGG 317
Query: 313 ---EGVVSAECKLVVS 325
+G +C V S
Sbjct: 318 YLMDGDYYVDCSTVDS 333
>gi|386371114|gb|AFJ11376.1| pregnancy-associated glycoprotein 1, partial [Bison bison]
Length = 367
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 107/278 (38%), Positives = 163/278 (58%), Gaps = 8/278 (2%)
Query: 38 LDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDIL--PLKNFMDAQYFGEIGIGSPP 95
L L + R T +E+ + + +RL +D I PL+N++D Y G I IG+PP
Sbjct: 16 LPLKKMKTLRETLREKNLLNNFLEEQAYRLSKNDSKITVHPLRNYLDTAYVGNITIGTPP 75
Query: 96 QNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
Q F V+FDTGS+NLWVP C S +CY H + + S+++ E+G I YGSG I GF
Sbjct: 76 QEFRVVFDTGSANLWVPCITCT-SPACYTHKTFNPQNSSSFREVGSPITIFYGSGIIQGF 134
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
D V +G++V +Q F + E L FDGI+GL F + + D +P++DN+
Sbjct: 135 LGSDTVRIGNLVSPEQSFGLSLEEYGFDSL--PFDGILGLAFPAMGIEDTIPIFDNLWSH 192
Query: 216 GLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGN 275
G SE VF+F+LN + EG ++FGGVD +++KG+ ++PV++ +WQ + +I + N
Sbjct: 193 GAFSEPVFAFYLNT--NKPEGSVVMFGGVDHRYYKGELNWIPVSQTSHWQISMNNISM-N 249
Query: 276 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
+ C GC A+VD+GTSL+ GPT +VT I+ +
Sbjct: 250 GTVTACSCGCEALVDTGTSLIYGPTKLVTNIHKLMNAR 287
>gi|444724657|gb|ELW65256.1| Gastricsin [Tupaia chinensis]
Length = 403
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 119/311 (38%), Positives = 175/311 (56%), Gaps = 30/311 (9%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
P+ N+MD+ YFGEI IG+PPQNF V+FDTGSS+LWVPS+ C S +C H+R+ S+T
Sbjct: 65 PITNYMDSFYFGEISIGTPPQNFLVLFDTGSSDLWVPSTYCQ-SQACSNHNRFNPSLSST 123
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
+ G++ ++YGSGS+S D V V ++V+ +Q F + E S F + FDGI+G+
Sbjct: 124 FRNNGQTYTLSYGSGSLSVVLGYDTVTVQNIVINNQEFGLSENEPSNPFYYSDFDGILGM 183
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ ++AVG+A V M++QG +++ +FSF+ +R P + GGE++ GGVD + + G+ +
Sbjct: 184 AYPDMAVGNAPTVMQGMLQQGQLTQPIFSFYFSRQPTRQYGGELILGGVDTQLYSGQIVW 243
Query: 256 VPVTKKGYWQFELGD-------------ILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPV 302
PVT++ YWQ + D +I +STG+C GC AIVD+GT LLA P
Sbjct: 244 TPVTRELYWQIAIQDHWLLLPGRAGYRAFIIETKSTGLCSQGCQAIVDTGTFLLAIPQQF 303
Query: 303 VTEINHAIGGE----GVVSAECKLVVSQYGDLIWDLLVSG---LLPEKVCQQIGLCAFNG 355
+ A G + G +C V S ++SG LP FN
Sbjct: 304 MGSFLQATGAQQAQNGDFVVDCNYVQSM---PTITFIISGSQFPLPPSA------YVFNN 354
Query: 356 AEYVRLGIPIT 366
Y RLGI T
Sbjct: 355 NGYCRLGIEAT 365
>gi|345318884|ref|XP_001520972.2| PREDICTED: renin-like [Ornithorhynchus anatinus]
Length = 388
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 108/266 (40%), Positives = 168/266 (63%), Gaps = 11/266 (4%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFHSRYKSRKSNT 135
L N++DAQYFGEIGIGSP Q F VIFDTGS+NLWVPS C +C H+ Y + +S T
Sbjct: 58 LTNYLDAQYFGEIGIGSPAQTFKVIFDTGSANLWVPSINCKPIHSACETHNLYDASQSQT 117
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y E G I+Y SG++ GF SQD V +G + V Q+F E T + +F+ A+FDG++G+
Sbjct: 118 YMENGTQIAISYVSGTVKGFLSQDLVTIGGIPVI-QMFAEITTLPTSSFMYAKFDGVLGM 176
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEE---GGEIVFGGVDPKHFKGK 252
G+ A+G PV+D+++ Q ++ E+VFS + +R+ + GGEI+ GG DP +++G
Sbjct: 177 GYPAQAIGGITPVFDHILTQHVLKEDVFSVYYSRNSKNDHMVPGGEIILGGRDPTYYQGD 236
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
Y+ V+KKG+WQ + + + +++ C+ GCAA+VD+G +L+ GP V + +G
Sbjct: 237 FYYLDVSKKGFWQVNMKGVSV-DRTLQFCQEGCAAMVDTGATLITGPVKDVKHMMDILGA 295
Query: 313 EGV----VSAECKLVVSQYGDLIWDL 334
+ + + +CK V+Q D+ + L
Sbjct: 296 QKIGGNMYAVDCK-EVAQLPDISFHL 320
>gi|9910338|ref|NP_064476.1| embryonic pepsinogen precursor [Rattus norvegicus]
gi|7106000|emb|CAB75983.1| prochymosin [Rattus norvegicus]
Length = 379
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 108/281 (38%), Positives = 166/281 (59%), Gaps = 10/281 (3%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDI-----LPLKNFMDAQYFGEIGI 91
R+ LH + R T KE+ + + ++ + + +I PL N++D++YFG I +
Sbjct: 21 RIPLHKGKSLRNTLKEQGLLEDFLRRHQYEFSEKNSNIGMVASEPLTNYLDSEYFGLIYV 80
Query: 92 GSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGS 151
G+PPQ F V+FDTGSS LWVPS C + C H+R+ KS T+ + K + YG+GS
Sbjct: 81 GTPPQEFKVVFDTGSSELWVPSVYCSSKV-CRNHNRFDPSKSFTFQNLSKPLFVQYGTGS 139
Query: 152 ISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDN 211
+ GF + D V V D+VV Q +T E F + FDGI+GL + A +VP++DN
Sbjct: 140 VEGFLAYDTVTVSDIVVPHQTVGLSTEEPGDIFTYSPFDGILGLAYPTFASKYSVPIFDN 199
Query: 212 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 271
M+ + LV++++FS +++R+ ++G + G +D +F G +VPVT +GYWQF + I
Sbjct: 200 MMNRHLVAQDLFSVYMSRN---DQGSMLTLGAIDQSYFIGSLHWVPVTVQGYWQFTVDRI 256
Query: 272 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
I N C+GGC A++D+GT+LL GP + I HAIG
Sbjct: 257 TI-NDEVVACQGGCPAVLDTGTALLTGPGRDILNIQHAIGA 296
>gi|327271205|ref|XP_003220378.1| PREDICTED: gastricsin-like [Anolis carolinensis]
Length = 388
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 107/290 (36%), Positives = 167/290 (57%), Gaps = 4/290 (1%)
Query: 25 SNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVS-GVRHRLGDSDEDILPLKNFMDA 83
S GL R+ LK+ + ++ + E ++ V +++ + + P+ N++++
Sbjct: 14 SEGLERVILKRGKSIRENMKEKGVL--EEFLKKNHVDPALKYHFNEYNVAYEPITNYLNS 71
Query: 84 QYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSC 143
YFGEI IG+PPQNF V+ D+GSSNLWVPS C + +C H+R+ S+TY+ G++
Sbjct: 72 YYFGEISIGTPPQNFLVVMDSGSSNLWVPSVYC-DTAACAKHNRFSPSASSTYSNSGQTY 130
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVG 203
+ YG+G ++ D V V ++VV +Q F + E F A FDGI+G+ + +AVG
Sbjct: 131 TLYYGAGDLTVMLGYDTVMVQNIVVTNQEFGLSENEPMTPFYYASFDGIMGMAYPSLAVG 190
Query: 204 DAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGY 263
V M+ QG +SE +FSF+ +R P + GGE++ GGVD + F G ++ PVT++ Y
Sbjct: 191 GTATVMQQMLNQGQLSEPIFSFYFSRQPTVQYGGELILGGVDTQLFSGDVSWAPVTREVY 250
Query: 264 WQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
WQ + + IGN++TG C GC AIVD+GT L P A+G E
Sbjct: 251 WQIGVEEFAIGNEATGWCSEGCQAIVDTGTCQLTIPRQYFDTFLQAVGAE 300
>gi|13096225|pdb|1F34|A Chain A, Crystal Structure Of Ascaris Pepsin Inhibitor-3 Bound To
Porcine Pepsin
Length = 326
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 109/249 (43%), Positives = 159/249 (63%), Gaps = 12/249 (4%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL+N++D +YFG IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H+++ S+T
Sbjct: 5 PLENYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACSDHNQFNPDDSST 63
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIG 194
+ + I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+G
Sbjct: 64 FEATXQELSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILG 122
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+ A PV+DN+ +QGLVS+++FS +L+ + D+ G ++ GG+D ++ G
Sbjct: 123 LAYPSISASGATPVFDNLWDQGLVSQDLFSVYLSSNDDS--GSVVLLGGIDSSYYTGSLN 180
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG--- 311
+VPV+ +GYWQ L I + + T C GGC AIVD+GTSLL GPT + I IG
Sbjct: 181 WVPVSVEGYWQITLDSITMDGE-TIACSGGCQAIVDTGTSLLTGPTSAIANIQSDIGASE 239
Query: 312 ---GEGVVS 317
GE V+S
Sbjct: 240 NSDGEMVIS 248
>gi|149025623|gb|EDL81866.1| prochymosin [Rattus norvegicus]
Length = 379
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 108/281 (38%), Positives = 166/281 (59%), Gaps = 10/281 (3%)
Query: 37 RLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDI-----LPLKNFMDAQYFGEIGI 91
R+ LH + R T KE+ + + ++ + + +I PL N++D++YFG I +
Sbjct: 21 RIPLHKGKSLRNTLKEQGLLEDFLRRHQYEFSEKNSNIGVVASEPLTNYLDSEYFGLIYV 80
Query: 92 GSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGS 151
G+PPQ F V+FDTGSS LWVPS C + C H+R+ KS T+ + K + YG+GS
Sbjct: 81 GTPPQEFKVVFDTGSSELWVPSVYCSSKV-CRNHNRFDPSKSFTFQNLSKPLFVQYGTGS 139
Query: 152 ISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDN 211
+ GF + D V V D+VV Q +T E F + FDGI+GL + A +VP++DN
Sbjct: 140 VEGFLAYDTVTVSDIVVPHQTVGLSTEEPGDIFTYSPFDGILGLAYPTFASKYSVPIFDN 199
Query: 212 MVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDI 271
M+ + LV++++FS +++R+ ++G + G +D +F G +VPVT +GYWQF + I
Sbjct: 200 MMNRHLVAQDLFSVYMSRN---DQGSMLTLGAIDQSYFIGSLHWVPVTVQGYWQFTVDRI 256
Query: 272 LIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
I N C+GGC A++D+GT+LL GP + I HAIG
Sbjct: 257 TI-NDEVVACQGGCPAVLDTGTALLTGPGRDILNIQHAIGA 296
>gi|326933879|ref|XP_003213025.1| PREDICTED: gastricsin-like [Meleagris gallopavo]
Length = 390
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 111/304 (36%), Positives = 175/304 (57%), Gaps = 8/304 (2%)
Query: 11 CLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVR-HRLGD 69
CL + CL L + G+ RI LKK + + A + E Y+ V+ +
Sbjct: 3 CLVLAVLCLQL---TEGMVRIKLKKGKSIREKMREAGVL--EEYLKKIKHDPVKKYNFSK 57
Query: 70 SDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYK 129
++ P+ + +D+ YFGEI IG+PPQNF V+FDTGSSNLWVPS+ C +C H+++K
Sbjct: 58 NNVVYEPMASHLDSSYFGEISIGTPPQNFLVLFDTGSSNLWVPSTLCNMP-ACGNHAKFK 116
Query: 130 SRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARF 189
R S+T+ G+ ++YGSG+++ D + + + V++Q F + E + F A+F
Sbjct: 117 PRASSTFINNGQKVTLSYGSGTLTVVLGYDTLRIQTISVRNQEFGLSRDEPTQPFYYAQF 176
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF 249
DGI+G+ + +AVG A P+ M++Q + + +FSF+ +R+P GGE+V GGVD + F
Sbjct: 177 DGIMGMAYPALAVGGATPL-QGMLQQNQLKQPIFSFYFSRNPTYNYGGELVLGGVDSRLF 235
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHA 309
G + PVT++ YWQ + + IG G C GC AIVD+GT LL P ++ + A
Sbjct: 236 TGDIVWAPVTQELYWQVAIDEFAIGQSVMGWCSQGCQAIVDTGTFLLTVPQQYLSRLLKA 295
Query: 310 IGGE 313
+G +
Sbjct: 296 VGAQ 299
>gi|298706992|emb|CBJ29800.1| aspartyl protease [Ectocarpus siliculosus]
Length = 410
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 128/337 (37%), Positives = 187/337 (55%), Gaps = 31/337 (9%)
Query: 5 LLRSVFCLWVLASCLLLPASSNG---LRRIGLKKR--------RLD----LHSLNAARIT 49
+ R+ L VL S LL N + R+ L KR +LD H A +
Sbjct: 1 MARASSVLTVLGSLLLASTCHNASAAVHRVKLSKRPDKEFVNSKLDKAHHRHHEGADEPS 60
Query: 50 RKERYMGGAGVSG-----VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDT 104
R + + A + G + L S E + +K++ +AQY+G++ IG+PPQ+F VIFDT
Sbjct: 61 RHDEGVLQANLRGAVEQVLMSELEASGEGKVIVKDYQNAQYYGQVEIGTPPQSFEVIFDT 120
Query: 105 GSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVG 164
GS+NLWV SKC +SC HSRY + KS+T+ E G+ EI Y SG +SG S D V G
Sbjct: 121 GSANLWVAGSKC--GLSCGLHSRYAASKSSTHAEDGRDFEITYASGPVSGSLSADTVTWG 178
Query: 165 DVVVKDQVFIEA--TREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEV 222
+ +KDQ F E + L F+L +FDGI+GL F EI+V + +VE+G + + V
Sbjct: 179 GIQLKDQTFAEVQDAKGLGLAFILGKFDGIMGLAFDEISVEGVPTPFGRLVEEGELDDAV 238
Query: 223 FSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCE 282
F+F+L ++ GE++ GG DP H+ + YVPVTKKGYWQ ++ ++ + S +
Sbjct: 239 FAFYLGN----QKEGELIIGGTDPDHYLHEINYVPVTKKGYWQIDMDNVDVSGSSVTSVK 294
Query: 283 GGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGVVSAE 319
+AI+DSGTSLL GP V +I +G ++ E
Sbjct: 295 ---SAILDSGTSLLVGPKEDVKKIASKVGAISFMNGE 328
>gi|395537495|ref|XP_003770734.1| PREDICTED: renin [Sarcophilus harrisii]
Length = 413
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 117/311 (37%), Positives = 174/311 (55%), Gaps = 26/311 (8%)
Query: 12 LWVLASCLLLPASSNGLRRIGLKK----------RRLDLHSLNAARITRKERYMGGAGVS 61
L V+ S S+ L+RI LKK + DL N ++ ++ +S
Sbjct: 7 LLVVWSTCFFSLPSDALQRIVLKKMPSIQENMKLKGKDLGKFNMEWLSYTKQLTLFNVMS 66
Query: 62 GVRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSI 120
VR L NF D QY+GEI IG+P Q F V+FDTGS++ WVPSSKC
Sbjct: 67 PVR------------LTNFEDTQYYGEISIGNPSQTFQVVFDTGSADFWVPSSKCSPLYT 114
Query: 121 SCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREG 180
+C FH +Y S KS+TY E G +I Y SG + GF S+D V VG + + Q F E T
Sbjct: 115 ACVFHHQYDSTKSSTYKENGTEFKIQYASGQVMGFLSEDTVTVGGIKMT-QSFGEVTVLP 173
Query: 181 SLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIV 240
L F LA+FDG++GLGF +++ VP +DN++ QG++ +EVFS + +R+ GGEI+
Sbjct: 174 LLPFGLAKFDGVLGLGFPALSMSKIVPFFDNIISQGMLKKEVFSVYYSRNSHV-PGGEII 232
Query: 241 FGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPT 300
GG DPK+++G Y+ ++ G+WQ ++ + + + C+ GC A VD+G S + GPT
Sbjct: 233 LGGSDPKYYRGTFHYINISHPGFWQIQMNGVSV-ESNVLACQDGCIASVDTGASFITGPT 291
Query: 301 PVVTEINHAIG 311
+ ++ +G
Sbjct: 292 SSMRKVMKMLG 302
>gi|253723333|pdb|4PEP|A Chain A, The Molecular And Crystal Structures Of Monoclinic Porcine
Pepsin Refined At 1.8 Angstroms Resolution
Length = 326
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 109/249 (43%), Positives = 159/249 (63%), Gaps = 12/249 (4%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL+N++D +YFG IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H+++ S+T
Sbjct: 5 PLENYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACSDHNQFNPDDSST 63
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDGIIG 194
+ + I YG+GS++G D V+VG + +Q+F + T GS + A FDGI+G
Sbjct: 64 FEATXQELSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDGILG 122
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L + I+ A PV+DN+ +QGLVS+++FS +L+ + D+ G ++ GG+D ++ G
Sbjct: 123 LAYPSISASGATPVFDNLWDQGLVSQDLFSVYLSSNDDS--GSVVLLGGIDSSYYTGSLN 180
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG--- 311
+VPV+ +GYWQ L I + + T C GGC AIVD+GTSLL GPT + I IG
Sbjct: 181 WVPVSVEGYWQITLDSITMDGE-TIACSGGCQAIVDTGTSLLTGPTSAIANIQSDIGASE 239
Query: 312 ---GEGVVS 317
GE V+S
Sbjct: 240 NSDGEMVIS 248
>gi|440894789|gb|ELR47149.1| Pregnancy-associated glycoprotein 2, partial [Bos grunniens mutus]
Length = 397
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 106/278 (38%), Positives = 163/278 (58%), Gaps = 8/278 (2%)
Query: 38 LDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDIL--PLKNFMDAQYFGEIGIGSPP 95
L L + R T +E+ + + +RL +D I PL+N++D Y G I IG+PP
Sbjct: 40 LPLKKMKTLRETLREKNLLNNFLEEQAYRLSKNDSKITIHPLRNYLDTAYVGNITIGTPP 99
Query: 96 QNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
Q F V+FDTGS+NLWVP C S +CY H + + S+++ E+G I YGSG I GF
Sbjct: 100 QEFRVVFDTGSANLWVPCITCT-SPACYTHKTFNPQNSSSFREVGSPITIFYGSGIIQGF 158
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
D V +G++V +Q F + E L FDGI+GL F + + D +P++DN+
Sbjct: 159 LGSDTVRIGNLVSPEQSFGLSLEEYGFDSL--PFDGILGLAFPPMGIEDTIPIFDNLWSH 216
Query: 216 GLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGN 275
G SE VF+F+LN + EG ++FGGVD +++KG+ ++PV++ +WQ + +I + N
Sbjct: 217 GAFSEPVFAFYLNT--NKPEGSVVMFGGVDHRYYKGELNWIPVSQTSHWQISMNNISM-N 273
Query: 276 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
+ C GC A++D+GTSL+ GPT +VT I+ +
Sbjct: 274 GTVTACSCGCEALLDTGTSLIYGPTKLVTNIHKLMNAR 311
>gi|1585064|prf||2124254A pepsin:ISOTYPE=3a
gi|1585065|prf||2124254B pepsin:ISOTYPE=3b
Length = 326
Score = 209 bits (531), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 109/258 (42%), Positives = 159/258 (61%), Gaps = 10/258 (3%)
Query: 73 DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRK 132
D PL+N++D +YFG IGIG+P Q+F+V+FDTGSSNLWVPS C S++C H+R+
Sbjct: 2 DEQPLENYLDMEYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCS-SLACTNHNRFNPED 60
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDG 191
S+TY ++ I YG+GS++G D V+VG + +Q+F + T GS + A FDG
Sbjct: 61 SSTYQSTSETVSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDG 119
Query: 192 IIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKG 251
I+GL I+ A PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G
Sbjct: 120 ILGLATPSISSSGATPVFDNIWNQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTG 177
Query: 252 KHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
+VPVT +GYWQ + I + ++ C GC AIVD+GTSLL GPT + I IG
Sbjct: 178 SLNWVPVTVEGYWQITVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIG 236
Query: 312 G----EGVVSAECKLVVS 325
+G + C + S
Sbjct: 237 ASENSDGDMVVSCSAISS 254
>gi|426333518|ref|XP_004028323.1| PREDICTED: cathepsin E isoform 2 [Gorilla gorilla gorilla]
Length = 363
Score = 209 bits (531), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 109/256 (42%), Positives = 148/256 (57%), Gaps = 12/256 (4%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+VIFDTGSSNLWVPS C S +C HSR++ +S+T
Sbjct: 69 PLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHSRFQPSQSST 127
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y++ G+S I YG+GS+SG D V V + V Q F E+ E TF+ A FDGI+GL
Sbjct: 128 YSQPGQSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAEFDGILGL 187
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+ +AVG PV+DNM+ Q LV +FS +++ +P+ G E++FGG D HF G +
Sbjct: 188 GYPSLAVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGSLNW 247
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEGV 315
VPVTK+ YWQ L ++L + C + +S PTP T
Sbjct: 248 VPVTKQAYWQIALDNMLWSVPTLTSCRMSPSPSTESPIPSAQLPTPYWTSW--------- 298
Query: 316 VSAECKLVVSQYGDLI 331
EC + + DL
Sbjct: 299 --MECSSAAAAFKDLT 312
>gi|403261257|ref|XP_003923041.1| PREDICTED: gastricsin [Saimiri boliviensis boliviensis]
Length = 388
Score = 209 bits (531), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 112/301 (37%), Positives = 173/301 (57%), Gaps = 4/301 (1%)
Query: 13 WVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDE 72
W++ + + L + ++ LKK + ++ + R+ +G ++ D
Sbjct: 3 WMVVAFVCLQLLEAAVVKVPLKKFKSIRETMKEKGLLREFLKTHKRDPAG-KYHFSDLSV 61
Query: 73 DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRK 132
P+ ++MDA YFGEI IG+PPQNF V+FDTGSSNLWVPS C S +C HSR+
Sbjct: 62 SYEPM-DYMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHSRFNPSA 119
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGI 192
S+TY+ G++ + YGSGS++G F D + V + V +Q F + E F+ A+FDGI
Sbjct: 120 SSTYSSNGQTFSLQYGSGSLTGLFGYDTLTVQSIQVPNQEFGLSENEPGTNFIYAQFDGI 179
Query: 193 IGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGK 252
+GL + ++VG A M+++ +++ VFSF+L+ GG +VFGGVD + G+
Sbjct: 180 MGLAYPALSVGGATTAMQGMLQEDVLTSPVFSFYLSNQ-QGSSGGAVVFGGVDSSLYTGQ 238
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
+ PVT++ YWQ + + LIG Q++G C GC AIVD+GTSLL P ++ A G
Sbjct: 239 IYWAPVTQELYWQIGIEEFLIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMSAFLEATGA 298
Query: 313 E 313
+
Sbjct: 299 Q 299
>gi|195570151|ref|XP_002103072.1| GD19155 [Drosophila simulans]
gi|194198999|gb|EDX12575.1| GD19155 [Drosophila simulans]
Length = 395
Score = 209 bits (531), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 130/323 (40%), Positives = 178/323 (55%), Gaps = 13/323 (4%)
Query: 12 LWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKE---RYMGGAGVSGVRHRLG 68
LW+L CL L RI ++ + + S R R +Y G V R G
Sbjct: 11 LWIL--CLFWAKCQGQLIRIPMQFQASFMASRRQHRAGRSSLLAKY-NVVGEQEVTSRNG 67
Query: 69 DSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFHSR 127
+ E L N ++ +Y G I IGSP Q F+++FDTGS+NLWVPS++C S++C+ H R
Sbjct: 68 GATET---LDNRLNLEYAGPISIGSPGQPFNMLFDTGSANLWVPSAECSPKSVACHHHHR 124
Query: 128 YKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLA 187
Y S S+T+ G+ I YG+GS+SG +QD V +G +VV++Q F AT E TF+
Sbjct: 125 YNSSASSTFVPDGRRFSIAYGTGSLSGRLAQDTVAIGQLVVRNQTFGMATHEPGPTFVDT 184
Query: 188 RFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPK 247
F GI+GLGFR IA P++++M +Q LV + VFSF+L R+ +GGE++FGGVD
Sbjct: 185 NFAGIVGLGFRPIAELGIKPLFESMCDQQLVDDCVFSFYLKRNGSERKGGELLFGGVDKT 244
Query: 248 HFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEIN 307
F G TYVP+T GYWQF L I + AI D+GTSLLA P IN
Sbjct: 245 KFSGSLTYVPLTHAGYWQFPLDAIEVAGTRITQHR---QAIADTGTSLLAAPPREYLIIN 301
Query: 308 HAIGGEGVVSAECKLVVSQYGDL 330
+GG + E L S+ L
Sbjct: 302 SLLGGLPTSNNEYLLNCSEIDSL 324
>gi|367000932|ref|XP_003685201.1| hypothetical protein TPHA_0D01260 [Tetrapisispora phaffii CBS 4417]
gi|357523499|emb|CCE62767.1| hypothetical protein TPHA_0D01260 [Tetrapisispora phaffii CBS 4417]
Length = 419
Score = 209 bits (531), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 104/241 (43%), Positives = 153/241 (63%), Gaps = 5/241 (2%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+++AQY+ +I +G+P QNF VI DTGSSNLWVPS C S++CY HS+Y +S
Sbjct: 94 VPLSNYLNAQYYTDISLGTPKQNFKVILDTGSSNLWVPSKDCT-SLACYLHSKYDHDEST 152
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVG-DVVVKDQVFIEATREGSLTFLLARFDGII 193
TY + G I YGSGS+ G+ S+D + +G D+V+ +Q F EAT E L F +FDGI+
Sbjct: 153 TYEKNGTKFTIQYGSGSMDGYISRDTLIIGDDLVIPEQDFAEATSEPGLAFAFGKFDGIL 212
Query: 194 GLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGK 252
GL + IAV VP + N ++QG++ E F+F+L + + D + GGE FGG D F G
Sbjct: 213 GLAYDTIAVNKVVPPFYNAIKQGILDENKFAFYLGDTNKDNKSGGEATFGGYDKSKFTGD 272
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
T++PV +K YW+ + I +G++ + G A +D+GTSL+ P+ + IN IG
Sbjct: 273 ITWLPVRRKAYWEVKFDSIALGDEVASL--DGYGAAIDTGTSLITLPSGLAEVINTQIGA 330
Query: 313 E 313
+
Sbjct: 331 K 331
>gi|123431419|ref|XP_001308165.1| Clan AA, family A1, cathepsin D-like aspartic peptidase
[Trichomonas vaginalis G3]
gi|121889831|gb|EAX95235.1| Clan AA, family A1, cathepsin D-like aspartic peptidase
[Trichomonas vaginalis G3]
Length = 370
Score = 209 bits (531), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 112/249 (44%), Positives = 152/249 (61%), Gaps = 10/249 (4%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL +F DAQY+ EI IG+P Q F V DTGSSNLWVPS KC SI+C+ H+RY S KS+
Sbjct: 47 VPLHDFSDAQYYTEITIGTPAQKFKVCPDTGSSNLWVPSKKCN-SIACWLHTRYDSSKSS 105
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TYT G+ +I YGSGS GF SQD V++ + K F E EGS++F+ A+FDGI+G
Sbjct: 106 TYTADGREVDIQYGSGSCKGFASQDEVQIAGITDK-MTFAEMKEEGSISFIAAKFDGILG 164
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L F+ I+V P + E G + + +F L R + E GE+ GG +P F G+ T
Sbjct: 165 LAFQNISVQGIPPPLQILYEHGEIEDYTVAFKLGR--TSGEDGEMTIGGYNPDAFSGEIT 222
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGG--CAAIVDSGTSLLAGPTPVVTEINHAIGG 312
+ V K+ +W FE D+L+ + S GVC G CAAI+D+GTS+L GP + I I
Sbjct: 223 WFNVAKELWWYFEFDDVLVNDVSAGVCPAGGKCAAILDTGTSMLIGPVSAMDVIMKNID- 281
Query: 313 EGVVSAECK 321
+ A C+
Sbjct: 282 ---IDARCQ 287
>gi|1585066|prf||2124254C pepsin:ISOTYPE=3c
Length = 326
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 109/258 (42%), Positives = 160/258 (62%), Gaps = 10/258 (3%)
Query: 73 DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRK 132
D PL+N++D +YFG IGIG+P Q+F+V+FDTGSSNLWVPS C S++C H+R+
Sbjct: 2 DEQPLENYLDMEYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCS-SLACTNHNRFNPED 60
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDG 191
S+TY ++ I YG+GS++G D V+VG + +Q+F + T GS + A FDG
Sbjct: 61 SSTYQSTSETVSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDG 119
Query: 192 IIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKG 251
I+GL I+ A PV+DN+ QGLVS+++FS +L+ D + G ++FGG+D ++ G
Sbjct: 120 ILGLATPSISSSGATPVFDNIWNQGLVSQDLFSVYLSA--DDKSGSVVIFGGIDSSYYTG 177
Query: 252 KHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
+VPVT +GYWQ + I + ++ C GC AIVD+GTSLL GPT + +I IG
Sbjct: 178 SLNWVPVTVEGYWQITVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIAKIQSDIG 236
Query: 312 G----EGVVSAECKLVVS 325
+G + C + S
Sbjct: 237 ASENSDGDMVVSCSAISS 254
>gi|11990128|emb|CAC19555.1| pepsin A [Camelus dromedarius]
Length = 390
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 117/301 (38%), Positives = 173/301 (57%), Gaps = 27/301 (8%)
Query: 73 DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRK 132
D PL+N++D +YFG I IG+P QNF+VIFDTGSSNLWVPS C S +C H+R+ +
Sbjct: 65 DEQPLENYLDTEYFGTISIGTPAQNFTVIFDTGSSNLWVPSIYCS-SSACTNHNRFNPEE 123
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDG 191
S+TY ++ I YG+GS++G D V+VG + +Q+F + T GS + A FDG
Sbjct: 124 SSTYQGTDETLSITYGTGSMTGILGYDTVQVGGISDVNQIFGLSETEPGSFLYY-APFDG 182
Query: 192 IIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKG 251
I+GL + I+ PV+DN+ ++GL+SE++FS +L+ + E G ++FGG+D ++ G
Sbjct: 183 ILGLAYPSISSSGGTPVFDNIWDEGLISEDLFSVYLSSND--ESGSVVIFGGIDSSYYTG 240
Query: 252 KHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
+VPV+ +GYWQ + I + +S C GC AIVD+GTSLLAGPT ++ I IG
Sbjct: 241 SLNWVPVSVEGYWQITVDSITMEGESIA-CSSGCQAIVDTGTSLLAGPTDAISNIQSYIG 299
Query: 312 GEGVVSAECKLVVSQYGDLIWDLLVSGLLPEKVCQQIGLCAFNGAEYVRLGIPITRVLFV 371
YGD++ LP V NG +Y P++ ++
Sbjct: 300 ASE----------DSYGDMVVSCSSISSLPNIV------FTINGVQY-----PLSPSAYI 338
Query: 372 L 372
L
Sbjct: 339 L 339
>gi|18152941|gb|AAB68519.2| proteinase A [Ogataea angusta]
gi|320580237|gb|EFW94460.1| proteinase A [Ogataea parapolymorpha DL-1]
Length = 413
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 106/248 (42%), Positives = 153/248 (61%), Gaps = 4/248 (1%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N+++AQYF EI +G+P Q+F VI DTGSSNLWVPSS C S++CY H++Y +S+T
Sbjct: 90 PLTNYLNAQYFTEIQLGTPGQSFKVILDTGSSNLWVPSSDC-TSLACYLHTKYDHDESST 148
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y + G S I YGSGS+ G+ SQD + +GD+V+ Q F EAT E L F +FDGI+GL
Sbjct: 149 YQKNGSSFAIQYGSGSLEGYVSQDTLTIGDLVIPKQDFAEATSEPGLAFAFGKFDGILGL 208
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEE-GGEIVFGGVDPKHFKGKHT 254
+ I+V VP N + GL+ F F+L +E+ GGE FGG D + G T
Sbjct: 209 AYDTISVNRIVPPIYNAINLGLLDTPQFGFYLGDTSKSEQDGGEATFGGYDVSKYTGDIT 268
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
++PV +K YW+ + I +G++ + G A +D+GTSL+A P+ + +N IG E
Sbjct: 269 WLPVRRKAYWEVKFSGIALGDEYAPLENTGAA--IDTGTSLIALPSQLAEILNSQIGAEK 326
Query: 315 VVSAECKL 322
S + ++
Sbjct: 327 SWSGQYQI 334
>gi|34740274|dbj|BAC87742.1| pepsinogen [Paralichthys olivaceus]
Length = 377
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 105/253 (41%), Positives = 160/253 (63%), Gaps = 9/253 (3%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
+ N D Y+G I IG+PPQ+FSVIFDTGSSNLW+PS C S +C H R+ ++S+T+
Sbjct: 62 MTNDADLSYYGVISIGTPPQSFSVIFDTGSSNLWIPSVYCS-SQACENHKRFNPQQSSTF 120
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
+ I YG+GS++G+ + D VEVG + V +QVF + E + + DGI+GL
Sbjct: 121 HWGNRPLSIQYGTGSMTGYLASDTVEVGGISVANQVFGISQSEAPFMAHM-KADGILGLA 179
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
F+ IA + VPV+DNM++Q LVS+ +FS +L+ + ++G E+VFGG+D H+ G+ +++
Sbjct: 180 FQSIASDNVVPVFDNMIKQNLVSQPLFSVYLSS--NNQQGSEVVFGGIDGNHYTGQVSWI 237
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE--- 313
P+T YWQ ++ + I Q T C GGC AI+D+GTSL+ GPT + +N +G
Sbjct: 238 PLTSATYWQIKMDSVTINGQ-TVACSGGCQAIIDTGTSLIVGPTNDINNMNSWVGASTNQ 296
Query: 314 -GVVSAECKLVVS 325
G + C+ + S
Sbjct: 297 YGEATVNCQNIQS 309
>gi|344234771|gb|EGV66639.1| Asp-domain-containing protein [Candida tenuis ATCC 10573]
Length = 425
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 110/264 (41%), Positives = 158/264 (59%), Gaps = 9/264 (3%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N+ +AQYF EI +G+P Q F VI DTGSSNLW+PS C S++CY HS+Y S+T
Sbjct: 102 PLSNYANAQYFTEIEVGTPGQPFKVILDTGSSNLWIPSQDCS-SLACYLHSKYDHDASST 160
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G I YGSG++ G+ S D + +GD+++K+Q F EAT E L F +FDGI+GL
Sbjct: 161 YKANGSEFAIQYGSGAMEGYVSTDALRIGDLLIKNQDFAEATSEPGLAFAFGKFDGILGL 220
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+ I+V VP N + QGL+ E+ F+F+L + + D E+GG FGG D F GK T
Sbjct: 221 AYDTISVNKIVPPVYNAINQGLLDEKSFAFYLGDTNKDEEDGGVATFGGYDESKFTGKIT 280
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG-- 312
++PV +K YW+ L + +G++ + G A +D+GTSL+ P+ + IN IG
Sbjct: 281 WLPVRRKAYWEVSLEGLGLGDEFAELKSTGAA--IDTGTSLITLPSSLAEIINAKIGAVK 338
Query: 313 --EGVVSAECKLVVSQYGDLIWDL 334
G + EC + DL ++L
Sbjct: 339 SWSGQYTVECD-ARANLPDLTFNL 361
>gi|391867010|gb|EIT76268.1| aspartyl protease [Aspergillus oryzae 3.042]
Length = 390
Score = 208 bits (530), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 122/303 (40%), Positives = 178/303 (58%), Gaps = 13/303 (4%)
Query: 14 VLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDED 73
+LA LLL ++ + R+ L+K L S + T +RY+G + + L D D
Sbjct: 5 LLAVPLLLSYTAAEIHRVPLEKELLVFGSDDDDTRTSSQRYIGS---NTHQKALQDHGPD 61
Query: 74 IL----PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYK 129
IL P+KN + QYF I IG+PPQ F V+ DTGS+NLWVPSSKC +ISC H +YK
Sbjct: 62 ILGHDIPVKNHRNTQYFSTIRIGTPPQKFKVVLDTGSANLWVPSSKCK-TISCKKHKKYK 120
Query: 130 SRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARF 189
S S+TY G EI YGSG ++G S+D +GD+ V++Q+F EAT+ + + A
Sbjct: 121 SALSDTYHNNGSEFEIYYGSGGMTGHVSEDIFTIGDLKVQEQLFGEATKVSGFSNVKA-- 178
Query: 190 DGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHF 249
DGI+GLGF I+V P + NM++Q L+ E VF+F+L+ D EI FGGVD +H+
Sbjct: 179 DGILGLGFASISVNSIPPPFYNMLDQNLLDEPVFAFYLS-DTYKGRTSEITFGGVDEQHY 237
Query: 250 KGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHA 309
G+ +P+ +K YW+ E + G+ V + G AI+D+G+SL+ P+ + +N
Sbjct: 238 SGEIVKIPLRRKAYWEVEFSGLFFGDHFADVEDTG--AILDTGSSLIGLPSGLFETVNKE 295
Query: 310 IGG 312
IG
Sbjct: 296 IGA 298
>gi|189011689|ref|NP_001098064.1| pepsin A precursor [Macaca mulatta]
gi|129793|sp|P11489.1|PEPA_MACMU RecName: Full=Pepsin A; Flags: Precursor
gi|342275|gb|AAA36902.1| pepsinogen A precursor (EC 3.4.23.1) [Macaca mulatta]
Length = 388
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 112/269 (41%), Positives = 166/269 (61%), Gaps = 15/269 (5%)
Query: 73 DILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRK 132
D PL+N++D +YFG IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H+ + +
Sbjct: 64 DEQPLENYLDVEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACTNHNLFNPQD 122
Query: 133 SNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVF-IEATREGSLTFLLARFDG 191
S+TY + I YG+GS++G D V+VG + +Q+F + T GS + A FDG
Sbjct: 123 SSTYQSTSGTLSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYY-APFDG 181
Query: 192 IIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKG 251
I+GL + I+ A PV+DN+ +QGLVS+++FS +L+ D + G ++FGG+D ++ G
Sbjct: 182 ILGLAYPSISSSGATPVFDNIWDQGLVSQDLFSVYLSA--DDQSGSVVIFGGIDSSYYTG 239
Query: 252 KHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG 311
+VPV+ +GYWQ + I + ++ C GC AIVD+GTSLL GPT + I IG
Sbjct: 240 SLNWVPVSVEGYWQISVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIG 298
Query: 312 ------GEGVVSAECKLVVSQYGDLIWDL 334
GE VVS +S D+++ +
Sbjct: 299 ASENSDGEMVVSCSA---ISSLPDIVFTI 324
>gi|292658825|ref|NP_999038.2| pepsin A preproprotein [Sus scrofa]
gi|121073319|gb|ABM47074.1| pepsinogen A [Sus scrofa]
Length = 385
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 106/248 (42%), Positives = 156/248 (62%), Gaps = 10/248 (4%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL+N++D +Y G IGIG+P Q+F+VIFDTGSSNLWVPS C S++C H+++ S+T
Sbjct: 64 PLENYLDTEYLGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACSDHNQFNPDDSST 122
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
+ + I YG+GS++G D V+VG + +Q+F + E + A FDGI+GL
Sbjct: 123 FEATSQELSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSSLYYAPFDGILGL 182
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
+ I+ A PV+DN+ +QGLVS+++FS +L+ + D+ G ++ GG+D ++ G +
Sbjct: 183 AYPSISASGATPVFDNLWDQGLVSQDLFSVYLSSNDDS--GSVVLLGGIDSSYYTGSLNW 240
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG---- 311
VPV+ +GYWQ L I + + T C GGC AIVD+GTSLL GPT + I IG
Sbjct: 241 VPVSVEGYWQITLDSITMDGE-TIACSGGCQAIVDTGTSLLTGPTSAIANIQSDIGASEN 299
Query: 312 --GEGVVS 317
GE V+S
Sbjct: 300 SDGEMVIS 307
>gi|402855684|ref|XP_003892446.1| PREDICTED: LOW QUALITY PROTEIN: gastricsin-like [Papio anubis]
Length = 377
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 101/250 (40%), Positives = 154/250 (61%), Gaps = 1/250 (0%)
Query: 63 VRHRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISC 122
++R + P+ N+M + YFGEI IG+PPQNF ++FDTGSSNLWVPS C S +C
Sbjct: 39 AKYRFNNDAVAYEPITNYMXSFYFGEISIGTPPQNFLLLFDTGSSNLWVPSIYCQ-SQAC 97
Query: 123 YFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSL 182
H+R+ S+T+ G++ ++YGSG++S F D V V +++V +Q F + E S
Sbjct: 98 SNHNRFNPSLSSTFRNNGQTYTLSYGSGNLSVFLGYDTVTVQNIIVNNQEFGLSENELSD 157
Query: 183 TFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFG 242
F + FDGI+G+ + +AVG++ V M++QG +++ FSF+ P + GGE++ G
Sbjct: 158 PFYYSDFDGILGMAYPSMAVGNSPTVMQGMLQQGQITQPDFSFYFTHQPTRQYGGELILG 217
Query: 243 GVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPV 302
GVDP+ + G+ PVT++ YWQ + + +GNQ+TG+C GC AIV +GT LLA P
Sbjct: 218 GVDPQLYSGQIIXTPVTRELYWQIPIEEFAVGNQATGLCSEGCQAIVVTGTFLLAVPQQY 277
Query: 303 VTEINHAIGG 312
+ A G
Sbjct: 278 MGSFLQATGA 287
>gi|118344566|ref|NP_001072055.1| nothepsin precursor [Takifugu rubripes]
gi|55771088|dbj|BAD69804.1| nothepsin [Takifugu rubripes]
Length = 414
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 117/300 (39%), Positives = 175/300 (58%), Gaps = 21/300 (7%)
Query: 21 LPASSNGLRRIG-----LKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDIL 75
+P+ + LR G L++RR DL + RY +G R+ E
Sbjct: 30 MPSMRSQLRADGQLSAFLQERRPDLF---------QRRYFQCFPATGPSLRVERFSET-- 78
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
L N+MD Q++GEI +G+P QNFSV+FDTGSS+LWVPS C H R+K+ +S +
Sbjct: 79 -LYNYMDVQFYGEIELGTPGQNFSVVFDTGSSDLWVPSVYCVSQTCGTVHRRFKAFESTS 137
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G+ EI+YGSG + G ++D ++V +V V++Q F E+ E + F++A FDGI+G+
Sbjct: 138 YRHDGRVFEIHYGSGHMLGIMARDTLKVNNVTVQNQEFGESVYEPGVAFVMAHFDGILGM 197
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLN---RDPDAEEGGEIVFGGVDPKHFKGK 252
G+ +A PV+DNM+ Q +V E +FSF+L+ R ++ GE++ GG+D F G
Sbjct: 198 GYPSLAQILGNPVFDNMLAQQMVEEPIFSFYLSKYERFSGSKLQGELLLGGMDQDLFTGP 257
Query: 253 HTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
++PVT KGYWQ ++ + + T C GC AIVD+GTSL+AGPT + + IG
Sbjct: 258 INWLPVTTKGYWQIKVDSVAVQGVDT-FCPEGCQAIVDTGTSLIAGPTRDILRLQQLIGA 316
>gi|162423778|gb|ABX89619.1| pepsinogen [Diplodus sargus]
Length = 376
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 103/237 (43%), Positives = 154/237 (64%), Gaps = 5/237 (2%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTY 136
+ N D Y+G + IG+PPQ+FSVIFDTGSSNLW+PS C S +C H ++ ++S+T+
Sbjct: 61 MTNDADLSYYGVVSIGTPPQSFSVIFDTGSSNLWIPSVYCS-SQACQNHKKFNPQQSSTF 119
Query: 137 TEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLG 196
+ I YG+GS++G+ + D VEVG + V +QVF + E + +A DGI+GL
Sbjct: 120 KWGNQQLSIQYGTGSMTGYLASDVVEVGGISVANQVFGISQTEAAFMASMAA-DGILGLA 178
Query: 197 FREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYV 256
F+ IA + VPV+ NMV+QGLVS+ +FS +L+ ++E+G E+VFGG D H+ G+ T++
Sbjct: 179 FQSIASDNVVPVFYNMVKQGLVSQPMFSVYLSG--NSEQGSEVVFGGTDSSHYTGQITWI 236
Query: 257 PVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
P++ YWQ + + I Q T C GGC AI+D+GTSL+ GPT + +N +G
Sbjct: 237 PLSSATYWQISMDSVTINGQ-TVACSGGCQAIIDTGTSLIVGPTSDINNMNSWVGAS 292
>gi|195159708|ref|XP_002020720.1| GL15705 [Drosophila persimilis]
gi|194117670|gb|EDW39713.1| GL15705 [Drosophila persimilis]
Length = 408
Score = 208 bits (529), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 120/317 (37%), Positives = 171/317 (53%), Gaps = 38/317 (11%)
Query: 26 NGLRRIGLKKRRLDLHSLNAA----RITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFM 81
N L+ + +K H LNA G SG R L N
Sbjct: 44 NELKSLSIK------HKLNATTTAPETASAPETTKDPGSSGTR------------LGNAF 85
Query: 82 DAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSI-SCYFHSRYKSRKSNTYTEIG 140
+ +Y+ + IG+PPQ F ++ DTGSSNLWVPSSKC ++ SC H++Y S+ S++Y G
Sbjct: 86 NTEYYLPVTIGTPPQEFILLIDTGSSNLWVPSSKCPATVKSCVSHNQYDSKSSSSYVANG 145
Query: 141 KSCEINYGSGS-----ISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
+ I Y S S +SG SQD V + ++ ++ QVF E T E TFL + FDG+ GL
Sbjct: 146 TAFTIEYASKSEGGVALSGILSQDTVTIAELAIQRQVFAEITDEPEATFLSSPFDGMFGL 205
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRD-PDAEEGGEIVFGGVDPKHFKGKHT 254
G+ I++G P + N+V QGL+ VFS +LNR+ +A +GGE+V GG+D F G T
Sbjct: 206 GYASISIGGVTPPFYNLVAQGLIKHPVFSIYLNRNGTNATDGGELVLGGIDATLFSGCLT 265
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIG--- 311
YVPV+++GYWQF + ++G ++ C C AI+D GTSLL PT + +IN +
Sbjct: 266 YVPVSQQGYWQFVMTSAVLGGKT--FCT-HCQAILDVGTSLLVAPTAAIKKINQLLAVLN 322
Query: 312 ---GEGVVSAECKLVVS 325
GV C + S
Sbjct: 323 PKDASGVFLVNCSTIAS 339
>gi|56971213|gb|AAH88063.1| LOC496913 protein, partial [Xenopus (Silurana) tropicalis]
Length = 380
Score = 208 bits (529), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 109/309 (35%), Positives = 170/309 (55%), Gaps = 19/309 (6%)
Query: 5 LLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVR 64
L+ ++ CL + + +P L+R + + H + A + +Y
Sbjct: 1 LILALVCLQLSEGIIKVP-----LKRFKSMREVMREHGIKAPIVDPASKY---------Y 46
Query: 65 HRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYF 124
++ + E PL N+MD Y+GEI IG+PPQNF V+FDTGSSNLWV S+ C S +C
Sbjct: 47 NQYATAFE---PLANYMDMSYYGEISIGTPPQNFLVLFDTGSSNLWVASTNCQ-SQACTN 102
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
H + +S+TY+ + + YG+GS++G D V + ++ + Q F + E F
Sbjct: 103 HPLFNPSQSSTYSSNQQQFSLQYGTGSLTGILGYDTVTIQNIAISQQEFGLSVTEPGTNF 162
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
+ A+FDGI+GL + IAVG A V M++Q L++E VF F+L+ + + + GGE+ FGGV
Sbjct: 163 VYAQFDGILGLAYPSIAVGGATTVMQGMLQQNLLNEPVFGFYLSGE-NTQSGGEVAFGGV 221
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
D ++ G+ + PVT + YWQ + I Q++G C GC IVD+GTSLL P +
Sbjct: 222 DQNYYTGQIYWTPVTSETYWQIGIQGFSINGQASGWCSQGCQGIVDTGTSLLTAPQSIFA 281
Query: 305 EINHAIGGE 313
+ IG +
Sbjct: 282 SLMQDIGAQ 290
>gi|395838962|ref|XP_003792373.1| PREDICTED: cathepsin E [Otolemur garnettii]
Length = 394
Score = 208 bits (529), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 104/237 (43%), Positives = 152/237 (64%), Gaps = 4/237 (1%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N++D +YFG I IGSPPQNF+V FDTGS + WVPS+ C S +C H+++ +SNT
Sbjct: 69 PLTNYLDVEYFGNISIGSPPQNFTVCFDTGSPDFWVPSAFCK-SRACKKHAKFCPSQSNT 127
Query: 136 YTEI-GKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
+T + GK+ I YG+GS SG D V VG + V +Q F EA +E F +FDGI+G
Sbjct: 128 HTRLEGKTFSIQYGTGSCSGIIGVDRVSVGGLTVPNQPFGEALKEPGKVFAHVQFDGIMG 187
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNR-DPDAEEGGEIVFGGVDPKHFKGKH 253
L + +A PV+DNM+ Q LV + +FS +++ + +G E++FGG D HF G+
Sbjct: 188 LSYPSLAEDGMTPVFDNMITQKLVDQPIFSIYMSSTNQKGGKGSELIFGGYDHSHFTGRL 247
Query: 254 TYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI 310
+VPV+K+ YWQ ++ I +G +S +C GC AIVD+GTS + GP+ + ++ AI
Sbjct: 248 NWVPVSKQEYWQIKVDKIRVG-RSVMLCSKGCQAIVDTGTSSITGPSDDIRQLQKAI 303
>gi|301606848|ref|XP_002933026.1| PREDICTED: gastricsin isoform 2 [Xenopus (Silurana) tropicalis]
Length = 382
Score = 208 bits (529), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 109/309 (35%), Positives = 170/309 (55%), Gaps = 19/309 (6%)
Query: 5 LLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVR 64
L+ ++ CL + + +P L+R + + H + A + +Y
Sbjct: 4 LILALVCLQLSEGIIKVP-----LKRFKSMREVMREHGIKAPIVDPASKY---------Y 49
Query: 65 HRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYF 124
++ + E PL N+MD Y+GEI IG+PPQNF V+FDTGSSNLWV S+ C S +C
Sbjct: 50 NQYATAFE---PLANYMDMSYYGEISIGTPPQNFLVLFDTGSSNLWVASTNCQ-SQACTN 105
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
H + +S+TY+ + + YG+GS++G D V + ++ + Q F + E F
Sbjct: 106 HPLFNPSQSSTYSSNQQQFSLQYGTGSLTGILGYDTVTIQNIAISQQEFGLSVTEPGTNF 165
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
+ A+FDGI+GL + IAVG A V M++Q L++E VF F+L+ + + + GGE+ FGGV
Sbjct: 166 VYAQFDGILGLAYPSIAVGGATTVMQGMLQQNLLNEPVFGFYLSGE-NTQSGGEVAFGGV 224
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
D ++ G+ + PVT + YWQ + I Q++G C GC IVD+GTSLL P +
Sbjct: 225 DQNYYTGQIYWTPVTSETYWQIGIQGFSINGQASGWCSQGCQGIVDTGTSLLTAPQSIFA 284
Query: 305 EINHAIGGE 313
+ IG +
Sbjct: 285 SLMQDIGAQ 293
>gi|410082415|ref|XP_003958786.1| hypothetical protein KAFR_0H02420 [Kazachstania africana CBS 2517]
gi|372465375|emb|CCF59651.1| hypothetical protein KAFR_0H02420 [Kazachstania africana CBS 2517]
Length = 416
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 109/256 (42%), Positives = 150/256 (58%), Gaps = 3/256 (1%)
Query: 75 LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSN 134
+PL N+++AQYF +I IGSP Q F VI DTGSSNLWVPS C S++C+ H++Y R S+
Sbjct: 89 VPLNNYLNAQYFADISIGSPGQTFRVIMDTGSSNLWVPSVDCN-SLACFLHNKYDHRVSS 147
Query: 135 TYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIG 194
TY G I YGSG++ G+ S D V VGD+ + Q F EAT E L F +FDGI G
Sbjct: 148 TYVRNGTRFAIRYGSGALEGYMSNDTVTVGDLQIPKQDFAEATSEPGLAFAFGKFDGIFG 207
Query: 195 LGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHT 254
L F I+V AVP + N V +GL+ F+F+L +EGGE+ FGG D F G T
Sbjct: 208 LAFDTISVNRAVPPFYNAVNRGLLDAPQFAFYLGDKRLRKEGGEVTFGGYDETRFTGNIT 267
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGEG 314
++PV ++ YW+ + I G+Q + G A +D+GTSL+ P+ + +N IG
Sbjct: 268 WLPVRREAYWEVDFNGISFGSQYAPLTATGAA--IDTGTSLITLPSGLAEILNAQIGARK 325
Query: 315 VVSAECKLVVSQYGDL 330
S + L S+ L
Sbjct: 326 NWSGQYVLDCSRRSTL 341
>gi|193499297|gb|ACF18591.1| pepsinogen C precursor [Siniperca chuatsi]
gi|253762213|gb|ACT35558.1| pepsinogen C precursor [Siniperca scherzeri]
Length = 387
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 120/354 (33%), Positives = 196/354 (55%), Gaps = 28/354 (7%)
Query: 21 LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNF 80
+P + R L+++ ++L + A + + + G A ++ + N+
Sbjct: 20 IPLRKHKSMREALREKGIELPYQDPALKYQADEFAGSANMN---------------INNY 64
Query: 81 MDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIG 140
D Y+G I IG+PPQ+F V+FDTGS+NLWV S C + +C H+++ ++S+T+T G
Sbjct: 65 ADTTYYGAISIGTPPQSFQVLFDTGSANLWVDSVYCN-TEACNAHTKFNPQQSSTFTAKG 123
Query: 141 KSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREI 200
+S + YG+GS+ G F D V VG +V+ +Q +T E TF +A+FDGI+GL + I
Sbjct: 124 QSFYLPYGAGSLYGVFGYDTVNVGGIVITNQEIGLSTNEPGETFAVAQFDGILGLSYPTI 183
Query: 201 AVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTK 260
+ G A PV DNM+ Q L++ ++F+F+L+ ++G E+ FGGVD ++G+ + PVT
Sbjct: 184 SAGGATPVMDNMISQNLLNADIFAFYLSS--GEQQGSELSFGGVDSSMYQGQIYWTPVTS 241
Query: 261 KGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE----GVV 316
+ YWQ + I Q +G C GC +IVD+GTS+L P+ ++ I AIG + G+
Sbjct: 242 ETYWQIGVQGFQINGQESGWCSQGCQSIVDTGTSMLTAPSQLLGYIMQAIGAQQNQYGMY 301
Query: 317 SAECKLVVSQYGDL-IWDLLVSGL-LPEKVCQQIGLCAFNGAEYVRLGIPITRV 368
+C SQ +L ++SG+ P I NG +Y +GI T +
Sbjct: 302 MVDC----SQVNNLPTLTFVISGVSFPLPPSAYIIEHTQNGYQYCSVGITPTYL 351
>gi|301606846|ref|XP_002933025.1| PREDICTED: gastricsin isoform 1 [Xenopus (Silurana) tropicalis]
Length = 383
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 109/309 (35%), Positives = 170/309 (55%), Gaps = 19/309 (6%)
Query: 5 LLRSVFCLWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVR 64
L+ ++ CL + + +P L+R + + H + A + +Y
Sbjct: 4 LILALVCLQLSEGIIKVP-----LKRFKSMREVMREHGIKAPIVDPASKY---------Y 49
Query: 65 HRLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYF 124
++ + E PL N+MD Y+GEI IG+PPQNF V+FDTGSSNLWV S+ C S +C
Sbjct: 50 NQYATAFE---PLANYMDMSYYGEISIGTPPQNFLVLFDTGSSNLWVASTNCQ-SQACTN 105
Query: 125 HSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTF 184
H + +S+TY+ + + YG+GS++G D V + ++ + Q F + E F
Sbjct: 106 HPLFNPSQSSTYSSNQQQFSLQYGTGSLTGILGYDTVTIQNIAISQQEFGLSVTEPGTNF 165
Query: 185 LLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGV 244
+ A+FDGI+GL + IAVG A V M++Q L++E VF F+L+ + + + GGE+ FGGV
Sbjct: 166 VYAQFDGILGLAYPSIAVGGATTVMQGMLQQNLLNEPVFGFYLSGE-NTQSGGEVAFGGV 224
Query: 245 DPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVT 304
D ++ G+ + PVT + YWQ + I Q++G C GC IVD+GTSLL P +
Sbjct: 225 DQNYYTGQIYWTPVTSETYWQIGIQGFSINGQASGWCSQGCQGIVDTGTSLLTAPQSIFA 284
Query: 305 EINHAIGGE 313
+ IG +
Sbjct: 285 SLMQDIGAQ 293
>gi|195349117|ref|XP_002041093.1| GM15229 [Drosophila sechellia]
gi|194122698|gb|EDW44741.1| GM15229 [Drosophila sechellia]
Length = 395
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 125/320 (39%), Positives = 175/320 (54%), Gaps = 7/320 (2%)
Query: 12 LWVLASCLLLPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSD 71
LW+L CL L RI ++ + + S R R + V G + +
Sbjct: 11 LWIL--CLFWAKCQGQLIRIPMQFQASFMASRRQHRAGRSS-LLAKYNVVGEQELTSRNG 67
Query: 72 EDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCY-FSISCYFHSRYKS 130
L N ++ +Y G I IGSP Q F+++FDTGS+NLWVPS++C S++C+ H RY +
Sbjct: 68 GATETLDNRLNLEYAGPISIGSPGQPFNMLFDTGSANLWVPSAECSPKSVACHHHHRYNA 127
Query: 131 RKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFD 190
S+T+ G+ I YG+GS+SG +QD V +G +VV++Q F AT E TF+ F
Sbjct: 128 SASSTFVPDGRRFSIAYGTGSLSGRLAQDTVAIGQLVVRNQTFGMATHEPGPTFVDTNFA 187
Query: 191 GIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFK 250
GI+GLGFR IA P++++M +Q LV + VFSF+L R+ +GGE++FGGVD F
Sbjct: 188 GIVGLGFRPIAEQGIKPLFESMCDQKLVDDCVFSFYLKRNGSDRKGGELLFGGVDKTKFS 247
Query: 251 GKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAI 310
G TYVP+T GYWQF L I + AI D+GTSLLA P IN +
Sbjct: 248 GSLTYVPLTHAGYWQFPLDAIEVAGTRISQHR---QAIADTGTSLLAAPPREYLIINSLL 304
Query: 311 GGEGVVSAECKLVVSQYGDL 330
GG + E L S+ L
Sbjct: 305 GGLPTSNNEYLLNCSEIDSL 324
>gi|1246039|gb|AAB35843.1| pepsinogen 2 [tuna, Peptide, 360 aa]
Length = 360
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 106/254 (41%), Positives = 160/254 (62%), Gaps = 9/254 (3%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
P+ N D Y+G + IG+PPQ+F VIFDTGSSNLWVPS C S +C H ++ ++S+T
Sbjct: 44 PMTNDADLSYYGVVSIGTPPQSFKVIFDTGSSNLWVPSVYCS-SQACQNHKKFNPQQSST 102
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
+ + I YG+GS++G + D VEVG + V +QVF + E + + DGI+GL
Sbjct: 103 FKWGDQPLSIQYGTGSMTGRLASDIVEVGGISVNNQVFGISQSEAPFMAYM-KADGILGL 161
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
F+ IA + VPV+DNMV QGLVS+ +FS +L+ ++++G E+VFGG+D H+ G+ T+
Sbjct: 162 AFQSIASDNVVPVFDNMVSQGLVSQPLFSVYLSS--NSQQGSEVVFGGIDSSHYTGQITW 219
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG--- 312
+P+T YWQ ++ + I Q T C GGC AI+D+GTSL+ GP+ + +N +G
Sbjct: 220 IPLTSATYWQIQMDSVTINGQ-TVACSGGCQAIIDTGTSLIVGPSRDIYNMNAWVGASTT 278
Query: 313 -EGVVSAECKLVVS 325
G + C+ + S
Sbjct: 279 QNGDATVNCQNIQS 292
>gi|406608071|emb|CCH40505.1| Saccharopepsin [Wickerhamomyces ciferrii]
Length = 401
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 120/323 (37%), Positives = 176/323 (54%), Gaps = 16/323 (4%)
Query: 11 CLWVLASCLL--LPASSNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLG 68
L+ LA LL + + + KR +D L A R MG + +
Sbjct: 5 ALYSLAVALLAFTETTDAKVHNAKIHKRPID-DQLKDATFEEHVRQMGQKYMGLYQKAYP 63
Query: 69 DSDEDIL------PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISC 122
+S+ + PL N+++AQY+ EI IG+P Q F VI DTGSSNLWVPS+ C S++C
Sbjct: 64 ESNVPFIDGTHETPLTNYLNAQYYTEIQIGTPGQPFKVILDTGSSNLWVPSTDCS-SLAC 122
Query: 123 YFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSL 182
Y HS+Y S+TY G I YGSGS+ G+ SQD +++GD+V+ Q F EAT E L
Sbjct: 123 YLHSKYDHEASSTYKANGSDFAIRYGSGSLEGYVSQDTLQLGDLVIPKQDFAEATSEPGL 182
Query: 183 TFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFG 242
F +FDGI+GL + I+V VP + GL+ E F+F+L E+GG FG
Sbjct: 183 AFAFGKFDGILGLAYDTISVNKIVPPVYKALNSGLLDEPKFAFYLGDADKTEDGGVATFG 242
Query: 243 GVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPV 302
G+D + GK T++PV +K YW+ + I +G++ + G A +D+GTSL+A P+ +
Sbjct: 243 GIDESKYTGKITWLPVRRKAYWEVKFNGIGLGDEFAELENTGAA--IDTGTSLIALPSGL 300
Query: 303 VTEINHAIGGE----GVVSAECK 321
+N IG + G S +C+
Sbjct: 301 AEILNSEIGAKKGWSGQYSVDCE 323
>gi|999902|pdb|1HTR|B Chain B, Crystal And Molecular Structures Of Human Progastricsin At
1.62 Angstroms Resolution
gi|2982065|pdb|1AVF|A Chain A, Activation Intermediate 2 Of Human Gastricsin From Human
Stomach
gi|2982067|pdb|1AVF|J Chain J, Activation Intermediate 2 Of Human Gastricsin From Human
Stomach
Length = 329
Score = 207 bits (528), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 104/234 (44%), Positives = 149/234 (63%), Gaps = 2/234 (0%)
Query: 80 FMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEI 139
+MDA YFGEI IG+PPQNF V+FDTGSSNLWVPS C S +C HSR+ +S+TY+
Sbjct: 9 YMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHSRFNPSESSTYSTN 67
Query: 140 GKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFRE 199
G++ + YGSGS++GFF D + V + V +Q F + E F+ A+FDGI+GL +
Sbjct: 68 GQTFSLQYGSGSLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGIMGLAYPA 127
Query: 200 IAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVT 259
++V +A MV++G ++ VFS +L+ GG +VFGGVD + G+ + PVT
Sbjct: 128 LSVDEATTAMQGMVQEGALTSPVFSVYLSNQ-QGSSGGAVVFGGVDSSLYTGQIYWAPVT 186
Query: 260 KKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
++ YWQ + + LIG Q++G C GC AIVD+GTSLL P ++ + A G +
Sbjct: 187 QELYWQIGIEEFLIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMSALLQATGAQ 240
>gi|393214080|gb|EJC99573.1| endopeptidase [Fomitiporia mediterranea MF3/22]
Length = 400
Score = 207 bits (528), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 125/331 (37%), Positives = 181/331 (54%), Gaps = 26/331 (7%)
Query: 12 LWVLASCLLLPAS--SNGLRRIGLKK--RRLDLHSLNAARITRKERYMGGA-----GVSG 62
L A +LLP + S+G+ ++ L K R + + +A ++ E+Y G A G G
Sbjct: 3 LSAFAVIILLPIAIASSGVHKLKLHKVPRGIVDSVIESAYLS--EKYRGQAQLPLTGTDG 60
Query: 63 VRHRLGDSDEDI------LPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC 116
H+ G + I +PL +FM+ QYF + +GSPPQ F VI DTGSSNLWVPS+KC
Sbjct: 61 PSHQPGPISDKIANGGHKVPLSDFMNVQYFTNVTLGSPPQEFRVILDTGSSNLWVPSTKC 120
Query: 117 YFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEA 176
S C H++Y S S+TY E G I YGSGS+ GF S+D V +GD+ + Q F EA
Sbjct: 121 R-SFGCSMHAKYNSSASSTYQENGTDIHITYGSGSMEGFVSKDVVTIGDLKIDGQDFAEA 179
Query: 177 TREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEG 236
T++ F +FDGI GLG+ I++ P + +MV QGL+ +FSF D +G
Sbjct: 180 TKDPGPAFAFGKFDGIFGLGYDTISINHITPPFYSMVNQGLLGAPIFSFRFGSSED--DG 237
Query: 237 GEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLL 296
GE FGG+D + G+ Y PV + +W+ EL G+ + G ++D+GTSL+
Sbjct: 238 GEATFGGIDESAYTGEINYAPVRSREHWEVELPKYAFGDNEFILENTG--GVIDTGTSLI 295
Query: 297 AGPTPVVTEINHAIGGE----GVVSAECKLV 323
P V ++N IG + G + +CK V
Sbjct: 296 NLPVDVAEKLNAQIGAKKSRTGQYTVDCKKV 326
>gi|28849951|ref|NP_788787.1| pregnancy-associated glycoprotein 2 precursor [Bos taurus]
gi|2499823|sp|Q28057.1|PAG2_BOVIN RecName: Full=Pregnancy-associated glycoprotein 2; Short=PAG 2;
Flags: Precursor
gi|797279|gb|AAA65822.1| aspartic proteinase [Bos taurus]
gi|296471711|tpg|DAA13826.1| TPA: pregnancy-associated glycoprotein 2 precursor [Bos taurus]
Length = 376
Score = 207 bits (528), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 105/278 (37%), Positives = 163/278 (58%), Gaps = 8/278 (2%)
Query: 38 LDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDIL--PLKNFMDAQYFGEIGIGSPP 95
L L + R T +E+ + + +RL +D I PL+N++D Y G I IG+PP
Sbjct: 19 LPLKKMKTLRETLREKNLLNNFLEEQAYRLSKNDSKITIHPLRNYLDTAYVGNITIGTPP 78
Query: 96 QNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNTYTEIGKSCEINYGSGSISGF 155
Q F V+FDTGS+NLWVP C S +CY H + + S+++ E+G I YGSG I GF
Sbjct: 79 QEFRVVFDTGSANLWVPCITCT-SPACYTHKTFNPQNSSSFREVGSPITIFYGSGIIQGF 137
Query: 156 FSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVGDAVPVWDNMVEQ 215
D V +G++V +Q F + E L FDGI+GL F + + D +P++DN+
Sbjct: 138 LGSDTVRIGNLVSPEQSFGLSLEEYGFDSL--PFDGILGLAFPAMGIEDTIPIFDNLWSH 195
Query: 216 GLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGYWQFELGDILIGN 275
G SE VF+F+LN + EG ++FGGVD +++KG+ ++PV++ +WQ + +I + N
Sbjct: 196 GAFSEPVFAFYLNT--NKPEGSVVMFGGVDHRYYKGELNWIPVSQTSHWQISMNNISM-N 252
Query: 276 QSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE 313
+ C GC A++D+GTS++ GPT +VT I+ +
Sbjct: 253 GTVTACSCGCEALLDTGTSMIYGPTKLVTNIHKLMNAR 290
>gi|194854120|ref|XP_001968292.1| GG24793 [Drosophila erecta]
gi|190660159|gb|EDV57351.1| GG24793 [Drosophila erecta]
Length = 404
Score = 207 bits (528), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 101/237 (42%), Positives = 151/237 (63%), Gaps = 4/237 (1%)
Query: 77 LKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFS-ISCYFHSRYKSRKSNT 135
L N M+ Y+G IGIG+P Q F V+FDTGS+NLWVPS++C + ++C HS+Y S S+T
Sbjct: 84 LGNSMNMYYYGLIGIGTPEQLFKVVFDTGSANLWVPSAQCLATDVACQQHSQYNSSASST 143
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
+ G++ I YG+GS+SG+ + D V + + + +Q F EA + +F FDGI+G+
Sbjct: 144 FVASGQNFSIQYGTGSVSGYLAMDTVTINGLAILNQTFGEAVSQPGASFTDVAFDGILGM 203
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTY 255
G+++IA VP + N+ E+GL+ E VF F+L R+ A EGG++ GG D G+ TY
Sbjct: 204 GYQQIAEDFVVPPFYNLYEEGLIDEPVFGFYLARNGSAVEGGQLTLGGTDQNLIAGEMTY 263
Query: 256 VPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGG 312
PVT++GYWQF + +I + GC AI D+GTSL+A P+ ++N+ IGG
Sbjct: 264 TPVTQQGYWQFAVNNITWNGT---LISSGCQAIADTGTSLIAVPSAAYIQLNNLIGG 317
>gi|50419019|ref|XP_458031.1| DEHA2C08074p [Debaryomyces hansenii CBS767]
gi|49653697|emb|CAG86094.1| DEHA2C08074p [Debaryomyces hansenii CBS767]
Length = 416
Score = 207 bits (528), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 107/251 (42%), Positives = 153/251 (60%), Gaps = 8/251 (3%)
Query: 76 PLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFHSRYKSRKSNT 135
PL N+++AQYF EI +G+P Q+F VI DTGSSNLWVPS C S++C+ HS+Y S+T
Sbjct: 93 PLTNYLNAQYFTEIQLGTPGQSFKVILDTGSSNLWVPSEDCS-SLACFLHSKYAHDSSST 151
Query: 136 YTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGL 195
Y G S I YGSG++ G+ SQD + +GD+++ Q F EAT E L F +FDGI+GL
Sbjct: 152 YKANGSSFSIQYGSGAMEGYVSQDTLAIGDLIIPKQDFAEATSEPGLAFAFGKFDGILGL 211
Query: 196 GFREIAVGDAVPVWDNMVEQGLVSEEVFSFWL-NRDPDAEEGGEIVFGGVDPKHFKGKHT 254
+ I+V VP N +EQGL+ E F+F+L + + E+GG FGG+D + GK
Sbjct: 212 AYNTISVNKIVPPVYNAIEQGLLEEPRFAFYLGDTSKNEEDGGVATFGGIDEDLYTGKVV 271
Query: 255 YVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE- 313
+PV +K YW+ I +G++ + + G A +D+GTSL+ P+ + IN IG E
Sbjct: 272 DLPVRRKAYWEVAFEGIGLGDEYAELTKTGAA--IDTGTSLITLPSSLAEIINSKIGAEK 329
Query: 314 ---GVVSAECK 321
G EC+
Sbjct: 330 SWSGQYQIECE 340
>gi|126306831|ref|XP_001370729.1| PREDICTED: renin-like [Monodelphis domestica]
Length = 389
Score = 207 bits (527), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 133/361 (36%), Positives = 196/361 (54%), Gaps = 29/361 (8%)
Query: 25 SNGLRRIGLKKRRLDLHSLNAARITRKERYMGGAGVSGVRHRLGDSDEDILPLKNFMDAQ 84
S+GL+RI LKK S+ + M GV + L N+ D Q
Sbjct: 20 SDGLQRIALKKMISVKESMKMRGKHLENLNMAENSWHGVVSPI--------ILTNYEDTQ 71
Query: 85 YFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKC-YFSISCYFHSRYKSRKSNTYTEIGKSC 143
Y+GEI IGSPPQ F V+FDTGSS+ WVPSS+C +C FH+RY + KS+TY G +
Sbjct: 72 YYGEINIGSPPQTFKVVFDTGSSDFWVPSSQCDPLYTACEFHNRYDASKSSTYKMNGSNF 131
Query: 144 EINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFLLARFDGIIGLGFREIAVG 203
I+Y SG + GF SQD + +G++ V QVF E T + F LA FDGI+GLG+ + ++
Sbjct: 132 IIHYASGRVKGFLSQDILTIGEIKVT-QVFGEVTALPLIPFGLAWFDGILGLGYPKRSMS 190
Query: 204 DAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVDPKHFKGKHTYVPVTKKGY 263
PV+DN++ +G++ E+VFS + +R + GGE++ GG DP +++G Y+ ++ +
Sbjct: 191 GITPVFDNIMAEGVLKEDVFSIYYSRS-SGKNGGELILGGSDPNYYQGTFHYINTSRPHF 249
Query: 264 WQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTEINHAIGGE---GVVSAEC 320
WQ ++ + + + CE GC A+VD+GTS + GPT + + AIG E G +C
Sbjct: 250 WQIQMQGVAVKSYVLS-CEDGCPAVVDTGTSFITGPTDSIRGLMTAIGAEEDGGEYLVKC 308
Query: 321 KLVVSQYGDLIW-----DLLVSG----LLPEKVCQQIGLCAFNGAEYVRLGIPITRVLFV 371
L S D+ + D + G L E Q+ L A NG + P T L+V
Sbjct: 309 DL-ASTLPDISFNFDGKDFTLQGSDYVLEDENQSDQMCLVAINGLDVS----PPTGPLWV 363
Query: 372 L 372
L
Sbjct: 364 L 364
>gi|255926586|gb|ACU40898.1| preprochymosin [Capra hircus]
Length = 381
Score = 207 bits (527), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 117/307 (38%), Positives = 178/307 (57%), Gaps = 18/307 (5%)
Query: 11 CLWVLASCLLLPASSNGLRRIGLKK-----RRLDLHSLNAARITRKERYMGGAGVSGVRH 65
CL VL + L + + RI L K + L H L +K++Y GVS
Sbjct: 3 CLVVLLAVFALSHGAE-ITRIPLYKGKPLRKALKEHGL-LEDFLQKQQY----GVSSEYS 56
Query: 66 RLGDSDEDILPLKNFMDAQYFGEIGIGSPPQNFSVIFDTGSSNLWVPSSKCYFSISCYFH 125
G+ +PL N++D+QYFG+I +G+PPQ F+V+FDTGSS+ WVPS C S +C H
Sbjct: 57 GFGEVAS--VPLTNYLDSQYFGKIYLGTPPQEFTVLFDTGSSDFWVPSIYCK-SNACKNH 113
Query: 126 SRYKSRKSNTYTEIGKSCEINYGSGSISGFFSQDNVEVGDVVVKDQVFIEATREGSLTFL 185
R+ RKS+T+ +GK I YG+GS+ G D V V ++V Q +T+E F
Sbjct: 114 QRFDPRKSSTFQNLGKPLSIRYGTGSMQGILGYDTVTVSNIVDTQQTVGLSTQEPGDVFT 173
Query: 186 LARFDGIIGLGFREIAVGDAVPVWDNMVEQGLVSEEVFSFWLNRDPDAEEGGEIVFGGVD 245
A FDGI+G+ + +A +VPV+D+M+++ LV++++FS +++R+ +G + G +D
Sbjct: 174 YAEFDGILGMAYPSLASEYSVPVFDSMMDRRLVAQDLFSVYMDRN---GQGSMLTLGAID 230
Query: 246 PKHFKGKHTYVPVTKKGYWQFELGDILIGNQSTGVCEGGCAAIVDSGTSLLAGPTPVVTE 305
P ++ G +VPVT + YWQF + + I + CEGGC AI+D+GTS L GP+ +
Sbjct: 231 PSYYTGSLHWVPVTLQKYWQFTVDSVTISG-AVVACEGGCQAILDTGTSKLVGPSSDILN 289
Query: 306 INHAIGG 312
I AIG
Sbjct: 290 IQQAIGA 296
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.321 0.140 0.428
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,611,652,800
Number of Sequences: 23463169
Number of extensions: 302462913
Number of successful extensions: 585962
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 3330
Number of HSP's successfully gapped in prelim test: 2000
Number of HSP's that attempted gapping in prelim test: 572080
Number of HSP's gapped (non-prelim): 6142
length of query: 392
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 248
effective length of database: 8,980,499,031
effective search space: 2227163759688
effective search space used: 2227163759688
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 78 (34.7 bits)