BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 018226
(359 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255578112|ref|XP_002529926.1| Aspartic proteinase precursor, putative [Ricinus communis]
gi|223530603|gb|EEF32480.1| Aspartic proteinase precursor, putative [Ricinus communis]
Length = 514
Score = 603 bits (1556), Expect = e-170, Method: Compositional matrix adjust.
Identities = 286/346 (82%), Positives = 317/346 (91%), Gaps = 6/346 (1%)
Query: 1 MGMVFKSITAGFFLCLLLFPVVFSTP---NGGLYRIGLKKRKFDLNNRVAARLDSKEGES 57
MG +FK F CL+L P+V +T N GL RIGLKKRKFD NNRVAA+ +SKEGE+
Sbjct: 1 MGTIFK---PALFFCLILLPLVCATASSSNDGLVRIGLKKRKFDQNNRVAAQFESKEGEA 57
Query: 58 FRTSIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVP 117
FR SI+KY +RGNLG++ D DIV+LKNYMDAQYFGEIGIGTPPQ FTVIFDTGSSNLWVP
Sbjct: 58 FRASIKKYHIRGNLGDAEDIDIVSLKNYMDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVP 117
Query: 118 SSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQE 177
SSKCYFS+ACYFHSKY+SG+SSTYKKNGKSADIHYGTGAISGFFS+D+VK+G+LV+K+QE
Sbjct: 118 SSKCYFSVACYFHSKYKSGQSSTYKKNGKSADIHYGTGAISGFFSQDNVKVGELVIKNQE 177
Query: 178 FIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNAD 237
FIEATREPS+TFL+AKFDGILGLGFQEISVG AVPVWYNMVNQGLV EPVFSFWFNRNAD
Sbjct: 178 FIEATREPSITFLVAKFDGILGLGFQEISVGNAVPVWYNMVNQGLVKEPVFSFWFNRNAD 237
Query: 238 EEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSG 297
E+EGGEIVFGGMDP+HYKGEHTYVPVTQKGYWQFDMGDV+IDG+TTG C+ GCAAIADSG
Sbjct: 238 EDEGGEIVFGGMDPNHYKGEHTYVPVTQKGYWQFDMGDVLIDGKTTGICSSGCAAIADSG 297
Query: 298 TSLLAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAK 343
TSLLAGPTTIIT+VNHAIGATG+VSQECKAVV+QYGE II MLLAK
Sbjct: 298 TSLLAGPTTIITEVNHAIGATGVVSQECKAVVAQYGETIIAMLLAK 343
>gi|21616051|emb|CAC86003.1| aspartic proteinase [Theobroma cacao]
Length = 514
Score = 582 bits (1499), Expect = e-163, Method: Compositional matrix adjust.
Identities = 272/343 (79%), Positives = 305/343 (88%)
Query: 1 MGMVFKSITAGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRT 60
MG + K+ T FLCLLLFP+VFS N L RIGLKKRKFD N R+AA LDSKE E+FR
Sbjct: 1 MGRIVKTTTVTLFLCLLLFPIVFSISNERLVRIGLKKRKFDQNYRLAAHLDSKEREAFRA 60
Query: 61 SIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSK 120
S++KY L+GNL ES D DIVALKNY+DAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSK
Sbjct: 61 SLKKYRLQGNLQESEDIDIVALKNYLDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSK 120
Query: 121 CYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIE 180
CYFSIACY HS+Y+S RSSTYK NGK ADI YGTGAISGFFSED+V++GDLVVK+QEFIE
Sbjct: 121 CYFSIACYLHSRYKSSRSSTYKANGKPADIQYGTGAISGFFSEDNVQVGDLVVKNQEFIE 180
Query: 181 ATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEE 240
ATREPS+TFL+AKFDGILGLGFQEISVG AVPVWYNMVNQGLV EPVFSFWFNR+ +++
Sbjct: 181 ATREPSITFLVAKFDGILGLGFQEISVGNAVPVWYNMVNQGLVKEPVFSFWFNRDPEDDI 240
Query: 241 GGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSL 300
GGE+VFGGMDP H+KG+HTYVP+T+KGYWQFDMGDV+I QTTG CAGGC+AIADSGTSL
Sbjct: 241 GGEVVFGGMDPKHFKGDHTYVPITRKGYWQFDMGDVLIGNQTTGLCAGGCSAIADSGTSL 300
Query: 301 LAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAK 343
+ GPT II QVNHAIGA+G+VSQECK VVSQYGE II+MLL+K
Sbjct: 301 ITGPTAIIAQVNHAIGASGVVSQECKTVVSQYGETIIDMLLSK 343
>gi|224056377|ref|XP_002298827.1| predicted protein [Populus trichocarpa]
gi|222846085|gb|EEE83632.1| predicted protein [Populus trichocarpa]
Length = 494
Score = 566 bits (1458), Expect = e-159, Method: Compositional matrix adjust.
Identities = 270/322 (83%), Positives = 297/322 (92%), Gaps = 5/322 (1%)
Query: 23 FSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLG-ESGDADIVA 81
S PN GL RIGLKKRK++ NNR+AA+L+SKEGES I+KY L NLG ++ D DIV+
Sbjct: 6 LSPPNDGLIRIGLKKRKYERNNRLAAKLESKEGES----IKKYHLLRNLGGDAEDTDIVS 61
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
LKNYMDAQYFGEIGIGTPPQ FTVIFDTGSSNLWVPSSKCYFS+ACYFHSKY+S S TY
Sbjct: 62 LKNYMDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFSVACYFHSKYKSSHSRTY 121
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
K+NGKSA+IHYGTGAISGFFS+DHVK+GDLVVK+QEFIEATREPS+TFL+AKFDGILGLG
Sbjct: 122 KENGKSAEIHYGTGAISGFFSQDHVKVGDLVVKNQEFIEATREPSVTFLVAKFDGILGLG 181
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
FQEISVGKAVPVWYNMV QGLV EPVFSFWFNRNADE+EGGEIVFGG+DPDHYKGEHTYV
Sbjct: 182 FQEISVGKAVPVWYNMVEQGLVKEPVFSFWFNRNADEKEGGEIVFGGVDPDHYKGEHTYV 241
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIV 321
PVTQKGYWQFDMGDV+I GQT+GFCA GCAAIADSGTSLLAGPTTIIT+VNHAIGATG+V
Sbjct: 242 PVTQKGYWQFDMGDVLIGGQTSGFCASGCAAIADSGTSLLAGPTTIITEVNHAIGATGVV 301
Query: 322 SQECKAVVSQYGEEIINMLLAK 343
SQECKAVV+QYG+ I+ MLLAK
Sbjct: 302 SQECKAVVAQYGDTIMEMLLAK 323
>gi|359483345|ref|XP_003632941.1| PREDICTED: aspartic proteinase isoform 2 [Vitis vinifera]
gi|359483347|ref|XP_002262915.2| PREDICTED: aspartic proteinase isoform 1 [Vitis vinifera]
Length = 514
Score = 559 bits (1441), Expect = e-157, Method: Compositional matrix adjust.
Identities = 257/319 (80%), Positives = 292/319 (91%)
Query: 29 GLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDA 88
GL+RIGLKK K D N+++AARL+SKEGES R SIRKY GNLG+S D DIV LKNYMDA
Sbjct: 29 GLFRIGLKKMKLDQNDQLAARLESKEGESLRASIRKYFRHGNLGDSQDTDIVGLKNYMDA 88
Query: 89 QYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSA 148
QYFGEIGIGTPPQ FTVIFDTGSSNLWVPSSKCYFS+ CYFHSKY+S +SSTY+KNGKSA
Sbjct: 89 QYFGEIGIGTPPQTFTVIFDTGSSNLWVPSSKCYFSVPCYFHSKYKSSQSSTYRKNGKSA 148
Query: 149 DIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVG 208
DIHYGTGAISGFFSED+VK+GDLVVK+QEFIEATREPS+TFL+AKFDGILGLGFQEISVG
Sbjct: 149 DIHYGTGAISGFFSEDNVKVGDLVVKNQEFIEATREPSVTFLVAKFDGILGLGFQEISVG 208
Query: 209 KAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGY 268
AVPVWYNMV QGLV EPVFSFW NR D++EGGE+VFGG+DPDH+KGEHTYVPVTQKGY
Sbjct: 209 NAVPVWYNMVKQGLVKEPVFSFWLNRKTDDDEGGELVFGGVDPDHFKGEHTYVPVTQKGY 268
Query: 269 WQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQECKAV 328
WQFDMG+V+IDG+TTG+CAGGCAAIADSGTSLLAGPT ++ +NHAIGATG+VSQECK V
Sbjct: 269 WQFDMGEVLIDGETTGYCAGGCAAIADSGTSLLAGPTAVVAMINHAIGATGVVSQECKTV 328
Query: 329 VSQYGEEIINMLLAKVSAK 347
V+QYGE I+++LL++ S +
Sbjct: 329 VAQYGETIMDLLLSEASPQ 347
>gi|449466825|ref|XP_004151126.1| PREDICTED: aspartic proteinase-like [Cucumis sativus]
Length = 513
Score = 555 bits (1431), Expect = e-155, Method: Compositional matrix adjust.
Identities = 263/347 (75%), Positives = 299/347 (86%), Gaps = 1/347 (0%)
Query: 1 MGMVFKSITAGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRT 60
MG K A F+C +FP+VF N G RIGLK+RKF NNRVA+++ +KEG S +
Sbjct: 1 MGTRLKLFIAVLFICFFMFPMVFCASNDGKVRIGLKRRKFGQNNRVASKIATKEGISLKN 60
Query: 61 SIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSK 120
S+ KY NLG+S D DIV LKNY++AQYFGEIGIGTPPQ F VIFDTGSSNLWVPSSK
Sbjct: 61 SVEKYQPSANLGDSDDFDIVGLKNYLNAQYFGEIGIGTPPQKFAVIFDTGSSNLWVPSSK 120
Query: 121 CYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIE 180
C FS+AC HSKY+S RSSTYKKNGKSA I YGTGAISG+FSED+VK+GDL+VK Q+FIE
Sbjct: 121 C-FSVACLLHSKYKSKRSSTYKKNGKSASIKYGTGAISGYFSEDNVKVGDLIVKKQDFIE 179
Query: 181 ATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEE 240
ATREPSLTF+LA+FDGILGLGF+EISVG AVPVWYNMV+Q LV EPVFSFWFNRNADEE+
Sbjct: 180 ATREPSLTFVLAQFDGILGLGFKEISVGDAVPVWYNMVDQNLVKEPVFSFWFNRNADEEQ 239
Query: 241 GGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSL 300
GGEIVFGG+DPDHYKGEHTYVPVT+KGYWQFDMGDV+I+G TTGFC+GGC+AIADSGTSL
Sbjct: 240 GGEIVFGGVDPDHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTGFCSGGCSAIADSGTSL 299
Query: 301 LAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKVSAK 347
LAGPTTIITQVNHAIGA+G+VS+ECKAVV++YGE II MLLAK K
Sbjct: 300 LAGPTTIITQVNHAIGASGVVSEECKAVVAEYGETIIKMLLAKDQPK 346
>gi|224115794|ref|XP_002317126.1| predicted protein [Populus trichocarpa]
gi|222860191|gb|EEE97738.1| predicted protein [Populus trichocarpa]
Length = 512
Score = 555 bits (1429), Expect = e-155, Method: Compositional matrix adjust.
Identities = 262/347 (75%), Positives = 303/347 (87%), Gaps = 2/347 (0%)
Query: 1 MGMVFKSITAGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRT 60
MG+ K+I L LLF VV S N GL RIGLKK KFD NNR+AARLDS+E + R
Sbjct: 1 MGVNLKAIGGFVLLSFLLFAVVLSESNDGLLRIGLKKVKFDKNNRIAARLDSQE--ALRA 58
Query: 61 SIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSK 120
SIRKY+L GNLGES D DIVALKNY DAQY+GEIG+GTPPQ FTVIFDTGSSNLWVPSSK
Sbjct: 59 SIRKYNLLGNLGESEDTDIVALKNYFDAQYYGEIGVGTPPQKFTVIFDTGSSNLWVPSSK 118
Query: 121 CYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIE 180
CY S+ACYFHSKY SG+SS+YKKNGKSA+I YG+G+ISGFFS D V++G+LVVKDQEFIE
Sbjct: 119 CYLSVACYFHSKYNSGKSSSYKKNGKSAEIQYGSGSISGFFSIDAVEVGNLVVKDQEFIE 178
Query: 181 ATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEE 240
AT+EPS+TFL+ KFDGILGLGF+EI+VG AVPVW NM+ QGL+ EPVFSFW NRNAD+EE
Sbjct: 179 ATKEPSITFLVGKFDGILGLGFKEIAVGGAVPVWDNMIKQGLIKEPVFSFWLNRNADDEE 238
Query: 241 GGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSL 300
GGEIVFGGMDP+HYKG+HTYVPVTQKGYWQFDMGDV++ ++TG+CAGGCAAIADSGTSL
Sbjct: 239 GGEIVFGGMDPNHYKGKHTYVPVTQKGYWQFDMGDVIVGDKSTGYCAGGCAAIADSGTSL 298
Query: 301 LAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKVSAK 347
LAGPT IIT +NHAIGA+G+VSQ+CKAVVSQYGE I+++LL++V K
Sbjct: 299 LAGPTAIITMINHAIGASGVVSQQCKAVVSQYGEVIMDLLLSEVQPK 345
>gi|225460913|ref|XP_002279049.1| PREDICTED: aspartic proteinase [Vitis vinifera]
gi|297737462|emb|CBI26663.3| unnamed protein product [Vitis vinifera]
Length = 514
Score = 551 bits (1420), Expect = e-154, Method: Compositional matrix adjust.
Identities = 259/343 (75%), Positives = 296/343 (86%)
Query: 1 MGMVFKSITAGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRT 60
MG +++ FL +L+F FS +GGL RIGLKKR FD NR+AAR++SK+GE+ T
Sbjct: 1 MGTKCRTVAVALFLSILMFSPEFSASDGGLVRIGLKKRAFDQTNRLAARIESKQGEALGT 60
Query: 61 SIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSK 120
SIRKY+L GN S +VAL NYMDAQYFGEI IGTPPQ FTVIFDTGSSNLWVPSSK
Sbjct: 61 SIRKYNLHGNAAGSKHTYVVALHNYMDAQYFGEISIGTPPQKFTVIFDTGSSNLWVPSSK 120
Query: 121 CYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIE 180
CYFS+ACYFHSKY+S +SSTYKKNG SADIHYGTGAISGFFS+D VK+GDL V +QEFIE
Sbjct: 121 CYFSVACYFHSKYKSSQSSTYKKNGTSADIHYGTGAISGFFSKDDVKVGDLAVINQEFIE 180
Query: 181 ATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEE 240
AT+EPS+TF LAKFDGILGLGFQEISVG AVPVWYNM+NQ L+ EP+FSFWFNRN++EE
Sbjct: 181 ATKEPSITFALAKFDGILGLGFQEISVGNAVPVWYNMINQELIKEPIFSFWFNRNSNEEV 240
Query: 241 GGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSL 300
GGEIVFGG+D DHYKG+HTYVPVT+KGYWQFD+GDVMI G+TTGFCA GC+AIADSGTSL
Sbjct: 241 GGEIVFGGIDSDHYKGKHTYVPVTKKGYWQFDLGDVMIGGKTTGFCASGCSAIADSGTSL 300
Query: 301 LAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAK 343
LAGPTTIIT+VNHAIGA+G VSQEC+AVV QYG+ II+MLL K
Sbjct: 301 LAGPTTIITEVNHAIGASGFVSQECRAVVQQYGQIIIDMLLTK 343
>gi|261264941|gb|ACX55829.1| aspartic proteinase 1 [Castanea mollissima]
Length = 513
Score = 549 bits (1415), Expect = e-154, Method: Compositional matrix adjust.
Identities = 283/343 (82%), Positives = 310/343 (90%)
Query: 1 MGMVFKSITAGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRT 60
MG K++ A FFLC LLFP+VFS NGGL RIGLKK K D NNRVAA+L+SK+GE
Sbjct: 1 MGTKLKTVVATFFLCFLLFPLVFSASNGGLVRIGLKKMKLDKNNRVAAQLESKDGEVRSA 60
Query: 61 SIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSK 120
SIRKY LRGN G+ D DIV+LKNYMDAQYFGEIG+GTPPQ FTVIFDTGSSNLWVPSSK
Sbjct: 61 SIRKYYLRGNSGDPEDIDIVSLKNYMDAQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSSK 120
Query: 121 CYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIE 180
CYFS+ACYFHSKY+S SSTYKKNGK ADIHYGTGAISG+FS+DHVK+GDLVVK+QEFIE
Sbjct: 121 CYFSVACYFHSKYKSSSSSTYKKNGKPADIHYGTGAISGYFSQDHVKVGDLVVKNQEFIE 180
Query: 181 ATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEE 240
ATREPS+TFL+AKFDGILGLGF+EISVG AVPVWYNMV QGLV EPVFSFWFNRN DEEE
Sbjct: 181 ATREPSITFLVAKFDGILGLGFKEISVGNAVPVWYNMVKQGLVKEPVFSFWFNRNTDEEE 240
Query: 241 GGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSL 300
GGEIVFGG+DP+HYKG+HTYVPVTQKGYWQFDMGDV+IDGQTTGFCA GC+AIADSGTSL
Sbjct: 241 GGEIVFGGVDPNHYKGKHTYVPVTQKGYWQFDMGDVLIDGQTTGFCARGCSAIADSGTSL 300
Query: 301 LAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAK 343
LAGPTTIIT+VNHAIGATG+VSQECKAVV++YGE II MLL K
Sbjct: 301 LAGPTTIITEVNHAIGATGVVSQECKAVVAEYGETIIKMLLEK 343
>gi|449503193|ref|XP_004161880.1| PREDICTED: aspartic proteinase-like [Cucumis sativus]
Length = 516
Score = 549 bits (1414), Expect = e-154, Method: Compositional matrix adjust.
Identities = 262/350 (74%), Positives = 300/350 (85%), Gaps = 4/350 (1%)
Query: 1 MGMVFKSITAGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRT 60
MG K A F+C +FP+VF N G RIGLK+RKF NNRVA+++ +KEG S +
Sbjct: 1 MGTRLKLFIAVLFICFFMFPMVFCASNDGKVRIGLKRRKFGQNNRVASKIATKEGISLKN 60
Query: 61 SIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSK 120
S+ KY NLG+S D DIV LKNY++AQYFGEIGIGTPPQ F VIFDTGSSNLWVPSSK
Sbjct: 61 SVEKYQPSANLGDSDDFDIVGLKNYLNAQYFGEIGIGTPPQKFAVIFDTGSSNLWVPSSK 120
Query: 121 CYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQ---E 177
C FS+AC HSKY+S RSSTYKKNGKSA I YGTGAISG+FSED+VK+GDL+VK++ +
Sbjct: 121 C-FSVACLLHSKYKSKRSSTYKKNGKSASIKYGTGAISGYFSEDNVKVGDLIVKNRSLFD 179
Query: 178 FIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNAD 237
FIEATREPSLTF+LA+FDGILGLGF+EISVG AVPVWYNMV+Q LV EPVFSFWFNRNAD
Sbjct: 180 FIEATREPSLTFVLAQFDGILGLGFKEISVGDAVPVWYNMVDQNLVKEPVFSFWFNRNAD 239
Query: 238 EEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSG 297
EE+GGEIVFGG+DPDHYKGEHTYVPVT+KGYWQFDMGDV+I+G TTGFC+GGC+AIADSG
Sbjct: 240 EEQGGEIVFGGVDPDHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTGFCSGGCSAIADSG 299
Query: 298 TSLLAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKVSAK 347
TSLLAGPTTIITQVNHAIGA+G+VS+ECKAVV++YGE II MLLAK K
Sbjct: 300 TSLLAGPTTIITQVNHAIGASGVVSEECKAVVAEYGETIIKMLLAKDQPK 349
>gi|261264943|gb|ACX55830.1| aspartic proteinase 2 [Castanea mollissima]
Length = 513
Score = 545 bits (1404), Expect = e-152, Method: Compositional matrix adjust.
Identities = 281/343 (81%), Positives = 308/343 (89%)
Query: 1 MGMVFKSITAGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRT 60
MG K++ A FFLC LLFP+VFS NGGL RIGLKK K D NNRVAA+L+SK+GE
Sbjct: 1 MGTKLKTVVATFFLCFLLFPLVFSASNGGLVRIGLKKMKLDKNNRVAAQLESKDGEVRSA 60
Query: 61 SIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSK 120
SIRKY LRGN G+ D DIV+LKNYMDAQYFGEIG+GTPPQ FTVIFDTGSSNLWVPSSK
Sbjct: 61 SIRKYYLRGNSGDPEDIDIVSLKNYMDAQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSSK 120
Query: 121 CYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIE 180
CYFS+ACYFHSKY+S SSTYKKNGK ADIHYGTGAISG+FS+DHVK+GDLVVK+QEFIE
Sbjct: 121 CYFSVACYFHSKYKSSSSSTYKKNGKPADIHYGTGAISGYFSQDHVKVGDLVVKNQEFIE 180
Query: 181 ATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEE 240
ATREPS+TFL+AKFDGILGLGF+EISVG AVPVWYNMV QGLV EPVFSFWFNRN DEEE
Sbjct: 181 ATREPSITFLVAKFDGILGLGFKEISVGNAVPVWYNMVKQGLVKEPVFSFWFNRNTDEEE 240
Query: 241 GGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSL 300
GGEIVFGG+DP+HYKG+HTYVPVTQKGYWQFDMGDV+IDGQTTGFC C+AIADSGTSL
Sbjct: 241 GGEIVFGGVDPNHYKGKHTYVPVTQKGYWQFDMGDVLIDGQTTGFCVTTCSAIADSGTSL 300
Query: 301 LAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAK 343
LAGPTTIIT+VNHAIGATG+VSQECKAVV++YGE II MLL K
Sbjct: 301 LAGPTTIITEVNHAIGATGVVSQECKAVVAEYGETIIKMLLEK 343
>gi|224118038|ref|XP_002331542.1| predicted protein [Populus trichocarpa]
gi|222873766|gb|EEF10897.1| predicted protein [Populus trichocarpa]
Length = 512
Score = 542 bits (1396), Expect = e-151, Method: Compositional matrix adjust.
Identities = 257/347 (74%), Positives = 299/347 (86%), Gaps = 2/347 (0%)
Query: 1 MGMVFKSITAGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRT 60
MG+ K+I FL LLF VV S N GL RIGLKK K D NNR+AARLDSKE + R
Sbjct: 1 MGVNLKAIVGFVFLSFLLFAVVSSASNDGLLRIGLKKVKLDKNNRIAARLDSKE--TLRA 58
Query: 61 SIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSK 120
SIRKY+L GNLGES D DIVALKNY+D+QY+GEIG+G+PPQ FTVIFDTGSSNLWVPSSK
Sbjct: 59 SIRKYNLCGNLGESEDTDIVALKNYLDSQYYGEIGVGSPPQKFTVIFDTGSSNLWVPSSK 118
Query: 121 CYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIE 180
CY S+ACYFHSKY SG+SSTYKKNGKSA+I YG+G+ISGFFS D V++G LVVKDQEFIE
Sbjct: 119 CYLSVACYFHSKYDSGKSSTYKKNGKSAEIRYGSGSISGFFSNDAVEVGGLVVKDQEFIE 178
Query: 181 ATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEE 240
AT+EP++TFL+AKFDGILGLGF+EISVG AVPVW NM+ GL+ EPVFSFW NRNA++EE
Sbjct: 179 ATKEPNITFLVAKFDGILGLGFKEISVGDAVPVWDNMIKHGLIKEPVFSFWLNRNAEDEE 238
Query: 241 GGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSL 300
GGEIVFGGMDP+HYKG+HT+VPVT+KGYWQF+MGDV I + TG+CA GCAAIADSGTSL
Sbjct: 239 GGEIVFGGMDPNHYKGKHTFVPVTRKGYWQFNMGDVHIGDKPTGYCASGCAAIADSGTSL 298
Query: 301 LAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKVSAK 347
LAGPTTIIT +N AIGA+G+VSQ+CKAVVSQYGE I+++LL++ K
Sbjct: 299 LAGPTTIITMINQAIGASGVVSQQCKAVVSQYGEAIMDLLLSQAQPK 345
>gi|449454758|ref|XP_004145121.1| PREDICTED: aspartic proteinase-like [Cucumis sativus]
gi|449472326|ref|XP_004153558.1| PREDICTED: aspartic proteinase-like [Cucumis sativus]
Length = 514
Score = 535 bits (1379), Expect = e-150, Method: Compositional matrix adjust.
Identities = 249/338 (73%), Positives = 289/338 (85%), Gaps = 3/338 (0%)
Query: 13 FLCLLLF---PVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRG 69
FLCL L +V S N GL R+GLKK D NR+AARL+SK+ E + + RKY+ G
Sbjct: 10 FLCLFLLVSLNIVSSVSNDGLLRVGLKKINLDPENRLAARLESKDAEILKAAFRKYNPNG 69
Query: 70 NLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYF 129
NLGES D DIVALKNY+DAQY+GEI IGTPPQ FTVIFDTGSSNLWVPS+KC FS+AC+F
Sbjct: 70 NLGESSDTDIVALKNYLDAQYYGEIAIGTPPQKFTVIFDTGSSNLWVPSAKCLFSVACHF 129
Query: 130 HSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTF 189
H++Y+S RSSTYKKNG SA I YGTGA+SGFFS D+VK+GDLVVK+Q FIEATREP LTF
Sbjct: 130 HARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPGLTF 189
Query: 190 LLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGM 249
L+AKFDG+LGLGFQEI+VG AVPVWYNMV QGLV EPVFSFW NRNA+EEEGGEIVFGG+
Sbjct: 190 LVAKFDGLLGLGFQEIAVGSAVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGV 249
Query: 250 DPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIIT 309
DP HYKG+HTYVPVTQKGYWQFDMGDV+IDG+ TG+C GGC+AIADSGTSLLAGPTTI+T
Sbjct: 250 DPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGKPTGYCEGGCSAIADSGTSLLAGPTTIVT 309
Query: 310 QVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKVSAK 347
+NHAIGA G++SQECKAVV QYG+ I+++LL++ K
Sbjct: 310 MINHAIGAKGVMSQECKAVVQQYGQTIMDLLLSEADPK 347
>gi|12231176|dbj|BAB20971.1| aspartic proteinase 3 [Nepenthes alata]
Length = 507
Score = 535 bits (1379), Expect = e-149, Method: Compositional matrix adjust.
Identities = 256/336 (76%), Positives = 299/336 (88%)
Query: 12 FFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNL 71
F+ ++L P+VFS N GL RIGLKK+ FD NNR+AARL+++EGE+ R+S+RKY L GNL
Sbjct: 4 LFVFIILLPLVFSDSNDGLLRIGLKKKIFDQNNRIAARLETEEGEARRSSLRKYYLHGNL 63
Query: 72 GESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHS 131
G + DIVALKNYMDAQYFGEIGIGTPPQ FTVIFDTGSSNLWVPSSKCYFS+ CYFH+
Sbjct: 64 GNPEETDIVALKNYMDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFSVPCYFHA 123
Query: 132 KYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLL 191
KY+S SS+YKKNGKSADIHYGTGAISGFFSED+V++GDL VK QEFIEA+REPS+TFL+
Sbjct: 124 KYKSSISSSYKKNGKSADIHYGTGAISGFFSEDNVQVGDLAVKAQEFIEASREPSVTFLV 183
Query: 192 AKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDP 251
AKFDGILGLGFQEISVG A PVWYNMVNQGLV EPVFSFW NR EEEGGEIVFGG+DP
Sbjct: 184 AKFDGILGLGFQEISVGNATPVWYNMVNQGLVKEPVFSFWLNRKVGEEEGGEIVFGGVDP 243
Query: 252 DHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQV 311
+H+KG H+YVPVT KGYWQFDMGDV+IDG+ T +C GGC+AIADSGTSLLAGPT+++T +
Sbjct: 244 NHFKGTHSYVPVTHKGYWQFDMGDVLIDGKATEYCEGGCSAIADSGTSLLAGPTSVVTMI 303
Query: 312 NHAIGATGIVSQECKAVVSQYGEEIINMLLAKVSAK 347
NHAIGATG+VS+ECKAVVSQYG+ I+++LLA+VS +
Sbjct: 304 NHAIGATGVVSEECKAVVSQYGQTIMDLLLAEVSPE 339
>gi|21616053|emb|CAC86004.1| aspartic proteinase [Theobroma cacao]
Length = 514
Score = 533 bits (1373), Expect = e-149, Method: Compositional matrix adjust.
Identities = 255/347 (73%), Positives = 292/347 (84%)
Query: 1 MGMVFKSITAGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRT 60
MG K + F+ LLF VV S N GL RIGLKK K D NNR+AARLDSK+GE+ R
Sbjct: 1 MGTTIKVVVLSLFISSLLFSVVSSVSNDGLVRIGLKKMKLDPNNRLAARLDSKDGEALRA 60
Query: 61 SIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSK 120
I+KY R NLG+S + DIVALKNYMDAQY+GEIGIGTP Q FTVIFDTGSSNLWV S+K
Sbjct: 61 FIKKYRFRNNLGDSEETDIVALKNYMDAQYYGEIGIGTPTQKFTVIFDTGSSNLWVSSTK 120
Query: 121 CYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIE 180
CYFS+ACYFH KY++ SSTYKK+GK A I YGTGAISGFFS DHV++GDLVVKDQEFIE
Sbjct: 121 CYFSVACYFHEKYKASDSSTYKKDGKPASIQYGTGAISGFFSYDHVQVGDLVVKDQEFIE 180
Query: 181 ATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEE 240
AT+EP LTF++AKFDGILGLGF+EISVG AVPVWYNM+ QGL+ EPVFSFW NRN DEE
Sbjct: 181 ATKEPGLTFMVAKFDGILGLGFKEISVGDAVPVWYNMIKQGLIKEPVFSFWLNRNVDEEA 240
Query: 241 GGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSL 300
GGEIVFGG+DP+HYKG+HTYVPVTQKGYWQFDMGDV+I + TG+CAG CAAIADSGTSL
Sbjct: 241 GGEIVFGGVDPNHYKGKHTYVPVTQKGYWQFDMGDVLIADKPTGYCAGSCAAIADSGTSL 300
Query: 301 LAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKVSAK 347
LAGP+T+IT +NHAIGATG+VSQECKAVV QYG II++L+A+ +
Sbjct: 301 LAGPSTVITMINHAIGATGVVSQECKAVVQQYGRTIIDLLIAEAQPQ 347
>gi|384040313|gb|AFH58568.1| aspartic acid protease [Ananas comosus]
Length = 514
Score = 528 bits (1360), Expect = e-147, Method: Compositional matrix adjust.
Identities = 252/345 (73%), Positives = 290/345 (84%), Gaps = 2/345 (0%)
Query: 1 MGMVFKSITAGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRT 60
MG ++ L +LL + GL RIGLKKR D NNR+AARL KE E
Sbjct: 1 MGTRGGALAVAILLSVLLHQSILLASADGLVRIGLKKRPIDENNRIAARLVEKE-EGPLL 59
Query: 61 SIRKYSLRGN-LGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSS 119
+ R+Y LRG L E + DI+ALKNYM+AQYFGEIGIGTPPQ FTVIFDTGSSNLWVPSS
Sbjct: 60 AARRYGLRGAPLKEGEETDIIALKNYMNAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSS 119
Query: 120 KCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFI 179
KCYFSIAC FH+KY+SGRSS+Y KNGKSA IHYGTGAISGFFS DHVK+GDLVVK Q+FI
Sbjct: 120 KCYFSIACLFHTKYKSGRSSSYHKNGKSASIHYGTGAISGFFSTDHVKVGDLVVKTQDFI 179
Query: 180 EATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEE 239
EAT+EPS+TF++AKFDGILGLGFQEISVG AVPVWYNMV+QGL+ EPVFSFWFNRNA++
Sbjct: 180 EATKEPSVTFVVAKFDGILGLGFQEISVGNAVPVWYNMVDQGLIKEPVFSFWFNRNANDG 239
Query: 240 EGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTS 299
EGGEIVFGG DP+HYKG HTYVPVTQKGYWQF+MGDV++ GQ+TGFC GGCAAIADSGTS
Sbjct: 240 EGGEIVFGGADPNHYKGNHTYVPVTQKGYWQFEMGDVLVGGQSTGFCNGGCAAIADSGTS 299
Query: 300 LLAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKV 344
LLAGPTTII ++N IGA+G+VSQECKAVV++YG++I+ MLLA+V
Sbjct: 300 LLAGPTTIIAEINQKIGASGVVSQECKAVVAEYGQQILQMLLAEV 344
>gi|20800441|gb|AAB03843.2| aspartic proteinase [Vigna unguiculata]
gi|33339734|gb|AAQ14346.1| aspartic proteinase [Vigna unguiculata]
Length = 513
Score = 525 bits (1353), Expect = e-146, Method: Compositional matrix adjust.
Identities = 247/347 (71%), Positives = 290/347 (83%), Gaps = 1/347 (0%)
Query: 1 MGMVFKSITAGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRT 60
MG I+ F+ LLF V PN GL RIGLKK K D NNR+AAR+ S + +SFR
Sbjct: 1 MGNNKNVISLCLFVTTLLFSAVSCAPNDGLRRIGLKKIKLDPNNRLAARIGSND-DSFRA 59
Query: 61 SIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSK 120
SIRK+ L+ N +G+ DIVALKNY+DAQY+GEI IGT PQ FTVIFDTGSSNLWVPSS+
Sbjct: 60 SIRKFHLQNNFAGTGETDIVALKNYLDAQYYGEISIGTSPQKFTVIFDTGSSNLWVPSSR 119
Query: 121 CYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIE 180
C FS+ACYFH+KYRSGRSSTY++NG +A I YGTGAI+GFFS D+V++GD+VVK+QEFIE
Sbjct: 120 CTFSLACYFHAKYRSGRSSTYRRNGTAAAIQYGTGAIAGFFSYDNVRVGDIVVKNQEFIE 179
Query: 181 ATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEE 240
ATREP + FL AKFDGILGLGFQEISVG AVPVWYNMV QGL+ EPVFSFW NR +EEE
Sbjct: 180 ATREPGVVFLAAKFDGILGLGFQEISVGNAVPVWYNMVEQGLIKEPVFSFWLNRKTEEEE 239
Query: 241 GGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSL 300
GGE+VFGG+DP HYKGEHTYVPVT+KGYWQFDMGDV+I G+ TG+CAGGCAAIADSGTSL
Sbjct: 240 GGELVFGGVDPAHYKGEHTYVPVTRKGYWQFDMGDVLIGGKPTGYCAGGCAAIADSGTSL 299
Query: 301 LAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKVSAK 347
LAGPT IIT +NHAIGA+G++SQECK VV++YG+ I+N+LLA+ K
Sbjct: 300 LAGPTAIITMINHAIGASGVMSQECKTVVAEYGQTILNLLLAETQPK 346
>gi|223929912|gb|ACN24614.1| aspartic acid protease [Phaseolus vulgaris]
Length = 513
Score = 522 bits (1344), Expect = e-145, Method: Compositional matrix adjust.
Identities = 250/347 (72%), Positives = 285/347 (82%), Gaps = 1/347 (0%)
Query: 1 MGMVFKSITAGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRT 60
MG I+ F+ LL V PN GL RIGLKK K D NNR+AAR+ SK+ +SFR
Sbjct: 1 MGKNMNVISWCLFVTTLLLSAVSCAPNDGLRRIGLKKIKLDPNNRLAARIGSKD-DSFRA 59
Query: 61 SIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSK 120
SIRK+ L+ N G + D DIVALKNY+DAQYFGEI IGT PQ FTVIFDTGSSNLWVPSS
Sbjct: 60 SIRKFHLQNNFGGTEDTDIVALKNYLDAQYFGEIAIGTSPQKFTVIFDTGSSNLWVPSSL 119
Query: 121 CYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIE 180
C FS+ACYFH+KYRS +SSTYKKNG +A I YGTGAISGFFS D V++GD+VVK QEFIE
Sbjct: 120 CTFSVACYFHAKYRSSKSSTYKKNGTAAAIQYGTGAISGFFSYDSVRVGDIVVKSQEFIE 179
Query: 181 ATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEE 240
ATREP + FL AKFDGILGLGFQEISVG AVPVWYNMV QGL+ EPVFSFWFNR +EEE
Sbjct: 180 ATREPGVVFLAAKFDGILGLGFQEISVGNAVPVWYNMVEQGLIKEPVFSFWFNRKPEEEE 239
Query: 241 GGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSL 300
GGEIVFGG+DP HYKG+HTYVPVT+KGYW+FDMGDV+I G+ TG+CA GC AIADSGTSL
Sbjct: 240 GGEIVFGGVDPAHYKGKHTYVPVTRKGYWRFDMGDVLIGGKPTGYCADGCLAIADSGTSL 299
Query: 301 LAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKVSAK 347
LAGPTTIIT +NHAIGA GI+SQECK VV++YG+ I+N+LLA+ K
Sbjct: 300 LAGPTTIITMINHAIGAAGIMSQECKTVVAEYGQTILNLLLAETQPK 346
>gi|255567717|ref|XP_002524837.1| Aspartic proteinase precursor, putative [Ricinus communis]
gi|223535897|gb|EEF37557.1| Aspartic proteinase precursor, putative [Ricinus communis]
Length = 456
Score = 515 bits (1327), Expect = e-143, Method: Compositional matrix adjust.
Identities = 240/332 (72%), Positives = 279/332 (84%), Gaps = 3/332 (0%)
Query: 16 LLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESG 75
+LL PVVFS N L R+GLKK+KFD N A +D KEGE+ R + +KY+L N S
Sbjct: 13 VLLLPVVFSLHNDALVRVGLKKKKFDQVNIPAGTVDFKEGEAMRAATKKYNLVEN---SD 69
Query: 76 DADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRS 135
D DIV LKNY+DAQY+GEI IGTPPQ FTVIFDTGSSNLW+PSSKCYFS+ACYFHSKY++
Sbjct: 70 DVDIVELKNYLDAQYYGEIAIGTPPQTFTVIFDTGSSNLWIPSSKCYFSVACYFHSKYKA 129
Query: 136 GRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFD 195
SSTY+KNG SA I YGTG+ISGFFS+D+VK+GDLV+++Q+FIEAT+EP +TFL AKFD
Sbjct: 130 SESSTYQKNGTSAAIRYGTGSISGFFSQDNVKVGDLVIRNQDFIEATKEPGVTFLAAKFD 189
Query: 196 GILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYK 255
GILGLGFQEISVGKA+PVWYNMVN+GLV E VFSFW NRN EEGGEIVFGGMDP+HYK
Sbjct: 190 GILGLGFQEISVGKAIPVWYNMVNEGLVKEQVFSFWLNRNVQAEEGGEIVFGGMDPNHYK 249
Query: 256 GEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAI 315
G+HTYVPVTQKGYWQFDMG+V+I + TG CA GC AIADSGTSLLAGPTT+ITQ+NHAI
Sbjct: 250 GQHTYVPVTQKGYWQFDMGEVLIGNEITGLCADGCKAIADSGTSLLAGPTTVITQINHAI 309
Query: 316 GATGIVSQECKAVVSQYGEEIINMLLAKVSAK 347
GA+GIVSQECK VV QYG+ I+ ML A+ +
Sbjct: 310 GASGIVSQECKTVVEQYGKFILEMLTAQAQPQ 341
>gi|255554815|ref|XP_002518445.1| Aspartic proteinase precursor, putative [Ricinus communis]
gi|223542290|gb|EEF43832.1| Aspartic proteinase precursor, putative [Ricinus communis]
Length = 511
Score = 514 bits (1324), Expect = e-143, Method: Compositional matrix adjust.
Identities = 237/322 (73%), Positives = 281/322 (87%), Gaps = 3/322 (0%)
Query: 26 PNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNY 85
PN GL R+GLKK K D N+R+AARL+SK E+ R S+RKY LRG +S D DIVALKNY
Sbjct: 26 PNDGLVRLGLKKMKLDENSRLAARLESKNAEALRASVRKYGLRG---DSKDTDIVALKNY 82
Query: 86 MDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNG 145
+DAQY+GEIGIGTPPQ FTV+FDTGSSNLWVPSSKC FS+AC+FHS+Y+SG+SSTYKKNG
Sbjct: 83 LDAQYYGEIGIGTPPQKFTVVFDTGSSNLWVPSSKCIFSVACFFHSRYKSGQSSTYKKNG 142
Query: 146 KSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEI 205
KSA+IHYG+GAISGFFS D+V +G+LVVKDQEFIEAT+EP +TF+ AKFDGILGLGFQEI
Sbjct: 143 KSAEIHYGSGAISGFFSSDNVVVGNLVVKDQEFIEATKEPGVTFVAAKFDGILGLGFQEI 202
Query: 206 SVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQ 265
SVG AVPVWYNM+ QGL+ EPVFSFW NRN EEGGEIVFGG+D +HYKG+HTYVPVTQ
Sbjct: 203 SVGNAVPVWYNMIKQGLIKEPVFSFWLNRNTQGEEGGEIVFGGVDLNHYKGKHTYVPVTQ 262
Query: 266 KGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQEC 325
KGYWQF+MGDV+I + T +CAGGC+AIADSGTSLLAGPTT++T +N AIGATG+ SQEC
Sbjct: 263 KGYWQFEMGDVLIGHKPTEYCAGGCSAIADSGTSLLAGPTTVVTLINEAIGATGVASQEC 322
Query: 326 KAVVSQYGEEIINMLLAKVSAK 347
K V++QYGE I+++L+A+ K
Sbjct: 323 KTVIAQYGETIMDLLIAEAQPK 344
>gi|297849560|ref|XP_002892661.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338503|gb|EFH68920.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 506
Score = 511 bits (1317), Expect = e-142, Method: Compositional matrix adjust.
Identities = 244/347 (70%), Positives = 284/347 (81%), Gaps = 10/347 (2%)
Query: 3 MVFKSITAGFFLCL--LLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRT 60
M S T F L + LLF FS N G +R+GLKK K D NR+AAR++SK+ +
Sbjct: 1 MKIYSTTVAFSLIVSFLLFFSAFSERNDGTFRVGLKKLKLDSKNRLAARVESKQDKP--- 57
Query: 61 SIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSK 120
+R YSL G S DAD+V LKNY+DAQY+GEI IGTPPQ FTV+FDTGSSNLWVPSSK
Sbjct: 58 -LRAYSL----GNSEDADVVVLKNYLDAQYYGEIAIGTPPQKFTVVFDTGSSNLWVPSSK 112
Query: 121 CYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIE 180
CYFS+AC H KY+S RSSTY+KNGKSA IHYGTGAI+GFFS D V +GDLVVKDQEFIE
Sbjct: 113 CYFSLACLLHPKYKSSRSSTYEKNGKSAAIHYGTGAIAGFFSNDAVTVGDLVVKDQEFIE 172
Query: 181 ATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEE 240
AT+EP +TF++AKFDGILGLGFQEISVG A PVWYNM+ QGL+ EPVFSFWFNRNADEEE
Sbjct: 173 ATKEPGITFVVAKFDGILGLGFQEISVGNATPVWYNMLKQGLIKEPVFSFWFNRNADEEE 232
Query: 241 GGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSL 300
GGE+VFGG+DP+H+KG+HTYVPVTQKGYWQFDMGDV+I G TGFC GC+AIADSGTSL
Sbjct: 233 GGELVFGGVDPNHFKGKHTYVPVTQKGYWQFDMGDVLIGGAPTGFCESGCSAIADSGTSL 292
Query: 301 LAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKVSAK 347
LAGPTTIIT +NHAIGA G+VSQ+CK VV QYG+ I+++LL++ K
Sbjct: 293 LAGPTTIITMINHAIGAAGVVSQQCKTVVDQYGQTILDLLLSETQPK 339
>gi|15221141|ref|NP_172655.1| aspartic proteinase A1 [Arabidopsis thaliana]
gi|75318541|sp|O65390.1|APA1_ARATH RecName: Full=Aspartic proteinase A1; Flags: Precursor
gi|3157937|gb|AAC17620.1| Identical to aspartic proteinase cDNA gb|U51036 from A. thaliana.
ESTs gb|N96313, gb|T21893, gb|R30158, gb|T21482,
gb|T43650, gb|R64749, gb|R65157, gb|T88269, gb|T44552,
gb|T22542, gb|T76533, gb|T44350, gb|Z34591, gb|AA728734,
gb|T46003, gb|R65157, gb|N38290, gb|AA395468, gb|T20815
and gb|Z34173 come from this gene [Arabidopsis thaliana]
gi|15912219|gb|AAL08243.1| At1g11910/F12F1_24 [Arabidopsis thaliana]
gi|15912251|gb|AAL08259.1| At1g11910/F12F1_24 [Arabidopsis thaliana]
gi|17381036|gb|AAL36330.1| putative aspartic proteinase [Arabidopsis thaliana]
gi|21617929|gb|AAM66979.1| putative aspartic proteinase [Arabidopsis thaliana]
gi|25055040|gb|AAN71979.1| putative aspartic proteinase [Arabidopsis thaliana]
gi|332190692|gb|AEE28813.1| aspartic proteinase A1 [Arabidopsis thaliana]
Length = 506
Score = 511 bits (1315), Expect = e-142, Method: Compositional matrix adjust.
Identities = 236/347 (68%), Positives = 282/347 (81%), Gaps = 8/347 (2%)
Query: 1 MGMVFKSITAGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRT 60
M + +++ + LL F+ N G +R+GLKK K D NR+AAR++SK+ + R
Sbjct: 1 MKIYSRTVAVSLIVSFLLCFSAFAERNDGTFRVGLKKLKLDSKNRLAARVESKQEKPLRA 60
Query: 61 SIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSK 120
LG+SGDAD+V LKNY+DAQY+GEI IGTPPQ FTV+FDTGSSNLWVPSSK
Sbjct: 61 Y--------RLGDSGDADVVVLKNYLDAQYYGEIAIGTPPQKFTVVFDTGSSNLWVPSSK 112
Query: 121 CYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIE 180
CYFS+AC H KY+S RSSTY+KNGK+A IHYGTGAI+GFFS D V +GDLVVKDQEFIE
Sbjct: 113 CYFSLACLLHPKYKSSRSSTYEKNGKAAAIHYGTGAIAGFFSNDAVTVGDLVVKDQEFIE 172
Query: 181 ATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEE 240
AT+EP +TF++AKFDGILGLGFQEISVGKA PVWYNM+ QGL+ EPVFSFW NRNADEEE
Sbjct: 173 ATKEPGITFVVAKFDGILGLGFQEISVGKAAPVWYNMLKQGLIKEPVFSFWLNRNADEEE 232
Query: 241 GGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSL 300
GGE+VFGG+DP+H+KG+HTYVPVTQKGYWQFDMGDV+I G TGFC GC+AIADSGTSL
Sbjct: 233 GGELVFGGVDPNHFKGKHTYVPVTQKGYWQFDMGDVLIGGAPTGFCESGCSAIADSGTSL 292
Query: 301 LAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKVSAK 347
LAGPTTIIT +NHAIGA G+VSQ+CK VV QYG+ I+++LL++ K
Sbjct: 293 LAGPTTIITMINHAIGAAGVVSQQCKTVVDQYGQTILDLLLSETQPK 339
>gi|1326165|gb|AAB03108.1| aspartic protease [Brassica napus]
Length = 506
Score = 510 bits (1313), Expect = e-142, Method: Compositional matrix adjust.
Identities = 241/348 (69%), Positives = 285/348 (81%), Gaps = 10/348 (2%)
Query: 1 MGMVFKSITAGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRT 60
MG+ K++ + LLF + N G +R+GLKK KFD +R+AA + SK+ +
Sbjct: 1 MGIYSKTVALSLIVSFLLFLSASAERNDGTFRVGLKKLKFDPRSRIAAPVGSKQLKP--- 57
Query: 61 SIRKYSLRG-NLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSS 119
LRG LG+SGDADIV LKNY+DAQY+GEI IGTPPQ FTV+FDTGSSNLWVPSS
Sbjct: 58 ------LRGYGLGDSGDADIVTLKNYLDAQYYGEIAIGTPPQKFTVVFDTGSSNLWVPSS 111
Query: 120 KCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFI 179
KCYFSIAC FHSKY+S RSSTY+KNGKSA IHYGTGAI+GFFS D V +GDLVVKDQEFI
Sbjct: 112 KCYFSIACLFHSKYKSSRSSTYEKNGKSAAIHYGTGAIAGFFSNDAVTVGDLVVKDQEFI 171
Query: 180 EATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEE 239
EAT+EP +TF+LAKFDGILGLGFQEISVG A PVWYNM+ QGL+ EPVFSFW NRNA++E
Sbjct: 172 EATKEPGITFVLAKFDGILGLGFQEISVGNAAPVWYNMLKQGLIKEPVFSFWLNRNAEDE 231
Query: 240 EGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTS 299
EGGE+VFGG+DP+H+KGEHTYVPVTQKGYWQFDMGDV+I G TG+C GC+AIADSGTS
Sbjct: 232 EGGELVFGGVDPNHFKGEHTYVPVTQKGYWQFDMGDVLIGGAPTGYCESGCSAIADSGTS 291
Query: 300 LLAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKVSAK 347
LLAGPTT+IT +NHAIGA G+VSQ+CK VV QYG+ I+++LL++ K
Sbjct: 292 LLAGPTTVITMINHAIGAAGVVSQQCKIVVDQYGQTILDLLLSETQPK 339
>gi|12231174|dbj|BAB20970.1| aspartic proteinase 2 [Nepenthes alata]
Length = 514
Score = 508 bits (1309), Expect = e-141, Method: Compositional matrix adjust.
Identities = 237/318 (74%), Positives = 278/318 (87%)
Query: 30 LYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQ 89
L R+GLKKRK D NR+++ K S SI K+ L LG S DADI++LKNYMDAQ
Sbjct: 30 LLRVGLKKRKLDQINRLSSHYGCKGKGSTSPSIWKHGLGNGLGNSDDADIISLKNYMDAQ 89
Query: 90 YFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSAD 149
YFGEIGIG+PPQ FTVIFDTGSSNLWVPS+KCYFSIACY H KY+S +SSTY KNGKSA
Sbjct: 90 YFGEIGIGSPPQKFTVIFDTGSSNLWVPSAKCYFSIACYLHPKYKSFKSSTYAKNGKSAA 149
Query: 150 IHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGK 209
IHYGTGAISGFFS+DHVK+GDLVV++Q+FIEAT+EPS+TF+ AKFDGILGLGFQEISVG
Sbjct: 150 IHYGTGAISGFFSQDHVKMGDLVVENQDFIEATKEPSITFVAAKFDGILGLGFQEISVGD 209
Query: 210 AVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYW 269
AVP WYNM++QGLVNEPVFSFW NR ++EEEGGEIVFGG+DP+HYKGEHTYVPVT+KGYW
Sbjct: 210 AVPAWYNMIDQGLVNEPVFSFWLNRKSEEEEGGEIVFGGVDPNHYKGEHTYVPVTRKGYW 269
Query: 270 QFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQECKAVV 329
QFDM DV++ G+TTG+C+GGC+AIADSGTSLLAGPTTII Q+NHAIGA+G+VSQECKAVV
Sbjct: 270 QFDMDDVLVGGETTGYCSGGCSAIADSGTSLLAGPTTIIVQINHAIGASGLVSQECKAVV 329
Query: 330 SQYGEEIINMLLAKVSAK 347
SQYG+ I++ L+A+ +
Sbjct: 330 SQYGKAILDALVAEAQPQ 347
>gi|3551952|gb|AAC34854.1| senescence-associated protein 4 [Hemerocallis hybrid cultivar]
Length = 517
Score = 508 bits (1309), Expect = e-141, Method: Compositional matrix adjust.
Identities = 236/344 (68%), Positives = 283/344 (82%), Gaps = 4/344 (1%)
Query: 1 MGMVFKSITAGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRT 60
MG I L +L+F + S GL RI LKK+ FD +RV++RL + E E +
Sbjct: 1 MGTKHGGILPALILSMLVFQLALSASAEGLVRINLKKKPFDEKSRVSSRLSADEDEPLKA 60
Query: 61 SIRKYSLRGNLGESGDA-DIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSS 119
+Y LRG L + D+ DI++LKNYM+AQYFGEIG+GTPPQ FTVIFDTGSSNLWVPS+
Sbjct: 61 ---RYGLRGGLNDGADSTDIISLKNYMNAQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSA 117
Query: 120 KCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFI 179
KCYFSIAC H+KY+SGRSSTY KNGK A IHYGTGAI+G+FSEDHV++GD VVK QEFI
Sbjct: 118 KCYFSIACLLHTKYKSGRSSTYHKNGKPAAIHYGTGAIAGYFSEDHVELGDFVVKGQEFI 177
Query: 180 EATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEE 239
EAT+EP +TFL+AKFDGILGLGF+EISVG AVP+WYNMV QGLV E VFSFW NR +++
Sbjct: 178 EATKEPGVTFLVAKFDGILGLGFKEISVGGAVPLWYNMVEQGLVKEAVFSFWLNRKSEDG 237
Query: 240 EGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTS 299
EGGEIVFGG+DP H+KGEH YVPVTQKGYWQFDMGDV++ GQ+TGFC GGCAAIADSGTS
Sbjct: 238 EGGEIVFGGVDPSHHKGEHVYVPVTQKGYWQFDMGDVLVGGQSTGFCEGGCAAIADSGTS 297
Query: 300 LLAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAK 343
L+AGPTT+IT++NH IGA G+VSQECKAVV QYG++I++ML+A+
Sbjct: 298 LIAGPTTVITEINHKIGAAGVVSQECKAVVQQYGQQILDMLIAQ 341
>gi|109675118|gb|ABG37021.1| aspartic protease [Nicotiana tabacum]
Length = 508
Score = 507 bits (1306), Expect = e-141, Method: Compositional matrix adjust.
Identities = 236/328 (71%), Positives = 283/328 (86%), Gaps = 5/328 (1%)
Query: 20 PVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADI 79
P+VFS N GL R+G+KKRK D N+ +DS S RT Y L GN+G+S D DI
Sbjct: 20 PMVFSVSNDGLIRVGIKKRKLDQINQAFGGIDSNGANSART----YHLGGNIGDS-DTDI 74
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
+ALKNY+DAQYFGEI IG+PPQ FTVIFDTGSSNLWVPS++CYFS+ACY H KY+S SS
Sbjct: 75 IALKNYLDAQYFGEICIGSPPQKFTVIFDTGSSNLWVPSARCYFSLACYLHPKYKSSHSS 134
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYKKNG SA I YGTG+ISG+FS D+VK+GDL+VKDQ+FIEATREP +TFL AKFDGILG
Sbjct: 135 TYKKNGTSAAIRYGTGSISGYFSNDNVKVGDLIVKDQDFIEATREPGITFLAAKFDGILG 194
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LGFQEISVGK+VPVWYNMVNQGLV +PVFSFWFNRNA EEEGGE+VFGG+DP+H+KG+HT
Sbjct: 195 LGFQEISVGKSVPVWYNMVNQGLVKKPVFSFWFNRNAQEEEGGELVFGGVDPNHFKGKHT 254
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATG 319
YVPVT KGYWQFDMGDV++ G+TTGFC+GGC+AIADSGTSLLAGPTTIITQ+NH IGA+G
Sbjct: 255 YVPVTHKGYWQFDMGDVLVGGETTGFCSGGCSAIADSGTSLLAGPTTIITQINHVIGASG 314
Query: 320 IVSQECKAVVSQYGEEIINMLLAKVSAK 347
+VSQECK++V++YG+ I+++L +K + +
Sbjct: 315 VVSQECKSLVTEYGKTILDLLESKAAPQ 342
>gi|1354272|gb|AAC49730.1| aspartic proteinase [Arabidopsis thaliana]
Length = 486
Score = 506 bits (1304), Expect = e-141, Method: Compositional matrix adjust.
Identities = 233/325 (71%), Positives = 274/325 (84%), Gaps = 8/325 (2%)
Query: 23 FSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVAL 82
F+ N G +R+GLKK K D NR+AAR++SK+ + R LG+SGDAD+V L
Sbjct: 3 FAERNDGTFRVGLKKLKLDSKNRLAARVESKQEKPLRAY--------RLGDSGDADVVVL 54
Query: 83 KNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYK 142
KNY+DAQY+GEI IGTPPQ FTV+FDTGSSNLWVPSSKCYFS+AC H KY+S RSSTY+
Sbjct: 55 KNYLDAQYYGEIAIGTPPQKFTVVFDTGSSNLWVPSSKCYFSLACLLHPKYKSSRSSTYE 114
Query: 143 KNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGF 202
KNGK+A IHYGTGAI+GFFS D V +GDLVVKDQEFIEAT+EP +TF++AKFDGILGLGF
Sbjct: 115 KNGKAAAIHYGTGAIAGFFSNDAVTVGDLVVKDQEFIEATKEPGITFVVAKFDGILGLGF 174
Query: 203 QEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVP 262
QEISVGKA PVWYNM+ QGL+ EPVFSFW NRNADEEEGGE+VFGG+DP+H+KG+HTYVP
Sbjct: 175 QEISVGKAAPVWYNMLKQGLIKEPVFSFWLNRNADEEEGGELVFGGVDPNHFKGKHTYVP 234
Query: 263 VTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVS 322
VTQKGYWQFDMGDV+I G TGFC GC+AIADSGTSLLAGPTTIIT +NHAIGA G+VS
Sbjct: 235 VTQKGYWQFDMGDVLIGGAPTGFCESGCSAIADSGTSLLAGPTTIITMINHAIGAAGVVS 294
Query: 323 QECKAVVSQYGEEIINMLLAKVSAK 347
Q+CK VV QYG+ I+++LL++ K
Sbjct: 295 QQCKTVVDQYGQTILDLLLSETQPK 319
>gi|357132502|ref|XP_003567869.1| PREDICTED: phytepsin-like [Brachypodium distachyon]
Length = 505
Score = 506 bits (1303), Expect = e-141, Method: Compositional matrix adjust.
Identities = 237/336 (70%), Positives = 281/336 (83%), Gaps = 7/336 (2%)
Query: 12 FFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNL 71
F +LL ++ ++ GL RI LKKR D NNRVA RL +EG+ + +L
Sbjct: 10 LFAAVLLQALLPASEAEGLVRIALKKRPIDQNNRVATRLSGEEGQHLGGA-------NSL 62
Query: 72 GESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHS 131
G + DIVAL+NYM+AQYFGEIG+GTPPQ FTVIFDTGSSNLWVPS+KCYFSIACYFHS
Sbjct: 63 GSEDEGDIVALQNYMNAQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSAKCYFSIACYFHS 122
Query: 132 KYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLL 191
+Y++G+SSTYKKNGK A I YGTG+I+G+FSED V +GDLVVKDQEFIEAT+EP +TF++
Sbjct: 123 RYKAGQSSTYKKNGKPAAIQYGTGSIAGYFSEDSVTVGDLVVKDQEFIEATKEPGVTFMV 182
Query: 192 AKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDP 251
AKFDGILGLGFQEISVGKAVPVWY M+ QGL+++PVFSFWFNR+A E EGGEIVFGGMDP
Sbjct: 183 AKFDGILGLGFQEISVGKAVPVWYKMIEQGLISDPVFSFWFNRHAGEGEGGEIVFGGMDP 242
Query: 252 DHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQV 311
HY GEHTYVPVTQKGYWQFDMGDV++ G++TGFCAGGCAAIADSGTSLLAGPT IIT++
Sbjct: 243 KHYIGEHTYVPVTQKGYWQFDMGDVLVGGKSTGFCAGGCAAIADSGTSLLAGPTAIITEI 302
Query: 312 NHAIGATGIVSQECKAVVSQYGEEIINMLLAKVSAK 347
N IGA G+VSQECK VVSQYG++I+++LLA+ K
Sbjct: 303 NEKIGAAGVVSQECKTVVSQYGQQILDLLLAETQPK 338
>gi|12231172|dbj|BAB20969.1| aspartic proteinase 1 [Nepenthes alata]
Length = 514
Score = 506 bits (1303), Expect = e-141, Method: Compositional matrix adjust.
Identities = 243/321 (75%), Positives = 281/321 (87%)
Query: 27 NGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYM 86
N L R+GLKKRK D NR ++ K ES +IRKY L LG S DADI++LKNYM
Sbjct: 27 NDRLLRVGLKKRKLDQINRFSSLYGCKGKESINPAIRKYGLGNGLGNSDDADIISLKNYM 86
Query: 87 DAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGK 146
+AQYFGEIGIGTPPQ FT+IFDTGSSNLWVPS+KCYFSIACYFHSKY+S SS+Y KNGK
Sbjct: 87 NAQYFGEIGIGTPPQKFTLIFDTGSSNLWVPSAKCYFSIACYFHSKYKSSLSSSYTKNGK 146
Query: 147 SADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEIS 206
SA+IHYGTGAISGFFS+DHVK+GDLVV++Q+FIEATREPS+TF+ AKFDGILGLGFQEIS
Sbjct: 147 SAEIHYGTGAISGFFSQDHVKLGDLVVENQDFIEATREPSITFVAAKFDGILGLGFQEIS 206
Query: 207 VGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQK 266
VG AVPVWYNMV QGLVNEPVFSFW NRNA EEEGGEIVFGG+DP+HYKGEHT+VPVT K
Sbjct: 207 VGNAVPVWYNMVKQGLVNEPVFSFWLNRNATEEEGGEIVFGGVDPNHYKGEHTFVPVTHK 266
Query: 267 GYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQECK 326
GYWQFDM DV++ G+TTG+C+GGC+AIADSGTSLLAGPTTI+ Q+NHAIGA+G+VSQECK
Sbjct: 267 GYWQFDMDDVLVGGETTGYCSGGCSAIADSGTSLLAGPTTIVAQINHAIGASGVVSQECK 326
Query: 327 AVVSQYGEEIINMLLAKVSAK 347
AVV+QYG I++ML+++ K
Sbjct: 327 AVVAQYGTAILDMLISETQPK 347
>gi|312282703|dbj|BAJ34217.1| unnamed protein product [Thellungiella halophila]
Length = 506
Score = 505 bits (1300), Expect = e-140, Method: Compositional matrix adjust.
Identities = 236/347 (68%), Positives = 279/347 (80%), Gaps = 8/347 (2%)
Query: 1 MGMVFKSITAGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRT 60
MG+ +++ + LLF S N G R+GLKK K D NR+AAR+ S++ + R
Sbjct: 1 MGIYSRTVAVSLIVSFLLFFSASSERNDGTVRVGLKKLKLDPKNRLAARISSEQEKPLRA 60
Query: 61 SIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSK 120
+LG+SGDADIVALKNY+DAQY+GEI IGTPPQ FTV+FDTGSSNLWVPSSK
Sbjct: 61 F--------SLGDSGDADIVALKNYLDAQYYGEIAIGTPPQKFTVVFDTGSSNLWVPSSK 112
Query: 121 CYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIE 180
CYFSIAC H KY+S RSSTY+KNGKSA IHYGTGAI+GFFS D V +GDLVVKDQEFIE
Sbjct: 113 CYFSIACLLHPKYKSSRSSTYEKNGKSAAIHYGTGAIAGFFSNDAVTVGDLVVKDQEFIE 172
Query: 181 ATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEE 240
AT+EP +TF+LAKFDGILGLGF+EISVG A PVWYNM+ QGL+ EPVFSFW NRNA+++E
Sbjct: 173 ATKEPGITFVLAKFDGILGLGFKEISVGNAAPVWYNMLKQGLIKEPVFSFWLNRNAEDDE 232
Query: 241 GGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSL 300
GGE+VFGG+DP+H+KG+HTYVPVTQKGYWQFDMGDV+I TGFC GC+AIADSGTSL
Sbjct: 233 GGELVFGGVDPNHFKGKHTYVPVTQKGYWQFDMGDVLIGNAPTGFCESGCSAIADSGTSL 292
Query: 301 LAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKVSAK 347
LAGPTTIIT +NHAIGA G+VSQ+CK VV QYG I+ +LL++ K
Sbjct: 293 LAGPTTIITMINHAIGAAGVVSQQCKTVVDQYGRTILELLLSETQPK 339
>gi|115465497|ref|NP_001056348.1| Os05g0567100 [Oryza sativa Japonica Group]
gi|78099759|sp|Q42456.2|ASPR1_ORYSJ RecName: Full=Aspartic proteinase oryzasin-1; Flags: Precursor
gi|51854282|gb|AAU10663.1| aspartic proteinase oryzasin 1 precursor [Oryza sativa Japonica
Group]
gi|113579899|dbj|BAF18262.1| Os05g0567100 [Oryza sativa Japonica Group]
gi|125553350|gb|EAY99059.1| hypothetical protein OsI_21016 [Oryza sativa Indica Group]
gi|169244443|gb|ACA50495.1| aspartic proteinase oryzasin 1 [Oryza sativa Japonica Group]
gi|215695381|dbj|BAG90572.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737145|dbj|BAG96074.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215740829|dbj|BAG96985.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222632587|gb|EEE64719.1| hypothetical protein OsJ_19575 [Oryza sativa Japonica Group]
Length = 509
Score = 502 bits (1292), Expect = e-139, Method: Compositional matrix adjust.
Identities = 240/319 (75%), Positives = 275/319 (86%), Gaps = 9/319 (2%)
Query: 29 GLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGN---LGESGDADIVALKNY 85
GL RI LKKR D N+RVAARL +EG R+ LRG G G+ DIVALKNY
Sbjct: 27 GLVRIALKKRPIDENSRVAARLSGEEGA------RRLGLRGANSLGGGGGEGDIVALKNY 80
Query: 86 MDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNG 145
M+AQYFGEIG+GTPPQ FTVIFDTGSSNLWVPS+KCYFSIAC+FHS+Y+SG+SSTY+KNG
Sbjct: 81 MNAQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSAKCYFSIACFFHSRYKSGQSSTYQKNG 140
Query: 146 KSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEI 205
K A I YGTG+I+GFFSED V +GDLVVKDQEFIEAT+EP LTF++AKFDGILGLGFQEI
Sbjct: 141 KPAAIQYGTGSIAGFFSEDSVTVGDLVVKDQEFIEATKEPGLTFMVAKFDGILGLGFQEI 200
Query: 206 SVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQ 265
SVG AVPVWY MV QGLV+EPVFSFWFNR++DE EGGEIVFGGMDP HYKG HTYVPV+Q
Sbjct: 201 SVGDAVPVWYKMVEQGLVSEPVFSFWFNRHSDEGEGGEIVFGGMDPSHYKGNHTYVPVSQ 260
Query: 266 KGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQEC 325
KGYWQF+MGDV+I G+TTGFCA GC+AIADSGTSLLAGPT IIT++N IGATG+VSQEC
Sbjct: 261 KGYWQFEMGDVLIGGKTTGFCASGCSAIADSGTSLLAGPTAIITEINEKIGATGVVSQEC 320
Query: 326 KAVVSQYGEEIINMLLAKV 344
K VVSQYG++I+++LLA+
Sbjct: 321 KTVVSQYGQQILDLLLAET 339
>gi|1030715|dbj|BAA06876.1| aspartic protease [Oryza sativa]
gi|1711289|dbj|BAA06875.1| aspartic protease [Oryza sativa]
Length = 509
Score = 501 bits (1291), Expect = e-139, Method: Compositional matrix adjust.
Identities = 240/319 (75%), Positives = 275/319 (86%), Gaps = 9/319 (2%)
Query: 29 GLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGN---LGESGDADIVALKNY 85
GL RI LKKR D N+RVAARL +EG R+ LRG G G+ DIVALKNY
Sbjct: 27 GLVRIALKKRPIDENSRVAARLSGEEGA------RRLGLRGANSLGGGGGEGDIVALKNY 80
Query: 86 MDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNG 145
M+AQYFGEIG+GTPPQ FTVIFDTGSSNLWVPS+KCYFSIAC+FHS+Y+SG+SSTY+KNG
Sbjct: 81 MNAQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSAKCYFSIACFFHSRYKSGQSSTYQKNG 140
Query: 146 KSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEI 205
K A I YGTG+I+GFFSED V +GDLVVKDQEFIEAT+EP LTF++AKFDGILGLGFQEI
Sbjct: 141 KPAAIQYGTGSIAGFFSEDSVTVGDLVVKDQEFIEATKEPGLTFMVAKFDGILGLGFQEI 200
Query: 206 SVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQ 265
SVG AVPVWY MV QGLV+EPVFSFWFNR++DE EGGEIVFGGMDP HYKG HTYVPV+Q
Sbjct: 201 SVGDAVPVWYKMVEQGLVSEPVFSFWFNRHSDEGEGGEIVFGGMDPSHYKGNHTYVPVSQ 260
Query: 266 KGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQEC 325
KGYWQF+MGDV+I G+TTGFCA GC+AIADSGTSLLAGPT IIT++N IGATG+VSQEC
Sbjct: 261 KGYWQFEMGDVLIGGKTTGFCASGCSAIADSGTSLLAGPTAIITEINEKIGATGVVSQEC 320
Query: 326 KAVVSQYGEEIINMLLAKV 344
K VVSQYG++I+++LLA+
Sbjct: 321 KTVVSQYGQQILDLLLAET 339
>gi|22330379|ref|NP_176419.2| phytepsin [Arabidopsis thaliana]
gi|79320483|ref|NP_001031219.1| phytepsin [Arabidopsis thaliana]
gi|75331143|sp|Q8VYL3.1|APA2_ARATH RecName: Full=Aspartic proteinase A2; AltName: Full=Aspartic
protease 57; Short=AtASP57; Flags: Precursor
gi|17979428|gb|AAL49856.1| putative aspartic protease [Arabidopsis thaliana]
gi|23297031|gb|AAN13225.1| putative aspartic protease [Arabidopsis thaliana]
gi|222424000|dbj|BAH19961.1| AT1G62290 [Arabidopsis thaliana]
gi|332195825|gb|AEE33946.1| phytepsin [Arabidopsis thaliana]
gi|332195826|gb|AEE33947.1| phytepsin [Arabidopsis thaliana]
Length = 513
Score = 501 bits (1289), Expect = e-139, Method: Compositional matrix adjust.
Identities = 230/348 (66%), Positives = 284/348 (81%), Gaps = 3/348 (0%)
Query: 1 MGMVFKSITAGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRT 60
MG+ +++ F+ LLF +S N G +R+GLKK K D NNR+A R SK+ E+ R+
Sbjct: 1 MGVYSRAVAFSVFVSFLLFFTAYSKRNDGTFRVGLKKLKLDPNNRLATRFGSKQEEALRS 60
Query: 61 SIRKYSLRGNLG-ESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSS 119
S+R Y+ NLG +SGDADIV LKNY+DAQY+GEI IGTPPQ FTVIFDTGSSNLWVPS
Sbjct: 61 SLRSYN--NNLGGDSGDADIVPLKNYLDAQYYGEIAIGTPPQKFTVIFDTGSSNLWVPSG 118
Query: 120 KCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFI 179
KC+FS++CYFH+KY+S RSSTYKK+GK A IHYG+G+ISGFFS D V +GDLVVKDQEFI
Sbjct: 119 KCFFSLSCYFHAKYKSSRSSTYKKSGKRAAIHYGSGSISGFFSYDAVTVGDLVVKDQEFI 178
Query: 180 EATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEE 239
E T EP LTFL+AKFDG+LGLGFQEI+VG A PVWYNM+ QGL+ PVFSFW NR+ E
Sbjct: 179 ETTSEPGLTFLVAKFDGLLGLGFQEIAVGNATPVWYNMLKQGLIKRPVFSFWLNRDPKSE 238
Query: 240 EGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTS 299
EGGEIVFGG+DP H++GEHT+VPVTQ+GYWQFDMG+V+I G++TG+C GC+AIADSGTS
Sbjct: 239 EGGEIVFGGVDPKHFRGEHTFVPVTQRGYWQFDMGEVLIAGESTGYCGSGCSAIADSGTS 298
Query: 300 LLAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKVSAK 347
LLAGPT ++ +N AIGA+G+VSQ+CK VV QYG+ I+++LLA+ K
Sbjct: 299 LLAGPTAVVAMINKAIGASGVVSQQCKTVVDQYGQTILDLLLAETQPK 346
>gi|226506070|ref|NP_001150729.1| aspartic proteinase oryzasin-1 precursor [Zea mays]
gi|195641348|gb|ACG40142.1| aspartic proteinase oryzasin-1 precursor [Zea mays]
Length = 518
Score = 498 bits (1282), Expect = e-138, Method: Compositional matrix adjust.
Identities = 233/317 (73%), Positives = 273/317 (86%), Gaps = 2/317 (0%)
Query: 28 GGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMD 87
GGL R+ LKK+ D N RVAARL ++E + R +R + G+ G D+D++AL Y +
Sbjct: 33 GGLVRVALKKQPVDQNARVAARLSAEERQ--RLLLRGANALGSAGGDDDSDVIALNXYXN 90
Query: 88 AQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKS 147
AQYFGEIG+GTPPQ FTVIFDTGSSNLWVPSSKCYFSIACYFHS+Y+SG+SSTYKKNGK
Sbjct: 91 AQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSSKCYFSIACYFHSRYKSGQSSTYKKNGKP 150
Query: 148 ADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISV 207
A I YGTGAI+GFFSED VK+GDL V DQEFIEAT+EP LTF++AKFDGILGLGFQEISV
Sbjct: 151 AAIQYGTGAIAGFFSEDSVKLGDLDVNDQEFIEATKEPGLTFMVAKFDGILGLGFQEISV 210
Query: 208 GKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKG 267
G A PVWYNMV QGL+++PVFSFWFNR+A E EGGEIVFGGMD HYKG+HTYVPVTQKG
Sbjct: 211 GNATPVWYNMVKQGLISDPVFSFWFNRHAGEGEGGEIVFGGMDSSHYKGDHTYVPVTQKG 270
Query: 268 YWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQECKA 327
YWQF+MGDV++DG++TGFCAGGCAAIADSGTSLLAGPT IIT++N IGA G+VSQECK
Sbjct: 271 YWQFNMGDVLVDGKSTGFCAGGCAAIADSGTSLLAGPTAIITEINEKIGAAGVVSQECKT 330
Query: 328 VVSQYGEEIINMLLAKV 344
VVSQYG++I+++LLA+
Sbjct: 331 VVSQYGQQILDLLLAET 347
>gi|356555682|ref|XP_003546159.1| PREDICTED: aspartic proteinase-like [Glycine max]
Length = 507
Score = 498 bits (1281), Expect = e-138, Method: Compositional matrix adjust.
Identities = 235/347 (67%), Positives = 279/347 (80%), Gaps = 7/347 (2%)
Query: 1 MGMVFKSITAGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRT 60
MG + + F L LLFP+VF PN GL RIGLKK K D ++ V + FR+
Sbjct: 1 MGNMPNVVVLCFCLWTLLFPLVFCAPNDGLRRIGLKKVKLDTDDVVGFK-------EFRS 53
Query: 61 SIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSK 120
SIRK+ L+ LG + D D+VALKNY+DAQY+GEI IGTPPQ FTVIFDTGSSNLWVPSSK
Sbjct: 54 SIRKHHLQNILGGAEDTDVVALKNYLDAQYYGEIAIGTPPQKFTVIFDTGSSNLWVPSSK 113
Query: 121 CYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIE 180
CYFS+AC+ H++YRS +SSTY++NG SA I YGTGAISGFFS D VK+GD+VVKDQEFIE
Sbjct: 114 CYFSVACFMHARYRSSQSSTYRENGTSAAIQYGTGAISGFFSNDDVKVGDIVVKDQEFIE 173
Query: 181 ATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEE 240
ATREP +TF+ AKFDGILGLGFQEISVG AVPVWY MV QGLV +PVFSFW NR +EE
Sbjct: 174 ATREPGVTFVAAKFDGILGLGFQEISVGYAVPVWYTMVEQGLVKDPVFSFWLNRKPEEEN 233
Query: 241 GGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSL 300
GGE+VFGG DP HYKG+HTYVPVT+KGYWQFDMGDV+I G+ TG+C C+AIADSGTSL
Sbjct: 234 GGELVFGGADPAHYKGKHTYVPVTRKGYWQFDMGDVLISGKPTGYCTNDCSAIADSGTSL 293
Query: 301 LAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKVSAK 347
LAGPTT+IT +N AIGA G+VS+EC++VV+QYG+ I+ +LLA+ K
Sbjct: 294 LAGPTTVITMINQAIGAAGVVSKECRSVVNQYGQTILELLLAEAKPK 340
>gi|297837199|ref|XP_002886481.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297332322|gb|EFH62740.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 513
Score = 498 bits (1281), Expect = e-138, Method: Compositional matrix adjust.
Identities = 231/348 (66%), Positives = 283/348 (81%), Gaps = 3/348 (0%)
Query: 1 MGMVFKSITAGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRT 60
MG+ +++ F+ LLF V S N G +R+GLKK K D NNR+A R SK+ E+ R+
Sbjct: 1 MGVYTRTVAVSLFVWFLLFFTVSSQRNDGTFRVGLKKLKLDPNNRLATRFGSKQEEALRS 60
Query: 61 SIRKYSLRGNLG-ESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSS 119
S+ Y+ NLG +SGDADIV LKNY+DAQY+GEI IGTPPQ FTVIFDTGSSNLWVPS
Sbjct: 61 SLPSYN--NNLGSDSGDADIVPLKNYLDAQYYGEIAIGTPPQKFTVIFDTGSSNLWVPSG 118
Query: 120 KCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFI 179
KC+FS++C+FH+K++S RSSTYKK+GK A IHYG+G+ISGFFS D V +GDLVVKDQEFI
Sbjct: 119 KCFFSLSCFFHAKFKSSRSSTYKKSGKRAAIHYGSGSISGFFSYDAVTVGDLVVKDQEFI 178
Query: 180 EATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEE 239
EAT EP LTFL+AKFDG+LGLGFQEI+VG A PVWYNM+ QGL+ PVFSFW NR+ E
Sbjct: 179 EATSEPGLTFLVAKFDGLLGLGFQEIAVGNATPVWYNMLKQGLIERPVFSFWLNRDPKSE 238
Query: 240 EGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTS 299
EGGEIVFGG+DP H+KGEHT+VPVTQ+GYWQFDMG+V+I G +TG+C GC+AIADSGTS
Sbjct: 239 EGGEIVFGGVDPKHFKGEHTFVPVTQRGYWQFDMGEVLIAGDSTGYCGSGCSAIADSGTS 298
Query: 300 LLAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKVSAK 347
LLAGPT +I +N AIGA+G+VSQ+CK VV QYG+ I+++LLA+ K
Sbjct: 299 LLAGPTAVIAMINKAIGASGVVSQQCKTVVDQYGQTILDLLLAETQPK 346
>gi|351724625|ref|NP_001237064.1| aspartic proteinase 1 precursor [Glycine max]
gi|15186732|dbj|BAB62890.1| aspartic proteinase 1 [Glycine max]
Length = 514
Score = 497 bits (1279), Expect = e-138, Method: Compositional matrix adjust.
Identities = 240/347 (69%), Positives = 289/347 (83%)
Query: 1 MGMVFKSITAGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRT 60
MG +I + LL V+ PN GL RIGLKK K D NR+AAR+ SK+ +SFR
Sbjct: 1 MGNRMNAIVLCLLVSTLLVSAVYCAPNAGLRRIGLKKIKLDPKNRLAARVGSKDVDSFRA 60
Query: 61 SIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSK 120
SIR++ L+ N G + + DIVALKNY+DAQY+GEI IGT PQ F VIFDTGSSNLWVPSSK
Sbjct: 61 SIRQFHLQNNFGGTEETDIVALKNYLDAQYYGEIAIGTSPQKFAVIFDTGSSNLWVPSSK 120
Query: 121 CYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIE 180
C FS+ACYFH+KY+S +SST+KKNG +A I YGTGAISGFFS D V++G++VVK+QEFIE
Sbjct: 121 CTFSVACYFHAKYKSSKSSTFKKNGTAAAIQYGTGAISGFFSYDSVRVGEIVVKNQEFIE 180
Query: 181 ATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEE 240
ATREP +TFL AKFDGILGLGFQEISVG A PVWYNMV+QGL+ EPVFSFWFNRN +EEE
Sbjct: 181 ATREPGVTFLAAKFDGILGLGFQEISVGNAAPVWYNMVDQGLLKEPVFSFWFNRNPEEEE 240
Query: 241 GGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSL 300
GGEIVFGG+DP HYKG+HTYVPVT+KGYWQFDMGDV+I G+ TG+CA GC+AIADSGTSL
Sbjct: 241 GGEIVFGGVDPAHYKGKHTYVPVTRKGYWQFDMGDVLIGGKPTGYCANGCSAIADSGTSL 300
Query: 301 LAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKVSAK 347
LAGPTT+IT +NHAIGA+G++SQECK +V++YG+ I+++LLA+ K
Sbjct: 301 LAGPTTVITMINHAIGASGVMSQECKTIVAEYGQTILDLLLAETQPK 347
>gi|223946977|gb|ACN27572.1| unknown [Zea mays]
gi|238014788|gb|ACR38429.1| unknown [Zea mays]
gi|413946556|gb|AFW79205.1| aspartic proteinase oryzasin-1 isoform 1 [Zea mays]
gi|413946557|gb|AFW79206.1| aspartic proteinase oryzasin-1 isoform 2 [Zea mays]
Length = 510
Score = 496 bits (1276), Expect = e-138, Method: Compositional matrix adjust.
Identities = 232/316 (73%), Positives = 276/316 (87%), Gaps = 3/316 (0%)
Query: 29 GLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDA 88
GL R+ LKK D N RVAARL ++E + R +R + G+ G+ D+D++ALKNYM+A
Sbjct: 27 GLVRVALKKLPVDQNGRVAARLSAEERQ--RLLLRGANALGSGGDD-DSDVIALKNYMNA 83
Query: 89 QYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSA 148
QYFGEIG+G+P Q FTVIFDTGSSNLWVPSSKCYFSIACYFHS+Y+SG+SSTYKKNGK A
Sbjct: 84 QYFGEIGVGSPQQKFTVIFDTGSSNLWVPSSKCYFSIACYFHSRYKSGQSSTYKKNGKPA 143
Query: 149 DIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVG 208
I YGTG+I+GFFSED V +GDLVVKDQEFIEAT+EP LTF++AKFDGILGLGFQEISVG
Sbjct: 144 AIRYGTGSIAGFFSEDSVTLGDLVVKDQEFIEATKEPGLTFMVAKFDGILGLGFQEISVG 203
Query: 209 KAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGY 268
A PVWYNMV QGL+++PVFSFWFNR+ADE EGGEIVFGGMD HYKG+HT+VPVT+KGY
Sbjct: 204 NATPVWYNMVKQGLISDPVFSFWFNRHADEGEGGEIVFGGMDSSHYKGDHTFVPVTRKGY 263
Query: 269 WQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQECKAV 328
WQF+MGDV++DG++TGFCAGGCAAIADSGTSLLAGPT IIT++N IGA G+VSQECK V
Sbjct: 264 WQFNMGDVLVDGKSTGFCAGGCAAIADSGTSLLAGPTAIITEINEKIGAAGVVSQECKTV 323
Query: 329 VSQYGEEIINMLLAKV 344
VSQYG++I+++LLA+
Sbjct: 324 VSQYGQQILDLLLAET 339
>gi|226503984|ref|NP_001148782.1| aspartic proteinase oryzasin-1 precursor [Zea mays]
gi|195622118|gb|ACG32889.1| aspartic proteinase oryzasin-1 precursor [Zea mays]
Length = 510
Score = 495 bits (1275), Expect = e-137, Method: Compositional matrix adjust.
Identities = 232/316 (73%), Positives = 276/316 (87%), Gaps = 3/316 (0%)
Query: 29 GLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDA 88
GL R+ LKK D N RVAARL ++E + R +R + G+ G+ D+D++ALKNYM+A
Sbjct: 27 GLVRVALKKLPVDQNGRVAARLSAEERQ--RLLLRGSNALGSGGDD-DSDVIALKNYMNA 83
Query: 89 QYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSA 148
QYFGEIG+G+P Q FTVIFDTGSSNLWVPSSKCYFSIACYFHS+Y+SG+SSTYKKNGK A
Sbjct: 84 QYFGEIGVGSPQQKFTVIFDTGSSNLWVPSSKCYFSIACYFHSRYKSGQSSTYKKNGKPA 143
Query: 149 DIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVG 208
I YGTG+I+GFFSED V +GDLVVKDQEFIEAT+EP LTF++AKFDGILGLGFQEISVG
Sbjct: 144 AIRYGTGSIAGFFSEDSVTLGDLVVKDQEFIEATKEPGLTFMVAKFDGILGLGFQEISVG 203
Query: 209 KAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGY 268
A PVWYNMV QGL+++PVFSFWFNR+ADE EGGEIVFGGMD HYKG+HT+VPVT+KGY
Sbjct: 204 NATPVWYNMVKQGLISDPVFSFWFNRHADEGEGGEIVFGGMDSSHYKGDHTFVPVTRKGY 263
Query: 269 WQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQECKAV 328
WQF+MGDV++DG++TGFCAGGCAAIADSGTSLLAGPT IIT++N IGA G+VSQECK V
Sbjct: 264 WQFNMGDVLVDGKSTGFCAGGCAAIADSGTSLLAGPTAIITEINEKIGAAGVVSQECKTV 323
Query: 329 VSQYGEEIINMLLAKV 344
VSQYG++I+++LLA+
Sbjct: 324 VSQYGQQILDLLLAET 339
>gi|2160151|gb|AAB60773.1| Strong similarity to Brassica aspartic protease (gb|X77260)
[Arabidopsis thaliana]
Length = 433
Score = 493 bits (1270), Expect = e-137, Method: Compositional matrix adjust.
Identities = 230/361 (63%), Positives = 284/361 (78%), Gaps = 16/361 (4%)
Query: 1 MGMVFKSITAGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRT 60
MG+ +++ F+ LLF +S N G +R+GLKK K D NNR+A R SK+ E+ R+
Sbjct: 1 MGVYSRAVAFSVFVSFLLFFTAYSKRNDGTFRVGLKKLKLDPNNRLATRFGSKQEEALRS 60
Query: 61 SIRKYSLRGNLG-ESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSS 119
S+R Y+ NLG +SGDADIV LKNY+DAQY+GEI IGTPPQ FTVIFDTGSSNLWVPS
Sbjct: 61 SLRSYN--NNLGGDSGDADIVPLKNYLDAQYYGEIAIGTPPQKFTVIFDTGSSNLWVPSG 118
Query: 120 KCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFI 179
KC+FS++CYFH+KY+S RSSTYKK+GK A IHYG+G+ISGFFS D V +GDLVVKDQEFI
Sbjct: 119 KCFFSLSCYFHAKYKSSRSSTYKKSGKRAAIHYGSGSISGFFSYDAVTVGDLVVKDQEFI 178
Query: 180 EATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEE 239
E T EP LTFL+AKFDG+LGLGFQEI+VG A PVWYNM+ QGL+ PVFSFW NR+ E
Sbjct: 179 ETTSEPGLTFLVAKFDGLLGLGFQEIAVGNATPVWYNMLKQGLIKRPVFSFWLNRDPKSE 238
Query: 240 EGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTS 299
EGGEIVFGG+DP H++GEHT+VPVTQ+GYWQFDMG+V+I G++TG+C GC+AIADSGTS
Sbjct: 239 EGGEIVFGGVDPKHFRGEHTFVPVTQRGYWQFDMGEVLIAGESTGYCGSGCSAIADSGTS 298
Query: 300 LLAGPT-------------TIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKVSA 346
LLAGPT ++ +N AIGA+G+VSQ+CK VV QYG+ I+++LLA+
Sbjct: 299 LLAGPTVSKYHEFIVLFQLAVVAMINKAIGASGVVSQQCKTVVDQYGQTILDLLLAETQP 358
Query: 347 K 347
K
Sbjct: 359 K 359
>gi|4589716|dbj|BAA76870.1| aspartic proteinase [Helianthus annuus]
Length = 509
Score = 493 bits (1269), Expect = e-137, Method: Compositional matrix adjust.
Identities = 235/336 (69%), Positives = 277/336 (82%), Gaps = 10/336 (2%)
Query: 20 PVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADI 79
P FS+ GGL R+GLKKRK + NRV+ S EG T R + L S + D+
Sbjct: 20 PTAFSSTKGGLLRVGLKKRKTNQFNRVSEHGLSMEG----TDRRNFGFYDTLRNS-EGDV 74
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
+ LKNYMDAQYFGEIGIGTPPQ FTV+FDTGS+NLWVPSSKC+ S+AC FH KY++ RSS
Sbjct: 75 IVLKNYMDAQYFGEIGIGTPPQKFTVVFDTGSANLWVPSSKCFLSVACLFHQKYKASRSS 134
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYKKNG +A I YGTGAISG FS D VK+GDLVVK+Q+FIEATREP +TFL AKFDGILG
Sbjct: 135 TYKKNGTAAAIQYGTGAISGVFSRDSVKLGDLVVKEQDFIEATREPGITFLAAKFDGILG 194
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+Q+ISVGKAVPVWYNMVNQGLV EPVFSFWFNR+ EEEGGE+VFGG+DP+H+KG+HT
Sbjct: 195 LGYQDISVGKAVPVWYNMVNQGLVQEPVFSFWFNRHTGEEEGGELVFGGVDPNHFKGKHT 254
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATG 319
YVPVTQKGYWQFDMGDV+I +TTGFC+GGCAAIADSGTSLLAGPTTIITQ+NHAIGA G
Sbjct: 255 YVPVTQKGYWQFDMGDVLIGDKTTGFCSGGCAAIADSGTSLLAGPTTIITQINHAIGAAG 314
Query: 320 IVSQECKAVVSQYGEEIINMLLA-----KVSAKLSL 350
++SQ+CK +V QYG+ II MLL+ K+ ++++L
Sbjct: 315 VMSQQCKTLVDQYGKTIIEMLLSEAQPDKICSRMNL 350
>gi|356522015|ref|XP_003529645.1| PREDICTED: aspartic proteinase-like [Glycine max]
Length = 514
Score = 491 bits (1263), Expect = e-136, Method: Compositional matrix adjust.
Identities = 238/325 (73%), Positives = 281/325 (86%)
Query: 23 FSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVAL 82
+ PN GL RIGLKK K D NR+AAR+ SK+ +SFR SIRK+ L+ N G S + DIVAL
Sbjct: 23 YCAPNDGLRRIGLKKIKLDPKNRLAARIGSKDVDSFRASIRKFHLQNNFGGSEETDIVAL 82
Query: 83 KNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYK 142
KNY+DAQY+GEI IGT PQ FTVIFDTGSSNLWVPSSKC FS+ACYFH+KY+S +SSTYK
Sbjct: 83 KNYLDAQYYGEIAIGTSPQKFTVIFDTGSSNLWVPSSKCTFSVACYFHAKYKSSKSSTYK 142
Query: 143 KNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGF 202
KNG +A I YGTGAISGFFS D V++GD+ VK+QEFIEATREP +TFL AKFDGILGLGF
Sbjct: 143 KNGTAAAIQYGTGAISGFFSYDSVRVGDIFVKNQEFIEATREPGVTFLAAKFDGILGLGF 202
Query: 203 QEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVP 262
QEISVG AVPVWYNMV+QGL+ EPVFSFWFNR +EEEGGEIVFGG+DP HYKG+HTYVP
Sbjct: 203 QEISVGNAVPVWYNMVDQGLIKEPVFSFWFNRKPEEEEGGEIVFGGVDPAHYKGKHTYVP 262
Query: 263 VTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVS 322
VT+KGYWQFDMGDV+I G+ TG+CA GC+AIADSGTSLLAGPTT+IT +NHAIGA+G++S
Sbjct: 263 VTRKGYWQFDMGDVLIGGKPTGYCADGCSAIADSGTSLLAGPTTVITMINHAIGASGVMS 322
Query: 323 QECKAVVSQYGEEIINMLLAKVSAK 347
QECK VV++YG+ I+++LL++ K
Sbjct: 323 QECKTVVAEYGQTILDLLLSETQPK 347
>gi|122890420|emb|CAM12780.1| aspartic proteinase [Fagopyrum esculentum]
Length = 506
Score = 490 bits (1262), Expect = e-136, Method: Compositional matrix adjust.
Identities = 235/328 (71%), Positives = 279/328 (85%), Gaps = 6/328 (1%)
Query: 20 PVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADI 79
P+ S N L R+GLKKRK D NR A+R K + ++KY L G+ D I
Sbjct: 18 PIALSVANNDLVRVGLKKRKLDPTNRPASRFGCK-----KHLMQKYGLGNGFGDD-DTGI 71
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
++LKNYMDAQYFGEI IGTP Q FTVIFDTGSSNLWVPS KCY SIAC+FHSKY+S +SS
Sbjct: 72 ISLKNYMDAQYFGEIAIGTPSQTFTVIFDTGSSNLWVPSGKCYLSIACFFHSKYKSSKSS 131
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TY KNGKSA+IHYGTGAISG+FS+D+VK+GDLVV++QEFIEATREPSLTF+ AKFDGILG
Sbjct: 132 TYVKNGKSAEIHYGTGAISGYFSQDNVKVGDLVVENQEFIEATREPSLTFVAAKFDGILG 191
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFW NRNADEE GGEIVFGG+DP H+KGEHT
Sbjct: 192 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWLNRNADEEVGGEIVFGGIDPAHHKGEHT 251
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATG 319
Y+PVTQKGYWQFD+ DV++ G++TGFC+GGC+AIADSGTSLLAGPT ++ Q+NHAIGA+G
Sbjct: 252 YLPVTQKGYWQFDLDDVLVGGESTGFCSGGCSAIADSGTSLLAGPTPVVAQINHAIGASG 311
Query: 320 IVSQECKAVVSQYGEEIINMLLAKVSAK 347
+VSQECK VVSQYG++I+++L+++ +
Sbjct: 312 VVSQECKTVVSQYGKQILDLLVSQTQPR 339
>gi|77808107|gb|AAV84085.2| aspartic proteinase 9 [Fagopyrum esculentum]
Length = 506
Score = 489 bits (1260), Expect = e-136, Method: Compositional matrix adjust.
Identities = 235/328 (71%), Positives = 279/328 (85%), Gaps = 6/328 (1%)
Query: 20 PVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADI 79
P+ S N L R+GLKKRK D NR A+R K + ++KY L G+ D I
Sbjct: 18 PISLSVANNDLVRVGLKKRKLDPTNRPASRFGCK-----KHLMQKYGLGNGFGDD-DTGI 71
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
++LKNYMDAQYFGEI IGTP Q FTVIFDTGSSNLWVPS KCY SIAC+FHSKY+S +SS
Sbjct: 72 ISLKNYMDAQYFGEIAIGTPSQTFTVIFDTGSSNLWVPSGKCYLSIACFFHSKYKSSKSS 131
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TY KNGKSA+IHYGTGAISG+FS+D+VK+GDLVV++QEFIEATREPSLTF+ AKFDGILG
Sbjct: 132 TYVKNGKSAEIHYGTGAISGYFSQDNVKVGDLVVENQEFIEATREPSLTFVAAKFDGILG 191
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFW NRNADEE GGEIVFGG+DP H+KGEHT
Sbjct: 192 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWLNRNADEEIGGEIVFGGIDPAHHKGEHT 251
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATG 319
Y+PVTQKGYWQFD+ DV++ G++TGFC+GGC+AIADSGTSLLAGPT ++ Q+NHAIGA+G
Sbjct: 252 YLPVTQKGYWQFDLDDVLVGGESTGFCSGGCSAIADSGTSLLAGPTPVVAQINHAIGASG 311
Query: 320 IVSQECKAVVSQYGEEIINMLLAKVSAK 347
+VSQECK VVSQYG++I+++L+++ +
Sbjct: 312 VVSQECKTVVSQYGKQILDLLVSQTQPR 339
>gi|356532081|ref|XP_003534602.1| PREDICTED: aspartic proteinase [Glycine max]
Length = 507
Score = 488 bits (1256), Expect = e-135, Method: Compositional matrix adjust.
Identities = 232/347 (66%), Positives = 281/347 (80%), Gaps = 7/347 (2%)
Query: 1 MGMVFKSITAGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRT 60
MG + + F L LLF +VF PN GL RIGLKK K + + D + + FR+
Sbjct: 1 MGNMSNVVVFCFCLWTLLFSLVFCAPNDGLGRIGLKKVKLNTH-------DVEGLKEFRS 53
Query: 61 SIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSK 120
SIRK+ L+ LG + + D+VALKNY+DAQY+GEI IGTPPQ FTVIFDTGSSNLWVPSSK
Sbjct: 54 SIRKHHLQNILGGAEETDVVALKNYLDAQYYGEIAIGTPPQKFTVIFDTGSSNLWVPSSK 113
Query: 121 CYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIE 180
CYFSIAC+ H++YRS +SSTY++NG SA I YGTGAISGFFS D VK+GD+VVKDQEFIE
Sbjct: 114 CYFSIACFMHARYRSSQSSTYRENGTSAAIQYGTGAISGFFSNDDVKVGDIVVKDQEFIE 173
Query: 181 ATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEE 240
ATREP +TF+ AKFDGILGLGFQ+ISVG AVPVWY+MV QGLV +PVFSFW NR +EE
Sbjct: 174 ATREPGVTFVAAKFDGILGLGFQDISVGYAVPVWYSMVEQGLVKDPVFSFWLNRKPEEEN 233
Query: 241 GGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSL 300
GGE+VFGG DP HYKG+HTYVPVT+KGYWQFDMGDV+I G+ TG+CA C+AIADSGTSL
Sbjct: 234 GGELVFGGADPAHYKGKHTYVPVTRKGYWQFDMGDVLIAGKPTGYCADDCSAIADSGTSL 293
Query: 301 LAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKVSAK 347
LAGPTT++T +N AIGA+G+VS+EC++VV+QYG+ I+ +LLA+ K
Sbjct: 294 LAGPTTVVTMINQAIGASGVVSKECRSVVNQYGQTILELLLAEAKPK 340
>gi|2811025|sp|O04057.1|ASPR_CUCPE RecName: Full=Aspartic proteinase; Flags: Precursor
gi|1944181|dbj|BAA19607.1| aspartic endopeptidase [Cucurbita pepo]
Length = 513
Score = 486 bits (1251), Expect = e-135, Method: Compositional matrix adjust.
Identities = 245/338 (72%), Positives = 289/338 (85%), Gaps = 4/338 (1%)
Query: 13 FLCLLL---FPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRG 69
FLCL L F +V S N GL R+GLKK K D NR+AAR++SK+ E + + RKY+ +G
Sbjct: 10 FLCLFLLVSFNIVSSASNDGLLRVGLKKIKLDPENRLAARVESKDAEILKAAFRKYNPKG 69
Query: 70 NLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYF 129
NLGES D DIVALKNY+DAQY+GEI IGTPPQ FTVIFDTGSSNLWV +C FS+AC+F
Sbjct: 70 NLGESSDTDIVALKNYLDAQYYGEIAIGTPPQKFTVIFDTGSSNLWV-LCECLFSVACHF 128
Query: 130 HSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTF 189
H++Y+S RSS+YKKNG SA I YGTGA+SGFFS D+VK+GDLVVK+Q FIEATREPSLTF
Sbjct: 129 HARYKSSRSSSYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKEQVFIEATREPSLTF 188
Query: 190 LLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGM 249
L+AKFDG+LGLGFQEI+VG AVPVWYNMV QGLV EPVFSFW NRN +EEEGGEIVFGG+
Sbjct: 189 LVAKFDGLLGLGFQEIAVGNAVPVWYNMVEQGLVKEPVFSFWLNRNVEEEEGGEIVFGGV 248
Query: 250 DPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIIT 309
DP HY+G+HTYVPVTQKGYWQFDMGDV+IDG+ TGFC GGC+AIADSGTSLLAGPT +IT
Sbjct: 249 DPKHYRGKHTYVPVTQKGYWQFDMGDVLIDGEPTGFCDGGCSAIADSGTSLLAGPTPVIT 308
Query: 310 QVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKVSAK 347
+NHAIGA G+VSQ+CKAVV+QYG+ I+++LL++ K
Sbjct: 309 MINHAIGAKGVVSQQCKAVVAQYGQTIMDLLLSEADPK 346
>gi|224068986|ref|XP_002302872.1| predicted protein [Populus trichocarpa]
gi|222844598|gb|EEE82145.1| predicted protein [Populus trichocarpa]
Length = 505
Score = 484 bits (1247), Expect = e-134, Method: Compositional matrix adjust.
Identities = 240/339 (70%), Positives = 279/339 (82%), Gaps = 11/339 (3%)
Query: 16 LLLFPVVFSTPNGGLYRIGLKKRKFD-LNNRVAAR----LDSKEG--ESFRTSIRKYSLR 68
LL FPVV S + GL RIGLKK+K D L RV + +EG S + +KY
Sbjct: 4 LLSFPVVLSARDDGLMRIGLKKKKLDHLGRRVVPGSVNFIPEEEGGGASKPAATKKYY-- 61
Query: 69 GNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACY 128
N+GE+ +ADIVALKNY+DAQY+GEI IGTPPQ FTVIFDTGSSNLWVPSSKCYFS+ACY
Sbjct: 62 -NIGET-EADIVALKNYLDAQYYGEITIGTPPQTFTVIFDTGSSNLWVPSSKCYFSLACY 119
Query: 129 FHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLT 188
FHSKY+S S+TY KNG SA I YGTG+ISGFFS+D V++GDLVVK+Q FIEAT+EP +T
Sbjct: 120 FHSKYKSSASTTYVKNGTSAAIQYGTGSISGFFSQDSVEVGDLVVKNQGFIEATKEPGVT 179
Query: 189 FLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGG 248
FL +KFDGILGLGFQEISVG AVPVWYNMVNQGLV E VFSFW NRN + EEGGEIVFGG
Sbjct: 180 FLASKFDGILGLGFQEISVGNAVPVWYNMVNQGLVKEKVFSFWLNRNVEGEEGGEIVFGG 239
Query: 249 MDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTII 308
+DP+HYKGEHTYVPVT KGYWQFDMGD++I +TTG CAGGC AIADSGTSLLAGPTT+I
Sbjct: 240 VDPNHYKGEHTYVPVTHKGYWQFDMGDLLIGTETTGLCAGGCKAIADSGTSLLAGPTTVI 299
Query: 309 TQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKVSAK 347
TQ+N+AIGA+GIVS+ECK VV+QYG+ I+ ML+A+ +
Sbjct: 300 TQINNAIGASGIVSEECKTVVAQYGKIILEMLVAQAQPR 338
>gi|556819|emb|CAA57510.1| cyprosin [Cynara cardunculus]
Length = 509
Score = 483 bits (1242), Expect = e-134, Method: Compositional matrix adjust.
Identities = 244/344 (70%), Positives = 286/344 (83%), Gaps = 7/344 (2%)
Query: 1 MGMVFKSITAGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRT 60
MG K+ FL LL P FS NGGL R+GLKKRK D N++ S G S
Sbjct: 1 MGTAIKASVLALFLFFLLSPTAFSVSNGGLLRVGLKKRKVDQINQL-----SGHGVSMEA 55
Query: 61 SIRK-YSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSS 119
RK + G L +SG +DI+ALKNYMDAQY+GEIGIG+PPQ FTVIFDTGSSNLWVPS+
Sbjct: 56 KARKDFGFGGALRDSG-SDIIALKNYMDAQYYGEIGIGSPPQKFTVIFDTGSSNLWVPSA 114
Query: 120 KCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFI 179
KCYFS+AC FHSKY+S SSTYKKNG SA I YGTG+ISGF S+D VK+GDLVVK+Q+FI
Sbjct: 115 KCYFSVACLFHSKYKSSHSSTYKKNGTSAAIQYGTGSISGFVSQDSVKLGDLVVKEQDFI 174
Query: 180 EATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEE 239
EAT+EP +TFL AKFDGILGLGFQEISVGK+VP+WYNMVNQGLV EPVFSFWFNRNADEE
Sbjct: 175 EATKEPGITFLAAKFDGILGLGFQEISVGKSVPLWYNMVNQGLVQEPVFSFWFNRNADEE 234
Query: 240 EGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTS 299
EGGE+VFGG+DP+H+KG+HTYVPVT+KGYWQFDMGDV+I+ +TTGFC+ GCAAIADSGTS
Sbjct: 235 EGGELVFGGVDPNHFKGKHTYVPVTEKGYWQFDMGDVLIEDKTTGFCSDGCAAIADSGTS 294
Query: 300 LLAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAK 343
LLAGPT IIT++NHAIGA G++SQ+CK +VSQYG+ +I MLL++
Sbjct: 295 LLAGPTAIITEINHAIGAKGVMSQQCKTLVSQYGKTMIEMLLSE 338
>gi|357480353|ref|XP_003610462.1| Aspartic proteinase [Medicago truncatula]
gi|355511517|gb|AES92659.1| Aspartic proteinase [Medicago truncatula]
Length = 519
Score = 482 bits (1241), Expect = e-133, Method: Compositional matrix adjust.
Identities = 228/326 (69%), Positives = 273/326 (83%), Gaps = 10/326 (3%)
Query: 29 GLYRIGLKKRKFDLNNRVAARLD-------SKEGESFRTSIRKYSLRGNLGESGDADIVA 81
GL RI LKK + D NN++AA +K+ +S ++SIRKY+L N E+ DIVA
Sbjct: 31 GLRRIALKKIQLDRNNKLAAAAAAAAGGRRTKDTDSLQSSIRKYNLANNYQET---DIVA 87
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
LKNY+DAQY+GEI IGT PQ FTVIFDTGSSNLWVPSSKC FS+ACYFH+KY+S +S+TY
Sbjct: 88 LKNYLDAQYYGEISIGTSPQKFTVIFDTGSSNLWVPSSKCTFSVACYFHAKYKSTKSTTY 147
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
+KNG +A I YGTGAISGFFS D VK+GD+VVK+QEFIEAT+EP +TFL+AKFDGILGLG
Sbjct: 148 RKNGTAAAIQYGTGAISGFFSYDSVKVGDIVVKNQEFIEATKEPGVTFLVAKFDGILGLG 207
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
FQEISVG AVPVWYNMV QGL+ EPVFSFW NR +EEEGGEIVFGG+DP HYKG HTYV
Sbjct: 208 FQEISVGNAVPVWYNMVEQGLIQEPVFSFWLNRKPEEEEGGEIVFGGVDPAHYKGNHTYV 267
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIV 321
PV +KGYWQFDMGDV IDG++TG+C GC+AIADSGTSLLAGPTT+IT +NHAIGA+G+V
Sbjct: 268 PVKRKGYWQFDMGDVTIDGKSTGYCVDGCSAIADSGTSLLAGPTTVITMINHAIGASGVV 327
Query: 322 SQECKAVVSQYGEEIINMLLAKVSAK 347
S+ECK +V++YG+ I+N+LLA+ K
Sbjct: 328 SKECKTIVAEYGQTILNLLLAEAQPK 353
>gi|509163|emb|CAA48939.1| cyprosin [Cynara cardunculus]
Length = 474
Score = 481 bits (1238), Expect = e-133, Method: Compositional matrix adjust.
Identities = 226/309 (73%), Positives = 265/309 (85%), Gaps = 5/309 (1%)
Query: 35 LKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYFGEI 94
LKKRK ++ N S + + RKY +RGN +S D +++ALKNYMDAQYFGEI
Sbjct: 1 LKKRKVNILNHPGEHAGSNDA----NARRKYGVRGNFRDS-DGELIALKNYMDAQYFGEI 55
Query: 95 GIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGT 154
GIGTPPQ FTVIFDTGSSNLWVPSSKCYFS+AC FHSKYRS S+TYKKNGKSA I YGT
Sbjct: 56 GIGTPPQKFTVIFDTGSSNLWVPSSKCYFSVACLFHSKYRSTDSTTYKKNGKSAAIQYGT 115
Query: 155 GAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVW 214
G+ISGFFS+D VK+GDL+VK+Q+FIEAT+EP +TFL AKFDGILGLGFQEISVG AVPVW
Sbjct: 116 GSISGFFSQDSVKLGDLLVKEQDFIEATKEPGITFLAAKFDGILGLGFQEISVGDAVPVW 175
Query: 215 YNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMG 274
Y M+NQGLV EPVFSFW NRNADE+EGGE+VFGG+DP+H+KGEHTYVPVTQKGYWQF+MG
Sbjct: 176 YTMLNQGLVQEPVFSFWLNRNADEQEGGELVFGGVDPNHFKGEHTYVPVTQKGYWQFEMG 235
Query: 275 DVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQECKAVVSQYGE 334
DV+I +TTGFCA GCAAIADSGTSLLAG TTI+TQ+N AIGA G++SQ+CK++V QYG+
Sbjct: 236 DVLIGDKTTGFCASGCAAIADSGTSLLAGTTTIVTQINQAIGAAGVMSQQCKSLVDQYGK 295
Query: 335 EIINMLLAK 343
+I MLL++
Sbjct: 296 SMIEMLLSE 304
>gi|1665867|emb|CAA70340.1| aspartic proteinase [Centaurea calcitrapa]
Length = 509
Score = 480 bits (1236), Expect = e-133, Method: Compositional matrix adjust.
Identities = 243/343 (70%), Positives = 286/343 (83%), Gaps = 5/343 (1%)
Query: 1 MGMVFKSITAGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRT 60
MG K+ FL +LL P FS NGGL R+GLKKRK D N++ S EG++ +
Sbjct: 1 MGTAIKASLLALFLFVLLSPTAFSASNGGLLRVGLKKRKVDQINQLRNHGASMEGKARK- 59
Query: 61 SIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSK 120
+ G+L +S D+DI+ LKNYMDAQY+GEIGIG+P Q FTVIFDTGSSNLWVPS+K
Sbjct: 60 ---DFGFGGSLRDS-DSDIIELKNYMDAQYYGEIGIGSPAQKFTVIFDTGSSNLWVPSAK 115
Query: 121 CYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIE 180
CYFS+AC FHSKY+S SSTYKKNG SA I YGTG+ISGF S+D VK+GDLVVK+Q+FIE
Sbjct: 116 CYFSVACLFHSKYKSSHSSTYKKNGTSAAIQYGTGSISGFVSQDSVKLGDLVVKEQDFIE 175
Query: 181 ATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEE 240
AT+EP +TFL AKFDGILGLGFQEISVGK+VPVWYNMVNQGLV EPVFSFWFNRNADEEE
Sbjct: 176 ATKEPGVTFLAAKFDGILGLGFQEISVGKSVPVWYNMVNQGLVQEPVFSFWFNRNADEEE 235
Query: 241 GGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSL 300
GGE+VFGG+DP+H+KG+HTYVPVTQKGYWQF+MGDV+I+ +TTGFCA GCAAIADSGTSL
Sbjct: 236 GGELVFGGVDPNHFKGKHTYVPVTQKGYWQFNMGDVLIEDKTTGFCADGCAAIADSGTSL 295
Query: 301 LAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAK 343
LAGPT IITQ+NHAIGA G++SQ+CK +V QYG+ II MLL++
Sbjct: 296 LAGPTAIITQINHAIGAKGVMSQQCKTLVDQYGKTIIEMLLSE 338
>gi|1169175|sp|P40782.2|CYPR1_CYNCA RecName: Full=Cyprosin; Flags: Precursor
gi|1585067|prf||2124255A cyprosin
Length = 473
Score = 480 bits (1235), Expect = e-133, Method: Compositional matrix adjust.
Identities = 226/309 (73%), Positives = 265/309 (85%), Gaps = 5/309 (1%)
Query: 35 LKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYFGEI 94
LKKRK ++ N S + + RKY +RGN +S D +++ALKNYMDAQYFGEI
Sbjct: 1 LKKRKVNILNHPGEHAGSNDA----NARRKYGVRGNFRDS-DGELIALKNYMDAQYFGEI 55
Query: 95 GIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGT 154
GIGTPPQ FTVIFDTGSSNLWVPSSKCYFS+AC FHSKYRS S+TYKKNGKSA I YGT
Sbjct: 56 GIGTPPQKFTVIFDTGSSNLWVPSSKCYFSVACLFHSKYRSTDSTTYKKNGKSAAIQYGT 115
Query: 155 GAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVW 214
G+ISGFFS+D VK+GDL+VK+Q+FIEAT+EP +TFL AKFDGILGLGFQEISVG AVPVW
Sbjct: 116 GSISGFFSQDSVKLGDLLVKEQDFIEATKEPGITFLAAKFDGILGLGFQEISVGDAVPVW 175
Query: 215 YNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMG 274
Y M+NQGLV EPVFSFW NRNADE+EGGE+VFGG+DP+H+KGEHTYVPVTQKGYWQF+MG
Sbjct: 176 YTMLNQGLVQEPVFSFWLNRNADEQEGGELVFGGVDPNHFKGEHTYVPVTQKGYWQFEMG 235
Query: 275 DVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQECKAVVSQYGE 334
DV+I +TTGFCA GCAAIADSGTSLLAG TTI+TQ+N AIGA G++SQ+CK++V QYG+
Sbjct: 236 DVLIGDKTTGFCASGCAAIADSGTSLLAGTTTIVTQINQAIGAAGVMSQQCKSLVDQYGK 295
Query: 335 EIINMLLAK 343
+I MLL++
Sbjct: 296 SMIEMLLSE 304
>gi|302144105|emb|CBI23210.3| unnamed protein product [Vitis vinifera]
Length = 429
Score = 478 bits (1229), Expect = e-132, Method: Compositional matrix adjust.
Identities = 217/262 (82%), Positives = 246/262 (93%)
Query: 86 MDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNG 145
MDAQYFGEIGIGTPPQ FTVIFDTGSSNLWVPSSKCYFS+ CYFHSKY+S +SSTY+KNG
Sbjct: 1 MDAQYFGEIGIGTPPQTFTVIFDTGSSNLWVPSSKCYFSVPCYFHSKYKSSQSSTYRKNG 60
Query: 146 KSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEI 205
KSADIHYGTGAISGFFSED+VK+GDLVVK+QEFIEATREPS+TFL+AKFDGILGLGFQEI
Sbjct: 61 KSADIHYGTGAISGFFSEDNVKVGDLVVKNQEFIEATREPSVTFLVAKFDGILGLGFQEI 120
Query: 206 SVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQ 265
SVG AVPVWYNMV QGLV EPVFSFW NR D++EGGE+VFGG+DPDH+KGEHTYVPVTQ
Sbjct: 121 SVGNAVPVWYNMVKQGLVKEPVFSFWLNRKTDDDEGGELVFGGVDPDHFKGEHTYVPVTQ 180
Query: 266 KGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQEC 325
KGYWQFDMG+V+IDG+TTG+CAGGCAAIADSGTSLLAGPT ++ +NHAIGATG+VSQEC
Sbjct: 181 KGYWQFDMGEVLIDGETTGYCAGGCAAIADSGTSLLAGPTAVVAMINHAIGATGVVSQEC 240
Query: 326 KAVVSQYGEEIINMLLAKVSAK 347
K VV+QYGE I+++LL++ S +
Sbjct: 241 KTVVAQYGETIMDLLLSEASPQ 262
>gi|425892460|gb|AFB73927.2| preprocirsin [Cirsium vulgare]
Length = 509
Score = 475 bits (1222), Expect = e-131, Method: Compositional matrix adjust.
Identities = 232/324 (71%), Positives = 276/324 (85%), Gaps = 5/324 (1%)
Query: 20 PVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADI 79
P S N GL R+GLKKRK D N+++ S EG++ + + G L +S D+DI
Sbjct: 20 PTAISVSNDGLIRVGLKKRKVDQINQLSGHGASMEGKARK----DFGFGGTLRDS-DSDI 74
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
+ALKNYMDAQY+GEIGIG PPQ FTVIFDTGSSNLWVPS+KCYFS+AC FHSKY+S SS
Sbjct: 75 IALKNYMDAQYYGEIGIGAPPQKFTVIFDTGSSNLWVPSAKCYFSVACLFHSKYKSSHSS 134
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYKKNG SA I YGTG+ISGF S+D VK+GDLVVK+Q+FIEAT+EP +TFL AKFDGILG
Sbjct: 135 TYKKNGTSAAIQYGTGSISGFVSQDSVKLGDLVVKEQDFIEATKEPGITFLAAKFDGILG 194
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LGFQEISVGK+VPVWYNMVNQGLV EPVFSFWFNRNA+EEEGGE+VFGG+DP+H+KG+HT
Sbjct: 195 LGFQEISVGKSVPVWYNMVNQGLVQEPVFSFWFNRNANEEEGGELVFGGVDPNHFKGKHT 254
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATG 319
YVPVT+KGYWQF+MGDV+I+ +TTGFC+ GCAAIADSGTSLLAGPT IIT++NHA GA G
Sbjct: 255 YVPVTEKGYWQFNMGDVLIEDKTTGFCSDGCAAIADSGTSLLAGPTAIITEINHASGAKG 314
Query: 320 IVSQECKAVVSQYGEEIINMLLAK 343
++SQ+CK +VSQYG+ II MLL++
Sbjct: 315 VMSQQCKTLVSQYGKSIIEMLLSE 338
>gi|115439013|ref|NP_001043786.1| Os01g0663400 [Oryza sativa Japonica Group]
gi|113533317|dbj|BAF05700.1| Os01g0663400 [Oryza sativa Japonica Group]
gi|215701483|dbj|BAG92907.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218188796|gb|EEC71223.1| hypothetical protein OsI_03158 [Oryza sativa Indica Group]
gi|222618996|gb|EEE55128.1| hypothetical protein OsJ_02912 [Oryza sativa Japonica Group]
gi|385717674|gb|AFI71272.1| unnamed protein [Oryza sativa Japonica Group]
Length = 522
Score = 475 bits (1222), Expect = e-131, Method: Compositional matrix adjust.
Identities = 224/324 (69%), Positives = 267/324 (82%), Gaps = 11/324 (3%)
Query: 29 GLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGN---------LGESGDADI 79
G+ RI LKKR+ D RV L ++ + R R++ N + DI
Sbjct: 30 GVVRIALKKRQVDETGRVGGHLAGEDAQ--RLLARRHGFLTNDAARAASRKARAEAEGDI 87
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
VALKNY++AQY+GEI IGTPPQ FTVIFDTGSSNLWVPSSKC+ SIACYFHS+Y++G+SS
Sbjct: 88 VALKNYLNAQYYGEIAIGTPPQMFTVIFDTGSSNLWVPSSKCHLSIACYFHSRYKAGQSS 147
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYKKNGK A IHYGTGAISG+FS+D VK+GD+ VK+Q+FIEATREPS+TF++AKFDGILG
Sbjct: 148 TYKKNGKPASIHYGTGAISGYFSQDSVKVGDVAVKNQDFIEATREPSITFMVAKFDGILG 207
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LGF+EISVG AVP+WYNMV QGLV +PVFSFWFNR+ADE +GGEIVFGG+DP+HYKG HT
Sbjct: 208 LGFKEISVGNAVPIWYNMVRQGLVVDPVFSFWFNRHADEGQGGEIVFGGIDPNHYKGNHT 267
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATG 319
YVPVT+KGYWQF+MGDV+I G +TGFCA GCAAIADSGTSLL GPT IITQ+N IGATG
Sbjct: 268 YVPVTRKGYWQFNMGDVLIGGNSTGFCAAGCAAIADSGTSLLTGPTAIITQINEKIGATG 327
Query: 320 IVSQECKAVVSQYGEEIINMLLAK 343
+VSQECKAVVSQYG++I++ L A+
Sbjct: 328 VVSQECKAVVSQYGQQILDQLRAE 351
>gi|297809619|ref|XP_002872693.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297318530|gb|EFH48952.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 507
Score = 473 bits (1217), Expect = e-131, Method: Compositional matrix adjust.
Identities = 230/349 (65%), Positives = 276/349 (79%), Gaps = 8/349 (2%)
Query: 1 MGMVFKSITAGFFL-CLLLFPVVFSTPNG-GLYRIGLKKRKFDLNNRVAARLDSKEGESF 58
MG F+S F L CL+L G G RIGLKKRK D +NR+A++L K S+
Sbjct: 1 MGTRFQSFLLVFLLSCLILISTALCERKGDGTIRIGLKKRKLDRSNRLASQLFLKNRGSW 60
Query: 59 RTSIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPS 118
S + Y R N +AD+V LKNY+DAQY+G+I IGTPPQ FTVIFDTGSSNLW+PS
Sbjct: 61 --SPKDY-FRLN---DANADMVPLKNYLDAQYYGDITIGTPPQKFTVIFDTGSSNLWIPS 114
Query: 119 SKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEF 178
+KCY S+ACYFHSKY++ +SS+Y+KNGK A I YGTGAISG+FS D VK+GD+VVK+QEF
Sbjct: 115 TKCYLSVACYFHSKYKASQSSSYRKNGKPASIRYGTGAISGYFSNDDVKVGDIVVKEQEF 174
Query: 179 IEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADE 238
IEAT EP +TFLLAKFDGILGLGF+EISVG + PVWYNMV +GLV +PVFSFW NRN +
Sbjct: 175 IEATTEPGITFLLAKFDGILGLGFKEISVGNSTPVWYNMVEKGLVKDPVFSFWLNRNPQD 234
Query: 239 EEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGT 298
+EGGEIVFGG+DP H+KGEHTYVPVT KGYWQFDMGD+ I G+ TG+CA GC+AIADSGT
Sbjct: 235 QEGGEIVFGGVDPKHFKGEHTYVPVTHKGYWQFDMGDLQIAGKPTGYCAKGCSAIADSGT 294
Query: 299 SLLAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKVSAK 347
SLL GP+T+IT +NHAIGA GIVS+ECKAVV QYG+ ++N LLA+ K
Sbjct: 295 SLLTGPSTVITMINHAIGAQGIVSRECKAVVDQYGKTMLNSLLAQEDPK 343
>gi|15233518|ref|NP_192355.1| phytepsin [Arabidopsis thaliana]
gi|75338508|sp|Q9XEC4.1|APA3_ARATH RecName: Full=Aspartic proteinase A3; Flags: Precursor
gi|4773885|gb|AAD29758.1|AF076243_5 putative aspartic protease [Arabidopsis thaliana]
gi|13937238|gb|AAK50111.1|AF372974_1 AT4g04460/T26N6_7 [Arabidopsis thaliana]
gi|7267203|emb|CAB77914.1| putative aspartic protease [Arabidopsis thaliana]
gi|332656990|gb|AEE82390.1| phytepsin [Arabidopsis thaliana]
Length = 508
Score = 472 bits (1215), Expect = e-130, Method: Compositional matrix adjust.
Identities = 227/349 (65%), Positives = 273/349 (78%), Gaps = 7/349 (2%)
Query: 1 MGMVFKSITAGFFL-CLLLFPVVFSTPNG-GLYRIGLKKRKFDLNNRVAARLDSKEGESF 58
MG F+S F L CL+L NG G RIGLKKRK D +NR+A++L K S
Sbjct: 1 MGTRFQSFLLVFLLSCLILISTASCERNGDGTIRIGLKKRKLDRSNRLASQLFLKNRGSH 60
Query: 59 RTSIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPS 118
+ + L +AD+V LKNY+DAQY+G+I IGTPPQ FTVIFDTGSSNLW+PS
Sbjct: 61 WSPKHYFRLNDE-----NADMVPLKNYLDAQYYGDITIGTPPQKFTVIFDTGSSNLWIPS 115
Query: 119 SKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEF 178
+KCY S+ACYFHSKY++ +SS+Y+KNGK A I YGTGAISG+FS D VK+GD+VVK+QEF
Sbjct: 116 TKCYLSVACYFHSKYKASQSSSYRKNGKPASIRYGTGAISGYFSNDDVKVGDIVVKEQEF 175
Query: 179 IEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADE 238
IEAT EP +TFLLAKFDGILGLGF+EISVG + PVWYNMV +GLV EP+FSFW NRN +
Sbjct: 176 IEATSEPGITFLLAKFDGILGLGFKEISVGNSTPVWYNMVEKGLVKEPIFSFWLNRNPKD 235
Query: 239 EEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGT 298
EGGEIVFGG+DP H+KGEHT+VPVT KGYWQFDMGD+ I G+ TG+CA GC+AIADSGT
Sbjct: 236 PEGGEIVFGGVDPKHFKGEHTFVPVTHKGYWQFDMGDLQIAGKPTGYCAKGCSAIADSGT 295
Query: 299 SLLAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKVSAK 347
SLL GP+T+IT +NHAIGA GIVS+ECKAVV QYG+ ++N LLA+ K
Sbjct: 296 SLLTGPSTVITMINHAIGAQGIVSRECKAVVDQYGKTMLNSLLAQEDPK 344
>gi|334186351|ref|NP_001190671.1| phytepsin [Arabidopsis thaliana]
gi|332656991|gb|AEE82391.1| phytepsin [Arabidopsis thaliana]
Length = 504
Score = 472 bits (1215), Expect = e-130, Method: Compositional matrix adjust.
Identities = 229/356 (64%), Positives = 278/356 (78%), Gaps = 8/356 (2%)
Query: 1 MGMVFKSITAGFFL-CLLLFPVVFSTPNG-GLYRIGLKKRKFDLNNRVAARLDSKEGESF 58
MG F+S F L CL+L NG G RIGLKKRK D +NR+A++L K S
Sbjct: 1 MGTRFQSFLLVFLLSCLILISTASCERNGDGTIRIGLKKRKLDRSNRLASQLFLKNRGSH 60
Query: 59 RTSIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPS 118
+ + L +AD+V LKNY+DAQY+G+I IGTPPQ FTVIFDTGSSNLW+PS
Sbjct: 61 WSPKHYFRLNDE-----NADMVPLKNYLDAQYYGDITIGTPPQKFTVIFDTGSSNLWIPS 115
Query: 119 SKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEF 178
+KCY S+ACYFHSKY++ +SS+Y+KNGK A I YGTGAISG+FS D VK+GD+VVK+QEF
Sbjct: 116 TKCYLSVACYFHSKYKASQSSSYRKNGKPASIRYGTGAISGYFSNDDVKVGDIVVKEQEF 175
Query: 179 IEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADE 238
IEAT EP +TFLLAKFDGILGLGF+EISVG + PVWYNMV +GLV EP+FSFW NRN +
Sbjct: 176 IEATSEPGITFLLAKFDGILGLGFKEISVGNSTPVWYNMVEKGLVKEPIFSFWLNRNPKD 235
Query: 239 EEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGT 298
EGGEIVFGG+DP H+KGEHT+VPVT KGYWQFDMGD+ I G+ TG+CA GC+AIADSGT
Sbjct: 236 PEGGEIVFGGVDPKHFKGEHTFVPVTHKGYWQFDMGDLQIAGKPTGYCAKGCSAIADSGT 295
Query: 299 SLLAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLA-KVSAKLSLRHY 353
SLL GP+T+IT +NHAIGA GIVS+ECKAVV QYG+ ++N LLA KV +++ + Y
Sbjct: 296 SLLTGPSTVITMINHAIGAQGIVSRECKAVVDQYGKTMLNSLLAQKVCSQIGVCAY 351
>gi|449433980|ref|XP_004134774.1| PREDICTED: aspartic proteinase-like [Cucumis sativus]
gi|449526063|ref|XP_004170034.1| PREDICTED: aspartic proteinase-like [Cucumis sativus]
Length = 516
Score = 469 bits (1206), Expect = e-129, Method: Compositional matrix adjust.
Identities = 219/325 (67%), Positives = 269/325 (82%), Gaps = 1/325 (0%)
Query: 24 STPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGN-LGESGDADIVAL 82
S N G RIGLKK K+D N+R A L+SK+GE +S+ K++ GN L ES +ADIV L
Sbjct: 26 SASNEGFLRIGLKKIKYDQNSRFKALLESKKGEFLGSSVGKHNQWGNNLEESKNADIVPL 85
Query: 83 KNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYK 142
KNY+DAQY+GEIGIGTPPQ FTVIFDTGSSNLWVPS+KC FS+AC+FH+KY+SGRSSTYK
Sbjct: 86 KNYLDAQYYGEIGIGTPPQKFTVIFDTGSSNLWVPSAKCIFSLACFFHAKYQSGRSSTYK 145
Query: 143 KNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGF 202
+NG SA I YG+GAISGFFS D+V++GD++V++QE IEAT ++TF+ AKFDGILGLGF
Sbjct: 146 RNGTSAAIQYGSGAISGFFSYDNVQVGDVIVRNQELIEATSMSTMTFMAAKFDGILGLGF 205
Query: 203 QEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVP 262
QEI+ G AVPVWYNMV Q LV E VFSFW NRNA+E+EGGE+VFGG+DP H+KG+HTYVP
Sbjct: 206 QEIATGGAVPVWYNMVKQKLVKEQVFSFWLNRNAEEKEGGELVFGGVDPKHFKGQHTYVP 265
Query: 263 VTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVS 322
VT KGYWQFD+GD++I G+TT +CAGGC+AIADSGTSLLAGP+ I+ +N AIGA +
Sbjct: 266 VTDKGYWQFDIGDILIGGETTKYCAGGCSAIADSGTSLLAGPSNIVVSINRAIGAAAVAH 325
Query: 323 QECKAVVSQYGEEIINMLLAKVSAK 347
ECKA+VSQYG I+++LLAK +
Sbjct: 326 PECKAIVSQYGRAIMDLLLAKAQPE 350
>gi|1168536|sp|P42210.1|ASPR_HORVU RecName: Full=Phytepsin; AltName: Full=Aspartic proteinase;
Contains: RecName: Full=Phytepsin 32 kDa subunit;
Contains: RecName: Full=Phytepsin 29 kDa subunit;
Contains: RecName: Full=Phytepsin 16 kDa subunit;
Contains: RecName: Full=Phytepsin 11 kDa subunit; Flags:
Precursor
gi|18904|emb|CAA39602.1| aspartic proteinase [Hordeum vulgare subsp. vulgare]
Length = 508
Score = 465 bits (1197), Expect = e-128, Method: Compositional matrix adjust.
Identities = 228/319 (71%), Positives = 265/319 (83%), Gaps = 6/319 (1%)
Query: 29 GLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDA 88
GL RI LKKR D N+RVA L E + + L + DIVALKNYM+A
Sbjct: 29 GLVRIALKKRPIDRNSRVATGLSGGEEQPLLSGANP------LRSEEEGDIVALKNYMNA 82
Query: 89 QYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSA 148
QYFGEIG+GTPPQ FTVIFDTGSSNLWVPS+KCYFSIACY HS+Y++G SSTYKKNGK A
Sbjct: 83 QYFGEIGVGTPPQKFTVIFDTGSSNLWVPSAKCYFSIACYLHSRYKAGASSTYKKNGKPA 142
Query: 149 DIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVG 208
I YGTG+I+G+FSED V +GDLVVKDQEFIEAT+EP +TFL+AKFDGILGLGF+EISVG
Sbjct: 143 AIQYGTGSIAGYFSEDSVTVGDLVVKDQEFIEATKEPGITFLVAKFDGILGLGFKEISVG 202
Query: 209 KAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGY 268
KAVPVWY M+ QGLV++PVFSFW NR+ DE EGGEI+FGGMDP HY GEHTYVPVTQKGY
Sbjct: 203 KAVPVWYKMIEQGLVSDPVFSFWLNRHVDEGEGGEIIFGGMDPKHYVGEHTYVPVTQKGY 262
Query: 269 WQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQECKAV 328
WQFDMGDV++ G++TGFCAGGCAAIADSGTSLLAGPT IIT++N IGA G+VSQECK +
Sbjct: 263 WQFDMGDVLVGGKSTGFCAGGCAAIADSGTSLLAGPTAIITEINEKIGAAGVVSQECKTI 322
Query: 329 VSQYGEEIINMLLAKVSAK 347
VSQYG++I+++LLA+ K
Sbjct: 323 VSQYGQQILDLLLAETQPK 341
>gi|5822248|pdb|1QDM|A Chain A, Crystal Structure Of Prophytepsin, A Zymogen Of A Barley
Vacuolar Aspartic Proteinase.
gi|5822249|pdb|1QDM|B Chain B, Crystal Structure Of Prophytepsin, A Zymogen Of A Barley
Vacuolar Aspartic Proteinase.
gi|5822250|pdb|1QDM|C Chain C, Crystal Structure Of Prophytepsin, A Zymogen Of A Barley
Vacuolar Aspartic Proteinase
Length = 478
Score = 461 bits (1187), Expect = e-127, Method: Compositional matrix adjust.
Identities = 226/316 (71%), Positives = 263/316 (83%), Gaps = 6/316 (1%)
Query: 32 RIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYF 91
RI LKKR D N+RVA L E + + L + DIVALKNYM+AQYF
Sbjct: 2 RIALKKRPIDRNSRVATGLSGGEEQPLLSGANP------LRSEEEGDIVALKNYMNAQYF 55
Query: 92 GEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIH 151
GEIG+GTPPQ FTVIFDTGSSNLWVPS+KCYFSIACY HS+Y++G SSTYKKNGK A I
Sbjct: 56 GEIGVGTPPQKFTVIFDTGSSNLWVPSAKCYFSIACYLHSRYKAGASSTYKKNGKPAAIQ 115
Query: 152 YGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAV 211
YGTG+I+G+FSED V +GDLVVKDQEFIEAT+EP +TFL+AKFDGILGLGF+EISVGKAV
Sbjct: 116 YGTGSIAGYFSEDSVTVGDLVVKDQEFIEATKEPGITFLVAKFDGILGLGFKEISVGKAV 175
Query: 212 PVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQF 271
PVWY M+ QGLV++PVFSFW NR+ DE EGGEI+FGGMDP HY GEHTYVPVTQKGYWQF
Sbjct: 176 PVWYKMIEQGLVSDPVFSFWLNRHVDEGEGGEIIFGGMDPKHYVGEHTYVPVTQKGYWQF 235
Query: 272 DMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQECKAVVSQ 331
DMGDV++ G++TGFCAGGCAAIADSGTSLLAGPT IIT++N IGA G+VSQECK +VSQ
Sbjct: 236 DMGDVLVGGKSTGFCAGGCAAIADSGTSLLAGPTAIITEINEKIGAAGVVSQECKTIVSQ 295
Query: 332 YGEEIINMLLAKVSAK 347
YG++I+++LLA+ K
Sbjct: 296 YGQQILDLLLAETQPK 311
>gi|73912433|dbj|BAE20413.1| aspartic proteinase [Triticum aestivum]
Length = 508
Score = 461 bits (1186), Expect = e-127, Method: Compositional matrix adjust.
Identities = 226/320 (70%), Positives = 265/320 (82%), Gaps = 6/320 (1%)
Query: 29 GLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDA 88
GL RI LKKR D N+RVA L +E L + DIV+LKNYM+A
Sbjct: 27 GLVRIALKKRAIDRNSRVAKSLSDREEVHLLGGASN-----TLPSEEEGDIVSLKNYMNA 81
Query: 89 QYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSA 148
QYFGEIG+GTPPQ FTVIFDTGSSNLWVPS+KCYFSIACY H++Y++G SSTYKKNGK A
Sbjct: 82 QYFGEIGVGTPPQKFTVIFDTGSSNLWVPSAKCYFSIACYLHARYKAGASSTYKKNGKPA 141
Query: 149 DIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVG 208
I YGTG+I+G+FSED V +GDLVVKDQEFIEAT+EP +TFL+AKFDGILGLGF+EISVG
Sbjct: 142 AIQYGTGSIAGYFSEDSVTVGDLVVKDQEFIEATKEPGVTFLVAKFDGILGLGFKEISVG 201
Query: 209 KAVPVWYNMVNQGLVNEPVFSFWFNRNADE-EEGGEIVFGGMDPDHYKGEHTYVPVTQKG 267
KAVPVWYNMV QGL+++PVFSFW NR+AD+ EGGEI+FGGMDP HY GEHTYVP TQKG
Sbjct: 202 KAVPVWYNMVEQGLISDPVFSFWLNRHADDEGEGGEIIFGGMDPKHYVGEHTYVPATQKG 261
Query: 268 YWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQECKA 327
YWQFDMGDV++ G++TGFCAGGCAAIADSGTSLLAGPT IIT++N IGA G+VSQECK
Sbjct: 262 YWQFDMGDVLVGGKSTGFCAGGCAAIADSGTSLLAGPTAIITEINEKIGAAGVVSQECKT 321
Query: 328 VVSQYGEEIINMLLAKVSAK 347
+VSQYG++I+++LLA+ K
Sbjct: 322 IVSQYGQQILDLLLAETQPK 341
>gi|388517285|gb|AFK46704.1| unknown [Medicago truncatula]
Length = 510
Score = 461 bits (1185), Expect = e-127, Method: Compositional matrix adjust.
Identities = 230/336 (68%), Positives = 271/336 (80%), Gaps = 6/336 (1%)
Query: 14 LCL--LLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNL 71
LCL LLF +V PN GL RIGLKK K + N L SK ES +SIR Y+ + L
Sbjct: 12 LCLWTLLFSLVSCAPNEGLRRIGLKKNKLEPKNL----LGSKGCESSWSSIRNYASKNIL 67
Query: 72 GESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHS 131
G +G+AD+VALKNY+DAQY+GEI IGTPPQ FTVIFDTGSSN WVPS KCYFS+AC H+
Sbjct: 68 GGAGEADVVALKNYLDAQYYGEISIGTPPQTFTVIFDTGSSNTWVPSVKCYFSLACLVHA 127
Query: 132 KYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLL 191
KY+S +SSTYK NG A I YGTGA+SGFFS D+VK+GD+VVKD EFIEATREP LTF+
Sbjct: 128 KYKSSQSSTYKPNGTHAAIQYGTGAVSGFFSYDNVKVGDVVVKDVEFIEATREPGLTFVA 187
Query: 192 AKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDP 251
AKFDG+LGLGFQEISVG AVP+WY MV QGLV +PVFSFW NRN +EE+GGE+VFGG+DP
Sbjct: 188 AKFDGLLGLGFQEISVGNAVPIWYKMVKQGLVKDPVFSFWLNRNPNEEQGGELVFGGVDP 247
Query: 252 DHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQV 311
H+KGEHTYVPVT+KGYWQF MGDV+IDG+ TG+CA C+AIADSGTSLLAGPTT+IT +
Sbjct: 248 AHFKGEHTYVPVTRKGYWQFAMGDVLIDGKPTGYCANDCSAIADSGTSLLAGPTTVITMI 307
Query: 312 NHAIGATGIVSQECKAVVSQYGEEIINMLLAKVSAK 347
N AIGA+G+ SQEC+ VV QYG I+ +L+A+ K
Sbjct: 308 NQAIGASGVYSQECRTVVDQYGHSILQLLVAEAQPK 343
>gi|148910494|gb|ABR18322.1| unknown [Picea sitchensis]
Length = 471
Score = 459 bits (1182), Expect = e-127, Method: Compositional matrix adjust.
Identities = 217/336 (64%), Positives = 265/336 (78%), Gaps = 3/336 (0%)
Query: 14 LCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLG- 72
L LL+ V + N L RI LKK+ D AAR+ ++EG RKY LRG L
Sbjct: 9 LALLVLTSVCAA-NDCLARIELKKKGLDQKTLQAARIVAREGGLSNEVNRKYGLRGGLSY 67
Query: 73 -ESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHS 131
ES + V LKNY+DAQY+GEIG+GTPPQ FTVIFDTGSSNLWVPS+KCY SIACYFHS
Sbjct: 68 SESARGEYVPLKNYLDAQYYGEIGLGTPPQKFTVIFDTGSSNLWVPSTKCYLSIACYFHS 127
Query: 132 KYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLL 191
KY++ +SS+Y NGK +I YG+G++SG+ +DHV GDLVVKDQ F E T+EP LTFL
Sbjct: 128 KYKASQSSSYCVNGKPFNIQYGSGSVSGYLGQDHVTAGDLVVKDQVFAEVTQEPGLTFLA 187
Query: 192 AKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDP 251
AKFDGILGLGFQ+ISVG VPVWYNMVNQGL+ EPVFSFW NR +EEGGEIVFGG+DP
Sbjct: 188 AKFDGILGLGFQKISVGNVVPVWYNMVNQGLIKEPVFSFWMNRKVGDEEGGEIVFGGVDP 247
Query: 252 DHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQV 311
+H+KG+HTYVPVT++GYWQF+MGD +I GQ+TGFC+GGCAAI DSGTSLLAGP+ I+ Q+
Sbjct: 248 NHFKGKHTYVPVTREGYWQFNMGDFLIGGQSTGFCSGGCAAIVDSGTSLLAGPSGIVAQI 307
Query: 312 NHAIGATGIVSQECKAVVSQYGEEIINMLLAKVSAK 347
N AIGA+G+ SQECK+VVSQYG+ I+ +L+A+ + +
Sbjct: 308 NEAIGASGLASQECKSVVSQYGDLIMELLMAQTNPQ 343
>gi|171854659|dbj|BAG16519.1| putative aspartic protease [Capsicum chinense]
Length = 506
Score = 459 bits (1182), Expect = e-127, Method: Compositional matrix adjust.
Identities = 218/326 (66%), Positives = 262/326 (80%), Gaps = 10/326 (3%)
Query: 22 VFSTPNGGLYRIGLKKRKFDLNNRVAAR---LDSKEGESFRTSIRKYSLRGNLGESGDAD 78
V + L RIGLKK D+N+ AAR L + G+ +K + D D
Sbjct: 19 VLPASSDNLLRIGLKKHHVDVNSINAARVARLQDRYGKHLNGLEKK-------SDGSDVD 71
Query: 79 IVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRS 138
IV LKNY+DAQY+GEIGIG+PPQ F VIFDTGSSNLWVPSS+CYFSIAC+FH KY++G+S
Sbjct: 72 IVPLKNYLDAQYYGEIGIGSPPQKFKVIFDTGSSNLWVPSSRCYFSIACWFHHKYKAGKS 131
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
STY +NGKS I YGTG+ISG FS+D+V++GDLVVKDQ FIEATREPS+TF++ KFDGIL
Sbjct: 132 STYTRNGKSCSIRYGTGSISGHFSQDNVQVGDLVVKDQVFIEATREPSITFIIGKFDGIL 191
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEH 258
GLGFQEISVG A PVWYNMV+QGLV EPVFSFWFNR+A +EGGE+VFGG+DP H+KG H
Sbjct: 192 GLGFQEISVGNATPVWYNMVDQGLVKEPVFSFWFNRDASTKEGGELVFGGVDPKHFKGNH 251
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
TYVP+TQKGYWQF+MGD +I +TG+CAGGCAAI DSGTSLLAGPTTI+TQ+NHAIGA
Sbjct: 252 TYVPLTQKGYWQFNMGDFLIGNTSTGYCAGGCAAIVDSGTSLLAGPTTIVTQLNHAIGAE 311
Query: 319 GIVSQECKAVVSQYGEEIINMLLAKV 344
G+VS ECK +VSQYGE + ++L++ V
Sbjct: 312 GVVSAECKTIVSQYGEVLWDLLVSGV 337
>gi|82623417|gb|ABB87123.1| aspartic protease precursor-like [Solanum tuberosum]
Length = 506
Score = 458 bits (1178), Expect = e-126, Method: Compositional matrix adjust.
Identities = 224/339 (66%), Positives = 268/339 (79%), Gaps = 15/339 (4%)
Query: 14 LC--LLLFPVVFST---PNGGLYRIGLKKRKFDLNNRVAAR---LDSKEGESFRTSIRKY 65
LC LLL+ + S +G L RIGLKK + D+N+ AAR L + G+ +K
Sbjct: 6 LCAALLLWAITCSALPASSGDLLRIGLKKHRLDVNSIKAARVAKLQDRYGKHVNGIEKKS 65
Query: 66 SLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSI 125
S D DIV LKNY+DAQY+GEIGIG+PPQ F VIFDTGSSNLWVPSSKCYFSI
Sbjct: 66 S-------DSDIDIVPLKNYLDAQYYGEIGIGSPPQKFKVIFDTGSSNLWVPSSKCYFSI 118
Query: 126 ACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREP 185
AC+ HSKY++ +SSTY ++G+S I YGTG+ISG FS D+V++GDLVVKDQ FIEATREP
Sbjct: 119 ACWIHSKYKASKSSTYTRDGESCSIRYGTGSISGHFSMDNVQVGDLVVKDQVFIEATREP 178
Query: 186 SLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIV 245
S+TF++AKFDGILGLGFQEISVG PVWYNMV QGLV E VFSFWFNR+A+ +EGGE+V
Sbjct: 179 SITFIVAKFDGILGLGFQEISVGNTTPVWYNMVGQGLVKESVFSFWFNRDANAKEGGELV 238
Query: 246 FGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPT 305
FGG+DP H+KG HTYVP+TQKGYWQF+MGD +I +TG+CAGGCAAI DSGTSLLAGPT
Sbjct: 239 FGGVDPKHFKGNHTYVPLTQKGYWQFNMGDFLIGNTSTGYCAGGCAAIVDSGTSLLAGPT 298
Query: 306 TIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKV 344
TI+TQ+NHAIGA GIVS ECK +VSQYGE I ++L++ V
Sbjct: 299 TIVTQINHAIGAEGIVSMECKTIVSQYGEMIWDLLVSGV 337
>gi|294440430|gb|ADE74632.1| aspartic protease 1 [Nicotiana tabacum]
Length = 506
Score = 457 bits (1177), Expect = e-126, Method: Compositional matrix adjust.
Identities = 220/336 (65%), Positives = 270/336 (80%), Gaps = 9/336 (2%)
Query: 14 LC--LLLFPVVF---STPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLR 68
LC LLL+ +V+ + L R+GLKK+ D+N+ AAR+ + R +
Sbjct: 6 LCAALLLWAIVYFVLPVSSDNLLRVGLKKQSLDVNSINAARVARLQD---RYGKNVNGIE 62
Query: 69 GNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACY 128
LG+S D DIV+LKNY+DAQY+GEIG+G+PPQ F VIFDTGSSNLWVPSS+CYFSIAC+
Sbjct: 63 KKLGDS-DLDIVSLKNYLDAQYYGEIGVGSPPQKFKVIFDTGSSNLWVPSSRCYFSIACW 121
Query: 129 FHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLT 188
FHSKY++ +S+TY +NG+S I YGTG+ISG FS+D+V++GDLVVKDQ FIEATREPS+T
Sbjct: 122 FHSKYKASKSTTYTRNGESCSIRYGTGSISGHFSQDNVQVGDLVVKDQVFIEATREPSIT 181
Query: 189 FLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGG 248
F++AKFDGILGLGFQEISVG A PVWYNMV QGLV E VFSFW NR+A +EGGE+VFGG
Sbjct: 182 FIIAKFDGILGLGFQEISVGNATPVWYNMVGQGLVKEQVFSFWINRDATAKEGGELVFGG 241
Query: 249 MDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTII 308
+D +H+KG HTYVP+TQKGYWQF+MGD +I +TG CAGGCAAI DSGTSLLAGPTT++
Sbjct: 242 VDSNHFKGNHTYVPLTQKGYWQFNMGDFLIGNASTGVCAGGCAAIVDSGTSLLAGPTTVV 301
Query: 309 TQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKV 344
TQ+NHAIGA GIVS ECK +VSQYGE I N+L++ V
Sbjct: 302 TQINHAIGAEGIVSMECKTIVSQYGEMIWNLLVSGV 337
>gi|148906206|gb|ABR16259.1| unknown [Picea sitchensis]
Length = 509
Score = 457 bits (1177), Expect = e-126, Method: Compositional matrix adjust.
Identities = 217/336 (64%), Positives = 265/336 (78%), Gaps = 3/336 (0%)
Query: 14 LCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLG- 72
L LL+ V + N L RI LKK+ D AAR+ ++EG RKY LRG L
Sbjct: 9 LALLVLTSVCAA-NDCLARIELKKKGLDQKTLQAARIVAREGGLSNEVNRKYGLRGGLSY 67
Query: 73 -ESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHS 131
ES + V LKNY+DAQY+GEIG+GTPPQ FTVIFDTGSSNLWVPS+KCY SIACYFHS
Sbjct: 68 SESARGEYVPLKNYLDAQYYGEIGLGTPPQKFTVIFDTGSSNLWVPSTKCYLSIACYFHS 127
Query: 132 KYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLL 191
KY++ +SS+Y NGK +I YG+G++SG+ +DHV GDLVVKDQ F E T+EP LTFL
Sbjct: 128 KYKASQSSSYCVNGKPFNIQYGSGSVSGYLGQDHVTAGDLVVKDQVFAEVTQEPGLTFLA 187
Query: 192 AKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDP 251
AKFDGILGLGFQ+ISVG VPVWYNMVNQGL+ EPVFSFW NR +EEGGEIVFGG+DP
Sbjct: 188 AKFDGILGLGFQKISVGNVVPVWYNMVNQGLIKEPVFSFWMNRKVGDEEGGEIVFGGVDP 247
Query: 252 DHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQV 311
+H+KG+HTYVPVT++GYWQF+MGD +I GQ+TGFC+GGCAAI DSGTSLLAGP+ I+ Q+
Sbjct: 248 NHFKGKHTYVPVTREGYWQFNMGDFLIGGQSTGFCSGGCAAIVDSGTSLLAGPSGIVAQI 307
Query: 312 NHAIGATGIVSQECKAVVSQYGEEIINMLLAKVSAK 347
N AIGA+G+ SQECK+VVSQYG+ I+ +L+A+ + +
Sbjct: 308 NEAIGASGLASQECKSVVSQYGDLIMELLMAQTNPQ 343
>gi|297848226|ref|XP_002891994.1| hypothetical protein ARALYDRAFT_314946 [Arabidopsis lyrata subsp.
lyrata]
gi|297337836|gb|EFH68253.1| hypothetical protein ARALYDRAFT_314946 [Arabidopsis lyrata subsp.
lyrata]
Length = 504
Score = 453 bits (1166), Expect = e-125, Method: Compositional matrix adjust.
Identities = 221/334 (66%), Positives = 266/334 (79%), Gaps = 7/334 (2%)
Query: 12 FFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFR-TSIRKYSLRGN 70
+FL +LL P T + L+ +GLKKR+ +L++ A+R+ K S R T+ ++ G
Sbjct: 18 YFLAILLHP----TTSSDLFHVGLKKRRLELDDIRASRVIRKLKHSQRLTNYPSFATLG- 72
Query: 71 LGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFH 130
G+S + D V LKNY+DAQY+G IGIGTP Q F VIFDTGSSNLWVPSSKCY S+ACY H
Sbjct: 73 -GDSSNQDQVILKNYLDAQYYGVIGIGTPSQEFEVIFDTGSSNLWVPSSKCYLSLACYLH 131
Query: 131 SKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFL 190
KY+S +S TY KNGK+ I YG+G+ISGFFSED+VK+GDLVVK+QEFIEATRE SLTFL
Sbjct: 132 PKYKSTKSKTYIKNGKTCTITYGSGSISGFFSEDNVKVGDLVVKNQEFIEATREGSLTFL 191
Query: 191 LAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMD 250
LAKFDG+LGLGFQEISVG AVPVWYNMV+QGLV + VFSFW NR+ + E GGEIVFGG+D
Sbjct: 192 LAKFDGLLGLGFQEISVGNAVPVWYNMVDQGLVRDKVFSFWLNRDTEAEVGGEIVFGGVD 251
Query: 251 PDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQ 310
P H+KG+HTYVPVT+KGYWQF+MGD+ + +TGFC GC AI DSGTSLLAGPTT+I Q
Sbjct: 252 PAHFKGKHTYVPVTRKGYWQFNMGDIFVGSNSTGFCEQGCDAIMDSGTSLLAGPTTVIAQ 311
Query: 311 VNHAIGATGIVSQECKAVVSQYGEEIINMLLAKV 344
+NHAIGA GIVS ECK VVSQYGE I N+L+ +V
Sbjct: 312 INHAIGAEGIVSAECKDVVSQYGEMIWNLLVKRV 345
>gi|350535356|ref|NP_001234702.1| aspartic protease precursor [Solanum lycopersicum]
gi|951449|gb|AAB18280.1| aspartic protease precursor [Solanum lycopersicum]
Length = 506
Score = 453 bits (1165), Expect = e-125, Method: Compositional matrix adjust.
Identities = 220/338 (65%), Positives = 270/338 (79%), Gaps = 13/338 (3%)
Query: 14 LC--LLLFPVVFST---PNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLR 68
LC LLL+ + S +G L+RIGLKK + D+++ AAR+ + + +
Sbjct: 6 LCAALLLWAIACSALPASSGDLFRIGLKKHRLDVDSIKAARVAKLQDRYGK------HVN 59
Query: 69 GNLGESGDADI--VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIA 126
G +S D+DI V LKNY+DAQY+GEIGIG+PPQ F VIFDTGSSNLWVPSSKCYFSIA
Sbjct: 60 GIEKKSSDSDIYKVPLKNYLDAQYYGEIGIGSPPQKFKVIFDTGSSNLWVPSSKCYFSIA 119
Query: 127 CYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPS 186
C+ HSKY++ +SSTY ++G+S I YGTG+ISG FS D+V++GDLVVKDQ FIEATREPS
Sbjct: 120 CWIHSKYQASKSSTYTRDGESCSIRYGTGSISGHFSMDNVQVGDLVVKDQVFIEATREPS 179
Query: 187 LTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVF 246
+TF++AKFDGILGLGFQEISVG PVWYNMV QGLV EPVFSFWFNR+A+ +EGGE+VF
Sbjct: 180 ITFIVAKFDGILGLGFQEISVGNTTPVWYNMVGQGLVKEPVFSFWFNRDANAKEGGELVF 239
Query: 247 GGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTT 306
GG+DP H+KG HT VP+TQKGYWQF+MGD +I +TG+CAGGCAAI DSGTSLLAGPTT
Sbjct: 240 GGVDPKHFKGNHTCVPLTQKGYWQFNMGDFLIGNTSTGYCAGGCAAIVDSGTSLLAGPTT 299
Query: 307 IITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKV 344
I+TQ+NHAIGA GIVS ECK +VSQYGE I ++L++ +
Sbjct: 300 IVTQINHAIGAEGIVSMECKTIVSQYGEMIWDLLVSGI 337
>gi|449533814|ref|XP_004173866.1| PREDICTED: aspartic proteinase-like, partial [Cucumis sativus]
Length = 290
Score = 451 bits (1160), Expect = e-124, Method: Compositional matrix adjust.
Identities = 210/281 (74%), Positives = 238/281 (84%), Gaps = 3/281 (1%)
Query: 13 FLCLLLF---PVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRG 69
FLCL L +V S N GL R+GLKK D NR+AARL+SK+ E + + RKYS G
Sbjct: 10 FLCLFLLVSLNIVSSVSNDGLLRVGLKKINLDPENRLAARLESKDAEILKAAFRKYSPNG 69
Query: 70 NLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYF 129
NLGES D DIVALKNY+DAQY+GEI IGTPPQ FTVIFDTGSSNLWVPS+KC FS+AC+F
Sbjct: 70 NLGESSDTDIVALKNYLDAQYYGEIAIGTPPQKFTVIFDTGSSNLWVPSAKCLFSVACHF 129
Query: 130 HSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTF 189
H++Y+S RSSTYKKNG SA I YGTGA+SGFFS D+VK+GDLVVK+Q FIEATREP LTF
Sbjct: 130 HARYKSSRSSTYKKNGTSASIRYGTGAVSGFFSYDNVKVGDLVVKNQLFIEATREPGLTF 189
Query: 190 LLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGM 249
L+AKFDG+LGLGFQEI+VG AVPVWYNMV QGLV EPVFSFW NRNA+EEEGGEIVFGG+
Sbjct: 190 LVAKFDGLLGLGFQEIAVGSAVPVWYNMVEQGLVKEPVFSFWLNRNAEEEEGGEIVFGGV 249
Query: 250 DPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGC 290
DP HY G+HTYVPVTQKGYWQFDMGDV+IDG+ TG+C GGC
Sbjct: 250 DPKHYTGKHTYVPVTQKGYWQFDMGDVLIDGKPTGYCEGGC 290
>gi|224124910|ref|XP_002319454.1| predicted protein [Populus trichocarpa]
gi|222857830|gb|EEE95377.1| predicted protein [Populus trichocarpa]
Length = 507
Score = 450 bits (1157), Expect = e-124, Method: Compositional matrix adjust.
Identities = 216/342 (63%), Positives = 263/342 (76%), Gaps = 9/342 (2%)
Query: 3 MVFKSITAGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSI 62
++ K+ C LL + GL RIGLKKR DL A + +EG + +
Sbjct: 5 ILLKAFCLWALTCFLL-----PASSNGLVRIGLKKRHLDLQTIKDAIIARQEG---KAGV 56
Query: 63 RKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCY 122
S +LG S D DI+ LKNY+DAQY GEIGIG+PPQNFTV+FDTGSSNLWVPSSKCY
Sbjct: 57 GASSRVHDLGSS-DGDIIPLKNYLDAQYLGEIGIGSPPQNFTVVFDTGSSNLWVPSSKCY 115
Query: 123 FSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEAT 182
FSIACYFHSKY+S RSSTY KNG +IHYG+G++SGFFS+D+V++GDLVVKDQ F+EAT
Sbjct: 116 FSIACYFHSKYKSSRSSTYTKNGNFCEIHYGSGSVSGFFSQDNVQVGDLVVKDQVFVEAT 175
Query: 183 REPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGG 242
+E SL+F+L KFDGILGLGFQEISVG VP+WYNM+ Q LV++ VFSFW NRN + +EGG
Sbjct: 176 KEGSLSFILGKFDGILGLGFQEISVGNVVPLWYNMIQQDLVDDEVFSFWLNRNPEAKEGG 235
Query: 243 EIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLA 302
E+VFGG+DP H+KG+HTYVPVTQKGYWQ +MGD +I +TG C GGCAAI DSGTSLLA
Sbjct: 236 ELVFGGVDPKHFKGKHTYVPVTQKGYWQINMGDFLIGKHSTGLCEGGCAAIVDSGTSLLA 295
Query: 303 GPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKV 344
GPT IIT++NHAIGA G+VS ECK VVS YG+ I ++++ V
Sbjct: 296 GPTPIITEINHAIGAEGLVSAECKEVVSHYGDLIWELIISGV 337
>gi|110162110|emb|CAL07969.1| aspartic proteinase [Cynara cardunculus]
Length = 506
Score = 441 bits (1135), Expect = e-121, Method: Compositional matrix adjust.
Identities = 213/326 (65%), Positives = 253/326 (77%), Gaps = 7/326 (2%)
Query: 20 PVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRK-YSLRGNLGESGDAD 78
P FS NGGL R+GLKKRK D +++ A G + RK + R L SG
Sbjct: 20 PTAFSVSNGGLLRVGLKKRKVDRLDQLRAH-----GVHMLGNARKDFGFRRTLRVSGSG- 73
Query: 79 IVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRS 138
IVAL N D Y+GEIGIGTPPQNF VIFDTGSS+LWVPSSKCY S+AC H +Y SG S
Sbjct: 74 IVALTNDRDTAYYGEIGIGTPPQNFAVIFDTGSSDLWVPSSKCYTSLACVIHPRYESGDS 133
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
STYK+NG +A I YGTGAI GF+S+D V++GDLVV+ Q+FIE T E FL FDGIL
Sbjct: 134 STYKRNGTTASIQYGTGAIVGFYSQDSVEVGDLVVEQQDFIETTEEDDTVFLARDFDGIL 193
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEH 258
GLGFQEIS GKAVPVWYNMVNQGLV E VFSFW NRN DEEEGGE+VFGG+DP+H++G H
Sbjct: 194 GLGFQEISAGKAVPVWYNMVNQGLVEEAVFSFWLNRNVDEEEGGELVFGGVDPNHFRGNH 253
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
TYVPVT+KGYWQF+MGDV+I +++GFCAGGCAAIADSGTSL+AGPT IITQ+N AIGA
Sbjct: 254 TYVPVTRKGYWQFEMGDVLIGDKSSGFCAGGCAAIADSGTSLIAGPTAIITQINQAIGAK 313
Query: 319 GIVSQECKAVVSQYGEEIINMLLAKV 344
G+++Q+CK +VSQYG+ +I ML ++V
Sbjct: 314 GVLNQQCKTLVSQYGKNMIQMLTSEV 339
>gi|75338567|sp|Q9XFX4.1|CARDB_CYNCA RecName: Full=Procardosin-B; Contains: RecName: Full=Cardosin-B
heavy chain; AltName: Full=Cardosin-B 34 kDa subunit;
Contains: RecName: Full=Cardosin-B light chain; AltName:
Full=Cardosin-B 14 kDa subunit; Flags: Precursor
gi|4582534|emb|CAB40349.1| preprocardosin B [Cynara cardunculus]
Length = 506
Score = 436 bits (1120), Expect = e-119, Method: Compositional matrix adjust.
Identities = 211/336 (62%), Positives = 253/336 (75%), Gaps = 7/336 (2%)
Query: 20 PVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRK-YSLRGNLGESGDAD 78
P FS NGGL R+GLKKRK D +++ A G + RK + R L +SG
Sbjct: 20 PTAFSVSNGGLLRVGLKKRKVDRLDQLRAH-----GVHMLGNARKDFGFRRTLSDSGSG- 73
Query: 79 IVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRS 138
IVAL N D Y+GEIGIGTPPQNF VIFDTGSS+LWVPS+KC S+AC H +Y SG S
Sbjct: 74 IVALTNDRDTAYYGEIGIGTPPQNFAVIFDTGSSDLWVPSTKCDTSLACVIHPRYDSGDS 133
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
STYK NG +A I YGTGAI GF+S+D V++GDLVV+ Q+FIE T E FL ++FDGIL
Sbjct: 134 STYKGNGTTASIQYGTGAIVGFYSQDSVEVGDLVVEHQDFIETTEEDDTVFLKSEFDGIL 193
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEH 258
GLGFQEIS GKAVPVWYNMVNQGLV E VFSFW NRN DEEEGGE+VFGG+DP+H++G H
Sbjct: 194 GLGFQEISAGKAVPVWYNMVNQGLVEEAVFSFWLNRNVDEEEGGELVFGGVDPNHFRGNH 253
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
TYVPVT+KGYWQF+MGDV+I +++GFCAGGCAAIADSGTS AGPT IITQ+N AIGA
Sbjct: 254 TYVPVTRKGYWQFEMGDVLIGDKSSGFCAGGCAAIADSGTSFFAGPTAIITQINQAIGAK 313
Query: 319 GIVSQECKAVVSQYGEEIINMLLAKVSAKLSLRHYK 354
G+++Q+CK +V QYG+ +I ML ++V H K
Sbjct: 314 GVLNQQCKTLVGQYGKNMIQMLTSEVQPDKICSHMK 349
>gi|359487701|ref|XP_002276363.2| PREDICTED: aspartic proteinase oryzasin-1-like [Vitis vinifera]
gi|296089851|emb|CBI39670.3| unnamed protein product [Vitis vinifera]
Length = 502
Score = 435 bits (1118), Expect = e-119, Method: Compositional matrix adjust.
Identities = 211/337 (62%), Positives = 261/337 (77%), Gaps = 5/337 (1%)
Query: 8 ITAGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSL 67
+ A F L L+ P++ +G + RIGLKKR D NN AR+ +G+ + KY
Sbjct: 6 VWAAFCLWALICPLLPVYSHGSV-RIGLKKRPLDFNNMRTARIAQMQGKIGGGVMSKY-- 62
Query: 68 RGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIAC 127
+ + D + V+LKNY+DAQYFGEIGIGTPPQNFTV+FDTGSSNLWVPSSKCYFSIAC
Sbjct: 63 --HGFDDPDGEFVSLKNYLDAQYFGEIGIGTPPQNFTVVFDTGSSNLWVPSSKCYFSIAC 120
Query: 128 YFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSL 187
+FH+KY++ SSTY K G+ +IHYG+G+ISGFFS+D+V++G LVVKDQ FIEATRE SL
Sbjct: 121 FFHNKYKARLSSTYTKIGRPGEIHYGSGSISGFFSQDNVEVGSLVVKDQVFIEATREGSL 180
Query: 188 TFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFG 247
TF LAKFDGI+GLGFQ ISVG A PVW M+ QGL++E +FSFW NRN + EGGEIVFG
Sbjct: 181 TFALAKFDGIMGLGFQGISVGNATPVWSTMLQQGLLHEELFSFWLNRNPNANEGGEIVFG 240
Query: 248 GMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTI 307
G+D H++G+HT+VPVTQ GYWQF MGD +I QTTG C GGC+AI DSGTSL+AGPT +
Sbjct: 241 GVDKRHFRGKHTFVPVTQAGYWQFRMGDFLISNQTTGVCEGGCSAIVDSGTSLIAGPTLV 300
Query: 308 ITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKV 344
+TQ+NHAIGA GIVS ECK VVSQYG + ++L++ V
Sbjct: 301 VTQINHAIGAEGIVSMECKEVVSQYGNMMWDLLVSGV 337
>gi|357131833|ref|XP_003567538.1| PREDICTED: aspartic proteinase-like [Brachypodium distachyon]
Length = 503
Score = 434 bits (1116), Expect = e-119, Method: Compositional matrix adjust.
Identities = 204/319 (63%), Positives = 247/319 (77%), Gaps = 4/319 (1%)
Query: 29 GLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDA 88
G+ RI L K++ D AA+L ++ R+ Y G D DIV L NY+D
Sbjct: 27 GVLRINLSKKRLDKEALTAAKLARQQRNVLRSGDGSYRYLG----VSDDDIVPLDNYLDT 82
Query: 89 QYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSA 148
QY+GEIG+GTPPQNFTVIFDTGSSNLWVPSSKCYFSIACY H KY+S +SSTYKKNG++
Sbjct: 83 QYYGEIGVGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYLHHKYKSTKSSTYKKNGETC 142
Query: 149 DIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVG 208
I YG+G+I+GFFSED V +GDLVVK+Q+FIE TRE S +F++ KFDGILGLGF EISVG
Sbjct: 143 TISYGSGSIAGFFSEDSVLVGDLVVKNQKFIETTREASPSFIIGKFDGILGLGFPEISVG 202
Query: 209 KAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGY 268
A PVW +M Q L+ + +FSFW NR+ D GGE+VFGG+D HYKG+HTYVPVT+KGY
Sbjct: 203 SAPPVWQSMQEQKLIAKDIFSFWLNRDPDAPTGGELVFGGVDQKHYKGKHTYVPVTRKGY 262
Query: 269 WQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQECKAV 328
WQFDMGD++I GQ+TGFCAGGCAAI DSGTSLLAGPTTI+ QVNHAIGA GI+S ECK V
Sbjct: 263 WQFDMGDLLIGGQSTGFCAGGCAAIVDSGTSLLAGPTTIVAQVNHAIGAEGIISMECKEV 322
Query: 329 VSQYGEEIINMLLAKVSAK 347
V +YGE I+ +L+A+ +
Sbjct: 323 VREYGEMILELLVAQTRPQ 341
>gi|357134751|ref|XP_003568979.1| PREDICTED: aspartic proteinase-like [Brachypodium distachyon]
Length = 498
Score = 434 bits (1116), Expect = e-119, Method: Compositional matrix adjust.
Identities = 208/327 (63%), Positives = 254/327 (77%), Gaps = 11/327 (3%)
Query: 21 VVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIV 80
++ ++P GL RI L K+ + AA+L +++ +S I S ++DIV
Sbjct: 21 LLHASPPDGLLRINLNKKSLNYEALNAAKL-ARQQDSVHLKI----------SSSNSDIV 69
Query: 81 ALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSST 140
L +Y++ QYFG IG+GTPPQNFTVIFDTGSSNLWVPSSKCYFSIACY H KY+S +SST
Sbjct: 70 PLVDYLNTQYFGVIGVGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYLHHKYKSSKSST 129
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
YK +G+SA I YG+GAISGFFS D+V +GDLVVK Q+FIE TRE S TF++ KFDGILGL
Sbjct: 130 YKADGESAKITYGSGAISGFFSNDNVLVGDLVVKKQKFIETTRETSATFIIGKFDGILGL 189
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
GF EISVGKA PVW +M Q L+ + VFSFW NRNAD GGE+VFGG+D +HYKG HTY
Sbjct: 190 GFPEISVGKAPPVWMSMQKQKLLADDVFSFWLNRNADATSGGELVFGGVDSNHYKGNHTY 249
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
VPV++KGYWQF+MGD++IDGQ+TGFCA GCAAI DSGTSLLAGPT I+ QVNHAIGA GI
Sbjct: 250 VPVSRKGYWQFNMGDLLIDGQSTGFCAKGCAAIVDSGTSLLAGPTAIVAQVNHAIGAEGI 309
Query: 321 VSQECKAVVSQYGEEIINMLLAKVSAK 347
+S ECK VVSQYGE I+++LLA+ +
Sbjct: 310 ISTECKEVVSQYGEMILDLLLAQTEPQ 336
>gi|50540937|gb|AAT77954.1| Asp [Solanum tuberosum]
Length = 497
Score = 432 bits (1110), Expect = e-118, Method: Compositional matrix adjust.
Identities = 216/339 (63%), Positives = 257/339 (75%), Gaps = 27/339 (7%)
Query: 14 LC--LLLFPVVFST---PNGGLYRIGLKKRKFDLNNRVAAR---LDSKEGESFRTSIRKY 65
LC LLL+ + S +G L RIGLKK + D+N+ AAR L + G+ +K
Sbjct: 6 LCAALLLWAITCSALPASSGDLLRIGLKKHRLDVNSIKAARVAKLQDRYGKHVNGIEKKS 65
Query: 66 SLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSI 125
S D DIV LKNY+DAQY+GEIGIG+PPQ F VIFDTGSSNLWVPSSKCYFSI
Sbjct: 66 S-------DSDIDIVPLKNYLDAQYYGEIGIGSPPQKFKVIFDTGSSNLWVPSSKCYFSI 118
Query: 126 ACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREP 185
AC+ H ++G+S I Y TG+ISG FS D+V++GDLVVKDQ FIEATREP
Sbjct: 119 ACWIH------------RDGESCSIRYETGSISGHFSMDNVQVGDLVVKDQVFIEATREP 166
Query: 186 SLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIV 245
S+TF++AKFDGILGLGFQEISVG PVWYNMV QGLV EPVFSFWFNR+A+ +EGGE+V
Sbjct: 167 SITFIVAKFDGILGLGFQEISVGNTTPVWYNMVGQGLVKEPVFSFWFNRDANAKEGGELV 226
Query: 246 FGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPT 305
FGG+DP H+KG HTYVP+TQKGYWQF+MGD +I +TG+CAGGCAAI DSGTSLLAGPT
Sbjct: 227 FGGVDPKHFKGNHTYVPLTQKGYWQFNMGDFLIGNTSTGYCAGGCAAIVDSGTSLLAGPT 286
Query: 306 TIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKV 344
TI+ Q+NHAIGA GIVS ECK +VSQYGE I ++L++ V
Sbjct: 287 TIVAQINHAIGAEGIVSMECKTIVSQYGEMIWDLLVSGV 325
>gi|326510801|dbj|BAJ91748.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 450
Score = 431 bits (1109), Expect = e-118, Method: Compositional matrix adjust.
Identities = 207/320 (64%), Positives = 250/320 (78%), Gaps = 13/320 (4%)
Query: 29 GLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKY-SLRGNLGESGDADIVALKNYMD 87
GL RI L KR + AA+ + R+Y +LR G S D+DIV L +Y++
Sbjct: 29 GLLRINLNKRSLTHKSLAAAK-----------AARQYGALRLKSGNS-DSDIVPLVDYLN 76
Query: 88 AQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKS 147
QY+G IG+GTPPQNFTVIFDTGSSNLWVPSSKCYFSIACY H KYRS RS+TYK +G++
Sbjct: 77 TQYYGVIGLGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYLHPKYRSSRSTTYKADGEN 136
Query: 148 ADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISV 207
I YG+GAISGFFS D+V +GDLVVK+Q+FIEATRE S++F+L KFDGILGLG+ +ISV
Sbjct: 137 CKITYGSGAISGFFSNDNVLVGDLVVKNQKFIEATRETSVSFILGKFDGILGLGYPDISV 196
Query: 208 GKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKG 267
GKA PVW +M Q L+ + VFSFW NR++D GGE+VFGGMDP HYKG HTYVPV++KG
Sbjct: 197 GKAPPVWLSMQEQKLLADDVFSFWLNRDSDALSGGELVFGGMDPHHYKGNHTYVPVSRKG 256
Query: 268 YWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQECKA 327
YWQF+MGD++IDG +TGFCA GCAAI DSGTSLLAGPT I+ QVNHAIGA GI+S ECK
Sbjct: 257 YWQFNMGDLLIDGHSTGFCAKGCAAIVDSGTSLLAGPTAIVAQVNHAIGAEGIISTECKE 316
Query: 328 VVSQYGEEIINMLLAKVSAK 347
VVSQYGE I+ ML+A+ +
Sbjct: 317 VVSQYGEMILEMLIAQTQPQ 336
>gi|168029783|ref|XP_001767404.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162681300|gb|EDQ67728.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 499
Score = 430 bits (1105), Expect = e-118, Method: Compositional matrix adjust.
Identities = 204/315 (64%), Positives = 249/315 (79%), Gaps = 9/315 (2%)
Query: 32 RIGLKKRKFDLNN-RVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQY 90
RI LKK+ L + R AA + ++F S + LR GE DIVAL NY+DAQY
Sbjct: 29 RIALKKKPVTLQSVRNAASRTIQRAKTFTRS--EDELRD--GE----DIVALNNYLDAQY 80
Query: 91 FGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADI 150
FGEIGIG+PPQ F VIFDTGSSNLWVPS+KCY S+ACYFH +Y+SG+SSTYK++G S I
Sbjct: 81 FGEIGIGSPPQPFAVIFDTGSSNLWVPSAKCYLSLACYFHHRYKSGKSSTYKEDGTSFAI 140
Query: 151 HYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKA 210
YGTG++ GF S+D V +GDL VK Q F EAT+EP LTF++AKFDGILGLGF+EISV +
Sbjct: 141 QYGTGSMEGFLSQDDVTLGDLTVKGQVFAEATKEPGLTFVVAKFDGILGLGFKEISVNRV 200
Query: 211 VPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQ 270
P WYNM++QGLV EPVFSFW NRN DE GGE+V GG+DP H+KGEH Y PVT+KGYWQ
Sbjct: 201 TPPWYNMLDQGLVKEPVFSFWLNRNPDESSGGELVLGGVDPKHFKGEHVYTPVTRKGYWQ 260
Query: 271 FDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQECKAVVS 330
FD+GDV I+G+TTGFCA GC AIADSGTSLLAGP+ I+ ++N AIGATG+VSQ+CK VV
Sbjct: 261 FDLGDVTINGRTTGFCANGCTAIADSGTSLLAGPSGIVAEINQAIGATGVVSQQCKMVVQ 320
Query: 331 QYGEEIINMLLAKVS 345
QYG++I+ MLLA+++
Sbjct: 321 QYGDQIVEMLLAQMN 335
>gi|326494022|dbj|BAJ85473.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326511208|dbj|BAJ87618.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 498
Score = 430 bits (1105), Expect = e-118, Method: Compositional matrix adjust.
Identities = 207/320 (64%), Positives = 250/320 (78%), Gaps = 13/320 (4%)
Query: 29 GLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKY-SLRGNLGESGDADIVALKNYMD 87
GL RI L KR + AA+ + R+Y +LR G S D+DIV L +Y++
Sbjct: 29 GLLRINLNKRSLTHESLAAAK-----------AARQYGALRLKSGNS-DSDIVPLVDYLN 76
Query: 88 AQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKS 147
QY+G IG+GTPPQNFTVIFDTGSSNLWVPSSKCYFSIACY H KYRS RS+TYK +G++
Sbjct: 77 TQYYGVIGLGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYLHPKYRSSRSTTYKADGEN 136
Query: 148 ADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISV 207
I YG+GAISGFFS D+V +GDLVVK+Q+FIEATRE S++F+L KFDGILGLG+ +ISV
Sbjct: 137 CKITYGSGAISGFFSNDNVLVGDLVVKNQKFIEATRETSVSFILGKFDGILGLGYPDISV 196
Query: 208 GKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKG 267
GKA PVW +M Q L+ + VFSFW NR++D GGE+VFGGMDP HYKG HTYVPV++KG
Sbjct: 197 GKAPPVWLSMQEQKLLADDVFSFWLNRDSDALSGGELVFGGMDPHHYKGNHTYVPVSRKG 256
Query: 268 YWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQECKA 327
YWQF+MGD++IDG +TGFCA GCAAI DSGTSLLAGPT I+ QVNHAIGA GI+S ECK
Sbjct: 257 YWQFNMGDLLIDGHSTGFCAKGCAAIVDSGTSLLAGPTAIVAQVNHAIGAEGIISTECKE 316
Query: 328 VVSQYGEEIINMLLAKVSAK 347
VVSQYGE I+ ML+A+ +
Sbjct: 317 VVSQYGEMILEMLIAQTQPQ 336
>gi|40641523|emb|CAE52913.1| putative vacuaolar aspartic proteinase [Physcomitrella patens]
Length = 504
Score = 429 bits (1104), Expect = e-118, Method: Compositional matrix adjust.
Identities = 204/315 (64%), Positives = 249/315 (79%), Gaps = 9/315 (2%)
Query: 32 RIGLKKRKFDLNN-RVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQY 90
RI LKK+ L + R AA + ++F S + LR GE DIVAL NY+DAQY
Sbjct: 29 RIALKKKPVTLQSVRNAASRTIQRAKTFTRS--EDELRD--GE----DIVALNNYLDAQY 80
Query: 91 FGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADI 150
FGEIGIG+PPQ F VIFDTGSSNLWVPS+KCY S+ACYFH +Y+SG+SSTYK++G S I
Sbjct: 81 FGEIGIGSPPQPFAVIFDTGSSNLWVPSAKCYLSLACYFHHRYKSGKSSTYKEDGTSFAI 140
Query: 151 HYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKA 210
YGTG++ GF S+D V +GDL VK Q F EAT+EP LTF++AKFDGILGLGF+EISV +
Sbjct: 141 QYGTGSMEGFLSQDDVTLGDLTVKGQVFAEATKEPGLTFVVAKFDGILGLGFKEISVNRV 200
Query: 211 VPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQ 270
P WYNM++QGLV EPVFSFW NRN DE GGE+V GG+DP H+KGEH Y PVT+KGYWQ
Sbjct: 201 TPPWYNMLDQGLVKEPVFSFWLNRNPDESSGGELVLGGVDPKHFKGEHVYTPVTRKGYWQ 260
Query: 271 FDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQECKAVVS 330
FD+GDV I+G+TTGFCA GC AIADSGTSLLAGP+ I+ ++N AIGATG+VSQ+CK VV
Sbjct: 261 FDLGDVTINGRTTGFCANGCTAIADSGTSLLAGPSGIVAEINQAIGATGVVSQQCKMVVQ 320
Query: 331 QYGEEIINMLLAKVS 345
QYG++I+ MLLA+++
Sbjct: 321 QYGDQIVEMLLAQMN 335
>gi|413946823|gb|AFW79472.1| hypothetical protein ZEAMMB73_587615 [Zea mays]
Length = 488
Score = 428 bits (1100), Expect = e-117, Method: Compositional matrix adjust.
Identities = 208/348 (59%), Positives = 258/348 (74%), Gaps = 7/348 (2%)
Query: 1 MGMVFKSITAGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRT 60
MG + A F++ L ++ + GL RI L K++ D AA+L KE +
Sbjct: 42 MGQTHLLLLACFWV-LSTCSLLLDASSDGLLRINLNKKRLDKEALTAAKLAKKE-----S 95
Query: 61 SIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSK 120
++R+ + DIV L NY+D QYFG+I IGTPPQNFTVIFDTGSSNLWVPSSK
Sbjct: 96 NLRRSVGADQYLSASTDDIVPLDNYLDTQYFGQISIGTPPQNFTVIFDTGSSNLWVPSSK 155
Query: 121 CYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIE 180
CYFSIACY H +Y+S +S TY KNG+S I YG+G I+GFFSED+V +G+LVV++Q+FIE
Sbjct: 156 CYFSIACYLHHRYKSTKSKTYTKNGESCTITYGSGQIAGFFSEDNVLVGNLVVQNQKFIE 215
Query: 181 ATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEE 240
TRE S TF++ KFDGILGLGF EISVG A P+W +M Q LV + VFSFW NR+ D
Sbjct: 216 TTRETSPTFIIGKFDGILGLGFPEISVGGAPPIWQSMKQQKLVAKDVFSFWLNRDPDASS 275
Query: 241 -GGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTS 299
GGE+VFGG+DP HYKG+HTYVPVT+KGYWQFDMGD++I G +TGFCAGGCAAI DSGTS
Sbjct: 276 GGGELVFGGVDPKHYKGDHTYVPVTRKGYWQFDMGDLIIGGHSTGFCAGGCAAIVDSGTS 335
Query: 300 LLAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKVSAK 347
LLAGPTTI+ QVNHAIGA GI+S ECK VVS+YGE I+ +L+++ S +
Sbjct: 336 LLAGPTTIVAQVNHAIGAEGIISTECKEVVSEYGEMILELLISQTSPQ 383
>gi|413946821|gb|AFW79470.1| retrotransposon protein SINE subclass isoform 1 [Zea mays]
gi|413946822|gb|AFW79471.1| retrotransposon protein SINE subclass isoform 2 [Zea mays]
Length = 545
Score = 426 bits (1096), Expect = e-117, Method: Compositional matrix adjust.
Identities = 208/348 (59%), Positives = 258/348 (74%), Gaps = 7/348 (2%)
Query: 1 MGMVFKSITAGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRT 60
MG + A F++ L ++ + GL RI L K++ D AA+L KE +
Sbjct: 42 MGQTHLLLLACFWV-LSTCSLLLDASSDGLLRINLNKKRLDKEALTAAKLAKKE-----S 95
Query: 61 SIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSK 120
++R+ + DIV L NY+D QYFG+I IGTPPQNFTVIFDTGSSNLWVPSSK
Sbjct: 96 NLRRSVGADQYLSASTDDIVPLDNYLDTQYFGQISIGTPPQNFTVIFDTGSSNLWVPSSK 155
Query: 121 CYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIE 180
CYFSIACY H +Y+S +S TY KNG+S I YG+G I+GFFSED+V +G+LVV++Q+FIE
Sbjct: 156 CYFSIACYLHHRYKSTKSKTYTKNGESCTITYGSGQIAGFFSEDNVLVGNLVVQNQKFIE 215
Query: 181 ATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEE 240
TRE S TF++ KFDGILGLGF EISVG A P+W +M Q LV + VFSFW NR+ D
Sbjct: 216 TTRETSPTFIIGKFDGILGLGFPEISVGGAPPIWQSMKQQKLVAKDVFSFWLNRDPDASS 275
Query: 241 -GGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTS 299
GGE+VFGG+DP HYKG+HTYVPVT+KGYWQFDMGD++I G +TGFCAGGCAAI DSGTS
Sbjct: 276 GGGELVFGGVDPKHYKGDHTYVPVTRKGYWQFDMGDLIIGGHSTGFCAGGCAAIVDSGTS 335
Query: 300 LLAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKVSAK 347
LLAGPTTI+ QVNHAIGA GI+S ECK VVS+YGE I+ +L+++ S +
Sbjct: 336 LLAGPTTIVAQVNHAIGAEGIISTECKEVVSEYGEMILELLISQTSPQ 383
>gi|226497182|ref|NP_001152501.1| retrotransposon protein SINE subclass precursor [Zea mays]
gi|195624058|gb|ACG33859.1| retrotransposon protein SINE subclass [Zea mays]
gi|195656921|gb|ACG47928.1| retrotransposon protein SINE subclass [Zea mays]
gi|413946824|gb|AFW79473.1| retrotransposon protein SINE subclass isoform 1 [Zea mays]
gi|413946825|gb|AFW79474.1| retrotransposon protein SINE subclass isoform 2 [Zea mays]
gi|413946826|gb|AFW79475.1| retrotransposon protein SINE subclass isoform 3 [Zea mays]
Length = 504
Score = 426 bits (1095), Expect = e-117, Method: Compositional matrix adjust.
Identities = 208/348 (59%), Positives = 258/348 (74%), Gaps = 7/348 (2%)
Query: 1 MGMVFKSITAGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRT 60
MG + A F++ L ++ + GL RI L K++ D AA+L KE +
Sbjct: 1 MGQTHLLLLACFWV-LSTCSLLLDASSDGLLRINLNKKRLDKEALTAAKLAKKE-----S 54
Query: 61 SIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSK 120
++R+ + DIV L NY+D QYFG+I IGTPPQNFTVIFDTGSSNLWVPSSK
Sbjct: 55 NLRRSVGADQYLSASTDDIVPLDNYLDTQYFGQISIGTPPQNFTVIFDTGSSNLWVPSSK 114
Query: 121 CYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIE 180
CYFSIACY H +Y+S +S TY KNG+S I YG+G I+GFFSED+V +G+LVV++Q+FIE
Sbjct: 115 CYFSIACYLHHRYKSTKSKTYTKNGESCTITYGSGQIAGFFSEDNVLVGNLVVQNQKFIE 174
Query: 181 ATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEE 240
TRE S TF++ KFDGILGLGF EISVG A P+W +M Q LV + VFSFW NR+ D
Sbjct: 175 TTRETSPTFIIGKFDGILGLGFPEISVGGAPPIWQSMKQQKLVAKDVFSFWLNRDPDASS 234
Query: 241 -GGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTS 299
GGE+VFGG+DP HYKG+HTYVPVT+KGYWQFDMGD++I G +TGFCAGGCAAI DSGTS
Sbjct: 235 GGGELVFGGVDPKHYKGDHTYVPVTRKGYWQFDMGDLIIGGHSTGFCAGGCAAIVDSGTS 294
Query: 300 LLAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKVSAK 347
LLAGPTTI+ QVNHAIGA GI+S ECK VVS+YGE I+ +L+++ S +
Sbjct: 295 LLAGPTTIVAQVNHAIGAEGIISTECKEVVSEYGEMILELLISQTSPQ 342
>gi|194706186|gb|ACF87177.1| unknown [Zea mays]
Length = 504
Score = 425 bits (1092), Expect = e-116, Method: Compositional matrix adjust.
Identities = 207/348 (59%), Positives = 257/348 (73%), Gaps = 7/348 (2%)
Query: 1 MGMVFKSITAGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRT 60
MG + A F++ L ++ + GL RI L K++ D AA+L KE +
Sbjct: 1 MGQTHLLLLACFWV-LSTCSLLLDASSDGLLRINLNKKRLDKEALTAAKLAKKE-----S 54
Query: 61 SIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSK 120
++R+ + DIV L NY+D QYFG+I IGTPPQNFTVIFDTGSSNLWVPSSK
Sbjct: 55 NLRRSVGADQYLSASTDDIVPLDNYLDTQYFGQISIGTPPQNFTVIFDTGSSNLWVPSSK 114
Query: 121 CYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIE 180
CYFSIACY H +Y+S +S TY KNG+S I YG+G I+GFFSED+V +G+LVV++Q+FIE
Sbjct: 115 CYFSIACYLHHRYKSTKSKTYTKNGESCTITYGSGQIAGFFSEDNVLVGNLVVQNQKFIE 174
Query: 181 ATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEE 240
TRE S TF++ KFDGILGLGF EISVG A P+W +M Q LV + VFSFW NR+ D
Sbjct: 175 TTRETSPTFIIGKFDGILGLGFPEISVGGAPPIWQSMKQQKLVAKDVFSFWLNRDPDASS 234
Query: 241 -GGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTS 299
GGE+VFGG+DP HYKG+HTYVP T+KGYWQFDMGD++I G +TGFCAGGCAAI DSGTS
Sbjct: 235 GGGELVFGGVDPKHYKGDHTYVPATRKGYWQFDMGDLIIGGHSTGFCAGGCAAIVDSGTS 294
Query: 300 LLAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKVSAK 347
LLAGPTTI+ QVNHAIGA GI+S ECK VVS+YGE I+ +L+++ S +
Sbjct: 295 LLAGPTTIVAQVNHAIGAEGIISTECKEVVSEYGEMILELLISQTSPQ 342
>gi|115461973|ref|NP_001054586.1| Os05g0137400 [Oryza sativa Japonica Group]
gi|78099760|sp|P42211.2|ASPRX_ORYSJ RecName: Full=Aspartic proteinase; Flags: Precursor
gi|46485798|gb|AAS98423.1| aspartic proteinase [Oryza sativa Japonica Group]
gi|113578137|dbj|BAF16500.1| Os05g0137400 [Oryza sativa Japonica Group]
gi|215694423|dbj|BAG89416.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 496
Score = 425 bits (1092), Expect = e-116, Method: Compositional matrix adjust.
Identities = 196/327 (59%), Positives = 251/327 (76%), Gaps = 12/327 (3%)
Query: 21 VVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIV 80
++ + G R+ L K++ D + AA+L + +T S D+D V
Sbjct: 20 LLLHASSDGFLRVNLNKKRLDKEDLTAAKLAQQGNRLLKTG------------SSDSDPV 67
Query: 81 ALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSST 140
L +Y++ QY+G IG+G+PPQNFTVIFDTGSSNLWVPS+KCYFSIACY HS+Y S +SS+
Sbjct: 68 PLVDYLNTQYYGVIGLGSPPQNFTVIFDTGSSNLWVPSAKCYFSIACYLHSRYNSKKSSS 127
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
YK +G++ I YG+GAISGFFS+D+V +GDLVVK+Q+FIEATRE S+TF++ KFDGILGL
Sbjct: 128 YKADGETCKITYGSGAISGFFSKDNVLVGDLVVKNQKFIEATRETSVTFIIGKFDGILGL 187
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
G+ EISVGKA P+W +M Q L+ + VFSFW NR+ D GGE+VFGGMDP HYKG+HTY
Sbjct: 188 GYPEISVGKAPPIWQSMQEQELLADDVFSFWLNRDPDASSGGELVFGGMDPKHYKGDHTY 247
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
VPV++KGYWQF+MGD++IDG +TGFCA GCAAI DSGTSLLAGPT I+ QVNHAIGA GI
Sbjct: 248 VPVSRKGYWQFNMGDLLIDGHSTGFCAKGCAAIVDSGTSLLAGPTAIVAQVNHAIGAEGI 307
Query: 321 VSQECKAVVSQYGEEIINMLLAKVSAK 347
+S ECK VVS+YGE I+N+L+A+ +
Sbjct: 308 ISTECKEVVSEYGEMILNLLIAQTDPQ 334
>gi|219887925|gb|ACL54337.1| unknown [Zea mays]
Length = 504
Score = 424 bits (1091), Expect = e-116, Method: Compositional matrix adjust.
Identities = 208/348 (59%), Positives = 257/348 (73%), Gaps = 7/348 (2%)
Query: 1 MGMVFKSITAGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRT 60
MG + A F++ L ++ + GL RI L K++ D AA+L KE +
Sbjct: 1 MGQTHLLLLACFWV-LSTCSLLLDASSDGLLRINLNKKRLDKEALTAAKLAKKE-----S 54
Query: 61 SIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSK 120
++R+ + DIV L NY+D QYFG+I IGTPPQNFTVIFDTGSSNLWVPSSK
Sbjct: 55 NLRRSVGADQYLSASTDDIVPLDNYLDTQYFGQISIGTPPQNFTVIFDTGSSNLWVPSSK 114
Query: 121 CYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIE 180
CYFSIACY H +Y+S +S TY KNG+S I YG+G I+GFFSED+V +G+LVV++Q+FIE
Sbjct: 115 CYFSIACYLHHRYKSTKSKTYTKNGESCTITYGSGQIAGFFSEDNVLVGNLVVQNQKFIE 174
Query: 181 ATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEE 240
TRE S TF++ KFDGILGLGF EISVG A P+W +M Q LV + VFSFW NR+ D
Sbjct: 175 TTRETSPTFIIGKFDGILGLGFPEISVGGAPPIWQSMKQQKLVAKDVFSFWLNRDPDASS 234
Query: 241 -GGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTS 299
GGE VFGG+DP HYKG+HTYVPVT+KGYWQFDMGD++I G +TGFCAGGCAAI DSGTS
Sbjct: 235 GGGEPVFGGVDPKHYKGDHTYVPVTRKGYWQFDMGDLIIGGHSTGFCAGGCAAIVDSGTS 294
Query: 300 LLAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKVSAK 347
LLAGPTTI+ QVNHAIGA GI+S ECK VVS+YGE I+ +L+++ S +
Sbjct: 295 LLAGPTTIVAQVNHAIGAEGIISTECKEVVSEYGEMILELLISQTSPQ 342
>gi|222630120|gb|EEE62252.1| hypothetical protein OsJ_17039 [Oryza sativa Japonica Group]
Length = 501
Score = 424 bits (1091), Expect = e-116, Method: Compositional matrix adjust.
Identities = 196/327 (59%), Positives = 251/327 (76%), Gaps = 12/327 (3%)
Query: 21 VVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIV 80
++ + G R+ L K++ D + AA+L + +T S D+D V
Sbjct: 20 LLLHASSDGFLRVNLNKKRLDKEDLTAAKLAQQGNRLLKTG------------SSDSDPV 67
Query: 81 ALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSST 140
L +Y++ QY+G IG+G+PPQNFTVIFDTGSSNLWVPS+KCYFSIACY HS+Y S +SS+
Sbjct: 68 PLVDYLNTQYYGVIGLGSPPQNFTVIFDTGSSNLWVPSAKCYFSIACYLHSRYNSKKSSS 127
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
YK +G++ I YG+GAISGFFS+D+V +GDLVVK+Q+FIEATRE S+TF++ KFDGILGL
Sbjct: 128 YKADGETCKITYGSGAISGFFSKDNVLVGDLVVKNQKFIEATRETSVTFIIGKFDGILGL 187
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
G+ EISVGKA P+W +M Q L+ + VFSFW NR+ D GGE+VFGGMDP HYKG+HTY
Sbjct: 188 GYPEISVGKAPPIWQSMQEQELLADDVFSFWLNRDPDASSGGELVFGGMDPKHYKGDHTY 247
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
VPV++KGYWQF+MGD++IDG +TGFCA GCAAI DSGTSLLAGPT I+ QVNHAIGA GI
Sbjct: 248 VPVSRKGYWQFNMGDLLIDGHSTGFCAKGCAAIVDSGTSLLAGPTAIVAQVNHAIGAEGI 307
Query: 321 VSQECKAVVSQYGEEIINMLLAKVSAK 347
+S ECK VVS+YGE I+N+L+A+ +
Sbjct: 308 ISTECKEVVSEYGEMILNLLIAQTDPQ 334
>gi|218143|dbj|BAA02242.1| aspartic proteinase [Oryza sativa Japonica Group]
Length = 496
Score = 422 bits (1085), Expect = e-115, Method: Compositional matrix adjust.
Identities = 195/327 (59%), Positives = 250/327 (76%), Gaps = 12/327 (3%)
Query: 21 VVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIV 80
++ + G R+ L K++ D + AA+L + +T S D+D V
Sbjct: 20 LLLHASSDGFLRVNLNKKRLDKEDLTAAKLAQQGNRLLKTG------------SSDSDPV 67
Query: 81 ALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSST 140
L +Y++ QY+G IG+G+PPQNFTVIFDTGSSNLWVPS+KCYFSIACY HS+Y S +SS+
Sbjct: 68 PLVDYLNTQYYGVIGLGSPPQNFTVIFDTGSSNLWVPSAKCYFSIACYLHSRYNSKKSSS 127
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
YK +G++ I YG+GAISGFFS+D+V +GD VVK+Q+FIEATRE S+TF++ KFDGILGL
Sbjct: 128 YKADGETCKITYGSGAISGFFSKDNVLVGDQVVKNQKFIEATRETSVTFIIGKFDGILGL 187
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
G+ EISVGKA P+W +M Q L+ + VFSFW NR+ D GGE+VFGGMDP HYKG+HTY
Sbjct: 188 GYPEISVGKAPPIWQSMQEQELLADDVFSFWLNRDPDASSGGELVFGGMDPKHYKGDHTY 247
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
VPV++KGYWQF+MGD++IDG +TGFCA GCAAI DSGTSLLAGPT I+ QVNHAIGA GI
Sbjct: 248 VPVSRKGYWQFNMGDLLIDGHSTGFCAKGCAAIVDSGTSLLAGPTAIVAQVNHAIGAEGI 307
Query: 321 VSQECKAVVSQYGEEIINMLLAKVSAK 347
+S ECK VVS+YGE I+N+L+A+ +
Sbjct: 308 ISTECKEVVSEYGEMILNLLIAQTDPQ 334
>gi|114786427|gb|ABI78942.1| aspartic protease [Ipomoea batatas]
Length = 508
Score = 421 bits (1083), Expect = e-115, Method: Compositional matrix adjust.
Identities = 218/350 (62%), Positives = 268/350 (76%), Gaps = 16/350 (4%)
Query: 3 MVFKSITAGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSI 62
M +K + A L ++ V+ ++ L R+GLKK D N+ AA+ +G+ + +
Sbjct: 1 MAWKYLCASILLWVIACSVLPASSEK-LLRVGLKKNPLDFNSIKAAKAARVQGKCGKGAN 59
Query: 63 RKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCY 122
K LG+S D IV+LKNY+DAQY+GEI IG+PPQ FTVIFDTGSSNLWVPSSKCY
Sbjct: 60 NK------LGDS-DTGIVSLKNYLDAQYYGEISIGSPPQKFTVIFDTGSSNLWVPSSKCY 112
Query: 123 FSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEAT 182
FSIACYFHSKY+S +SSTY K G S I YG+G+ISGF S+D+V +GDLVVKDQ FIE T
Sbjct: 113 FSIACYFHSKYKSSKSSTYTKIGTSCSITYGSGSISGFLSQDNVGVGDLVVKDQVFIETT 172
Query: 183 REPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGG 242
+EPSLTF+LAKFDG+LGLGFQEISV VPVWYNMV QGLV+EPVFSFW NR+ + EEGG
Sbjct: 173 KEPSLTFVLAKFDGLLGLGFQEISVEDVVPVWYNMVEQGLVDEPVFSFWLNRDTNAEEGG 232
Query: 243 EIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLA 302
E++FGG+DP+H+KG+HTYVPVTQKGYWQF+MGD +I +TGFC GGCAAI DSGTSLL
Sbjct: 233 ELIFGGVDPNHFKGKHTYVPVTQKGYWQFEMGDFLIGNSSTGFCEGGCAAIVDSGTSLLT 292
Query: 303 GPTT--------IITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKV 344
GPTT I+T++NHAIGA G+VS ECK +VSQYG I ++L++ V
Sbjct: 293 GPTTIVTEINHAIVTEINHAIGAEGVVSTECKEIVSQYGNMIWDLLVSGV 342
>gi|56182674|gb|AAV84086.1| aspartic proteinase 12 [Fagopyrum esculentum]
Length = 387
Score = 421 bits (1082), Expect = e-115, Method: Compositional matrix adjust.
Identities = 191/236 (80%), Positives = 221/236 (93%)
Query: 108 DTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVK 167
DTGSSNLWVPS+KCYFSIAC+FHSKY+S +S T+ KNG SA I YGTGAISGFFS D+VK
Sbjct: 1 DTGSSNLWVPSAKCYFSIACFFHSKYKSSKSITHVKNGTSAAIRYGTGAISGFFSRDNVK 60
Query: 168 IGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPV 227
IGDLVV++QEFIEATREPS+TF+ AKFDGILGLGFQEISVGKAVPVWYNM++QGL++EPV
Sbjct: 61 IGDLVVENQEFIEATREPSITFIAAKFDGILGLGFQEISVGKAVPVWYNMIDQGLISEPV 120
Query: 228 FSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCA 287
FSFWFNRNA+EEEGGE+VFGG+DPDH++G+HTYVPVTQKGYWQFDM DV+IDG +TGFCA
Sbjct: 121 FSFWFNRNAEEEEGGELVFGGIDPDHFRGQHTYVPVTQKGYWQFDMDDVLIDGMSTGFCA 180
Query: 288 GGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAK 343
GGCAAIADSGTSLLAGP ++ Q+NHAIGATGIVSQECK VV++YG+EII MLL++
Sbjct: 181 GGCAAIADSGTSLLAGPMAVVAQINHAIGATGIVSQECKTVVAEYGKEIIEMLLSE 236
>gi|357511707|ref|XP_003626142.1| Aspartic proteinase [Medicago truncatula]
gi|355501157|gb|AES82360.1| Aspartic proteinase [Medicago truncatula]
Length = 504
Score = 419 bits (1077), Expect = e-114, Method: Compositional matrix adjust.
Identities = 205/337 (60%), Positives = 252/337 (74%), Gaps = 17/337 (5%)
Query: 14 LCLLLFPV-VFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGN-- 70
CLL F + + + G+ RIGL+KR DL+N A ++ +R+ LR
Sbjct: 10 FCLLAFTCSLLPSFSFGMMRIGLQKRPLDLHNMDAFKM-----------VREQQLRSGRP 58
Query: 71 ---LGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIAC 127
+S D IV LKNYMDAQYFGEI IGTPPQ FTVIFDTGSSNLWVPSSKCYFS+AC
Sbjct: 59 MMLAHKSSDDAIVPLKNYMDAQYFGEIAIGTPPQTFTVIFDTGSSNLWVPSSKCYFSLAC 118
Query: 128 YFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSL 187
Y H+ Y++ +S TY KNG S I YGTG+ISG+FS+D+VK+G VVK Q+FIEATRE SL
Sbjct: 119 YTHNWYKAKKSKTYNKNGTSCKISYGTGSISGYFSQDNVKVGSSVVKHQDFIEATREGSL 178
Query: 188 TFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFG 247
+FL KFDGI GLGFQEISV +A+PVWYNM+ Q L+ E VFSFW N N + ++GGE+VFG
Sbjct: 179 SFLAGKFDGIFGLGFQEISVERALPVWYNMLEQNLIGEKVFSFWLNGNPNAKKGGELVFG 238
Query: 248 GMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTI 307
G+DP H+KG+HTYVPVT+KGYWQ +MGD I G +TG C GGCAAI DSGTSLLAGPT +
Sbjct: 239 GVDPKHFKGKHTYVPVTEKGYWQIEMGDFFIGGLSTGVCEGGCAAIVDSGTSLLAGPTPV 298
Query: 308 ITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKV 344
+ ++NHAIGA G++S ECK VVSQYGE I ++L++ V
Sbjct: 299 VAEINHAIGAEGVLSVECKEVVSQYGELIWDLLVSGV 335
>gi|73912435|dbj|BAE20414.1| aspartic proteinase [Triticum aestivum]
Length = 498
Score = 419 bits (1076), Expect = e-114, Method: Compositional matrix adjust.
Identities = 199/320 (62%), Positives = 249/320 (77%), Gaps = 13/320 (4%)
Query: 29 GLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKY-SLRGNLGESGDADIVALKNYMD 87
GL RI L K+ + AA+ + R++ +LR G S D+DIV L +Y++
Sbjct: 29 GLLRINLNKKSLTHESLAAAK-----------AARQHDALRLKSGNS-DSDIVPLVDYLN 76
Query: 88 AQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKS 147
QY+G IG+GTPPQNFTVIFDTGSSNLWVPS+KCYFSIACY H KY+S +SSTYK +G++
Sbjct: 77 TQYYGVIGLGTPPQNFTVIFDTGSSNLWVPSAKCYFSIACYLHPKYKSSKSSTYKADGET 136
Query: 148 ADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISV 207
I YG+GAISGFFS D+V +GDLVVK+Q+FI TRE S++F++ KFDGILGLG+ +ISV
Sbjct: 137 CKITYGSGAISGFFSNDNVLVGDLVVKNQKFIGTTRETSVSFIVGKFDGILGLGYPDISV 196
Query: 208 GKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKG 267
GKA PVW +M Q L+ + VFSFW NR++D GGE+VFGGMDPDHYKG HTYVPV+++G
Sbjct: 197 GKAPPVWLSMQEQKLLADDVFSFWLNRDSDALSGGELVFGGMDPDHYKGNHTYVPVSRRG 256
Query: 268 YWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQECKA 327
YWQF+MGD++IDG +TGFCA GCAAI DSGTSLLAGPT I+ QVNHAIGA GI+S ECK
Sbjct: 257 YWQFNMGDLLIDGHSTGFCAKGCAAIVDSGTSLLAGPTAIVAQVNHAIGAEGIISTECKE 316
Query: 328 VVSQYGEEIINMLLAKVSAK 347
VVSQYGE I+ +L+A+ +
Sbjct: 317 VVSQYGEMILELLIAQTQPQ 336
>gi|75267434|sp|Q9XFX3.1|CARDA_CYNCA RecName: Full=Procardosin-A; Contains: RecName: Full=Cardosin-A
intermediate form 35 kDa subunit; Contains: RecName:
Full=Cardosin-A heavy chain; AltName: Full=Cardosin-A 31
kDa subunit; Contains: RecName: Full=Cardosin-A
intermediate form 30 kDa subunit; Contains: RecName:
Full=Cardosin-A light chain; AltName: Full=Cardosin-A 15
kDa subunit; Flags: Precursor
gi|4581209|emb|CAB40134.1| preprocardosin A [Cynara cardunculus]
Length = 504
Score = 418 bits (1074), Expect = e-114, Method: Compositional matrix adjust.
Identities = 212/354 (59%), Positives = 252/354 (71%), Gaps = 9/354 (2%)
Query: 1 MGMVFKSITAGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRT 60
MG K+ FL LL P VFS + GL RIGLKKRK D +++ R EG + +
Sbjct: 1 MGTSIKANVLALFLFYLLSPTVFSVSDDGLIRIGLKKRKVDRIDQLRGRRALMEGNARK- 59
Query: 61 SIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSK 120
+ RG + +SG A +VAL N D YFGEIGIGTPPQ FTVIFDTGSS LWVPSSK
Sbjct: 60 ---DFGFRGTVRDSGSA-VVALTNDRDTSYFGEIGIGTPPQKFTVIFDTGSSVLWVPSSK 115
Query: 121 CYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIE 180
C S AC HS Y S SSTYK+NG I YGTG+I+GFFS+D V IGDLVVK+Q+FIE
Sbjct: 116 CINSKACRAHSMYESSDSSTYKENGTFGAIIYGTGSITGFFSQDSVTIGDLVVKEQDFIE 175
Query: 181 ATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEE 240
AT E FL FDGILGL FQ ISV PVWYNM+NQGLV E FSFW NRN DEEE
Sbjct: 176 ATDEADNVFLHRLFDGILGLSFQTISV----PVWYNMLNQGLVKERRFSFWLNRNVDEEE 231
Query: 241 GGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSL 300
GGE+VFGG+DP+H++G+HTYVPVT + YWQF +GDV+I ++TGFCA GC A ADSGTSL
Sbjct: 232 GGELVFGGLDPNHFRGDHTYVPVTYQYYWQFGIGDVLIGDKSTGFCAPGCQAFADSGTSL 291
Query: 301 LAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKVSAKLSLRHYK 354
L+GPT I+TQ+NHAIGA G+++Q+CK VVS+YG +II ML +K+ H K
Sbjct: 292 LSGPTAIVTQINHAIGANGVMNQQCKTVVSRYGRDIIEMLRSKIQPDKICSHMK 345
>gi|357511711|ref|XP_003626144.1| Aspartic proteinase [Medicago truncatula]
gi|355501159|gb|AES82362.1| Aspartic proteinase [Medicago truncatula]
Length = 426
Score = 416 bits (1069), Expect = e-114, Method: Compositional matrix adjust.
Identities = 200/320 (62%), Positives = 244/320 (76%), Gaps = 16/320 (5%)
Query: 30 LYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGN-----LGESGDADIVALKN 84
+ RIGL+KR DL+N A ++ +R+ LR +S D IV LKN
Sbjct: 1 MMRIGLQKRPLDLHNMDAFKM-----------VREQQLRSGRPMMLAHKSSDDAIVPLKN 49
Query: 85 YMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKN 144
YMDAQYFGEI IGTPPQ FTVIFDTGSSNLWVPSSKCYFS+ACY H+ Y++ +S TY KN
Sbjct: 50 YMDAQYFGEIAIGTPPQTFTVIFDTGSSNLWVPSSKCYFSLACYTHNWYKAKKSKTYNKN 109
Query: 145 GKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQE 204
G S I YGTG+ISG+FS+D+VK+G VVK Q+FIEATRE SL+FL KFDGI GLGFQE
Sbjct: 110 GTSCKISYGTGSISGYFSQDNVKVGSSVVKHQDFIEATREGSLSFLAGKFDGIFGLGFQE 169
Query: 205 ISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVT 264
ISV +A+PVWYNM+ Q L+ E VFSFW N N + ++GGE+VFGG+DP H+KG+HTYVPVT
Sbjct: 170 ISVERALPVWYNMLEQNLIGEKVFSFWLNGNPNAKKGGELVFGGVDPKHFKGKHTYVPVT 229
Query: 265 QKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE 324
+KGYWQ +MGD I G +TG C GGCAAI DSGTSLLAGPT ++ ++NHAIGA G++S E
Sbjct: 230 EKGYWQIEMGDFFIGGLSTGVCEGGCAAIVDSGTSLLAGPTPVVAEINHAIGAEGVLSVE 289
Query: 325 CKAVVSQYGEEIINMLLAKV 344
CK VVSQYGE I ++L++ V
Sbjct: 290 CKEVVSQYGELIWDLLVSGV 309
>gi|357511709|ref|XP_003626143.1| Aspartic proteinase [Medicago truncatula]
gi|355501158|gb|AES82361.1| Aspartic proteinase [Medicago truncatula]
Length = 478
Score = 416 bits (1068), Expect = e-113, Method: Compositional matrix adjust.
Identities = 200/320 (62%), Positives = 244/320 (76%), Gaps = 16/320 (5%)
Query: 30 LYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGN-----LGESGDADIVALKN 84
+ RIGL+KR DL+N A ++ +R+ LR +S D IV LKN
Sbjct: 1 MMRIGLQKRPLDLHNMDAFKM-----------VREQQLRSGRPMMLAHKSSDDAIVPLKN 49
Query: 85 YMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKN 144
YMDAQYFGEI IGTPPQ FTVIFDTGSSNLWVPSSKCYFS+ACY H+ Y++ +S TY KN
Sbjct: 50 YMDAQYFGEIAIGTPPQTFTVIFDTGSSNLWVPSSKCYFSLACYTHNWYKAKKSKTYNKN 109
Query: 145 GKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQE 204
G S I YGTG+ISG+FS+D+VK+G VVK Q+FIEATRE SL+FL KFDGI GLGFQE
Sbjct: 110 GTSCKISYGTGSISGYFSQDNVKVGSSVVKHQDFIEATREGSLSFLAGKFDGIFGLGFQE 169
Query: 205 ISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVT 264
ISV +A+PVWYNM+ Q L+ E VFSFW N N + ++GGE+VFGG+DP H+KG+HTYVPVT
Sbjct: 170 ISVERALPVWYNMLEQNLIGEKVFSFWLNGNPNAKKGGELVFGGVDPKHFKGKHTYVPVT 229
Query: 265 QKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE 324
+KGYWQ +MGD I G +TG C GGCAAI DSGTSLLAGPT ++ ++NHAIGA G++S E
Sbjct: 230 EKGYWQIEMGDFFIGGLSTGVCEGGCAAIVDSGTSLLAGPTPVVAEINHAIGAEGVLSVE 289
Query: 325 CKAVVSQYGEEIINMLLAKV 344
CK VVSQYGE I ++L++ V
Sbjct: 290 CKEVVSQYGELIWDLLVSGV 309
>gi|359487589|ref|XP_003633616.1| PREDICTED: aspartic proteinase-like [Vitis vinifera]
Length = 510
Score = 416 bits (1068), Expect = e-113, Method: Compositional matrix adjust.
Identities = 205/335 (61%), Positives = 260/335 (77%), Gaps = 5/335 (1%)
Query: 10 AGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRG 69
A F L L FP++ ++ +G L RIGLKK + D N AAR+ + +S ++ S+
Sbjct: 10 AAFCLWALTFPLLQASSDG-LVRIGLKKWRLDYNRIRAARMARRA-KSIGGVVK--SMYQ 65
Query: 70 NLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYF 129
LG+S D + V L+NYMDAQY+GEIGIGTPPQNFTV+FDTGS+NLWVPS+KC+FSIAC F
Sbjct: 66 GLGDS-DGESVLLRNYMDAQYYGEIGIGTPPQNFTVVFDTGSANLWVPSTKCHFSIACLF 124
Query: 130 HSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTF 189
HSKY S S+TY GK +IHYG+G+ISG FS+D+V++G + +K+Q FIEATRE SL F
Sbjct: 125 HSKYNSRLSTTYIDLGKEGEIHYGSGSISGVFSQDNVQVGSMAIKNQVFIEATREASLVF 184
Query: 190 LLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGM 249
+L KFDGILGLGF+EI VG A PVWYN++ QGLV E +FSFW NR+ +GGEIVFGG+
Sbjct: 185 VLGKFDGILGLGFEEIVVGNATPVWYNLLRQGLVQEDIFSFWLNRDPQATDGGEIVFGGV 244
Query: 250 DPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIIT 309
D H+KG+HTY +TQKGYWQF+MG+ +I Q+TGFC GCAAI DSGTSL+AGPT I+T
Sbjct: 245 DKRHFKGQHTYASITQKGYWQFEMGEFLIGYQSTGFCEAGCAAIVDSGTSLIAGPTAIVT 304
Query: 310 QVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKV 344
++NHAIGA GIVSQECK VVSQYG I ++L+++V
Sbjct: 305 EINHAIGAEGIVSQECKEVVSQYGNMIWDLLISRV 339
>gi|356534977|ref|XP_003536026.1| PREDICTED: aspartic proteinase-like [Glycine max]
Length = 508
Score = 415 bits (1067), Expect = e-113, Method: Compositional matrix adjust.
Identities = 207/316 (65%), Positives = 253/316 (80%), Gaps = 2/316 (0%)
Query: 29 GLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDA 88
GL RIGLKKR DL++ AAR+ ++ R + Y +LG+ D IV LKNY+DA
Sbjct: 27 GLMRIGLKKRDLDLDSIRAARMVREKPRLGRPVLGAYD--HDLGKPIDEGIVPLKNYLDA 84
Query: 89 QYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSA 148
QY+GEIGIGTPPQ F VIFDTGSSNLWVPSSKCYFSIACY H Y+S +S TY KNG S
Sbjct: 85 QYYGEIGIGTPPQKFNVIFDTGSSNLWVPSSKCYFSIACYTHHWYKSKKSKTYTKNGTSC 144
Query: 149 DIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVG 208
I YG+G+ISGFFS+DHVK+GD+VVK+Q+FIEATRE SL+F+LAKFDG+LGLGFQEISV
Sbjct: 145 KIGYGSGSISGFFSKDHVKVGDVVVKNQDFIEATREGSLSFVLAKFDGLLGLGFQEISVE 204
Query: 209 KAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGY 268
AVPVWYNMV Q LV+E VFSFW N + ++GGE++FGG+DP H+KG+H YVPVT+KGY
Sbjct: 205 NAVPVWYNMVKQNLVSEQVFSFWLNGDPKAKDGGELIFGGIDPKHFKGDHIYVPVTKKGY 264
Query: 269 WQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQECKAV 328
WQ +MGD I G +TG C GGCAAI DSGTSLLAGPTT++T++NHAIGA G++S ECK V
Sbjct: 265 WQIEMGDFFIGGLSTGVCEGGCAAIVDSGTSLLAGPTTVVTEINHAIGAEGVLSVECKEV 324
Query: 329 VSQYGEEIINMLLAKV 344
VS+YGE + ++L++ V
Sbjct: 325 VSEYGELLWDLLVSGV 340
>gi|356505735|ref|XP_003521645.1| PREDICTED: aspartic proteinase-like [Glycine max]
Length = 508
Score = 414 bits (1065), Expect = e-113, Method: Compositional matrix adjust.
Identities = 199/316 (62%), Positives = 247/316 (78%), Gaps = 2/316 (0%)
Query: 29 GLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDA 88
G+ RIGLKKR D+++ AAR + S R+ + + +G+S D+V LKNYMDA
Sbjct: 26 GILRIGLKKRPLDIDSINAARKAREGLRSGRSMMGAHDQY--IGKSKGEDLVPLKNYMDA 83
Query: 89 QYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSA 148
QYFGEIGIGTPPQ FTV+FDTGSSNLWVPSSKCYF++ACY H+ Y + +S T+ KNG S
Sbjct: 84 QYFGEIGIGTPPQPFTVVFDTGSSNLWVPSSKCYFTLACYTHNWYTAKKSKTHAKNGTSC 143
Query: 149 DIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVG 208
I YGTG+ISGFFS+D+VK+G VVK Q+FIEAT E SLTFL AKFDGILGLGFQEISV
Sbjct: 144 KISYGTGSISGFFSQDNVKVGSAVVKHQDFIEATHEGSLTFLSAKFDGILGLGFQEISVE 203
Query: 209 KAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGY 268
+VPVWY MV Q L++E VFSFW N + + ++GGE+VFGG+DP H+KG HTYVP+T+KGY
Sbjct: 204 NSVPVWYKMVEQKLISEKVFSFWLNGDPNAKKGGELVFGGVDPKHFKGNHTYVPITEKGY 263
Query: 269 WQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQECKAV 328
WQ ++GD I G +TG C GGCAAI DSGTSLLAGPT ++ ++NHAIGA G++S ECK V
Sbjct: 264 WQIEIGDFFIGGVSTGVCEGGCAAIVDSGTSLLAGPTPVVAEINHAIGAEGVLSVECKEV 323
Query: 329 VSQYGEEIINMLLAKV 344
VSQYGE I ++L++ V
Sbjct: 324 VSQYGELIWDLLVSGV 339
>gi|351725345|ref|NP_001237345.1| aspartic proteinase 2 [Glycine max]
gi|15425751|dbj|BAB64296.1| aspartic proteinase 2 [Glycine max]
Length = 508
Score = 414 bits (1063), Expect = e-113, Method: Compositional matrix adjust.
Identities = 199/316 (62%), Positives = 246/316 (77%), Gaps = 2/316 (0%)
Query: 29 GLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDA 88
G+ RIGLKKR DL++ AAR + S R + + +G+S DIV LKNY+DA
Sbjct: 26 GILRIGLKKRPLDLDSINAARKAREGLRSVRPMMGAHDQF--IGKSKGEDIVPLKNYLDA 83
Query: 89 QYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSA 148
QYFGEIGIG PPQ FTV+FDTGSSNLWVPSSKCYF++ACY H+ Y + +S T+ KNG S
Sbjct: 84 QYFGEIGIGIPPQPFTVVFDTGSSNLWVPSSKCYFTLACYTHNWYTAKKSKTHVKNGTSC 143
Query: 149 DIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVG 208
I+YGTG+ISGFFS+D+VK+G VVK Q+FIEAT E SLTFL AKFDGILGLGFQEISV
Sbjct: 144 KINYGTGSISGFFSQDNVKVGSAVVKHQDFIEATHEGSLTFLSAKFDGILGLGFQEISVE 203
Query: 209 KAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGY 268
AVPVW+ MV Q L++E VFSFW N + + ++GGE+VFGG+DP H+KG HTYVP+T+KGY
Sbjct: 204 NAVPVWFKMVEQKLISEKVFSFWLNGDPNAKKGGELVFGGVDPKHFKGNHTYVPITEKGY 263
Query: 269 WQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQECKAV 328
WQ +MGD + G +TG C GGCAAI DSGTSLLAGPT ++ ++NHAIGA G++S ECK V
Sbjct: 264 WQIEMGDFFVGGVSTGVCEGGCAAIVDSGTSLLAGPTPVVAEINHAIGAEGVLSVECKEV 323
Query: 329 VSQYGEEIINMLLAKV 344
VSQYGE I ++L++ V
Sbjct: 324 VSQYGELIWDLLVSGV 339
>gi|255543036|ref|XP_002512581.1| Aspartic proteinase precursor, putative [Ricinus communis]
gi|223548542|gb|EEF50033.1| Aspartic proteinase precursor, putative [Ricinus communis]
Length = 494
Score = 412 bits (1060), Expect = e-112, Method: Compositional matrix adjust.
Identities = 208/350 (59%), Positives = 258/350 (73%), Gaps = 24/350 (6%)
Query: 10 AGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRG 69
A F L L + ++ NG L +I LKKR DL++ AAR +E +T I S+
Sbjct: 9 AAFCLWALTCSFLPASSNG-LMKISLKKRPLDLDSINAARTARQER---KTRIAASSML- 63
Query: 70 NLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYF 129
S D D++ LKNY+D QYFGEI IG+PPQ FTVIFDTGSSNLW+PS+KCYFS+ACYF
Sbjct: 64 ---HSPDPDMIPLKNYLDTQYFGEISIGSPPQTFTVIFDTGSSNLWIPSAKCYFSLACYF 120
Query: 130 HSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTF 189
HS+Y+S RS+TY +NG + I YGTG+I GFFS+D V++G+LVV++Q FIEATRE SLTF
Sbjct: 121 HSRYKSSRSTTYIRNGTTCKIRYGTGSIVGFFSQDTVEVGNLVVRNQVFIEATREGSLTF 180
Query: 190 LLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGM 249
+LAKFDGI GLGFQEISVG AVPVWYNMV QGLV +PVFSFW N + D +EGGE+VFGG+
Sbjct: 181 VLAKFDGIFGLGFQEISVGDAVPVWYNMVQQGLVGDPVFSFWLNNDPDAKEGGELVFGGV 240
Query: 250 DPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIIT 309
D HY+G+HTYVPVTQKGYWQF+MGD +I +T DSGTSLLAGPT I+
Sbjct: 241 DEKHYRGKHTYVPVTQKGYWQFNMGDFIIGNHST-----------DSGTSLLAGPTPIVA 289
Query: 310 QVNHAIGATGIVSQECKAVVSQYGEEIINMLLA-----KVSAKLSLRHYK 354
++NHAIGA GIVS ECK VVSQYG I ++L++ KV ++L L ++
Sbjct: 290 EINHAIGAEGIVSAECKEVVSQYGNLIWDLLISGVQPGKVCSQLGLCTFR 339
>gi|356575293|ref|XP_003555776.1| PREDICTED: aspartic proteinase [Glycine max]
Length = 507
Score = 412 bits (1060), Expect = e-112, Method: Compositional matrix adjust.
Identities = 210/321 (65%), Positives = 253/321 (78%), Gaps = 12/321 (3%)
Query: 29 GLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIR--KYSLRGN---LGESGDADIVALK 83
GL RIGLKKR DL++ AAR+ R ++R + L N +G+ D IV LK
Sbjct: 26 GLLRIGLKKRDLDLDSIRAARM-------VRENLRLGRPVLGANDQYIGKPTDEGIVPLK 78
Query: 84 NYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKK 143
NY+DAQY+GEIGIGTPPQ F VIFDTGSSNLWVPSSKCYFSIACY H Y+S +S TY K
Sbjct: 79 NYLDAQYYGEIGIGTPPQKFNVIFDTGSSNLWVPSSKCYFSIACYTHHWYKSKKSKTYTK 138
Query: 144 NGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQ 203
NG S I YG+G+ISGFFS+DHVK+GD+VVK+Q+FIEATRE SL+F+LAKFDG+LGLGFQ
Sbjct: 139 NGTSCKIRYGSGSISGFFSKDHVKVGDVVVKNQDFIEATREGSLSFVLAKFDGLLGLGFQ 198
Query: 204 EISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPV 263
EISV AVPVWYNMV Q LV+E VFSFW N + + GGE+VFGG+DP H+KGEH YVPV
Sbjct: 199 EISVENAVPVWYNMVKQNLVSEQVFSFWLNGDPKVKNGGELVFGGVDPKHFKGEHIYVPV 258
Query: 264 TQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQ 323
T+KGYWQ +MGD I G +TG C GGCAAI DSGTSLLAGPTT++T++NHAIGA G++S
Sbjct: 259 TKKGYWQIEMGDFFIGGLSTGVCEGGCAAIVDSGTSLLAGPTTVVTEINHAIGAEGVLSV 318
Query: 324 ECKAVVSQYGEEIINMLLAKV 344
ECK VVS+YGE + ++L++ V
Sbjct: 319 ECKEVVSEYGELLWDLLVSGV 339
>gi|115436054|ref|NP_001042785.1| Os01g0290000 [Oryza sativa Japonica Group]
gi|8467954|dbj|BAA96578.1| putative aspartic proteinase [Oryza sativa Japonica Group]
gi|113532316|dbj|BAF04699.1| Os01g0290000 [Oryza sativa Japonica Group]
gi|215694819|dbj|BAG90010.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215701475|dbj|BAG92899.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222618242|gb|EEE54374.1| hypothetical protein OsJ_01384 [Oryza sativa Japonica Group]
Length = 495
Score = 412 bits (1059), Expect = e-112, Method: Compositional matrix adjust.
Identities = 207/332 (62%), Positives = 246/332 (74%), Gaps = 12/332 (3%)
Query: 14 LCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGE 73
L +L V+ GL RI L K++ D A+L +E R LG+
Sbjct: 12 LWILSCAVLLHASPDGLLRISLNKKRLDKKTLDGAKLAREESHRLRAD--------GLGD 63
Query: 74 SGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKY 133
DIV L NY+D QYFGEIGIGTPPQNFTVIFDTGSSNLWVPS KCYFSIACY H +Y
Sbjct: 64 ----DIVPLDNYLDTQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSVKCYFSIACYLHHRY 119
Query: 134 RSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAK 193
+S SS+YKKNG+S I YG+G+I+GFFSED V +GDL VK+Q FIE TREPSLTF++ K
Sbjct: 120 KSKGSSSYKKNGESCSISYGSGSIAGFFSEDSVLVGDLAVKNQMFIETTREPSLTFIIGK 179
Query: 194 FDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDH 253
FDGILGLGF EISVG A P+W M Q L+ + VFSFW NR+ D GGE++FGG+DP+H
Sbjct: 180 FDGILGLGFPEISVGGAPPIWQGMKEQQLIEKDVFSFWLNRDPDAPTGGELIFGGVDPNH 239
Query: 254 YKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNH 313
YKG HTYVPVT+KGYWQF+MGD++ID +TGFC+GGCAAIADSGTSLL GPTTI+ Q+NH
Sbjct: 240 YKGSHTYVPVTRKGYWQFEMGDLLIDDYSTGFCSGGCAAIADSGTSLLGGPTTIVAQINH 299
Query: 314 AIGATGIVSQECKAVVSQYGEEIINMLLAKVS 345
AIGA GIVS ECK VV YG+ I+ ML+A+ S
Sbjct: 300 AIGAEGIVSMECKQVVRDYGDMILEMLIAQAS 331
>gi|218188020|gb|EEC70447.1| hypothetical protein OsI_01478 [Oryza sativa Indica Group]
Length = 495
Score = 412 bits (1059), Expect = e-112, Method: Compositional matrix adjust.
Identities = 207/332 (62%), Positives = 246/332 (74%), Gaps = 12/332 (3%)
Query: 14 LCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGE 73
L +L V+ GL RI L K++ D A+L +E R LG+
Sbjct: 12 LWILSCAVLLHASPDGLLRISLNKKRLDKKTLDGAKLAREESHRLRAD--------GLGD 63
Query: 74 SGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKY 133
DIV L NY+D QYFGEIGIGTPPQNFTVIFDTGSSNLWVPS KCYFSIACY H +Y
Sbjct: 64 ----DIVPLDNYLDTQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSVKCYFSIACYLHHRY 119
Query: 134 RSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAK 193
+S SS+YKKNG+S I YG+G+I+GFFSED V +GDL VK+Q FIE TREPSLTF++ K
Sbjct: 120 KSKGSSSYKKNGESCSISYGSGSIAGFFSEDSVLVGDLAVKNQMFIETTREPSLTFIIGK 179
Query: 194 FDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDH 253
FDGILGLGF EISVG A P+W M Q L+ + VFSFW NR+ D GGE++FGG+DP+H
Sbjct: 180 FDGILGLGFPEISVGGAPPIWQGMKEQQLIEKDVFSFWLNRDPDAPTGGELIFGGVDPNH 239
Query: 254 YKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNH 313
YKG HTYVPVT+KGYWQF+MGD++ID +TGFC+GGCAAIADSGTSLL GPTTI+ Q+NH
Sbjct: 240 YKGSHTYVPVTRKGYWQFEMGDLLIDDYSTGFCSGGCAAIADSGTSLLGGPTTIVAQINH 299
Query: 314 AIGATGIVSQECKAVVSQYGEEIINMLLAKVS 345
AIGA GIVS ECK VV YG+ I+ ML+A+ S
Sbjct: 300 AIGAEGIVSMECKQVVRDYGDMILEMLIAQAS 331
>gi|12231180|dbj|BAB20973.1| aspartic proteinase 5 [Nepenthes alata]
Length = 358
Score = 411 bits (1056), Expect = e-112, Method: Compositional matrix adjust.
Identities = 219/348 (62%), Positives = 263/348 (75%), Gaps = 6/348 (1%)
Query: 12 FFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNL 71
F C L+ FST GL RIGLK++ D N+ A R+ K G + +YS
Sbjct: 10 FCFCALI-SCFFSTSADGLVRIGLKRQFSDSNSIRAVRIARKAGMNQGLKRFQYSF---- 64
Query: 72 GESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHS 131
G+S D DIV LKNY+DAQY+GEIGIG+PPQ F+VIFDTGSSNLWVPSSKCYFS+ACYFHS
Sbjct: 65 GDS-DTDIVYLKNYLDAQYYGEIGIGSPPQKFSVIFDTGSSNLWVPSSKCYFSVACYFHS 123
Query: 132 KYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLL 191
KY+S +SSTY K GKS +I YG+G+ISGFFS+D V++G+L VK+Q FIEA+RE SLTF L
Sbjct: 124 KYKSSKSSTYTKIGKSCEIDYGSGSISGFFSQDIVEVGNLAVKNQVFIEASREKSLTFAL 183
Query: 192 AKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDP 251
AKFDGILGLGFQEISVG VPVWYNMV QGLV+E VFSFWFNR+ + GGEIVFGG+D
Sbjct: 184 AKFDGILGLGFQEISVGDVVPVWYNMVEQGLVSEKVFSFWFNRDPKAKIGGEIVFGGIDE 243
Query: 252 DHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQV 311
H+ GEH YVP+T+KGYWQF+MG+ +I +TGFC GGC AI DSGTSLLAGP ++T+V
Sbjct: 244 KHFVGEHIYVPITRKGYWQFEMGNFLIGNYSTGFCRGGCDAIVDSGTSLLAGPMHVVTEV 303
Query: 312 NHAIGATGIVSQECKAVVSQYGEEIINMLLAKVSAKLSLRHYKKWLII 359
NHAIGA GI S ECK VV QYG+ I ++L++ V LS W I+
Sbjct: 304 NHAIGAEGIASMECKEVVYQYGDMIWDLLVSGVCPPLSSLLPLSWCIL 351
>gi|13897888|gb|AAK48494.1|AF259982_1 putative aspartic protease [Ipomoea batatas]
Length = 504
Score = 410 bits (1054), Expect = e-112, Method: Compositional matrix adjust.
Identities = 208/328 (63%), Positives = 254/328 (77%), Gaps = 9/328 (2%)
Query: 19 FPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDAD 78
P +S N L R+GLKKR DL + AA+ ++ G + + K LG+S D
Sbjct: 20 LPAAYSDNN--LLRVGLKKRPLDLESIKAAK-GARLGGKYGKGVNK-----KLGDS-DEG 70
Query: 79 IVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRS 138
IV+L NY+DAQY+GEI IG+PPQNFTVIFDTGSSNLWVPSSKCY SIACYFHSKY+S +S
Sbjct: 71 IVSLNNYLDAQYYGEISIGSPPQNFTVIFDTGSSNLWVPSSKCYLSIACYFHSKYKSSKS 130
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
STY + GKS I YG+ +ISGF S+D V++GDL+VKDQ FIE TREPSLTF++AKFDGIL
Sbjct: 131 STYTQIGKSCSITYGSVSISGFLSQDDVQLGDLLVKDQVFIETTREPSLTFIIAKFDGIL 190
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEH 258
GLGFQEISV VPVWY+MV QGLV+EPVFSFW NR+ E GGE+VFGG+DP H+KGEH
Sbjct: 191 GLGFQEISVENVVPVWYDMVEQGLVDEPVFSFWLNRDPKAEVGGELVFGGVDPKHFKGEH 250
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
TYVPVTQKGYWQ D+GD +I +TG+C GGCA I DSGTSLL GPT ++T++N+AIG
Sbjct: 251 TYVPVTQKGYWQIDLGDFLIGNSSTGYCEGGCAVIVDSGTSLLTGPTAVVTEINYAIGPE 310
Query: 319 GIVSQECKAVVSQYGEEIINMLLAKVSA 346
G+V ECK VVS+YGE I ++L++ + A
Sbjct: 311 GVVCAECKEVVSEYGEMIWDLLVSGLRA 338
>gi|12231178|dbj|BAB20972.1| aspartic proteinase 4 [Nepenthes alata]
Length = 505
Score = 408 bits (1049), Expect = e-111, Method: Compositional matrix adjust.
Identities = 217/344 (63%), Positives = 264/344 (76%), Gaps = 11/344 (3%)
Query: 12 FFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNL 71
F C L+ FST GL RIGLK++ D N+ A R+ K G + +YS
Sbjct: 10 FCFCALI-SCFFSTSADGLVRIGLKRQFSDSNSIRAVRIARKAGMNQGLKRFQYSF---- 64
Query: 72 GESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHS 131
G+S D DIV LKNY+DAQY+GEIGIG+PPQ F+VIFDTGSSNLWVPSSKCYFS+ACYFHS
Sbjct: 65 GDS-DTDIVYLKNYLDAQYYGEIGIGSPPQKFSVIFDTGSSNLWVPSSKCYFSVACYFHS 123
Query: 132 KYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLL 191
KY+S +SSTY K GKS +I YG+G+ISGFFS+D V++G+L VK+Q FIEA+RE SLTF L
Sbjct: 124 KYKSSKSSTYTKIGKSCEIDYGSGSISGFFSQDIVEVGNLAVKNQVFIEASREKSLTFAL 183
Query: 192 AKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDP 251
AKFDGILGLGFQEISVG VPVWYNMV QGLV+E VFSFWFNR+ + GGEIVFGG+D
Sbjct: 184 AKFDGILGLGFQEISVGDVVPVWYNMVEQGLVSEKVFSFWFNRDPKAKIGGEIVFGGIDE 243
Query: 252 DHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQV 311
H+ GEH YVP+T+KGYWQF+MG+ +I +TGFC GGC AI DSGTSLLAGP ++T+V
Sbjct: 244 KHFVGEHIYVPITRKGYWQFEMGNFLIGNYSTGFCRGGCDAIVDSGTSLLAGPMHVVTEV 303
Query: 312 NHAIGATGIVSQECKAVVSQYGEEIINMLLA-----KVSAKLSL 350
NHAIGA GI S ECK VV QYG+ I ++L++ K+ ++L+L
Sbjct: 304 NHAIGAEGIASMECKEVVYQYGDMIWDLLVSGVQPDKICSQLAL 347
>gi|357130655|ref|XP_003566963.1| PREDICTED: aspartic proteinase oryzasin-1-like [Brachypodium
distachyon]
Length = 520
Score = 408 bits (1049), Expect = e-111, Method: Compositional matrix adjust.
Identities = 202/336 (60%), Positives = 253/336 (75%), Gaps = 15/336 (4%)
Query: 22 VFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDAD--- 78
+ + P GL R+ LKK D + A E+ R +R+Y N +G +
Sbjct: 26 LLAAPAEGLVRVALKKHPVDEHGLAAGE------EAQRLLLRRYGHVFNDASAGASSKPS 79
Query: 79 ------IVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSK 132
V LKN ++AQY+GE+GIGTPPQNFTVIFDTGS+NLWVPSS CYFSIACYFH +
Sbjct: 80 TAAKGGSVTLKNCLNAQYYGEVGIGTPPQNFTVIFDTGSANLWVPSSNCYFSIACYFHPR 139
Query: 133 YRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLA 192
Y +G+S TYKKNGK +IHYGTGAISG+ S+D V++G +VVK Q+FIEAT EPS+TF+
Sbjct: 140 YNAGQSKTYKKNGKHVEIHYGTGAISGYLSQDSVQVGGVVVKKQDFIEATGEPSITFMFG 199
Query: 193 KFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPD 252
KFDGILGLGF+E+ +P+WYNMV+QGLV + +FSFWFNR+A E +GGEIVFGG+DP
Sbjct: 200 KFDGILGLGFKEMLYLSVLPIWYNMVSQGLVGDLIFSFWFNRHAGEGQGGEIVFGGIDPS 259
Query: 253 HYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVN 312
H+KG HTYVPV +KGYWQFDM DV+I G +TGFC GCAA+ADSGTSLL+GPT I+TQ+N
Sbjct: 260 HHKGNHTYVPVPKKGYWQFDMSDVLIGGNSTGFCKDGCAAMADSGTSLLSGPTAIVTQIN 319
Query: 313 HAIGATGIVSQECKAVVSQYGEEIINMLLAKVSAKL 348
IGATG+VSQECKAVVSQYG++I+++LL K+
Sbjct: 320 KKIGATGVVSQECKAVVSQYGKQILDLLLKYSRKKI 355
>gi|168033581|ref|XP_001769293.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162679399|gb|EDQ65847.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 485
Score = 408 bits (1049), Expect = e-111, Method: Compositional matrix adjust.
Identities = 199/324 (61%), Positives = 242/324 (74%), Gaps = 26/324 (8%)
Query: 32 RIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYF 91
RI LKK+ DL + +A + RT R +L G+ D V L NYMDAQYF
Sbjct: 29 RIPLKKKSIDLQSVRSA--------AARTLQRANALAGSANSLRGGDAVDLNNYMDAQYF 80
Query: 92 GEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIH 151
GEIGIG+PPQ F+VIFDTGSSNLWVPS+KCY S+ACYFH +Y+S +SSTYK++G S I
Sbjct: 81 GEIGIGSPPQPFSVIFDTGSSNLWVPSAKCYLSLACYFHRRYKSSKSSTYKEDGTSFAIQ 140
Query: 152 YGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAV 211
YGTG++ GF S+D V +GDL VK Q F EAT+EP +TF+ AKFDGILGLGF+EISV +
Sbjct: 141 YGTGSMEGFLSQDDVTLGDLTVKWQVFAEATKEPGVTFVSAKFDGILGLGFKEISVDRVT 200
Query: 212 PVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQF 271
PVWYNM++QGLV EPVFSFW NR++DE +GGE+VFGG+DPDH+KGEHTY PVT+KGYWQF
Sbjct: 201 PVWYNMLDQGLVKEPVFSFWLNRDSDESDGGELVFGGVDPDHFKGEHTYTPVTRKGYWQF 260
Query: 272 DMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQECKAVVSQ 331
D+GD GC+AIADSGTSLLAGP+ I+ ++N AIGATGIVSQ+CK VV Q
Sbjct: 261 DLGD-------------GCSAIADSGTSLLAGPSGIVAEINQAIGATGIVSQQCKMVVQQ 307
Query: 332 YGEEIINMLLA-----KVSAKLSL 350
YGE+I+ ML+A KV A L L
Sbjct: 308 YGEQIVEMLVAQMNPGKVCASLGL 331
>gi|302761358|ref|XP_002964101.1| hypothetical protein SELMODRAFT_166719 [Selaginella moellendorffii]
gi|300167830|gb|EFJ34434.1| hypothetical protein SELMODRAFT_166719 [Selaginella moellendorffii]
Length = 505
Score = 404 bits (1037), Expect = e-110, Method: Compositional matrix adjust.
Identities = 184/269 (68%), Positives = 227/269 (84%), Gaps = 1/269 (0%)
Query: 78 DIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGR 137
DI L+NY+DAQY+GEIGIG+PPQ FTVIFDTGSSNLWVPSS+C FS AC+ H +Y+S +
Sbjct: 74 DIEPLRNYLDAQYYGEIGIGSPPQVFTVIFDTGSSNLWVPSSRCIFSPACWLHHRYKSRK 133
Query: 138 SSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGI 197
SSTYK +G S I YGTG ++GF S D+V IGD+VVKDQ F E+T EP L FL+AKFDGI
Sbjct: 134 SSTYKPDGTSIAIQYGTGQMAGFLSTDYVTIGDVVVKDQTFAESTSEPGLVFLVAKFDGI 193
Query: 198 LGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNA-DEEEGGEIVFGGMDPDHYKG 256
LGLGF+ IS G+ PVWYNM+ Q L+++PVFSFW NR+A DEE+GGEIVFGG++ D +KG
Sbjct: 194 LGLGFKAISKGQVTPVWYNMLAQKLISQPVFSFWLNRDASDEEDGGEIVFGGVNKDRFKG 253
Query: 257 EHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIG 316
+H Y PVT++GYWQF+MGDV +DGQ+TGFCA GCAAIADSGTSLLAGPT I+ Q+N AIG
Sbjct: 254 KHVYTPVTREGYWQFNMGDVAVDGQSTGFCAKGCAAIADSGTSLLAGPTGIVAQINQAIG 313
Query: 317 ATGIVSQECKAVVSQYGEEIINMLLAKVS 345
ATG+VS+ECK VV+QYG+ I+ +LLA+V+
Sbjct: 314 ATGLVSEECKMVVAQYGDLIVELLLAQVT 342
>gi|302820804|ref|XP_002992068.1| hypothetical protein SELMODRAFT_186535 [Selaginella moellendorffii]
gi|300140190|gb|EFJ06917.1| hypothetical protein SELMODRAFT_186535 [Selaginella moellendorffii]
Length = 499
Score = 403 bits (1036), Expect = e-110, Method: Compositional matrix adjust.
Identities = 184/275 (66%), Positives = 229/275 (83%), Gaps = 1/275 (0%)
Query: 72 GESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHS 131
G + DI L+NY+DAQY+GEIGIG+PPQ FTVIFDTGSSNLWVPSS+C FS AC+ H
Sbjct: 62 GANSLTDIEPLRNYLDAQYYGEIGIGSPPQVFTVIFDTGSSNLWVPSSRCIFSPACWLHR 121
Query: 132 KYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLL 191
+Y+S +SSTYK + S I YGTG ++GF S D+V IGD+VVKDQ F E+T EP L FL
Sbjct: 122 RYKSRKSSTYKPDDASIAIQYGTGQMAGFLSTDYVTIGDVVVKDQTFAESTSEPGLVFLF 181
Query: 192 AKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNA-DEEEGGEIVFGGMD 250
AKFDGILGLGF+ IS+G+ PVWYNM+ Q L+++PVFSFW NR+A DEE+GGEIVFGG++
Sbjct: 182 AKFDGILGLGFKAISMGQVTPVWYNMLAQKLISQPVFSFWLNRDASDEEDGGEIVFGGVN 241
Query: 251 PDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQ 310
D +KG+H Y PVT++GYWQF+MGDV++DGQ+TGFCA GCAAIADSGTSLLAGPT I+ Q
Sbjct: 242 KDRFKGKHVYTPVTREGYWQFNMGDVVVDGQSTGFCAKGCAAIADSGTSLLAGPTGIVAQ 301
Query: 311 VNHAIGATGIVSQECKAVVSQYGEEIINMLLAKVS 345
+N AIGATG+VS+ECK VV+QYG+ I+ +LLA+V+
Sbjct: 302 INQAIGATGLVSEECKMVVTQYGDLIVELLLAQVT 336
>gi|302761356|ref|XP_002964100.1| hypothetical protein SELMODRAFT_438819 [Selaginella moellendorffii]
gi|300167829|gb|EFJ34433.1| hypothetical protein SELMODRAFT_438819 [Selaginella moellendorffii]
Length = 503
Score = 402 bits (1034), Expect = e-109, Method: Compositional matrix adjust.
Identities = 182/269 (67%), Positives = 226/269 (84%), Gaps = 1/269 (0%)
Query: 78 DIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGR 137
DI L+NY+DAQY+GEIGIG+PPQ FTVIFDTGSSNLWVPSS+C FS AC+ H +Y+S +
Sbjct: 74 DIEPLRNYLDAQYYGEIGIGSPPQVFTVIFDTGSSNLWVPSSRCIFSPACWLHRRYKSRK 133
Query: 138 SSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGI 197
SSTYK + S I YGTG ++GF S D+V IGD+VVKDQ F E+T EP L FL AKFDGI
Sbjct: 134 SSTYKPDDASIAIQYGTGQMAGFLSTDYVTIGDVVVKDQTFAESTSEPGLVFLFAKFDGI 193
Query: 198 LGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNA-DEEEGGEIVFGGMDPDHYKG 256
LGLGF+ IS+G+ PVWYNM+ Q L+++PVFSFW NR+A DEE+GGEIVFGG++ D +KG
Sbjct: 194 LGLGFKAISMGQVTPVWYNMLAQKLISQPVFSFWLNRDASDEEDGGEIVFGGVNKDRFKG 253
Query: 257 EHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIG 316
+H Y PVT++GYWQF+MGDV++DGQ+TGFCA GCAAIADSGTSLL GPT I+ Q+N AIG
Sbjct: 254 KHVYTPVTREGYWQFNMGDVVVDGQSTGFCAKGCAAIADSGTSLLVGPTGIVAQINQAIG 313
Query: 317 ATGIVSQECKAVVSQYGEEIINMLLAKVS 345
ATG+VS+ECK VV+QYG+ I+ +LLA+V+
Sbjct: 314 ATGLVSEECKMVVAQYGDLIVELLLAQVT 342
>gi|302761354|ref|XP_002964099.1| hypothetical protein SELMODRAFT_142401 [Selaginella moellendorffii]
gi|300167828|gb|EFJ34432.1| hypothetical protein SELMODRAFT_142401 [Selaginella moellendorffii]
Length = 497
Score = 402 bits (1033), Expect = e-109, Method: Compositional matrix adjust.
Identities = 183/275 (66%), Positives = 229/275 (83%), Gaps = 1/275 (0%)
Query: 72 GESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHS 131
G + DI L+NY+DAQY+GEIGIG+PPQ FTVIFDTGSSNLWVPSS+C FS AC+ H
Sbjct: 62 GANSLTDIEPLRNYLDAQYYGEIGIGSPPQVFTVIFDTGSSNLWVPSSRCIFSPACWLHR 121
Query: 132 KYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLL 191
+Y+S +SSTYK + S I YG+G ++GFFS D+V IGD+VVKDQ F E+T EP L FL
Sbjct: 122 RYKSRKSSTYKPDDASIAIQYGSGQMAGFFSTDYVTIGDVVVKDQTFAESTSEPGLVFLF 181
Query: 192 AKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNA-DEEEGGEIVFGGMD 250
AKFDGILGLGF+ IS+G+ PVWYNM+ Q L+++PVFSFW NR+A DEE+GGEIVFGG++
Sbjct: 182 AKFDGILGLGFKAISMGQVTPVWYNMLAQKLISQPVFSFWLNRDASDEEDGGEIVFGGVN 241
Query: 251 PDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQ 310
D +KG+H Y PVT++GYWQF+MGDV++DGQ+TGFCA GCAAIADSGTSLL GPT I+ Q
Sbjct: 242 KDRFKGKHVYTPVTREGYWQFNMGDVVVDGQSTGFCAKGCAAIADSGTSLLVGPTGIVAQ 301
Query: 311 VNHAIGATGIVSQECKAVVSQYGEEIINMLLAKVS 345
+N AIGATG+VS+ECK VV+QYG+ I+ +LLA+V+
Sbjct: 302 INQAIGATGLVSEECKMVVAQYGDLIVELLLAQVT 336
>gi|356542078|ref|XP_003539498.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase oryzasin-1-like
[Glycine max]
Length = 449
Score = 402 bits (1033), Expect = e-109, Method: Compositional matrix adjust.
Identities = 187/273 (68%), Positives = 220/273 (80%), Gaps = 16/273 (5%)
Query: 69 GNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACY 128
G S D I+ LKNYM+AQYFGEIGIGT PQ FTVIFDTGSSNLWVPSSKCYFS+ACY
Sbjct: 20 GGYENSDDTSIIRLKNYMNAQYFGEIGIGTLPQKFTVIFDTGSSNLWVPSSKCYFSVACY 79
Query: 129 FHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLT 188
HS+Y+S +SST KNG SA+IHYGTG ISGFF++DHVK+ DLVV DQ+FIEATR
Sbjct: 80 LHSRYKSSQSSTCNKNGSSAEIHYGTGHISGFFTQDHVKVXDLVVYDQDFIEATR----- 134
Query: 189 FLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGG 248
+GFQEISVG A P+WYNM+NQ + +PVFSFW NRN +EE+GG+IVFGG
Sbjct: 135 -----------VGFQEISVGNAAPIWYNMLNQHFLTQPVFSFWLNRNTNEEQGGQIVFGG 183
Query: 249 MDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTII 308
+D DHYKGEHTYVPVTQKGYWQ ++GDV+I+G+TTG CA C AI DSGTSLLAGPT +I
Sbjct: 184 IDSDHYKGEHTYVPVTQKGYWQIEIGDVLINGKTTGLCAAKCLAIVDSGTSLLAGPTGVI 243
Query: 309 TQVNHAIGATGIVSQECKAVVSQYGEEIINMLL 341
Q+NHAIGA GIVSQECKA+V+QYG+ I++ L+
Sbjct: 244 AQINHAIGAVGIVSQECKALVAQYGKTILDKLI 276
>gi|223949795|gb|ACN28981.1| unknown [Zea mays]
gi|413917601|gb|AFW57533.1| hypothetical protein ZEAMMB73_218341 [Zea mays]
gi|413917602|gb|AFW57534.1| hypothetical protein ZEAMMB73_218341 [Zea mays]
Length = 509
Score = 399 bits (1025), Expect = e-108, Method: Compositional matrix adjust.
Identities = 186/279 (66%), Positives = 227/279 (81%), Gaps = 2/279 (0%)
Query: 70 NLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYF 129
LG SG D V L +Y++ QY+G +GIGTPPQNFTVIFDTGSSNLWVPSS+CYFSIACY
Sbjct: 70 RLGASGGGD-VPLVDYLNTQYYGVVGIGTPPQNFTVIFDTGSSNLWVPSSRCYFSIACYL 128
Query: 130 HSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTF 189
H +Y+S +SSTYK +G++ I YG+G+I+GFFS+D V +GDL VK+Q+FIE TRE S+TF
Sbjct: 129 HHRYKSAKSSTYKADGETCKITYGSGSIAGFFSDDDVLVGDLTVKNQKFIETTRESSITF 188
Query: 190 LLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNAD-EEEGGEIVFGG 248
++ KFDGILGLG+ EISVGKA P+W +M Q L+ E VFSFW NR+ D GGE+VFGG
Sbjct: 189 IIGKFDGILGLGYPEISVGKAPPIWQSMQEQELLAEDVFSFWLNRSPDAAAAGGELVFGG 248
Query: 249 MDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTII 308
+DP H+ G HTYVPV++KGYWQFDMGD++IDG +TGFCA GCAAI DSGTSLLAGPT II
Sbjct: 249 VDPAHFSGNHTYVPVSRKGYWQFDMGDLLIDGHSTGFCAKGCAAIVDSGTSLLAGPTAII 308
Query: 309 TQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKVSAK 347
QVN AIGA GI+S ECK VVSQYGE I++ML+A+ +
Sbjct: 309 AQVNEAIGADGIISTECKEVVSQYGEMILDMLIAQTDPQ 347
>gi|226532912|ref|NP_001146573.1| hypothetical protein [Zea mays]
gi|219887869|gb|ACL54309.1| unknown [Zea mays]
gi|413917600|gb|AFW57532.1| hypothetical protein ZEAMMB73_218341 [Zea mays]
Length = 494
Score = 399 bits (1025), Expect = e-108, Method: Compositional matrix adjust.
Identities = 186/279 (66%), Positives = 227/279 (81%), Gaps = 2/279 (0%)
Query: 70 NLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYF 129
LG SG D V L +Y++ QY+G +GIGTPPQNFTVIFDTGSSNLWVPSS+CYFSIACY
Sbjct: 55 RLGASGGGD-VPLVDYLNTQYYGVVGIGTPPQNFTVIFDTGSSNLWVPSSRCYFSIACYL 113
Query: 130 HSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTF 189
H +Y+S +SSTYK +G++ I YG+G+I+GFFS+D V +GDL VK+Q+FIE TRE S+TF
Sbjct: 114 HHRYKSAKSSTYKADGETCKITYGSGSIAGFFSDDDVLVGDLTVKNQKFIETTRESSITF 173
Query: 190 LLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNAD-EEEGGEIVFGG 248
++ KFDGILGLG+ EISVGKA P+W +M Q L+ E VFSFW NR+ D GGE+VFGG
Sbjct: 174 IIGKFDGILGLGYPEISVGKAPPIWQSMQEQELLAEDVFSFWLNRSPDAAAAGGELVFGG 233
Query: 249 MDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTII 308
+DP H+ G HTYVPV++KGYWQFDMGD++IDG +TGFCA GCAAI DSGTSLLAGPT II
Sbjct: 234 VDPAHFSGNHTYVPVSRKGYWQFDMGDLLIDGHSTGFCAKGCAAIVDSGTSLLAGPTAII 293
Query: 309 TQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKVSAK 347
QVN AIGA GI+S ECK VVSQYGE I++ML+A+ +
Sbjct: 294 AQVNEAIGADGIISTECKEVVSQYGEMILDMLIAQTDPQ 332
>gi|293335451|ref|NP_001169605.1| uncharacterized protein LOC100383486 precursor [Zea mays]
gi|224030337|gb|ACN34244.1| unknown [Zea mays]
Length = 556
Score = 398 bits (1023), Expect = e-108, Method: Compositional matrix adjust.
Identities = 187/277 (67%), Positives = 222/277 (80%), Gaps = 1/277 (0%)
Query: 69 GNLGE-SGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIAC 127
G GE SG IVALKN+++AQYFG+IG+G PPQNFTV+FDTGS+NLWVPS+KC+FS+AC
Sbjct: 115 GGQGEGSGSIAIVALKNFLNAQYFGQIGVGCPPQNFTVVFDTGSANLWVPSAKCFFSLAC 174
Query: 128 YFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSL 187
FH KY S +SSTYK NG A IHYGTG I+GF+S+D V +G+LVV++QEFIEAT EP
Sbjct: 175 LFHPKYDSRQSSTYKPNGTPASIHYGTGGIAGFYSQDQVTVGNLVVQNQEFIEATHEPGF 234
Query: 188 TFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFG 247
TFLLAKFDGILGL FQEISV ++PVWYNMVNQ LV +PVFSFW NRN + EGGEIVFG
Sbjct: 235 TFLLAKFDGILGLAFQEISVEGSLPVWYNMVNQNLVAQPVFSFWLNRNPFDGEGGEIVFG 294
Query: 248 GMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTI 307
G D HYKG HTY VT+KGYWQF+MGD +I G++TG C GCAAIADSGTSL+AGP
Sbjct: 295 GSDEQHYKGSHTYTRVTRKGYWQFEMGDFLIGGRSTGICVDGCAAIADSGTSLIAGPLVA 354
Query: 308 ITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKV 344
I Q+N IGA G+V+QECK VV+ YG +I +L A+
Sbjct: 355 IAQINEQIGAAGVVNQECKQVVAGYGLQIAGLLEAQT 391
>gi|414881317|tpg|DAA58448.1| TPA: hypothetical protein ZEAMMB73_088821 [Zea mays]
Length = 557
Score = 398 bits (1023), Expect = e-108, Method: Compositional matrix adjust.
Identities = 187/276 (67%), Positives = 222/276 (80%), Gaps = 1/276 (0%)
Query: 69 GNLGE-SGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIAC 127
G GE SG IVALKN+++AQYFG+IG+G PPQNFTV+FDTGS+NLWVPS+KC+FS+AC
Sbjct: 115 GGQGEGSGSIAIVALKNFLNAQYFGQIGVGCPPQNFTVVFDTGSANLWVPSAKCFFSLAC 174
Query: 128 YFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSL 187
FH KY S +SSTYK NG A IHYGTG I+GF+S+D V +G+LVV++QEFIEAT EP
Sbjct: 175 LFHPKYDSRQSSTYKPNGTPASIHYGTGGIAGFYSQDQVTVGNLVVQNQEFIEATHEPGF 234
Query: 188 TFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFG 247
TFLLAKFDGILGL FQEISV ++PVWYNMVNQ LV +PVFSFW NRN + EGGEIVFG
Sbjct: 235 TFLLAKFDGILGLAFQEISVEGSLPVWYNMVNQNLVAQPVFSFWLNRNPFDGEGGEIVFG 294
Query: 248 GMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTI 307
G D HYKG HTY VT+KGYWQF+MGD +I G++TG C GCAAIADSGTSL+AGP
Sbjct: 295 GSDEQHYKGSHTYTRVTRKGYWQFEMGDFLIGGRSTGICVDGCAAIADSGTSLIAGPLVA 354
Query: 308 ITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAK 343
I Q+N IGA G+V+QECK VV+ YG +I +L A+
Sbjct: 355 IAQINEQIGAAGVVNQECKQVVAGYGLQIAGLLEAQ 390
>gi|357135633|ref|XP_003569413.1| PREDICTED: aspartic proteinase oryzasin-1-like [Brachypodium
distachyon]
Length = 560
Score = 397 bits (1020), Expect = e-108, Method: Compositional matrix adjust.
Identities = 185/281 (65%), Positives = 224/281 (79%), Gaps = 1/281 (0%)
Query: 67 LRGNLG-ESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSI 125
+RGN G +I+ LKNYM+AQYFG+IG+G PPQNFTV+FDTGSSN+WVPS+KC FS+
Sbjct: 112 IRGNRSVHDGQQNIIPLKNYMNAQYFGQIGVGCPPQNFTVVFDTGSSNIWVPSAKCIFSL 171
Query: 126 ACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREP 185
ACYFH KY S SSTYK+NG A IHYG+GAI GF+SED V IG+LVVK+QEFIE T E
Sbjct: 172 ACYFHPKYVSRWSSTYKENGTPASIHYGSGAIYGFYSEDQVTIGNLVVKNQEFIETTYEH 231
Query: 186 SLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIV 245
TFL AKFDGILGLGF+EISV + PVWYNM++QGLV E FSFW NR+A++ EGGEIV
Sbjct: 232 GFTFLAAKFDGILGLGFKEISVEGSDPVWYNMIDQGLVKEKSFSFWLNRDANDGEGGEIV 291
Query: 246 FGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPT 305
FGG DP HYKG HTY VT+K YWQF+MGD +I G++TG C GCAAIADSGTSL+AGP
Sbjct: 292 FGGSDPKHYKGSHTYTRVTRKAYWQFEMGDFLIGGKSTGICVDGCAAIADSGTSLIAGPV 351
Query: 306 TIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKVSA 346
+I Q+N IGA G+ ++ECK VV+ YG+++I +L AK +A
Sbjct: 352 AVIAQINEKIGANGVANEECKQVVAGYGQQMIELLEAKQTA 392
>gi|356556454|ref|XP_003546541.1| PREDICTED: aspartic proteinase oryzasin-1-like [Glycine max]
Length = 505
Score = 396 bits (1017), Expect = e-107, Method: Compositional matrix adjust.
Identities = 196/341 (57%), Positives = 241/341 (70%), Gaps = 9/341 (2%)
Query: 3 MVFKSITAGFFLCLLLFPVVFSTPNGG-LYRIGLKKRKFDLNNRVAARLDSKEGESFRTS 61
M FK + G + + F+T N G L RIGLK+R DL AAR+ KE R
Sbjct: 1 MDFKYLLVGMCVWAWFGSITFATSNDGRLMRIGLKRRTLDLQCLKAARI--KEAGHHR-- 56
Query: 62 IRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC 121
L G D DIV LKNY+DAQYFGEI IG+PPQ F V+FDTGSSNLWVPSSKC
Sbjct: 57 ----DLGGVNRNCCDEDIVYLKNYLDAQYFGEISIGSPPQYFNVVFDTGSSNLWVPSSKC 112
Query: 122 YFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEA 181
FSIACYFHSKYRS SSTY + G I YG G+I GFFS+D+V++GD+++KDQEF E
Sbjct: 113 IFSIACYFHSKYRSKISSTYTEIGIPCKIPYGQGSIFGFFSQDNVQVGDIIIKDQEFAEI 172
Query: 182 TREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEG 241
TRE SL FDGILGLGFQ+ SVGK PVWYNM+ GL++ +FS W N++ EE G
Sbjct: 173 TREGSLALPALPFDGILGLGFQDTSVGKVTPVWYNMLEGGLISHKIFSLWLNQDPSEEMG 232
Query: 242 GEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLL 301
GEIVFGG+D H++GEHTYVP++QKGYWQ D+GD+++ +TG C GGCAA+ DSGTSL+
Sbjct: 233 GEIVFGGIDYRHFRGEHTYVPLSQKGYWQIDLGDILLANNSTGLCEGGCAAVVDSGTSLI 292
Query: 302 AGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLA 342
AGPTT++TQ+NHAIGA G S ECK+++ YG+ I L+A
Sbjct: 293 AGPTTVVTQINHAIGAEGYTSFECKSILHNYGDSIWESLIA 333
>gi|218188712|gb|EEC71139.1| hypothetical protein OsI_02961 [Oryza sativa Indica Group]
Length = 540
Score = 393 bits (1009), Expect = e-107, Method: Compositional matrix adjust.
Identities = 184/295 (62%), Positives = 226/295 (76%), Gaps = 5/295 (1%)
Query: 61 SIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSK 120
S R + RG+L +ALKN+++AQYFGEIG+G PPQNFTV+FDTGSSNLWVPS+K
Sbjct: 125 SYRGFRERGSLKNGTQNHPLALKNFLNAQYFGEIGVGCPPQNFTVVFDTGSSNLWVPSAK 184
Query: 121 CYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIE 180
C FS+ACYFH KY S SSTY +NG A IHYGTG+I G++S+D V IGDLVV +QEFIE
Sbjct: 185 CVFSLACYFHRKYESRSSSTYMENGTPASIHYGTGSIHGYYSQDQVTIGDLVVNNQEFIE 244
Query: 181 ATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEE 240
AT EP LTFL AKFDGILGLGF+EISV A PVWYNM+ Q LV + VFSFW NRNA++
Sbjct: 245 ATHEPGLTFLAAKFDGILGLGFKEISVEGADPVWYNMIQQSLVTDKVFSFWLNRNANDIN 304
Query: 241 GGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSL 300
GGEIVFGG D HYKG+HTY VT+K YWQF+MGD +I G++TG C GCA IADSGTSL
Sbjct: 305 GGEIVFGGADESHYKGDHTYTRVTRKAYWQFEMGDFLIGGRSTGICVDGCAVIADSGTSL 364
Query: 301 LAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLL-----AKVSAKLSL 350
+AGP I Q++ IGATG+ ++ECK VV+++G E++ +L A+V +K+ L
Sbjct: 365 IAGPIAAIAQIHAHIGATGVANEECKQVVARHGHEMLELLQDKTPPAQVCSKIGL 419
>gi|242053731|ref|XP_002456011.1| hypothetical protein SORBIDRAFT_03g028820 [Sorghum bicolor]
gi|241927986|gb|EES01131.1| hypothetical protein SORBIDRAFT_03g028820 [Sorghum bicolor]
Length = 567
Score = 392 bits (1008), Expect = e-106, Method: Compositional matrix adjust.
Identities = 182/267 (68%), Positives = 217/267 (81%), Gaps = 1/267 (0%)
Query: 78 DIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGR 137
+IVALKN+++AQYFG+IG+G PPQNFTV+FDTGS+NLWVPS+KC+FS+AC FH KY S +
Sbjct: 135 NIVALKNFLNAQYFGQIGVGCPPQNFTVVFDTGSANLWVPSAKCFFSLACLFHPKYDSSQ 194
Query: 138 SSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGI 197
SSTYK NG A IHYGTG I+GF+S+D V +G+LVV++QEFIEAT EP TFLLAKFDGI
Sbjct: 195 SSTYKPNGTPASIHYGTGGIAGFYSQDEVTVGNLVVQNQEFIEATHEPGFTFLLAKFDGI 254
Query: 198 LGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNA-DEEEGGEIVFGGMDPDHYKG 256
LGL FQEISV +VPVWYNMVNQ LV +PVFSFW NRN D EEGGEIVFGG D HYKG
Sbjct: 255 LGLAFQEISVEGSVPVWYNMVNQSLVPQPVFSFWLNRNPFDGEEGGEIVFGGSDEQHYKG 314
Query: 257 EHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIG 316
HTY VT+K YWQF+MGD +I ++TG C GCAAIADSGTSL+AGP I Q+N IG
Sbjct: 315 SHTYTRVTRKAYWQFEMGDFLIGERSTGICVDGCAAIADSGTSLIAGPLVAIAQINEQIG 374
Query: 317 ATGIVSQECKAVVSQYGEEIINMLLAK 343
A G+V+ ECK VV+ YG E++ +L A+
Sbjct: 375 AAGVVNHECKQVVAGYGLEMVELLKAQ 401
>gi|115438741|ref|NP_001043650.1| Os01g0631900 [Oryza sativa Japonica Group]
gi|55297073|dbj|BAD68642.1| putative aspartic proteinase [Oryza sativa Japonica Group]
gi|113533181|dbj|BAF05564.1| Os01g0631900 [Oryza sativa Japonica Group]
Length = 522
Score = 392 bits (1007), Expect = e-106, Method: Compositional matrix adjust.
Identities = 184/295 (62%), Positives = 226/295 (76%), Gaps = 5/295 (1%)
Query: 61 SIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSK 120
S R + RG+L +ALKN+++AQYFGEIG+G PPQNFTV+FDTGSSNLWVPS+K
Sbjct: 110 SYRGFRERGSLKNGTQNHPLALKNFLNAQYFGEIGVGCPPQNFTVVFDTGSSNLWVPSAK 169
Query: 121 CYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIE 180
C FS+ACYFH KY S SSTY +NG A IHYGTG+I G++S+D V IGDLVV +QEFIE
Sbjct: 170 CVFSLACYFHRKYESRSSSTYMENGTPASIHYGTGSIHGYYSQDQVTIGDLVVNNQEFIE 229
Query: 181 ATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEE 240
AT EP LTFL AKFDGILGLGF+EISV A PVWYNM+ Q LV + VFSFW NRNA++
Sbjct: 230 ATHEPGLTFLAAKFDGILGLGFKEISVEGADPVWYNMIQQSLVTDKVFSFWLNRNANDIN 289
Query: 241 GGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSL 300
GGEIVFGG D HYKG+HTY VT+K YWQF+MGD +I G++TG C GCA IADSGTSL
Sbjct: 290 GGEIVFGGADESHYKGDHTYTRVTRKAYWQFEMGDFLIGGRSTGICVDGCAVIADSGTSL 349
Query: 301 LAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLL-----AKVSAKLSL 350
+AGP I Q++ IGATG+ ++ECK VV+++G E++ +L A+V +K+ L
Sbjct: 350 IAGPIAAIAQIHAHIGATGVANEECKQVVARHGHEMLELLQDKTPPAQVCSKIGL 404
>gi|297736824|emb|CBI26025.3| unnamed protein product [Vitis vinifera]
Length = 500
Score = 391 bits (1005), Expect = e-106, Method: Compositional matrix adjust.
Identities = 187/318 (58%), Positives = 238/318 (74%), Gaps = 14/318 (4%)
Query: 29 GLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGD----ADIVALKN 84
GL RIGLKK+ DL AAR+ G F LG+ D A+ V LKN
Sbjct: 28 GLVRIGLKKKPLDLARLHAARI--TRGNGFHAQ--------GLGKVDDNYPKANTVYLKN 77
Query: 85 YMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKN 144
YMDAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPSSKCYFSIACYFH++YR+ S TY KN
Sbjct: 78 YMDAQYYGEIGIGSPPQTFSVVFDTGSSNLWVPSSKCYFSIACYFHARYRAVLSRTYSKN 137
Query: 145 GKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQE 204
G+ I+YG+G+ISGFFS+DHV+IG++V+K+Q F EAT+E F LA+FDGILGLGFQ
Sbjct: 138 GRHCKINYGSGSISGFFSQDHVQIGEIVIKNQVFTEATKEGLFAFSLAQFDGILGLGFQN 197
Query: 205 ISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVT 264
SVGK P+WYNMV Q LV+ + SFW NR+ + GGE++FGG+D H+ G+HT+VP+T
Sbjct: 198 ASVGKIPPIWYNMVQQSLVSMEIVSFWLNRDPKAKIGGEVIFGGVDWRHFMGDHTFVPIT 257
Query: 265 QKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE 324
+K YWQ ++GD++I G +TGFC GGCAAI D+GTS++AGPTT++TQ+NHAIGA GIVS
Sbjct: 258 RKDYWQIEVGDILIAGSSTGFCEGGCAAIVDTGTSMIAGPTTVVTQINHAIGAEGIVSFN 317
Query: 325 CKAVVSQYGEEIINMLLA 342
CK VV++YG I L++
Sbjct: 318 CKNVVNKYGRLIWQFLVS 335
>gi|359477267|ref|XP_002275241.2| PREDICTED: aspartic proteinase [Vitis vinifera]
Length = 502
Score = 391 bits (1005), Expect = e-106, Method: Compositional matrix adjust.
Identities = 187/318 (58%), Positives = 238/318 (74%), Gaps = 14/318 (4%)
Query: 29 GLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGD----ADIVALKN 84
GL RIGLKK+ DL AAR+ G F LG+ D A+ V LKN
Sbjct: 28 GLVRIGLKKKPLDLARLHAARI--TRGNGFHAQ--------GLGKVDDNYPKANTVYLKN 77
Query: 85 YMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKN 144
YMDAQY+GEIGIG+PPQ F+V+FDTGSSNLWVPSSKCYFSIACYFH++YR+ S TY KN
Sbjct: 78 YMDAQYYGEIGIGSPPQTFSVVFDTGSSNLWVPSSKCYFSIACYFHARYRAVLSRTYSKN 137
Query: 145 GKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQE 204
G+ I+YG+G+ISGFFS+DHV+IG++V+K+Q F EAT+E F LA+FDGILGLGFQ
Sbjct: 138 GRHCKINYGSGSISGFFSQDHVQIGEIVIKNQVFTEATKEGLFAFSLAQFDGILGLGFQN 197
Query: 205 ISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVT 264
SVGK P+WYNMV Q LV+ + SFW NR+ + GGE++FGG+D H+ G+HT+VP+T
Sbjct: 198 ASVGKIPPIWYNMVQQSLVSMEIVSFWLNRDPKAKIGGEVIFGGVDWRHFMGDHTFVPIT 257
Query: 265 QKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE 324
+K YWQ ++GD++I G +TGFC GGCAAI D+GTS++AGPTT++TQ+NHAIGA GIVS
Sbjct: 258 RKDYWQIEVGDILIAGSSTGFCEGGCAAIVDTGTSMIAGPTTVVTQINHAIGAEGIVSFN 317
Query: 325 CKAVVSQYGEEIINMLLA 342
CK VV++YG I L++
Sbjct: 318 CKNVVNKYGRLIWQFLVS 335
>gi|168031065|ref|XP_001768042.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680680|gb|EDQ67114.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 455
Score = 390 bits (1003), Expect = e-106, Method: Compositional matrix adjust.
Identities = 185/294 (62%), Positives = 227/294 (77%), Gaps = 3/294 (1%)
Query: 58 FRTSIRKYSLRGN-LGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWV 116
R + RG +G GD VAL NY+DAQY+G I IGTP Q FTV+FDTGSSNLWV
Sbjct: 1 MRAQQARGGTRGQGVGSGGDE--VALVNYLDAQYYGVIEIGTPKQEFTVVFDTGSSNLWV 58
Query: 117 PSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQ 176
PS+KCY S+AC+FH +Y++ +SSTYK++G I YGTG++ GF S D V +GDL VK Q
Sbjct: 59 PSAKCYLSLACFFHHRYKARKSSTYKQDGTPFAIQYGTGSMEGFLSIDDVTLGDLTVKAQ 118
Query: 177 EFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNA 236
F EAT+EP +TFL A+ DGILGLGF+EISV PVWYNM+ Q LV EPVFSFW NR+
Sbjct: 119 VFAEATKEPGVTFLAAEMDGILGLGFKEISVNDVNPVWYNMLYQKLVQEPVFSFWLNRDV 178
Query: 237 DEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADS 296
+ E+GGE+V GG+DP H+KG HTY PVT+ GYWQFDMGDV++DGQ+TGFCAGGCAAIADS
Sbjct: 179 EGEKGGELVLGGVDPHHFKGNHTYTPVTRLGYWQFDMGDVLLDGQSTGFCAGGCAAIADS 238
Query: 297 GTSLLAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKVSAKLSL 350
GTSLLAGPT I+ ++N+AIGATGI+S ECK VV QY + II ML++K+ L +
Sbjct: 239 GTSLLAGPTGIVAEINYAIGATGIISGECKLVVDQYADFIIQMLMSKLLTPLKI 292
>gi|255556616|ref|XP_002519342.1| Aspartic proteinase oryzasin-1 precursor, putative [Ricinus
communis]
gi|223541657|gb|EEF43206.1| Aspartic proteinase oryzasin-1 precursor, putative [Ricinus
communis]
Length = 500
Score = 389 bits (1000), Expect = e-106, Method: Compositional matrix adjust.
Identities = 187/319 (58%), Positives = 236/319 (73%), Gaps = 13/319 (4%)
Query: 30 LYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQ 89
L +IGLKKR+ DL + AAR+ + + + G AD+V LKNY+D Q
Sbjct: 30 LVKIGLKKRRLDLYSINAARITIADASA------------SFGWP-KADVVYLKNYLDTQ 76
Query: 90 YFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSAD 149
Y+GE+ IG+PPQ FTV+FDTGSSNLWVPSSKC SI CYFHSK+R+ S TY K G
Sbjct: 77 YYGEVAIGSPPQTFTVVFDTGSSNLWVPSSKCVLSITCYFHSKFRAKMSRTYTKIGLPCK 136
Query: 150 IHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGK 209
I YG+G+ISGFFS+D+VK+GD V+DQEF+E TRE L FL +FDGILGLGFQEI+VG+
Sbjct: 137 IDYGSGSISGFFSQDYVKLGDATVRDQEFVEVTREGLLAFLGTQFDGILGLGFQEITVGQ 196
Query: 210 AVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYW 269
A PVWYNMV QG VN+ +FS W NR+ GGEIVFGG+D H++GEHTYVPVT+KGYW
Sbjct: 197 ATPVWYNMVRQGHVNQKLFSLWLNRDPTAGMGGEIVFGGLDWRHFRGEHTYVPVTEKGYW 256
Query: 270 QFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQECKAVV 329
Q ++GDV I ++TG C GCAAI DSGTS +AGPTTI+TQ+NHAIGA GIVS ECK+VV
Sbjct: 257 QIEVGDVFIAKKSTGMCEYGCAAIVDSGTSFIAGPTTIVTQINHAIGAQGIVSLECKSVV 316
Query: 330 SQYGEEIINMLLAKVSAKL 348
+++G+ I L++ + ++
Sbjct: 317 TKFGDLIWESLISGLRPEI 335
>gi|296089849|emb|CBI39668.3| unnamed protein product [Vitis vinifera]
Length = 430
Score = 387 bits (995), Expect = e-105, Method: Compositional matrix adjust.
Identities = 176/259 (67%), Positives = 216/259 (83%)
Query: 86 MDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNG 145
MDAQY+GEIGIGTPPQNFTV+FDTGS+NLWVPS+KC+FSIAC FHSKY S S+TY G
Sbjct: 1 MDAQYYGEIGIGTPPQNFTVVFDTGSANLWVPSTKCHFSIACLFHSKYNSRLSTTYIDLG 60
Query: 146 KSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEI 205
K +IHYG+G+ISG FS+D+V++G + +K+Q FIEATRE SL F+L KFDGILGLGF+EI
Sbjct: 61 KEGEIHYGSGSISGVFSQDNVQVGSMAIKNQVFIEATREASLVFVLGKFDGILGLGFEEI 120
Query: 206 SVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQ 265
VG A PVWYN++ QGLV E +FSFW NR+ +GGEIVFGG+D H+KG+HTY +TQ
Sbjct: 121 VVGNATPVWYNLLRQGLVQEDIFSFWLNRDPQATDGGEIVFGGVDKRHFKGQHTYASITQ 180
Query: 266 KGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQEC 325
KGYWQF+MG+ +I Q+TGFC GCAAI DSGTSL+AGPT I+T++NHAIGA GIVSQEC
Sbjct: 181 KGYWQFEMGEFLIGYQSTGFCEAGCAAIVDSGTSLIAGPTAIVTEINHAIGAEGIVSQEC 240
Query: 326 KAVVSQYGEEIINMLLAKV 344
K VVSQYG I ++L+++V
Sbjct: 241 KEVVSQYGNMIWDLLISRV 259
>gi|357450315|ref|XP_003595434.1| Aspartic proteinase [Medicago truncatula]
gi|355484482|gb|AES65685.1| Aspartic proteinase [Medicago truncatula]
Length = 507
Score = 381 bits (978), Expect = e-103, Method: Compositional matrix adjust.
Identities = 181/336 (53%), Positives = 234/336 (69%), Gaps = 10/336 (2%)
Query: 15 CLLLFPV--VFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLG 72
CL ++ + ++ N L RI LKKR D+ + +R+ + I + L
Sbjct: 11 CLWIWSLSLAYTISNDNLMRISLKKRNLDIQSLNTSRI--------KKVIHERDLESVDT 62
Query: 73 ESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSK 132
G D+V LKNY D QY+GEIGIG+PPQ F V+FDTGSSNLWVPSS+C FSIACYFHSK
Sbjct: 63 NYGSKDVVYLKNYFDVQYYGEIGIGSPPQYFNVVFDTGSSNLWVPSSRCIFSIACYFHSK 122
Query: 133 YRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLA 192
YRSG SSTY + G +I Y G I GFFS+D+VK+GD+ VKDQEF E TRE + L
Sbjct: 123 YRSGISSTYNEIGVPCEIPYDEGYIYGFFSQDNVKVGDINVKDQEFCEITREGNFALLAL 182
Query: 193 KFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPD 252
FDGILGLGFQ++SVGK PVWYNM+ QG +++ VFS WFN++ E GGEIVFGG+D
Sbjct: 183 PFDGILGLGFQDVSVGKVTPVWYNMIEQGHISDKVFSLWFNKDPMAEVGGEIVFGGVDKR 242
Query: 253 HYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVN 312
H++G+HTY P++QKGYWQ ++GD+++ TTG C GGCAAI DSGTSL+AGPT ++TQ+N
Sbjct: 243 HFRGDHTYFPISQKGYWQIEVGDILLANNTTGLCEGGCAAIVDSGTSLIAGPTGVVTQIN 302
Query: 313 HAIGATGIVSQECKAVVSQYGEEIINMLLAKVSAKL 348
H IG G VS ECK +V YG I L++ ++ ++
Sbjct: 303 HVIGTEGYVSYECKNIVHNYGNLIWESLISGLNPEI 338
>gi|510880|emb|CAA56373.1| putative aspartic protease [Brassica oleracea]
Length = 255
Score = 380 bits (977), Expect = e-103, Method: Compositional matrix adjust.
Identities = 183/264 (69%), Positives = 211/264 (79%), Gaps = 10/264 (3%)
Query: 1 MGMVFKSITAGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRT 60
MG+ K + + LLF + N G +R+GLKK K D +R+AAR+ SK+ +
Sbjct: 1 MGIYSKPVAVSLIVSFLLFLSASAERNDGTFRVGLKKLKLDRKSRIAARVGSKQLKP--- 57
Query: 61 SIRKYSLRG-NLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSS 119
LRG LG+SGDADIV LKNY+DAQY+GEI IGTPPQ FTV+FDTGSSNLWVPSS
Sbjct: 58 ------LRGYGLGDSGDADIVTLKNYLDAQYYGEIAIGTPPQKFTVVFDTGSSNLWVPSS 111
Query: 120 KCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFI 179
KCYFSIAC FHSKY+S RSSTY+KNGKSA IHYGTGAI+GFFS D V +GDLVVKDQEFI
Sbjct: 112 KCYFSIACLFHSKYKSSRSSTYEKNGKSAAIHYGTGAIAGFFSNDAVTVGDLVVKDQEFI 171
Query: 180 EATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEE 239
EAT+EP +TF+LAKFDGILGLGFQEISVG A PVWYNM+ QGL EPVFSFW NRNA++E
Sbjct: 172 EATKEPGITFVLAKFDGILGLGFQEISVGNAAPVWYNMLKQGLYKEPVFSFWLNRNAEDE 231
Query: 240 EGGEIVFGGMDPDHYKGEHTYVPV 263
EGGE+VFGG+DP+HYKGEH YVPV
Sbjct: 232 EGGELVFGGVDPNHYKGEHIYVPV 255
>gi|356547093|ref|XP_003541952.1| PREDICTED: LOW QUALITY PROTEIN: cyprosin-like, partial [Glycine
max]
Length = 470
Score = 379 bits (972), Expect = e-102, Method: Compositional matrix adjust.
Identities = 190/321 (59%), Positives = 233/321 (72%), Gaps = 34/321 (10%)
Query: 21 VVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIV 80
VV S P+ G+ R+GL+K KFD ++ + G S D I+
Sbjct: 12 VVLSGPSNGIIRVGLEKNKFD---------------------QRKTPFGGYENSDDTSII 50
Query: 81 ALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSST 140
LKNYM+AQYFGEIGIGTP Q FTVIFDTGSSNLWVPSSKCYFS+ACY HS+Y+S +SST
Sbjct: 51 RLKNYMNAQYFGEIGIGTP-QKFTVIFDTGSSNLWVPSSKCYFSVACYLHSRYKSSQSST 109
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
KNG SA+I YGTG ISGFFS+D+VK+GDL+V TR LL + ++ L
Sbjct: 110 QNKNGSSAEIRYGTGQISGFFSQDYVKVGDLIV-------LTR----XILLNEHFCVI-L 157
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
F+ ISVGK P+WYNM+NQ L+ +PVFSFW NRN DE++GG+IVFGG+D DHY GEHTY
Sbjct: 158 QFKSISVGKVSPIWYNMLNQHLLAQPVFSFWLNRNTDEKQGGQIVFGGVDSDHYXGEHTY 217
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
VPVT KGYWQ ++GDV+ID +TT FCA C+AI DSGTSLLAGPT I Q+NHAIGA G+
Sbjct: 218 VPVTHKGYWQTEIGDVLIDRKTTEFCASKCSAIDDSGTSLLAGPTGAIAQINHAIGAVGV 277
Query: 321 VSQECKAVVSQYGEEIINMLL 341
V+QECKAVV+QYG+ I++ L+
Sbjct: 278 VNQECKAVVAQYGKTILDKLI 298
>gi|302756359|ref|XP_002961603.1| hypothetical protein SELMODRAFT_230037 [Selaginella moellendorffii]
gi|300170262|gb|EFJ36863.1| hypothetical protein SELMODRAFT_230037 [Selaginella moellendorffii]
Length = 423
Score = 377 bits (967), Expect = e-102, Method: Compositional matrix adjust.
Identities = 169/260 (65%), Positives = 209/260 (80%)
Query: 86 MDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNG 145
MDAQY+GEIGIG+PPQ FTVIFDTGSSNLWVPS KC S +C+FH +Y++G+SSTYK NG
Sbjct: 1 MDAQYYGEIGIGSPPQEFTVIFDTGSSNLWVPSGKCVLSPSCWFHRRYKAGQSSTYKPNG 60
Query: 146 KSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEI 205
S I YG+G++SGF S D V +G L VK + F EAT EP LTF+ AKFDGI+GLGFQ I
Sbjct: 61 TSISIQYGSGSMSGFLSVDDVTLGKLTVKGEVFAEATSEPGLTFMAAKFDGIMGLGFQAI 120
Query: 206 SVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQ 265
+ + VP+WY++V Q LV EPVFSFW NR+A + GGE+V GG+DP H+KG+H Y P+T+
Sbjct: 121 AQARVVPIWYHIVEQQLVKEPVFSFWLNRDATDGNGGELVLGGVDPKHFKGKHNYAPITR 180
Query: 266 KGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQEC 325
+GYW+ MGDV+IDG TG C+ GCAAI DSGTSLLAGP+ II ++NHAIGA+G+VSQEC
Sbjct: 181 EGYWEIRMGDVLIDGHGTGMCSKGCAAIVDSGTSLLAGPSAIIAEINHAIGASGVVSQEC 240
Query: 326 KAVVSQYGEEIINMLLAKVS 345
K +V QYG IIN+LLA+VS
Sbjct: 241 KLIVDQYGNIIINLLLAQVS 260
>gi|302775562|ref|XP_002971198.1| hypothetical protein SELMODRAFT_147484 [Selaginella moellendorffii]
gi|300161180|gb|EFJ27796.1| hypothetical protein SELMODRAFT_147484 [Selaginella moellendorffii]
Length = 423
Score = 375 bits (963), Expect = e-101, Method: Compositional matrix adjust.
Identities = 168/260 (64%), Positives = 209/260 (80%)
Query: 86 MDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNG 145
MDAQY+GEIGIG+PPQ FTVIFDTGSSNLWVPS KC S +C+FH ++++G+SSTYK NG
Sbjct: 1 MDAQYYGEIGIGSPPQEFTVIFDTGSSNLWVPSGKCVLSPSCWFHRRFKAGQSSTYKPNG 60
Query: 146 KSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEI 205
S I YG+G++SGF S D V +G L VK + F EAT EP LTF+ AKFDGI+GLGFQ I
Sbjct: 61 TSISIQYGSGSMSGFLSVDDVTLGKLTVKGEVFAEATSEPGLTFMAAKFDGIMGLGFQAI 120
Query: 206 SVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQ 265
+ + VP+WY++V Q LV EPVFSFW NR+A + GGE+V GG+DP H+KG+H Y P+T+
Sbjct: 121 AQARVVPIWYHIVEQQLVKEPVFSFWLNRDATDGNGGELVLGGVDPKHFKGKHNYAPITR 180
Query: 266 KGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQEC 325
+GYW+ MGDV+IDG TG C+ GCAAI DSGTSLLAGP+ II ++NHAIGA+G+VSQEC
Sbjct: 181 EGYWEIRMGDVLIDGHGTGMCSKGCAAIVDSGTSLLAGPSAIIAEINHAIGASGVVSQEC 240
Query: 326 KAVVSQYGEEIINMLLAKVS 345
K +V QYG IIN+LLA+VS
Sbjct: 241 KLIVDQYGNIIINLLLAQVS 260
>gi|222424506|dbj|BAH20208.1| AT1G11910 [Arabidopsis thaliana]
Length = 389
Score = 373 bits (958), Expect = e-101, Method: Compositional matrix adjust.
Identities = 169/222 (76%), Positives = 196/222 (88%)
Query: 126 ACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREP 185
AC H KY+S RSSTY+KNGK+A IHYGTGAI+GFFS D V +GDLVVKDQEFIEAT+EP
Sbjct: 1 ACLLHPKYKSSRSSTYEKNGKAAAIHYGTGAIAGFFSNDAVTVGDLVVKDQEFIEATKEP 60
Query: 186 SLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIV 245
+TF++AKFDGILGLGFQEISVGKA PVWYNM+ QGL+ EPVFSFW NRNADEEEGGE+V
Sbjct: 61 GITFVVAKFDGILGLGFQEISVGKAAPVWYNMLKQGLIKEPVFSFWLNRNADEEEGGELV 120
Query: 246 FGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPT 305
FGG+DP+H+KG+HTYVPVTQKGYWQFDMGDV+I G TGFC GC+AIADSGTSLLAGPT
Sbjct: 121 FGGVDPNHFKGKHTYVPVTQKGYWQFDMGDVLIGGAPTGFCESGCSAIADSGTSLLAGPT 180
Query: 306 TIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKVSAK 347
TIIT +NHAIGA G+VSQ+CK VV QYG+ I+++LL++ K
Sbjct: 181 TIITMINHAIGAAGVVSQQCKTVVDQYGQTILDLLLSETQPK 222
>gi|224106994|ref|XP_002314336.1| predicted protein [Populus trichocarpa]
gi|222863376|gb|EEF00507.1| predicted protein [Populus trichocarpa]
Length = 483
Score = 372 bits (954), Expect = e-100, Method: Compositional matrix adjust.
Identities = 178/325 (54%), Positives = 228/325 (70%), Gaps = 12/325 (3%)
Query: 24 STPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALK 83
S + GL R+GLKKR DLN+ AAR+ + SF + +A+IV LK
Sbjct: 8 SLSSDGLARVGLKKRNLDLNSIHAARITRPQATSFARV------------TSNAEIVYLK 55
Query: 84 NYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKK 143
NY+D QY+GEIGIG+PPQ FTV+FDTGSSNLWVPSSKC SI CYFHSK+ + S TY K
Sbjct: 56 NYLDTQYYGEIGIGSPPQIFTVVFDTGSSNLWVPSSKCLLSITCYFHSKFIARLSRTYTK 115
Query: 144 NGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQ 203
G I YG+G++SGF S+DHVK+GD ++ +Q +++E L L +FDGILGL FQ
Sbjct: 116 IGIPCKIQYGSGSVSGFLSQDHVKVGDDIIINQVSSASSKEGFLALLGVQFDGILGLAFQ 175
Query: 204 EISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPV 263
+I+V KA PVWYNM QG V++ VFS W NRN E GGE+VFGG+D H+KG+HTYVPV
Sbjct: 176 DIAVAKATPVWYNMAEQGHVSQKVFSLWLNRNPSSELGGEVVFGGLDWRHFKGDHTYVPV 235
Query: 264 TQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQ 323
T +GYWQ +GD+ I +TG CAGGC+AI DSGTS L+GPT I+ Q+NHAIGA GIVS
Sbjct: 236 TGRGYWQIQVGDIFIANNSTGLCAGGCSAIVDSGTSFLSGPTRIVAQINHAIGARGIVSL 295
Query: 324 ECKAVVSQYGEEIINMLLAKVSAKL 348
ECK VVS+Y I + +++ + ++
Sbjct: 296 ECKEVVSKYWNSIWDSMISGLRPEI 320
>gi|356545806|ref|XP_003541325.1| PREDICTED: aspartic proteinase oryzasin-1-like [Glycine max]
Length = 495
Score = 369 bits (947), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 180/347 (51%), Positives = 236/347 (68%), Gaps = 19/347 (5%)
Query: 3 MVFKSITAGFFLCLLLFPVVFSTPNG-GLYRIGLKKRKFDLNNRVAARLDSKEGESFRTS 61
M FK + +C + +T +G G+ R+ LK+R D+N+ +AR+
Sbjct: 1 MGFKHLLLVTSVCAWFVSLAVTTSSGDGVTRVSLKRRSLDINSLNSARI----------- 49
Query: 62 IRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC 121
+G + AD V LKNY+DAQYFGEIGIG+PPQ+F V+FDTGSSNLWVPS+KC
Sbjct: 50 ------KGVVNHL-KADGVYLKNYLDAQYFGEIGIGSPPQSFRVVFDTGSSNLWVPSAKC 102
Query: 122 YFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEA 181
SIACYFHSKYRS S+TY K G I YG G + GF S+D++++GD+++KDQ+F E
Sbjct: 103 VLSIACYFHSKYRSKLSNTYTKIGTPCKIPYGHGHVPGFISQDNLRVGDIIIKDQQFAEI 162
Query: 182 TREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEG 241
T+E L FL FDGILGLGFQ SV + PVWYNM+ QGLV + +FS W N++ + G
Sbjct: 163 TKEGPLAFLAMHFDGILGLGFQNKSVRQVTPVWYNMIEQGLVTQKIFSLWLNQDPVAKLG 222
Query: 242 GEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLL 301
GEIVFGG+D H+KGEHTYVP+TQK YWQ ++GD+ I TG C GGCAAI DSGTSL+
Sbjct: 223 GEIVFGGIDWRHFKGEHTYVPLTQKDYWQIEVGDIQIANNPTGLCEGGCAAIIDSGTSLI 282
Query: 302 AGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKVSAKL 348
AGPT I+TQ+NHAIGA G VS ECK ++ YG+ I +++ + ++
Sbjct: 283 AGPTKIVTQINHAIGAEGYVSYECKNIIHNYGDSIWEYIISGLKPEI 329
>gi|356565563|ref|XP_003551009.1| PREDICTED: aspartic proteinase oryzasin-1-like [Glycine max]
Length = 494
Score = 366 bits (939), Expect = 9e-99, Method: Compositional matrix adjust.
Identities = 178/347 (51%), Positives = 236/347 (68%), Gaps = 19/347 (5%)
Query: 3 MVFKSITAGFFLCLLLFPVVFSTPNG-GLYRIGLKKRKFDLNNRVAARLDSKEGESFRTS 61
M FK + +C +V +T +G GL R+ LK+R D+++ +A++
Sbjct: 1 MGFKYLLVVTCVCAWFGSLVVTTSSGDGLMRVSLKRRSLDISSLNSAKIKE--------- 51
Query: 62 IRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC 121
+ AD V LKNY+DAQYFGEIGIG+PPQ+F V+FDTGSSNLWVPS+KC
Sbjct: 52 ---------VVNHLKADGVYLKNYLDAQYFGEIGIGSPPQSFRVVFDTGSSNLWVPSAKC 102
Query: 122 YFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEA 181
SIACYFHSKYRS S+TY K G I YG G I GF S+D++++GD+++KDQ+F E
Sbjct: 103 VLSIACYFHSKYRSKLSNTYTKIGTPCKIPYGRGHIPGFISQDNIRVGDIIIKDQQFAEI 162
Query: 182 TREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEG 241
T+E L FL FDGILGLGFQ SVG+ PVWYNM+ QG V++ +FS W N++ + G
Sbjct: 163 TKEGPLAFLAMHFDGILGLGFQNKSVGQVTPVWYNMIEQGHVSQKIFSLWLNQDPVAKVG 222
Query: 242 GEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLL 301
GEIVFGG+D H+KG+HTYVP+TQK YWQ ++GD++I TG C GGCAAI DSGTSL+
Sbjct: 223 GEIVFGGIDWRHFKGDHTYVPLTQKDYWQIEVGDILIANNPTGLCEGGCAAIIDSGTSLI 282
Query: 302 AGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKVSAKL 348
AGPT I+TQ+N AIGA G VS ECK ++ YG+ I +++ + ++
Sbjct: 283 AGPTKIVTQINRAIGAEGYVSYECKNIIHNYGDSIWEYIISGLKPEI 329
>gi|147780252|emb|CAN65745.1| hypothetical protein VITISV_037763 [Vitis vinifera]
Length = 504
Score = 360 bits (925), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 191/344 (55%), Positives = 238/344 (69%), Gaps = 34/344 (9%)
Query: 1 MGMVFKSITAGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRT 60
M M + A F L L FP++ ++ +G L RIGLKK + D N AAR+ + +S
Sbjct: 1 MKMRQGYLWAAFCLWALTFPLLQASSDG-LVRIGLKKWRLDYNRIRAARMARRA-KSIGG 58
Query: 61 SIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSK 120
++ S+ LG+S D + V L+NYMDAQY+GEIGIGTPPQNFTV+FDTGS+NLWVPS+K
Sbjct: 59 VVK--SMYQGLGDS-DGESVLLRNYMDAQYYGEIGIGTPPQNFTVVFDTGSANLWVPSTK 115
Query: 121 CYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIE 180
C+FSIAC FHSKY S S+T S H+ FIE
Sbjct: 116 CHFSIACLFHSKYNSRLSTT------STKCHFSV-----------------------FIE 146
Query: 181 ATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEE 240
ATRE SL F+L KFDGILGLGF+EI VG A PVWYN++ QGLV E +FSFW NR+ +
Sbjct: 147 ATREASLVFVLGKFDGILGLGFEEIVVGNATPVWYNLLRQGLVQEDIFSFWLNRDPQATD 206
Query: 241 GGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSL 300
GGEIVFGG+D H+KG+HTY +TQKGYWQF+MG+ +I Q+TGFC GCAAI DSGTSL
Sbjct: 207 GGEIVFGGVDKRHFKGQHTYASITQKGYWQFEMGEFLIGYQSTGFCEAGCAAIVDSGTSL 266
Query: 301 LAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKV 344
+AGPT I+T++NHAIGA GIVSQECK VVSQYG I ++L+++V
Sbjct: 267 IAGPTAIVTEINHAIGAEGIVSQECKEVVSQYGNMIWDLLISRV 310
>gi|117662285|gb|ABK55693.1| aspartic proteinase [Cucumis sativus]
Length = 196
Score = 358 bits (920), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 167/196 (85%), Positives = 185/196 (94%)
Query: 126 ACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREP 185
AC HSKY+S RSSTYKKNGKSA I YGTGAISG FSED+VK+GDL+VK Q+FIEATREP
Sbjct: 1 ACLLHSKYKSKRSSTYKKNGKSASIKYGTGAISGCFSEDNVKVGDLIVKKQDFIEATREP 60
Query: 186 SLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIV 245
SLTF+LA+FDGILGLGF+EISVG AVPVWYNMV+Q LV EPVFSFWFNRNADEE+GGEIV
Sbjct: 61 SLTFVLAQFDGILGLGFKEISVGDAVPVWYNMVDQNLVKEPVFSFWFNRNADEEQGGEIV 120
Query: 246 FGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPT 305
FGG+DPDHYKGEHTYVPVT+KGYWQFDMGDV+I+G TTGFC+GGC+AIADSGTSLLAGPT
Sbjct: 121 FGGVDPDHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTGFCSGGCSAIADSGTSLLAGPT 180
Query: 306 TIITQVNHAIGATGIV 321
TIITQVNHAIGA+G+V
Sbjct: 181 TIITQVNHAIGASGVV 196
>gi|33347413|gb|AAQ15289.1| aspartic protease [Pyrus pyrifolia]
Length = 199
Score = 357 bits (917), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 167/192 (86%), Positives = 180/192 (93%)
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y KNGK A I YGTGAISGFFSEDHV +GDLVVKDQEFIEAT+EP +TFL AKFDGILGL
Sbjct: 5 YNKNGKPAAIQYGTGAISGFFSEDHVTVGDLVVKDQEFIEATKEPGITFLAAKFDGILGL 64
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
GFQEISVG AVPVWYNMVNQGL+ EPVFSFWFNRNADEEEGGEIVFGG+DP+HYKG+HTY
Sbjct: 65 GFQEISVGNAVPVWYNMVNQGLLKEPVFSFWFNRNADEEEGGEIVFGGVDPNHYKGKHTY 124
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
VPVTQKGYWQFDMGDVMIDGQTTGFCA GC+AIADSGTSLL GPTTIIT++NHAIGA+GI
Sbjct: 125 VPVTQKGYWQFDMGDVMIDGQTTGFCADGCSAIADSGTSLLVGPTTIITELNHAIGASGI 184
Query: 321 VSQECKAVVSQY 332
VSQECK VV++Y
Sbjct: 185 VSQECKTVVAEY 196
>gi|33347411|gb|AAQ15288.1| aspartic protease [Pyrus pyrifolia]
Length = 199
Score = 356 bits (914), Expect = 8e-96, Method: Compositional matrix adjust.
Identities = 166/192 (86%), Positives = 180/192 (93%)
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y KNGK A I YGTGAISGFFSEDHV +GDLVVKDQEFIEAT+EP +TFL+AKFDGILGL
Sbjct: 5 YNKNGKPAAIQYGTGAISGFFSEDHVTVGDLVVKDQEFIEATKEPGITFLVAKFDGILGL 64
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
GFQEISVG AVPVWYNMVNQGL+ EPVFS WFNRNADEEEGGEIVFGG+DP+HYKG+HTY
Sbjct: 65 GFQEISVGNAVPVWYNMVNQGLLKEPVFSLWFNRNADEEEGGEIVFGGVDPNHYKGKHTY 124
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
VPVTQKGYWQFDMGDVMIDGQTTGFCA GC+AIADSGTSLL GPTTIIT++NHAIGA+GI
Sbjct: 125 VPVTQKGYWQFDMGDVMIDGQTTGFCADGCSAIADSGTSLLVGPTTIITELNHAIGASGI 184
Query: 321 VSQECKAVVSQY 332
VSQECK VV++Y
Sbjct: 185 VSQECKTVVAEY 196
>gi|413946558|gb|AFW79207.1| hypothetical protein ZEAMMB73_486493 [Zea mays]
Length = 382
Score = 341 bits (874), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 156/208 (75%), Positives = 186/208 (89%)
Query: 136 GRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFD 195
G+ + +GK A I YGTG+I+GFFSED V +GDLVVKDQEFIEAT+EP LTF++AKFD
Sbjct: 3 GKKTKTYMSGKPAAIRYGTGSIAGFFSEDSVTLGDLVVKDQEFIEATKEPGLTFMVAKFD 62
Query: 196 GILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYK 255
GILGLGFQEISVG A PVWYNMV QGL+++PVFSFWFNR+ADE EGGEIVFGGMD HYK
Sbjct: 63 GILGLGFQEISVGNATPVWYNMVKQGLISDPVFSFWFNRHADEGEGGEIVFGGMDSSHYK 122
Query: 256 GEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAI 315
G+HT+VPVT+KGYWQF+MGDV++DG++TGFCAGGCAAIADSGTSLLAGPT IIT++N I
Sbjct: 123 GDHTFVPVTRKGYWQFNMGDVLVDGKSTGFCAGGCAAIADSGTSLLAGPTAIITEINEKI 182
Query: 316 GATGIVSQECKAVVSQYGEEIINMLLAK 343
GA G+VSQECK VVSQYG++I+++LLA+
Sbjct: 183 GAAGVVSQECKTVVSQYGQQILDLLLAE 210
>gi|307103455|gb|EFN51715.1| hypothetical protein CHLNCDRAFT_59800 [Chlorella variabilis]
Length = 523
Score = 340 bits (871), Expect = 9e-91, Method: Compositional matrix adjust.
Identities = 156/266 (58%), Positives = 202/266 (75%), Gaps = 2/266 (0%)
Query: 76 DADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYR 134
DA+ V + N+MDAQY+GEIG+G+PPQ+F VIFDTGSSNLWVPSSKC Y S+ACY HSKY
Sbjct: 57 DAEPVPITNFMDAQYYGEIGLGSPPQSFQVIFDTGSSNLWVPSSKCSYLSVACYLHSKYY 116
Query: 135 SGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKF 194
+ RS TYK++G+ I YG+G +SGF S+D + +G L V+ Q F EAT EPSL F+ A+F
Sbjct: 117 AERSHTYKEDGREFAIQYGSGQLSGFLSQDTLSMGGLKVEGQVFAEATMEPSLAFIAARF 176
Query: 195 DGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHY 254
DGILG+GF EI+VGK P + NM+ Q L+ EPVFSFW NR + EEGGE+V GG+DPDH+
Sbjct: 177 DGILGMGFPEIAVGKVTPPFQNMLQQSLLPEPVFSFWLNRKVEGEEGGELVLGGVDPDHF 236
Query: 255 KGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHA 314
GEHT+VPVT++G+WQF M + ++G FC GGC AIAD+GTSLL GP +I +N A
Sbjct: 237 VGEHTWVPVTRRGFWQFKMDGMEVEGGGE-FCKGGCQAIADTGTSLLVGPPDVIDAINAA 295
Query: 315 IGATGIVSQECKAVVSQYGEEIINML 340
IGA ++ ++CK +V QY EII ++
Sbjct: 296 IGAEPVLVEQCKEMVHQYLPEIIKLI 321
>gi|4389326|pdb|1B5F|A Chain A, Native Cardosin A From Cynara Cardunculus L.
gi|6729875|pdb|1B5F|C Chain C, Native Cardosin A From Cynara Cardunculus L
Length = 239
Score = 338 bits (868), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 166/239 (69%), Positives = 188/239 (78%), Gaps = 4/239 (1%)
Query: 79 IVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRS 138
+VAL N D YFGEIGIGTPPQ FTVIFDTGSS LWVPSSKC S AC HS Y S S
Sbjct: 4 VVALTNDRDTSYFGEIGIGTPPQKFTVIFDTGSSVLWVPSSKCINSKACRAHSMYESSDS 63
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
STYK+NG I YGTG+I+GFFS+D V IGDLVVK+Q+FIEAT E FL FDGIL
Sbjct: 64 STYKENGTFGAIIYGTGSITGFFSQDSVTIGDLVVKEQDFIEATDEADNVFLHRLFDGIL 123
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEH 258
GL FQ ISV PVWYNM+NQGLV E FSFW NRN DEEEGGE+VFGG+DP+H++G+H
Sbjct: 124 GLSFQTISV----PVWYNMLNQGLVKERRFSFWLNRNVDEEEGGELVFGGLDPNHFRGDH 179
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
TYVPVT + YWQF +GDV+I ++TGFCA GC A ADSGTSLL+GPT I+TQ+NHAIGA
Sbjct: 180 TYVPVTYQYYWQFGIGDVLIGDKSTGFCAPGCQAFADSGTSLLSGPTAIVTQINHAIGA 238
>gi|339460405|gb|AEJ76922.1| aspartic protease [Dimocarpus longan]
Length = 222
Score = 337 bits (864), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 162/223 (72%), Positives = 185/223 (82%), Gaps = 1/223 (0%)
Query: 1 MGMVFKSITAGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRT 60
MG F++ FL LLL P FS P GL RIGLKK+K D +RV+ +++S EGE+ R
Sbjct: 1 MGSKFRAFWVALFLSLLLSPTAFSAPKDGLVRIGLKKKKLDQISRVSGQINSNEGEAIRA 60
Query: 61 SIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSK 120
I+KY+LR NLG+S D DIV+LKNYMDAQYFGE+GIGTP Q FTVIFDTGSSNLWVPSSK
Sbjct: 61 PIKKYNLRSNLGDS-DTDIVSLKNYMDAQYFGEVGIGTPSQTFTVIFDTGSSNLWVPSSK 119
Query: 121 CYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIE 180
CYFS+ACYFHSKYRS +SSTYKKNG SA I YGTGA+SGFFS+D VK+GDL VK+Q+FIE
Sbjct: 120 CYFSVACYFHSKYRSTQSSTYKKNGTSAAIQYGTGAVSGFFSQDSVKVGDLFVKNQDFIE 179
Query: 181 ATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLV 223
AT+E S+TFL AKFDGILGLGFQEISVG AVPVW NMVNQGLV
Sbjct: 180 ATKEASITFLAAKFDGILGLGFQEISVGNAVPVWDNMVNQGLV 222
>gi|384245845|gb|EIE19337.1| putative aspartic protease [Coccomyxa subellipsoidea C-169]
Length = 508
Score = 335 bits (860), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 167/336 (49%), Positives = 222/336 (66%), Gaps = 13/336 (3%)
Query: 10 AGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAAR--LDSKEGESFRTSIRKYSL 67
AGF L L + + R+ LKKR D A + L ++ + ++R
Sbjct: 9 AGFLSLLCLSIGLLAQAQQSPLRVPLKKRTLDAEQVRATQTALHARNVRNVANALR---- 64
Query: 68 RGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIA 126
GE +ADI L +++DAQY+GEIG+GTP Q FTV+FDTGSSNLWVPSS+C YF +A
Sbjct: 65 ----GEPEEADI-PLLDFLDAQYYGEIGLGTPEQKFTVVFDTGSSNLWVPSSQCSYFDLA 119
Query: 127 CYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPS 186
C H+K+ + +S TY+ NG I YG+G++SGFFS D + +G L V++Q F EAT+EP
Sbjct: 120 CLLHNKFYASKSRTYQANGTDFAIQYGSGSLSGFFSTDVLSLGSLNVQNQTFAEATKEPG 179
Query: 187 LTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVF 246
L F+ AKFDGILGL F EIS+G+ P + NMV QGLV EPVFSFW NRN GGE+V
Sbjct: 180 LAFVAAKFDGILGLAFPEISIGEVTPPFQNMVQQGLVPEPVFSFWLNRNDPSGPGGELVL 239
Query: 247 GGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTT 306
GG+DP HY GEH +V VT++ YWQFD+G + + G T CA GC AIADSGTSL+ GP+
Sbjct: 240 GGVDPSHYTGEHLWVNVTRRAYWQFDLGGISVPG-TNSPCADGCQAIADSGTSLIVGPSD 298
Query: 307 IITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLA 342
I ++N AIGA G++ EC+ +V QY EI+ +++
Sbjct: 299 EIAEINRAIGAKGVLPAECRELVRQYVPEIMKAVIS 334
>gi|418731269|gb|AFX67029.1| aspartic protease, partial [Solanum tuberosum]
Length = 372
Score = 320 bits (821), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 148/202 (73%), Positives = 176/202 (87%)
Query: 143 KNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGF 202
++G+S I YGTG+ISG FS D+V++GDLVVKDQ FIEATREPS+TF++AKFDGILGLGF
Sbjct: 2 RDGESCSIRYGTGSISGHFSMDNVQVGDLVVKDQVFIEATREPSITFIVAKFDGILGLGF 61
Query: 203 QEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVP 262
QEISVG PVWYNMV QGLV E VFSFWFNR+A+ +EGGE+VFGG+DP H+KG HTYVP
Sbjct: 62 QEISVGNTTPVWYNMVGQGLVKESVFSFWFNRDANAKEGGELVFGGVDPKHFKGNHTYVP 121
Query: 263 VTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVS 322
+TQKGYWQF+MGD +I +TG+CAGGCAAI DSGTSLLAGPTTI+TQ+NHAIGA GIVS
Sbjct: 122 LTQKGYWQFNMGDFLIGNTSTGYCAGGCAAIVDSGTSLLAGPTTIVTQINHAIGAEGIVS 181
Query: 323 QECKAVVSQYGEEIINMLLAKV 344
ECK +VSQYGE I ++L++ V
Sbjct: 182 MECKTIVSQYGEMIWDLLVSGV 203
>gi|303285091|ref|XP_003061836.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226457166|gb|EEH54466.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 647
Score = 319 bits (818), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 164/315 (52%), Positives = 209/315 (66%), Gaps = 14/315 (4%)
Query: 30 LYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADI-VALKNYMDA 88
L R+ L KR D V AR+ T + + R N DAD V++ NYMDA
Sbjct: 54 LPRVSLSKRVVDARA-VHARV-------VATRANEANARLNSMYGADADARVSITNYMDA 105
Query: 89 QYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSSTYKKNGKS 147
QYFG + IGTPPQ+F V+FDTGSSNLWVPSSKC F+ I C H KY + SST+ +NG
Sbjct: 106 QYFGAVSIGTPPQSFDVVFDTGSSNLWVPSSKCKFTQIPCDLHHKYDAKASSTHAQNGTD 165
Query: 148 ADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISV 207
I YG+G++SGF S D V G L + Q F EATREP L F+ AKFDGILG+G+ ISV
Sbjct: 166 FAIQYGSGSLSGFLSADVVGWGGLEIASQTFAEATREPGLAFMFAKFDGILGMGWDTISV 225
Query: 208 GKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEE--GGEIVFGGMDPDHYKGEHTYVPVTQ 265
K VP +YN QGLV + VFSFW NR+ + GGE+V GG+DP HY GEH ++PVT+
Sbjct: 226 DKVVPPFYNAYAQGLVPDDVFSFWLNRDESHPDGPGGELVLGGVDPAHYVGEHAWLPVTR 285
Query: 266 KGYWQFDMGDVMIDGQTTGFC--AGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQ 323
+GYWQ M DV++DG + G C GCAAI D+GTSLLAGP +I ++N IGA I+++
Sbjct: 286 EGYWQVRMDDVIVDGASAGECDETDGCAAILDTGTSLLAGPKDVIEKINAKIGARPILNE 345
Query: 324 ECKAVVSQYGEEIIN 338
EC+ ++ QYGEE+I+
Sbjct: 346 ECRVMIEQYGEELID 360
>gi|33352213|emb|CAE18153.1| aspartic proteinase [Chlamydomonas reinhardtii]
Length = 578
Score = 317 bits (812), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 157/311 (50%), Positives = 202/311 (64%), Gaps = 18/311 (5%)
Query: 58 FRTSIRKYSLRGNLGE-------------SGDADIVALKNYMDAQYFGEIGIGTPPQNFT 104
R ++RK + LG S D V LKN+MDAQY+GEIG+GTPPQ F
Sbjct: 30 LRVTLRKTEMLTTLGRPRPYLLGEQGLLGSSDQGQVTLKNFMDAQYYGEIGLGTPPQLFN 89
Query: 105 VIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSE 163
VIFDTGS+NLWVPSSKC F+IAC H KY + +S TYK NG I YGTG++ G+ S+
Sbjct: 90 VIFDTGSANLWVPSSKCALFNIACRLHRKYNAAKSKTYKANGTEFAIEYGTGSLDGYISQ 149
Query: 164 DHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLV 223
D + G L +KDQ F EA EP LTF+ AKFDGILG+GF ISV P + +V +G +
Sbjct: 150 DVLTWGGLTIKDQGFAEAINEPGLTFVAAKFDGILGMGFPAISVQHVPPPFTRLVEEGGL 209
Query: 224 NEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTT 283
PVFSFW NR+ + GGE+V GG+DP H+ GEHT+VPVT++GYWQF M + + +
Sbjct: 210 AAPVFSFWLNRDPNAPNGGELVLGGIDPTHFTGEHTWVPVTRQGYWQFTMEGLDLGPGSQ 269
Query: 284 GFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINML--- 340
CA GCAAIAD+GTSL+AGP+ + +NHAIGAT +S +C+ +V Y +II L
Sbjct: 270 KMCAKGCAAIADTGTSLIAGPSDEVAALNHAIGATSALSAQCRQLVRDYLPQIIAQLHDL 329
Query: 341 -LAKVSAKLSL 350
L +V A + L
Sbjct: 330 PLDQVCASIGL 340
>gi|302840660|ref|XP_002951885.1| hypothetical protein VOLCADRAFT_81669 [Volvox carteri f.
nagariensis]
gi|300262786|gb|EFJ46990.1| hypothetical protein VOLCADRAFT_81669 [Volvox carteri f.
nagariensis]
Length = 559
Score = 317 bits (812), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 165/340 (48%), Positives = 219/340 (64%), Gaps = 25/340 (7%)
Query: 16 LLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESG 75
L+ V+ S +G L+R+ LKK++ L + G R Y L LG G
Sbjct: 15 LVACTVLASGDSGALHRVQLKKKQLSL---------ATYGRP-----RPY-LNNMLGYGG 59
Query: 76 DADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYR 134
D V L N+MDAQY+GE+ +GTP Q F VIFDTGSSNLWVPSSKC +F+IAC H +Y
Sbjct: 60 D---VPLHNFMDAQYYGEVSLGTPQQYFQVIFDTGSSNLWVPSSKCSFFNIACRLHRRYY 116
Query: 135 SGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKF 194
+ RS TYK NG + I YG+G++ GF SED + G L V +Q F EA EP LTF+ AKF
Sbjct: 117 AARSKTYKANGTAFSIQYGSGSLDGFISEDILGWGGLAVPEQGFAEAVNEPGLTFVAAKF 176
Query: 195 DGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHY 254
DGILG+GF ISV VP + +V+ GL++EPVFSFW NR++ GGE+V GG+DP H+
Sbjct: 177 DGILGMGFPAISVSGVVPPFTRLVDSGLLSEPVFSFWLNRDSSAAVGGELVLGGVDPAHF 236
Query: 255 KGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHA 314
GEHT+V VT++GYWQF++ + + Q C GC AIAD+GTSL+AGP + +NHA
Sbjct: 237 TGEHTWVDVTRRGYWQFNLDGIHLGSQR--LCTQGCPAIADTGTSLIAGPVDEVAAINHA 294
Query: 315 IGATGIVSQECKAVVSQYGEEII----NMLLAKVSAKLSL 350
IGAT +S +C+ +V +Y EI+ N+ L +V A + L
Sbjct: 295 IGATSALSAQCRTLVREYLPEIVAALHNLPLDQVCASIGL 334
>gi|145352062|ref|XP_001420378.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580612|gb|ABO98671.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 454
Score = 315 bits (808), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 149/269 (55%), Positives = 192/269 (71%), Gaps = 8/269 (2%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRS 138
V + NYMDAQY+GEI IG P Q F V+FDTGSSNLWVPSSKC + I C H+K+ S S
Sbjct: 16 VDVHNYMDAQYYGEIEIGNPRQKFQVVFDTGSSNLWVPSSKCGFLQIPCDLHAKFDSRAS 75
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
TY+ +G I YG+G++SGF S+D VK+GDLVV+ Q F EAT+EP + FL +KFDGIL
Sbjct: 76 ETYEADGTPFAIQYGSGSLSGFLSKDEVKVGDLVVQGQYFAEATKEPGIAFLFSKFDGIL 135
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADE-----EEGGEIVFGGMDPDH 253
GLGF I+V K PV+YNM+ QGLV +FSFW NR + + E GGE++FGG DPDH
Sbjct: 136 GLGFDNIAVDKVKPVFYNMMEQGLVENKMFSFWLNRTSTKDGMPSEVGGELIFGGSDPDH 195
Query: 254 YKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAG--GCAAIADSGTSLLAGPTTIITQV 311
+ GEHTY PVT++GYWQ M D +DG++ G C G GC IAD+GTSLLAGPT I+ ++
Sbjct: 196 FIGEHTYAPVTREGYWQIKMDDFKVDGRSLGACDGDDGCQVIADTGTSLLAGPTEIVNKI 255
Query: 312 NHAIGATGIVSQECKAVVSQYGEEIINML 340
N IGA ++ +EC+ ++ QY E+ + L
Sbjct: 256 NDYIGAHSMIGEECRLLIDQYAEQFVEDL 284
>gi|291223847|ref|XP_002731917.1| PREDICTED: putative gut cathepsin D-like aspartic protease-like
[Saccoglossus kowalevskii]
Length = 389
Score = 315 bits (807), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 164/310 (52%), Positives = 206/310 (66%), Gaps = 12/310 (3%)
Query: 17 LLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGD 76
LL ++F GL RI L K R R S G +I+ +L G+L +
Sbjct: 4 LLICLLFVGLACGLQRIHLHKF------RSVRRQLSDVG----VTIKDLALSGSLKYTQG 53
Query: 77 ADIV-ALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYR 134
A I LKNY+DAQY+GEIG+GTP Q F V+FDTGSSNLWVPS KC + IAC FH KY
Sbjct: 54 APIPEVLKNYLDAQYYGEIGLGTPQQKFNVVFDTGSSNLWVPSKKCPITDIACLFHKKYD 113
Query: 135 SGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKF 194
S +SSTYK NG +I YG+G++ GF SED + I D+V K Q F EAT+EP L F+ AKF
Sbjct: 114 STKSSTYKVNGTKFEIQYGSGSMEGFLSEDSIAISDVVAKSQTFAEATKEPGLAFVAAKF 173
Query: 195 DGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHY 254
DGILG+G+ +ISV VPV NM+ Q L+ +PVFSF+ +RN ++ +GGE+ GG DP +Y
Sbjct: 174 DGILGMGYPQISVDGVVPVIDNMIQQQLIEKPVFSFYLDRNVNDSQGGELFLGGSDPKYY 233
Query: 255 KGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHA 314
G TYVPVT+KGYWQF M + + G + FC GGC AIAD+GTSL+AGPT + +N A
Sbjct: 234 TGNFTYVPVTRKGYWQFKMDGITLGGSASQFCKGGCQAIADTGTSLIAGPTEEVQAINKA 293
Query: 315 IGATGIVSQE 324
IGAT IVS E
Sbjct: 294 IGATPIVSGE 303
>gi|336454164|gb|AEI58896.1| cathepsin D [Pinctada maxima]
Length = 390
Score = 314 bits (805), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 165/319 (51%), Positives = 216/319 (67%), Gaps = 17/319 (5%)
Query: 16 LLLFPVV-FSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGES 74
L+LF V+ F + L RI L K K R + G S + +KYS G G +
Sbjct: 4 LILFAVISFICYSDALVRIKLHKIK------SVRRTLQEVGTSIESLQQKYSGYGITGPA 57
Query: 75 GDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKY 133
+ L NY+DAQY+G IGIGTP QNF V+FDTGSSNLWVPS KC + IAC H+KY
Sbjct: 58 PEP----LSNYLDAQYYGVIGIGTPAQNFKVVFDTGSSNLWVPSKKCKVTDIACLLHNKY 113
Query: 134 RSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAK 193
S +SSTYKKNG +I YGTG+++GF S D V + + VK Q F EAT++P +TF+ AK
Sbjct: 114 DSSKSSTYKKNGTDFEIRYGTGSLTGFLSTDTVTVAGIAVKGQTFAEATQQPGITFVAAK 173
Query: 194 FDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDH 253
FDGILG+ F++ISV VPV+YNMV QGLV +P+FSF+ +R+ EGGE++ GG D H
Sbjct: 174 FDGILGMAFEKISVDGVVPVFYNMVKQGLVPQPIFSFYLDRDPSASEGGELILGGSDTKH 233
Query: 254 YKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNH 313
YKG TY+PVT++GYWQF+M V + G + FC+GGC AIAD+GTSL+AGPT+ I+++N
Sbjct: 234 YKGNFTYLPVTRQGYWQFEMDGVSVGG-SAKFCSGGCNAIADTGTSLIAGPTSEISKLNK 292
Query: 314 AIGATGIVSQE----CKAV 328
AIGA +V+ E C A+
Sbjct: 293 AIGAKPLVAGEYTVDCNAI 311
>gi|218196057|gb|EEC78484.1| hypothetical protein OsI_18377 [Oryza sativa Indica Group]
Length = 389
Score = 314 bits (804), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 143/216 (66%), Positives = 181/216 (83%)
Query: 132 KYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLL 191
+ +S +SS+YK +G++ I YG+GAISGFFS+D+V +GDLVVK+Q+FIEATRE S+TF++
Sbjct: 12 QIQSKKSSSYKADGETCKITYGSGAISGFFSKDNVLVGDLVVKNQKFIEATRETSVTFII 71
Query: 192 AKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDP 251
KFDGILGLG+ EISVGKA P+W +M Q L+ + VFSFW NR+ D GGE+VFGGMDP
Sbjct: 72 GKFDGILGLGYPEISVGKAPPIWQSMQEQELLADDVFSFWLNRDPDASSGGELVFGGMDP 131
Query: 252 DHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQV 311
HYKG+HTYVPV++KGYWQF+MGD++IDG +TGFCA GCAAI DSGTSLLAGPT I+ QV
Sbjct: 132 KHYKGDHTYVPVSRKGYWQFNMGDLLIDGHSTGFCAKGCAAIVDSGTSLLAGPTAIVAQV 191
Query: 312 NHAIGATGIVSQECKAVVSQYGEEIINMLLAKVSAK 347
NHAIGA GI+S ECK VVS+YGE I+N+L+A+ +
Sbjct: 192 NHAIGAEGIISTECKEVVSEYGEMILNLLIAQTDPQ 227
>gi|329754204|gb|AEC03508.1| cathepsin-D [Polyrhachis vicina]
Length = 384
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 151/244 (61%), Positives = 179/244 (73%), Gaps = 2/244 (0%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSST 140
L NY+DAQY+G I IGTPPQNF VIFDTGSSNLWVPS KC+F+ IAC H+KY + +SST
Sbjct: 56 LSNYLDAQYYGAISIGTPPQNFKVIFDTGSSNLWVPSKKCHFTNIACLLHNKYDTTKSST 115
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
YKKNG IHYG+G++SG+ S D V IG L VKDQ F EA EP L F+ AKFDGILG+
Sbjct: 116 YKKNGTDFAIHYGSGSLSGYLSTDTVTIGGLKVKDQTFAEAMSEPGLAFVAAKFDGILGM 175
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ISV PV+YNMV QGLV++PVFSF+ NR+ D +EGGE++ GG DP+HYKG+ TY
Sbjct: 176 AYTTISVDGVTPVFYNMVKQGLVSQPVFSFYLNRDPDAKEGGELILGGSDPNHYKGDFTY 235
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
VPV +K YWQF M V I G C GC AIAD+GTSL+AGP I +N AIGAT I
Sbjct: 236 VPVDRKAYWQFKMDSVQI-GSDLKLCKQGCEAIADTGTSLIAGPVKEIEAINKAIGATPI 294
Query: 321 VSQE 324
V E
Sbjct: 295 VGGE 298
>gi|87241358|gb|ABD33216.1| Peptidase A1, pepsin [Medicago truncatula]
Length = 396
Score = 310 bits (793), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 142/222 (63%), Positives = 178/222 (80%)
Query: 123 FSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEAT 182
+ACY H+ Y++ +S TY KNG S I YGTG+ISG+FS+D+VK+G VVK Q+FIEAT
Sbjct: 6 LQLACYTHNWYKAKKSKTYNKNGTSCKISYGTGSISGYFSQDNVKVGSSVVKHQDFIEAT 65
Query: 183 REPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGG 242
RE SL+FL KFDGI GLGFQEISV +A+PVWYNM+ Q L+ E VFSFW N N + ++GG
Sbjct: 66 REGSLSFLAGKFDGIFGLGFQEISVERALPVWYNMLEQNLIGEKVFSFWLNGNPNAKKGG 125
Query: 243 EIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLA 302
E+VFGG+DP H+KG+HTYVPVT+KGYWQ +MGD I G +TG C GGCAAI DSGTSLLA
Sbjct: 126 ELVFGGVDPKHFKGKHTYVPVTEKGYWQIEMGDFFIGGLSTGVCEGGCAAIVDSGTSLLA 185
Query: 303 GPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKV 344
GPT ++ ++NHAIGA G++S ECK VVSQYGE I ++L++ V
Sbjct: 186 GPTPVVAEINHAIGAEGVLSVECKEVVSQYGELIWDLLVSGV 227
>gi|218944225|gb|ACL13150.1| cathepsin D [Azumapecten farreri]
Length = 396
Score = 309 bits (791), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 155/307 (50%), Positives = 198/307 (64%), Gaps = 9/307 (2%)
Query: 14 LCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYS-LRGNLG 72
LC+ V + + L+RI L + K R + G S +Y+ L G
Sbjct: 4 LCIFALLAVIAC-SSALHRIKLHRVK------TVRRSLQEVGTSINLLKNRYTGLSDRNG 56
Query: 73 ESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHS 131
D L NY+DAQY+G I IGTP Q F V+FDTGSSNLWVPS KC S IAC H+
Sbjct: 57 RLLGPDPEPLSNYLDAQYYGAIQIGTPAQEFKVVFDTGSSNLWVPSKKCKLSDIACLLHN 116
Query: 132 KYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLL 191
KY S +SSTYK+NG +I YGTG+++GF S D V IGD+ VK Q F EA +P +TF+
Sbjct: 117 KYDSTKSSTYKQNGTHFEIRYGTGSLTGFLSTDSVTIGDITVKGQTFAEAITQPGITFVA 176
Query: 192 AKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDP 251
AKFDGILG+G+ ISV VPV+YNMV Q LV+ PVFSF+ +R+ D GGE++ GG DP
Sbjct: 177 AKFDGILGMGYDTISVDHVVPVFYNMVQQKLVDSPVFSFYLDRDPDASAGGELIIGGSDP 236
Query: 252 DHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQV 311
HY G +Y P+T+KGYWQFDM + + G+ + +C GGC+AIAD+GTSLL GPT + Q+
Sbjct: 237 KHYSGNFSYAPITKKGYWQFDMAGIQVGGKASAYCNGGCSAIADTGTSLLVGPTAEVQQL 296
Query: 312 NHAIGAT 318
N IGAT
Sbjct: 297 NKQIGAT 303
>gi|440803835|gb|ELR24718.1| aspartic proteinase, partial [Acanthamoeba castellanii str. Neff]
Length = 489
Score = 309 bits (791), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 147/269 (54%), Positives = 186/269 (69%), Gaps = 2/269 (0%)
Query: 75 GDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKY 133
G + + N++DAQY+GEI IG PPQ F V+ DTGSSNLWVPS +C ++ IAC H KY
Sbjct: 77 GGKGVEPISNFLDAQYYGEISIGNPPQYFNVVLDTGSSNLWVPSIQCPWYEIACDLHHKY 136
Query: 134 RSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAK 193
+SSTYK NG + I YG+GA+SGF S D+V I L K Q F EA EP L F+ A+
Sbjct: 137 DHSKSSTYKANGTNFQIQYGSGAMSGFLSADNVVIAGLTAKGQLFAEAVAEPGLAFVAAQ 196
Query: 194 FDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDH 253
FDGILGLGF ISV PVWY ++ Q V EPVF+FW NR+ GGE+V GG+D H
Sbjct: 197 FDGILGLGFDTISVDGVPPVWYTLLAQSQVAEPVFAFWLNRDPSGISGGELVLGGVDESH 256
Query: 254 YKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFC-AGGCAAIADSGTSLLAGPTTIITQVN 312
Y G+ TY P+T++GYWQF D +I+G++ GFC AGGC AIAD+GTSLLAGP+ I+ Q+N
Sbjct: 257 YTGDFTYTPITKEGYWQFLAHDFLINGKSMGFCPAGGCKAIADTGTSLLAGPSKIVAQIN 316
Query: 313 HAIGATGIVSQECKAVVSQYGEEIINMLL 341
I ATGI+ EC +V+QY +II +L
Sbjct: 317 KMINATGILESECDMLVNQYAGQIIQYIL 345
>gi|380746491|gb|AFE48185.1| cathepsin D [Pinctada margaritifera]
Length = 390
Score = 307 bits (787), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 160/311 (51%), Positives = 208/311 (66%), Gaps = 13/311 (4%)
Query: 16 LLLFPVV-FSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGES 74
L+LF V+ F + L RI L K K R + G S + +KYS G G +
Sbjct: 4 LILFAVISFICYSDALVRIKLHKIK------SVRRTLQEVGTSIESLQQKYSGYGITGPA 57
Query: 75 GDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKY 133
+ L NY+DAQY+G IGIGTP QNF V+FDTGSSNLWVPS KC FS IAC H+KY
Sbjct: 58 PEP----LSNYLDAQYYGVIGIGTPAQNFKVVFDTGSSNLWVPSKKCKFSDIACLLHNKY 113
Query: 134 RSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAK 193
S +SSTYKKN + +I YGTG+++GF S D V + + VK Q F EAT++P +TF+ AK
Sbjct: 114 DSSKSSTYKKNDTTFEIRYGTGSLTGFLSTDTVTVAGIAVKGQTFAEATQQPGITFVAAK 173
Query: 194 FDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDH 253
FDGILG+ F +ISV VPV+YNM+ QGLV +P+FSF+ +R+ EGGE++ GG D H
Sbjct: 174 FDGILGMAFDKISVDGVVPVFYNMIKQGLVPQPIFSFYLDRDPSASEGGELILGGSDTKH 233
Query: 254 YKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNH 313
YKG TY+PVT++GYW+F M V + G+ FC GGC IAD+GTSL+AGP++ + ++N
Sbjct: 234 YKGNFTYLPVTRQGYWEFKMDGVSV-GENHKFCTGGCNTIADTGTSLIAGPSSEVKKLNA 292
Query: 314 AIGATGIVSQE 324
AIGAT I E
Sbjct: 293 AIGATAIPGGE 303
>gi|405951067|gb|EKC19012.1| Lysosomal aspartic protease [Crassostrea gigas]
Length = 439
Score = 305 bits (780), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 163/321 (50%), Positives = 204/321 (63%), Gaps = 14/321 (4%)
Query: 14 LCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGE 73
+ +LLF + + L RI L K + + R G + RKY N E
Sbjct: 7 VSILLFVAAYVS---ALQRIKLHKIDKTVRETLLER-----GTTAEYLKRKY----NRYE 54
Query: 74 SGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSK 132
+G + L NYMDAQY+G I IGTPPQNF VIFDTGSSNLWVPS KC S IAC H+K
Sbjct: 55 TG-PEPEPLSNYMDAQYYGPISIGTPPQNFKVIFDTGSSNLWVPSKKCKLSDIACLLHNK 113
Query: 133 YRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLA 192
Y S +SSTYK NG +I YGTG++ GF S D V +GD+ VKDQ F EAT +P +TF+ A
Sbjct: 114 YDSTKSSTYKANGTDFEIRYGTGSLKGFLSTDTVTVGDIKVKDQTFAEATEQPGITFVAA 173
Query: 193 KFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPD 252
KFDGILG+GF EISV PV+ NMV Q LV P+FSF+ +RN GGE++ GG DP
Sbjct: 174 KFDGILGMGFPEISVKGVTPVFNNMVAQKLVPAPIFSFYLDRNPTGTPGGEMILGGSDPK 233
Query: 253 HYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVN 312
+Y G TYV VT+KGYWQF M V ++G+ + +C+GGC AIAD+GTSLLAGP+T + +N
Sbjct: 234 YYSGNFTYVNVTRKGYWQFKMDGVKVNGKASKYCSGGCNAIADTGTSLLAGPSTEVKSLN 293
Query: 313 HAIGATGIVSQECKAVVSQYG 333
IGA + E S+ G
Sbjct: 294 AMIGAKPFAAGEYTVDCSKIG 314
>gi|332024025|gb|EGI64243.1| Lysosomal aspartic protease [Acromyrmex echinatior]
Length = 381
Score = 303 bits (776), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 150/284 (52%), Positives = 190/284 (66%), Gaps = 9/284 (3%)
Query: 38 RKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIG 97
++ L+ + R KE + T +R ++ N L NY+DAQY+G I IG
Sbjct: 20 QRIPLHKTDSIRKALKEVGTDLTQVRTFTTTDNYTPE------PLSNYLDAQYYGVISIG 73
Query: 98 TPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSSTYKKNGKSADIHYGTGA 156
TPPQNF VIFDTGSSNLWVPS KC+ + IAC H+KY S +S+TYKKNG I YG+G+
Sbjct: 74 TPPQNFKVIFDTGSSNLWVPSKKCHITNIACLLHNKYTSEKSTTYKKNGTIFAIRYGSGS 133
Query: 157 ISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYN 216
+SGF SED V + L V+ Q F EA EP + F+ AKFDGILG+G+ ISV PV+YN
Sbjct: 134 LSGFLSEDVVTVAGLAVQHQTFAEAISEPGIAFVAAKFDGILGMGYSTISVDGVTPVFYN 193
Query: 217 MVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDV 276
MV Q LV++ VFSF+ NR++ EGGE++ GG DPDHY+GE TY+PVT+KGYWQF M V
Sbjct: 194 MVKQNLVSQAVFSFYLNRDSSAAEGGEMILGGSDPDHYEGEFTYIPVTRKGYWQFKMDGV 253
Query: 277 MIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
+ FC GC AIAD+GTSL+AGPT+ I +N IGAT I
Sbjct: 254 QVKDH--AFCKEGCQAIADTGTSLIAGPTSEIKDINEMIGATSI 295
>gi|332376487|gb|AEE63383.1| unknown [Dendroctonus ponderosae]
Length = 388
Score = 300 bits (768), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 149/266 (56%), Positives = 188/266 (70%), Gaps = 5/266 (1%)
Query: 60 TSIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSS 119
T +++ LR G + L NY+DAQYFG I IGTPPQ F V+FDTGSSNLWVPS
Sbjct: 41 THVQQVRLRYTSGAEPVPE--PLSNYLDAQYFGAISIGTPPQKFVVVFDTGSSNLWVPSK 98
Query: 120 KCYFS-IACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEF 178
KC F+ IAC H+KY S +SSTYK+NG I YG+G++SGF S D V + D+ VK Q F
Sbjct: 99 KCSFTNIACLLHNKYDSSKSSTYKENGTEFAIRYGSGSLSGFLSTDVVGVSDINVKGQTF 158
Query: 179 IEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADE 238
EA EP L F+ AKFDGILGL + ISV VP++YNMVNQG+V++ VFSF+ NRN D
Sbjct: 159 AEALSEPGLAFVAAKFDGILGLAYSRISVDGVVPLFYNMVNQGIVSQAVFSFYLNRNPDG 218
Query: 239 EEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGT 298
+ GGE++FGG DP++Y G TY+PV ++ YWQF M +V++ GQ T FC GGC AIAD+GT
Sbjct: 219 KVGGELIFGGSDPNYYSGNFTYLPVDRQAYWQFKMDEVIV-GQKT-FCKGGCEAIADTGT 276
Query: 299 SLLAGPTTIITQVNHAIGATGIVSQE 324
SL+AGP + +N AIGAT +V E
Sbjct: 277 SLIAGPVDEVKALNEAIGATPLVGGE 302
>gi|312861579|gb|ADR10277.1| cathepsin D [Branchiostoma belcheri]
Length = 395
Score = 300 bits (767), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 161/320 (50%), Positives = 205/320 (64%), Gaps = 12/320 (3%)
Query: 13 FLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLG 72
FL +L VVF++ L+RI L K K R + G ++ + K G
Sbjct: 3 FLSVLFALVVFAS---ALHRIPLTKMK------TVRRQLADVGITYDQVLDK-DYSGKYY 52
Query: 73 ESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHS 131
DA L NY+DAQY+G I IGTP QNF V+FDTGSSNLWVPS KC S IAC H+
Sbjct: 53 NIKDAP-EPLTNYLDAQYYGPISIGTPAQNFQVVFDTGSSNLWVPSKKCKLSDIACLLHN 111
Query: 132 KYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLL 191
KY S +SSTY KNG I YG+G+++GF SED V IG L V++Q F EA +P +TF+
Sbjct: 112 KYDSTQSSTYMKNGTDFAIRYGSGSLTGFLSEDTVTIGGLKVQNQTFAEAVTQPGITFVA 171
Query: 192 AKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDP 251
AKFDGILG+G+ ISV VP +YNMV Q LV++PVFSF+ NR+ GE++ GG DP
Sbjct: 172 AKFDGILGMGYDTISVDGVVPPFYNMVQQKLVDKPVFSFYLNRDPSSTTRGELLLGGTDP 231
Query: 252 DHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQV 311
+Y G+ T++ VT+ GYWQF M +MI+G+ T +C GGCAAIAD+GTSL+AGPTT + +
Sbjct: 232 KYYTGDFTFLDVTKPGYWQFKMDGIMINGKATDYCKGGCAAIADTGTSLIAGPTTEVQAL 291
Query: 312 NHAIGATGIVSQECKAVVSQ 331
N IGAT I E SQ
Sbjct: 292 NKQIGATPIPGGEYMVDCSQ 311
>gi|350411706|ref|XP_003489428.1| PREDICTED: lysosomal aspartic protease-like [Bombus impatiens]
Length = 386
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 153/320 (47%), Positives = 202/320 (63%), Gaps = 11/320 (3%)
Query: 42 LNNRVAARLDSKEGESFRTSIRKYSL---RGNLGESGDADIVALKNYMDAQYFGEIGIGT 98
L N R+ + +S R ++Y+ + ++ + G L NY+DAQY+G I IGT
Sbjct: 14 LANADLQRITLHKMDSVRKQFKEYNTEVYQAHMVQGGFPQPEPLSNYLDAQYYGVISIGT 73
Query: 99 PPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSSTYKKNGKSADIHYGTGAI 157
P Q+F VIFDTGSSNLWVPS KC+ + IAC H KY + +SSTYKKNG I YG+G++
Sbjct: 74 PSQDFKVIFDTGSSNLWVPSQKCHLTNIACKLHHKYDNTKSSTYKKNGTDFAIRYGSGSL 133
Query: 158 SGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNM 217
SG+ S D V I L V DQ F EA EP + F+ AKFDGILG+ + I+V PV+YNM
Sbjct: 134 SGYLSTDVVNIAGLKVSDQTFAEALSEPGMAFVAAKFDGILGMAYSRIAVDGVTPVFYNM 193
Query: 218 VNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVM 277
V QGLV +PVFSF+ NRN D++ GGE++ GG DP+HY+G TYVPV +KGYWQF M +
Sbjct: 194 VKQGLVPQPVFSFYLNRNPDDKAGGELILGGSDPNHYEGPFTYVPVDRKGYWQFRMDGIK 253
Query: 278 IDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE----CKAVVSQYG 333
+ Q C GC AIAD+GTSL+AGP + +N AIGAT I + E C ++ +
Sbjct: 254 VGSQHLAICEKGCEAIADTGTSLIAGPVKEVEAINSAIGATNIAAGEAMVDCNSIPNL-- 311
Query: 334 EEIINMLLAKVSAKLSLRHY 353
IN +L S L+ + Y
Sbjct: 312 -PTINFVLGGRSFPLTGKDY 330
>gi|255085919|ref|XP_002508926.1| predicted protein [Micromonas sp. RCC299]
gi|226524204|gb|ACO70184.1| predicted protein [Micromonas sp. RCC299]
Length = 557
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 166/357 (46%), Positives = 215/357 (60%), Gaps = 21/357 (5%)
Query: 1 MGMVFKSITAGFFLCLLLFPVVFSTP----NGGLYRIGLKKRKFDLNNRVAARLDSKEGE 56
M + +SI A F +C L + L R + KR V +D
Sbjct: 1 MVPILRSIVALFLVCALCLAAAPGASALVESSHLPRAKVHKRALGPPETVKKCVDVAR-- 58
Query: 57 SFRTSIRKYSLRGNLGESGDADIVAL--------KNYMDAQYFGEIGIGTPPQNFTVIFD 108
R ++S R + D D L NYMDAQY+G + IGTPPQ+F V+FD
Sbjct: 59 --RARYERFSARLHDEPHRDPDGPTLAGGTPECISNYMDAQYYGAVSIGTPPQSFLVVFD 116
Query: 109 TGSSNLWVPSSKCYF-SIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVK 167
TGSSNLW+PS+KC F I C H KYRSG SSTYK G I YG+G++SGF S+D V
Sbjct: 117 TGSSNLWIPSAKCSFLQIPCDLHQKYRSGDSSTYKALGDPFAIQYGSGSLSGFLSQDTVT 176
Query: 168 IGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPV 227
L +KDQ F EAT+EP + FL +KFDGILG+G+ ISV P +YN V+QGLV E V
Sbjct: 177 WAGLEIKDQVFAEATKEPGIAFLFSKFDGILGMGWDTISVNGVKPPFYNAVDQGLVVENV 236
Query: 228 FSFWFNR---NADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTG 284
FSFW NR + EGGEIV GG+DP H+ GEHT++ VT++GYWQ M DV++ G + G
Sbjct: 237 FSFWLNRDADEGGDGEGGEIVLGGVDPAHFVGEHTWLNVTREGYWQIAMDDVLLGGVSVG 296
Query: 285 FCA-GGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINML 340
C GCAAI D+GTSLLAGPT ++ +N IGA ++ +EC+ ++ QYG+E+I L
Sbjct: 297 QCGKKGCAAIVDTGTSLLAGPTKVVEALNKRIGAKSVLGEECRVMIDQYGDELIRDL 353
>gi|257228998|gb|ACV53024.1| cathepsin D2 [Homarus americanus]
Length = 385
Score = 299 bits (765), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 152/310 (49%), Positives = 196/310 (63%), Gaps = 16/310 (5%)
Query: 17 LLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSI-RKYSLRGNLGESG 75
L+F V+ + L RI L K K + R +E ++ T RK+ RG + E
Sbjct: 4 LIFLVIVALATAELPRIPLHKIK-------SVRRTLQEVDTAVTRAHRKWGNRGPMPE-- 54
Query: 76 DADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYR 134
L NYMDAQY+G I IGTPPQ+F V+FDTGSSNLWVPS +C+++ IAC H+KY
Sbjct: 55 -----PLSNYMDAQYYGPISIGTPPQSFRVVFDTGSSNLWVPSKQCHYTNIACMIHNKYD 109
Query: 135 SGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKF 194
+ +SSTYKKNG I YG+G++SG+ S D V +G L V+ Q F EA EP L F+ AKF
Sbjct: 110 ARKSSTYKKNGTDFAIQYGSGSLSGYLSTDTVAVGSLAVRQQTFAEALSEPGLAFVAAKF 169
Query: 195 DGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHY 254
DGILG+GF I+V PV+YNMV Q L+ PVFSF+ NR+ EGGE++ GG DP++Y
Sbjct: 170 DGILGMGFDNIAVDGVTPVFYNMVKQSLIPAPVFSFYLNRDPSSPEGGELILGGSDPNYY 229
Query: 255 KGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHA 314
G TY+PV +KGYWQ M + ++G FC GGC AIAD+GTSL+A P +N
Sbjct: 230 SGNFTYIPVDRKGYWQIKMDGIQMNGARVPFCEGGCEAIADTGTSLIAAPVEEARSINKK 289
Query: 315 IGATGIVSQE 324
IGA I S E
Sbjct: 290 IGAKPIASGE 299
>gi|336454162|gb|AEI58895.1| cathepsin D [Pteria penguin]
Length = 392
Score = 298 bits (764), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 155/314 (49%), Positives = 204/314 (64%), Gaps = 14/314 (4%)
Query: 12 FFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNL 71
++ LL V + + L RI L K K R + G S KY++
Sbjct: 4 LWIVALLCAVAYCS---ALQRIKLHKFK------SVRRTLQEVGTSIEALQNKYNVYKVE 54
Query: 72 GESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFH 130
G + + L NYMDAQY+G+I IGTP Q+F VIFDTGSSNLWVPS KC S IAC H
Sbjct: 55 GPAPEP----LSNYMDAQYYGDITIGTPGQSFKVIFDTGSSNLWVPSKKCKLSDIACLLH 110
Query: 131 SKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFL 190
+KY S +SSTYK NG +I YGTG+++GF S D V + + VK Q F EAT++P +TF+
Sbjct: 111 NKYDSSKSSTYKANGTDFEIRYGTGSLTGFLSTDTVTVAGIAVKGQTFAEATQQPGITFV 170
Query: 191 LAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMD 250
AKFDGILG+G+Q ISV VPV+YNMV Q LV VFSF+ NR+ +GGE++ GG D
Sbjct: 171 AAKFDGILGMGYQTISVDGVVPVFYNMVKQNLVPASVFSFYLNRDPGASDGGELILGGSD 230
Query: 251 PDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQ 310
+YKG TY+PVT++GYW+F M +M++G+ + +C+GGC AIAD+GTSLLAGP T +
Sbjct: 231 SKYYKGNFTYLPVTKQGYWRFKMDGIMMNGKASKYCSGGCKAIADTGTSLLAGPKTEVDA 290
Query: 311 VNHAIGATGIVSQE 324
+N IGAT + + E
Sbjct: 291 LNKQIGATPLAAGE 304
>gi|66560290|ref|XP_392857.2| PREDICTED: lysosomal aspartic protease [Apis mellifera]
Length = 385
Score = 298 bits (764), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 155/314 (49%), Positives = 198/314 (63%), Gaps = 16/314 (5%)
Query: 12 FFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNL 71
F +L F + N + RI L K + + + E ++T I +G+L
Sbjct: 1 MFHAILCFCAFIAIANADVTRIPLHKI-----DSIRKQFKEYNTEIYQTHI----FQGDL 51
Query: 72 GESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFH 130
+ L NY+DAQY+G I IGTPPQ+F VIFDTGSSNLWVPS KC+ + IAC H
Sbjct: 52 PQPE-----PLSNYLDAQYYGVISIGTPPQDFRVIFDTGSSNLWVPSKKCHLTNIACKLH 106
Query: 131 SKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFL 190
KY + +SSTYKKNG I YG+G++SG+ S D V I + + DQ F EA EP L F+
Sbjct: 107 RKYDNTKSSTYKKNGTDFAIRYGSGSLSGYLSTDTVDIAGMKISDQTFAEALSEPGLAFV 166
Query: 191 LAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMD 250
AKFDGILG+ + +ISV PV+YNMV QGLV +PVFSF+ NRN D++ GGE++ GG D
Sbjct: 167 AAKFDGILGMAYSKISVDGVTPVFYNMVKQGLVPQPVFSFYLNRNPDDKYGGELILGGSD 226
Query: 251 PDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQ 310
P+HY+G TYVPV +KGYWQF M + I G C GC AIAD+GTSL+AGP I
Sbjct: 227 PNHYEGSFTYVPVDKKGYWQFRMDSIQI-GSDLKVCQQGCEAIADTGTSLIAGPVKEIEA 285
Query: 311 VNHAIGATGIVSQE 324
+N AIGAT I + E
Sbjct: 286 INKAIGATPIAAGE 299
>gi|224548868|dbj|BAH24176.1| aspartic proteinase [Sitophilus zeamais]
Length = 389
Score = 298 bits (764), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 147/291 (50%), Positives = 195/291 (67%), Gaps = 11/291 (3%)
Query: 49 RLDSKEGESFRTSIRKYSLRGNLGE----SGDADIVALKNYMDAQYFGEIGIGTPPQNFT 104
R+ +G+S R ++R + S D L NY+DAQY+G I IGTPPQNF
Sbjct: 23 RVSLTKGKSVRNTLRDVGTHVQQVKLRYVSVDPSPEPLTNYLDAQYYGPISIGTPPQNFN 82
Query: 105 VIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSE 163
VIFDTGSSNLWVPS KC +IAC H+KY + +SSTYK+NG I YG+G++SG+ S
Sbjct: 83 VIFDTGSSNLWVPSKKCELLNIACLLHNKYDATKSSTYKENGTEFAITYGSGSLSGYLST 142
Query: 164 DHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLV 223
D + +G + VKDQ F EA +EP LTF+ AKFDGILG+ + ISV PV+YNM++Q LV
Sbjct: 143 DSLSVGSVQVKDQTFGEAIKEPGLTFIAAKFDGILGMAYPRISVDGVTPVFYNMIDQNLV 202
Query: 224 NEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTT 283
P+FSF+ NR+ + + GGEI+ GG DP++Y+G+ TY+PV ++ YWQF M V + Q+
Sbjct: 203 AAPIFSFYLNRDPNAQTGGEIILGGSDPNYYEGDFTYLPVDRQAYWQFKMDSVQVADQS- 261
Query: 284 GFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE----CKAVVS 330
C GGC AIAD+GTSL+AGPT I +N AIGA+ IV E C ++ S
Sbjct: 262 -LCKGGCEAIADTGTSLIAGPTEEIAALNKAIGASAIVGGEYIVDCNSISS 311
>gi|146217392|gb|ABQ10738.1| cathepsin D [Penaeus monodon]
Length = 386
Score = 298 bits (762), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 146/291 (50%), Positives = 192/291 (65%), Gaps = 12/291 (4%)
Query: 35 LKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYFGEI 94
+K KF R +D+ + + RK+ +G + E L NYMDAQY+G I
Sbjct: 21 IKLHKFKSARRSLQEVDT----AVKVVHRKWGNKGPMPE-------PLSNYMDAQYYGPI 69
Query: 95 GIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSSTYKKNGKSADIHYG 153
IGTPPQ+F V+FDTGSSNLWVPS +C+F+ IAC H+KY + +SSTYKKNG DI YG
Sbjct: 70 TIGTPPQSFRVVFDTGSSNLWVPSKQCHFTNIACLIHNKYDATKSSTYKKNGTKFDIQYG 129
Query: 154 TGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPV 213
+G++SG+ S D V +G + VKDQ F EA EP L F+ AKFDGILG+ + I+V PV
Sbjct: 130 SGSLSGYLSTDTVSVGSVSVKDQTFAEAMSEPGLAFVAAKFDGILGMAYDRIAVDGVTPV 189
Query: 214 WYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDM 273
+YNMVNQ +V P+FSF+ NR+ EGGE++ GG DP +Y G+ TYVPV ++GYWQF M
Sbjct: 190 FYNMVNQNVVPAPIFSFYLNRDPAAAEGGELILGGSDPAYYTGDFTYVPVDRQGYWQFKM 249
Query: 274 GDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE 324
+ ++G T FC GGC AIAD+GTSL+A P+ +N IGA I+ E
Sbjct: 250 DGLQMNGTTVPFCDGGCEAIADTGTSLIAAPSEEARLINKKIGAKPIMGGE 300
>gi|380018765|ref|XP_003693293.1| PREDICTED: lysosomal aspartic protease-like [Apis florea]
Length = 385
Score = 297 bits (761), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 154/312 (49%), Positives = 198/312 (63%), Gaps = 19/312 (6%)
Query: 14 LCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGE 73
LCL F + N + RI L K + + + E ++T I L+G+ +
Sbjct: 6 LCLCAF---IAIANADITRIPLHKI-----DSIRKQFKEYNTEIYQTHI----LQGDFPQ 53
Query: 74 SGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSK 132
L NY+DAQY+G I IGTPPQ+F VIFDTGSSNLWVPS KC+ + IAC H K
Sbjct: 54 PE-----PLSNYLDAQYYGVISIGTPPQDFRVIFDTGSSNLWVPSKKCHLTNIACKLHRK 108
Query: 133 YRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLA 192
Y + +SSTYKKNG I YG+G++SG+ S D V I + + DQ F EA EP L F+ A
Sbjct: 109 YDNTKSSTYKKNGTDFAIRYGSGSLSGYLSTDTVDIAGMKISDQTFAEALSEPGLAFVAA 168
Query: 193 KFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPD 252
KFDGILG+ + +I+V PV+YNMV QGLV +PVFSF+ NRN D++ GGE++ GG DP+
Sbjct: 169 KFDGILGMAYSKIAVDDVTPVFYNMVKQGLVPQPVFSFYLNRNPDDKYGGELILGGSDPN 228
Query: 253 HYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVN 312
HY+G TYVPV +KGYWQF M + I G C GC AIAD+GTSL+AGP + +N
Sbjct: 229 HYEGSFTYVPVDKKGYWQFKMDSIQI-GSDLKVCQQGCEAIADTGTSLIAGPVKEVGAIN 287
Query: 313 HAIGATGIVSQE 324
AIGAT I + E
Sbjct: 288 KAIGATPIAAGE 299
>gi|260810438|ref|XP_002599971.1| hypothetical protein BRAFLDRAFT_74093 [Branchiostoma floridae]
gi|229285255|gb|EEN55983.1| hypothetical protein BRAFLDRAFT_74093 [Branchiostoma floridae]
Length = 388
Score = 296 bits (757), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 154/317 (48%), Positives = 204/317 (64%), Gaps = 12/317 (3%)
Query: 16 LLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESG 75
LL+ + +T N L+RI L K K R ++ G + I+ YS G
Sbjct: 4 LLVLLAIVATANA-LHRIPLTKMK------TVRRHLAEVGVPYDKIIKDYS--GKYYNMT 54
Query: 76 DADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYR 134
L NY+DAQYFG I IGTPPQ+F V+FDTGSSNLWVPS KC++S IAC H+KY
Sbjct: 55 GPQPEPLSNYLDAQYFGPISIGTPPQSFQVVFDTGSSNLWVPSKKCHYSNIACLLHNKYD 114
Query: 135 SGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKF 194
+ +SSTYKKNG+ I YG+G++SGF S+D V + + VKDQ F EA EP + F+ AKF
Sbjct: 115 ASKSSTYKKNGEKFAIQYGSGSLSGFLSQDTVSVAGIEVKDQTFAEALSEPGMAFVAAKF 174
Query: 195 DGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHY 254
DGILG+G+ I+V VP +YNMV+QG V EPVFSF+ NR+ GGE++ GG DP++Y
Sbjct: 175 DGILGMGYSNIAVDGVVPPFYNMVSQGAVPEPVFSFYLNRDPSATAGGELILGGADPNYY 234
Query: 255 KGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHA 314
G+ T++ VT+KGYWQF M + + G T FC GC AIAD+GTSL+AGP + +++
Sbjct: 235 TGDFTFLDVTRKGYWQFKMDGINVGGST--FCQEGCQAIADTGTSLIAGPIEEVNKLHKQ 292
Query: 315 IGATGIVSQECKAVVSQ 331
IGAT + E K S+
Sbjct: 293 IGATPLAGGEYKVDCSK 309
>gi|226437842|gb|ACO56332.1| putative gut cathepsin D-like aspartic protease [Callosobruchus
maculatus]
Length = 389
Score = 295 bits (756), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 140/244 (57%), Positives = 177/244 (72%), Gaps = 2/244 (0%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSST 140
L NY+DAQY+G I IGTPPQ F VIFDTGSSNLWVPS C+F+ IAC H+KY S +SST
Sbjct: 61 LSNYLDAQYYGPISIGTPPQTFKVIFDTGSSNLWVPSKLCHFTNIACLLHNKYDSSKSST 120
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
YKKNG + I YG+G++ GF S DHV G L V++Q F EA EP + F+ AKFDGILG+
Sbjct: 121 YKKNGTAFAIRYGSGSLDGFLSTDHVSFGGLKVENQTFAEAMNEPGMAFVAAKFDGILGM 180
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
G+ I+V PV+YNMV+Q LV++PVFSF+ NR+ +GGE++ GG D HYKGE TY
Sbjct: 181 GYSRIAVDGVPPVFYNMVSQKLVSQPVFSFYLNRDPAAPQGGELILGGSDKAHYKGEFTY 240
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
+PV ++ YWQF M V + +TT CA GC AIAD+GTSL+AGP+ + +N AIGAT I
Sbjct: 241 LPVDRQAYWQFKMDKVQVGPETT-LCAKGCEAIADTGTSLIAGPSEEVKAINKAIGATPI 299
Query: 321 VSQE 324
+ E
Sbjct: 300 MGGE 303
>gi|91093044|ref|XP_966517.1| PREDICTED: similar to cathepsin D isoform 1 [Tribolium castaneum]
gi|270002651|gb|EEZ99098.1| hypothetical protein TcasGA2_TC004989 [Tribolium castaneum]
Length = 384
Score = 295 bits (755), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 139/244 (56%), Positives = 174/244 (71%), Gaps = 2/244 (0%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSST 140
L NY+DAQY+G I IG PPQNF V+FDTGSSNLWVPS KC+++ IAC H+KY S +S T
Sbjct: 56 LSNYLDAQYYGPISIGNPPQNFKVVFDTGSSNLWVPSKKCHYTNIACLLHNKYDSSQSKT 115
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
YKKNG I YG+G++SGF S D V +G L V+ Q F EA EP L F+ AKFDGILG+
Sbjct: 116 YKKNGTDFAIQYGSGSLSGFLSTDIVTVGGLKVQQQTFAEAMSEPGLAFVAAKFDGILGM 175
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ISV PV+YNM+ Q LV +PVFSF+ NR+ +GGEI+ GG DP HYKG+ TY
Sbjct: 176 AYNRISVDGVTPVFYNMIQQNLVAQPVFSFYLNRDPSAAQGGEIILGGSDPAHYKGDFTY 235
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
+ V ++ YWQF M + + G+ T FCA GC AIAD+GTSL+AGP + + +N AIGAT I
Sbjct: 236 LSVDRQAYWQFKMDSISVGGKNT-FCANGCEAIADTGTSLIAGPVSEVQGINKAIGATPI 294
Query: 321 VSQE 324
V E
Sbjct: 295 VGGE 298
>gi|170063951|ref|XP_001867326.1| lysosomal aspartic protease [Culex quinquefasciatus]
gi|167881401|gb|EDS44784.1| lysosomal aspartic protease [Culex quinquefasciatus]
Length = 387
Score = 294 bits (753), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 147/282 (52%), Positives = 187/282 (66%), Gaps = 8/282 (2%)
Query: 49 RLDSKEGESFRTSIRKY-----SLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNF 103
R+ + +S RT R LR G L NYMDAQYFG I IGTPPQ+F
Sbjct: 22 RVQLHKTDSARTHFRNVDTEVKQLRLKYSNYGGPAPEPLSNYMDAQYFGAITIGTPPQSF 81
Query: 104 TVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFS 162
V+FDTGSSNLWVPS +C F+ IAC H+KY + +SST++KNG + I YG+G++SG+ S
Sbjct: 82 KVVFDTGSSNLWVPSKECSFTNIACLMHNKYNAKKSSTFEKNGTAFAIQYGSGSLSGYLS 141
Query: 163 EDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGL 222
D V +G + ++ Q F EA EP L F+ AKFDGILGLG+ ISV VP +YNM NQGL
Sbjct: 142 TDTVTVGGVAIQKQTFAEAINEPGLVFVAAKFDGILGLGYSSISVDGVVPPFYNMYNQGL 201
Query: 223 VNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQT 282
++ PVFSF+ NR+ EGGEI+FGG D Y G+ TY+PV +K YWQF M V +
Sbjct: 202 IDSPVFSFYLNRDPSAAEGGEIIFGGSDSAKYTGDFTYLPVDRKAYWQFKMDSVKVG--D 259
Query: 283 TGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE 324
T FC GC AIAD+GTSL+AGPT+ +T +N AIG T I++ E
Sbjct: 260 TEFCNNGCEAIADTGTSLIAGPTSEVTAINKAIGGTPIINGE 301
>gi|156406785|ref|XP_001641225.1| predicted protein [Nematostella vectensis]
gi|156228363|gb|EDO49162.1| predicted protein [Nematostella vectensis]
Length = 370
Score = 294 bits (753), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 148/255 (58%), Positives = 181/255 (70%), Gaps = 7/255 (2%)
Query: 81 ALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSS 139
L NYMDAQY+GEI IGTPPQ FTV+FDTGSSNLWVPS KC + +IAC H KY S +SS
Sbjct: 42 PLINYMDAQYYGEITIGTPPQKFTVVFDTGSSNLWVPSKKCSWTNIACLLHDKYDSTKSS 101
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYKKNG I YG+G++SGF S D V +G + VK Q F EA +EP LTF+ AKFDGILG
Sbjct: 102 TYKKNGTEFAIRYGSGSLSGFLSIDTVSVGGIDVKGQTFAEALKEPGLTFVAAKFDGILG 161
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
+GF ISV + VPV+Y+MV Q LV PVFSF+ NR GGE++ GG DP +YKG +
Sbjct: 162 MGFSSISVDQVVPVFYDMVLQKLVPAPVFSFYLNREPGASPGGELLLGGSDPKYYKGNFS 221
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATG 319
YVPVTQ+GYWQF M + + + FC+ GC AIAD+GTSL+AGPT I ++N+ IGA
Sbjct: 222 YVPVTQEGYWQFKMDGISV--KEGSFCSDGCQAIADTGTSLIAGPTDEIEKLNNLIGAKI 279
Query: 320 IVSQE----CKAVVS 330
I+ E C A+ S
Sbjct: 280 IIGGEYTVNCSAIDS 294
>gi|321472775|gb|EFX83744.1| hypothetical protein DAPPUDRAFT_92408 [Daphnia pulex]
Length = 379
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 146/292 (50%), Positives = 191/292 (65%), Gaps = 6/292 (2%)
Query: 35 LKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYFGEI 94
+K ++ L + R K ES TSI+ + E+G LKNYMDAQY+G+I
Sbjct: 5 VKLQRVTLEKVPSVR---KTLESVGTSIKVIQKKWGASEAGPTP-EELKNYMDAQYYGQI 60
Query: 95 GIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSSTYKKNGKSADIHYG 153
+GTPPQ F V+FDTGS+NLWVPS+ C+ + +AC H+KY G+S TYK NG I YG
Sbjct: 61 TLGTPPQTFNVVFDTGSANLWVPSTHCHLTNLACLLHNKYNGGKSQTYKANGTDFAIQYG 120
Query: 154 TGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPV 213
+G +SG+ S D + +G +VKDQ F EA EPSLTF+ AKFDGILG+ + ISV PV
Sbjct: 121 SGKLSGYLSTDTLGLGGALVKDQTFAEAISEPSLTFVAAKFDGILGMSYPSISVNGVPPV 180
Query: 214 WYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDM 273
+ NM+ QGLV +PVFSFW +RN D +GGEI FGG DP+ Y GE ++ PVT+K YWQF +
Sbjct: 181 FNNMIEQGLVEDPVFSFWLSRNPDAAQGGEITFGGADPERYTGEISWAPVTRKAYWQFKV 240
Query: 274 GDVMIDGQTTG-FCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE 324
V + + G FC GGC IAD+GTSL+AGP I ++N IG I++ E
Sbjct: 241 DGVQVSNEADGAFCQGGCQMIADTGTSLIAGPVDEIKKLNTLIGGIPIMAGE 292
>gi|332514729|gb|AEE69372.1| cathepsin D [Fasciola gigantica]
Length = 429
Score = 291 bits (746), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 157/319 (49%), Positives = 203/319 (63%), Gaps = 17/319 (5%)
Query: 16 LLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLR-GNLGES 74
+LL ++FS + RI K R F + + S + ES + KY+ R G++ E
Sbjct: 4 VLLICLLFSAALCDVLRI--KLRPFKTTRQELSEYGSLDWESSQRLFGKYAGRNGSIPEQ 61
Query: 75 GDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKY 133
L NY+DAQY+GEIGIGTPPQ F VIFDTGSSNLWVPS +C Y S AC+ H+KY
Sbjct: 62 -------LNNYLDAQYYGEIGIGTPPQTFKVIFDTGSSNLWVPSKRCSYLSWACWLHNKY 114
Query: 134 RSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAK 193
SSTY+ NG + I YGTG++SGF S D ++G + VK Q F EA +EP + F+ AK
Sbjct: 115 NYAASSTYQANGTAFSIQYGTGSVSGFISVDSFEVGGVEVKGQPFGEAIKEPGIVFVFAK 174
Query: 194 FDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDH 253
FDGILG+GF+ ISVG V V+ NM+ QGLV EPVFSF+ NRNA + GGE++ GG+DP++
Sbjct: 175 FDGILGMGFRSISVGGLVTVFENMIAQGLVPEPVFSFYLNRNASDPVGGELLLGGIDPNY 234
Query: 254 YKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNH 313
Y G+ TYVPVT + YWQF + + G CA GC AIAD+GTSL+AGP + +N
Sbjct: 235 YTGDITYVPVTHEAYWQFKVDKIEFPG--VSICADGCQAIADTGTSLIAGPKKEVDALNE 292
Query: 314 AIGAT----GIVSQECKAV 328
IG T GI C +
Sbjct: 293 QIGGTWMPGGIYVVNCDKI 311
>gi|322796189|gb|EFZ18765.1| hypothetical protein SINV_10075 [Solenopsis invicta]
Length = 366
Score = 291 bits (745), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 147/283 (51%), Positives = 185/283 (65%), Gaps = 8/283 (2%)
Query: 49 RLDSKEGESFRTSIRKY-----SLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNF 103
R+ ++ ES R ++RK +R + L NY+DAQY+GEI IGTPPQ F
Sbjct: 1 RIPLQKTESVRNALRKVGTELTQVRLATTRHLHSSTEPLSNYLDAQYYGEITIGTPPQKF 60
Query: 104 TVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFS 162
VIFDTGSSNLWVPS KC Y +IAC H+KY S +S TY+KNG I YGTG++SGF S
Sbjct: 61 KVIFDTGSSNLWVPSKKCRYTNIACLLHNKYDSRKSITYQKNGTPFAIRYGTGSLSGFLS 120
Query: 163 EDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGL 222
D V + L V++Q F EA EP LTF+ AKFDGILG+G+ ISV PV+YNMV Q L
Sbjct: 121 TDVVNVAGLNVQNQTFAEAVSEPGLTFVAAKFDGILGMGYSTISVDGVTPVFYNMVKQKL 180
Query: 223 VNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQT 282
V +P+FSF+ NR+ +GGE++ GG DP+HY G TYV VT+KGYWQF M + + +
Sbjct: 181 VPQPIFSFYLNRDPTAAQGGEMILGGSDPEHYVGSMTYVDVTRKGYWQFTMDRITVGDSS 240
Query: 283 TG--FCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQ 323
C C AIAD+GTSL+AGPT I ++N IGAT I Q
Sbjct: 241 PSHILCKNTCQAIADTGTSLIAGPTVEINEINKQIGATMIGGQ 283
>gi|383859202|ref|XP_003705085.1| PREDICTED: lysosomal aspartic protease-like [Megachile rotundata]
Length = 384
Score = 291 bits (745), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 155/314 (49%), Positives = 197/314 (62%), Gaps = 24/314 (7%)
Query: 13 FLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLG 72
FLC+L+ N L RI L K +R+ ++L + + +T I +
Sbjct: 7 FLCVLV------AANAELRRIKLHKI-----DRIRSQLKEYDTDLVQTRIVQ-------- 47
Query: 73 ESGDADIV-ALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFH 130
GD + L NY+DAQY+G I IGTPPQ F VIFDTGSSNLWVPS KC+ + IAC H
Sbjct: 48 --GDVILPEPLSNYLDAQYYGVINIGTPPQKFRVIFDTGSSNLWVPSKKCHLTNIACKLH 105
Query: 131 SKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFL 190
KY S +SSTYKKNG I YG+G++SG+ S D V + + V DQ F EA EP L F+
Sbjct: 106 YKYDSTKSSTYKKNGTDFSIRYGSGSLSGYLSTDMVDVAGIKVNDQTFAEALSEPGLAFV 165
Query: 191 LAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMD 250
AKFDGI+G+ + I+V PV+YNMV QGLV++PVFSF+ NR+ + E GGE++ GG D
Sbjct: 166 AAKFDGIMGMAYSTIAVDGVTPVFYNMVKQGLVSQPVFSFYLNRDPNAEFGGEMILGGSD 225
Query: 251 PDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQ 310
P+HY G TYVPV +KGYWQF M V + G C GC AIAD+GTSL+AGP I
Sbjct: 226 PNHYVGPFTYVPVDKKGYWQFAMDRVEV-GSDVKVCEKGCEAIADTGTSLIAGPVKEIEL 284
Query: 311 VNHAIGATGIVSQE 324
+N IGAT I + E
Sbjct: 285 LNKKIGATPIAAGE 298
>gi|147743000|sp|P85137.1|CARDF_CYNCA RecName: Full=Cardosin-F; Contains: RecName: Full=Cardosin-F heavy
chain; Contains: RecName: Full=Cardosin-F light chain
Length = 281
Score = 290 bits (742), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 153/247 (61%), Positives = 174/247 (70%), Gaps = 34/247 (13%)
Query: 73 ESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSK 132
+SG A +VAL N D Y+GEIGIGTPPQ FTVIFDTGSS LWVPSSK HS
Sbjct: 2 DSGSA-VVALTNDRDTSYYGEIGIGTPPQKFTVIFDTGSSVLWVPSSKA--------HSM 52
Query: 133 YRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLA 192
Y S SSTYK S+D V IGDLVVK+Q+FIEAT E FL
Sbjct: 53 YESSGSSTYK-------------------SQDSVTIGDLVVKEQDFIEATEEADNVFLNR 93
Query: 193 KFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPD 252
FDGILGL FQ ISV PVWYNM+NQGLV FSFW NRN DEEEGGE+VFGG+DP+
Sbjct: 94 LFDGILGLSFQTISV----PVWYNMLNQGLVKR--FSFWLNRNVDEEEGGELVFGGLDPN 147
Query: 253 HYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVN 312
H++G+HTYVPVT + YWQF +GDV+I ++TGFCA GC A ADSGTSLL+GPT I+TQ+N
Sbjct: 148 HFRGDHTYVPVTYQYYWQFGIGDVLIGDKSTGFCAPGCQAFADSGTSLLSGPTAIVTQIN 207
Query: 313 HAIGATG 319
HAIGA G
Sbjct: 208 HAIGANG 214
>gi|116282368|gb|ABJ97285.1| cathepsin D-like aspartic protease [Fasciola hepatica]
Length = 429
Score = 290 bits (742), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 153/305 (50%), Positives = 199/305 (65%), Gaps = 13/305 (4%)
Query: 16 LLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLR-GNLGES 74
+LL ++FS + RI K R F + + S + ES + KY+ R G++ E
Sbjct: 4 VLLICLLFSAALCDVLRI--KLRPFKTTRQELSEYGSLDWESSQRLFGKYAGRNGSIPEQ 61
Query: 75 GDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKY 133
L NY+DAQY+GEIGIGTPPQ F VIFDTGSSNLWVPS +C Y S AC+ H+KY
Sbjct: 62 -------LNNYLDAQYYGEIGIGTPPQTFKVIFDTGSSNLWVPSKRCSYLSWACWLHNKY 114
Query: 134 RSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAK 193
SSTY+ NG + I YGTG++SGF S D ++G + VK Q F EA +EP + F+ AK
Sbjct: 115 NYAASSTYQVNGTAFSIQYGTGSVSGFISVDSFEVGGVEVKGQPFGEAIKEPGIVFVFAK 174
Query: 194 FDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDH 253
FDGILG+GF+ ISVG + V+ NM+ QGLV EPVFSF+ NRNA + GGE++ GG+DP++
Sbjct: 175 FDGILGMGFRSISVGGLITVFENMIAQGLVPEPVFSFYLNRNASDPVGGELLLGGIDPNY 234
Query: 254 YKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNH 313
Y G+ TYVPVT + YWQF + + G CA GC AIAD+GTSL+AGP + +N
Sbjct: 235 YTGDITYVPVTHEAYWQFKVDKIEFPG--VSICADGCQAIADTGTSLIAGPKKEVDALNE 292
Query: 314 AIGAT 318
IG T
Sbjct: 293 QIGGT 297
>gi|227018334|gb|ACP18833.1| aspartic proteinase 1 [Chrysomela tremula]
Length = 386
Score = 290 bits (742), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 147/304 (48%), Positives = 191/304 (62%), Gaps = 17/304 (5%)
Query: 22 VFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVA 81
V +T N R+ L K + + L S+ +S ++KY+ G
Sbjct: 12 VLATVNCDFVRVPLHKM-----DTAKSTLQSRGYKSNENLVKKYTTDG---------YAP 57
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
L NYMDAQY+GEI IGTP Q F VIFDTGSSNLW+PS KC ++AC H++Y S +SST
Sbjct: 58 LTNYMDAQYYGEITIGTPGQKFNVIFDTGSSNLWIPSHKCKLLNVACRTHNQYNSDKSST 117
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y NG I YG+G++ GF S D V++ L VKDQ F EAT EP L F+ KFDGILGL
Sbjct: 118 YTSNGTDFSITYGSGSLKGFLSSDIVEVAGLTVKDQIFAEATEEPGLAFIAGKFDGILGL 177
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ISV + P +Y ++ QG+V EPVFSF+ NR+ + E GGEIVFGG DP +Y G+ TY
Sbjct: 178 AYDTISVNQVTPFFYKLIEQGVVKEPVFSFYLNRDPNAEVGGEIVFGGSDPKYYTGDFTY 237
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
+PVT+KGYWQ M ++D T C GGC AI D+GTSL+ GP+ I ++ A+GAT I
Sbjct: 238 LPVTRKGYWQIKMDKAVVDSNT--LCDGGCQAIVDTGTSLITGPSDEIEKIVKAVGATAI 295
Query: 321 VSQE 324
+ E
Sbjct: 296 TAGE 299
>gi|195997283|ref|XP_002108510.1| expressed hypothetical protein [Trichoplax adhaerens]
gi|190589286|gb|EDV29308.1| expressed hypothetical protein [Trichoplax adhaerens]
Length = 389
Score = 290 bits (742), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 155/310 (50%), Positives = 201/310 (64%), Gaps = 12/310 (3%)
Query: 16 LLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESG 75
LL+ +F + L R+ L K K +A + G + KY LG S
Sbjct: 4 LLVIAALFLISSDALVRVPLYKFKKTPREHLA-----EVGITSSMLSEKY----ELGASR 54
Query: 76 DADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYF-SIACYFHSKYR 134
+A + L NY+DAQY+GEI IGTPPQ F V+FDTGSSNLWVPSSKC F +IAC FHSKY
Sbjct: 55 NATEM-LNNYLDAQYYGEISIGTPPQKFKVLFDTGSSNLWVPSSKCSFLNIACLFHSKYD 113
Query: 135 SGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKF 194
+SSTYKKN I YGTG+++GF S D V+I + VK+Q F EA EP LTF+ A+F
Sbjct: 114 HSKSSTYKKNSTKFSIRYGTGSLTGFLSVDTVRIQGVSVKNQGFAEAVSEPGLTFVAAQF 173
Query: 195 DGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHY 254
DGILG+G+QEI+V PV+ N++ Q V + VFSF+ NR + GGE++ GG D HY
Sbjct: 174 DGILGMGYQEIAVDGVPPVFNNIMAQKQVGKSVFSFYLNRKEGAKPGGELILGGSDSKHY 233
Query: 255 KGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHA 314
G TY+PVT+KGYWQF M + + G+ + FC GGC AIAD+GTSLLAGPT + ++
Sbjct: 234 SGNFTYLPVTKKGYWQFKMDGISVKGKGS-FCKGGCQAIADTGTSLLAGPTAEVNKIQTL 292
Query: 315 IGATGIVSQE 324
IGAT +++ E
Sbjct: 293 IGATPLLNGE 302
>gi|205363469|gb|ACI04164.1| cathepsin D-like aspartic protease precursor [Fasciola hepatica]
Length = 429
Score = 290 bits (741), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 156/319 (48%), Positives = 202/319 (63%), Gaps = 17/319 (5%)
Query: 16 LLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLR-GNLGES 74
+LL ++FS + R K R F + + S + ES + KY+ R G++ E
Sbjct: 4 VLLICLLFSAALCDVLRT--KLRPFKTTRQELSEYGSLDWESSQRLFGKYAGRNGSIPEQ 61
Query: 75 GDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKY 133
L NY+DAQY+GEIGIGTPPQ F VIFDTGSSNLWVPS +C Y S AC+ H+KY
Sbjct: 62 -------LNNYLDAQYYGEIGIGTPPQTFKVIFDTGSSNLWVPSKRCSYLSWACWLHNKY 114
Query: 134 RSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAK 193
SSTY+ NG + I YGTG++SGF S D ++G + VK Q F EA +EP + F+ AK
Sbjct: 115 NYAASSTYQANGTAFSIQYGTGSVSGFISVDSFEVGGVEVKGQPFGEAIKEPGIVFVFAK 174
Query: 194 FDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDH 253
FDGILG+GF+ ISVG V V+ NM+ QGLV EPVFSF+ NRNA + GGE++ GG+DP++
Sbjct: 175 FDGILGMGFRSISVGGLVTVFENMIAQGLVPEPVFSFYLNRNASDPVGGELLLGGIDPNY 234
Query: 254 YKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNH 313
Y G+ TYVPVT + YWQF + + G CA GC AIAD+GTSL+AGP + +N
Sbjct: 235 YTGDITYVPVTHEAYWQFKVDKIEFPG--VSICADGCQAIADTGTSLIAGPKKEVDALNE 292
Query: 314 AIGAT----GIVSQECKAV 328
IG T GI C +
Sbjct: 293 QIGGTWMPGGIYVVNCDKI 311
>gi|46309251|dbj|BAD15111.1| cathepsin D [Todarodes pacificus]
Length = 392
Score = 290 bits (741), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 138/244 (56%), Positives = 174/244 (71%), Gaps = 3/244 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSST 140
L NY+DAQY+G I IGTP QNF V+FDTGSSNLWVPS KC S IAC H+KY S +SST
Sbjct: 65 LSNYLDAQYYGVISIGTPAQNFKVVFDTGSSNLWVPSKKCKLSDIACLLHNKYDSTQSST 124
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
YK NG I YG+G++ GF S D V IG + +K Q F EAT +P L F+ AKFDGILG+
Sbjct: 125 YKANGTDFHIQYGSGSLDGFLSTDTVAIGSVAIKAQTFAEATNQPGLVFVAAKFDGILGM 184
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ISV K PV+Y +++Q LV++PVFSF+ NR+ +EGGE++ GG DP HY G TY
Sbjct: 185 AYDTISVDKVTPVFYQIISQKLVDQPVFSFYLNRDPSGKEGGELILGGSDPKHYTGNFTY 244
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
+PVT+KGYWQ M D ++ G+ T FC+GGC AIAD+GTSL+AGP I ++N AIG +
Sbjct: 245 LPVTRKGYWQIKM-DKVVSGENT-FCSGGCQAIADTGTSLIAGPVDEIKKLNEAIGGRAL 302
Query: 321 VSQE 324
E
Sbjct: 303 PGGE 306
>gi|24417300|gb|AAN60260.1| unknown [Arabidopsis thaliana]
Length = 168
Score = 288 bits (737), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 132/168 (78%), Positives = 150/168 (89%)
Query: 162 SEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQG 221
S D V +GDLVVKDQEF+EAT+E +TF++AK DGILGLGFQEISVGKA PVWYNM+ QG
Sbjct: 1 SNDAVTVGDLVVKDQEFMEATKELGITFVVAKXDGILGLGFQEISVGKAAPVWYNMLKQG 60
Query: 222 LVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQ 281
L+ EPVFSFW NRNADEEEGGE+VFGG+DP+H+KG+HTYVPVTQKGYWQFDMGDV+I G
Sbjct: 61 LIKEPVFSFWLNRNADEEEGGELVFGGVDPNHFKGKHTYVPVTQKGYWQFDMGDVLIGGA 120
Query: 282 TTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQECKAVV 329
TGFC GC+AIADSGTSLLAGPTTIIT +NHAIGA G+VSQ+CK VV
Sbjct: 121 PTGFCESGCSAIADSGTSLLAGPTTIITMINHAIGAAGVVSQQCKTVV 168
>gi|56118817|ref|NP_001008172.1| MGC89016 protein precursor [Xenopus (Silurana) tropicalis]
gi|51950197|gb|AAH82490.1| MGC89016 protein [Xenopus (Silurana) tropicalis]
Length = 421
Score = 288 bits (737), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 163/361 (45%), Positives = 216/361 (59%), Gaps = 32/361 (8%)
Query: 7 SITAGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYS 66
S+ + + LLF V FS GL RI L + K AR G + KY
Sbjct: 2 SVISHLVVTCLLF-VAFSN---GLERIKLHRFK------SVARTLHDVGSAVEHVRMKY- 50
Query: 67 LRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-I 125
+ ++ + + L NYMD QY+G I IGTPPQ+F V+FDTGSSNLWVPS KC ++ I
Sbjct: 51 VDNHMKSAPEP----LTNYMDVQYYGVISIGTPPQSFRVVFDTGSSNLWVPSKKCKWTDI 106
Query: 126 ACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREP 185
AC+ H KY S +SSTYK NG IHYGTG+++GF S D V +G L VK Q F EA +P
Sbjct: 107 ACWLHRKYDSKKSSTYKANGTEFAIHYGTGSLTGFLSTDTVSVGSLSVKSQTFAEAITQP 166
Query: 186 SLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIV 245
+TF+ AKFDGILG+ + ISV VPV+ NMVNQ LV++ +FSF+ +R+A +EGGEI+
Sbjct: 167 GITFVAAKFDGILGMAYPSISVDGVVPVFNNMVNQKLVDQAIFSFYLSRDASAKEGGEII 226
Query: 246 FGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTG---------FCAGGCAAIADS 296
GG DPDHY G TY+ VT+K YWQ M V + ++ +C GGC AIAD+
Sbjct: 227 LGGSDPDHYVGNFTYLDVTRKAYWQIKMDSVTVSSESECMNAMMVGGEYCKGGCQAIADT 286
Query: 297 GTSLLAGPTTIITQVNHAIGATGIVSQE----CKAVVSQYGEEIINMLLAKVSAKLSLRH 352
GTSL+ GP++ + ++N IGA I+S E C + S IN +L S L+ +
Sbjct: 287 GTSLIVGPSSDVEKLNAEIGALPIISGEYWINCSKIASL---PTINFVLGGKSFSLTGKD 343
Query: 353 Y 353
Y
Sbjct: 344 Y 344
>gi|347451476|gb|AEO94539.1| aspartate protease cathepsin D [Triatoma infestans]
Length = 393
Score = 288 bits (737), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 139/267 (52%), Positives = 183/267 (68%), Gaps = 4/267 (1%)
Query: 60 TSIRKYSLRGNLGESGDADIV-ALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPS 118
T++ + +L+ + G + L NY+DAQY+G I +G+PPQ+F V+FDTGSSNLWVPS
Sbjct: 42 TAVEQLTLKYDTGNGVEGPFPEPLSNYLDAQYYGAITLGSPPQSFRVVFDTGSSNLWVPS 101
Query: 119 SKC-YFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQE 177
KC F+IAC+ H KY S S TY NG+ I YG+G++SGF S+D + IG + V +Q
Sbjct: 102 KKCSRFNIACWVHRKYDSSNSKTYVPNGEKFAIQYGSGSLSGFLSQDQLSIGGVTVANQT 161
Query: 178 FIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNAD 237
F EA EP + F+ AKFDGILGLG+ ISV K P +YNM QG V PVFSF+ NR+
Sbjct: 162 FAEAVNEPGMVFVAAKFDGILGLGYDTISVDKVTPPFYNMYQQGAVQNPVFSFYLNRDPA 221
Query: 238 EEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSG 297
GGEI+FGG DP+ Y G+ TYVPV ++GYWQF+M V+++G+T FC GGC AIAD+G
Sbjct: 222 AAVGGEIIFGGSDPEKYVGDFTYVPVDKQGYWQFNMDKVIVNGKT--FCKGGCQAIADTG 279
Query: 298 TSLLAGPTTIITQVNHAIGATGIVSQE 324
TSL+AGPT + +N +G T I E
Sbjct: 280 TSLIAGPTEDVIALNKLLGGTPIAGGE 306
>gi|238816835|gb|ACR56788.1| aspartic protease 4 [Strongyloides ratti]
Length = 428
Score = 288 bits (736), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 143/277 (51%), Positives = 187/277 (67%), Gaps = 9/277 (3%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
L+NYMDAQY+GEI IGTP QNF+VIFDTGSSNLW+PS KC ++IAC H+KY S SST
Sbjct: 61 LRNYMDAQYYGEISIGTPGQNFSVIFDTGSSNLWIPSKKCPIYNIACLLHNKYDSSSSST 120
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y +G++ I YGTG++ GF S+D V I DL +DQ F EAT EP +TF+ AKFDGILG+
Sbjct: 121 YVTDGRTMAIQYGTGSMKGFLSKDKVCIADLCAEDQTFAEATSEPGVTFIAAKFDGILGM 180
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+Q I+V PV+ +++Q V +P+F+FW NR AD+ +GGEI GGMDP HYKG+ TY
Sbjct: 181 AYQNIAVLGVKPVFNTLIDQHKVPQPIFAFWLNRIADDSDGGEITLGGMDPKHYKGDITY 240
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
VPV++KGYWQF M + D + C GC AIAD+GTSL+AGP + + IGA +
Sbjct: 241 VPVSRKGYWQFKMDGFVGDNEKIA-CKNGCQAIADTGTSLIAGPKAQVEAIQKFIGAEPL 299
Query: 321 VSQE----CKAVVSQYGEEIINMLLAKVSAKLSLRHY 353
E C V S I+N+++ + LS + Y
Sbjct: 300 ARGEYMVPCDKVSSL---PIVNIVIGGQAFALSGKDY 333
>gi|159468321|ref|XP_001692331.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
gi|158278517|gb|EDP04281.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
Length = 303
Score = 287 bits (734), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 139/262 (53%), Positives = 176/262 (67%), Gaps = 14/262 (5%)
Query: 58 FRTSIRKYSLRGNLGE-------------SGDADIVALKNYMDAQYFGEIGIGTPPQNFT 104
R ++RK + LG S D V LKN+MDAQY+GEIG+GTPPQ F
Sbjct: 30 LRVTLRKTEMLTTLGRPRPYLLGEQGLLGSSDQGQVTLKNFMDAQYYGEIGLGTPPQLFN 89
Query: 105 VIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSE 163
VIFDTGS+NLWVPSSKC F+IAC H KY + +S TYK NG I YGTG++ G+ S+
Sbjct: 90 VIFDTGSANLWVPSSKCALFNIACRLHRKYNAAKSKTYKANGTEFAIEYGTGSLDGYISQ 149
Query: 164 DHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLV 223
D + G L +KDQ F EA EP LTF+ AKFDGILG+GF ISV P + +V +G +
Sbjct: 150 DVLTWGGLTIKDQGFAEAINEPGLTFVAAKFDGILGMGFPAISVQHVPPPFTRLVEEGGL 209
Query: 224 NEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTT 283
PVFSFW NR+ + GGE+V GG+DP H+ GEHT+VPVT++GYWQF+M + + +
Sbjct: 210 AAPVFSFWLNRDPNAPNGGELVLGGIDPTHFTGEHTWVPVTRQGYWQFNMEGLDLGPGSQ 269
Query: 284 GFCAGGCAAIADSGTSLLAGPT 305
CA GCAAIAD+GTSL+AGP+
Sbjct: 270 KMCAKGCAAIADTGTSLIAGPS 291
>gi|195332251|ref|XP_002032812.1| GM20753 [Drosophila sechellia]
gi|194124782|gb|EDW46825.1| GM20753 [Drosophila sechellia]
Length = 392
Score = 287 bits (734), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 139/242 (57%), Positives = 167/242 (69%), Gaps = 5/242 (2%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSST 140
L NYMDAQY+G I IG+PPQNF V+FDTGSSNLWVPS KC+ + IAC H+KY + +S T
Sbjct: 65 LSNYMDAQYYGPIAIGSPPQNFRVVFDTGSSNLWVPSKKCHLTNIACLMHNKYDASKSKT 124
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y KNG IHYG+G++SG+ S D V I L +KDQ F EA EP L F+ AKFDGILGL
Sbjct: 125 YTKNGTEFAIHYGSGSLSGYLSTDTVSIAGLDIKDQTFAEALSEPGLVFVAAKFDGILGL 184
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
G+ ISV K P +Y M QGL++ PVFSF+ NR+ EGGEI+FGG DP+HY GE TY
Sbjct: 185 GYSSISVDKVKPPFYAMYEQGLISAPVFSFYLNRDPASPEGGEIIFGGSDPNHYTGEFTY 244
Query: 261 VPVTQKGYWQFDMGDVMI-DGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATG 319
+PVT+K YWQ M I D Q C GGC IAD+GTSL+A P T +N IG T
Sbjct: 245 LPVTRKAYWQIKMDAASIGDLQ---LCKGGCQVIADTGTSLIAAPLEEATSINQKIGGTP 301
Query: 320 IV 321
I+
Sbjct: 302 II 303
>gi|242013446|ref|XP_002427417.1| Lysosomal aspartic protease precursor, putative [Pediculus humanus
corporis]
gi|212511797|gb|EEB14679.1| Lysosomal aspartic protease precursor, putative [Pediculus humanus
corporis]
Length = 383
Score = 287 bits (734), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 134/245 (54%), Positives = 170/245 (69%), Gaps = 3/245 (1%)
Query: 81 ALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSS 139
L NY+DAQY+G I IGTPPQ F VIFDTGSSNLW+PS KC FS IAC H+KY S RSS
Sbjct: 55 PLSNYLDAQYYGPISIGTPPQQFKVIFDTGSSNLWIPSKKCLFSNIACLLHNKYDSSRSS 114
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TY +NG I YG+G++SG+ S D V +G L +K Q F EA EP L F+ AKFDGILG
Sbjct: 115 TYIRNGTEFSIQYGSGSLSGYLSTDDVTLGGLTIKRQTFAEAISEPGLAFVAAKFDGILG 174
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
+G+ I+V VP +YNM Q LV+ P+FSF+ NRN +E+ GGE++ GG DP++YKG T
Sbjct: 175 MGYMSIAVDGVVPPFYNMYEQRLVDSPIFSFYLNRNPNEKVGGELLLGGSDPNYYKGNFT 234
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATG 319
Y+PV +K YWQF M VM++ T C GGC AIAD+GTSL+AGP + ++N +
Sbjct: 235 YLPVNRKAYWQFQMDKVMMEDITV--CRGGCQAIADTGTSLIAGPVEDVNKINKKLNGVP 292
Query: 320 IVSQE 324
+ E
Sbjct: 293 VSGGE 297
>gi|31197673|ref|XP_307784.1| AGAP003277-PA [Anopheles gambiae str. PEST]
gi|347969584|ref|XP_003436430.1| AGAP003277-PB [Anopheles gambiae str. PEST]
gi|347969586|ref|XP_003436431.1| AGAP003277-PC [Anopheles gambiae str. PEST]
gi|347969588|ref|XP_003436432.1| AGAP003277-PD [Anopheles gambiae str. PEST]
gi|30179074|gb|EAA03535.2| AGAP003277-PA [Anopheles gambiae str. PEST]
gi|333466215|gb|EGK96172.1| AGAP003277-PB [Anopheles gambiae str. PEST]
gi|333466216|gb|EGK96173.1| AGAP003277-PC [Anopheles gambiae str. PEST]
gi|333466217|gb|EGK96174.1| AGAP003277-PD [Anopheles gambiae str. PEST]
Length = 389
Score = 287 bits (734), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 145/285 (50%), Positives = 190/285 (66%), Gaps = 10/285 (3%)
Query: 45 RVAARLDSKEGESFR---TSIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQ 101
RVA + + FR T +R+ LR G L NY+DAQYFG I IGTPPQ
Sbjct: 24 RVALQKAPSARDHFRSVGTEVRQ--LRLKYGAPTGPVPEPLSNYLDAQYFGAISIGTPPQ 81
Query: 102 NFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGF 160
+F V+FDTGSSNLWVPS +C F+ IAC H+KY + +SS+++KNG + I YGTG++SG+
Sbjct: 82 SFKVVFDTGSSNLWVPSKQCSFTNIACLMHNKYDAKKSSSFEKNGTAFHIQYGTGSLSGY 141
Query: 161 FSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQ 220
S D V +G + V+ Q F EA +EP L F+ AKFDGILGL ++ ISV +PV+YNM NQ
Sbjct: 142 LSTDTVTVGGVPVEKQTFAEAIQEPGLVFVAAKFDGILGLAYKSISVDGVMPVFYNMFNQ 201
Query: 221 GLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMI-D 279
G ++ PVFSF+ NR+ EGGEI+FGG D HY G+ TY+ V +K YWQF M V + D
Sbjct: 202 GKIDAPVFSFYLNRDPSAAEGGEIIFGGSDSKHYTGDFTYLSVDRKAYWQFKMDSVTVGD 261
Query: 280 GQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE 324
Q +C GC AIAD+GTSL+AGP +T +N AIG T +++ E
Sbjct: 262 AQ---YCNNGCEAIADTGTSLIAGPVAEVTAINKAIGGTPVLNGE 303
>gi|427789779|gb|JAA60341.1| Putative cathepsin d isoform 1 protein [Rhipicephalus pulchellus]
Length = 391
Score = 286 bits (733), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 151/304 (49%), Positives = 197/304 (64%), Gaps = 16/304 (5%)
Query: 30 LYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQ 89
L R+ LK+ K + R +E + R +SL G LKNY+DAQ
Sbjct: 21 LLRVPLKRMK-------SVRDHLREVGTPLNVTRTWSLGGAFAGPFPE---PLKNYLDAQ 70
Query: 90 YFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSSTYKKNGKSA 148
Y+G+I +GTPPQ F V+FDTGSSNLWVPSSKC F+ IAC+ H KY S RS+TY+KNG +
Sbjct: 71 YYGDITLGTPPQVFRVVFDTGSSNLWVPSSKCSFTNIACWLHHKYHSSRSTTYQKNGTAF 130
Query: 149 DIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVG 208
+I YG+G++ G S D +G++ V+ Q F E E L F+ AKFDGILGLG+ ISV
Sbjct: 131 EIRYGSGSVKGVLSTDVFGLGNVTVRSQTFAEIIDESGLAFIAAKFDGILGLGYPRISVL 190
Query: 209 KAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGY 268
PV+ NMV QG+ +PVFS + +RNA + +GGE++FGG+D HY G TYVPVT+KGY
Sbjct: 191 GVPPVFDNMVAQGVAAKPVFSVYLDRNASDPQGGEVLFGGIDKAHYTGNITYVPVTRKGY 250
Query: 269 WQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE---- 324
WQF M V + G T FC GGC AIAD+GTSL+AGP+ I ++N AIGA + E
Sbjct: 251 WQFHMDGVTV-GTNTTFCNGGCEAIADTGTSLIAGPSEEIQKLNLAIGAAPFTAGEYLVS 309
Query: 325 CKAV 328
CK++
Sbjct: 310 CKSI 313
>gi|157112486|ref|XP_001657556.1| cathepsin d [Aedes aegypti]
gi|205831550|sp|Q03168.2|ASPP_AEDAE RecName: Full=Lysosomal aspartic protease; Flags: Precursor
gi|108878060|gb|EAT42285.1| AAEL006169-PA [Aedes aegypti]
Length = 387
Score = 286 bits (731), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 147/284 (51%), Positives = 189/284 (66%), Gaps = 8/284 (2%)
Query: 45 RVAARLDSKEGESFR---TSIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQ 101
RV + FR T I++ L+ N SG L NY+DAQY+G I IGTPPQ
Sbjct: 22 RVQLHKTESARQHFRNVDTEIKQLRLKYN-AVSGPVP-EPLSNYLDAQYYGAITIGTPPQ 79
Query: 102 NFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGF 160
+F V+FDTGSSNLWVPS +C F+ IAC H+KY + +SST++KNG + I YG+G++SG+
Sbjct: 80 SFKVVFDTGSSNLWVPSKECSFTNIACLMHNKYNAKKSSTFEKNGTAFHIQYGSGSLSGY 139
Query: 161 FSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQ 220
S D V +G + V Q F EA EP L F+ AKFDGILGLG+ ISV VPV+YNM NQ
Sbjct: 140 LSTDTVGLGGVSVTKQTFAEAINEPGLVFVAAKFDGILGLGYSSISVDGVVPVFYNMFNQ 199
Query: 221 GLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDG 280
GL++ PVFSF+ NR+ EGGEI+FGG D + Y G+ TY+ V +K YWQF M V +
Sbjct: 200 GLIDAPVFSFYLNRDPSAAEGGEIIFGGSDSNKYTGDFTYLSVDRKAYWQFKMDSVKVG- 258
Query: 281 QTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE 324
T FC GC AIAD+GTSL+AGP + +T +N AIG T I++ E
Sbjct: 259 -DTEFCNNGCEAIADTGTSLIAGPVSEVTAINKAIGGTPIMNGE 301
>gi|289740593|gb|ADD19044.1| aspartyl protease [Glossina morsitans morsitans]
Length = 394
Score = 286 bits (731), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 138/256 (53%), Positives = 174/256 (67%), Gaps = 3/256 (1%)
Query: 67 LRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-I 125
LR G + D L NY+DAQY+G I IGTP Q+F V+FDTGSSNLWVPS +CYF+ I
Sbjct: 52 LRLKYGSANDVRPEPLSNYLDAQYYGPISIGTPSQDFKVVFDTGSSNLWVPSKQCYFTNI 111
Query: 126 ACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREP 185
AC H+KY + +SS+YKKNG IHYG+G++SG+ S D V I L ++ Q F EA EP
Sbjct: 112 ACLMHNKYDANKSSSYKKNGTEFAIHYGSGSLSGYLSTDTVNIAGLGIEGQTFAEALSEP 171
Query: 186 SLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIV 245
L F+ AKFDGILGLG+ I+V P +Y M QGL+++PVFSF+ NR+ EGGEI+
Sbjct: 172 GLVFIGAKFDGILGLGYSSIAVDGVKPPFYQMYEQGLISQPVFSFYLNRDPKAPEGGEII 231
Query: 246 FGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPT 305
FGG DP+HYKGE TY+PVT+K YWQ M + C GGC IAD+GTSL+A P
Sbjct: 232 FGGSDPNHYKGEFTYLPVTRKAYWQIKMDSASMG--NLNLCQGGCQVIADTGTSLIALPP 289
Query: 306 TIITQVNHAIGATGIV 321
+ T +N AIG T I+
Sbjct: 290 SEATSINKAIGGTPIM 305
>gi|293230|gb|AAA29350.1| aspartic protease [Aedes aegypti]
Length = 387
Score = 285 bits (730), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 137/244 (56%), Positives = 175/244 (71%), Gaps = 3/244 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSST 140
L NY+DAQY+G I IGTPPQ+F V+FDTGSSNLWVPS +C F+ IAC H+KY + +SST
Sbjct: 60 LSNYLDAQYYGAITIGTPPQSFKVVFDTGSSNLWVPSKECSFTNIACLMHNKYNAKKSST 119
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
++KNG + I YG+G++SG+ S D V +G + V Q F EA EP L F+ AKFDGILGL
Sbjct: 120 FEKNGTAFHIQYGSGSLSGYLSTDTVGLGGVSVTKQTFAEAINEPGLVFVAAKFDGILGL 179
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
G+ ISV VPV+YNM NQGL++ PVFSF+ NR+ EGGEI+FGG D + Y G+ TY
Sbjct: 180 GYSSISVDGVVPVFYNMFNQGLIDAPVFSFYLNRDPSAAEGGEIIFGGSDSNKYTGDFTY 239
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
+ V +K YWQF M V + T FC GC AIAD+GTSL+AGP + +T +N AIG T I
Sbjct: 240 LSVDRKAYWQFKMDSVKVG--DTEFCNNGCEAIADTGTSLIAGPVSEVTAINKAIGGTPI 297
Query: 321 VSQE 324
++ E
Sbjct: 298 MNGE 301
>gi|157779726|gb|ABV71391.1| aspartic protease [Haemaphysalis longicornis]
Length = 391
Score = 285 bits (730), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 152/304 (50%), Positives = 197/304 (64%), Gaps = 16/304 (5%)
Query: 30 LYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQ 89
L R+ LK+ + +AR +E + +SL LG LKNY+DAQ
Sbjct: 21 LLRVPLKRVQ-------SARNQLREVGTVLNVTHGWSLGNALGGPFPE---PLKNYLDAQ 70
Query: 90 YFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSSTYKKNGKSA 148
Y+G++ +GTPPQ F V+FDTGSSNLWVPSSKC F+ IAC H KY S +SSTY KNG
Sbjct: 71 YYGDVTLGTPPQVFRVVFDTGSSNLWVPSSKCPFTNIACMLHHKYNSKKSSTYAKNGTQF 130
Query: 149 DIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVG 208
+I YG+G++ G S D +GD+ ++ Q F E RE L F+ AKFDGILGLG+ +ISV
Sbjct: 131 EIRYGSGSVKGELSTDVFGLGDIRLQGQTFAEILRESGLAFIAAKFDGILGLGYPQISVL 190
Query: 209 KAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGY 268
PV+ NMV QG+ +PVFS + +RNA + GGE++FGG+D HY G TYVPVT+KGY
Sbjct: 191 NVPPVFDNMVAQGVAPKPVFSVYLDRNASDPNGGEVLFGGIDEAHYTGNITYVPVTRKGY 250
Query: 269 WQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE---- 324
WQF M V + G FC GGCAAIAD+GTSL+AGPT I ++N AIGA ++ E
Sbjct: 251 WQFHMNGVKV-GDNATFCNGGCAAIADTGTSLIAGPTEEIHKLNVAIGAAPFMAGEYIVS 309
Query: 325 CKAV 328
CK++
Sbjct: 310 CKSI 313
>gi|83319201|dbj|BAE53722.1| aspartic protease [Haemaphysalis longicornis]
Length = 391
Score = 285 bits (730), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 152/304 (50%), Positives = 197/304 (64%), Gaps = 16/304 (5%)
Query: 30 LYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQ 89
L R+ LK+ + +AR +E + +SL LG LKNY+DAQ
Sbjct: 21 LLRVPLKRVQ-------SARNQLREVGTVLNVTHGWSLGNALGGPFPE---PLKNYLDAQ 70
Query: 90 YFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSSTYKKNGKSA 148
Y+G++ +GTPPQ F V+FDTGSSNLWVPSSKC F+ IAC H KY S +SSTY KNG
Sbjct: 71 YYGDVTLGTPPQVFRVVFDTGSSNLWVPSSKCPFTNIACMLHHKYNSKKSSTYAKNGTQF 130
Query: 149 DIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVG 208
+I YG+G++ G S D +GD+ ++ Q F E RE L F+ AKFDGILGLG+ +ISV
Sbjct: 131 EIRYGSGSVKGELSTDVFGLGDIRLQGQTFAEILRESGLAFIAAKFDGILGLGYPQISVL 190
Query: 209 KAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGY 268
PV+ NMV QG+ +PVFS + +RNA + GGE++FGG+D HY G TYVPVT+KGY
Sbjct: 191 NVPPVFDNMVAQGVAPKPVFSVYLDRNASDPNGGEVLFGGIDEAHYTGNITYVPVTRKGY 250
Query: 269 WQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE---- 324
WQF M V + G FC GGCAAIAD+GTSL+AGPT I ++N AIGA ++ E
Sbjct: 251 WQFHMNGVKV-GDNATFCNGGCAAIADTGTSLIAGPTEEIHKLNVAIGAAPFMAGEYIVS 309
Query: 325 CKAV 328
CK++
Sbjct: 310 CKSI 313
>gi|195027894|ref|XP_001986817.1| GH21578 [Drosophila grimshawi]
gi|193902817|gb|EDW01684.1| GH21578 [Drosophila grimshawi]
Length = 388
Score = 285 bits (730), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 141/274 (51%), Positives = 181/274 (66%), Gaps = 9/274 (3%)
Query: 60 TSIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSS 119
T + + ++R G+ G L NY+DAQY+G I IG+PPQNF V+FDTGSSNLWVPS
Sbjct: 41 TELHRLNIRP--GKDGLIAPEPLSNYLDAQYYGPISIGSPPQNFKVVFDTGSSNLWVPSK 98
Query: 120 KCYFS-IACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEF 178
KC+ + IAC H+KY + +SSTY KNG IHYG+G++SG+ S D V I L +KD F
Sbjct: 99 KCHLTNIACLMHNKYDATKSSTYVKNGTEFAIHYGSGSLSGYLSTDTVNIAGLDIKDHTF 158
Query: 179 IEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADE 238
EA EP L F+ AKFDGILGLG+ ISV P +Y M QGL+++PVFSF+ NR+
Sbjct: 159 AEALSEPGLVFVAAKFDGILGLGYSSISVDGVKPSFYAMYEQGLISDPVFSFYLNRDPKA 218
Query: 239 EEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGT 298
EGGEI+FGG DP+HY G+ TY+PVT+KGYWQ M ++ C GGC IAD+GT
Sbjct: 219 PEGGEIIFGGSDPNHYTGDFTYLPVTRKGYWQIKMDSAQLN--DIELCKGGCQVIADTGT 276
Query: 299 SLLAGPTTIITQVNHAIGATGIVSQE----CKAV 328
SL+A P T +N AIG T I+ + C A+
Sbjct: 277 SLIAAPQDEATSINQAIGGTPILGGQYVVSCDAI 310
>gi|227336874|gb|ACP21315.1| aspartic proteinase precursor [Rhipicephalus microplus]
Length = 391
Score = 285 bits (729), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 149/304 (49%), Positives = 195/304 (64%), Gaps = 16/304 (5%)
Query: 30 LYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQ 89
L R+ LK+ K + R +E + R +SL G LKNY+DAQ
Sbjct: 21 LLRVPLKRMK-------SVREHLREVGTPLNVTRTWSLGGAFAGPFPE---PLKNYLDAQ 70
Query: 90 YFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSSTYKKNGKSA 148
Y+G+I +GTPPQ F V+FDTGSSNLWVPSSKC F+ IAC+ H KY S +S+TY+KNG +
Sbjct: 71 YYGDITLGTPPQVFRVVFDTGSSNLWVPSSKCSFTNIACWLHHKYHSSKSTTYQKNGTAF 130
Query: 149 DIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVG 208
+I YG+G++ G S D +G++ V+ Q F E E L F+ AKFDGILGLG+ ISV
Sbjct: 131 EIRYGSGSVKGVLSADMFGLGNVTVRSQTFAEIIDESGLAFIAAKFDGILGLGYPRISVL 190
Query: 209 KAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGY 268
PV+ NMV QG+ PVFS + +RN + +GGE++FGG+D HY G TYVPVT+KGY
Sbjct: 191 GVPPVFDNMVAQGVAANPVFSVYLDRNTSDPQGGEVLFGGIDKAHYTGNITYVPVTRKGY 250
Query: 269 WQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE---- 324
WQF M V + G FC GGC AIAD+GTSL+AGPT I ++N AIGA ++ E
Sbjct: 251 WQFHMDGVTV-GTNATFCNGGCEAIADTGTSLIAGPTAEIQKLNMAIGAAPFLAGEYMVS 309
Query: 325 CKAV 328
CK++
Sbjct: 310 CKSI 313
>gi|307167890|gb|EFN61279.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 354
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 137/243 (56%), Positives = 169/243 (69%), Gaps = 3/243 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSST 140
L NY+DAQY+G I IGTPPQ F VIFDTGSSNLWVPS C+F+ IAC H KY S +SST
Sbjct: 29 LSNYLDAQYYGVISIGTPPQEFKVIFDTGSSNLWVPSKNCHFTNIACQLHHKYNSKKSST 88
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y+ NG S I YG+G++SG+ S D V + L V Q F EA EP L F+ AKFDGILG+
Sbjct: 89 YEPNGASFAIQYGSGSLSGYLSADVVNVAGLNVTSQVFAEAISEPGLAFVAAKFDGILGM 148
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
G+ I+V PV+YNMV Q LV + VFSF+ NR+ E GGE++ GG DPDHY+ + TY
Sbjct: 149 GYSTIAVDGVTPVFYNMVKQKLVPKAVFSFYLNRDPSAEVGGELILGGSDPDHYEADLTY 208
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
VPVT+KGYWQF M + + +T FC GC AIAD+GTSL+AGP + +N IGA+ I
Sbjct: 209 VPVTRKGYWQFSMDGIEVGNRT--FCNNGCQAIADTGTSLIAGPVADVAAINKLIGASAI 266
Query: 321 VSQ 323
Q
Sbjct: 267 AGQ 269
>gi|324507249|gb|ADY43078.1| Cathepsin D [Ascaris suum]
Length = 437
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 140/263 (53%), Positives = 178/263 (67%), Gaps = 5/263 (1%)
Query: 63 RKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCY 122
R +SLRG G D LKNYMDAQY+G+I IGTPPQNFTVIFDTGS+NLWVPS KC
Sbjct: 64 RVHSLRGQSGNEIDE---LLKNYMDAQYYGDISIGTPPQNFTVIFDTGSANLWVPSRKCP 120
Query: 123 FS-IACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEA 181
F+ IAC H KY + +SSTY ++G+ I YGTG++ GF S D+V + D+ +Q F EA
Sbjct: 121 FTDIACLLHHKYDAAKSSTYAEDGRKLQIQYGTGSMKGFISLDNVCVADVCATEQPFAEA 180
Query: 182 TREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEG 241
T EP LTF+ AKFDGILG+ F EI+V PV++ M++Q L+ PVF+FW +RN D++ G
Sbjct: 181 TSEPGLTFIAAKFDGILGMAFPEIAVLGVKPVFHTMIDQQLLAAPVFAFWLDRNPDDQIG 240
Query: 242 GEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLL 301
GEI FGG D Y TY PVT++GYWQF M D ++ + CA GC AIAD+GTSL+
Sbjct: 241 GEITFGGTDTKRYVEPITYTPVTRRGYWQFKM-DKVVGEEAVLACANGCQAIADTGTSLI 299
Query: 302 AGPTTIITQVNHAIGATGIVSQE 324
AGP + + IGA + E
Sbjct: 300 AGPKQQVDTIQKFIGAEPLFRGE 322
>gi|340729556|ref|XP_003403066.1| PREDICTED: lysosomal aspartic protease-like [Bombus terrestris]
Length = 385
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 138/276 (50%), Positives = 184/276 (66%), Gaps = 9/276 (3%)
Query: 41 DLNNRVAARLDS--KEGESFRTSI-RKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIG 97
DL ++D+ K+ + + T + + + ++GN + L NY+DAQY+G I IG
Sbjct: 17 DLQRITLHKIDTVRKQFKEYNTEVYQAHMVQGNFPQPE-----PLSNYLDAQYYGVISIG 71
Query: 98 TPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSSTYKKNGKSADIHYGTGA 156
TP Q+F VIFDTGSSNLWVPS KC+ + IAC H KY + +SSTYKKNG I YG+G+
Sbjct: 72 TPSQDFKVIFDTGSSNLWVPSKKCHLTNIACKLHHKYDNTKSSTYKKNGTDFAIRYGSGS 131
Query: 157 ISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYN 216
+SG+ S D V + L V DQ F EA EP + F+ AKFDGILG+ + +I+V PV+YN
Sbjct: 132 LSGYLSTDVVNVAGLKVSDQTFAEALSEPGMAFVAAKFDGILGMAYSKIAVDGVTPVFYN 191
Query: 217 MVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDV 276
MV QGLV +PVFSF+ NRN D++ GGE++ GG DP+HY+G TYVPV +KGYWQF M +
Sbjct: 192 MVKQGLVPQPVFSFYLNRNPDDKAGGELILGGSDPNHYEGPFTYVPVDRKGYWQFRMDGI 251
Query: 277 MIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVN 312
+ Q C GC AIAD+GTSL+AGP + +N
Sbjct: 252 KVGSQHLAICQKGCEAIADTGTSLIAGPVKEVEAIN 287
>gi|195474504|ref|XP_002089531.1| GE23596 [Drosophila yakuba]
gi|194175632|gb|EDW89243.1| GE23596 [Drosophila yakuba]
Length = 392
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 147/289 (50%), Positives = 183/289 (63%), Gaps = 11/289 (3%)
Query: 41 DLNNRVAARLDSKEGESFRTSIRKYSLRGNLGES-----GDADIV-ALKNYMDAQYFGEI 94
D N + RL F+++ R ++ G + G D+ L NYMDAQY+G I
Sbjct: 18 DSNPQEKPRLLRVPLHKFQSARRHFADVGTELQQLRIRYGGGDVPEPLSNYMDAQYYGPI 77
Query: 95 GIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSSTYKKNGKSADIHYG 153
IG+PPQNF V+FDTGSSNLWVPS KC+ + IAC H+KY + +S TY KNG I YG
Sbjct: 78 AIGSPPQNFRVVFDTGSSNLWVPSKKCHLTNIACLMHNKYDASKSKTYTKNGTEFAIQYG 137
Query: 154 TGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPV 213
+G++SG+ S D V I L +KDQ F EA EP L F+ AKFDGILGLG+ ISV K P
Sbjct: 138 SGSLSGYLSTDTVSIAGLDIKDQTFAEALSEPGLVFVAAKFDGILGLGYSSISVDKVKPP 197
Query: 214 WYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDM 273
+Y M QGL++ PVFSF+ NR+ EGGEI+FGG DP+HY GE TY+PVT+K YWQ M
Sbjct: 198 FYAMYEQGLISAPVFSFYLNRDPASPEGGEIIFGGSDPNHYTGEFTYLPVTRKAYWQIKM 257
Query: 274 GDVMI-DGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIV 321
I D Q C GGC IAD+GTSL+A P T +N IG T I+
Sbjct: 258 DAASIGDLQ---LCKGGCQVIADTGTSLIAAPLEEATSINQKIGGTPII 303
>gi|327259983|ref|XP_003214815.1| PREDICTED: cathepsin D-like [Anolis carolinensis]
Length = 399
Score = 284 bits (726), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 145/277 (52%), Positives = 185/277 (66%), Gaps = 9/277 (3%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNYMDAQY+GEIGIGTPPQ FTV+FDTGSSNLWVPS C IAC H KY S +S+T
Sbjct: 68 LKNYMDAQYYGEIGIGTPPQKFTVVFDTGSSNLWVPSVHCRLLDIACMLHHKYDSSKSNT 127
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y +NG IHYGTG++SGF S+D V IGD+ VK+Q F EAT EP +TFL AKFDGILGL
Sbjct: 128 YVQNGTKFAIHYGTGSLSGFISQDTVTIGDIAVKNQMFGEATSEPGITFLAAKFDGILGL 187
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
GF +ISV K P + N + QGL+++ +FSF+ NR+ GGEI+FGG+DP +Y G+ +
Sbjct: 188 GFPKISVDKVTPFFDNAMKQGLLDKNMFSFFLNRDPSSSPGGEIIFGGVDPKYYSGDFNW 247
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
V VT+K YWQ M V + T C GC AI D+GTSL+ GPT + + AIGA I
Sbjct: 248 VNVTRKAYWQVHMDRVEVPSGLT-VCKNGCEAIVDTGTSLITGPTDEVKALQKAIGAKPI 306
Query: 321 VSQE----CKAVVSQYGEEIINMLLAKVSAKLSLRHY 353
+ + C+ + + I++ +L S LS +Y
Sbjct: 307 IKGQYILPCEKLATL---PIVSFVLGGRSYSLSAENY 340
>gi|194863696|ref|XP_001970568.1| GG10707 [Drosophila erecta]
gi|190662435|gb|EDV59627.1| GG10707 [Drosophila erecta]
Length = 390
Score = 284 bits (726), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 138/242 (57%), Positives = 166/242 (68%), Gaps = 5/242 (2%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSST 140
L NYMDAQY+G I IG+PPQNF V+FDTGSSNLWVPS KC+ + IAC H+KY + +S T
Sbjct: 63 LSNYMDAQYYGPIAIGSPPQNFRVVFDTGSSNLWVPSKKCHLTNIACLMHNKYDASKSKT 122
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y KNG I YG+G++SG+ S D V I L +KDQ F EA EP L F+ AKFDGILGL
Sbjct: 123 YTKNGTEFAIQYGSGSLSGYLSTDTVSIAGLDIKDQTFAEALSEPGLVFVAAKFDGILGL 182
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
G+ ISV K P +Y M QGL++ PVFSF+ NR+ EGGEI+FGG DP+HY GE TY
Sbjct: 183 GYSSISVDKVKPPFYAMYEQGLISAPVFSFYLNRDPASPEGGEIIFGGSDPNHYTGEFTY 242
Query: 261 VPVTQKGYWQFDMGDVMI-DGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATG 319
+PVT+K YWQ M I D Q C GGC IAD+GTSL+A P T +N IG T
Sbjct: 243 LPVTRKAYWQIKMDAASIGDLQ---LCKGGCQVIADTGTSLIAAPLEEATSINQKIGGTP 299
Query: 320 IV 321
I+
Sbjct: 300 II 301
>gi|21355083|ref|NP_652013.1| cathD [Drosophila melanogaster]
gi|6685167|gb|AAF23824.1|AF220040_1 cathepsin D precursor [Drosophila melanogaster]
gi|7304149|gb|AAF59186.1| cathD [Drosophila melanogaster]
gi|15292549|gb|AAK93543.1| SD07085p [Drosophila melanogaster]
gi|220946566|gb|ACL85826.1| cathD-PA [synthetic construct]
Length = 392
Score = 283 bits (725), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 138/242 (57%), Positives = 166/242 (68%), Gaps = 5/242 (2%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSST 140
L NYMDAQY+G I IG+PPQNF V+FDTGSSNLWVPS KC+ + IAC H+KY + +S T
Sbjct: 65 LSNYMDAQYYGPIAIGSPPQNFRVVFDTGSSNLWVPSKKCHLTNIACLMHNKYDASKSKT 124
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y KNG I YG+G++SG+ S D V I L +KDQ F EA EP L F+ AKFDGILGL
Sbjct: 125 YTKNGTEFAIQYGSGSLSGYLSTDTVSIAGLDIKDQTFAEALSEPGLVFVAAKFDGILGL 184
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
G+ ISV K P +Y M QGL++ PVFSF+ NR+ EGGEI+FGG DP+HY GE TY
Sbjct: 185 GYNSISVDKVKPPFYAMYEQGLISAPVFSFYLNRDPASPEGGEIIFGGSDPNHYTGEFTY 244
Query: 261 VPVTQKGYWQFDMGDVMI-DGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATG 319
+PVT+K YWQ M I D Q C GGC IAD+GTSL+A P T +N IG T
Sbjct: 245 LPVTRKAYWQIKMDAASIGDLQ---LCKGGCQVIADTGTSLIAAPLEEATSINQKIGGTP 301
Query: 320 IV 321
I+
Sbjct: 302 II 303
>gi|449666857|ref|XP_002161366.2| PREDICTED: lysosomal aspartic protease-like [Hydra magnipapillata]
Length = 387
Score = 283 bits (724), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 149/317 (47%), Positives = 206/317 (64%), Gaps = 19/317 (5%)
Query: 9 TAGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLR 68
T + LL ++FS ++RI +KK + R ++G F+ K +
Sbjct: 3 TLSLTILLLCASLIFSE----IHRIKIKKLE-----TTVRRTLREQGFDFQ----KLGFQ 49
Query: 69 GNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIAC 127
GES + L+NYMDAQY+G+I +GTPPQ F V+FDTGSSNLWVPSS C + IAC
Sbjct: 50 SKWGESPEV----LRNYMDAQYYGDISLGTPPQPFKVVFDTGSSNLWVPSSHCGWTDIAC 105
Query: 128 YFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSL 187
H+KY +SSTY +NG I YG+G+ SG+ S D +++ D+ VK+Q F EAT EP +
Sbjct: 106 LTHNKYHGDKSSTYVQNGTKFSIQYGSGSCSGYQSIDTLQVADISVKNQMFGEATSEPGI 165
Query: 188 TFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFG 247
F+ AKFDG+LG+G+ +ISV VP +YNMV+Q LV + VFSF+ +RN ++ GGE++ G
Sbjct: 166 AFVAAKFDGLLGMGYSQISVNGVVPPFYNMVDQKLVEDAVFSFYLDRNVNDSTGGELLLG 225
Query: 248 GMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTI 307
G+D + G+ TY PVT +GYWQF M V+++G+ FCA GC AIAD+GTSL+AGPT
Sbjct: 226 GVDSSKFVGDITYTPVTVEGYWQFKMDKVVVNGEPM-FCASGCNAIADTGTSLIAGPTEE 284
Query: 308 ITQVNHAIGATGIVSQE 324
+ ++N IGAT IV E
Sbjct: 285 VNKLNQMIGATPIVGGE 301
>gi|21907889|dbj|BAC05689.1| aspartic protease BmAsp-2 [Brugia malayi]
Length = 452
Score = 283 bits (724), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 151/336 (44%), Positives = 208/336 (61%), Gaps = 15/336 (4%)
Query: 1 MGMVFKSITAGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGES--- 57
M F + L ++ V +T RI L+K+ + + DS E S
Sbjct: 1 MLTAFATALLSIHLLIITLSTVTATQQNHFTRIPLRKQNSLRKHLIET--DSWEAYSKLI 58
Query: 58 -FRTSIRKYSLRG--NLGESGDADIVA-----LKNYMDAQYFGEIGIGTPPQNFTVIFDT 109
F+ +K + NL + + + ++ LKNYMDAQY+GEI IGTPPQNF+V+FDT
Sbjct: 59 NFQIQRKKIQQKMGLNLHFTSEPNAISETDEVLKNYMDAQYYGEISIGTPPQNFSVVFDT 118
Query: 110 GSSNLWVPSSKCYF-SIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKI 168
GSSNLWVPS KC F IAC FH+KY+ +S+TYK +G+ I YGTG++ GF S D V I
Sbjct: 119 GSSNLWVPSVKCPFLDIACLFHNKYKGTKSTTYKPDGRKIQIQYGTGSMEGFISLDTVCI 178
Query: 169 GDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVF 228
++ V Q F EAT EP TF++AKFDGILG+ F EISV PV++ M++Q +V++PVF
Sbjct: 179 ANICVTGQPFAEATSEPGATFVMAKFDGILGMAFPEISVLGLNPVFHTMISQKVVHQPVF 238
Query: 229 SFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAG 288
+FW +RN ++ GGEI FGG+D + + TY PV++ GYWQF M V+ G+ G C
Sbjct: 239 AFWLDRNPSDKIGGEITFGGIDANRFVSPITYTPVSRHGYWQFKMDRVLGRGKAIG-CGN 297
Query: 289 GCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE 324
GC AIAD+GTSL+AGP + I ++ IGA + + E
Sbjct: 298 GCQAIADTGTSLIAGPKSQIDKIQEYIGAEHVYAGE 333
>gi|320165710|gb|EFW42609.1| lysosomal aspartic protease [Capsaspora owczarzaki ATCC 30864]
Length = 462
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 137/268 (51%), Positives = 181/268 (67%), Gaps = 6/268 (2%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRS 138
V L N+ +AQY+GEI IGTPPQ F V+FDTGSSN WVPS+ C + + C H KY S +S
Sbjct: 52 VNLGNFENAQYYGEIEIGTPPQKFKVVFDTGSSNAWVPSATCKITDLPCDLHKKYHSEKS 111
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
STY NG + I YG+G+++G+ S+D + L V +Q F EAT EP L F+LA+FDG+L
Sbjct: 112 STYVANGTTFAIQYGSGSLTGYLSQDTFTVAGLKVTNQVFAEATNEPGLAFVLARFDGLL 171
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADE--EEGGEIVFGGMDPDHYKG 256
GLGFQEISV VPV+YNMV QGL+N F+FW +RN + GGE+V GG+DP HY G
Sbjct: 172 GLGFQEISVLNVVPVFYNMVAQGLLNSASFAFWLSRNGTSILKPGGELVLGGVDPSHYTG 231
Query: 257 EHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIG 316
TY+PV++ GYWQF + V + T G G IADSGTSLLAGP + ++N IG
Sbjct: 232 AFTYIPVSKPGYWQFALDSVQVGSTTFGANTQG---IADSGTSLLAGPVADVKKINAQIG 288
Query: 317 ATGIVSQECKAVVSQYGEEIINMLLAKV 344
A GI+++EC ++ QY I+ L+ ++
Sbjct: 289 AIGILAEECDMIIEQYEPIIVEGLVQRL 316
>gi|3378673|emb|CAA08878.1| Cathepsin D [Podarcis siculus]
Length = 399
Score = 282 bits (722), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 146/278 (52%), Positives = 189/278 (67%), Gaps = 9/278 (3%)
Query: 81 ALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYF-SIACYFHSKYRSGRSS 139
ALKNYMDAQY+GEIGIGTPPQ FTV+FDTGSSNLWVPS KC+ IAC H KY S +SS
Sbjct: 67 ALKNYMDAQYYGEIGIGTPPQKFTVVFDTGSSNLWVPSVKCHLLDIACLLHHKYDSSKSS 126
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
+Y KNG IHYGTG++SGF S+DHV IGDL+V++Q F EA ++P +TF+ AKFDGILG
Sbjct: 127 SYVKNGTDFAIHYGTGSLSGFLSQDHVTIGDLIVQNQLFGEAVKQPGITFIAAKFDGILG 186
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
L + +ISV K +P + N + Q L+ + +FSF+ NR+ GGE++FGG+DP +Y G+ T
Sbjct: 187 LAYPKISVDKVLPFFDNAMKQALMEKNLFSFYLNRDPKGATGGELLFGGVDPQYYTGDFT 246
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATG 319
+V VT+K YWQ M V +D T C GC AI D+GTSL+ GPT I Q+ AIGA
Sbjct: 247 WVNVTRKAYWQIHMEKVDVDNGLT-VCKDGCEAIVDTGTSLITGPTDEIKQLQKAIGAKP 305
Query: 320 IVSQE----CKAVVSQYGEEIINMLLAKVSAKLSLRHY 353
I+ + C + S +N++L S L+ Y
Sbjct: 306 IIKGQYMLPCDKLSSLPN---VNLVLGGKSYALTPNQY 340
>gi|262232673|gb|ACY38599.1| cathepsin D-like aspartic protease [Anisakis simplex]
Length = 453
Score = 282 bits (721), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 142/267 (53%), Positives = 176/267 (65%), Gaps = 5/267 (1%)
Query: 61 SIRKYSLRG-NLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSS 119
SI + R + G S + L+NYMDAQY+G I IGTPPQNFTVIFDTGSSNLWVPS
Sbjct: 78 SIANHQFRSLSAGSSSNEIDELLRNYMDAQYYGVISIGTPPQNFTVIFDTGSSNLWVPSR 137
Query: 120 KCYFS-IACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEF 178
KC ++ IAC+ H KY + +SST+K +G+ I YGTG++ GF S D V + +L +DQ F
Sbjct: 138 KCKWTDIACWLHHKYDAAKSSTHKADGRELQIQYGTGSMKGFISLDTVCVAELCARDQPF 197
Query: 179 IEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADE 238
EA EP +TF+ AKFDGILG+ F EI+ PV+ MVNQ LV EPVF+FW NR D+
Sbjct: 198 AEAASEPGITFVAAKFDGILGMAFPEIAALNVTPVFNTMVNQQLVAEPVFAFWLNRTPDD 257
Query: 239 EEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQT-TGFCAGGCAAIADSG 297
E GGEI FGG DP H+ Y PVT++ YWQF M I GQ T C+ GC AIAD+G
Sbjct: 258 EIGGEITFGGTDPKHFVEPIVYAPVTRRAYWQFKMDK--ISGQDGTLACSDGCQAIADTG 315
Query: 298 TSLLAGPTTIITQVNHAIGATGIVSQE 324
TSL+AGP + + IGA + S E
Sbjct: 316 TSLIAGPKQQVQLIQKYIGAEPLFSGE 342
>gi|156553448|ref|XP_001600543.1| PREDICTED: lysosomal aspartic protease-like [Nasonia vitripennis]
Length = 384
Score = 281 bits (720), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 140/252 (55%), Positives = 173/252 (68%), Gaps = 7/252 (2%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSST 140
L NY+DAQY+GEIGIG+P Q FTVIFDTGSSNLWVPS KC+ + IAC H+KY S +S +
Sbjct: 57 LSNYLDAQYYGEIGIGSPMQKFTVIFDTGSSNLWVPSKKCHITNIACLLHNKYDSRKSKS 116
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
YK NG I YG+G++SGF S D V I + VKD F EA EP L F+ AKFDGILG+
Sbjct: 117 YKANGTDFSIRYGSGSLSGFLSTDVVTIAGVDVKDTTFAEAMSEPGLAFVAAKFDGILGM 176
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ISV PV+YNMV Q LV +P+FSF+ NR+ + + GGE++ GG D HY G+ TY
Sbjct: 177 AYDRISVDGVPPVFYNMVKQNLVPQPIFSFYLNRDPNAKIGGEMILGGSDSAHYTGDFTY 236
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
VPV++K YWQF M + I + FC GC AIAD+GTSL+AGP I +N IGAT I
Sbjct: 237 VPVSRKAYWQFKMDKITIGDKL--FCENGCEAIADTGTSLIAGPVGEIEGINKKIGATPI 294
Query: 321 VSQE----CKAV 328
V+ E C AV
Sbjct: 295 VAGEAMVSCDAV 306
>gi|218847782|ref|NP_001136375.1| cathepsin D-like precursor [Xenopus (Silurana) tropicalis]
gi|159155417|gb|AAI54878.1| LOC613063 protein [Xenopus (Silurana) tropicalis]
Length = 399
Score = 281 bits (718), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 148/313 (47%), Positives = 194/313 (61%), Gaps = 5/313 (1%)
Query: 10 AGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRG 69
A + ++L P L RI LKK + A D KE + + KYS
Sbjct: 2 ASLLVWVVLLASSLLQPGSALIRIPLKKFPSIRHTFTEAGKDVKELLANEVPL-KYS--P 58
Query: 70 NLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACY 128
SG+ ALKNY+DAQY+GEIG+G+PPQNFTV+FDTGSSNLWVPS C IAC+
Sbjct: 59 GFPPSGEPTPEALKNYLDAQYYGEIGLGSPPQNFTVVFDTGSSNLWVPSVHCSMLDIACW 118
Query: 129 FHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLT 188
H KY S +SSTY KNG + I YGTG++SG+ S+D V IG+L VK Q F EA ++P +T
Sbjct: 119 MHHKYDSSKSSTYVKNGTAFAIQYGTGSLSGYLSKDTVTIGNLAVKGQIFGEAVKQPGVT 178
Query: 189 FLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGG 248
F+ AKFDGILG+ + ISV PV+ N++ Q LV +FSF+ NRN D + GGE++ GG
Sbjct: 179 FVAAKFDGILGMAYPVISVDGVPPVFDNIMAQKLVESNIFSFYLNRNPDTQPGGELLLGG 238
Query: 249 MDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTII 308
DP +Y G+ Y+ VT+K YWQ M + + Q T C GGC I D+GTSL+ GP +
Sbjct: 239 TDPKYYTGDFHYLSVTRKAYWQIHMDQLGVGDQLT-LCKGGCEVIVDTGTSLITGPLEEV 297
Query: 309 TQVNHAIGATGIV 321
T + AIGA ++
Sbjct: 298 TALQKAIGAVPLI 310
>gi|122938524|gb|ABM69086.1| aspartic proteinase AspMD03 [Musca domestica]
Length = 390
Score = 281 bits (718), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 137/244 (56%), Positives = 172/244 (70%), Gaps = 9/244 (3%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSST 140
L NY+DAQY+G I IGTPPQ+F V+FDTGSSNLWVPS KC+ + IAC H+KY + +S T
Sbjct: 63 LSNYLDAQYYGPISIGTPPQDFKVVFDTGSSNLWVPSKKCHLTNIACLMHNKYDATKSKT 122
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
+K+NG IHYG+G++SG+ S D V IG L +KDQ F EA EP L F+ AKFDGILGL
Sbjct: 123 FKQNGTEFAIHYGSGSLSGYLSTDTVNIGGLDIKDQTFAEALSEPGLVFVAAKFDGILGL 182
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
G+ ISV P +Y M QGL+++P+FSF+ NR+ EGGEI+FGG DPDHY G+ TY
Sbjct: 183 GYSSISVDGVKPPFYAMYEQGLISQPIFSFYLNRDPKAPEGGEIIFGGSDPDHYTGDFTY 242
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTG--FCA-GGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
+PVT+K YWQ M D + G CA GGC IAD+GTSL+A P + T +N AIG
Sbjct: 243 LPVTRKAYWQIKM-----DSASMGDLKCAKGGCQVIADTGTSLIALPPSEATSINQAIGG 297
Query: 318 TGIV 321
T I+
Sbjct: 298 TPIM 301
>gi|194757447|ref|XP_001960976.1| GF11236 [Drosophila ananassae]
gi|190622274|gb|EDV37798.1| GF11236 [Drosophila ananassae]
Length = 388
Score = 280 bits (717), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 134/244 (54%), Positives = 167/244 (68%), Gaps = 9/244 (3%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSST 140
L NYMDAQY+G I IG+PPQNF V+FDTGSSNLWVPS KC+ + IAC H+KY + +S +
Sbjct: 61 LSNYMDAQYYGPISIGSPPQNFRVVFDTGSSNLWVPSKKCHLTNIACLMHNKYDASKSKS 120
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y KNG I YG+G++SG+ S D V IG L +KDQ F EA EP L F+ AKFDGILGL
Sbjct: 121 YVKNGTEFAIQYGSGSLSGYLSTDTVSIGGLNIKDQTFAEALSEPGLVFVAAKFDGILGL 180
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
G+ ISV + P +Y M QGL++ P+FSF+ NR+ EGGEI+FGG DP HY G+ TY
Sbjct: 181 GYSSISVDRVKPPFYAMYEQGLISAPIFSFYLNRDPAGPEGGEIIFGGSDPKHYSGDFTY 240
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTG---FCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
+PVT+K YWQ M D + G C GGC IAD+GTSL+A P + T +N IG
Sbjct: 241 LPVTRKAYWQIKM-----DAASIGDLELCKGGCQVIADTGTSLIAAPMSEATSINQKIGG 295
Query: 318 TGIV 321
T I+
Sbjct: 296 TPII 299
>gi|195429864|ref|XP_002062977.1| GK21682 [Drosophila willistoni]
gi|194159062|gb|EDW73963.1| GK21682 [Drosophila willistoni]
Length = 389
Score = 280 bits (717), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 150/301 (49%), Positives = 188/301 (62%), Gaps = 27/301 (8%)
Query: 27 NGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYM 86
+ GL R+ LKK + A R + G + KY GDA L NYM
Sbjct: 21 SAGLLRVPLKKF------QSARRHFADVGTELQQLRIKYG-------GGDAP-EPLSNYM 66
Query: 87 DAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSSTYKKNG 145
DAQY+G I IGTP Q+F V+FDTGSSNLWVPS KC+F+ IAC H+KY + +S+TY KNG
Sbjct: 67 DAQYYGPISIGTPAQSFKVVFDTGSSNLWVPSKKCHFTNIACLMHNKYDATKSNTYAKNG 126
Query: 146 KSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEI 205
IHYG+G++SG+ S D V IG L +K Q F EA EP L F+ AKFDGILGLG+ I
Sbjct: 127 TEFAIHYGSGSLSGYLSTDTVGIGGLNIKGQTFAEALSEPGLVFVAAKFDGILGLGYSSI 186
Query: 206 SVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQ 265
SV P +Y M QGL++ PVFSF+ NR+ EGGEI+FGG DP+HY G+ TY+PVT+
Sbjct: 187 SVDGVKPPFYAMYEQGLISSPVFSFYLNRDPSAPEGGEIIFGGSDPNHYTGDFTYLPVTR 246
Query: 266 KGYWQFDM-----GDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
K YWQ M GD+ + C GGC IAD+GTSL+A P + T +N IG T I
Sbjct: 247 KAYWQIKMDSASVGDLQV-------CQGGCQVIADTGTSLIAAPLSEATSINQKIGGTPI 299
Query: 321 V 321
+
Sbjct: 300 I 300
>gi|205289916|gb|ACI02330.1| aspartic protease 1 [Uncinaria stenocephala]
Length = 447
Score = 280 bits (716), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 135/244 (55%), Positives = 169/244 (69%), Gaps = 2/244 (0%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
L+NYMDAQYFG I IGTP QNFTVIFDTGSSNLWVPS KC ++ IAC H +Y SG SST
Sbjct: 88 LRNYMDAQYFGTIQIGTPAQNFTVIFDTGSSNLWVPSRKCPFYDIACMLHHRYDSGASST 147
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
YK++G+ I YGTG++ GF S+D+V I + ++Q F EAT EP LTF+ AKFDGILG+
Sbjct: 148 YKEDGRKMAIQYGTGSMKGFISKDNVCIAGICAEEQPFAEATSEPGLTFIAAKFDGILGM 207
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
F EISV PV++ + Q V P+F+FW NRN D E GGEI GGMDP Y T+
Sbjct: 208 AFPEISVLGVPPVFHTFIEQKKVPSPMFAFWLNRNPDSELGGEITLGGMDPRRYVEPLTW 267
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
PVT++GYWQF M D++ G ++ C GC AIAD+GTSL+AGP + + IGA +
Sbjct: 268 TPVTRRGYWQFKM-DMVQGGSSSIACPNGCQAIADTGTSLIAGPKAQVEAIQKFIGAEPL 326
Query: 321 VSQE 324
+ E
Sbjct: 327 MRGE 330
>gi|27803878|gb|AAO22152.1| cathepsin D-like aspartic protease [Ancylostoma ceylanicum]
Length = 446
Score = 280 bits (716), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 137/244 (56%), Positives = 167/244 (68%), Gaps = 2/244 (0%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
L+NYMDAQYFG I IGTP QNFTVIFDTGSSNLWVPS KC ++ IAC H +Y SG SST
Sbjct: 87 LRNYMDAQYFGTIQIGTPAQNFTVIFDTGSSNLWVPSRKCPFYDIACMLHHRYDSGASST 146
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
YK++G+ I YGTG++ GF S+D+V I + +Q F EAT EP LTF+ AKFDGILG+
Sbjct: 147 YKEDGRKMAIQYGTGSMKGFISKDNVCIAGICAVEQPFAEATSEPGLTFIAAKFDGILGM 206
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
F EISV PV++ + Q V PVF+FW NRN D E GGEI GGMDP Y T+
Sbjct: 207 AFPEISVLGVPPVFHTFIEQKKVPSPVFAFWLNRNPDSELGGEITLGGMDPRRYVEPITW 266
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
PVT++GYWQF M D + G T+ C GC AIAD+GTSL+AGP + + IGA +
Sbjct: 267 TPVTRRGYWQFKM-DKVQGGSTSIACPNGCQAIADTGTSLIAGPKAQVEAIQKFIGAEPL 325
Query: 321 VSQE 324
+ E
Sbjct: 326 MKGE 329
>gi|167524529|ref|XP_001746600.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774870|gb|EDQ88496.1| predicted protein [Monosiga brevicollis MX1]
Length = 381
Score = 280 bits (716), Expect = 9e-73, Method: Compositional matrix adjust.
Identities = 143/246 (58%), Positives = 168/246 (68%), Gaps = 4/246 (1%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRS 138
V L NY DAQYFGEI IGTP Q F VIFDTGSSNLWVPSS+C +IAC H+KY S S
Sbjct: 53 VPLSNYEDAQYFGEISIGTPAQKFKVIFDTGSSNLWVPSSQCPKTNIACDVHAKYDSSAS 112
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
STYK NG I YGTG++SGF S D IGDL VKDQ F EA EP +TF+ AKFDGIL
Sbjct: 113 STYKANGTKFAIQYGTGSLSGFLSTDTACIGDLCVKDQTFAEALEEPGVTFVAAKFDGIL 172
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEH 258
G+GF ISV VPVWYNMV Q +V + ++SF+ NRN + GGE+ GG D H+ G
Sbjct: 173 GMGFSTISVDHVVPVWYNMVQQQVVEQNMYSFYLNRNPNGVSGGELTLGGYDESHFAGPI 232
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
+ VT GYWQF M + I + T +C C AIAD+GTSLLAGPT ++ Q+N AIGAT
Sbjct: 233 HWTDVTVDGYWQFTMTGLSI--ENTPYCT-NCKAIADTGTSLLAGPTDVVKQINKAIGAT 289
Query: 319 GIVSQE 324
I + E
Sbjct: 290 TIAAGE 295
>gi|17981530|gb|AAL51056.1|AF454831_1 cathepsin D [Apriona germari]
Length = 386
Score = 280 bits (715), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 152/326 (46%), Positives = 201/326 (61%), Gaps = 29/326 (8%)
Query: 22 VFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIR-KYSLRGNLGESGDADIV 80
VF T N L R+ L++ K +AR +E + +R +Y + G E
Sbjct: 12 VFITVNCDLIRVPLERGK-------SARRTLQEVNTHVQQVRFRYGVGGPAPE------- 57
Query: 81 ALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSS 139
L NY+DAQYFG I IG PPQ F V+FDTGSSNLWVPS KC+++ IAC H+KY S +SS
Sbjct: 58 PLSNYLDAQYFGPISIGNPPQKFKVVFDTGSSNLWVPSKKCHYTNIACLLHNKYDSSKSS 117
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYKKNG I YG+G++SGF S D V +G L VKDQ F EA EP L F+ AKFD
Sbjct: 118 TYKKNGTDFSIKYGSGSLSGFLSTDVVTVGSLAVKDQTFAEAMSEPGLAFVAAKFDEYPW 177
Query: 200 LGFQE-ISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEH 258
G Q+ + ++P +YNM+ QGLV++PVFSF+ NR+ D EGGE+ GG DP +YKG
Sbjct: 178 HGLQQDLGSRASLPFFYNMITQGLVSQPVFSFYLNRDPDAAEGGELSLGGSDPKYYKGNF 237
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
TY+ V ++ YWQF M + + G+T FC GC AIAD+GTSL+AGP +T +N IG T
Sbjct: 238 TYLSVDRQAYWQFKMDKIQL-GKTV-FCKSGCQAIADTGTSLVAGPVDEVTSINKLIGGT 295
Query: 319 GIVSQECKAVVSQYGEEIINMLLAKV 344
I+ GE +++ L+ K+
Sbjct: 296 PIIG----------GEYVVDCLIPKL 311
>gi|195581342|ref|XP_002080493.1| GD10217 [Drosophila simulans]
gi|194192502|gb|EDX06078.1| GD10217 [Drosophila simulans]
Length = 324
Score = 279 bits (714), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 136/238 (57%), Positives = 164/238 (68%), Gaps = 5/238 (2%)
Query: 86 MDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSSTYKKN 144
MDAQY+G I IG+PPQNF V+FDTGSSNLWVPS KC+ + IAC H+KY + +S TY KN
Sbjct: 1 MDAQYYGPIAIGSPPQNFRVVFDTGSSNLWVPSKKCHLTNIACLMHNKYDASKSKTYTKN 60
Query: 145 GKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQE 204
G IHYG+G++SG+ S D V I L +KDQ F EA EP L F+ AKFDGILGLG+
Sbjct: 61 GTEFAIHYGSGSLSGYLSTDTVSIAGLDIKDQTFAEALSEPGLVFVAAKFDGILGLGYSS 120
Query: 205 ISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVT 264
ISV K P +Y M QGL++ PVFSF+ NR+ EGGEI+FGG DP+HY GE TY+PVT
Sbjct: 121 ISVDKVKPPFYAMYEQGLISAPVFSFYLNRDPASPEGGEIIFGGSDPNHYTGEFTYLPVT 180
Query: 265 QKGYWQFDMGDVMI-DGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIV 321
+K YWQ M I D Q C GGC IAD+GTSL+A P T +N IG T I+
Sbjct: 181 RKAYWQIKMDAASIGDLQ---LCKGGCQVIADTGTSLIAAPLEEATSINQKIGGTPII 235
>gi|66911216|gb|AAH96630.1| LOC613063 protein, partial [Xenopus (Silurana) tropicalis]
Length = 395
Score = 279 bits (714), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 146/297 (49%), Positives = 189/297 (63%), Gaps = 5/297 (1%)
Query: 26 PNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNY 85
P L RI LKK + A D KE + + KYS SG+ ALKNY
Sbjct: 14 PGSALIRIPLKKFPSIRHTFTEAGKDVKELLANEVPL-KYS--PGFPPSGEPTPEALKNY 70
Query: 86 MDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKN 144
+DAQY+GEIG+G+PPQNFTV+FDTGSSNLWVPS C IAC+ H KY S +SSTY KN
Sbjct: 71 LDAQYYGEIGLGSPPQNFTVVFDTGSSNLWVPSVHCSMLDIACWMHHKYDSSKSSTYVKN 130
Query: 145 GKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQE 204
G + I YGTG++SG+ S+D V IG+L VK Q F EA ++P +TF+ AKFDGILG+ +
Sbjct: 131 GTAFAIQYGTGSLSGYLSKDTVTIGNLAVKGQIFGEAVKQPGVTFVAAKFDGILGMAYPV 190
Query: 205 ISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVT 264
ISV PV+ N++ Q LV +FSF+ NRN D + GGE++ GG DP +Y G+ Y+ VT
Sbjct: 191 ISVDGVPPVFDNIMAQKLVESNIFSFYLNRNPDTQPGGELLLGGTDPKYYTGDFHYLSVT 250
Query: 265 QKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIV 321
+K YWQ M + + Q T C GGC I D+GTSL+ GP +T + AIGA ++
Sbjct: 251 RKAYWQIHMDQLGVGDQLT-LCKGGCEVIVDTGTSLITGPLEEVTALQKAIGAVPLI 306
>gi|116284100|gb|AAI23963.1| LOC613063 protein [Xenopus (Silurana) tropicalis]
Length = 396
Score = 279 bits (714), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 146/297 (49%), Positives = 189/297 (63%), Gaps = 5/297 (1%)
Query: 26 PNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNY 85
P L RI LKK + A D KE + + KYS SG+ ALKNY
Sbjct: 15 PGSALIRIPLKKFPSIRHTFTEAGKDVKELLANEVPL-KYS--PGFPPSGEPTPEALKNY 71
Query: 86 MDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKN 144
+DAQY+GEIG+G+PPQNFTV+FDTGSSNLWVPS C IAC+ H KY S +SSTY KN
Sbjct: 72 LDAQYYGEIGLGSPPQNFTVVFDTGSSNLWVPSVHCSMLDIACWMHHKYDSSKSSTYVKN 131
Query: 145 GKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQE 204
G + I YGTG++SG+ S+D V IG+L VK Q F EA ++P +TF+ AKFDGILG+ +
Sbjct: 132 GTAFAIQYGTGSLSGYLSKDTVTIGNLAVKGQIFGEAVKQPGVTFVAAKFDGILGMAYPV 191
Query: 205 ISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVT 264
ISV PV+ N++ Q LV +FSF+ NRN D + GGE++ GG DP +Y G+ Y+ VT
Sbjct: 192 ISVDGVPPVFDNIMAQKLVESNIFSFYLNRNPDTQPGGELLLGGTDPKYYTGDFHYLSVT 251
Query: 265 QKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIV 321
+K YWQ M + + Q T C GGC I D+GTSL+ GP +T + AIGA ++
Sbjct: 252 RKAYWQIHMDQLGVGDQLT-LCKGGCEVIVDTGTSLITGPLEEVTALQKAIGAVPLI 307
>gi|225717994|gb|ACO14843.1| Lysosomal aspartic protease precursor [Caligus clemensi]
Length = 386
Score = 279 bits (714), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 132/244 (54%), Positives = 170/244 (69%), Gaps = 3/244 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSST 140
L NY+DAQY+G I IGTPPQ+F VIFDTGSSNLWVPS C+ + IAC H K+ +SS+
Sbjct: 59 LSNYLDAQYYGPITIGTPPQSFNVIFDTGSSNLWVPSKSCHITNIACLLHHKFDHSKSSS 118
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y NG I YG+G++ GF S D V +G + + Q F EA EP + F+ AKFDGILG+
Sbjct: 119 YVVNGTEFAIQYGSGSLFGFLSTDSVSMGGVEIGSQTFGEAMSEPGMAFVAAKFDGILGM 178
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
G+ I+V VP +YNM QGL+ EPVFSF+ NR+ + + GGEI+FGG DPDHYKG TY
Sbjct: 179 GYSNIAVDGVVPPFYNMFKQGLIQEPVFSFYLNRDPNAQVGGEIIFGGSDPDHYKGNITY 238
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
+PVT+KGYWQF M + + +T FC GC AIAD+GTSL+AGP+ + +N +G I
Sbjct: 239 IPVTKKGYWQFKMDGMKVSSKT--FCQNGCQAIADTGTSLIAGPSVEVNALNQLLGGMPI 296
Query: 321 VSQE 324
V+ E
Sbjct: 297 VNGE 300
>gi|118429511|gb|ABK91803.1| aspartic protease precursor [Clonorchis sinensis]
Length = 425
Score = 279 bits (714), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 151/305 (49%), Positives = 202/305 (66%), Gaps = 13/305 (4%)
Query: 17 LLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGD 76
LLF V+ +T + RI L K ++ R+ E +F TSIR + G++ E
Sbjct: 6 LLFWVLLTTSECSIIRIPLTGFK-NVRRRLMEVGTPVEQLNF-TSIR-FVGNGSIPE--- 59
Query: 77 ADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRS 135
L NY+DAQY+GEIGIGTPPQ+F V+FDTGSSNLWVPS C FSIAC+ H KY S
Sbjct: 60 ----ILNNYLDAQYYGEIGIGTPPQSFEVVFDTGSSNLWVPSKHCSIFSIACWLHHKYDS 115
Query: 136 GRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFD 195
+SSTY NG I YG+G++SG S D+V +G + VK+Q F EA +EP + F+ AKFD
Sbjct: 116 AKSSTYMANGTEFSIRYGSGSVSGILSTDYVSVGTVTVKNQTFGEAMKEPGIAFVAAKFD 175
Query: 196 GILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYK 255
GILG+GF+ ISV ++ NM++QGLV+EPVFSF+ +RNA + GGE++ GG DP +YK
Sbjct: 176 GILGMGFKTISVDGVPTLFDNMISQGLVSEPVFSFYLDRNASDPVGGELLLGGTDPKYYK 235
Query: 256 GEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAI 315
GE + P+T + YWQF + D M G + C GC AIAD+GTSL+AGP+ + ++N A+
Sbjct: 236 GEILWAPLTHEAYWQFKV-DSMNVG-SMKLCENGCQAIADTGTSLIAGPSEEVGKLNDAL 293
Query: 316 GATGI 320
GA I
Sbjct: 294 GAIKI 298
>gi|357627475|gb|EHJ77155.1| cathepsin D [Danaus plexippus]
Length = 358
Score = 279 bits (714), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 133/238 (55%), Positives = 167/238 (70%), Gaps = 4/238 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSST 140
L NY+DAQY+G I IG PPQ F V+FDTGSSNLWVPS KC+++ IAC H+KY S +S +
Sbjct: 32 LSNYLDAQYYGPISIGNPPQTFKVVFDTGSSNLWVPSKKCHYTNIACLLHNKYDSSKSKS 91
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y KNG IHYG+G++SGF S D V +G + VK Q F EA EP L F+ AKFDGILG+
Sbjct: 92 YHKNGTEFAIHYGSGSLSGFLSVDDVTLGGMTVKSQTFAEAMSEPGLAFVAAKFDGILGM 151
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
F I+V PV+ NMV QGLV PVFSF+ NR+A +GGE+V GG DP HY+G TY
Sbjct: 152 AFASIAVDGVTPVFDNMVKQGLV-APVFSFYLNRDASAAQGGELVLGGSDPAHYRGPLTY 210
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
VP+++ YWQF M V+++G + FC GC AIAD+GTSL+ GP + +N IGAT
Sbjct: 211 VPLSKDTYWQFQMDGVLVNGSS--FCKRGCQAIADTGTSLIGGPVEEVAALNAKIGAT 266
>gi|71727523|gb|AAZ39883.1| cathepsin D-like aspartic protease [Opisthorchis viverrini]
Length = 425
Score = 279 bits (713), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 130/237 (54%), Positives = 173/237 (72%), Gaps = 3/237 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
L NY+DAQY+GEIGIGTPPQ+F V+FDTGSSNLWVPS+ C F+IAC+ H KY S RSST
Sbjct: 61 LNNYLDAQYYGEIGIGTPPQSFQVVFDTGSSNLWVPSTHCSIFNIACWLHHKYDSARSST 120
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y NG I YG+G++SG S D+V +G ++VK+Q F EA +EP + F+ AKFDGILG+
Sbjct: 121 YYPNGTEFSIRYGSGSVSGILSTDYVSVGTVIVKNQTFGEAMKEPGIAFVAAKFDGILGM 180
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
GF+ ISV ++ NM++QGLV EPVFSF+ +RNA + GGE++ GG DP +YKGE +
Sbjct: 181 GFKSISVDGVPTLFDNMISQGLVPEPVFSFYLDRNASDPVGGELLLGGTDPKYYKGEILW 240
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
P+T + YWQF + + + G C GC AIAD+GTSL+AGP+ + ++N A+GA
Sbjct: 241 APLTHEAYWQFKVDSMSVGGMK--LCENGCQAIADTGTSLIAGPSEEVGKLNDALGA 295
>gi|358255149|dbj|GAA56870.1| cathepsin D [Clonorchis sinensis]
Length = 425
Score = 279 bits (713), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 151/305 (49%), Positives = 202/305 (66%), Gaps = 13/305 (4%)
Query: 17 LLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGD 76
LLF V+ +T + RI L K ++ R+ E +F TSIR + G++ E
Sbjct: 6 LLFWVLLTTSECSVIRIPLTGFK-NVRRRLMEVGTPVEQLNF-TSIR-FVGNGSIPE--- 59
Query: 77 ADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRS 135
L NY+DAQY+GEIGIGTPPQ+F V+FDTGSSNLWVPS C FSIAC+ H KY S
Sbjct: 60 ----ILNNYLDAQYYGEIGIGTPPQSFEVVFDTGSSNLWVPSKHCSIFSIACWLHHKYDS 115
Query: 136 GRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFD 195
+SSTY NG I YG+G++SG S D+V +G + VK+Q F EA +EP + F+ AKFD
Sbjct: 116 AKSSTYMANGTEFSIRYGSGSVSGILSTDYVSVGTVTVKNQTFGEAMKEPGIAFVAAKFD 175
Query: 196 GILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYK 255
GILG+GF+ ISV ++ NM++QGLV+EPVFSF+ +RNA + GGE++ GG DP +YK
Sbjct: 176 GILGMGFKTISVDGVPTLFDNMISQGLVSEPVFSFYLDRNASDPVGGELLLGGTDPKYYK 235
Query: 256 GEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAI 315
GE + P+T + YWQF + D M G + C GC AIAD+GTSL+AGP+ + ++N A+
Sbjct: 236 GEILWAPLTHEAYWQFKV-DSMNVG-SMKLCENGCQAIADTGTSLIAGPSEEVGKLNDAL 293
Query: 316 GATGI 320
GA I
Sbjct: 294 GAIKI 298
>gi|205364148|gb|ACI04532.1| aspartic protease 1 precursor [Ancylostoma duodenale]
Length = 446
Score = 278 bits (711), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 136/244 (55%), Positives = 167/244 (68%), Gaps = 2/244 (0%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
L+NYMDAQYFG I IGTP QNFTVIFDTGSSNLWVPS KC ++ IAC H +Y SG SST
Sbjct: 87 LRNYMDAQYFGTIQIGTPAQNFTVIFDTGSSNLWVPSRKCPFYDIACMLHRRYDSGASST 146
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
YK++G+ I YGTG++ GF S+D+V I + ++Q F EAT EP LTF+ AKFDGILG+
Sbjct: 147 YKEDGRKMAIQYGTGSMKGFISKDNVCIAGICAEEQPFAEATSEPGLTFIAAKFDGILGM 206
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
F EISV PV++ + Q V PVF+FW NRN D E GGEI GGMD Y T+
Sbjct: 207 AFPEISVLGVPPVFHTFIEQKKVPSPVFAFWLNRNPDSELGGEITLGGMDTRRYVEPITW 266
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
PVT++GYWQF M D + G T+ C GC AIAD+GTSL+AGP + + IGA +
Sbjct: 267 TPVTRRGYWQFKM-DKVQGGSTSIACPNGCQAIADTGTSLIAGPKAQVEAIQKFIGAEPL 325
Query: 321 VSQE 324
+ E
Sbjct: 326 MKGE 329
>gi|125807245|ref|XP_001360320.1| GA13759 [Drosophila pseudoobscura pseudoobscura]
gi|195149648|ref|XP_002015768.1| GL11239 [Drosophila persimilis]
gi|54635492|gb|EAL24895.1| GA13759 [Drosophila pseudoobscura pseudoobscura]
gi|194109615|gb|EDW31658.1| GL11239 [Drosophila persimilis]
Length = 388
Score = 278 bits (711), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 132/241 (54%), Positives = 165/241 (68%), Gaps = 3/241 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSST 140
L NYMDAQY+G I IG+PPQ+F V+FDTGSSNLWVPS KC+ + IAC H+KY + +SST
Sbjct: 61 LSNYMDAQYYGPISIGSPPQSFRVVFDTGSSNLWVPSKKCHLTNIACLMHNKYDASKSST 120
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y KNG + I YG+G++SG+ S D + +G L +K Q F EA EP L F+ AKFDGILGL
Sbjct: 121 YAKNGTTFAIQYGSGSLSGYLSTDTLSMGGLDIKGQTFAEALSEPGLVFVAAKFDGILGL 180
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
G+ ISV P +Y M QGL++ PVFSF+ NR+ EGGEI+FGG DP HY G+ TY
Sbjct: 181 GYSSISVDGVKPPFYAMYEQGLISSPVFSFYLNRDPASPEGGEIIFGGSDPKHYTGDFTY 240
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
+PVT+K YWQ M + C GGC IAD+GTSL+A P T T +N IG T I
Sbjct: 241 LPVTRKAYWQIKMDSAALGDLE--LCKGGCQVIADTGTSLIAAPMTEATSINQKIGGTPI 298
Query: 321 V 321
+
Sbjct: 299 I 299
>gi|346469557|gb|AEO34623.1| hypothetical protein [Amblyomma maculatum]
Length = 391
Score = 277 bits (709), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 151/304 (49%), Positives = 196/304 (64%), Gaps = 20/304 (6%)
Query: 32 RIGLKKRKFDLNN--RVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQ 89
R+ LKK K + V ++ G SF ++ L G + E LKNY+DAQ
Sbjct: 23 RVPLKKMKSTREHLREVGTPINVTRGWSFGSA-----LGGPVPE-------PLKNYLDAQ 70
Query: 90 YFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSSTYKKNGKSA 148
Y+G+I +GTPPQ F V+FDTGSSNLWVPSSKC F+ IAC H KY + +SSTY KNG
Sbjct: 71 YYGDITLGTPPQVFRVVFDTGSSNLWVPSSKCPFTNIACMLHHKYYAKKSSTYVKNGTKF 130
Query: 149 DIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVG 208
+I YG+G+++G S D +GD+ V+ Q F E E L F+ AKFDGILGLG+ +ISV
Sbjct: 131 EIRYGSGSVTGELSTDVFGLGDVRVQSQTFAEILHESGLAFIAAKFDGILGLGYPQISVL 190
Query: 209 KAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGY 268
PV+ NMV QG+ +PVFS + +RNA + GGE++FGG+D HY G +YVPVT+KGY
Sbjct: 191 GVPPVFDNMVAQGVATKPVFSVYLDRNATDPNGGEVLFGGIDEAHYTGNISYVPVTRKGY 250
Query: 269 WQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE---- 324
WQF M D + G FC GGC AIAD+GTSL+AGPT I ++N AIGA + E
Sbjct: 251 WQFHM-DGLKVGDNATFCNGGCEAIADTGTSLIAGPTEEIQKLNLAIGAAPFTAGEYLVS 309
Query: 325 CKAV 328
CK++
Sbjct: 310 CKSI 313
>gi|148229393|ref|NP_001085403.1| MGC82347 protein precursor [Xenopus laevis]
gi|48734644|gb|AAH72252.1| MGC82347 protein [Xenopus laevis]
Length = 401
Score = 277 bits (709), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 148/300 (49%), Positives = 190/300 (63%), Gaps = 5/300 (1%)
Query: 26 PNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNY 85
P L RI LKK + A D+KE + KYS +G A AL NY
Sbjct: 20 PGSALIRIPLKKFPSIRHTLTEAGGDAKELLGNGMPL-KYST--GFPPNGKATPEALMNY 76
Query: 86 MDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKN 144
+DAQY+GEIGIGTPPQ FTV+FDTGSSNLWVPS C F IAC+ H KY S +SSTY KN
Sbjct: 77 LDAQYYGEIGIGTPPQTFTVVFDTGSSNLWVPSVHCSMFDIACWMHHKYDSSKSSTYVKN 136
Query: 145 GKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQE 204
G I YGTG++SG+ S+D V IG+L +K+Q F EA ++P +TF+ AKFDGILG+ +
Sbjct: 137 GTEFAIQYGTGSLSGYLSKDTVTIGNLGIKEQLFGEAIKQPGVTFIAAKFDGILGMAYPI 196
Query: 205 ISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVT 264
ISV PV+ N++ Q LV VFSF+ NRN D + GGE++ GG DP +Y G+ Y+ VT
Sbjct: 197 ISVDGVSPVFDNIMAQKLVESNVFSFYLNRNPDTQPGGELLLGGTDPKYYTGDFHYLNVT 256
Query: 265 QKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE 324
+K YWQ M + + Q T C GGC AI D+GTSL+ GP +T + AIGA ++ +
Sbjct: 257 RKAYWQIHMDQLGVGDQLT-LCKGGCEAIVDTGTSLITGPLEEVTALQKAIGAVPLIQGQ 315
>gi|312097106|ref|XP_003148873.1| aspartic protease BmAsp-2 [Loa loa]
gi|307755962|gb|EFO15196.1| aspartic protease BmAsp-2 [Loa loa]
Length = 417
Score = 277 bits (709), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 129/244 (52%), Positives = 171/244 (70%), Gaps = 2/244 (0%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSST 140
LKNYMDAQY+G+I IGTP QNF+V+FDTGSSNLW+PS KC FS IAC FH+KY+ +S+T
Sbjct: 63 LKNYMDAQYYGQISIGTPAQNFSVVFDTGSSNLWIPSVKCPFSDIACLFHNKYKGAQSTT 122
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
YK +G+ I YG G++ GF S D V I D+ V DQ F EAT EP +TF++AKFDGILG+
Sbjct: 123 YKPDGRKIKIQYGRGSMEGFISSDTVCIADICVTDQPFAEATSEPGVTFVMAKFDGILGM 182
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
F EI+V PV++ M+ Q V E +F+FW +RN ++E GGEI GG+D + + Y
Sbjct: 183 AFPEIAVLGLSPVFHTMIKQKTVKESLFAFWLDRNPNDEIGGEITLGGIDVNRFVAPLVY 242
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
P+++ GYWQF M + DG+ CA GC AIAD+GTSL+AGP + I ++ IGA +
Sbjct: 243 TPISKHGYWQFQMDSIQGDGKAIS-CANGCQAIADTGTSLIAGPKSQIDKIQKYIGAEHL 301
Query: 321 VSQE 324
+ E
Sbjct: 302 YADE 305
>gi|351712803|gb|EHB15722.1| Cathepsin D, partial [Heterocephalus glaber]
Length = 390
Score = 277 bits (709), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 137/264 (51%), Positives = 181/264 (68%), Gaps = 15/264 (5%)
Query: 81 ALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSS 139
LKNYMDAQY+GEIGIGTPPQ FTV+FDTGSSNLWVPSS+C IAC+FH KY S +SS
Sbjct: 50 TLKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSSRCNMLDIACWFHHKYHSDKSS 109
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIG---------DLVVKDQEFIEATREPSLTFL 190
TY KNG S DIHYG+G++SG+ S+D V + +L V+ Q F EAT++P +TF+
Sbjct: 110 TYVKNGSSFDIHYGSGSLSGYLSQDTVSVPCQSAESNPRNLRVEKQTFGEATKQPGITFI 169
Query: 191 LAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMD 250
AKFDGILG+ + ISV +PV+ N+++Q LV++ VFSF+ NR+ + GGE++ GG+D
Sbjct: 170 AAKFDGILGMAYPRISVNNVLPVFDNLMSQKLVDKNVFSFYLNRDPSAQPGGELMLGGID 229
Query: 251 PDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQ 310
+YKG TY+ VT+K YWQ M + + G C GGC AI D+GTSLL GP + +
Sbjct: 230 SKYYKGSFTYLNVTRKAYWQVHMDQLEV-GSGLNLCKGGCEAIVDTGTSLLVGPVDEVKE 288
Query: 311 VNHAIGATGIVSQE----CKAVVS 330
+ AIGA ++ E C+ V S
Sbjct: 289 LQKAIGAIPLIQGEYMVPCEKVSS 312
>gi|387915174|gb|AFK11196.1| cathepsin D1 [Callorhinchus milii]
Length = 394
Score = 277 bits (708), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 151/320 (47%), Positives = 194/320 (60%), Gaps = 13/320 (4%)
Query: 16 LLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESG 75
LLL + +T G L RI L + R R S G S + + + +
Sbjct: 7 LLLLTAITAT-GGALVRIPLTRF------RSIRRALSDSGRSVEDLLPENKYKTDSPGIN 59
Query: 76 DADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYR 134
LKNY+DAQY+GE+GIGTPPQ FTV+FDTGSSNLWVPS C F IAC H KY
Sbjct: 60 GPTPETLKNYLDAQYYGEVGIGTPPQPFTVVFDTGSSNLWVPSVHCSMFDIACLLHHKYN 119
Query: 135 SGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKF 194
S +SS+Y +NG I YG+G++SG+ S+D V IG++ V+ Q F EA ++P L F+ AKF
Sbjct: 120 SDKSSSYVRNGTKFAIRYGSGSLSGYLSKDTVLIGNIKVQSQLFGEAIKQPGLAFIAAKF 179
Query: 195 DGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHY 254
DGILG+G+ ISV +PV+ N+V Q LV VFSF+ NRN D GGE++ GG DP +Y
Sbjct: 180 DGILGMGYPLISVDGVIPVFDNIVTQKLVPNNVFSFYLNRNPDSLPGGELILGGTDPKYY 239
Query: 255 KGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHA 314
G+ Y+ VT+K YWQ M +V I Q T C GGCAAI D+GTSL+ GP I + A
Sbjct: 240 TGDFHYLNVTRKAYWQVKMDEVSIGEQLT-LCKGGCAAIVDTGTSLITGPAQEIKALQKA 298
Query: 315 IGATGIVSQE----CKAVVS 330
IGA ++ E CK V S
Sbjct: 299 IGAIPLIQGEYLIDCKKVAS 318
>gi|348530268|ref|XP_003452633.1| PREDICTED: cathepsin D-like [Oreochromis niloticus]
Length = 396
Score = 277 bits (708), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 148/314 (47%), Positives = 193/314 (61%), Gaps = 12/314 (3%)
Query: 16 LLLFPVV-FSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIR-KYSLRGNLG- 72
L LF + + N L RI LKK R R + G+ + K+SL+ N G
Sbjct: 4 LFLFVIAALALTNDALVRIPLKKF------RSIRRELTDSGKGIEELVADKHSLKYNFGF 57
Query: 73 -ESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFH 130
S LKNY+DAQY+GEI +GTPPQ FTV+FDTGSSNLWVPS C +F IAC+ H
Sbjct: 58 PSSNGPTPETLKNYLDAQYYGEITLGTPPQKFTVVFDTGSSNLWVPSVHCSFFDIACWLH 117
Query: 131 SKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFL 190
KY S +SSTY KNG S I YG+G++SG+ S+D IGD+ V+ Q F EA ++P + F+
Sbjct: 118 HKYNSAKSSTYVKNGTSFAIQYGSGSLSGYLSQDTCSIGDISVEKQIFGEAIKQPGVAFI 177
Query: 191 LAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMD 250
AKFDGILG+ + ISV VPV+ NM+NQ V + VFSF+ NRN D E GGE++ GG D
Sbjct: 178 AAKFDGILGMAYPSISVDGVVPVFDNMMNQKKVEKNVFSFYLNRNPDTEPGGELLLGGTD 237
Query: 251 PDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQ 310
P +Y G+ Y ++++ YWQ M D M G C GGC AI D+GTSL+ GP +
Sbjct: 238 PKYYDGDFHYANISRQAYWQVHM-DGMTVGSQLSLCKGGCEAIVDTGTSLITGPAAEVKA 296
Query: 311 VNHAIGATGIVSQE 324
+ AIGA ++ E
Sbjct: 297 LQKAIGAIPLIQGE 310
>gi|256072901|ref|XP_002572772.1| cathepsin D (A01 family) [Schistosoma mansoni]
gi|360043052|emb|CCD78464.1| cathepsin D (A01 family) [Schistosoma mansoni]
Length = 428
Score = 276 bits (707), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 133/249 (53%), Positives = 171/249 (68%), Gaps = 7/249 (2%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNY+DAQY+G+I IGTPPQ F+V+FDTGSSNLWVPS C YF IAC H KY S +SST
Sbjct: 58 LKNYLDAQYYGDITIGTPPQTFSVVFDTGSSNLWVPSKYCSYFDIACLLHRKYDSSKSST 117
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y NG +HYGTG++SGF S D +++G L VK Q F EAT++P L F++AKFDGILG+
Sbjct: 118 YIPNGTEFSVHYGTGSLSGFLSTDSLQLGSLSVKGQTFGEATQQPGLVFVMAKFDGILGM 177
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ISV PV+ NM+ QG+V PVFSF+ +RN GGE++ GG+D +Y GE Y
Sbjct: 178 AYPSISVDGVTPVFVNMIQQGIVESPVFSFYLSRNISAVLGGELMIGGIDKKYYSGEINY 237
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT-- 318
V +T++ YW F M + I T C GC AIAD+GTS++AGPT I ++N +GAT
Sbjct: 238 VDLTEQSYWLFKMDKLTISDMTA--CPDGCLAIADTGTSMIAGPTDEIQKINAKLGATRL 295
Query: 319 --GIVSQEC 325
GI + C
Sbjct: 296 PGGIYTVSC 304
>gi|226476812|emb|CAX72322.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 276 bits (707), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 131/252 (51%), Positives = 175/252 (69%), Gaps = 7/252 (2%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNY+DAQY+G+I IGTPPQ F+V+FDTGSSNLWVPS+ C YF IAC H KY S +S+T
Sbjct: 58 LKNYLDAQYYGDITIGTPPQTFSVVFDTGSSNLWVPSTHCSYFDIACLLHRKYDSSKSTT 117
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y NG I YGTG++SGF S D +++G L VK Q F EAT++P L F++AKFDGILG+
Sbjct: 118 YVPNGTDFSIRYGTGSLSGFLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGM 177
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ++VG PV+ NM+ QG+V+ PVFSF+ +RN GGE++ GG+D +Y GE Y
Sbjct: 178 AYPSLAVGGVTPVFVNMIKQGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINY 237
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT-- 318
V +T+K YW F M ++ I + C GC AIAD+GTS++AGPT + Q+N +GAT
Sbjct: 238 VNLTEKSYWLFKMDNLTISDLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHL 295
Query: 319 --GIVSQECKAV 328
GI + C +
Sbjct: 296 PGGIYTVSCDVI 307
>gi|256072903|ref|XP_002572773.1| cathepsin D (A01 family) [Schistosoma mansoni]
gi|360043053|emb|CCD78465.1| cathepsin D (A01 family) [Schistosoma mansoni]
Length = 430
Score = 276 bits (707), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 133/249 (53%), Positives = 171/249 (68%), Gaps = 7/249 (2%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNY+DAQY+G+I IGTPPQ F+V+FDTGSSNLWVPS C YF IAC H KY S +SST
Sbjct: 60 LKNYLDAQYYGDITIGTPPQTFSVVFDTGSSNLWVPSKYCSYFDIACLLHRKYDSSKSST 119
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y NG +HYGTG++SGF S D +++G L VK Q F EAT++P L F++AKFDGILG+
Sbjct: 120 YIPNGTEFSVHYGTGSLSGFLSTDSLQLGSLSVKGQTFGEATQQPGLVFVMAKFDGILGM 179
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ISV PV+ NM+ QG+V PVFSF+ +RN GGE++ GG+D +Y GE Y
Sbjct: 180 AYPSISVDGVTPVFVNMIQQGIVESPVFSFYLSRNISAVLGGELMIGGIDKKYYSGEINY 239
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT-- 318
V +T++ YW F M + I T C GC AIAD+GTS++AGPT I ++N +GAT
Sbjct: 240 VDLTEQSYWLFKMDKLTISDMTA--CPDGCLAIADTGTSMIAGPTDEIQKINAKLGATRL 297
Query: 319 --GIVSQEC 325
GI + C
Sbjct: 298 PGGIYTVSC 306
>gi|1778026|gb|AAB63442.1| aspartic proteinase [Schistosoma mansoni]
Length = 427
Score = 276 bits (706), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 133/249 (53%), Positives = 171/249 (68%), Gaps = 7/249 (2%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNY+DAQY+G+I IGTPPQ F+V+FDTGSSNLWVPS C YF IAC H KY S +SST
Sbjct: 57 LKNYLDAQYYGDITIGTPPQTFSVVFDTGSSNLWVPSKYCSYFDIACLLHRKYDSSKSST 116
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y NG +HYGTG++SGF S D +++G L VK Q F EAT++P L F++AKFDGILG+
Sbjct: 117 YIPNGTEFSVHYGTGSLSGFLSTDSLQLGSLSVKGQTFGEATQQPGLVFVMAKFDGILGM 176
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ISV PV+ NM+ QG+V PVFSF+ +RN GGE++ GG+D +Y GE Y
Sbjct: 177 AYPSISVDGVTPVFVNMIQQGIVESPVFSFYLSRNISAVLGGELMIGGIDKKYYSGEINY 236
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT-- 318
V +T++ YW F M + I T C GC AIAD+GTS++AGPT I ++N +GAT
Sbjct: 237 VDLTEQSYWLFKMDKLTISDMTA--CPDGCLAIADTGTSMIAGPTDEIQKINAKLGATRL 294
Query: 319 --GIVSQEC 325
GI + C
Sbjct: 295 PGGIYTVSC 303
>gi|315440803|gb|ADU20407.1| aspartic protease 1 [Clonorchis sinensis]
Length = 425
Score = 276 bits (706), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 150/305 (49%), Positives = 201/305 (65%), Gaps = 13/305 (4%)
Query: 17 LLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGD 76
LLF V+ +T + RI L K ++ R+ E +F TSIR + G++ E
Sbjct: 6 LLFWVLLTTSECSVIRIPLTGFK-NVRRRLMEVGTPVEQLNF-TSIR-FVGNGSIPE--- 59
Query: 77 ADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRS 135
L NY+DAQY+GEIGIGTPPQ+F V+FDTGSSNLWVPS C FSIAC+ H KY S
Sbjct: 60 ----ILNNYLDAQYYGEIGIGTPPQSFEVVFDTGSSNLWVPSKHCSIFSIACWLHHKYDS 115
Query: 136 GRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFD 195
+ STY NG I YG+G++SG S D+V +G + VK+Q F EA +EP + F+ AKFD
Sbjct: 116 AKYSTYMANGTEFSIRYGSGSVSGILSTDYVSVGTVTVKNQTFGEAMKEPGIAFVAAKFD 175
Query: 196 GILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYK 255
GILG+GF+ ISV ++ NM++QGLV+EPVFSF+ +RNA + GGE++ GG DP +YK
Sbjct: 176 GILGMGFKTISVDGVPTLFDNMISQGLVSEPVFSFYLDRNASDPVGGELLLGGTDPKYYK 235
Query: 256 GEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAI 315
GE + P+T + YWQF + D M G + C GC AIAD+GTSL+AGP+ + ++N A+
Sbjct: 236 GEILWAPLTHEAYWQFKV-DSMNVG-SMKLCENGCQAIADTGTSLIAGPSEEVGKLNDAL 293
Query: 316 GATGI 320
GA I
Sbjct: 294 GAIKI 298
>gi|226476838|emb|CAX72335.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 276 bits (706), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 131/252 (51%), Positives = 174/252 (69%), Gaps = 7/252 (2%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNY+DAQY+G+I IGTPPQ F+V+FDTGSSNLWVPS C YF IAC H KY S +S+T
Sbjct: 58 LKNYLDAQYYGDITIGTPPQTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTT 117
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y NG I YGTG++SGF S D +++G L VK Q F EAT++P L F++AKFDGILG+
Sbjct: 118 YVPNGTDFSIRYGTGSLSGFLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGM 177
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ++VG PV+ NM+ QG+V+ PVFSF+ +RN GGE++ GG+D +Y GE Y
Sbjct: 178 AYPSLAVGGVTPVFVNMIKQGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINY 237
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT-- 318
V +T+K YW F M ++ I + C GC AIAD+GTS++AGPT + Q+N +GAT
Sbjct: 238 VNLTEKSYWLFKMDNLTISDLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHL 295
Query: 319 --GIVSQECKAV 328
GI + C +
Sbjct: 296 PGGIYTVSCDVI 307
>gi|2347147|gb|AAC37302.1| aspartic proteinase precursor [Schistosoma japonicum]
gi|226476814|emb|CAX72323.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476816|emb|CAX72324.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476820|emb|CAX72326.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476822|emb|CAX72327.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476824|emb|CAX72328.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476826|emb|CAX72329.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476834|emb|CAX72333.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476836|emb|CAX72334.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476840|emb|CAX72336.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476842|emb|CAX72337.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476844|emb|CAX72338.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476846|emb|CAX72339.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476852|emb|CAX72342.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476880|emb|CAX72318.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476882|emb|CAX72317.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476886|emb|CAX72315.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476890|emb|CAX72313.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476892|emb|CAX72312.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476894|emb|CAX72311.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476896|emb|CAX72310.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476898|emb|CAX72309.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476900|emb|CAX72308.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226482870|emb|CAX79402.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 276 bits (706), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 131/252 (51%), Positives = 174/252 (69%), Gaps = 7/252 (2%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNY+DAQY+G+I IGTPPQ F+V+FDTGSSNLWVPS C YF IAC H KY S +S+T
Sbjct: 58 LKNYLDAQYYGDITIGTPPQTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTT 117
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y NG I YGTG++SGF S D +++G L VK Q F EAT++P L F++AKFDGILG+
Sbjct: 118 YVPNGTDFSIRYGTGSLSGFLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGM 177
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ++VG PV+ NM+ QG+V+ PVFSF+ +RN GGE++ GG+D +Y GE Y
Sbjct: 178 AYPSLAVGGVTPVFVNMIKQGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINY 237
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT-- 318
V +T+K YW F M ++ I + C GC AIAD+GTS++AGPT + Q+N +GAT
Sbjct: 238 VNLTEKSYWLFKMDNLTISDLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHL 295
Query: 319 --GIVSQECKAV 328
GI + C +
Sbjct: 296 PGGIYTVSCDVI 307
>gi|226476888|emb|CAX72314.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476904|emb|CAX72306.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 276 bits (706), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 131/252 (51%), Positives = 174/252 (69%), Gaps = 7/252 (2%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNY+DAQY+G+I IGTPPQ F+V+FDTGSSNLWVPS C YF IAC H KY S +S+T
Sbjct: 58 LKNYLDAQYYGDITIGTPPQTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTT 117
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y NG I YGTG++SGF S D +++G L VK Q F EAT++P L F++AKFDGILG+
Sbjct: 118 YVPNGTDFSIRYGTGSLSGFLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGM 177
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ++VG PV+ NM+ QG+V+ PVFSF+ +RN GGE++ GG+D +Y GE Y
Sbjct: 178 AYPSLAVGGVTPVFVNMIKQGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINY 237
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT-- 318
V +T+K YW F M ++ I + C GC AIAD+GTS++AGPT + Q+N +GAT
Sbjct: 238 VNLTEKSYWLFKMDNLTISDLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHL 295
Query: 319 --GIVSQECKAV 328
GI + C +
Sbjct: 296 PGGIYTVSCDVI 307
>gi|226476818|emb|CAX72325.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 276 bits (706), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 131/252 (51%), Positives = 174/252 (69%), Gaps = 7/252 (2%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNY+DAQY+G+I IGTPPQ F+V+FDTGSSNLWVPS C YF IAC H KY S +S+T
Sbjct: 58 LKNYLDAQYYGDITIGTPPQTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTT 117
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y NG I YGTG++SGF S D +++G L VK Q F EAT++P L F++AKFDGILG+
Sbjct: 118 YVPNGTDFSIRYGTGSLSGFLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGM 177
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ++VG PV+ NM+ QG+V+ PVFSF+ +RN GGE++ GG+D +Y GE Y
Sbjct: 178 AYPSLAVGGVTPVFVNMIKQGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINY 237
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT-- 318
V +T+K YW F M ++ I + C GC AIAD+GTS++AGPT + Q+N +GAT
Sbjct: 238 VNLTEKSYWLFKMDNLTISDLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHL 295
Query: 319 --GIVSQECKAV 328
GI + C +
Sbjct: 296 PGGIYTVSCDVI 307
>gi|2102722|gb|AAB63357.1| aspartic protease precursor, partial [Schistosoma japonicum]
Length = 428
Score = 276 bits (706), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 131/252 (51%), Positives = 174/252 (69%), Gaps = 7/252 (2%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNY+DAQY+G+I IGTPPQ F+V+FDTGSSNLWVPS C YF IAC H KY S +S+T
Sbjct: 57 LKNYLDAQYYGDITIGTPPQTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTT 116
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y NG I YGTG++SGF S D +++G L VK Q F EAT++P L F++AKFDGILG+
Sbjct: 117 YVPNGTDFSIRYGTGSLSGFLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGM 176
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ++VG PV+ NM+ QG+V+ PVFSF+ +RN GGE++ GG+D +Y GE Y
Sbjct: 177 AYPSLAVGGVTPVFVNMIKQGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINY 236
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT-- 318
V +T+K YW F M ++ I + C GC AIAD+GTS++AGPT + Q+N +GAT
Sbjct: 237 VNLTEKSYWLFKMDNLTISDLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHL 294
Query: 319 --GIVSQECKAV 328
GI + C +
Sbjct: 295 PGGIYTVSCDVI 306
>gi|189502972|gb|ACE06867.1| unknown [Schistosoma japonicum]
Length = 429
Score = 276 bits (706), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 131/252 (51%), Positives = 174/252 (69%), Gaps = 7/252 (2%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNY+DAQY+G+I IGTPPQ F+V+FDTGSSNLWVPS C YF IAC H KY S +S+T
Sbjct: 58 LKNYLDAQYYGDITIGTPPQTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTT 117
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y NG I YGTG++SGF S D +++G L VK Q F EAT++P L F++AKFDGILG+
Sbjct: 118 YVPNGTDFSIRYGTGSLSGFLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGM 177
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ++VG PV+ NM+ QG+V+ PVFSF+ +RN GGE++ GG+D +Y GE Y
Sbjct: 178 AYPSLAVGGVTPVFVNMIKQGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINY 237
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT-- 318
V +T+K YW F M ++ I + C GC AIAD+GTS++AGPT + Q+N +GAT
Sbjct: 238 VNLTEKSYWLFKMDNLTISDLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHL 295
Query: 319 --GIVSQECKAV 328
GI + C +
Sbjct: 296 PGGIYTVSCDVI 307
>gi|387015018|gb|AFJ49628.1| Cathepsin D [Crotalus adamanteus]
Length = 399
Score = 276 bits (706), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 135/242 (55%), Positives = 172/242 (71%), Gaps = 2/242 (0%)
Query: 81 ALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSS 139
ALKNYMDAQY+GEIGIGTP Q FTV+FDTGSSNLWVPSS C IAC H KY S +SS
Sbjct: 67 ALKNYMDAQYYGEIGIGTPQQRFTVVFDTGSSNLWVPSSHCTLLDIACLIHHKYDSSKSS 126
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TY KNG IHYGTG++SG+ S+D V IGD+ VK+Q F EAT++P +TF+ AKFDGILG
Sbjct: 127 TYVKNGTDFAIHYGTGSLSGYLSQDTVTIGDMCVKNQLFGEATKQPGITFIAAKFDGILG 186
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
+ + EISV K P + N++ QGL+ + +FSF+ NR+ E GGE++FGG D +Y G+ +
Sbjct: 187 MAYPEISVDKVAPFFDNVMEQGLLEKNLFSFYLNRDPKGETGGELLFGGTDSQYYSGDFS 246
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATG 319
+V V++K YWQ M V + T C GC AI D+GTSL+ GPT I ++ AIGA
Sbjct: 247 WVNVSRKAYWQVHMDKVDVANGLT-VCKDGCEAIVDTGTSLITGPTKEIKELQKAIGAKP 305
Query: 320 IV 321
I+
Sbjct: 306 II 307
>gi|226476902|emb|CAX72307.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 131/252 (51%), Positives = 174/252 (69%), Gaps = 7/252 (2%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNY+DAQY+G+I IGTPPQ F+V+FDTGSSNLWVPS C YF IAC H KY S +S+T
Sbjct: 58 LKNYLDAQYYGDITIGTPPQTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTT 117
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y NG I YGTG++SGF S D +++G L VK Q F EAT++P L F++AKFDGILG+
Sbjct: 118 YVPNGTDFSIRYGTGSLSGFLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGM 177
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ++VG PV+ NM+ QG+V+ PVFSF+ +RN GGE++ GG+D +Y GE Y
Sbjct: 178 AYPSLAVGGVTPVFVNMIKQGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINY 237
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT-- 318
V +T+K YW F M ++ I + C GC AIAD+GTS++AGPT + Q+N +GAT
Sbjct: 238 VNLTEKSYWLFKMDNLTISDLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHL 295
Query: 319 --GIVSQECKAV 328
GI + C +
Sbjct: 296 PGGIYTVSCDVI 307
>gi|226476830|emb|CAX72331.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 131/252 (51%), Positives = 174/252 (69%), Gaps = 7/252 (2%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNY+DAQY+G+I IGTPPQ F+V+FDTGSSNLWVPS C YF IAC H KY S +S+T
Sbjct: 58 LKNYLDAQYYGDITIGTPPQTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTT 117
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y NG I YGTG++SGF S D +++G L VK Q F EAT++P L F++AKFDGILG+
Sbjct: 118 YVPNGTDFSIRYGTGSLSGFLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGM 177
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ++VG PV+ NM+ QG+V+ PVFSF+ +RN GGE++ GG+D +Y GE Y
Sbjct: 178 AYPSLAVGGVTPVFVNMIKQGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINY 237
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT-- 318
V +T+K YW F M ++ I + C GC AIAD+GTS++AGPT + Q+N +GAT
Sbjct: 238 VNLTEKSYWLFKMDNLTISDLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHL 295
Query: 319 --GIVSQECKAV 328
GI + C +
Sbjct: 296 PGGIYTVSCDVI 307
>gi|224050910|ref|XP_002199093.1| PREDICTED: cathepsin D [Taeniopygia guttata]
Length = 396
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 143/266 (53%), Positives = 176/266 (66%), Gaps = 4/266 (1%)
Query: 60 TSIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSS 119
T KY L E+G LKNYMDAQYFG IGIGTPPQNFTVIFDTGSSNLWVPS
Sbjct: 48 TEAIKYKL--GFAEAGKPTPEILKNYMDAQYFGVIGIGTPPQNFTVIFDTGSSNLWVPSV 105
Query: 120 KC-YFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEF 178
C IAC H KY S +SSTY KNG I YGTG++SG+ S+D V +GDL + DQ F
Sbjct: 106 HCSLLDIACMVHHKYDSAKSSTYVKNGTKFAIRYGTGSLSGYLSQDIVTLGDLKIMDQIF 165
Query: 179 IEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADE 238
EAT++P +TF+ AKFDGILGL F +ISV A P + N++ Q LV + +FSF+ NR+
Sbjct: 166 GEATKQPGITFIAAKFDGILGLAFPKISVEGAEPFFDNVMKQKLVEKNMFSFYLNRDPSG 225
Query: 239 EEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGT 298
GGE+V GG DP +YKGE ++ VT+K YWQ M V + G C GGC AI D+GT
Sbjct: 226 VPGGEMVLGGTDPKYYKGEFSWFNVTRKAYWQIHMDSVDV-GNGPTVCEGGCEAIVDTGT 284
Query: 299 SLLAGPTTIITQVNHAIGATGIVSQE 324
SL+ GPT + ++ AIGA ++ E
Sbjct: 285 SLITGPTKEVKKIQEAIGAKPLIKGE 310
>gi|226476856|emb|CAX72344.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 131/252 (51%), Positives = 174/252 (69%), Gaps = 7/252 (2%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNY+DAQY+G+I IGTPPQ F+V+FDTGSSNLWVPS C YF IAC H KY S +S+T
Sbjct: 58 LKNYLDAQYYGDITIGTPPQTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTT 117
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y NG I YGTG++SGF S D +++G L VK Q F EAT++P L F++AKFDGILG+
Sbjct: 118 YVPNGTDFSIRYGTGSLSGFLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGM 177
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ++VG PV+ NM+ QG+V+ PVFSF+ +RN GGE++ GG+D +Y GE Y
Sbjct: 178 AYPSLAVGGVTPVFVNMIKQGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINY 237
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT-- 318
V +T+K YW F M ++ I + C GC AIAD+GTS++AGPT + Q+N +GAT
Sbjct: 238 VNLTEKSYWLFKMDNLTISDLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHL 295
Query: 319 --GIVSQECKAV 328
GI + C +
Sbjct: 296 PGGIYTVSCDVI 307
>gi|412987808|emb|CCO19204.1| cathepsin D (lysosomal aspartyl protease) [Bathycoccus prasinos]
Length = 628
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 149/283 (52%), Positives = 192/283 (67%), Gaps = 16/283 (5%)
Query: 72 GESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFH 130
GE G V + NYMDAQY+G + IGTP Q F V FDTGSSNLWVPSSKC FS I C H
Sbjct: 133 GEGGATSSVPIANYMDAQYYGPVEIGTPGQKFQVCFDTGSSNLWVPSSKCKFSQIPCDAH 192
Query: 131 SKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLV-VKDQEFIEATREPSLTF 189
KY S +S +Y+ NG+ I YG+G++SGF S D V++G+ + +KDQ F EAT+EP LTF
Sbjct: 193 EKYDSEKSRSYEPNGEDFAIQYGSGSLSGFLSSDTVRLGNSIEIKDQTFAEATKEPGLTF 252
Query: 190 LLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEE---EGGEIVF 246
L AKFDGILGLGF+EI+V PV+ N V Q V + FSFW NR+ D + +GGE+VF
Sbjct: 253 LFAKFDGILGLGFKEIAVDGVTPVFDNAVAQNQVEKDQFSFWLNRDQDGDGVVDGGELVF 312
Query: 247 GGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMI----------DGQTTGFCAG-GCAAIAD 295
GG+D H+ GEH +V +T+KGYWQFD+ DV + D T F + AIAD
Sbjct: 313 GGVDEKHFVGEHVWVDLTKKGYWQFDLDDVKVGEFSFIDDKNDKTTVSFSSSTKHQAIAD 372
Query: 296 SGTSLLAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIIN 338
+GTSLLAGP+ +I ++N AIGA ++ QECK + +YGEE ++
Sbjct: 373 TGTSLLAGPSAVIDKINDAIGAENLMIQECKIAIKRYGEEFLD 415
>gi|195120065|ref|XP_002004549.1| GI19550 [Drosophila mojavensis]
gi|193909617|gb|EDW08484.1| GI19550 [Drosophila mojavensis]
Length = 387
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 143/263 (54%), Positives = 178/263 (67%), Gaps = 7/263 (2%)
Query: 60 TSIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSS 119
T + + ++ G+S + L NY+DAQY+G I IGTPPQNF V+FDTGSSNLWVPS
Sbjct: 42 TELHQLRIKYGAGDSPEP----LSNYLDAQYYGPISIGTPPQNFKVVFDTGSSNLWVPSK 97
Query: 120 KCYFS-IACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEF 178
KC+ + IAC H+KY + +SSTY KNG S DIHYG+G++SG+ S D V I L +K Q F
Sbjct: 98 KCHLTNIACLMHNKYDASKSSTYNKNGTSFDIHYGSGSLSGYLSSDTVNIAGLDIKGQTF 157
Query: 179 IEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADE 238
EA EP L F+ AKFDGILGLG+ ISV P +YNM Q L+ +PVFSF+ NR+
Sbjct: 158 AEALSEPGLVFVAAKFDGILGLGYSSISVDGVKPPFYNMFEQSLIAQPVFSFYLNRDPKA 217
Query: 239 EEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGT 298
EGGEI+FGG DP+HY G+ TY+PVT+KGYWQ M I+ C GGC IAD+GT
Sbjct: 218 PEGGEIIFGGSDPNHYTGDFTYLPVTRKGYWQIKMDSAQIN--NVELCKGGCQVIADTGT 275
Query: 299 SLLAGPTTIITQVNHAIGATGIV 321
SL+A P T +N AIG T IV
Sbjct: 276 SLIAAPAAEATSINQAIGGTPIV 298
>gi|226476854|emb|CAX72343.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 435
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 131/252 (51%), Positives = 174/252 (69%), Gaps = 7/252 (2%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNY+DAQY+G+I IGTPPQ F+V+FDTGSSNLWVPS C YF IAC H KY S +S+T
Sbjct: 64 LKNYLDAQYYGDITIGTPPQTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTT 123
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y NG I YGTG++SGF S D +++G L VK Q F EAT++P L F++AKFDGILG+
Sbjct: 124 YVPNGTDFSIRYGTGSLSGFLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGM 183
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ++VG PV+ NM+ QG+V+ PVFSF+ +RN GGE++ GG+D +Y GE Y
Sbjct: 184 AYPSLAVGGVTPVFVNMIKQGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINY 243
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT-- 318
V +T+K YW F M ++ I + C GC AIAD+GTS++AGPT + Q+N +GAT
Sbjct: 244 VNLTEKSYWLFKMDNLTISDLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHL 301
Query: 319 --GIVSQECKAV 328
GI + C +
Sbjct: 302 PGGIYTVSCDVI 313
>gi|321461134|gb|EFX72169.1| hypothetical protein DAPPUDRAFT_189045 [Daphnia pulex]
Length = 391
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 146/294 (49%), Positives = 188/294 (63%), Gaps = 12/294 (4%)
Query: 47 AARLDSKEGESFRTSIRKYSL-------RGNLGESGDADIVALKNYMDAQYFGEIGIGTP 99
AA+L S E T+ SL N SG L+N+ D+QYFG I +GTP
Sbjct: 16 AAKLLSIPLERLPTARSSMSLVEQSMERTRNRYSSGKILTEDLRNFQDSQYFGPITLGTP 75
Query: 100 PQNFTVIFDTGSSNLWVPSSKCYF-SIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAIS 158
PQ+FTVIFDTGS+NLWVPSS+C ++AC H++Y S S TYK NG I YGTGA+
Sbjct: 76 PQDFTVIFDTGSANLWVPSSQCSEENLACKVHNQYNSSLSDTYKPNGTEFSIQYGTGAMD 135
Query: 159 GFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMV 218
GF S D + + V DQ F EA EP +TF+ +FDGILG+ + I+V VP++ NM+
Sbjct: 136 GFLSTDILGVAGAQVMDQTFAEAVNEPGVTFVAGRFDGILGMSYPNIAVQGVVPMFQNMM 195
Query: 219 NQGLVNEPVFSFWFNRNA-DEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVM 277
QGLV+EPVFSFW NR+A D GGEIVFGG +PDHY GE Y+PVT+K YWQF +M
Sbjct: 196 AQGLVDEPVFSFWLNRDASDPVNGGEIVFGGTNPDHYVGEINYIPVTRKAYWQFRADGLM 255
Query: 278 IDG-QTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQECKAVVS 330
I+G FC GGC I+D+GTS++AGP + +N +GA I++ E AV+S
Sbjct: 256 IEGIPEYPFCDGGCEMISDTGTSVIAGPAEEVNLLNRLLGAINIINGE--AVIS 307
>gi|213625094|gb|AAI69806.1| LOC443721 protein [Xenopus laevis]
Length = 399
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 143/305 (46%), Positives = 191/305 (62%), Gaps = 16/305 (5%)
Query: 26 PNGGLYRIGLKK-----RKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIV 80
P L RI LKK R ++ + +L E + KYS +S +
Sbjct: 19 PGSSLVRIPLKKFTSIRRAMSDTDKDSLKLSGNEAAT------KYS---AFPKSNNPTPE 69
Query: 81 ALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYF-SIACYFHSKYRSGRSS 139
L NY+DAQY+GEI IGTPPQ FTV+FDTGSSNLWVPS C F IAC+ H KY S +SS
Sbjct: 70 TLLNYLDAQYYGEISIGTPPQPFTVVFDTGSSNLWVPSVHCSFWDIACWLHHKYDSSKSS 129
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TY NG + I YG+G+++G+ S+D V IGDL VK Q F EA ++P +TF+ AKFDGILG
Sbjct: 130 TYVNNGTAFAIQYGSGSLTGYLSKDTVTIGDLAVKGQLFAEAVKQPGITFVAAKFDGILG 189
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
+G+ ISV PV+ +++ Q LV+ +FSF+ NRN D + GGE++ GG DP +Y G+ +
Sbjct: 190 MGYPRISVDGVPPVFDDIMEQKLVDSNLFSFYLNRNPDTQPGGELLLGGTDPTYYTGDFS 249
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATG 319
Y+ VT+K YWQ M + + Q T C GGC AI D+GTSL+ GP +T + AIGA
Sbjct: 250 YMNVTRKAYWQIRMDQLSVGDQLT-LCKGGCEAIVDTGTSLITGPVEEVTALQRAIGAIP 308
Query: 320 IVSQE 324
++ E
Sbjct: 309 LIRGE 313
>gi|226476832|emb|CAX72332.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 276 bits (705), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 131/252 (51%), Positives = 174/252 (69%), Gaps = 7/252 (2%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNY+DAQY+G+I IGTPPQ F+V+FDTGSSNLWVPS C YF IAC H KY S +S+T
Sbjct: 58 LKNYLDAQYYGDITIGTPPQTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTT 117
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y NG I YGTG++SGF S D +++G L VK Q F EAT++P L F++AKFDGILG+
Sbjct: 118 YVPNGTDFSIRYGTGSLSGFLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGM 177
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ++VG PV+ NM+ QG+V+ PVFSF+ +RN GGE++ GG+D +Y GE Y
Sbjct: 178 AYPSLAVGGVTPVFVNMIKQGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINY 237
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT-- 318
V +T+K YW F M ++ I + C GC AIAD+GTS++AGPT + Q+N +GAT
Sbjct: 238 VNLTEKSYWLFKMDNLTISDLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHL 295
Query: 319 --GIVSQECKAV 328
GI + C +
Sbjct: 296 PGGIYTVSCDVI 307
>gi|226476810|emb|CAX72321.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 276 bits (705), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 131/252 (51%), Positives = 174/252 (69%), Gaps = 7/252 (2%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNY+DAQY+G+I IGTPPQ F+V+FDTGSSNLWVPS C YF IAC H KY S +S+T
Sbjct: 58 LKNYLDAQYYGDITIGTPPQTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTT 117
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y NG I YGTG++SGF S D +++G L VK Q F EAT++P L F++AKFDGILG+
Sbjct: 118 YVPNGTDFSIRYGTGSLSGFLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGM 177
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ++VG PV+ NM+ QG+V+ PVFSF+ +RN GGE++ GG+D +Y GE Y
Sbjct: 178 AYPSLAVGGVTPVFVNMIKQGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINY 237
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT-- 318
V +T+K YW F M ++ I + C GC AIAD+GTS++AGPT + Q+N +GAT
Sbjct: 238 VNLTEKSYWLFKMDNLTISDLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHL 295
Query: 319 --GIVSQECKAV 328
GI + C +
Sbjct: 296 PGGIYTVSCDVI 307
>gi|443723962|gb|ELU12180.1| hypothetical protein CAPTEDRAFT_225009 [Capitella teleta]
Length = 364
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 130/238 (54%), Positives = 170/238 (71%), Gaps = 1/238 (0%)
Query: 88 AQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSSTYKKNGK 146
AQY+G I IGTP Q F V+FDTGSSNLWVPS KC ++ IAC+ H++Y S +S++YKKNG
Sbjct: 23 AQYYGAITIGTPAQTFKVVFDTGSSNLWVPSQKCKWTDIACWLHNRYDSTKSTSYKKNGT 82
Query: 147 SADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEIS 206
I YG+G++SGF S D V IGD+ V Q F EAT +P +TF+ AKFDGILG+G+ IS
Sbjct: 83 EFKIQYGSGSLSGFLSTDIVTIGDVSVTAQTFAEATAQPGITFVAAKFDGILGMGYPTIS 142
Query: 207 VGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQK 266
V PV+ NMV Q V+ PVFSF+ NR+ EGGE++ GG DP +Y+G TY+PV++K
Sbjct: 143 VDGVTPVFNNMVKQKSVSSPVFSFFLNRDPSASEGGELILGGSDPKYYEGNFTYLPVSKK 202
Query: 267 GYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE 324
GYWQF M + + G +T +C GGC AIAD+GTSLLAGP+ + ++N +G T I E
Sbjct: 203 GYWQFKMDGMKLAGSSTSYCDGGCQAIADTGTSLLAGPSAEVQKLNQELGGTAIPGGE 260
>gi|226476906|emb|CAX72305.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 130/252 (51%), Positives = 174/252 (69%), Gaps = 7/252 (2%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNY+DAQY+G+I +GTPPQ F+V+FDTGSSNLWVPS C YF IAC H KY S +S+T
Sbjct: 58 LKNYLDAQYYGDITVGTPPQTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTT 117
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y NG I YGTG++SGF S D +++G L VK Q F EAT++P L F++AKFDGILG+
Sbjct: 118 YVPNGTDFSIRYGTGSLSGFLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGM 177
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ++VG PV+ NM+ QG+V+ PVFSF+ +RN GGE++ GG+D +Y GE Y
Sbjct: 178 AYPSLAVGGVTPVFVNMIKQGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINY 237
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT-- 318
V +T+K YW F M ++ I + C GC AIAD+GTS++AGPT + Q+N +GAT
Sbjct: 238 VNLTEKSYWLFKMDNLTISDLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHL 295
Query: 319 --GIVSQECKAV 328
GI + C +
Sbjct: 296 PGGIYTVSCDVI 307
>gi|326920173|ref|XP_003206349.1| PREDICTED: cathepsin D-like [Meleagris gallopavo]
Length = 397
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 147/332 (44%), Positives = 208/332 (62%), Gaps = 18/332 (5%)
Query: 33 IGLKKR----KFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVA--LKNYM 86
+GL+ R KF R+ + S+ + +I ++ L+ LG S A+ LKNYM
Sbjct: 17 VGLENRIPLTKFTSTRRMLTEVGSEIPD--MNAITQF-LKFKLGFSDLAEPTPEILKNYM 73
Query: 87 DAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYF-SIACYFHSKYRSGRSSTYKKNG 145
DAQY+GEIGIGTPPQ FTV+FDTGSSNLWVPS C+ IAC H KY + +SSTY +NG
Sbjct: 74 DAQYYGEIGIGTPPQKFTVVFDTGSSNLWVPSVHCHLLDIACLLHHKYDASKSSTYVENG 133
Query: 146 KSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEI 205
IHYGTG++SGF S+D V +G+L +K+Q F EA ++P +TF+ AKFDGILG+ F I
Sbjct: 134 TEFAIHYGTGSLSGFLSQDTVTLGNLKIKNQIFGEAVKQPGITFIAAKFDGILGMAFPRI 193
Query: 206 SVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQ 265
SV K P + N++ Q L+ + +FSF+ NR+ + GGE++ GG DP +Y+G+ ++V VT+
Sbjct: 194 SVDKVTPFFDNVMKQKLIEKNIFSFYLNRDPTAQPGGELLLGGTDPKYYRGDFSWVNVTR 253
Query: 266 KGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE- 324
K YWQ M V + T C GGC AI D+GTSL+ GPT + ++ AIGA ++ +
Sbjct: 254 KAYWQVHMDSVNVANGLT-LCKGGCEAIVDTGTSLITGPTKEVKELQTAIGAKPLIKGQY 312
Query: 325 ---CKAVVSQYGEEIINMLLAKVSAKLSLRHY 353
C + S ++ ++L KL+ Y
Sbjct: 313 IIPCDKISSL---PVVTLMLGGKPYKLTGEQY 341
>gi|66815097|ref|XP_641645.1| cathepsin D [Dictyostelium discoideum AX4]
gi|74960832|sp|O76856.1|CATD_DICDI RecName: Full=Cathepsin D; AltName: Full=Ddp44; Flags: Precursor
gi|3288145|emb|CAA76563.1| preprocathepsin D [Dictyostelium discoideum]
gi|6010025|emb|CAB57223.1| cathepsin D [Dictyostelium discoideum]
gi|60469656|gb|EAL67644.1| cathepsin D [Dictyostelium discoideum AX4]
Length = 383
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 140/263 (53%), Positives = 175/263 (66%), Gaps = 4/263 (1%)
Query: 56 ESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLW 115
ES R +K+S R + +G + + ++ DAQY+G I IGTP Q F V+FDTGSSNLW
Sbjct: 31 ESRRRVPQKWSNRLSALNAGTT--IPISDFEDAQYYGAITIGTPGQAFKVVFDTGSSNLW 88
Query: 116 VPSSKCYFSI-ACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVK 174
+PS KC ++ AC H+KY SG SSTY NG I YG+GA+SGF S+D V +G L VK
Sbjct: 89 IPSKKCPITVVACDLHNKYNSGASSTYVANGTDFTIQYGSGAMSGFVSQDSVTVGSLTVK 148
Query: 175 DQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNR 234
DQ F EAT EP + F AKFDGILGL FQ ISV PV+YNM++QGLV+ +FSFW +R
Sbjct: 149 DQLFAEATAEPGIAFDFAKFDGILGLAFQSISVNSIPPVFYNMLSQGLVSSTLFSFWLSR 208
Query: 235 NADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIA 294
GGE+ FG +D Y G+ TYVP+T + YW+F M D IDGQ+ GFC C AI
Sbjct: 209 TPG-ANGGELSFGSIDNTKYTGDITYVPLTNETYWEFVMDDFAIDGQSAGFCGTTCHAIC 267
Query: 295 DSGTSLLAGPTTIITQVNHAIGA 317
DSGTSL+AGP IT +N +GA
Sbjct: 268 DSGTSLIAGPMADITALNEKLGA 290
>gi|328869722|gb|EGG18099.1| cathepsin D [Dictyostelium fasciculatum]
Length = 476
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 135/264 (51%), Positives = 176/264 (66%), Gaps = 6/264 (2%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRS 138
+ L ++ DAQY+G I IGTP Q F V+FDTGSSNLW+PS KC + IAC H+KY S +S
Sbjct: 142 IPLSDFEDAQYYGAITIGTPGQPFKVVFDTGSSNLWIPSKKCPITVIACDLHNKYDSTKS 201
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
S++ +NG I YG+GA+SGF SED V++G L VK+Q F EAT EP + F AKFDGIL
Sbjct: 202 SSFVQNGTDFSIQYGSGAMSGFVSEDTVQVGSLSVKNQLFAEATAEPGIAFDFAKFDGIL 261
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEH 258
GL FQ ISV PV+YNM++QGLV +P+F+FW ++ A GGE+ FG +D + G
Sbjct: 262 GLAFQSISVNNIPPVFYNMMDQGLVAQPLFAFWLSKTASPTNGGELSFGSIDNSKFTGAI 321
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCA-GGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
TYVP+T + YW+F M DV DG + G+C GC AIADSGTSLLAGPT I +N +GA
Sbjct: 322 TYVPLTNRTYWEFSMDDVQYDGNSLGYCGKTGCRAIADSGTSLLAGPTEQIEAINTKLGA 381
Query: 318 TGI----VSQECKAVVSQYGEEII 337
+ + C + S +I+
Sbjct: 382 VSVNGEAIFPSCNVISSLPDVQIV 405
>gi|226822856|gb|ACO83090.1| cathepsin D preproprotein (predicted) [Dasypus novemcinctus]
Length = 410
Score = 275 bits (703), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 155/346 (44%), Positives = 204/346 (58%), Gaps = 24/346 (6%)
Query: 24 STPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESF-RTSIRKYSLRGNLGESGDADIVAL 82
+ P+ L RI L KF R + E + KYS +G +G L
Sbjct: 15 AAPSNALLRIPL--HKFTSVRRTLTEMQGPMEELLPQGPPSKYS-QGAPALAGGPIPEML 71
Query: 83 KNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTY 141
+NYMDAQY+GEIGIGTP Q F V+FDTGSSNLWVPS C AC+ H KY SGRSSTY
Sbjct: 72 RNYMDAQYYGEIGIGTPAQCFRVVFDTGSSNLWVPSIHCRLLDFACWLHRKYNSGRSSTY 131
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLV---------VKDQEFIEATREPSLTFLLA 192
KNG + DIHYG+G++SG+ S+D V + LV V Q F EAT++P +TFL+A
Sbjct: 132 VKNGSAFDIHYGSGSLSGYLSQDTVSVSPLVPCSAPVGVSVGKQVFGEATKQPGITFLMA 191
Query: 193 KFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPD 252
KFDGILG+ + ISVG +PV+ N++ Q LV++ VFSF+ NR+ + GGE+V GGMDP
Sbjct: 192 KFDGILGMAYPSISVGGVLPVFDNLMQQKLVDKNVFSFYLNRDPTAQPGGELVLGGMDPR 251
Query: 253 HYKGEHTYVPVTQKGYWQFDMGDVMI-DGQTTGFCAGGCAAIADSGTSLLAGPTTIITQV 311
HY G Y+ +T+K YWQ M + + DG T C GC AI D+GTSL+ GP + ++
Sbjct: 252 HYTGSVDYLNITRKAYWQVHMDRLEVGDGLT--LCKQGCEAIVDTGTSLMVGPVAEVREL 309
Query: 312 NHAIGATGIVSQE----CKAVVSQYGEEIINMLLAKVSAKLSLRHY 353
AIGA ++ E C+ V S I ++L +LS Y
Sbjct: 310 QKAIGAVPLIQGEYMISCEKVASL---PPITLMLGNRGYRLSGEDY 352
>gi|226476876|emb|CAX72320.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 275 bits (703), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 130/252 (51%), Positives = 174/252 (69%), Gaps = 7/252 (2%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNY+DAQY+G+I IGTPPQ F+V+FDTGSSNLWVPS C YF IAC H KY S +S+T
Sbjct: 58 LKNYLDAQYYGDITIGTPPQTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTT 117
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y NG I YGTG++SGF S D +++G L VK Q + EAT++P L F++AKFDGILG+
Sbjct: 118 YVPNGTDFSIRYGTGSLSGFLSTDSLQLGSLGVKGQTYGEATKQPGLVFVMAKFDGILGM 177
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ++VG PV+ NM+ QG+V+ PVFSF+ +RN GGE++ GG+D +Y GE Y
Sbjct: 178 AYPSLAVGGVTPVFVNMIKQGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINY 237
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT-- 318
V +T+K YW F M ++ I + C GC AIAD+GTS++AGPT + Q+N +GAT
Sbjct: 238 VNLTEKSYWLFKMDNLTISDLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHL 295
Query: 319 --GIVSQECKAV 328
GI + C +
Sbjct: 296 PGGIYTVSCDVI 307
>gi|146286061|sp|O93428.2|CATD_CHIHA RecName: Full=Cathepsin D; Flags: Precursor
Length = 396
Score = 275 bits (702), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 149/324 (45%), Positives = 193/324 (59%), Gaps = 16/324 (4%)
Query: 17 LLFPVVFST---PNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLG- 72
+L VFS N L RI LKK + + +L + +SL+ NL
Sbjct: 3 MLLLCVFSALALTNDALVRIPLKKFR-----SIRRQLTDSGKRAEELLADHHSLKYNLSF 57
Query: 73 -ESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFH 130
S LKNY+DAQY+GEIG+GTPPQ FTV+FDTGSSNLWVPS C IAC H
Sbjct: 58 PASNAPTPETLKNYLDAQYYGEIGLGTPPQPFTVVFDTGSSNLWVPSIHCSLLDIACLLH 117
Query: 131 SKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFL 190
KY SG+SSTY KNG + I YG+G++SG+ S+D IGDL + Q F EA ++P + F+
Sbjct: 118 HKYNSGKSSTYVKNGTAFAIQYGSGSLSGYLSQDTCTIGDLAIDSQLFGEAIKQPGVAFI 177
Query: 191 LAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMD 250
AKFDGILG+ + ISV PV+ N+++Q V + VFSF+ NRN D E GGE++ GG D
Sbjct: 178 AAKFDGILGMAYPRISVDGVAPVFDNIMSQKKVEQNVFSFYLNRNPDTEPGGELLLGGTD 237
Query: 251 PDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQ 310
P +Y G+ YV VT++ YWQ + D M G C GGC AI DSGTSL+ GP+ +
Sbjct: 238 PKYYTGDFNYVNVTRQAYWQIRV-DSMAVGDQLSLCTGGCEAIVDSGTSLITGPSVEVKA 296
Query: 311 VNHAIGATGIVSQE----CKAVVS 330
+ AIGA ++ E C V S
Sbjct: 297 LQKAIGAFPLIQGEYMVNCDTVPS 320
>gi|237874218|ref|NP_001153867.1| cathepsin D [Acyrthosiphon pisum]
Length = 393
Score = 274 bits (701), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 148/347 (42%), Positives = 200/347 (57%), Gaps = 23/347 (6%)
Query: 14 LCLLL-----FPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLR 68
LCLLL V +G L R+ L K + V +L + F R+Y
Sbjct: 7 LCLLLTTVFTITVTLVQCDGDLLRVKLHKI-----DSVRNQLRGRTSNLFGFVQRRYD-- 59
Query: 69 GNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIAC 127
+ L NY+DAQY+G I IGTPPQ F V+FDTGSSNLWVPS +C +IAC
Sbjct: 60 -------PLNAEPLSNYLDAQYYGPITIGTPPQPFNVVFDTGSSNLWVPSKQCSVLNIAC 112
Query: 128 YFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSL 187
H+KY +S+TY KNG IHYG+G++SG+ S D + + + +Q F EA +EP L
Sbjct: 113 MLHNKYNMAKSTTYXKNGTEFSIHYGSGSLSGYLSTDVMSMDGTSIVNQTFAEAIQEPGL 172
Query: 188 TFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFG 247
F+ AKFDGILGLG+ I+V VP +YNMVNQG++ +FSF+ NR+ GGEI+FG
Sbjct: 173 AFVAAKFDGILGLGYNTIAVDGVVPPFYNMVNQGIIKSAIFSFYLNRDPSSTPGGEIIFG 232
Query: 248 GMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTI 307
G DP+ Y G TYVPVT+ GYWQF + +V++ T +G AIAD+GTSL+AGP
Sbjct: 233 GSDPEKYTGPFTYVPVTRHGYWQFGLDEVIVG--NTSIVSGALQAIADTGTSLIAGPVDN 290
Query: 308 ITQVNHAIGATGIVSQECKAVVSQYGE-EIINMLLAKVSAKLSLRHY 353
I Q+N +G T I E Q +++ ++ + KL + Y
Sbjct: 291 IKQINELLGGTAIPGGEYIIACDQIDNLPVLSFVIGSTTFKLEGKDY 337
>gi|326433118|gb|EGD78688.1| cathepsin D [Salpingoeca sp. ATCC 50818]
Length = 385
Score = 274 bits (701), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 148/316 (46%), Positives = 194/316 (61%), Gaps = 30/316 (9%)
Query: 29 GLYRI---GLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNY 85
GL+R+ G+ + + L N AA L+ KYSL G+ V + N+
Sbjct: 24 GLHRVPLTGMPRSRDTLRNAGAALLN------------KYSL-------GNGTNVPIYNF 64
Query: 86 MDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNG 145
DAQY+GEI IGTPPQ F V+FDTGSSNLWVPS +C S+AC H KY S +SSTY NG
Sbjct: 65 EDAQYYGEITIGTPPQRFKVVFDTGSSNLWVPSKQCK-SLACDLHHKYDSSQSSTYFPNG 123
Query: 146 KSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEI 205
I YG+G+++GF S D +GDL V+ Q F EAT EP +TF+ AKFDGILG+GF EI
Sbjct: 124 TKFAIEYGSGSLTGFLSGDKTCVGDLCVEKQLFAEATNEPGITFVAAKFDGILGMGFVEI 183
Query: 206 SVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQ 265
SV + VP WYN+V+ G V +++FW NR GGE+ GG DP H G +VP+T+
Sbjct: 184 SVDQVVPYWYNLVSAGKVESNMYTFWLNRVQGAPSGGELTLGGYDPKHMSGPIQWVPLTR 243
Query: 266 KGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE- 324
GYWQF M + ++G + +C+ C AIAD+GTSLLAGPT I ++N IGA I E
Sbjct: 244 DGYWQFAMDSLSVNGDS--YCS-NCQAIADTGTSLLAGPTDAIKKLNKQIGAIPIAQGEY 300
Query: 325 ---CKAVVSQYGEEII 337
CK + + +I+
Sbjct: 301 MVDCKKIPTMPNVDIV 316
>gi|307203870|gb|EFN82801.1| Lysosomal aspartic protease [Harpegnathos saltator]
Length = 374
Score = 274 bits (701), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 142/244 (58%), Positives = 174/244 (71%), Gaps = 3/244 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSST 140
L NY+DAQY+G I IGTPPQ F VIFDTGSSNLWVPS KC + IAC H KY S +SST
Sbjct: 47 LSNYLDAQYYGVITIGTPPQEFRVIFDTGSSNLWVPSKKCSITNIACLLHHKYDSRKSST 106
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y+KNG I YG+G++SGF S D V IG L V+ Q F EA +EP L F+ AKFDGILG+
Sbjct: 107 YQKNGTEFAIRYGSGSLSGFLSSDVVNIGGLNVQGQTFAEAVKEPGLVFVAAKFDGILGM 166
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
G+ I+V PV+YNMV Q LV + VFSF+ NR+ D + GGE++ GG D DHY+GE TY
Sbjct: 167 GYSTIAVDGVTPVFYNMVKQDLVPKAVFSFYLNRDPDAKVGGEMLLGGSDSDHYEGEFTY 226
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
VPV++KGYWQF M + + G T CA GC AIAD+GTSL+AGP + +N IGAT I
Sbjct: 227 VPVSRKGYWQFAMDSIQVHGHT--LCASGCQAIADTGTSLIAGPVEEVAVINSLIGATTI 284
Query: 321 VSQE 324
++ E
Sbjct: 285 IAGE 288
>gi|112983576|ref|NP_001037351.1| cathepsin D precursor [Bombyx mori]
gi|66269351|gb|AAY43135.1| CathD [Bombyx mori]
Length = 384
Score = 274 bits (701), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 148/311 (47%), Positives = 191/311 (61%), Gaps = 25/311 (8%)
Query: 12 FFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKE-GESFRTSIRKYSLRGN 70
FFL L+ V+ LYR+ L + K AR E G KY + G
Sbjct: 7 FFLALIASSVM------ALYRVPLHRMK-------TARTHFHEVGTELELLRLKYDVTGP 53
Query: 71 LGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYF 129
E L NY+DAQY+G I IGTPPQ+F V+FDTGSSNLWVPS KC+++ IAC
Sbjct: 54 SPEP-------LSNYLDAQYYGVISIGTPPQSFKVVFDTGSSNLWVPSKKCHYTNIACLL 106
Query: 130 HSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTF 189
H+KY S +S TY NG I YG+G++SGF S D V +G L V+ Q F EA EP L F
Sbjct: 107 HNKYDSRKSKTYVANGTQFAIQYGSGSLSGFLSTDDVTVGGLKVRRQTFAEAVSEPGLAF 166
Query: 190 LLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGM 249
+ AKFDGILG+ F I+V PV+ NMV QGLV +PVFSF+ NR+ GGE++ GG
Sbjct: 167 VAAKFDGILGMAFSTIAVDHVTPVFDNMVAQGLV-QPVFSFYLNRDPGATTGGELLLGGS 225
Query: 250 DPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIIT 309
DP HY+G+ VP+ + YW+F M V ++ + FCA GC+AIAD+GTSL+AGP+ +
Sbjct: 226 DPAHYRGDLVRVPLLRDTYWEFHMDSVNVNA--SRFCAQGCSAIADTGTSLIAGPSKEVE 283
Query: 310 QVNHAIGATGI 320
+N A+GAT I
Sbjct: 284 ALNAAVGATAI 294
>gi|148231809|ref|NP_001085308.1| cathepsin D precursor [Xenopus laevis]
gi|62739292|gb|AAH94178.1| LOC443721 protein [Xenopus laevis]
Length = 399
Score = 274 bits (701), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 142/305 (46%), Positives = 190/305 (62%), Gaps = 16/305 (5%)
Query: 26 PNGGLYRIGLKK-----RKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIV 80
P L RI LKK R ++ + +L E + KYS +S +
Sbjct: 19 PGSSLVRIPLKKFTSIRRAMSDTDKDSLKLSGNEAAT------KYS---AFPKSNNPTPE 69
Query: 81 ALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYF-SIACYFHSKYRSGRSS 139
L NY+DAQY+GEI IGTPPQ FTV+FDTGSSNLWVPS C F IAC+ H KY S +SS
Sbjct: 70 TLLNYLDAQYYGEISIGTPPQPFTVVFDTGSSNLWVPSVHCSFWDIACWLHHKYDSSKSS 129
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TY NG + I YG+G+++G+ S+D V IGDL VK Q F EA ++P +TF+ AKFDGILG
Sbjct: 130 TYVNNGTAFAIQYGSGSLTGYLSKDTVTIGDLAVKGQLFAEAVKQPGITFVAAKFDGILG 189
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
+G+ ISV PV+ +++ Q LV+ +FSF+ NRN D + GGE++ GG DP +Y G+ +
Sbjct: 190 MGYPRISVDGVPPVFDDIMEQKLVDSNLFSFYLNRNPDTQPGGELLLGGTDPTYYTGDFS 249
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATG 319
Y+ VT+K YWQ M + + Q T C GGC AI D+GTSL+ GP + + AIGA
Sbjct: 250 YMNVTRKAYWQIRMDQLSVGDQLT-LCKGGCEAIVDTGTSLITGPVEEVAALQRAIGAIP 308
Query: 320 IVSQE 324
++ E
Sbjct: 309 LIRGE 313
>gi|195380081|ref|XP_002048799.1| GJ21122 [Drosophila virilis]
gi|194143596|gb|EDW59992.1| GJ21122 [Drosophila virilis]
Length = 391
Score = 274 bits (700), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 153/311 (49%), Positives = 195/311 (62%), Gaps = 17/311 (5%)
Query: 12 FFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNL 71
F +CL L V + P L R+ L KF R A + ++ + LR
Sbjct: 8 FAVCLALAWAVAAEPK--LLRVPL--NKFQSARRHFADVGTELQQ----------LRIKY 53
Query: 72 GESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFH 130
G +G L NY+DAQY+G I IG+PPQNF V+FDTGSSNLWVPS KC+ + IAC H
Sbjct: 54 GGAGGVSPEPLSNYLDAQYYGPISIGSPPQNFKVVFDTGSSNLWVPSKKCHLTNIACLMH 113
Query: 131 SKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFL 190
+KY + +SS+Y KNG IHYG+G++SG+ S D V I L +KDQ F EA EP L F+
Sbjct: 114 NKYDASKSSSYSKNGTEFAIHYGSGSLSGYLSSDTVNIAGLDIKDQTFAEALSEPGLVFV 173
Query: 191 LAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMD 250
AKFDGILGLG+ ISV P +Y+M QGL+++PVFSF+ NR+ EGGEI+FGG D
Sbjct: 174 AAKFDGILGLGYSSISVDGVKPPFYSMFEQGLISQPVFSFYLNRDPKAPEGGEIIFGGSD 233
Query: 251 PDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQ 310
P+HY G+ TY+PVT+KGYWQ M ++ C GGC IAD+GTSL+A P T
Sbjct: 234 PNHYTGDFTYLPVTRKGYWQIKMDSAQLN--NLELCKGGCQIIADTGTSLIAAPVAEATS 291
Query: 311 VNHAIGATGIV 321
+N AIG T IV
Sbjct: 292 INQAIGGTPIV 302
>gi|49522906|gb|AAH75134.1| LOC443721 protein, partial [Xenopus laevis]
Length = 398
Score = 274 bits (700), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 142/305 (46%), Positives = 190/305 (62%), Gaps = 16/305 (5%)
Query: 26 PNGGLYRIGLKK-----RKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIV 80
P L RI LKK R ++ + +L E + KYS +S +
Sbjct: 18 PGSSLVRIPLKKFTSIRRAMSDTDKDSLKLSGNEAAT------KYS---AFPKSNNPTPE 68
Query: 81 ALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYF-SIACYFHSKYRSGRSS 139
L NY+DAQY+GEI IGTPPQ FTV+FDTGSSNLWVPS C F IAC+ H KY S +SS
Sbjct: 69 TLLNYLDAQYYGEISIGTPPQPFTVVFDTGSSNLWVPSVHCSFWDIACWLHHKYDSSKSS 128
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TY NG + I YG+G+++G+ S+D V IGDL VK Q F EA ++P +TF+ AKFDGILG
Sbjct: 129 TYVNNGTAFAIQYGSGSLTGYLSKDTVTIGDLAVKGQLFAEAVKQPGITFVAAKFDGILG 188
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
+G+ ISV PV+ +++ Q LV+ +FSF+ NRN D + GGE++ GG DP +Y G+ +
Sbjct: 189 MGYPRISVDGVPPVFDDIMEQKLVDSNLFSFYLNRNPDTQPGGELLLGGTDPTYYTGDFS 248
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATG 319
Y+ VT+K YWQ M + + Q T C GGC AI D+GTSL+ GP + + AIGA
Sbjct: 249 YMNVTRKAYWQIRMDQLSVGDQLT-LCKGGCEAIVDTGTSLITGPVEEVAALQRAIGAIP 307
Query: 320 IVSQE 324
++ E
Sbjct: 308 LIRGE 312
>gi|449280808|gb|EMC88033.1| Cathepsin D, partial [Columba livia]
Length = 387
Score = 274 bits (700), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 147/328 (44%), Positives = 203/328 (61%), Gaps = 14/328 (4%)
Query: 32 RIGLKKRKFDLNNRVAARLDSKEGE-SFRTSIRKYSLRGNLGESGDADIVALKNYMDAQY 90
RI L KF RV + + S+ + + T + K+ L + G+ LKNYMDAQY
Sbjct: 12 RIPL--TKFPSMRRVLSEVGSEIPDMNAMTQLLKFKL--GFADVGEPTPEILKNYMDAQY 67
Query: 91 FGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYF-SIACYFHSKYRSGRSSTYKKNGKSAD 149
+GEIGIGTPPQ FTV+FDTGSSNLWVPS C+ IAC H KY S +SSTY +NG
Sbjct: 68 YGEIGIGTPPQKFTVVFDTGSSNLWVPSVHCHLLDIACLLHHKYDSSKSSTYVENGTDFA 127
Query: 150 IHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGK 209
IHYGTG++SG+ S+D V +G+L +K+Q F EA ++P +TF+ AKFDGILG+ F ISV K
Sbjct: 128 IHYGTGSLSGYLSQDTVTLGNLKIKNQIFGEALKQPGITFIAAKFDGILGMAFPRISVDK 187
Query: 210 AVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYW 269
P + N++ Q L+ + +FSF+ NR+ + GGE++ GG DP +Y G+ ++V VT+K YW
Sbjct: 188 VTPFFDNIMQQKLIEKNIFSFYLNRDPSAQPGGELLLGGTDPKYYSGDFSWVNVTRKAYW 247
Query: 270 QFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE----C 325
Q M V + T C GGC AI D+GTSL+ GPT + ++ AIGA ++ + C
Sbjct: 248 QVHMDAVDVANGLT-LCKGGCEAIVDTGTSLITGPTKEVKELQTAIGAKPLIKGQYVIPC 306
Query: 326 KAVVSQYGEEIINMLLAKVSAKLSLRHY 353
V S +I + L +L+ Y
Sbjct: 307 DKVSSL---PVITLTLGGKPYQLTGEQY 331
>gi|45384002|ref|NP_990508.1| cathepsin D precursor [Gallus gallus]
gi|461696|sp|Q05744.1|CATD_CHICK RecName: Full=Cathepsin D; Contains: RecName: Full=Cathepsin D
light chain; Contains: RecName: Full=Cathepsin D heavy
chain; Flags: Precursor
gi|259835|gb|AAB24157.1| prepro-cathepsin D [Gallus gallus]
Length = 398
Score = 274 bits (700), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 147/335 (43%), Positives = 207/335 (61%), Gaps = 16/335 (4%)
Query: 26 PNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVA--LK 83
P L RI L KF R+ + S+ + +I ++ L+ LG + A+ LK
Sbjct: 17 PCAALIRIPL--TKFTSTRRMLTEVGSEIPD--MNAITQF-LKFKLGFADLAEPTPEILK 71
Query: 84 NYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYF-SIACYFHSKYRSGRSSTYK 142
NYMDAQY+GEIGIGTPPQ FTV+FDTGSSNLWVPS C+ IAC H KY + +SSTY
Sbjct: 72 NYMDAQYYGEIGIGTPPQKFTVVFDTGSSNLWVPSVHCHLLDIACLLHHKYDASKSSTYV 131
Query: 143 KNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGF 202
+NG IHYGTG++SGF S+D V +G+L +K+Q F EA ++P +TF+ AKFDGILG+ F
Sbjct: 132 ENGTEFAIHYGTGSLSGFLSQDTVTLGNLKIKNQIFGEAVKQPGITFIAAKFDGILGMAF 191
Query: 203 QEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVP 262
ISV K P + N++ Q L+ + +FSF+ NR+ + GGE++ GG DP +Y G+ ++V
Sbjct: 192 PRISVDKVTPFFDNVMQQKLIEKNIFSFYLNRDPTAQPGGELLLGGTDPKYYSGDFSWVN 251
Query: 263 VTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVS 322
VT+K YWQ M V + T C GGC AI D+GTSL+ GPT + ++ AIGA ++
Sbjct: 252 VTRKAYWQVHMDSVDVANGLT-LCKGGCEAIVDTGTSLITGPTKEVKELQTAIGAKPLIK 310
Query: 323 QE----CKAVVSQYGEEIINMLLAKVSAKLSLRHY 353
+ C + S ++ ++L +L+ Y
Sbjct: 311 GQYVISCDKISSL---PVVTLMLGGKPYQLTGEQY 342
>gi|226476848|emb|CAX72340.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 273 bits (698), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 130/252 (51%), Positives = 173/252 (68%), Gaps = 7/252 (2%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNY+DAQY+G+I IGTPPQ F+V+FDTGSSNLWVPS C YF IAC H KY S +S+T
Sbjct: 58 LKNYLDAQYYGDITIGTPPQTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTT 117
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y NG I YGTG++SGF S D +++G L VK Q F EAT++P L F++AKFDGILG+
Sbjct: 118 YVPNGTDFSIRYGTGSLSGFLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGM 177
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ++VG PV+ NM+ Q +V+ PVFSF+ +RN GGE++ GG+D +Y GE Y
Sbjct: 178 AYPSLAVGGVTPVFVNMIKQRVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINY 237
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT-- 318
V +T+K YW F M ++ I + C GC AIAD+GTS++AGPT + Q+N +GAT
Sbjct: 238 VNLTEKSYWLFKMDNLTISDLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHL 295
Query: 319 --GIVSQECKAV 328
GI + C +
Sbjct: 296 PGGIYTVSCDVI 307
>gi|355681641|gb|AER96810.1| cathepsin D [Mustela putorius furo]
Length = 410
Score = 273 bits (698), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 145/310 (46%), Positives = 195/310 (62%), Gaps = 26/310 (8%)
Query: 59 RTSIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPS 118
R I KYS +G +GD L+NYMDAQY+GEIGIGTPPQ FTV+FDTGSSNLWVPS
Sbjct: 49 RGPISKYS-QGVPSVAGDPVPEVLRNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPS 107
Query: 119 SKC-YFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKI--------- 168
C IAC+ H KY SG+SSTY KNG S DIHYG+G++SG+ S+D V +
Sbjct: 108 IHCKLLDIACWIHHKYNSGKSSTYVKNGTSFDIHYGSGSLSGYLSQDTVSVPCKSGLSSL 167
Query: 169 GDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVF 228
+ V+ Q F EAT++P +TF+ AKFDGILG+ + ISV +PV+ N++ Q LV + +F
Sbjct: 168 AGVKVERQTFGEATKQPGITFIAAKFDGILGMAYPRISVNNVLPVFDNLMQQKLVEKNIF 227
Query: 229 SFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAG 288
SF+ NR+ + GGE++ GG D +YKG +Y+ VT+K YWQ M V + G + C G
Sbjct: 228 SFYLNRDPGAQPGGELMLGGTDSKYYKGPLSYLNVTRKAYWQVHMEXVDV-GSSLTLCKG 286
Query: 289 GCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE----CKAVVSQYGEEIINMLLAKV 344
GC AI D+GTSL+ GP + ++ AIGA ++ E C+ V + L +V
Sbjct: 287 GCEAIVDTGTSLIVGPVDEVRELQKAIGAVPLIQGEYMIPCEKVST----------LPEV 336
Query: 345 SAKLSLRHYK 354
+ KL + YK
Sbjct: 337 TLKLGGKPYK 346
>gi|9581805|emb|CAC00543.1| necepsin II [Necator americanus]
Length = 446
Score = 273 bits (698), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 132/244 (54%), Positives = 168/244 (68%), Gaps = 2/244 (0%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
L+NYMDAQY+G I IGTP QNFTVIFDTGSSNLWVPS KC ++ IAC H +Y SG SST
Sbjct: 87 LRNYMDAQYYGVIQIGTPAQNFTVIFDTGSSNLWVPSRKCPFYDIACMLHHRYDSGASST 146
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
YK++G+ I YGTG++ GF S+D V I + ++Q F EAT EP LTF+ AKFDGILG+
Sbjct: 147 YKEDGRKMAIQYGTGSMKGFISKDIVCIAGICAEEQPFAEATSEPGLTFIAAKFDGILGM 206
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
F EI+V PV++ + Q V PVF+FW NRN + E GGEI FGG+D Y T+
Sbjct: 207 AFPEIAVLGVTPVFHTFIEQKKVPSPVFAFWLNRNPESEIGGEITFGGVDTRRYVEPITW 266
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
PVT++GYWQF M D++ G ++ C GC AIAD+GTSL+AGP + + IGA +
Sbjct: 267 TPVTRRGYWQFKM-DMVQGGSSSIACPNGCQAIADTGTSLIAGPKAQVEAIQKYIGAEPL 325
Query: 321 VSQE 324
+ E
Sbjct: 326 MKGE 329
>gi|31559113|gb|AAP50847.1| cathepsin D [Bombyx mori]
gi|90992734|gb|ABE03014.1| aspartic protease [Bombyx mori]
Length = 385
Score = 273 bits (698), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 147/311 (47%), Positives = 190/311 (61%), Gaps = 25/311 (8%)
Query: 12 FFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKE-GESFRTSIRKYSLRGN 70
FFL L+ V+ LYR+ L + K AR E G KY + G
Sbjct: 7 FFLALIASSVM------ALYRVPLHRMK-------TARTHFHEVGTELELLRLKYDVTGP 53
Query: 71 LGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYF 129
E L NY+DAQY+G I IGTPPQ+F V+FDTGSSNLWVPS KC+++ IAC
Sbjct: 54 SPEP-------LSNYLDAQYYGVISIGTPPQSFKVVFDTGSSNLWVPSKKCHYTNIACLL 106
Query: 130 HSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTF 189
H+KY S +S +Y NG I YG+G++SGF S D V +G L V+ Q F EA EP L F
Sbjct: 107 HNKYDSRKSKSYVANGTQFAIQYGSGSLSGFLSTDDVTVGGLKVRRQTFAEAVSEPGLAF 166
Query: 190 LLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGM 249
+ AKFDGILG+ F I+V PV+ NMV QGLV +PVFSF+ NR+ GGE++ GG
Sbjct: 167 VAAKFDGILGMAFSTIAVDHVTPVFDNMVAQGLV-QPVFSFYLNRDPGATTGGELLLGGS 225
Query: 250 DPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIIT 309
DP HY+G+ VP+ + YW+F M V ++ FCA GC+AIAD+GTSL+AGP+ +
Sbjct: 226 DPAHYRGDLVRVPLLRDTYWEFHMDSVNVNASR--FCAQGCSAIADTGTSLIAGPSKEVE 283
Query: 310 QVNHAIGATGI 320
+N A+GAT I
Sbjct: 284 ALNAAVGATAI 294
>gi|315440805|gb|ADU20408.1| aspartic protease 2 [Clonorchis sinensis]
Length = 385
Score = 272 bits (696), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 127/240 (52%), Positives = 169/240 (70%), Gaps = 3/240 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCY-FSIACYFHSKYRSGRSST 140
L NYMD+QY+GEI IGTPPQ F V+FDTGSSNLWVPS++C ++ AC H +Y +SST
Sbjct: 60 LDNYMDSQYYGEIAIGTPPQPFKVVFDTGSSNLWVPSNRCSPWNEACRLHHRYDCEKSST 119
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
YK NGK I YGTG++SG S D V + V+DQ F EA EP L F++AKFDGILGL
Sbjct: 120 YKANGKPFSIQYGTGSVSGVLSTDVVTVSSAKVQDQTFGEAINEPGLVFVVAKFDGILGL 179
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
FQ I+V VPV+ NM++QGLV +P+FS W +RN ++ GGEI+FGG++ +HY G+ +
Sbjct: 180 AFQSIAVDNVVPVFDNMISQGLVEKPLFSVWLDRNDVQDIGGEIMFGGINKEHYMGDMYF 239
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
VP++ + YWQ D+ + + T CA GC AI D+GT+L+ GPT + Q+N A+GA I
Sbjct: 240 VPLSSETYWQIDLDGIQVTSLT--LCAQGCQAIVDTGTTLIVGPTADVNQLNEALGAVSI 297
>gi|348565205|ref|XP_003468394.1| PREDICTED: cathepsin D-like [Cavia porcellus]
Length = 407
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 147/313 (46%), Positives = 195/313 (62%), Gaps = 14/313 (4%)
Query: 29 GLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDA 88
L RI L K K + + + S E R + KYS + + +G L NYMDA
Sbjct: 20 ALIRIPLHKFK-SIRHTMTEAGGSVENLIARDPLTKYSPQLSTKATGPVP-EPLSNYMDA 77
Query: 89 QYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNGKS 147
QY+GEIGIGTPPQ FTV+FDTGSSNLWVPS+KC IAC+FH KY +SSTY KNG S
Sbjct: 78 QYYGEIGIGTPPQCFTVVFDTGSSNLWVPSAKCKMLDIACWFHHKYHGDKSSTYVKNGTS 137
Query: 148 ADIHYGTGAISGFFSEDHVKIG------DLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
DIHYG+G++SG+ S+D V + + V Q F EAT++P + F+ AKFDGILGL
Sbjct: 138 FDIHYGSGSLSGYLSQDTVSVPCKSSNSSVKVSKQTFGEATKQPGIVFVAAKFDGILGLA 197
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ ISV +PV+ N++ Q LV + +FSF+ NR+ + GGE+V GG+D +YKG TY+
Sbjct: 198 YPRISVNNVLPVFDNLMEQKLVEKNIFSFYLNRDPTAQPGGELVLGGIDSKYYKGSFTYL 257
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIV 321
VT+K YWQ M + + + T C GGC AI D+GTSLL GP + ++ AIGA ++
Sbjct: 258 NVTRKAYWQVHMDQLQVGSELT-LCKGGCEAIVDTGTSLLVGPVDEVKELQKAIGALPLI 316
Query: 322 SQE----CKAVVS 330
E C+ V S
Sbjct: 317 QGEYMIPCEKVSS 329
>gi|1585311|prf||2124395A Asp protease
Length = 380
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 130/252 (51%), Positives = 172/252 (68%), Gaps = 7/252 (2%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNY+DAQY G+I IGTPPQ F+ +FDTGSSNLWVPS C YF IAC H KY S +S+T
Sbjct: 58 LKNYLDAQYHGDITIGTPPQTFSAVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTT 117
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y NG I YGTG++SGF S D +++G L VK Q F EAT++P L F++AKFDGILG+
Sbjct: 118 YVPNGTDFSIRYGTGSLSGFLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGM 177
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ++VG PV+ NM+ QG+V+ PVFSF+ +RN GGE++ GG+D +Y GE Y
Sbjct: 178 AYPSLAVGGVTPVFVNMIKQGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINY 237
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT-- 318
V +T+K YW F M ++ I + C GC AIAD+GTS++AGPT + Q+N +GAT
Sbjct: 238 VNLTEKSYWLFKMDNLTISDLS--ICTDGCQAIADTGTSMIAGPTDEVKQINQKLGATHL 295
Query: 319 --GIVSQECKAV 328
GI + C +
Sbjct: 296 PGGIYTVSCDVI 307
>gi|432850599|ref|XP_004066827.1| PREDICTED: cathepsin D-like isoform 1 [Oryzias latipes]
Length = 396
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 142/298 (47%), Positives = 185/298 (62%), Gaps = 9/298 (3%)
Query: 30 LYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLG--ESGDADIVALKNYMD 87
L RI LKK + + L E+ K+SL+ NLG S LKNY+D
Sbjct: 19 LIRIPLKKFR-----SIRRELTDSGREAHELLADKHSLKYNLGFPSSNGPTPETLKNYLD 73
Query: 88 AQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNGK 146
AQY+GEI +GTPPQ FTV+FDTGSSNLWVPS C IAC KY S +SSTY KNG
Sbjct: 74 AQYYGEIALGTPPQPFTVVFDTGSSNLWVPSVHCSLLDIACXXXHKYNSAKSSTYVKNGT 133
Query: 147 SADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEIS 206
S I YG+G++SG+ S+D IGD+ V++Q F EA ++P + F+ AKFDGILG+ + IS
Sbjct: 134 SFSIQYGSGSLSGYLSQDTCTIGDISVENQVFGEAIKQPGVAFIAAKFDGILGMAYPRIS 193
Query: 207 VGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQK 266
V VPV+ N++ Q V+ VFSF+ NRN D E GGE++ GG DP +Y G+ YV ++++
Sbjct: 194 VDGVVPVFDNIMQQKKVDSNVFSFYLNRNPDTEPGGELLLGGTDPKYYSGDFHYVNISRQ 253
Query: 267 GYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE 324
YWQ M D M G C GGC AI D+GTSLL GP+ + + AIGA ++ E
Sbjct: 254 AYWQIHM-DGMAVGSQLSLCKGGCEAIVDTGTSLLTGPSAEVKALQKAIGAIPLIQGE 310
>gi|358333762|dbj|GAA52230.1| cathepsin D [Clonorchis sinensis]
Length = 408
Score = 271 bits (694), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 127/240 (52%), Positives = 169/240 (70%), Gaps = 3/240 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCY-FSIACYFHSKYRSGRSST 140
L NYMD+QY+GEI IGTPPQ F V+FDTGSSNLWVPS++C ++ AC H +Y +SST
Sbjct: 83 LDNYMDSQYYGEIAIGTPPQPFKVVFDTGSSNLWVPSNRCSPWNEACRLHHRYDCEKSST 142
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
YK NGK I YGTG++SG S D V + V+DQ F EA EP L F++AKFDGILGL
Sbjct: 143 YKANGKPFSIQYGTGSVSGVLSTDVVTVSSAKVQDQTFGEAINEPGLVFVVAKFDGILGL 202
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
FQ I+V VPV+ NM++QGLV +P+FS W +RN ++ GGEI+FGG++ +HY G+ +
Sbjct: 203 AFQSIAVDNVVPVFDNMISQGLVEKPLFSVWLDRNDVQDIGGEIMFGGINKEHYMGDMYF 262
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
VP++ + YWQ D+ + + T CA GC AI D+GT+L+ GPT + Q+N A+GA I
Sbjct: 263 VPLSSETYWQIDLDGIQVTSLT--LCAQGCQAIVDTGTTLIVGPTADVNQLNEALGAVSI 320
>gi|86278345|gb|ABC88426.1| cathepsin D-like aspartic proteinase preproprotein [Meloidogyne
incognita]
Length = 454
Score = 271 bits (694), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 133/244 (54%), Positives = 162/244 (66%), Gaps = 1/244 (0%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
L+NYMDAQY+G I IG+PPQNF+VIFDTGSSNLWVPS KC ++ IAC H KY S +SS+
Sbjct: 82 LRNYMDAQYYGPISIGSPPQNFSVIFDTGSSNLWVPSKKCPFYDIACLLHHKYDSTKSSS 141
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
YK +G+ I YGTG++ GF S+D V I ++ V QEF EA EP LTF+ AKFDGILG+
Sbjct: 142 YKDDGRKMQIQYGTGSMKGFVSKDTVCIANICVAGQEFAEAVSEPGLTFVAAKFDGILGM 201
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
F EISV PV+ M++Q V EPVFSFW NR+ + GGEI GG D Y Y
Sbjct: 202 AFPEISVLGVQPVFQQMISQQKVPEPVFSFWLNRDPYSKVGGEITIGGTDKRRYVEPLNY 261
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
PVT+K YWQF M V C GC AIAD+GTSL+AGP I ++ H IGA +
Sbjct: 262 TPVTRKAYWQFKMEGVHNSKGEKIACQNGCEAIADTGTSLIAGPKAQIEEIQHYIGAVPL 321
Query: 321 VSQE 324
+ E
Sbjct: 322 MHGE 325
>gi|354496335|ref|XP_003510282.1| PREDICTED: cathepsin D [Cricetulus griseus]
gi|344248735|gb|EGW04839.1| Cathepsin D [Cricetulus griseus]
Length = 408
Score = 271 bits (692), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 151/335 (45%), Positives = 203/335 (60%), Gaps = 25/335 (7%)
Query: 33 IGLKKRKFDLNNRVAARLD-SKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYF 91
I + RKF R + S E + I KYS + G + LKNY+DAQY+
Sbjct: 22 IRIPLRKFTSIRRTMTEVGGSVEDLILKGPITKYSNQSPAETKGPVSEL-LKNYLDAQYY 80
Query: 92 GEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNGKSADI 150
GEIGIGTPPQ FTV+FDTGSSNLWVPS C IAC+ H KY SG+SST+ KNG S DI
Sbjct: 81 GEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSGKSSTFVKNGTSFDI 140
Query: 151 HYGTGAISGFFSEDHVKI-------GDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQ 203
HYG+G++SG+ S+D V + G L V+ Q F EA ++P +TF+ AKFDGILG+G+
Sbjct: 141 HYGSGSLSGYLSQDTVSVPCKSEQPGGLKVEKQIFGEAIKQPGITFIAAKFDGILGMGYP 200
Query: 204 EISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPV 263
ISV VPV+ N++ Q LV + +FSF+ NR+ + GGE++ GG+D +Y+GE +Y+ V
Sbjct: 201 SISVNNVVPVFDNLMQQKLVEKNIFSFFLNRDPTGQPGGELMLGGIDSKYYEGELSYLNV 260
Query: 264 TQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQ 323
T+K YWQ M + + T C GGC AI D+GTSLL GP + ++ AIGA ++
Sbjct: 261 TRKAYWQVHMDQLDVANGLT-LCKGGCEAIVDTGTSLLVGPVDEVKELQKAIGAVPLIQG 319
Query: 324 E----CKAVVSQYGEEIINMLLAKVSAKLSLRHYK 354
E C+ V S L V+ KL + Y+
Sbjct: 320 EYMIPCEKVSS----------LPSVTLKLGGKDYE 344
>gi|158523297|gb|ABW70789.1| cathepsin D [Scophthalmus maximus]
Length = 396
Score = 271 bits (692), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 142/318 (44%), Positives = 192/318 (60%), Gaps = 19/318 (5%)
Query: 15 CLLLFPVVFSTPNGGLYRIGLKK-----RKFDLNNRVAARLDSKEGESFRTSIRKYSLR- 68
CLL+ V + L RI LKK R+ + R A L + K+SL+
Sbjct: 4 CLLVVFVSLALSGDALVRIPLKKFHSVRRELTDSGRKAEELLAD----------KHSLKY 53
Query: 69 -GNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIA 126
G S LKN++DAQY+G+I +G+PPQ F+V+FDTGSSNLWVPS C IA
Sbjct: 54 SGGFPSSNGPTPEMLKNFLDAQYYGDIALGSPPQTFSVVFDTGSSNLWVPSVHCSLLDIA 113
Query: 127 CYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPS 186
C H KY S +SSTY KNG + I YG+G++SGF S+D IGD+ V++Q F EAT++P
Sbjct: 114 CLLHHKYNSAKSSTYVKNGTAFAIQYGSGSLSGFLSQDTCTIGDVTVENQVFGEATKQPG 173
Query: 187 LTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVF 246
+ F+ AKFDGILG+ F ISV VPV+ N+++Q V + VFSF+ NRN D GGE++
Sbjct: 174 VAFIAAKFDGILGMAFPRISVDGVVPVFDNIMSQKKVEQNVFSFYLNRNPDTAPGGELLL 233
Query: 247 GGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTT 306
GG DP +Y G+ Y+ +T+K YWQ M + + Q T C GGC I D+GTSL+ GP
Sbjct: 234 GGTDPKYYTGDFNYINITRKAYWQIHMDGLAVGSQLT-LCNGGCEVIVDTGTSLITGPAA 292
Query: 307 IITQVNHAIGATGIVSQE 324
+ + AIGA ++ E
Sbjct: 293 EVKALQKAIGAVPLIQGE 310
>gi|342305186|dbj|BAK55647.1| cathepsin D [Oplegnathus fasciatus]
Length = 396
Score = 271 bits (692), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 146/319 (45%), Positives = 190/319 (59%), Gaps = 19/319 (5%)
Query: 14 LCLLLFPVVFSTPNGGLYRIGLKK-----RKFDLNNRVAARLDSKEGESFRTSIRKYSLR 68
L LL + P+ L RI L K R+ + R A L + K SL+
Sbjct: 3 LFLLGVFAALALPSDALIRIPLTKFRSIRRELTDSGRTAEELLAD----------KNSLK 52
Query: 69 GNLG--ESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSI 125
NLG S LKNY+DAQY+GEIG+GTPPQ FTV+FDTGSSNLWVPS C I
Sbjct: 53 YNLGFPSSNGPTPETLKNYLDAQYYGEIGLGTPPQPFTVVFDTGSSNLWVPSVHCSILDI 112
Query: 126 ACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREP 185
AC H KY S +SSTY KNG + I YGTG++SG+ S+D IGD+ V Q F EA ++P
Sbjct: 113 ACLLHHKYNSAKSSTYVKNGTAFAIQYGTGSLSGYLSQDTCTIGDISVDKQLFGEAIKQP 172
Query: 186 SLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIV 245
+ F+ AKFDGILG+ + ISV PV+ N+++Q V + VFSF+ NRN D E GGE++
Sbjct: 173 GVAFIAAKFDGILGMAYPRISVDGVAPVFDNIMSQKKVEKNVFSFYLNRNPDTEPGGELL 232
Query: 246 FGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPT 305
GG DP +Y G+ YV +T++ YWQ M + + GQ C GC AI D+GTSL+ GP+
Sbjct: 233 LGGTDPKYYSGDFHYVNITRQAYWQIHMDGMAVGGQLN-LCTSGCEAIVDTGTSLITGPS 291
Query: 306 TIITQVNHAIGATGIVSQE 324
+ + AIGA + E
Sbjct: 292 AEVRSLQKAIGAIPFIQGE 310
>gi|25452827|sp|Q9DEX3.1|CATD_CLUHA RecName: Full=Cathepsin D; Flags: Precursor
gi|11037777|gb|AAG27733.1|AF312364_1 muscular cathepsin D [Clupea harengus]
Length = 396
Score = 271 bits (692), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 152/324 (46%), Positives = 195/324 (60%), Gaps = 16/324 (4%)
Query: 13 FLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIR-KYSLRGNL 71
FL L LF VF+ + + RI LKK R R S G + + SL+ N
Sbjct: 3 FLYLFLF-AVFAWTSDAIVRIPLKKF------RSIRRTLSDSGLNVEQLLAGTNSLQHNQ 55
Query: 72 G--ESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACY 128
G S LKNYMDAQY+GEIG+GTP Q FTV+FDTGSSNLW+PS C F+ IAC
Sbjct: 56 GFPSSNAPTPETLKNYMDAQYYGEIGLGTPVQMFTVVFDTGSSNLWLPSIHCSFTDIACL 115
Query: 129 FHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLT 188
H KY +SSTY KNG I YG+G++SG+ S+D IGD+VV+ Q F EA ++P +
Sbjct: 116 LHHKYNGAKSSTYVKNGTEFAIQYGSGSLSGYLSQDSCTIGDIVVEKQLFGEAIKQPGVA 175
Query: 189 FLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGG 248
F+ AKFDGILG+ + ISV PV+ M++Q V + VFSF+ NRN D E GGE++ GG
Sbjct: 176 FIAAKFDGILGMAYPRISVDGVPPVFDMMMSQKKVEQNVFSFYLNRNPDTEPGGELLLGG 235
Query: 249 MDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTII 308
DP +Y G+ YVPVT++ YWQ M + I Q T C GC AI D+GTSL+ GP +
Sbjct: 236 TDPKYYTGDFNYVPVTRQAYWQIHMDGMSIGSQLT-LCKDGCEAIVDTGTSLITGPPAEV 294
Query: 309 TQVNHAIGATGIVSQE----CKAV 328
+ AIGA ++ E CK V
Sbjct: 295 RALQKAIGAIPLIQGEYMIDCKKV 318
>gi|3378161|emb|CAA07719.1| cathepsin D precursor [Chionodraco hamatus]
Length = 396
Score = 271 bits (692), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 140/299 (46%), Positives = 184/299 (61%), Gaps = 9/299 (3%)
Query: 38 RKFDLNNRVAARL-DSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGI 96
++ + A RL + G R S+ + L + S LKNY+DAQY+GEIG+
Sbjct: 25 KEIPFHQTSADRLWEESRGAPGRPSLPEVQL--SFPASNAPTPETLKNYLDAQYYGEIGL 82
Query: 97 GTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTG 155
GTPPQ FTV+FDTGSSNLWVPS C IAC H KY SG+SSTY KNG + I YG+G
Sbjct: 83 GTPPQPFTVVFDTGSSNLWVPSIHCSLLDIACLLHHKYNSGKSSTYVKNGTAFAIQYGSG 142
Query: 156 AISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWY 215
++SG+ S+D IGDL + Q F EA ++P + F+ AKFDGILG+ + ISV PV+
Sbjct: 143 SLSGYLSQDTCTIGDLAIDSQLFGEAIKQPGVAFIAAKFDGILGMAYPRISVDGVAPVFD 202
Query: 216 NMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGD 275
N+++Q V + VFSF+ NRN D E GGE++ GG DP +Y G+ YV VT++ YWQ + D
Sbjct: 203 NIMSQKKVEQNVFSFYLNRNPDTEPGGELLLGGTDPKYYTGDFNYVNVTRQAYWQIRV-D 261
Query: 276 VMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE----CKAVVS 330
M G C GGC AI DSGTSL+ GP+ + + AIGA ++ E C V S
Sbjct: 262 SMAVGDQLSLCTGGCEAIVDSGTSLITGPSVEVKALQKAIGAFPLIQGEYMVNCDTVPS 320
>gi|115720|sp|P24268.1|CATD_RAT RecName: Full=Cathepsin D; Contains: RecName: Full=Cathepsin D 12
kDa light chain; Contains: RecName: Full=Cathepsin D 9
kDa light chain; Contains: RecName: Full=Cathepsin D 34
kDa heavy chain; Contains: RecName: Full=Cathepsin D 30
kDa heavy chain; Flags: Precursor
gi|55882|emb|CAA38349.1| preprocathepsin D [Rattus norvegicus]
Length = 407
Score = 270 bits (691), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 147/316 (46%), Positives = 199/316 (62%), Gaps = 16/316 (5%)
Query: 27 NGGLYRIGLKKRKFDLNNRVAARLD-SKEGESFRTSIRKYSLRGNLGESGDADIVALKNY 85
+ L RI L RKF R + S E + I KYS++ + + + LKNY
Sbjct: 18 SSALIRIPL--RKFTSIRRTMTEVGGSVEDLILKGPITKYSMQSS-PRTKEPVSELLKNY 74
Query: 86 MDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKN 144
+DAQY+GEIGIGTPPQ FTV+FDTGSSNLWVPS C IAC+ H KY S +SSTY KN
Sbjct: 75 LDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWVHHKYNSDKSSTYVKN 134
Query: 145 GKSADIHYGTGAISGFFSEDHVKI------GDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
G S DIHYG+G++SG+ S+D V + G + V+ Q F EAT++P + F+ AKFDGIL
Sbjct: 135 GTSFDIHYGSGSLSGYLSQDTVSVPCKSDLGGIKVEKQIFGEATKQPGVVFIAAKFDGIL 194
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEH 258
G+G+ ISV K +PV+ N++ Q LV + +FSF+ NR+ + GGE++ GG D +Y GE
Sbjct: 195 GMGYPFISVNKVLPVFDNLMKQKLVEKNIFSFYLNRDPTGQPGGELMLGGTDSRYYHGEL 254
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
+Y+ VT+K YWQ M + + + T C GGC AI D+GTSLL GP + ++ AIGA
Sbjct: 255 SYLNVTRKAYWQVHMDQLEVGSELT-LCKGGCEAIVDTGTSLLVGPVDEVKELQKAIGAV 313
Query: 319 GIVSQE----CKAVVS 330
++ E C+ V S
Sbjct: 314 PLIQGEYMIPCEKVSS 329
>gi|403305561|ref|XP_003943328.1| PREDICTED: cathepsin D [Saimiri boliviensis boliviensis]
Length = 522
Score = 270 bits (691), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 143/312 (45%), Positives = 192/312 (61%), Gaps = 28/312 (8%)
Query: 59 RTSIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPS 118
+ I KYS G + LKNYMDAQY+GEIGIGTPPQ FTV+FDTGSSNLWVPS
Sbjct: 14 KGPISKYSQAMPTAPGGPVPEI-LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPS 72
Query: 119 SKC-YFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHV----------- 166
C IAC+ H KY S +SSTY KNG S DIHYG+G++SG+ S+D V
Sbjct: 73 IHCKLLDIACWIHHKYNSAKSSTYVKNGTSFDIHYGSGSLSGYLSQDTVLVPCRPSSSAS 132
Query: 167 KIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEP 226
+G + V+ Q F EAT++P +TF+ AKFDGILG+ + ISV +PV+ N++ Q LV++
Sbjct: 133 ALGGVKVERQVFGEATKQPGITFIAAKFDGILGMAYPRISVNNVLPVFDNLMQQKLVDQN 192
Query: 227 VFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFC 286
+FSF+ NR+ D + GGE++ GG D +YKG +Y+ VT+K YWQ M V + T C
Sbjct: 193 IFSFYLNRDPDAQPGGELMLGGTDSKYYKGSLSYLNVTRKAYWQVHMDQVEVASGLT-LC 251
Query: 287 AGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE----CKAVVSQYGEEIINMLLA 342
GGC AI D+GTSL+ GP + ++ AIGA ++ E C+ V + L
Sbjct: 252 KGGCEAIVDTGTSLMVGPVDEVRELQKAIGAVPLIQGEYMIPCEKVST----------LP 301
Query: 343 KVSAKLSLRHYK 354
++ KL + YK
Sbjct: 302 TITLKLGGKDYK 313
>gi|157644743|gb|ABV59077.1| cathepsin D [Lates calcarifer]
gi|396084116|gb|AFN84539.1| cathepsin D [Lates calcarifer]
Length = 396
Score = 270 bits (691), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 144/314 (45%), Positives = 192/314 (61%), Gaps = 14/314 (4%)
Query: 18 LFPVVFST---PNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIR-KYSLRGNLG- 72
LF VVF+ + L RI LKK R R + G + K+SL+ N G
Sbjct: 4 LFLVVFAALALSSDALVRIPLKKF------RSIRRELTDSGTRLEELLADKHSLKYNFGF 57
Query: 73 -ESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFH 130
S LKNY+DAQY+G+I +GTPPQ F+V+FDTGSSNLWVPS C IAC H
Sbjct: 58 PSSNGPTPETLKNYLDAQYYGDISLGTPPQTFSVVFDTGSSNLWVPSVHCSLLDIACLLH 117
Query: 131 SKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFL 190
KY S +SSTY KNG + I YG+G++SG+ SED IGD+ V+ Q F EA ++P + F+
Sbjct: 118 HKYNSAKSSTYVKNGTAFAIQYGSGSLSGYLSEDTCTIGDISVEKQLFGEAIKQPGVAFI 177
Query: 191 LAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMD 250
AKFDGILG+ + ISV VPV+ N+++Q V + VFSF+ NRN D GGE++ GG D
Sbjct: 178 AAKFDGILGMAYPRISVDGVVPVFDNIMSQKKVEQNVFSFYLNRNPDTAPGGELLLGGTD 237
Query: 251 PDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQ 310
P +Y G+ YV +T++ YWQ M D ++ G C GGC AI D+GTSL+ GP+ +
Sbjct: 238 PKYYTGDFNYVNITRQAYWQIHM-DELVVGTQLSLCKGGCEAIVDTGTSLITGPSAEVKA 296
Query: 311 VNHAIGATGIVSQE 324
+ AIGA ++ E
Sbjct: 297 LQKAIGAIPLIQGE 310
>gi|27503926|gb|AAH42316.1| Ctsd protein [Danio rerio]
gi|38571742|gb|AAH62824.1| Ctsd protein [Danio rerio]
gi|197247273|gb|AAI64814.1| Ctsd protein [Danio rerio]
Length = 398
Score = 270 bits (690), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 147/323 (45%), Positives = 192/323 (59%), Gaps = 15/323 (4%)
Query: 14 LCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKY-SLRGNLG 72
+ LL V F + + RI LKK R R S G S + SL+ NLG
Sbjct: 3 IAFLLLVVAFFCTSDAIVRIPLKKF------RTLRRTLSDSGRSLEELVSSSNSLKYNLG 56
Query: 73 --ESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYF 129
S D LKNY+DAQY+GEIG+GTP Q FTV+FDTGSSNLWVPS C + IAC
Sbjct: 57 FPASNDPTPETLKNYLDAQYYGEIGLGTPVQTFTVVFDTGSSNLWVPSVHCSLTDIACLL 116
Query: 130 HSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTF 189
H KY G+SSTY KNG I YG+G++SG+ S+D IGD+ V+ Q F EA ++P + F
Sbjct: 117 HHKYNGGKSSTYVKNGTQFAIQYGSGSLSGYLSQDTCTIGDIAVEKQIFGEAIKQPGVAF 176
Query: 190 LLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGM 249
+ AKFDGILG+ + I+V PV+ M++Q V + VFSF+ NRN D + GGE++ GG
Sbjct: 177 IAAKFDGILGMAYPRIAVDGVPPVFDMMMSQKKVEKNVFSFYLNRNPDTQPGGELLLGGT 236
Query: 250 DPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIIT 309
DP +Y G+ YV ++++ YWQ M D M G C GGC AI D+GTSL+ GP +
Sbjct: 237 DPKYYTGDFNYVDISRQAYWQIHM-DGMSIGSGLSLCKGGCEAIVDTGTSLITGPAAEVK 295
Query: 310 QVNHAIGATGIVSQE----CKAV 328
+ AIGA ++ E CK V
Sbjct: 296 ALQKAIGAIPLMQGEYMVDCKKV 318
>gi|290561455|gb|ADD38128.1| Lysosomal aspartic protease [Lepeophtheirus salmonis]
Length = 384
Score = 270 bits (690), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 133/244 (54%), Positives = 174/244 (71%), Gaps = 3/244 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSST 140
L NY+DAQY+G I IG+PPQ+F VIFDTGSSNLW+PS C+ + IAC H KY +SST
Sbjct: 57 LSNYLDAQYYGPITIGSPPQSFKVIFDTGSSNLWIPSKSCHITNIACLLHHKYDHSKSST 116
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y NG I YG+G++SGF S D V +G++ + Q F EA EP + F+ AKFDGILG+
Sbjct: 117 YVANGTEFAIQYGSGSLSGFLSSDSVSMGEVEIGSQTFGEAMSEPGMAFVAAKFDGILGM 176
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
G+ I+V VP +YNM QGL+ EP+FSF+ NRN D + GGEI+FGG DPDHYKG TY
Sbjct: 177 GYSNIAVDGVVPPFYNMFKQGLIQEPIFSFYLNRNPDAKVGGEIIFGGSDPDHYKGNITY 236
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
+PVT+KGYWQF M + ++ ++ FC GC AIAD+GTSL+AGP+ + +N +G T I
Sbjct: 237 IPVTKKGYWQFKMDKMEVNSKS--FCQNGCQAIADTGTSLIAGPSIEVNALNQLLGGTPI 294
Query: 321 VSQE 324
++ E
Sbjct: 295 INGE 298
>gi|311324976|gb|ADP89523.1| cathepsin D [Miichthys miiuy]
Length = 396
Score = 270 bits (689), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 140/301 (46%), Positives = 188/301 (62%), Gaps = 9/301 (2%)
Query: 27 NGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLG--ESGDADIVALKN 84
N L RI LKK + + R K E ++SL+ N G S LKN
Sbjct: 16 NDALVRIPLKKFR---SIRRELTDSGKRAEELLAD--RHSLKYNFGFPSSNGPTPELLKN 70
Query: 85 YMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKK 143
Y+DAQY+GEIG+GTPPQ FTV+FDTGSSNLWVPS C IAC H KY S +SSTY K
Sbjct: 71 YLDAQYYGEIGLGTPPQLFTVVFDTGSSNLWVPSVHCQILDIACLLHHKYNSAKSSTYVK 130
Query: 144 NGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQ 203
NG + I YG+G++SGF S+D IGD+ V++Q F EAT++P + F+ AKFDGILG+ +
Sbjct: 131 NGTAFAIQYGSGSLSGFLSQDTCTIGDISVQNQLFGEATKQPGVAFIAAKFDGILGMAYP 190
Query: 204 EISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPV 263
ISV PV+ N+++Q V + VFSF+ NRN D + GGE++ GG DP +Y G+ YV +
Sbjct: 191 RISVDGVAPVFDNIMSQKKVEKNVFSFYLNRNPDTQPGGELLLGGTDPKYYSGDFHYVNI 250
Query: 264 TQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQ 323
T++ YWQ + + + Q T C GC AI D+GTSL+ GP+ + + AIGA ++
Sbjct: 251 TRQAYWQIHVDGMAVGSQLT-LCKSGCEAIVDTGTSLITGPSAEVRSLQKAIGAIPLIQG 309
Query: 324 E 324
E
Sbjct: 310 E 310
>gi|60678793|gb|AAX33731.1| Blo t allergen [Blomia tropicalis]
Length = 402
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 132/244 (54%), Positives = 173/244 (70%), Gaps = 6/244 (2%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSST 140
L NY DAQY+GEI IG+PPQ F VIFDTGSSNLWVPS KC F+ +AC H KY S +SS+
Sbjct: 62 LSNYADAQYYGEIQIGSPPQPFNVIFDTGSSNLWVPSKKCKFTNLACLLHHKYDSSKSSS 121
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y NG S +I YGTG+++GF S D V + + +++Q F EA EP +TF+ AKFDGILGL
Sbjct: 122 YVNNGTSFEIRYGTGSMTGFLSTDVVTVANQQIQNQTFAEAVSEPGITFVFAKFDGILGL 181
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
GF ISV V+ +MV QGLV +PVFSF+ NR+ + + GGEI+FGG DP +YKG+ TY
Sbjct: 182 GFNTISVDGVPTVFDSMVKQGLVQQPVFSFYLNRDTNGKVGGEIIFGGSDPAYYKGDFTY 241
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTG-----FCAGGCAAIADSGTSLLAGPTTIITQVNHAI 315
P+T+ GYWQF M ++++ ++ C GC AIAD+GTSL+AGP+ + +N A+
Sbjct: 242 APLTKIGYWQFQMHGILLENKSNNKTVGHVCESGCEAIADTGTSLIAGPSDQVEHLNRAL 301
Query: 316 GATG 319
GA G
Sbjct: 302 GAIG 305
>gi|147906891|ref|NP_001082550.1| cathepsin D precursor [Xenopus laevis]
gi|28436104|dbj|BAC57431.1| cathepsin D [Xenopus laevis]
Length = 409
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 147/317 (46%), Positives = 190/317 (59%), Gaps = 21/317 (6%)
Query: 14 LCLLLFPVVFSTPNGGLYRIGLKK-----RKFDLNNRVAARLDSKEGESFRTSIRKYSLR 68
LC L+F P L RI LKK R ++ + +L E + KYS
Sbjct: 12 LCCLVF-----QPGSSLVRIPLKKFTSIRRAMSDTDKDSLKLSGNEAAT------KYS-- 58
Query: 69 GNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIAC 127
+S + L NY+DAQY+GEI IGTPPQ FTV+FDTGSSNLWV S C F IAC
Sbjct: 59 -AFPKSNNPTPETLLNYLDAQYYGEISIGTPPQPFTVVFDTGSSNLWVASVHCSMFDIAC 117
Query: 128 YFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSL 187
+ H KY S +SSTY KNG I YGTG+ISG+ S+D V IG+L K+Q F EA ++P +
Sbjct: 118 WMHRKYDSSKSSTYVKNGTEFAIQYGTGSISGYLSKDTVTIGNLGYKEQIFGEAIKQPGV 177
Query: 188 TFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFG 247
TF+ AKFDGILG+ + ISV P + N++ Q LV VFSF+ NRN D + GGE++ G
Sbjct: 178 TFIAAKFDGILGMAYPIISVDGVSPCFDNIMAQKLVESNVFSFYLNRNPDTQPGGELLLG 237
Query: 248 GMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTI 307
G DP +Y G+ Y+ VT+K YWQ M + + Q T C GGC AI D+GTSL+ GP
Sbjct: 238 GTDPKYYTGDFHYLNVTRKAYWQIHMDQLGVGDQLT-LCKGGCEAIVDTGTSLITGPVEE 296
Query: 308 ITQVNHAIGATGIVSQE 324
+ + AIGA ++ E
Sbjct: 297 VAALQRAIGAIPLIRGE 313
>gi|4099023|gb|AAD00524.1| aspartic protease [Onchocerca volvulus]
Length = 422
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 131/244 (53%), Positives = 168/244 (68%), Gaps = 5/244 (2%)
Query: 63 RKYSLRGNLGES--GDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSK 120
KY G+S G+ D V LKNYMDAQY+GEI IGTPPQNF+VIFDTGSSNLW+PS K
Sbjct: 72 HKYEFGSRSGKSIAGETDEV-LKNYMDAQYYGEISIGTPPQNFSVIFDTGSSNLWIPSIK 130
Query: 121 C-YFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFI 179
C + IAC H+KY+ S TYK +G+ +I YG G++ GF S D V I D+ V DQ F
Sbjct: 131 CPFLDIACLLHNKYKGTESKTYKSDGRKIEIQYGRGSMKGFVSMDTVCIADVCVTDQPFA 190
Query: 180 EATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEE 239
EAT EP +TF++AKFDGILG+ F EI+V PV+ M++Q ++ +PVF+FW +RN +E
Sbjct: 191 EATSEPGVTFIMAKFDGILGMAFPEIAVLGLSPVFNTMISQKVLQQPVFAFWLDRNPSDE 250
Query: 240 EGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTS 299
GGEI GG+D + + TY PV++ GYWQF M + + G CA GC AIAD+GTS
Sbjct: 251 VGGEITLGGIDTNRFVSPITYTPVSRHGYWQFKMDSIQGKDEAIG-CANGCQAIADTGTS 309
Query: 300 LLAG 303
L+AG
Sbjct: 310 LIAG 313
>gi|94732449|emb|CAK11131.1| cathepsin D [Danio rerio]
gi|94733132|emb|CAK05390.1| cathepsin D [Danio rerio]
gi|158253911|gb|AAI54316.1| Ctsd protein [Danio rerio]
Length = 398
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 149/321 (46%), Positives = 192/321 (59%), Gaps = 16/321 (4%)
Query: 16 LLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKY-SLRGNLG-- 72
LLL F T + + RI LKK R R S G S + SL+ NLG
Sbjct: 6 LLLVAAFFCTSDA-IVRIPLKKF------RTLRRTLSDSGRSLEELVSSSNSLKYNLGFP 58
Query: 73 ESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHS 131
S D LKNY+DAQY+GEIG+GTP Q FTV+FDTGSSNLWVPS C + IAC H
Sbjct: 59 ASNDPTPETLKNYLDAQYYGEIGLGTPVQTFTVVFDTGSSNLWVPSVHCSLTDIACLLHH 118
Query: 132 KYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLL 191
KY G+SSTY KNG I YG+G++SG+ S+D IGD+ V+ Q F EA ++P + F+
Sbjct: 119 KYNGGKSSTYVKNGTQFAIQYGSGSLSGYLSQDTCTIGDIAVEKQIFGEAIKQPGVAFIA 178
Query: 192 AKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDP 251
AKFDGILG+ + ISV PV+ M++Q V + VFSF+ NRN D + GGE++ GG DP
Sbjct: 179 AKFDGILGMAYPRISVDGVPPVFDMMMSQKKVEKNVFSFYLNRNPDTQPGGELLLGGTDP 238
Query: 252 DHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQV 311
+Y G+ YV ++++ YWQ M D M G C GGC AI D+GTSL+ GP + +
Sbjct: 239 KYYTGDFNYVDISRQAYWQIHM-DGMSIGSGLSLCKGGCEAIVDTGTSLITGPAAEVKAL 297
Query: 312 NHAIGATGIVSQE----CKAV 328
AIGA ++ E CK V
Sbjct: 298 QKAIGAIPLMQGEYMVDCKKV 318
>gi|320163747|gb|EFW40646.1| cathepsin D [Capsaspora owczarzaki ATCC 30864]
Length = 382
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 136/251 (54%), Positives = 164/251 (65%), Gaps = 6/251 (2%)
Query: 58 FRTSIRKYSLRGNLGES---GDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNL 114
F ++ R + GN ES G I NY DAQY+G+I IGTP Q FTV+FDTGS+NL
Sbjct: 27 FESARRSLANVGNFIESRYLGVGAIEPQHNYQDAQYYGDITIGTPGQKFTVVFDTGSANL 86
Query: 115 WVPSSKCYFS-IACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVV 173
WVPS KC + IAC H+KY S +SSTYK NG S I YG+G +SGF S D V L V
Sbjct: 87 WVPSKKCPVTDIACQLHNKYDSTKSSTYKVNGTSFAIQYGSGKLSGFLSTDSVSFAGLTV 146
Query: 174 KDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFN 233
Q F EAT EP L+F+ AKFDGILGLGF +I+V PVW N + QG+ P+F FW N
Sbjct: 147 TGQTFAEATAEPGLSFVAAKFDGILGLGFPQIAVDGVTPVWNNAILQGVAAAPLFGFWLN 206
Query: 234 RNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAI 293
R+ +GGEI FG +D HY G Y PVT++GYWQF +G V + G+ +CA GC AI
Sbjct: 207 RDPTAADGGEIDFGAIDDSHYTGPILYTPVTRQGYWQFALGAVTVSGKN--YCASGCQAI 264
Query: 294 ADSGTSLLAGP 304
ADSGTSLL GP
Sbjct: 265 ADSGTSLLVGP 275
>gi|148232796|ref|NP_001083566.1| napsin A aspartic peptidase precursor [Xenopus laevis]
gi|38197533|gb|AAH61685.1| MGC68767 protein [Xenopus laevis]
Length = 392
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 151/351 (43%), Positives = 208/351 (59%), Gaps = 25/351 (7%)
Query: 8 ITAGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSL 67
+ A F+ LLLF G+ RI LKK F R+ + DS E + + ++
Sbjct: 1 MKANIFILLLLF-----WDTDGVIRIPLKK--FPSIRRMLS--DSMTAEELKGATKENLQ 51
Query: 68 RGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIA 126
+ E L NY+DAQY+GEI IGTPPQ F VIFDTGSSNLWVPS KC +F A
Sbjct: 52 QQMFPEK-------LTNYLDAQYYGEIFIGTPPQKFAVIFDTGSSNLWVPSVKCSFFDFA 104
Query: 127 CYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPS 186
C+ H KYRS SSTY++N + I YGTG++SGF S+D V IG + V +Q F EA ++P
Sbjct: 105 CWVHKKYRSQNSSTYRQNNTAFAIQYGTGSLSGFLSQDTVSIGSIEVANQTFAEAIKQPG 164
Query: 187 LTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVF 246
+ F+ A FDGILG+G+ +ISV VPV+ NM+ Q L+ E VFSF+ +R+ GGE++
Sbjct: 165 IVFVFAHFDGILGMGYPDISVDGVVPVFDNMMQQNLLEENVFSFYLSRDPMATVGGELIL 224
Query: 247 GGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTT 306
GG DP++Y G+ Y+ VT+ YWQ +V ++ Q C GGC AI D+GTSL+ GP
Sbjct: 225 GGTDPNYYTGDFHYLNVTRMAYWQIKADEVRVNNQLV-LCKGGCQAIVDTGTSLITGPKE 283
Query: 307 IITQVNHAIGATGIVSQE----CKAVVSQYGEEIINMLLAKVSAKLSLRHY 353
I ++ AIGA + + E CK + S ++ +L V+ L+ Y
Sbjct: 284 EIRALHKAIGAFPLFAGEYFINCKRIQSL---PTVSFILGGVAYNLTGEQY 331
>gi|71043798|ref|NP_001020792.1| cathepsin D precursor [Canis lupus familiaris]
gi|85540968|sp|Q4LAL9.1|CATD_CANFA RecName: Full=Cathepsin D; Flags: Precursor
gi|70561318|emb|CAJ14973.1| cathepsin D [Canis lupus familiaris]
Length = 410
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 142/310 (45%), Positives = 194/310 (62%), Gaps = 26/310 (8%)
Query: 59 RTSIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPS 118
+ I KY+ +G +G L+NYMDAQY+GEIGIGTPPQ FTV+FDTGSSNLWVPS
Sbjct: 49 KGPISKYA-QGAPAVTGGPIPEMLRNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPS 107
Query: 119 SKC-YFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKI--------- 168
C IAC+ H KY SG+SSTY KNG S DIHYG+G++SG+ S+D V +
Sbjct: 108 IHCKLLDIACWIHHKYNSGKSSTYVKNGTSFDIHYGSGSLSGYLSQDTVSVPCKSALSGL 167
Query: 169 GDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVF 228
+ V+ Q F EAT++P +TF+ AKFDGILG+ + ISV +PV+ N++ Q LV + +F
Sbjct: 168 AGIKVERQTFGEATKQPGITFIAAKFDGILGMAYPRISVNNVLPVFDNLMQQKLVEKNIF 227
Query: 229 SFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAG 288
SF+ NR+ + + GGE++ GG D +YKG +Y+ VT+K YWQ M V + G + C G
Sbjct: 228 SFYLNRDPNAQPGGELMLGGTDSKYYKGPLSYLNVTRKAYWQVHMEQVDV-GSSLTLCKG 286
Query: 289 GCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE----CKAVVSQYGEEIINMLLAKV 344
GC AI D+GTSL+ GP + ++ AIGA ++ E C+ V + L V
Sbjct: 287 GCEAIVDTGTSLIVGPVDEVRELQKAIGAVPLIQGEYMIPCEKVST----------LPDV 336
Query: 345 SAKLSLRHYK 354
+ KL + YK
Sbjct: 337 TLKLGGKLYK 346
>gi|184185542|gb|ACC68942.1| cathepsin D (predicted) [Rhinolophus ferrumequinum]
Length = 410
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 139/287 (48%), Positives = 186/287 (64%), Gaps = 25/287 (8%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNYMDAQY+GEIGIGTPPQ FTV+FDTGSSNLWVPS C IAC+ H KY SG+SST
Sbjct: 71 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSGKSST 130
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKI---------GDLVVKDQEFIEATREPSLTFLL 191
Y KNG S DIHYG+G++SG+ S+D V + G + V+ Q F EAT++P +TF+
Sbjct: 131 YVKNGTSFDIHYGSGSLSGYLSQDTVSVPCNSALLGLGGVKVERQVFGEATKQPGITFIA 190
Query: 192 AKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDP 251
AKFDGILG+ + ISV +PV+ N++ Q LV++ +FSF+ NR+ + + GGE++ GG D
Sbjct: 191 AKFDGILGMAYPRISVNNVLPVFDNLMQQKLVDKNIFSFYLNRDPNAQPGGELMLGGTDS 250
Query: 252 DHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQV 311
+YKG +Y+ VT+K YWQ M V + G + C GC AI D+GTSL+ GP + ++
Sbjct: 251 RYYKGALSYLNVTRKAYWQVHMDQVDV-GNSLTLCKAGCEAIVDTGTSLIVGPVEEVREL 309
Query: 312 NHAIGATGIVSQE----CKAVVSQYGEEIINMLLAKVSAKLSLRHYK 354
AIGA ++ E C+ V S L +V KL + YK
Sbjct: 310 QKAIGAVPLIQGEYMIPCEKVSS----------LPEVILKLGGKDYK 346
>gi|225713714|gb|ACO12703.1| Lysosomal aspartic protease precursor [Lepeophtheirus salmonis]
gi|290462953|gb|ADD24524.1| Lysosomal aspartic protease [Lepeophtheirus salmonis]
Length = 384
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 133/244 (54%), Positives = 173/244 (70%), Gaps = 3/244 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSST 140
L NY+DAQY+G I IG+PPQ+F VIFDTGSSNLW+PS C+ + IAC H KY +SST
Sbjct: 57 LSNYLDAQYYGPITIGSPPQSFKVIFDTGSSNLWIPSKSCHITNIACLLHHKYDHSKSST 116
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y NG I YG+G++SGF S D V +G + + Q F EA EP + F+ AKFDGILG+
Sbjct: 117 YVANGTEFAIQYGSGSLSGFLSSDSVSMGGVEIGSQTFGEAMSEPGMAFVAAKFDGILGM 176
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
G+ I+V VP +YNM QGL+ EP+FSF+ NRN D + GGEI+FGG DPDHYKG TY
Sbjct: 177 GYSNIAVDGVVPPFYNMFKQGLIQEPIFSFYLNRNPDAKVGGEIIFGGSDPDHYKGNITY 236
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
+PVT+KGYWQF M + ++ ++ FC GC AIAD+GTSL+AGP+ + +N +G T I
Sbjct: 237 IPVTKKGYWQFKMDKMEVNSKS--FCQNGCQAIADTGTSLIAGPSIEVNALNQLLGGTPI 294
Query: 321 VSQE 324
++ E
Sbjct: 295 INGE 298
>gi|17549909|ref|NP_510191.1| Protein ASP-4 [Caenorhabditis elegans]
gi|3879202|emb|CAA90633.1| Protein ASP-4 [Caenorhabditis elegans]
Length = 444
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 133/244 (54%), Positives = 165/244 (67%), Gaps = 3/244 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
L+NYMDAQYFG I IGTP QNFTVIFDTGSSNLW+PS KC ++ IAC H +Y S SST
Sbjct: 86 LRNYMDAQYFGTISIGTPAQNFTVIFDTGSSNLWIPSKKCPFYDIACMLHHRYDSKSSST 145
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
YK++G+ I YGTG++ GF S+D V + + +DQ F EAT EP +TF+ AKFDGILG+
Sbjct: 146 YKEDGRKMAIQYGTGSMKGFISKDSVCVAGVCAEDQPFAEATSEPGITFVAAKFDGILGM 205
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ EI+V PV+ + Q V +FSFW NRN D E GGEI FGG+D Y TY
Sbjct: 206 AYPEIAVLGVQPVFNTLFEQKKVPSNLFSFWLNRNPDSEIGGEITFGGIDSRRYVEPITY 265
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
VPVT+KGYWQF M V+ G C+ GC AIAD+GTSL+AGP I + + IGA +
Sbjct: 266 VPVTRKGYWQFKMDKVV--GSGVLGCSNGCQAIADTGTSLIAGPKAQIEAIQNFIGAEPL 323
Query: 321 VSQE 324
+ E
Sbjct: 324 IKGE 327
>gi|42476045|ref|NP_599161.2| cathepsin D precursor [Rattus norvegicus]
gi|38303993|gb|AAH62032.1| Cathepsin D [Rattus norvegicus]
gi|149061703|gb|EDM12126.1| cathepsin D, isoform CRA_c [Rattus norvegicus]
Length = 407
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 146/316 (46%), Positives = 198/316 (62%), Gaps = 16/316 (5%)
Query: 27 NGGLYRIGLKKRKFDLNNRVAARLD-SKEGESFRTSIRKYSLRGNLGESGDADIVALKNY 85
+ L RI L RKF R + S E + I KYS++ + + + LKNY
Sbjct: 18 SSALIRIPL--RKFTSIRRTMTEVGGSVEDLILKGPITKYSMQSS-PRTKEPVSELLKNY 74
Query: 86 MDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKN 144
+DAQY+GEIGIGTPPQ FTV+FDTGSSNLWVPS C IAC+ H KY S +SSTY KN
Sbjct: 75 LDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWVHHKYNSDKSSTYVKN 134
Query: 145 GKSADIHYGTGAISGFFSEDHVKI------GDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
G S DIHYG+G++SG+ S+D V + G + V+ Q F EAT++P + F+ AKFDGIL
Sbjct: 135 GTSFDIHYGSGSLSGYLSQDTVSVPCKSDLGGIKVEKQIFGEATKQPGVVFIAAKFDGIL 194
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEH 258
G+G+ ISV +PV+ N++ Q LV + +FSF+ NR+ + GGE++ GG D +Y GE
Sbjct: 195 GMGYPFISVNNVLPVFDNLMKQKLVEKNIFSFYLNRDPTGQPGGELMLGGTDSRYYHGEL 254
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
+Y+ VT+K YWQ M + + + T C GGC AI D+GTSLL GP + ++ AIGA
Sbjct: 255 SYLNVTRKAYWQVHMDQLEVGSELT-LCKGGCEAIVDTGTSLLVGPVDEVKELQKAIGAV 313
Query: 319 GIVSQE----CKAVVS 330
++ E C+ V S
Sbjct: 314 PLIQGEYMIPCEKVSS 329
>gi|170649686|gb|ACB21270.1| cathepsin D preproprotein (predicted) [Callicebus moloch]
Length = 412
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 131/255 (51%), Positives = 173/255 (67%), Gaps = 13/255 (5%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNYMDAQY+GEIGIGTPPQ FTV+FDTGSSNLWVPS C IAC+ H KY S +SST
Sbjct: 71 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSAKSST 130
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHV-----------KIGDLVVKDQEFIEATREPSLTF 189
Y KNG S DIHYG+G++SG+ S+D V +G + V+ Q F EAT++P +TF
Sbjct: 131 YVKNGTSFDIHYGSGSLSGYLSQDTVLVPCRSSSSASALGGVKVERQVFGEATKQPGITF 190
Query: 190 LLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGM 249
+ AKFDGILG+ + ISV +PV+ N++ Q LV++ +FSF+ NR+ D + GGE++ GG
Sbjct: 191 IAAKFDGILGMAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLNRDPDAQPGGELMLGGT 250
Query: 250 DPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIIT 309
D +YKG +Y+ VT+K YWQ M V + T C GGC AI D+GTSL+ GP +
Sbjct: 251 DSKYYKGSLSYLNVTRKAYWQVHMDQVEVASGLT-LCKGGCEAIVDTGTSLMVGPVDEVR 309
Query: 310 QVNHAIGATGIVSQE 324
++ AIGA ++ E
Sbjct: 310 ELQKAIGAVPLIQGE 324
>gi|339237491|ref|XP_003380300.1| lysosomal aspartic protease [Trichinella spiralis]
gi|316976887|gb|EFV60084.1| lysosomal aspartic protease [Trichinella spiralis]
Length = 405
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 127/241 (52%), Positives = 171/241 (70%), Gaps = 2/241 (0%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
L NYMDAQY+GEI IGTPPQNFTVIFDTGSSNLWVPSSKC +F IAC+ H++Y S +SST
Sbjct: 73 LHNYMDAQYYGEISIGTPPQNFTVIFDTGSSNLWVPSSKCSFFDIACWLHNRYNSKKSST 132
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y+ +G++ +I YG+G++ GF S+D V I L VK Q F EAT +P L F+ A FDGILG+
Sbjct: 133 YEASGETIEIRYGSGSMRGFKSKDTVCIASLCVKGQGFAEATSQPGLAFIFAHFDGILGM 192
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
F I+VG PV+ M+ Q L++E VF+FW NRN +++ GG I FG +D +Y G T+
Sbjct: 193 AFPSIAVGGIQPVFQAMIEQNLISEAVFAFWLNRNPEDDLGGLISFGTVDEKYYIGNITW 252
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
VP+ + YW+F+M + + + G C GC IAD+GTSL+AGP + ++ AIGA +
Sbjct: 253 VPLVNQRYWEFNMETIKVGDEHVG-CIDGCTTIADTGTSLIAGPKDEVERLQEAIGAKPL 311
Query: 321 V 321
+
Sbjct: 312 I 312
>gi|22651403|gb|AAL61540.1| cathepsin D precursor [Danio rerio]
Length = 398
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 146/323 (45%), Positives = 191/323 (59%), Gaps = 15/323 (4%)
Query: 14 LCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKY-SLRGNLG 72
+ LL F + + RI LKK R R S G S + SL+ NLG
Sbjct: 3 IAFLLLVAAFFCTSDAIVRIPLKKF------RTLRRTLSDSGRSLEELVSSSNSLKYNLG 56
Query: 73 --ESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYF 129
S D LKNY+DAQY+GEIG+GTP Q FTV+FDTGSSNLWVPS C + IAC
Sbjct: 57 FPASNDPTPETLKNYLDAQYYGEIGLGTPVQTFTVVFDTGSSNLWVPSVHCSLTDIACLL 116
Query: 130 HSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTF 189
H KY G+SSTY KNG I YG+G++SG+ S+D IGD+ V+ Q F EA ++P + F
Sbjct: 117 HHKYNGGKSSTYVKNGTQFAIQYGSGSLSGYLSQDTCTIGDIAVEKQIFGEAIKQPGVAF 176
Query: 190 LLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGM 249
+ AKFDGILG+ + I+V PV+ M++Q V + VFSF+ NRN D + GGE++ GG
Sbjct: 177 IAAKFDGILGMAYPRIAVDGVPPVFDMMMSQKKVEKNVFSFYLNRNPDTQPGGELLLGGT 236
Query: 250 DPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIIT 309
DP +Y G+ YV ++++ YWQ M D M G C GGC AI D+GTSL+ GP +
Sbjct: 237 DPKYYTGDFNYVDISRQAYWQIHM-DGMSIGSGLSLCKGGCEAIVDTGTSLITGPAAEVK 295
Query: 310 QVNHAIGATGIVSQE----CKAV 328
+ AIGA ++ E CK V
Sbjct: 296 ALQKAIGAIPLMQGEYMVDCKKV 318
>gi|417400425|gb|JAA47158.1| Putative cathepsin d [Desmodus rotundus]
Length = 409
Score = 268 bits (686), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 138/286 (48%), Positives = 186/286 (65%), Gaps = 24/286 (8%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNYMDAQY+GEIGIGTPPQ FTV+FDTGSSNLWVPS C AC+ H KY SG+S+T
Sbjct: 71 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDFACWIHHKYNSGKSTT 130
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKI--------GDLVVKDQEFIEATREPSLTFLLA 192
Y KNG + DIHYG+G++SG+ S+D V + + V+ Q F EAT++P +TF+ A
Sbjct: 131 YVKNGTTFDIHYGSGSLSGYLSQDTVSVPCNSAASGSGVKVERQVFGEATKQPGVTFIAA 190
Query: 193 KFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPD 252
KFDGILG+ + ISV +PV+ N++ Q LV+E VFSF+ NR+ + + GGE++ GG+D
Sbjct: 191 KFDGILGMAYPRISVNNVLPVFDNLMQQKLVDENVFSFYLNRDPNAQPGGELMLGGVDSK 250
Query: 253 HYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVN 312
+YKG TY+ VT+K YWQ M +V + G C GC AI D+GTSLL GP + ++
Sbjct: 251 YYKGPITYLNVTRKAYWQVHMDEVAV-GSGLTLCKEGCEAIVDTGTSLLVGPVEEVRELQ 309
Query: 313 HAIGATGIVSQE----CKAVVSQYGEEIINMLLAKVSAKLSLRHYK 354
AIGA ++ E C+ V S L +V+ KL + Y+
Sbjct: 310 KAIGAVPLIQGEYMVPCEKVSS----------LPEVTLKLGGKAYR 345
>gi|56417363|gb|AAV90625.1| cathepsin D protein [Sus scrofa]
Length = 395
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 139/287 (48%), Positives = 184/287 (64%), Gaps = 25/287 (8%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNYMDAQY+GEIGIGTPPQ FTV+FDTGSSNLWVPS C IAC+ H KY SG+SST
Sbjct: 56 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSGKSST 115
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKI---------GDLVVKDQEFIEATREPSLTFLL 191
Y KNG + IHYG+G++SG++S+D V + G + V+ Q F EAT++P LTF+
Sbjct: 116 YVKNGTTFAIHYGSGSLSGYWSQDTVSVPCNSALLGVGGIKVERQTFGEATKQPGLTFIA 175
Query: 192 AKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDP 251
AKFDGILG+ + ISV VPV+ N++ Q LV++ +FSF+ NR+ + GGE++ GG+D
Sbjct: 176 AKFDGILGMAYPRISVNNVVPVFDNLMQQKLVDKNIFSFYLNRDPGAQPGGELMLGGIDS 235
Query: 252 DHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQV 311
+YKG Y VT+K YWQ M V + G + C GGC AI D+GTSL+ GP + ++
Sbjct: 236 KYYKGSLDYHNVTRKAYWQIHMDQVAV-GSSLTLCKGGCEAIVDTGTSLIVGPVEEVREL 294
Query: 312 NHAIGATGIVSQE----CKAVVSQYGEEIINMLLAKVSAKLSLRHYK 354
AIGA ++ E C+ V S L V+ L + YK
Sbjct: 295 QKAIGAVPLIQGEYMIPCEKVPS----------LPDVTVTLGGKKYK 331
>gi|342675479|gb|AEL31665.1| cathepsin D [Cynoglossus semilaevis]
Length = 396
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 142/299 (47%), Positives = 186/299 (62%), Gaps = 11/299 (3%)
Query: 30 LYRIGLKKRKFDLNNRVAARLDSKEGESFRTSI-RKYSLRGNLGE--SGDADIVALKNYM 86
L RI LKK R R + G S + + K SL+ NLG S LKNY+
Sbjct: 19 LVRIPLKKF------RSIRRGLTDSGRSVQDLLAEKNSLKYNLGFPFSKGPTPETLKNYL 72
Query: 87 DAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNG 145
DAQY+G+I +GTPPQ F+V+FDTGSSNLWVPS C IAC H KY S +SSTY KNG
Sbjct: 73 DAQYYGDITLGTPPQTFSVVFDTGSSNLWVPSIHCSLLDIACLLHKKYNSAKSSTYVKNG 132
Query: 146 KSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEI 205
+ I YG+G++SG+ S+D IG L V++Q F EA ++P + F+ AKFDGILG+ + I
Sbjct: 133 TAFAIQYGSGSLSGYLSQDTCSIGGLTVENQLFGEAIKQPGIAFIAAKFDGILGMAYPRI 192
Query: 206 SVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQ 265
SV +PV+ N++ Q V VFSF+ NRN D GGE++ GG DP +Y GE YV VT+
Sbjct: 193 SVDGVLPVFDNIMQQKKVESNVFSFYLNRNPDTAPGGELLLGGTDPTYYTGEFNYVNVTR 252
Query: 266 KGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE 324
+ YWQ M ++ + Q T C GGC AI D+GTSLL GP+ + + AIGA ++ E
Sbjct: 253 QAYWQVSMDELAVGSQLT-LCKGGCQAIVDTGTSLLTGPSAEVKALQKAIGAIPLIQGE 310
>gi|224460527|gb|ACN43675.1| cathepsin D [Paralichthys olivaceus]
Length = 396
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 142/303 (46%), Positives = 186/303 (61%), Gaps = 19/303 (6%)
Query: 30 LYRIGLKK-----RKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLG--ESGDADIVAL 82
L RI LKK R+ + R L + E +SL+ N G S L
Sbjct: 19 LIRIPLKKFRSIRRELTDSGRPVEELLANE----------HSLKYNTGFPSSNGPTPETL 68
Query: 83 KNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTY 141
KNY+DAQY+G+I +GTPPQ F+V+FDTGSSNLWVPS C IAC+ H KY S +SSTY
Sbjct: 69 KNYLDAQYYGDIALGTPPQTFSVVFDTGSSNLWVPSVHCSILDIACWLHHKYNSAKSSTY 128
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
KNG + I YG+G++SGF S+D IGDL V+ Q F EAT++P + F+ AKFDGILG+
Sbjct: 129 VKNGTTFAIQYGSGSLSGFLSQDTCTIGDLTVEKQVFGEATKQPGVAFIAAKFDGILGMA 188
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ ISV PV+ N+++Q V E VFSF+ NRN D GGE++ GG DP +Y G+ YV
Sbjct: 189 YPRISVDGVAPVFDNIMSQKKVEENVFSFYLNRNPDMAPGGELLLGGTDPKYYSGDFNYV 248
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIV 321
VT++ YWQ MG + Q T C GC AI D+GTSL+ GP+ + + AIGA ++
Sbjct: 249 NVTRQAYWQIHMGGMGAGSQLT-LCKDGCEAIVDTGTSLITGPSAEVKALQKAIGAVPLI 307
Query: 322 SQE 324
E
Sbjct: 308 QGE 310
>gi|26354406|dbj|BAC40831.1| unnamed protein product [Mus musculus]
Length = 445
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 144/317 (45%), Positives = 199/317 (62%), Gaps = 19/317 (5%)
Query: 29 GLYRIGLKKRKFDLNNRVAARLD-SKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMD 87
+ RI L RKF R + S E + I KYS++ + ++ + LKNY+D
Sbjct: 20 AIIRIPL--RKFTSIRRTMTEVGGSVEDLILKGPITKYSMQSS-PKTTEPVSELLKNYLD 76
Query: 88 AQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNGK 146
AQY+G+IGIGTPPQ FTV+FDTGSSNLWVPS C IAC+ H KY S +SSTY KNG
Sbjct: 77 AQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSSTYVKNGT 136
Query: 147 SADIHYGTGAISGFFSEDHV---------KIGDLVVKDQEFIEATREPSLTFLLAKFDGI 197
S DIHYG+G++SG+ S+D V K + V+ Q F EAT++P + F+ AKFDGI
Sbjct: 137 SFDIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIFGEATKQPGIVFVAAKFDGI 196
Query: 198 LGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGE 257
LG+G+ ISV +PV+ N++ Q LV++ +FSF+ NR+ + + GGE++ GG D +Y GE
Sbjct: 197 LGMGYPHISVNNVLPVFDNLMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDSKYYHGE 256
Query: 258 HTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
+Y+ VT+K YWQ M + + + T C GGC AI D+GTSLL GP + ++ AIGA
Sbjct: 257 LSYLNVTRKAYWQVHMDQLEVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKELQKAIGA 315
Query: 318 TGIVSQE----CKAVVS 330
++ E C+ V S
Sbjct: 316 VPLIQGEYMIPCEKVSS 332
>gi|315274244|gb|ADU03674.1| cathepsin D2 [Ixodes ricinus]
Length = 387
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 141/295 (47%), Positives = 186/295 (63%), Gaps = 13/295 (4%)
Query: 39 KFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGT 98
+ L+ +AR + + T ++ RG + E LKNY+DAQY+GEI +GT
Sbjct: 23 RMPLHKMQSARAHLLDATTPLTRPAVHATRGPIPE-------PLKNYLDAQYYGEITLGT 75
Query: 99 PPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSSTYKKNGKSADIHYGTGAI 157
PPQ+F V+FDTGSSNLWVPS+KC F+ IAC H KY S +SSTY KNG +I YG+G++
Sbjct: 76 PPQSFRVVFDTGSSNLWVPSAKCPFTNIACLLHRKYYSRKSSTYVKNGTQFEIRYGSGSV 135
Query: 158 SGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNM 217
G S D + +GD V Q F E E L FL AKFDGILGLG+ EISV V+ M
Sbjct: 136 RGELSTDTMGVGDSSVTGQTFAEILHESGLAFLAAKFDGILGLGYPEISVLGVPTVFDTM 195
Query: 218 VNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVM 277
V QG+ +PVFS + +RNA + GGE++FGG+D HY G +YVPV+++GYWQ M
Sbjct: 196 VAQGVAAKPVFSVFLDRNASDPAGGEVLFGGIDESHYTGNISYVPVSKRGYWQVHMDGTR 255
Query: 278 IDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE----CKAV 328
+ G FC+GGC AI D+GTSL+AGP+ I ++N IGA S E CK++
Sbjct: 256 V-GNNGSFCSGGCEAILDTGTSLIAGPSDEIEKLNLLIGAAPFASGEYIVSCKSI 309
>gi|74198620|dbj|BAE39786.1| unnamed protein product [Mus musculus]
Length = 410
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 144/313 (46%), Positives = 197/313 (62%), Gaps = 17/313 (5%)
Query: 33 IGLKKRKFDLNNRVAARLD-SKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYF 91
I + RKF R + S E + I KYS++ + ++ + LKNYMDAQY+
Sbjct: 22 IRIPLRKFTSIRRTMTEVGGSVEDLILKGPITKYSMQSS-PKTTEPVSELLKNYMDAQYY 80
Query: 92 GEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNGKSADI 150
G+IGIGTPPQ FTV+FDTGSSNLWVPS C IAC+ H KY S +SSTY KNG S DI
Sbjct: 81 GDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSSTYVKNGTSFDI 140
Query: 151 HYGTGAISGFFSEDHV---------KIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
HYG+G++SG+ S+D V K + V+ Q F EAT++P + F+ AKFDGILG+G
Sbjct: 141 HYGSGSLSGYLSQDTVSVPCKSGQSKARGIKVEKQIFGEATKQPGIVFVAAKFDGILGMG 200
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ ISV +PV+ NM+ Q LV++ +FSF+ NR+ + + GGE++ GG D +Y GE +Y+
Sbjct: 201 YPHISVNNVLPVFDNMMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDSKYYHGELSYL 260
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIV 321
VT+K YWQ M + + + T C GGC AI D+GTSLL GP + ++ AIGA ++
Sbjct: 261 NVTRKAYWQVHMDQLEVGNELT-LCKGGCEAIVDTGTSLLVGPVGEVKELQKAIGAVPLI 319
Query: 322 SQE----CKAVVS 330
E C+ V S
Sbjct: 320 QGEYMIPCEKVSS 332
>gi|74207446|dbj|BAE30902.1| unnamed protein product [Mus musculus]
Length = 410
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 146/330 (44%), Positives = 206/330 (62%), Gaps = 19/330 (5%)
Query: 16 LLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLD-SKEGESFRTSIRKYSLRGNLGES 74
+LL + ++ + + RI L RKF R + S E + I KYS++ + ++
Sbjct: 7 MLLILGLLASSSFAIIRIPL--RKFTSIRRTMTEVGGSVEDLILKGPITKYSMQSS-PKT 63
Query: 75 GDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKY 133
+ LKNY+DAQY+G+IGIGTPPQ FTV+FDTGSSNLWVPS C IAC+ H KY
Sbjct: 64 TEPVSELLKNYLDAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKY 123
Query: 134 RSGRSSTYKKNGKSADIHYGTGAISGFFSEDHV---------KIGDLVVKDQEFIEATRE 184
S +SSTY KNG S DIHYG+G++SG+ S+D V K + V+ Q F EAT++
Sbjct: 124 NSDKSSTYVKNGTSFDIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIFGEATKQ 183
Query: 185 PSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEI 244
P + F+ AKFDGILG+G+ ISV +PV+ N++ Q LV++ +FSF+ NR+ + + GGE+
Sbjct: 184 PGIVFVAAKFDGILGMGYPHISVNNVLPVFDNLMQQKLVDKNIFSFYLNRDPEGQPGGEL 243
Query: 245 VFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGP 304
+ GG D +Y GE +Y+ VT+K YWQ M + + + T C GGC AI D+GTSLL GP
Sbjct: 244 MLGGTDSKYYHGELSYLNVTRKAYWQVHMDQLEVGNELT-LCKGGCEAIVDTGTSLLVGP 302
Query: 305 TTIITQVNHAIGATGIVSQE----CKAVVS 330
+ ++ AIGA ++ E C+ V S
Sbjct: 303 VEEVKELQKAIGAVPLIQGEYMIPCEKVSS 332
>gi|195997419|ref|XP_002108578.1| expressed hypothetical protein [Trichoplax adhaerens]
gi|190589354|gb|EDV29376.1| expressed hypothetical protein [Trichoplax adhaerens]
Length = 383
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 143/318 (44%), Positives = 195/318 (61%), Gaps = 16/318 (5%)
Query: 16 LLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESG 75
LL+ +V S L RI L K K + A + + E + K+S G
Sbjct: 5 LLVLALVLSCA-AALQRIKLYKMKTIRQTLLDAGITA---EMLKAKYSKFS-----ASRG 55
Query: 76 DADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRS 135
D +L NY+DAQY+G I IGTPPQNF ++FDTGSS+LWVPS+KC + AC H KY
Sbjct: 56 DE---SLSNYLDAQYYGPITIGTPPQNFKILFDTGSSDLWVPSTKCNGNAACESHDKYDH 112
Query: 136 GRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFD 195
+SSTY NG+ I YG+GA SGF SED V + + V++Q F EA EP L+F+ AKFD
Sbjct: 113 TKSSTYVSNGQQWSIQYGSGAASGFLSEDVVTVAGISVRNQTFGEAVGEPGLSFVAAKFD 172
Query: 196 GILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYK 255
GILG+G++++S + PV+ NMV QGLV +PVFSF+ NR GGE++ GG DP++Y
Sbjct: 173 GILGMGYKQLSAERTNPVFVNMVQQGLVRKPVFSFYLNRKQGGAVGGELILGGSDPNYYS 232
Query: 256 GEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAI 315
G+ YVP++++ YWQF M + TT C GGC AIAD+GT+L+ GP + ++ AI
Sbjct: 233 GQFNYVPLSRESYWQFAMDGGKVATGTT-VCNGGCQAIADTGTTLIVGPPEDVQRIQQAI 291
Query: 316 GAT---GIVSQECKAVVS 330
GA G + +C + S
Sbjct: 292 GAQNAGGQYTVDCSTISS 309
>gi|268581165|ref|XP_002645565.1| C. briggsae CBR-ASP-4 protein [Caenorhabditis briggsae]
Length = 446
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 134/244 (54%), Positives = 164/244 (67%), Gaps = 3/244 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
L+NYMDAQYFG I IGTP QNFTVIFDTGSSNLWVPS KC ++ IAC H +Y S SST
Sbjct: 87 LRNYMDAQYFGTISIGTPGQNFTVIFDTGSSNLWVPSKKCPFYDIACMLHHRYDSKSSST 146
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
YK++G+ I YGTG++ GF S+D V + + +DQ F EAT EP +TF+ AKFDGILG+
Sbjct: 147 YKEDGRKMAIQYGTGSMKGFISKDSVCVAGICAEDQPFAEATSEPGITFVAAKFDGILGM 206
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ EI+V PV+ + Q V VFSFW NRN D E GGEI FGG+D Y TY
Sbjct: 207 AYPEIAVLGVQPVFNTLFEQKKVPSNVFSFWLNRNPDSELGGEITFGGIDARRYVEPITY 266
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
PVT+KGYWQF M V+ G C+ GC AIAD+GTSL+AGP I + + IGA +
Sbjct: 267 TPVTRKGYWQFKMDKVV--GSGVLGCSNGCQAIADTGTSLIAGPKAQIEAIQNFIGAEPL 324
Query: 321 VSQE 324
+ E
Sbjct: 325 IKGE 328
>gi|60678795|gb|AAX33732.1| Blo t allergen isoform 2 [Blomia tropicalis]
Length = 402
Score = 267 bits (682), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 131/244 (53%), Positives = 171/244 (70%), Gaps = 6/244 (2%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSST 140
L NY DAQY+GEI IG+PPQ F VIFDTGSSNLWVPS KC F+ + C H KY S +SS+
Sbjct: 62 LSNYADAQYYGEIQIGSPPQPFNVIFDTGSSNLWVPSKKCKFTNLVCLLHHKYDSSKSSS 121
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y NG S +I YGTG+++GF S D V + + +++Q F EA EP +TF+ AKFDGILGL
Sbjct: 122 YVNNGTSFEIRYGTGSMTGFLSTDVVTVANQQIQNQTFAEAVSEPGITFVFAKFDGILGL 181
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
GF ISV V+ +MV QGLV PVFSF+ NR+ + + GGEI+FGG DP +YKG+ TY
Sbjct: 182 GFNTISVDGVPTVFDSMVKQGLVQHPVFSFYLNRDTNGKVGGEIIFGGSDPAYYKGDFTY 241
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTG-----FCAGGCAAIADSGTSLLAGPTTIITQVNHAI 315
P+T+ GYWQF M ++++ ++ C GC AIAD+GTSL+AGP+ + +N A+
Sbjct: 242 APLTKIGYWQFQMHGILLENKSNNKTVGHVCESGCEAIADTGTSLIAGPSDQVEHLNRAL 301
Query: 316 GATG 319
GA G
Sbjct: 302 GAIG 305
>gi|54020914|ref|NP_001005701.1| napsin A aspartic peptidase precursor [Xenopus (Silurana)
tropicalis]
gi|49522956|gb|AAH75272.1| cathepsin D (lysosomal aspartyl protease) [Xenopus (Silurana)
tropicalis]
Length = 402
Score = 267 bits (682), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 153/346 (44%), Positives = 206/346 (59%), Gaps = 25/346 (7%)
Query: 13 FLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLG 72
F+ LLL V++T L RI LKK F R + DS E F + +++ + +
Sbjct: 6 FILLLL---VWTT--DALIRIPLKK--FPSIRRTLS--DSMTKEEFNGATKEFLKQQTIP 56
Query: 73 ESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHS 131
E L NY+DAQY+GEI IGTPPQ F VIFDTGSSNLWVPS KC +F AC+ H
Sbjct: 57 EK-------LTNYLDAQYYGEIFIGTPPQKFAVIFDTGSSNLWVPSIKCSFFDFACWLHK 109
Query: 132 KYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLL 191
KYRS SSTY++N I YGTG++SGF S+D V +G + V +Q F EA ++P + F+
Sbjct: 110 KYRSKDSSTYQQNNTEFAIQYGTGSLSGFLSQDTVTVGSIDVANQTFAEAVKQPGIVFVF 169
Query: 192 AKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDP 251
A FDGILG+G+ ISV VPV+ NM+ Q L+ E VFSF+ +R+ GGE+V GG DP
Sbjct: 170 AHFDGILGMGYPNISVDGVVPVFDNMMEQKLLEENVFSFYLSRDPMAMVGGELVLGGTDP 229
Query: 252 DHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQV 311
++Y G+ Y+ VT+ YWQ +V + Q C GGC AI D+GTSL+ GP I +
Sbjct: 230 NYYTGDFHYLNVTRMAYWQIKADEVRVANQLV-LCKGGCQAIVDTGTSLITGPREEIRAL 288
Query: 312 NHAIGATGIVSQE----CKAVVSQYGEEIINMLLAKVSAKLSLRHY 353
+ AIGA + S E CK + S ++ +L V+ L+ Y
Sbjct: 289 HKAIGAFPLFSGEYFVNCKRIQSL---PTVSFILGGVAYNLTGEQY 331
>gi|344307517|ref|XP_003422427.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin D-like [Loxodonta
africana]
Length = 419
Score = 267 bits (682), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 143/310 (46%), Positives = 191/310 (61%), Gaps = 27/310 (8%)
Query: 59 RTSIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPS 118
+T I KY+ G G + L+NYMDAQY+GEIGIGTPPQ FTV+FDTGSSNLWVPS
Sbjct: 57 KTPISKYTQGGPAVPGGPVPEI-LRNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPS 115
Query: 119 SKC-YFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKI--------- 168
C IAC+ H KY S +SSTY KNG + DIHYG+G++SG+ S+D V +
Sbjct: 116 VHCKLLDIACWIHHKYNSAKSSTYVKNGTTFDIHYGSGSLSGYLSQDTVSVPCSSASASA 175
Query: 169 -GDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPV 227
G + V+ Q F EAT++P +TF+ AKFDGILG+ + ISV K VPV+ N++ Q LV + +
Sbjct: 176 LGGVRVERQTFGEATKQPGITFIAAKFDGILGMAYPRISVNKVVPVFDNLMAQKLVEKNM 235
Query: 228 FSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCA 287
FSF+ NR+ + GGE++ GG+D +Y G + VT++ YWQ M V + G C
Sbjct: 236 FSFYLNRDPTAQPGGELMLGGIDSKYYTGTLNFNKVTREAYWQIHMDRVDV-GNGLTLCK 294
Query: 288 GGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE----CKAVVSQYGEEIINMLLAK 343
GGC AI D+GTSL+ GP IT++ A+GA ++ E C+ V S L
Sbjct: 295 GGCEAIVDTGTSLMVGPVEEITELQKALGAIPLIQGEYMIPCEKVSS----------LPP 344
Query: 344 VSAKLSLRHY 353
VS +L R Y
Sbjct: 345 VSLQLGGRSY 354
>gi|241275826|ref|XP_002406708.1| aspartic protease, putative [Ixodes scapularis]
gi|215496940|gb|EEC06580.1| aspartic protease, putative [Ixodes scapularis]
Length = 345
Score = 266 bits (681), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 134/252 (53%), Positives = 171/252 (67%), Gaps = 6/252 (2%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSST 140
LKNY+DAQY+GEI +GTPPQ+F V+FDTGSSNLWVPS+KC F+ IAC H KY S +SST
Sbjct: 60 LKNYLDAQYYGEITLGTPPQSFRVVFDTGSSNLWVPSAKCPFTNIACLLHRKYYSRKSST 119
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y KNG +I YG+G++ G S D + +GD V Q F E E L FL AKFDGILGL
Sbjct: 120 YVKNGTQFEIRYGSGSVRGELSTDTMGVGDSSVTGQTFAEILHESGLAFLAAKFDGILGL 179
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
G+ EISV V+ MV QG+ +PVFS + +RNA + GGE++FGG+D HY G +Y
Sbjct: 180 GYPEISVLGVPTVFDTMVAQGVAAKPVFSVFLDRNASDPAGGEVLFGGIDESHYIGNISY 239
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
VPV+++GYWQ M + G FC+GGC AI D+GTSL+AGP+ I ++N IGA
Sbjct: 240 VPVSKRGYWQVHMDGTRV-GNNGSFCSGGCEAILDTGTSLIAGPSDEIEKLNLLIGAAPF 298
Query: 321 VSQE----CKAV 328
S E CK++
Sbjct: 299 ASGEYIVSCKSI 310
>gi|118344558|ref|NP_001072052.1| cathepsin D1 precursor [Takifugu rubripes]
gi|55771082|dbj|BAD69801.1| cathepsin D1 [Takifugu rubripes]
Length = 396
Score = 266 bits (681), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 141/313 (45%), Positives = 187/313 (59%), Gaps = 6/313 (1%)
Query: 13 FLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLG 72
L L +F + T N L RI LKK + + R ++ E R
Sbjct: 3 LLILCVFAALALT-NDALVRIPLKKFR---SIRRELTDSGRKIEELLADRRINKYNYGFP 58
Query: 73 ESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHS 131
+G LKNY+DAQY+GEIG+GTPPQ FTV+FDTGSSNLWVPS C IAC H
Sbjct: 59 TAGAPTPETLKNYLDAQYYGEIGLGTPPQPFTVVFDTGSSNLWVPSVHCSLLDIACLLHH 118
Query: 132 KYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLL 191
KY S +SS+Y KNG + I YG+G++SG+ S+D +GDL V+ Q F EA ++P + F+
Sbjct: 119 KYNSAKSSSYVKNGTAFAIRYGSGSLSGYLSQDTCTLGDLAVEKQLFGEAIKQPGIAFIA 178
Query: 192 AKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDP 251
AKFDGILG+ + ISV PV+ N+++Q V + VFSF+ NRN D + GGE++ GG DP
Sbjct: 179 AKFDGILGMAYPRISVDGVTPVFDNIMSQKKVEKNVFSFYLNRNPDTQPGGELLLGGTDP 238
Query: 252 DHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQV 311
+Y G+ YV VT++ YWQ M D M G C GC AI D+GTSLL GP+ + +
Sbjct: 239 KYYTGDFDYVNVTRQAYWQIHM-DGMSVGSQLSLCKSGCEAIVDTGTSLLTGPSEEVKAL 297
Query: 312 NHAIGATGIVSQE 324
AIGA ++ E
Sbjct: 298 QKAIGAMPLIQGE 310
>gi|147743007|sp|P85138.1|CARDG_CYNCA RecName: Full=Cardosin-G; Contains: RecName: Full=Cardosin-G heavy
chain; Contains: RecName: Full=Cardosin-G light chain
Length = 266
Score = 266 bits (681), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 141/239 (58%), Positives = 158/239 (66%), Gaps = 46/239 (19%)
Query: 79 IVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRS 138
+VAL N D YFGEIGIGTPPQ FTVIFDTGSS LWVPSSK HS Y S S
Sbjct: 6 VVALTNDRDTSYFGEIGIGTPPQKFTVIFDTGSSYLWVPSSKA--------HSMYESSDS 57
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
STYK +Q+FIEAT E FL FDGIL
Sbjct: 58 STYK--------------------------------EQDFIEATEEADNVFLNRLFDGIL 85
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEH 258
GL FQ ISV PVWYNMVNQGLV FSFW NRN DEEEGGE+VFGG+DP+H++G+H
Sbjct: 86 GLSFQTISV----PVWYNMVNQGLVKR--FSFWLNRNVDEEEGGELVFGGLDPNHFRGDH 139
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
TYVPVT + YWQF +GDV+I ++TGFCA GC A ADSGTSLL+GPT I+TQ+NHAIGA
Sbjct: 140 TYVPVTYQYYWQFGIGDVLIGDKSTGFCAPGCQAFADSGTSLLSGPTAIVTQINHAIGA 198
>gi|74198040|dbj|BAE35200.1| unnamed protein product [Mus musculus]
Length = 410
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 144/317 (45%), Positives = 199/317 (62%), Gaps = 19/317 (5%)
Query: 29 GLYRIGLKKRKFDLNNRVAARLD-SKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMD 87
+ RI L RKF R + S E + I KYS++ + ++ + LKNY+D
Sbjct: 20 AIMRIPL--RKFTSIRRTMTEVGGSVEDLILKGPITKYSMQSS-PKTTEPVSELLKNYLD 76
Query: 88 AQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNGK 146
AQY+G+IGIGTPPQ FTV+FDTGSSNLWVPS C IAC+ H KY S +SSTY KNG
Sbjct: 77 AQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSSTYVKNGT 136
Query: 147 SADIHYGTGAISGFFSEDHV---------KIGDLVVKDQEFIEATREPSLTFLLAKFDGI 197
S DIHYG+G++SG+ S+D V K + V+ Q F EAT++P + F+ AKFDGI
Sbjct: 137 SFDIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIFGEATKQPGIVFVAAKFDGI 196
Query: 198 LGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGE 257
LG+G+ ISV +PV+ N++ Q LV++ +FSF+ NR+ + + GGE++ GG D +Y GE
Sbjct: 197 LGMGYPHISVNNVLPVFDNLMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDSKYYHGE 256
Query: 258 HTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
+Y+ VT+K YWQ M + + + T C GGC AI D+GTSLL GP + ++ AIGA
Sbjct: 257 LSYLNVTRKAYWQVHMDQLEVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKELQKAIGA 315
Query: 318 TGIVSQE----CKAVVS 330
++ E C+ V S
Sbjct: 316 VPLIQGEYMIPCEKVSS 332
>gi|74220304|dbj|BAE31329.1| unnamed protein product [Mus musculus]
Length = 410
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 142/313 (45%), Positives = 197/313 (62%), Gaps = 17/313 (5%)
Query: 33 IGLKKRKFDLNNRVAARLD-SKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYF 91
I + RKF R + S E + I KYS++ + ++ + LKNY+DAQY+
Sbjct: 22 IRIPLRKFTPIRRTMTEVGGSVEDLILKGPITKYSMQSS-PKTTEPVSELLKNYLDAQYY 80
Query: 92 GEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNGKSADI 150
G+IGIGTPPQ FTV+FDTGSSNLWVPS C IAC+ H KY S +SSTY KNG S DI
Sbjct: 81 GDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSSTYVKNGTSFDI 140
Query: 151 HYGTGAISGFFSEDHV---------KIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
HYG+G++SG+ S+D V K + V+ Q F EAT++P + F+ AKFDGILG+G
Sbjct: 141 HYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIFGEATKQPGIVFVAAKFDGILGMG 200
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ ISV +PV+ N++ Q LV++ +FSF+ NR+ + + GGE++ GG D +Y GE +Y+
Sbjct: 201 YPHISVNNVLPVFDNLMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDSKYYHGELSYL 260
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIV 321
VT+K YWQ M + + + T C GGC AI D+GTSLL GP + ++ AIGA ++
Sbjct: 261 NVTRKAYWQVHMDQLEVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKELQKAIGAVPLI 319
Query: 322 SQE----CKAVVS 330
E C+ V S
Sbjct: 320 QGEYMIPCEKVSS 332
>gi|6753556|ref|NP_034113.1| cathepsin D precursor [Mus musculus]
gi|115718|sp|P18242.1|CATD_MOUSE RecName: Full=Cathepsin D; Flags: Precursor
gi|50299|emb|CAA37067.1| cathepsin D [Mus musculus]
gi|50301|emb|CAA37423.1| unnamed protein product [Mus musculus]
gi|817945|emb|CAA48453.1| cathepsin d [Mus musculus]
gi|32452040|gb|AAH54758.1| Cathepsin D [Mus musculus]
gi|34785578|gb|AAH57931.1| Cathepsin D [Mus musculus]
gi|74139562|dbj|BAE40918.1| unnamed protein product [Mus musculus]
gi|74139905|dbj|BAE31791.1| unnamed protein product [Mus musculus]
gi|74151769|dbj|BAE29674.1| unnamed protein product [Mus musculus]
gi|74177956|dbj|BAE29773.1| unnamed protein product [Mus musculus]
gi|74178091|dbj|BAE29834.1| unnamed protein product [Mus musculus]
gi|74181413|dbj|BAE29980.1| unnamed protein product [Mus musculus]
gi|74184920|dbj|BAE39078.1| unnamed protein product [Mus musculus]
gi|74185047|dbj|BAE39131.1| unnamed protein product [Mus musculus]
gi|74185557|dbj|BAE30245.1| unnamed protein product [Mus musculus]
gi|74186716|dbj|BAE34813.1| unnamed protein product [Mus musculus]
gi|74189047|dbj|BAE39288.1| unnamed protein product [Mus musculus]
gi|74191359|dbj|BAE30262.1| unnamed protein product [Mus musculus]
gi|74191542|dbj|BAE30346.1| unnamed protein product [Mus musculus]
gi|74197068|dbj|BAE35086.1| unnamed protein product [Mus musculus]
gi|74197198|dbj|BAE35144.1| unnamed protein product [Mus musculus]
gi|74199016|dbj|BAE30724.1| unnamed protein product [Mus musculus]
gi|74204247|dbj|BAE39883.1| unnamed protein product [Mus musculus]
gi|74207294|dbj|BAE30833.1| unnamed protein product [Mus musculus]
gi|74207430|dbj|BAE30895.1| unnamed protein product [Mus musculus]
gi|74212520|dbj|BAE31001.1| unnamed protein product [Mus musculus]
gi|74212556|dbj|BAE31018.1| unnamed protein product [Mus musculus]
gi|74212558|dbj|BAE31019.1| unnamed protein product [Mus musculus]
gi|74213416|dbj|BAE35523.1| unnamed protein product [Mus musculus]
gi|74214708|dbj|BAE31193.1| unnamed protein product [Mus musculus]
gi|74217133|dbj|BAE31236.1| unnamed protein product [Mus musculus]
gi|74219445|dbj|BAE29499.1| unnamed protein product [Mus musculus]
gi|74220283|dbj|BAE31319.1| unnamed protein product [Mus musculus]
gi|74220373|dbj|BAE31412.1| unnamed protein product [Mus musculus]
gi|74220638|dbj|BAE31529.1| unnamed protein product [Mus musculus]
gi|74220740|dbj|BAE31342.1| unnamed protein product [Mus musculus]
gi|74222921|dbj|BAE42305.1| unnamed protein product [Mus musculus]
gi|74225262|dbj|BAE31566.1| unnamed protein product [Mus musculus]
gi|74225282|dbj|BAE31575.1| unnamed protein product [Mus musculus]
gi|148686195|gb|EDL18142.1| cathepsin D, isoform CRA_a [Mus musculus]
Length = 410
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 142/313 (45%), Positives = 197/313 (62%), Gaps = 17/313 (5%)
Query: 33 IGLKKRKFDLNNRVAARLD-SKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYF 91
I + RKF R + S E + I KYS++ + ++ + LKNY+DAQY+
Sbjct: 22 IRIPLRKFTSIRRTMTEVGGSVEDLILKGPITKYSMQSS-PKTTEPVSELLKNYLDAQYY 80
Query: 92 GEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNGKSADI 150
G+IGIGTPPQ FTV+FDTGSSNLWVPS C IAC+ H KY S +SSTY KNG S DI
Sbjct: 81 GDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSSTYVKNGTSFDI 140
Query: 151 HYGTGAISGFFSEDHV---------KIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
HYG+G++SG+ S+D V K + V+ Q F EAT++P + F+ AKFDGILG+G
Sbjct: 141 HYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIFGEATKQPGIVFVAAKFDGILGMG 200
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ ISV +PV+ N++ Q LV++ +FSF+ NR+ + + GGE++ GG D +Y GE +Y+
Sbjct: 201 YPHISVNNVLPVFDNLMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDSKYYHGELSYL 260
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIV 321
VT+K YWQ M + + + T C GGC AI D+GTSLL GP + ++ AIGA ++
Sbjct: 261 NVTRKAYWQVHMDQLEVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKELQKAIGAVPLI 319
Query: 322 SQE----CKAVVS 330
E C+ V S
Sbjct: 320 QGEYMIPCEKVSS 332
>gi|74192771|dbj|BAE34900.1| unnamed protein product [Mus musculus]
Length = 410
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 142/313 (45%), Positives = 197/313 (62%), Gaps = 17/313 (5%)
Query: 33 IGLKKRKFDLNNRVAARLD-SKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYF 91
I + RKF R + S E + I KYS++ + ++ + LKNY+DAQY+
Sbjct: 22 IRIPLRKFTSIRRTMTEVGGSVEDLILKGPITKYSMQSS-PKTTEPVSELLKNYLDAQYY 80
Query: 92 GEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNGKSADI 150
G+IGIGTPPQ FTV+FDTGSSNLWVPS C IAC+ H KY S +SSTY KNG S DI
Sbjct: 81 GDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSSTYVKNGTSFDI 140
Query: 151 HYGTGAISGFFSEDHV---------KIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
HYG+G++SG+ S+D V K + V+ Q F EAT++P + F+ AKFDGILG+G
Sbjct: 141 HYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIFGEATKQPGIVFVAAKFDGILGMG 200
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ ISV +PV+ N++ Q LV++ +FSF+ NR+ + + GGE++ GG D +Y GE +Y+
Sbjct: 201 YPHISVNNVLPVFDNLMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDSKYYHGELSYL 260
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIV 321
VT+K YWQ M + + + T C GGC AI D+GTSLL GP + ++ AIGA ++
Sbjct: 261 NVTRKAYWQVHMDQLEVGSELT-LCKGGCEAIVDTGTSLLVGPVEEVKELQKAIGAVPLI 319
Query: 322 SQE----CKAVVS 330
E C+ V S
Sbjct: 320 QGEYMIPCEKVSS 332
>gi|122114359|gb|AAY42145.2| cathepsin D [Sus scrofa]
Length = 410
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 138/287 (48%), Positives = 182/287 (63%), Gaps = 25/287 (8%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNYMDAQY+GEIGIGTPPQ FTV+FDTGSSNLWVPS C IAC+ H KY SG+SST
Sbjct: 71 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSGKSST 130
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKI---------GDLVVKDQEFIEATREPSLTFLL 191
Y KNG + IHYG+G++SG+ S+D V + G + V+ Q F EAT++P LTF+
Sbjct: 131 YVKNGTTFAIHYGSGSLSGYLSQDTVSVPCNSASSGVGGIKVERQTFGEATKQPGLTFIA 190
Query: 192 AKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDP 251
AKFDGILG+ + ISV VPV+ N++ Q LV++ +FSF+ NR+ + G E++ GG+D
Sbjct: 191 AKFDGILGMAYPRISVNNVVPVFDNLMQQKLVDKNIFSFYLNRDPGAQPGSELMLGGIDS 250
Query: 252 DHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQV 311
+YKG Y VT+K YWQ M V + G + C GGC AI D+GTSL+ GP + ++
Sbjct: 251 KYYKGSLDYHNVTRKAYWQIHMDQVAV-GSSLTLCKGGCEAIVDTGTSLIVGPVEEVREL 309
Query: 312 NHAIGATGIVSQE----CKAVVSQYGEEIINMLLAKVSAKLSLRHYK 354
AIGA ++ E C+ V S L V+ L + YK
Sbjct: 310 QKAIGAVPLIQGEYMIPCEKVPS----------LPDVTVTLGGKKYK 346
>gi|332264729|ref|XP_003281384.1| PREDICTED: cathepsin D [Nomascus leucogenys]
Length = 412
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 129/255 (50%), Positives = 172/255 (67%), Gaps = 13/255 (5%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNYMDAQY+GEIGIGTPPQ FTV+FDTGSSNLWVPS C IAC+ H KY S +SST
Sbjct: 71 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSVHCKLLDIACWIHHKYNSDKSST 130
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKI-----------GDLVVKDQEFIEATREPSLTF 189
Y KNG S DIHYG+G++SG+ S+D V + G + V+ Q F EAT++P +TF
Sbjct: 131 YVKNGTSFDIHYGSGSLSGYLSQDTVSVPCQSASSASALGSVKVERQVFGEATKQPGITF 190
Query: 190 LLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGM 249
+ AKFDGILG+ + ISV +PV+ N++ Q LV++ +FSF+ NR+ D + GGE++ GG
Sbjct: 191 IAAKFDGILGMAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLNRDPDAQPGGELMLGGT 250
Query: 250 DPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIIT 309
D +YKG +Y+ VT+K YWQ + V + T C GC AI D+GTSL+ GP +
Sbjct: 251 DSKYYKGSLSYLNVTRKAYWQVHLDQVEVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVR 309
Query: 310 QVNHAIGATGIVSQE 324
++ AIGA ++ E
Sbjct: 310 ELQKAIGAVPLIQGE 324
>gi|281344446|gb|EFB20030.1| hypothetical protein PANDA_008874 [Ailuropoda melanoleuca]
Length = 345
Score = 266 bits (679), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 152/343 (44%), Positives = 202/343 (58%), Gaps = 29/343 (8%)
Query: 26 PNGGLYRIGLKKRKFDLNNRVAARLDSK-EGESFRTSIRKYSLRGNLGESGDADIVALKN 84
P L RI L KF R + L E + I KY+ +G +G LKN
Sbjct: 2 PLPALDRIPL--HKFTSIRRTMSELGGPVEDLIAKGPISKYA-QGVPSVAGGPIPEILKN 58
Query: 85 YMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKK 143
YMDAQY+GEIGIGTPPQ FTV+FDTGSSNLWVPS C IAC+ H KY S +SSTY K
Sbjct: 59 YMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSDKSSTYVK 118
Query: 144 NGKSADIHYGTGAISGFFSEDHVKI---------GDLVVKDQEFIEATREPSLTFLLAKF 194
NG S DIHYG+G++SG+ S+D V + + V+ Q F EA ++P +TF+ AKF
Sbjct: 119 NGTSFDIHYGSGSLSGYLSQDTVSVPCKSALSSLAGVKVERQTFGEAIKQPGITFIAAKF 178
Query: 195 DGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHY 254
DGILG+ + ISV +PV+ N++ Q LV + +FSF+ NRN + GGE++ GG D +Y
Sbjct: 179 DGILGMAYPRISVNNVLPVFDNLMEQKLVEKNIFSFYLNRNPGAQPGGELMLGGTDSKYY 238
Query: 255 KGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHA 314
KG +Y+ VT+K YWQ M V + G + C GGC AI D+GTSL+ GP + ++ A
Sbjct: 239 KGPLSYLNVTRKAYWQVHMEQVDV-GSSLTLCKGGCEAILDTGTSLIVGPVDEVRELQKA 297
Query: 315 IGATGIVSQE----CKAVVSQYGEEIINMLLAKVSAKLSLRHY 353
IGA ++ E C+ V + L +V+ KL R Y
Sbjct: 298 IGAVPLIQGEYMIPCEKVST----------LPEVTLKLGGRAY 330
>gi|391329068|ref|XP_003738999.1| PREDICTED: lysosomal aspartic protease-like [Metaseiulus
occidentalis]
Length = 384
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 125/249 (50%), Positives = 167/249 (67%), Gaps = 2/249 (0%)
Query: 78 DIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSG 136
++ + NYMDAQY+G I IG PPQ F V+FDTGSSNLWVPS+ C + +AC H+KY S
Sbjct: 52 NVEPIANYMDAQYYGPISIGNPPQPFQVVFDTGSSNLWVPSANCPITNVACLLHNKYHSS 111
Query: 137 RSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDG 196
+S++Y NG + I YG+GA+SG S D V + + + Q F E +E L F+ KFDG
Sbjct: 112 KSTSYLANGTTFSIQYGSGAVSGLLSADDVSVNGVNITRQTFAEILKESGLGFIAGKFDG 171
Query: 197 ILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKG 256
ILG+G+ +ISV +PV+ MV Q + P+FSF+ R+ D G E+V GG+DP H+KG
Sbjct: 172 ILGMGYPQISVLGVLPVFDQMVAQNAIAAPIFSFYLTRDNDHPTGSELVIGGIDPKHHKG 231
Query: 257 EHTYVPVTQKGYWQFDMGDVMI-DGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAI 315
E TY+PV++KGYWQF M V I D T CA GC AIAD+GTSL+AGPT+ + +N AI
Sbjct: 232 EITYIPVSRKGYWQFKMDSVKIGDVSKTTLCANGCQAIADTGTSLIAGPTSEVKALNKAI 291
Query: 316 GATGIVSQE 324
GA ++ E
Sbjct: 292 GAAPFLNGE 300
>gi|301769501|ref|XP_002920177.1| PREDICTED: cathepsin D-like [Ailuropoda melanoleuca]
Length = 371
Score = 265 bits (677), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 137/286 (47%), Positives = 182/286 (63%), Gaps = 25/286 (8%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNYMDAQY+GEIGIGTPPQ FTV+FDTGSSNLWVPS C IAC+ H KY S +SST
Sbjct: 71 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSDKSST 130
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKI---------GDLVVKDQEFIEATREPSLTFLL 191
Y KNG S DIHYG+G++SG+ S+D V + + V+ Q F EA ++P +TF+
Sbjct: 131 YVKNGTSFDIHYGSGSLSGYLSQDTVSVPCKSALSSLAGVKVERQTFGEAIKQPGITFIA 190
Query: 192 AKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDP 251
AKFDGILG+ + ISV +PV+ N++ Q LV + +FSF+ NRN + GGE++ GG D
Sbjct: 191 AKFDGILGMAYPRISVNNVLPVFDNLMEQKLVEKNIFSFYLNRNPGAQPGGELMLGGTDS 250
Query: 252 DHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQV 311
+YKG +Y+ VT+K YWQ M V + G + C GGC AI D+GTSL+ GP + ++
Sbjct: 251 KYYKGPLSYLNVTRKAYWQVHMEQVDV-GSSLTLCKGGCEAILDTGTSLIVGPVDEVREL 309
Query: 312 NHAIGATGIVSQE----CKAVVSQYGEEIINMLLAKVSAKLSLRHY 353
AIGA ++ E C+ V + L +V+ KL R Y
Sbjct: 310 QKAIGAVPLIQGEYMIPCEKVST----------LPEVTLKLGGRAY 345
>gi|431920733|gb|ELK18506.1| Napsin-A [Pteropus alecto]
Length = 760
Score = 265 bits (677), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 129/246 (52%), Positives = 166/246 (67%), Gaps = 2/246 (0%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYF-SIACYFHSKYRSGRS 138
V L N+M+AQY+GEIG+GTPPQNF+V+FDTGSSNLWVPS +CYF S+ C+FH ++ S S
Sbjct: 54 VPLSNFMNAQYYGEIGLGTPPQNFSVVFDTGSSNLWVPSKRCYFFSLPCWFHHRFDSKAS 113
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
S++K NG I YGTG +SG SED + IG + F EA EPSLTF+ A+FDGIL
Sbjct: 114 SSFKPNGTKFAIQYGTGRLSGVLSEDKLTIGGITGASVTFGEALWEPSLTFIFARFDGIL 173
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEH 258
GLGF ++V P +V QGL+++PVFSF+ R+ +E +GGE+V GG DP HY
Sbjct: 174 GLGFPALAVEGVRPPLDMLVAQGLLDKPVFSFYLTRDPEEADGGELVLGGSDPTHYIPPL 233
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
TYVPVT YWQ M V + G CA GCAAI D+GTSL+ GP+ I ++ AIG
Sbjct: 234 TYVPVTVPAYWQIHMERVQV-GTGLTLCAHGCAAILDTGTSLITGPSEEIRALHRAIGGI 292
Query: 319 GIVSQE 324
++ E
Sbjct: 293 SLLVGE 298
>gi|74191361|dbj|BAE30263.1| unnamed protein product [Mus musculus]
Length = 410
Score = 265 bits (677), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 142/313 (45%), Positives = 196/313 (62%), Gaps = 17/313 (5%)
Query: 33 IGLKKRKFDLNNRVAARLD-SKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYF 91
I + RKF R + S E + I KYS++ + ++ + LKNY+DAQY+
Sbjct: 22 IRIPLRKFTSIRRTMTEVGGSVEDLILKGPITKYSMQSS-PKTTEPVSELLKNYLDAQYY 80
Query: 92 GEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNGKSADI 150
G+IGIGTPPQ FTV+FDTGSSNLWVPS C IAC+ H KY S +SSTY KNG S DI
Sbjct: 81 GDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSSTYVKNGTSFDI 140
Query: 151 HYGTGAISGFFSEDHV---------KIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
HYG+G++SG+ S+D V K + V+ Q F EAT++P + F+ AKFDGILG+G
Sbjct: 141 HYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIFGEATKQPGIVFVAAKFDGILGMG 200
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ ISV +PV+ N++ Q LV++ FSF+ NR+ + + GGE++ GG D +Y GE +Y+
Sbjct: 201 YPHISVNNVLPVFDNLMQQKLVDKNTFSFYLNRDPEGQPGGELMLGGTDSKYYHGELSYL 260
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIV 321
VT+K YWQ M + + + T C GGC AI D+GTSLL GP + ++ AIGA ++
Sbjct: 261 NVTRKAYWQVHMDQLEVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKELQKAIGAVPLI 319
Query: 322 SQE----CKAVVS 330
E C+ V S
Sbjct: 320 QGEYMIPCEKVSS 332
>gi|315274255|gb|ADU03675.1| putative cathepsin D3 [Ixodes ricinus]
Length = 398
Score = 265 bits (677), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 128/237 (54%), Positives = 163/237 (68%), Gaps = 1/237 (0%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
L NY+DAQY+G I IG+PPQ F V+FDTGSSNLWVPS +C + +IAC H KY RS +
Sbjct: 66 LSNYLDAQYYGPISIGSPPQPFRVVFDTGSSNLWVPSKQCKWTNIACLLHKKYDHTRSRS 125
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y+KNG + + YGTG+++GF S D V + + V +Q F EA EP LTF+ AKFDGILGL
Sbjct: 126 YRKNGTAISLRYGTGSMTGFLSVDTVSLAGIDVHNQTFAEAVTEPGLTFVAAKFDGILGL 185
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
GF I+V A V+ NMV Q LV PVFSF+ NRN GGEI FGG D Y G+ +Y
Sbjct: 186 GFSNIAVMGAPTVFDNMVAQLLVPRPVFSFFLNRNTTSPTGGEITFGGTDDRFYSGDISY 245
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
VPV+ KGYWQF + ++++ + CA GC AIAD+GTSL+AGP+ I ++ IGA
Sbjct: 246 VPVSTKGYWQFTVDNIVVKNSSFKLCAEGCEAIADTGTSLMAGPSLEIMKLQKLIGA 302
>gi|147743015|sp|P85139.1|CARDH_CYNCA RecName: Full=Cardosin-H; Contains: RecName: Full=Cardosin-H heavy
chain; Contains: RecName: Full=Cardosin-H light chain
Length = 265
Score = 265 bits (677), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 143/245 (58%), Positives = 162/245 (66%), Gaps = 47/245 (19%)
Query: 73 ESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSK 132
+SG A +VAL N D YFGEIGIGTPPQ FTVIFDTGSS LWVPSSK HS
Sbjct: 1 DSGSA-VVALTNDRDTSYFGEIGIGTPPQKFTVIFDTGSSVLWVPSSKA--------HSM 51
Query: 133 YRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLA 192
Y S SSTYK+ Q+FIEAT E FL
Sbjct: 52 YESSGSSTYKE--------------------------------QDFIEATDETDNVFLHR 79
Query: 193 KFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPD 252
FDGILGL FQ ISV PVWYNM+NQGLV FSFW NRN DEEEGGE+VFGG+DP+
Sbjct: 80 LFDGILGLSFQTISV----PVWYNMLNQGLVKR--FSFWLNRNVDEEEGGELVFGGLDPN 133
Query: 253 HYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVN 312
H++G+HTYVPVT + YWQF +GDV+I ++TGFCA GC A ADSGTSLL+GPT I+TQ+N
Sbjct: 134 HFRGDHTYVPVTYQYYWQFGIGDVLIGDKSTGFCAPGCQAFADSGTSLLSGPTAIVTQIN 193
Query: 313 HAIGA 317
HAIGA
Sbjct: 194 HAIGA 198
>gi|83523775|ref|NP_001032810.1| cathepsin D precursor [Sus scrofa]
gi|65330113|gb|AAY42144.1| cathepsin D [Sus scrofa]
Length = 410
Score = 265 bits (677), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 138/288 (47%), Positives = 183/288 (63%), Gaps = 25/288 (8%)
Query: 81 ALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSS 139
LKNYMDAQ +GEIGIGTPPQ FTV+FDTGSSNLWVPS C IAC+ H KY SG+SS
Sbjct: 70 VLKNYMDAQNYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSGKSS 129
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKI---------GDLVVKDQEFIEATREPSLTFL 190
TY KNG + IHYG+G++SG++S+D V + G + V+ Q F EAT++P LTF+
Sbjct: 130 TYVKNGTTFAIHYGSGSLSGYWSQDTVSVPCNSALLGVGGIKVERQTFGEATKQPGLTFI 189
Query: 191 LAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMD 250
AKFDGILG+ + ISV VPV+ N++ Q LV++ +FSF+ NR+ + GGE++ GG+D
Sbjct: 190 AAKFDGILGMAYPRISVNNVVPVFDNLMQQKLVDKNIFSFYLNRDPGAQPGGELMLGGID 249
Query: 251 PDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQ 310
+YKG Y VT+K YWQ M V + G + C GGC AI D+GTSL+ GP + +
Sbjct: 250 SKYYKGSLDYHNVTRKAYWQIHMDQVAV-GSSLTLCKGGCEAIVDTGTSLIVGPVEEVRE 308
Query: 311 VNHAIGATGIVSQE----CKAVVSQYGEEIINMLLAKVSAKLSLRHYK 354
+ AIGA ++ E C+ V S L V+ L + YK
Sbjct: 309 LQKAIGAVPLIQGEYMIPCEKVPS----------LPDVTVTLGGKKYK 346
>gi|432850601|ref|XP_004066828.1| PREDICTED: cathepsin D-like isoform 2 [Oryzias latipes]
Length = 398
Score = 265 bits (676), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 141/301 (46%), Positives = 184/301 (61%), Gaps = 11/301 (3%)
Query: 29 GLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLG--ESGDADIVALKNYM 86
L RI LKK + + L E+ K+SL+ NLG S LKNY+
Sbjct: 18 ALIRIPLKKFR-----SIRRELTDSGREAHELLADKHSLKYNLGFPSSNGPTPETLKNYL 72
Query: 87 DAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACY--FHSKYRSGRSSTYKK 143
DAQY+GEI +GTPPQ FTV+FDTGSSNLWVPS C IAC Y S +SSTY K
Sbjct: 73 DAQYYGEIALGTPPQPFTVVFDTGSSNLWVPSVHCSLLDIACRECPPPSYNSAKSSTYVK 132
Query: 144 NGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQ 203
NG S I YG+G++SG+ S+D IGD+ V++Q F EA ++P + F+ AKFDGILG+ +
Sbjct: 133 NGTSFSIQYGSGSLSGYLSQDTCTIGDISVENQVFGEAIKQPGVAFIAAKFDGILGMAYP 192
Query: 204 EISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPV 263
ISV VPV+ N++ Q V+ VFSF+ NRN D E GGE++ GG DP +Y G+ YV +
Sbjct: 193 RISVDGVVPVFDNIMQQKKVDSNVFSFYLNRNPDTEPGGELLLGGTDPKYYSGDFHYVNI 252
Query: 264 TQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQ 323
+++ YWQ M D M G C GGC AI D+GTSLL GP+ + + AIGA ++
Sbjct: 253 SRQAYWQIHM-DGMAVGSQLSLCKGGCEAIVDTGTSLLTGPSAEVKALQKAIGAIPLIQG 311
Query: 324 E 324
E
Sbjct: 312 E 312
>gi|241813645|ref|XP_002416518.1| aspartic protease, putative [Ixodes scapularis]
gi|215510982|gb|EEC20435.1| aspartic protease, putative [Ixodes scapularis]
Length = 392
Score = 265 bits (676), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 128/237 (54%), Positives = 163/237 (68%), Gaps = 1/237 (0%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
L NY+DAQY+G I IG+PPQ F V+FDTGSSNLWVPS +C + +IAC H KY RS +
Sbjct: 60 LSNYLDAQYYGPISIGSPPQPFRVVFDTGSSNLWVPSKQCKWTNIACLLHKKYDHTRSRS 119
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y+KNG + + YGTG+++GF S D V + + V +Q F EA EP LTF+ AKFDGILGL
Sbjct: 120 YRKNGTAISLRYGTGSMTGFLSVDTVSLAGIDVHNQTFAEAVTEPGLTFVAAKFDGILGL 179
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
GF I+V A V+ NMV Q LV PVFSF+ NRN GGEI FGG D Y G+ +Y
Sbjct: 180 GFSNIAVMGAPTVFDNMVAQLLVPRPVFSFFLNRNTTSPTGGEITFGGTDDRFYSGDISY 239
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
VPV+ KGYWQF + ++++ + CA GC AIAD+GTSL+AGP+ I ++ IGA
Sbjct: 240 VPVSTKGYWQFTVDNIVVKNSSFKLCAEGCEAIADTGTSLMAGPSLEIMKLQKLIGA 296
>gi|431910128|gb|ELK13201.1| Cathepsin D [Pteropus alecto]
Length = 375
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 141/310 (45%), Positives = 193/310 (62%), Gaps = 26/310 (8%)
Query: 59 RTSIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPS 118
+ I KY+ +G SG LKNYMDAQY+GEIGIGTPPQ FTV+FDTGSSNLWVPS
Sbjct: 14 KGPISKYA-QGAPAVSGGPIPEMLKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPS 72
Query: 119 SKC-YFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKI--------- 168
C IAC+ H KY SG+SSTY +NG + DIHYG+G++SG+ S+D V +
Sbjct: 73 IHCKLLDIACWIHHKYNSGKSSTYVRNGTAFDIHYGSGSLSGYLSQDTVSVPCKSAPSPP 132
Query: 169 GDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVF 228
+ V+ Q F EAT++P +TF+ AKFDGILG+ + ISV +PV+ N++ Q LV++ +F
Sbjct: 133 SSVKVERQIFGEATKQPGITFIAAKFDGILGMAYPRISVNNVLPVFDNLMQQKLVDKNIF 192
Query: 229 SFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAG 288
SF+ NR+ + + GGE++ GG D +Y G +Y+ VT+K YWQ M V + G + C
Sbjct: 193 SFYLNRDPNAQPGGELMLGGTDSKYYTGSLSYLNVTRKAYWQVHMEQVDV-GNSLTLCKA 251
Query: 289 GCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE----CKAVVSQYGEEIINMLLAKV 344
GC AI D+GTSL+ GP + + AIGA ++ E C+ V S L +V
Sbjct: 252 GCEAIVDTGTSLVVGPVEEVRALQKAIGAVPLIQGEYMIPCEKVSS----------LPEV 301
Query: 345 SAKLSLRHYK 354
+ KL + YK
Sbjct: 302 TLKLGGKGYK 311
>gi|74219443|dbj|BAE29498.1| unnamed protein product [Mus musculus]
Length = 410
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 141/313 (45%), Positives = 196/313 (62%), Gaps = 17/313 (5%)
Query: 33 IGLKKRKFDLNNRVAARLD-SKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYF 91
I + RKF R + S E + I KYS++ + ++ + LKNY+DAQY+
Sbjct: 22 IRIPLRKFTSIRRTMTEVGGSVEDLILKGPITKYSMQSS-PKTTEPVSELLKNYLDAQYY 80
Query: 92 GEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNGKSADI 150
G+IGIGTPPQ FTV+FDTGSSNLWVPS C IAC+ H KY S +SSTY KNG S DI
Sbjct: 81 GDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSSTYVKNGTSFDI 140
Query: 151 HYGTGAISGFFSEDHV---------KIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
HYG+G++SG+ S+D V K + V+ Q F EAT++P + F+ AKFDGILG+G
Sbjct: 141 HYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIFGEATKQPGIVFVAAKFDGILGMG 200
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ ISV +PV+ N++ Q LV++ +FSF+ NR+ + + GGE++ GG D +Y GE +Y+
Sbjct: 201 YPHISVNNVLPVFDNLMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDSKYYHGELSYL 260
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIV 321
VT+K YWQ M + + + T C GGC AI D+G SLL GP + ++ AIGA ++
Sbjct: 261 NVTRKAYWQVHMDQLEVGNELT-LCKGGCEAIVDTGASLLVGPVEEVKELQKAIGAVPLI 319
Query: 322 SQE----CKAVVS 330
E C+ V S
Sbjct: 320 QGEYMIPCEKVSS 332
>gi|74142218|dbj|BAE31874.1| unnamed protein product [Mus musculus]
Length = 410
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 141/313 (45%), Positives = 196/313 (62%), Gaps = 17/313 (5%)
Query: 33 IGLKKRKFDLNNRVAARLD-SKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYF 91
I + RKF R + E + I KYS++ + ++ + LKNY+DAQY+
Sbjct: 22 IRIPLRKFTSIRRTMTEVGGCVEDLILKGPITKYSMQSS-PKTTEPVSELLKNYLDAQYY 80
Query: 92 GEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNGKSADI 150
G+IGIGTPPQ FTV+FDTGSSNLWVPS C IAC+ H KY S +SSTY KNG S DI
Sbjct: 81 GDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSSTYVKNGTSFDI 140
Query: 151 HYGTGAISGFFSEDHV---------KIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
HYG+G++SG+ S+D V K + V+ Q F EAT++P + F+ AKFDGILG+G
Sbjct: 141 HYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIFGEATKQPGIVFVAAKFDGILGMG 200
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ ISV +PV+ N++ Q LV++ +FSF+ NR+ + + GGE++ GG D +Y GE +Y+
Sbjct: 201 YPHISVNNVLPVFDNLMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDSKYYHGELSYL 260
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIV 321
VT+K YWQ M + + + T C GGC AI D+GTSLL GP + ++ AIGA ++
Sbjct: 261 NVTRKAYWQVHMDQLEVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKELQKAIGAVPLI 319
Query: 322 SQE----CKAVVS 330
E C+ V S
Sbjct: 320 QGEYMIPCEKVSS 332
>gi|74204520|dbj|BAE35336.1| unnamed protein product [Mus musculus]
Length = 410
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 141/313 (45%), Positives = 197/313 (62%), Gaps = 17/313 (5%)
Query: 33 IGLKKRKFDLNNRVAARLD-SKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYF 91
I + RKF R + S E + I KYS++ + ++ + LKNY+DAQY+
Sbjct: 22 IRIPLRKFTSIRRTMTEVGGSVEDLILKGPITKYSMQSS-PKTTEPVSELLKNYLDAQYY 80
Query: 92 GEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNGKSADI 150
G+IGIGTPPQ FTV+FDTGSSNLWVPS C IAC+ H KY S +SST+ KNG S DI
Sbjct: 81 GDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSSTHVKNGTSFDI 140
Query: 151 HYGTGAISGFFSEDHV---------KIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
HYG+G++SG+ S+D V K + V+ Q F EAT++P + F+ AKFDGILG+G
Sbjct: 141 HYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIFGEATKQPGIVFVAAKFDGILGMG 200
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ ISV +PV+ N++ Q LV++ +FSF+ NR+ + + GGE++ GG D +Y GE +Y+
Sbjct: 201 YPHISVNNVLPVFDNLMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDSKYYHGELSYL 260
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIV 321
VT+K YWQ M + + + T C GGC AI D+GTSLL GP + ++ AIGA ++
Sbjct: 261 NVTRKAYWQVHMDQLEVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKELQKAIGAVPLI 319
Query: 322 SQE----CKAVVS 330
E C+ V S
Sbjct: 320 QGEYMIPCEKVSS 332
>gi|74191270|dbj|BAE39462.1| unnamed protein product [Mus musculus]
gi|74204799|dbj|BAE35462.1| unnamed protein product [Mus musculus]
Length = 410
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 141/313 (45%), Positives = 196/313 (62%), Gaps = 17/313 (5%)
Query: 33 IGLKKRKFDLNNRVAARLD-SKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYF 91
I + RKF R + S E + I KYS++ + ++ + LKNY+DAQY+
Sbjct: 22 IRIPLRKFTSIRRTMTEVGGSVEDLILKGPITKYSMQSS-PKTTEPVSELLKNYLDAQYY 80
Query: 92 GEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNGKSADI 150
G+IGIGTPPQ FTV+FDTGSSNLWVPS C IAC+ H KY S +SSTY KNG S DI
Sbjct: 81 GDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSSTYVKNGTSFDI 140
Query: 151 HYGTGAISGFFSEDHV---------KIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
HYG+G++SG+ S+D V K + V+ Q F EAT++P + F+ AKFDGILG+G
Sbjct: 141 HYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIFGEATKQPGIVFVAAKFDGILGMG 200
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ ISV +PV+ N++ Q LV++ +FSF+ NR+ + + GGE++ GG D +Y GE +Y+
Sbjct: 201 YPHISVNNVLPVFDNLMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDSKYYHGELSYL 260
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIV 321
VT+K YWQ M + + + T C GGC AI D+GTSLL GP + ++ A GA ++
Sbjct: 261 NVTRKAYWQVHMDQLEVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKELQKATGAVPLI 319
Query: 322 SQE----CKAVVS 330
E C+ V S
Sbjct: 320 QGEYMIPCEKVSS 332
>gi|190576608|gb|ACE79095.1| cathepsin D precursor (predicted) [Sorex araneus]
Length = 405
Score = 264 bits (674), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 127/247 (51%), Positives = 170/247 (68%), Gaps = 5/247 (2%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
L+NYMDAQY+GEIGIGTPPQ FTV+FDTGSSNLWVPS KC IAC+ H KY S +SST
Sbjct: 72 LRNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSVKCQLLDIACWLHHKYNSAKSST 131
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKI---GDLVVKDQEFIEATREPSLTFLLAKFDGI 197
Y KNG + DIHYG+G++SG+ S+D V + + V Q F EAT++P +TF+ AKFDGI
Sbjct: 132 YVKNGTAFDIHYGSGSLSGYLSQDTVSVPCNSGIQVARQLFGEATKQPGVTFIAAKFDGI 191
Query: 198 LGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGE 257
LG+ + ISV PV+ N++ Q LV++ +FSF+ NR+ + GGE++ GG+D ++KG
Sbjct: 192 LGMAYPRISVNNVPPVFDNLMQQKLVDKNIFSFYLNRDPTAQPGGELMLGGIDSKYFKGS 251
Query: 258 HTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
TY VT++ YWQ M + + G C GGC AI D+GTSLL GP + ++ AIGA
Sbjct: 252 MTYHNVTRQAYWQVHMDQIDV-GNGLTLCKGGCEAIVDTGTSLLVGPVDEVKELQKAIGA 310
Query: 318 TGIVSQE 324
++ E
Sbjct: 311 VPLIQGE 317
>gi|432102593|gb|ELK30160.1| Napsin-A [Myotis davidii]
Length = 357
Score = 263 bits (673), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 131/258 (50%), Positives = 169/258 (65%), Gaps = 5/258 (1%)
Query: 75 GDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKY 133
G + V L NYM+AQY+G+IG+GTPPQNF+V+FDTGSSNLWVPS +C +FS+ C+FH ++
Sbjct: 64 GKSAFVPLSNYMNAQYYGKIGLGTPPQNFSVVFDTGSSNLWVPSRRCSFFSLPCWFHHRF 123
Query: 134 RSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAK 193
SST+K NG I YG+G +SG SED + IG + F EA EPSL F+ A
Sbjct: 124 DPKASSTFKPNGTKFAIQYGSGQLSGILSEDKLTIGGIKNASVVFGEALWEPSLVFVFAH 183
Query: 194 FDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDH 253
FDGILGLGF ++VG P MV+QGL+++PVFSF+ NR+ + EGGE+V GG DP H
Sbjct: 184 FDGILGLGFPVLAVGGVRPPLDTMVDQGLLDKPVFSFYLNRDPEAAEGGELVLGGSDPAH 243
Query: 254 YKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNH 313
Y TYVPVT YWQ M V + G CA GC AI D+GTSL+ GPT I ++
Sbjct: 244 YIPPLTYVPVTVPAYWQVHMERVTV-GPGLTLCAQGCPAILDTGTSLITGPTEEIRALHR 302
Query: 314 AIGATGIVSQ---ECKAV 328
AIG ++ + EC +
Sbjct: 303 AIGGFPLLGKYIIECSVI 320
>gi|74151850|dbj|BAE29712.1| unnamed protein product [Mus musculus]
gi|74151877|dbj|BAE29725.1| unnamed protein product [Mus musculus]
Length = 410
Score = 263 bits (673), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 141/313 (45%), Positives = 196/313 (62%), Gaps = 17/313 (5%)
Query: 33 IGLKKRKFDLNNRVAARLD-SKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYF 91
I + RKF R + S E + I KYS++ + ++ + LKNY+DAQY+
Sbjct: 22 IRIPLRKFTSIRRTMTEVGGSVEDLILKGPITKYSMQSS-PKTTEPVSELLKNYLDAQYY 80
Query: 92 GEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNGKSADI 150
G+IGIGTPPQ FTV+FDTGSSNLWVPS C IAC+ H KY S +SSTY KNG S DI
Sbjct: 81 GDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSSTYVKNGTSFDI 140
Query: 151 HYGTGAISGFFSEDHV---------KIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
HYG+G++SG+ S+D V K + V+ Q F EAT++P + F+ AKFDGILG+G
Sbjct: 141 HYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIFGEATKQPGIVFVAAKFDGILGMG 200
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ ISV +PV+ N++ Q LV++ +FSF+ NR+ + + GGE++ G D +Y GE +Y+
Sbjct: 201 YPHISVNNVLPVFDNLMQQKLVDKNIFSFYLNRDPEGQPGGELMLGDTDSKYYHGELSYL 260
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIV 321
VT+K YWQ M + + + T C GGC AI D+GTSLL GP + ++ AIGA ++
Sbjct: 261 NVTRKAYWQVHMDQLEVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKELQKAIGAVPLI 319
Query: 322 SQE----CKAVVS 330
E C+ V S
Sbjct: 320 QGEYMIPCEKVSS 332
>gi|198422402|ref|XP_002130569.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 389
Score = 263 bits (672), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 132/252 (52%), Positives = 168/252 (66%), Gaps = 2/252 (0%)
Query: 74 SGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSK 132
+ +A L NY+DAQY+G+I IGTPPQ FTV+FDTGSSNLWVPS C + IAC H+K
Sbjct: 52 AANAAPEPLTNYLDAQYYGKIYIGTPPQPFTVVFDTGSSNLWVPSVHCAITDIACLIHNK 111
Query: 133 YRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLA 192
Y++ SS+YK NG S I YG+G++SG+ S D V I + K+Q F EAT+EP LTF+ A
Sbjct: 112 YKASESSSYKSNGTSFAIQYGSGSLSGYVSSDIVSIAGVKSKNQLFAEATKEPGLTFVAA 171
Query: 193 KFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPD 252
KFDGILG+G+ EISV PV+ M Q + FSF+ NR+A+ GGE+ GG+D
Sbjct: 172 KFDGILGMGYPEISVNGITPVFNQMFKQEALAHNQFSFYLNRDANASSGGELYLGGVDTK 231
Query: 253 HYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVN 312
+ G +Y PVT KGYWQ M V + G +T C GC AI DSGTSLLAGPT I ++N
Sbjct: 232 KFTGSFSYHPVTVKGYWQISMDSVSV-GSSTSACVSGCKAIVDSGTSLLAGPTDEIEKIN 290
Query: 313 HAIGATGIVSQE 324
IGAT ++ E
Sbjct: 291 KLIGATKFLNGE 302
>gi|197099366|ref|NP_001125492.1| cathepsin D precursor [Pongo abelii]
gi|55728229|emb|CAH90861.1| hypothetical protein [Pongo abelii]
Length = 412
Score = 263 bits (672), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 128/255 (50%), Positives = 172/255 (67%), Gaps = 13/255 (5%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNYMDAQY+GEIGIGTPPQ FTV+FDTGSSNLWVPS C IAC+ H KY S +SST
Sbjct: 71 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHRKYNSDKSST 130
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKI-----------GDLVVKDQEFIEATREPSLTF 189
Y KNG S DIHYG+G++SG+ S+D V + G + V+ Q F EAT++P +TF
Sbjct: 131 YVKNGTSFDIHYGSGSLSGYLSQDTVSVPCQSASSASALGGVKVERQVFGEATKQPGITF 190
Query: 190 LLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGM 249
+ AKFDGILG+ + ISV +PV+ N++ Q LV++ +FSF+ +R+ D + GGE++ GG
Sbjct: 191 IAAKFDGILGMAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLSRDPDAQPGGELMLGGT 250
Query: 250 DPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIIT 309
D +YKG +Y+ VT+K YWQ + V + T C GC AI D+GTSL+ GP +
Sbjct: 251 DSKYYKGSLSYLNVTRKAYWQVHLDQVEVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVR 309
Query: 310 QVNHAIGATGIVSQE 324
++ AIGA ++ E
Sbjct: 310 ELQKAIGAVPLIQGE 324
>gi|30584113|gb|AAP36305.1| Homo sapiens cathepsin D (lysosomal aspartyl protease) [synthetic
construct]
gi|60653917|gb|AAX29651.1| cathepsin D [synthetic construct]
Length = 413
Score = 263 bits (672), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 128/255 (50%), Positives = 172/255 (67%), Gaps = 13/255 (5%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNYMDAQY+GEIGIGTPPQ FTV+FDTGSSNLWVPS C IAC+ H KY S +SST
Sbjct: 71 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSDKSST 130
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKI-----------GDLVVKDQEFIEATREPSLTF 189
Y KNG S DIHYG+G++SG+ S+D V + G + V+ Q F EAT++P +TF
Sbjct: 131 YVKNGTSFDIHYGSGSLSGYLSQDTVSVPCQSASSASALGGVKVERQVFGEATKQPGITF 190
Query: 190 LLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGM 249
+ AKFDGILG+ + ISV +PV+ N++ Q LV++ +FSF+ +R+ D + GGE++ GG
Sbjct: 191 IAAKFDGILGMAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLSRDPDAQPGGELMLGGT 250
Query: 250 DPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIIT 309
D +YKG +Y+ VT+K YWQ + V + T C GC AI D+GTSL+ GP +
Sbjct: 251 DSKYYKGSLSYLNVTRKAYWQVHLDQVEVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVR 309
Query: 310 QVNHAIGATGIVSQE 324
++ AIGA ++ E
Sbjct: 310 ELQKAIGAVPLIQGE 324
>gi|397490270|ref|XP_003816129.1| PREDICTED: cathepsin D [Pan paniscus]
Length = 603
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 128/255 (50%), Positives = 172/255 (67%), Gaps = 13/255 (5%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNYMDAQY+GEIGIGTPPQ FTV+FDTGSSNLWVPS C IAC+ H KY S +SST
Sbjct: 71 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSDKSST 130
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKI-----------GDLVVKDQEFIEATREPSLTF 189
Y KNG S DIHYG+G++SG+ S+D V + G + V+ Q F EAT++P +TF
Sbjct: 131 YVKNGTSFDIHYGSGSLSGYLSQDTVSVPCQSASSASAPGGVKVERQVFGEATKQPGITF 190
Query: 190 LLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGM 249
+ AKFDGILG+ + ISV +PV+ N++ Q LV++ +FSF+ +R+ D + GGE++ GG
Sbjct: 191 IAAKFDGILGMAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLSRDPDAQPGGELMLGGT 250
Query: 250 DPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIIT 309
D +YKG +Y+ VT+K YWQ + V + T C GC AI D+GTSL+ GP +
Sbjct: 251 DSKYYKGSLSYLNVTRKAYWQVHLDQVEVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVR 309
Query: 310 QVNHAIGATGIVSQE 324
++ AIGA ++ E
Sbjct: 310 ELQKAIGAVPLIQGE 324
>gi|60654209|gb|AAX29797.1| cathepsin D [synthetic construct]
Length = 413
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 128/255 (50%), Positives = 172/255 (67%), Gaps = 13/255 (5%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNYMDAQY+GEIGIGTPPQ FTV+FDTGSSNLWVPS C IAC+ H KY S +SST
Sbjct: 71 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSDKSST 130
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKI-----------GDLVVKDQEFIEATREPSLTF 189
Y KNG S DIHYG+G++SG+ S+D V + G + V+ Q F EAT++P +TF
Sbjct: 131 YVKNGTSFDIHYGSGSLSGYLSQDTVSVPCQSASSASALGGVKVERQVFGEATKQPGITF 190
Query: 190 LLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGM 249
+ AKFDGILG+ + ISV +PV+ N++ Q LV++ +FSF+ +R+ D + GGE++ GG
Sbjct: 191 IAAKFDGILGMAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLSRDPDAQPGGELMLGGT 250
Query: 250 DPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIIT 309
D +YKG +Y+ VT+K YWQ + V + T C GC AI D+GTSL+ GP +
Sbjct: 251 DSKYYKGSLSYLNVTRKAYWQVHLDQVEVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVR 309
Query: 310 QVNHAIGATGIVSQE 324
++ AIGA ++ E
Sbjct: 310 ELQKAIGAVPLIEGE 324
>gi|341884635|gb|EGT40570.1| CBN-ASP-4 protein [Caenorhabditis brenneri]
Length = 447
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 135/262 (51%), Positives = 170/262 (64%), Gaps = 7/262 (2%)
Query: 64 KYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-Y 122
KY LGE + L+NYMDAQYFG I IGTP QNFTVIFDTGSSNLW+PS KC +
Sbjct: 73 KYISEKPLGEIDEL----LRNYMDAQYFGTISIGTPGQNFTVIFDTGSSNLWIPSKKCPF 128
Query: 123 FSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEAT 182
+ IAC H +Y S SSTYK++G+ I YGTG++ GF S+D V + + +DQ F EAT
Sbjct: 129 YDIACMLHHRYDSKASSTYKEDGRKMAIQYGTGSMKGFISKDSVCLAGICAEDQPFAEAT 188
Query: 183 REPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGG 242
EP +TF+ AKFDGILG+ + EI+V PV+ + Q V +F+FW NRN D + GG
Sbjct: 189 SEPGITFVAAKFDGILGMAYPEIAVLGVQPVFNTLFEQKKVPANLFAFWLNRNPDSDLGG 248
Query: 243 EIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLA 302
EI FGG+D Y TY PVT+KGYWQF M V+ G C+ GC AIAD+GTSL+A
Sbjct: 249 EITFGGIDSRRYVEPITYAPVTRKGYWQFKMDKVV--GSGVLGCSNGCQAIADTGTSLIA 306
Query: 303 GPTTIITQVNHAIGATGIVSQE 324
GP I + + IGA ++ E
Sbjct: 307 GPKAQIEAIQNFIGAEPLIKGE 328
>gi|395851770|ref|XP_003798425.1| PREDICTED: cathepsin D [Otolemur garnettii]
Length = 405
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 132/283 (46%), Positives = 186/283 (65%), Gaps = 22/283 (7%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
L+NYMDAQY+GEIGIGTPPQ FTV+FDTGS+NLWVPSSKC IAC+ H++Y S RS+T
Sbjct: 69 LRNYMDAQYYGEIGIGTPPQCFTVVFDTGSANLWVPSSKCKMLDIACWLHNRYHSDRSTT 128
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIG------DLVVKDQEFIEATREPSLTFLLAKF 194
Y KNG + DIHYG+G++SG+ S+D V + ++ V+ Q F EAT++P +TF+ AKF
Sbjct: 129 YVKNGTAFDIHYGSGSLSGYLSQDTVLMPCKSVSVNVKVEKQVFGEATKQPGITFIAAKF 188
Query: 195 DGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHY 254
DGILG+ + ISV +P + N++ Q LV + +FSF+ NR+ + + GGE++ GG+D +Y
Sbjct: 189 DGILGMAYPRISVDNVLPFFDNLMEQKLVEKNIFSFYLNRDPNAQPGGELMLGGVDSKYY 248
Query: 255 KGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHA 314
G +Y+ VT+K YW+ M V + T C GGC AI D+GTSL+ GP + ++ A
Sbjct: 249 TGSLSYLNVTRKAYWEVHMEQVEVASGLT-LCKGGCEAIVDTGTSLMVGPVDEVRELQKA 307
Query: 315 IGATGIVSQE----CKAVVSQYGEEIINMLLAKVSAKLSLRHY 353
IGA ++ E C+ V S L V+ KL+ + Y
Sbjct: 308 IGAIPLIQGEYMIPCEKVSS----------LPSVTLKLAGKDY 340
>gi|156039363|ref|XP_001586789.1| hypothetical protein SS1G_11818 [Sclerotinia sclerotiorum 1980]
gi|154697555|gb|EDN97293.1| hypothetical protein SS1G_11818 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 396
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 132/250 (52%), Positives = 172/250 (68%), Gaps = 14/250 (5%)
Query: 73 ESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSK 132
E GD V + N+++AQYF EI IGTPPQ F V+ DTGSSNLWVPSS+C SIACY H+K
Sbjct: 70 EGGD-HTVPVSNFLNAQYFSEITIGTPPQTFKVVLDTGSSNLWVPSSEC-GSIACYLHTK 127
Query: 133 YRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLA 192
Y S SSTY+KNG S +I YG+G++SGF S D + IGDL +KDQ F EAT EP L F
Sbjct: 128 YDSSSSSTYEKNGTSFEIRYGSGSLSGFTSRDVMSIGDLEIKDQVFAEATEEPGLAFAFG 187
Query: 193 KFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPD 252
+FDGILGLG+ ISV + VP +YNM+NQGL++EPVF+F+ + DE + E +FGG++ D
Sbjct: 188 RFDGILGLGYDTISVNQIVPPFYNMINQGLLDEPVFAFYLGDSKDEGDESEAIFGGVNKD 247
Query: 253 HYKGEHTYVPVTQKGYWQFDM-----GDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTI 307
HY+G+ T +P+ +K YW+ D+ GD D TG I D+GTSL+A P+T+
Sbjct: 248 HYEGKITEIPLRRKAYWEVDLDAISFGDAKADLDNTG-------VILDTGTSLIAVPSTL 300
Query: 308 ITQVNHAIGA 317
+N IGA
Sbjct: 301 AELLNKEIGA 310
>gi|4503143|ref|NP_001900.1| cathepsin D preproprotein [Homo sapiens]
gi|115717|sp|P07339.1|CATD_HUMAN RecName: Full=Cathepsin D; Contains: RecName: Full=Cathepsin D
light chain; Contains: RecName: Full=Cathepsin D heavy
chain; Flags: Precursor
gi|29678|emb|CAA28955.1| cathepsin D [Homo sapiens]
gi|179948|gb|AAA51922.1| cathepsin D [Homo sapiens]
gi|181180|gb|AAB59529.1| preprocathepsin D [Homo sapiens]
gi|16740920|gb|AAH16320.1| Cathepsin D [Homo sapiens]
gi|30582659|gb|AAP35556.1| cathepsin D (lysosomal aspartyl protease) [Homo sapiens]
gi|48146011|emb|CAG33228.1| CTSD [Homo sapiens]
gi|54697170|gb|AAV38957.1| cathepsin D (lysosomal aspartyl protease) [Homo sapiens]
gi|61356567|gb|AAX41260.1| cathepsin D [synthetic construct]
gi|61362282|gb|AAX42193.1| cathepsin D [synthetic construct]
gi|119622866|gb|EAX02461.1| cathepsin D (lysosomal aspartyl peptidase), isoform CRA_a [Homo
sapiens]
gi|119622867|gb|EAX02462.1| cathepsin D (lysosomal aspartyl peptidase), isoform CRA_a [Homo
sapiens]
gi|119622868|gb|EAX02463.1| cathepsin D (lysosomal aspartyl peptidase), isoform CRA_a [Homo
sapiens]
gi|123994405|gb|ABM84804.1| cathepsin D (lysosomal aspartyl peptidase) [synthetic construct]
gi|261860344|dbj|BAI46694.1| cathepsin D [synthetic construct]
Length = 412
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 128/255 (50%), Positives = 172/255 (67%), Gaps = 13/255 (5%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNYMDAQY+GEIGIGTPPQ FTV+FDTGSSNLWVPS C IAC+ H KY S +SST
Sbjct: 71 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSDKSST 130
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKI-----------GDLVVKDQEFIEATREPSLTF 189
Y KNG S DIHYG+G++SG+ S+D V + G + V+ Q F EAT++P +TF
Sbjct: 131 YVKNGTSFDIHYGSGSLSGYLSQDTVSVPCQSASSASALGGVKVERQVFGEATKQPGITF 190
Query: 190 LLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGM 249
+ AKFDGILG+ + ISV +PV+ N++ Q LV++ +FSF+ +R+ D + GGE++ GG
Sbjct: 191 IAAKFDGILGMAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLSRDPDAQPGGELMLGGT 250
Query: 250 DPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIIT 309
D +YKG +Y+ VT+K YWQ + V + T C GC AI D+GTSL+ GP +
Sbjct: 251 DSKYYKGSLSYLNVTRKAYWQVHLDQVEVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVR 309
Query: 310 QVNHAIGATGIVSQE 324
++ AIGA ++ E
Sbjct: 310 ELQKAIGAVPLIQGE 324
>gi|123993743|gb|ABM84473.1| cathepsin D (lysosomal aspartyl peptidase) [synthetic construct]
Length = 412
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 128/255 (50%), Positives = 172/255 (67%), Gaps = 13/255 (5%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNYMDAQY+GEIGIGTPPQ FTV+FDTGSSNLWVPS C IAC+ H KY S +SST
Sbjct: 71 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSDKSST 130
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKI-----------GDLVVKDQEFIEATREPSLTF 189
Y KNG S DIHYG+G++SG+ S+D V + G + V+ Q F EAT++P +TF
Sbjct: 131 YVKNGTSFDIHYGSGSLSGYLSQDTVSVPCQSASSASALGGVKVERQVFGEATKQPGITF 190
Query: 190 LLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGM 249
+ AKFDGILG+ + ISV +PV+ N++ Q LV++ +FSF+ +R+ D + GGE++ GG
Sbjct: 191 IAAKFDGILGMAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLSRDPDAQPGGELMLGGT 250
Query: 250 DPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIIT 309
D +YKG +Y+ VT+K YWQ + V + T C GC AI D+GTSL+ GP +
Sbjct: 251 DSKYYKGSLSYLNVTRKAYWQVHLDQVEVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVR 309
Query: 310 QVNHAIGATGIVSQE 324
++ AIGA ++ E
Sbjct: 310 ELQKAIGAVPLIQGE 324
>gi|60820131|gb|AAX36524.1| cathepsin D [synthetic construct]
gi|61363243|gb|AAX42359.1| cathepsin D [synthetic construct]
Length = 412
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 128/255 (50%), Positives = 172/255 (67%), Gaps = 13/255 (5%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNYMDAQY+GEIGIGTPPQ FTV+FDTGSSNLWVPS C IAC+ H KY S +SST
Sbjct: 71 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSDKSST 130
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKI-----------GDLVVKDQEFIEATREPSLTF 189
Y KNG S DIHYG+G++SG+ S+D V + G + V+ Q F EAT++P +TF
Sbjct: 131 YVKNGTSFDIHYGSGSLSGYLSQDTVSVPCQSASSASALGGVKVERQVFGEATKQPGITF 190
Query: 190 LLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGM 249
+ AKFDGILG+ + ISV +PV+ N++ Q LV++ +FSF+ +R+ D + GGE++ GG
Sbjct: 191 IAAKFDGILGMAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLSRDPDAQPGGELMLGGT 250
Query: 250 DPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIIT 309
D +YKG +Y+ VT+K YWQ + V + T C GC AI D+GTSL+ GP +
Sbjct: 251 DSKYYKGSLSYLNVTRKAYWQVHLDQVEVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVR 309
Query: 310 QVNHAIGATGIVSQE 324
++ AIGA ++ E
Sbjct: 310 ELQKAIGAVPLIEGE 324
>gi|426366854|ref|XP_004050458.1| PREDICTED: cathepsin D [Gorilla gorilla gorilla]
Length = 412
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 128/255 (50%), Positives = 172/255 (67%), Gaps = 13/255 (5%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNYMDAQY+GEIGIGTPPQ FTV+FDTGSSNLWVPS C IAC+ H KY S +SST
Sbjct: 71 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSDKSST 130
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKI-----------GDLVVKDQEFIEATREPSLTF 189
Y KNG S DIHYG+G++SG+ S+D V + G + V+ Q F EAT++P +TF
Sbjct: 131 YVKNGTSFDIHYGSGSLSGYLSQDTVSVPCQSASSASAPGGVKVERQVFGEATKQPGITF 190
Query: 190 LLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGM 249
+ AKFDGILG+ + ISV +PV+ N++ Q LV++ +FSF+ +R+ D + GGE++ GG
Sbjct: 191 IAAKFDGILGMAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLSRDPDAQPGGELMLGGT 250
Query: 250 DPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIIT 309
D +YKG +Y+ VT+K YWQ + V + T C GC AI D+GTSL+ GP +
Sbjct: 251 DSKYYKGSLSYLNVTRKAYWQVHLDQVEVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVR 309
Query: 310 QVNHAIGATGIVSQE 324
++ AIGA ++ E
Sbjct: 310 ELQKAIGAVPLIQGE 324
>gi|407728652|gb|AFU24355.1| cathepsin D [Ctenopharyngodon idella]
Length = 398
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 145/322 (45%), Positives = 191/322 (59%), Gaps = 16/322 (4%)
Query: 15 CLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIR-KYSLRGNLG- 72
CLLL F T + + RI L K R R S G + + L+ NLG
Sbjct: 5 CLLLAAAFFWTSDA-IVRIPLTKF------RSIRRTLSDSGRAVEELVAGSVPLKYNLGF 57
Query: 73 -ESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFH 130
S LKNY+DAQY+GEIG+GTP Q+FTV+FDTGSSNLWVPS C IAC H
Sbjct: 58 PASNGPTPGTLKNYLDAQYYGEIGLGTPVQSFTVVFDTGSSNLWVPSVHCSLMDIACLLH 117
Query: 131 SKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFL 190
KY G+SSTY KNG I YG+G++SG+ S+D +GD+ V+ Q F EA ++P + F+
Sbjct: 118 HKYNGGKSSTYVKNGTEFAIQYGSGSLSGYLSQDTCTVGDIAVEKQIFGEAIKQPGVAFI 177
Query: 191 LAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMD 250
AKFDGILG+ + I+V PV+ M++Q V + +FSF+ NRN D + GGE++ GG D
Sbjct: 178 AAKFDGILGMAYPRIAVDGVPPVFDMMMSQKKVEKNIFSFYLNRNPDTQPGGELLLGGTD 237
Query: 251 PDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQ 310
P +Y G+ YV ++++ YWQ M D M G C GGC AI D+GTSL+ GP T I
Sbjct: 238 PKYYTGDFNYVDISRQAYWQIHM-DGMSIGSELTLCKGGCEAIVDTGTSLITGPATEIKA 296
Query: 311 VNHAIGATGIVSQE----CKAV 328
+ AIGA ++ E CK V
Sbjct: 297 LQKAIGAIPLIQGEYMVDCKKV 318
>gi|301619112|ref|XP_002938948.1| PREDICTED: cathepsin D-like [Xenopus (Silurana) tropicalis]
Length = 355
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 125/238 (52%), Positives = 164/238 (68%), Gaps = 2/238 (0%)
Query: 85 YMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKK 143
Y+ AQY+GEIG+G+PPQNFTV+FDTGSSNLWVPS C IAC+ H KY S +SSTY K
Sbjct: 30 YLQAQYYGEIGLGSPPQNFTVVFDTGSSNLWVPSVHCSMLDIACWMHHKYDSSKSSTYVK 89
Query: 144 NGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQ 203
NG + I YGTG++SG+ S+D V IG+L VK Q F EA ++P +TF+ AKFDGILG+ +
Sbjct: 90 NGTAFAIQYGTGSLSGYLSKDTVTIGNLAVKGQIFGEAVKQPGVTFVAAKFDGILGMAYP 149
Query: 204 EISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPV 263
ISV PV+ N++ Q LV +FSF+ NRN D + GGE++ GG DP +Y G+ Y+ V
Sbjct: 150 VISVDGVPPVFDNIMAQKLVESNIFSFYLNRNPDTQPGGELLLGGTDPKYYTGDFHYLSV 209
Query: 264 TQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIV 321
T+K YWQ M + + Q T C GGC I D+GTSL+ GP +T + AIGA ++
Sbjct: 210 TRKAYWQIHMDQLGVGDQLT-LCKGGCEVIVDTGTSLITGPLEEVTALQKAIGAVPLI 266
>gi|121543617|gb|ABM55520.1| putative cathepsin D [Maconellicoccus hirsutus]
Length = 391
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 136/241 (56%), Positives = 172/241 (71%), Gaps = 5/241 (2%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
L+NY+DAQY+G I IGTPPQ F V+FDTGSSNLWVPS +C +IAC H+KY S SST
Sbjct: 64 LRNYLDAQYYGPITIGTPPQPFNVVFDTGSSNLWVPSKQCSILNIACLIHNKYNSKTSST 123
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y+ NG IHYG+G++SGF S D V IG L ++ Q F EA +EP + F+ AKFDGILGL
Sbjct: 124 YQANGTEFAIHYGSGSLSGFLSSDTVSIGGLDIEKQTFAEAVKEPGIAFIAAKFDGILGL 183
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
G++EISVG P +YNMV+QGLV + VFSF+ NRN +GGEI+FGG+DP ++G TY
Sbjct: 184 GYKEISVGGIPPPFYNMVDQGLVKDSVFSFYLNRNTSAADGGEIIFGGVDPSKFRGNFTY 243
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
VPV+ KGYWQF M + + G+ AIAD+GTSL+AGP+ I +N AIGA I
Sbjct: 244 VPVSVKGYWQFGMEKISLGGKDI----QTSQAIADTGTSLIAGPSEDIAAINKAIGAVEI 299
Query: 321 V 321
+
Sbjct: 300 L 300
>gi|330800100|ref|XP_003288077.1| preprocathepsin D [Dictyostelium purpureum]
gi|325081901|gb|EGC35401.1| preprocathepsin D [Dictyostelium purpureum]
Length = 386
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 143/294 (48%), Positives = 183/294 (62%), Gaps = 14/294 (4%)
Query: 63 RKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCY 122
KY++ N G + + + ++ DAQY+G I IGTP Q F V+FDTGSSNLW+PS KC
Sbjct: 41 NKYTIAANGGTN-----IPISDFEDAQYYGAITIGTPGQPFKVVFDTGSSNLWIPSKKCS 95
Query: 123 FSI-ACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEA 181
++ AC H KY S +SS+Y NG S I YG+GA+SGF S+D V +G L VK+Q F EA
Sbjct: 96 ITVPACDLHEKYDSSKSSSYVANGTSFSIQYGSGAMSGFVSQDTVTVGSLSVKNQLFAEA 155
Query: 182 TREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEG 241
T EP + F AKFDGILGL FQ ISV PV+YNM++QGLV + +FSFW ++ G
Sbjct: 156 TAEPGIAFDFAKFDGILGLAFQSISVNDIPPVFYNMIDQGLVGQNLFSFWLSKTPG-SNG 214
Query: 242 GEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAG-GCAAIADSGTSL 300
GE+ FG +D Y G TYVP+T YW+F M D I GQ+ GFC GC AIADSGTSL
Sbjct: 215 GELSFGSIDSSKYTGPITYVPLTNTTYWEFKMDDFAIGGQSAGFCGSQGCPAIADSGTSL 274
Query: 301 LAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEIINMLLAKVSAKLSLRHYK 354
+AGP IT +N +GA I + S +IN L V+ L+ R +
Sbjct: 275 IAGPIDFITALNQKLGAVVISGEAIFPDCS-----VINT-LPNVTVTLAGRQFN 322
>gi|185132376|ref|NP_001118183.1| cathepsin D precursor [Oncorhynchus mykiss]
gi|1858020|gb|AAC60301.1| cathepsin D [Oncorhynchus mykiss]
Length = 398
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 137/299 (45%), Positives = 183/299 (61%), Gaps = 13/299 (4%)
Query: 32 RIGLKK-----RKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYM 86
RI L+K R + R A L + + + KY+ G S LKN+M
Sbjct: 21 RIPLRKFRSIRRTLTDSGRAAEELLAGQEHT------KYNNLGFPSSSNGPTPETLKNFM 74
Query: 87 DAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSSTYKKNG 145
DAQY+GEIG+GTP Q FTV+FDTGSSNLWVPS C F+ IAC H KY +SSTY KNG
Sbjct: 75 DAQYYGEIGLGTPVQTFTVVFDTGSSNLWVPSVHCSFTDIACLLHHKYNGAKSSTYVKNG 134
Query: 146 KSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEI 205
+ I YG+G++SG+ S+D IG L ++DQ F EA ++P + F+ AKFDGILG+ + I
Sbjct: 135 TAFAIQYGSGSLSGYLSQDTCTIGGLSIEDQGFGEAIKQPGVAFIAAKFDGILGMAYPRI 194
Query: 206 SVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQ 265
SV P + N+++Q V + VFSF+ NRN D E GGE++ GG DP +Y G+ Y+ V++
Sbjct: 195 SVDGVAPPFDNIMSQKKVEQNVFSFYLNRNPDSEPGGELLLGGTDPKYYSGDFQYLDVSR 254
Query: 266 KGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE 324
+ YWQ M D M G C GGC AI D+GTSL+ GP + + AIGAT ++ E
Sbjct: 255 QAYWQIHM-DGMGVGSQLSLCKGGCEAIVDTGTSLITGPAAEVKALQRAIGATPLIQGE 312
>gi|121705756|ref|XP_001271141.1| aspartic endopeptidase Pep2 [Aspergillus clavatus NRRL 1]
gi|119399287|gb|EAW09715.1| aspartic endopeptidase Pep2 [Aspergillus clavatus NRRL 1]
Length = 398
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 137/300 (45%), Positives = 185/300 (61%), Gaps = 7/300 (2%)
Query: 30 LYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQ 89
L ++ L+++ + N R ++ R +I K L N V + N+++AQ
Sbjct: 25 LNKVPLEEQLYTHNIDAHVRALGQKYMGIRPNIHKELLEENSFNDMSRHDVLVDNFLNAQ 84
Query: 90 YFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSAD 149
YF EI +GTPPQ F V+ DTGSSNLWVPSS+C SIACY H+KY S SSTYKKNG
Sbjct: 85 YFSEIELGTPPQKFKVVLDTGSSNLWVPSSEC-GSIACYLHTKYDSSASSTYKKNGTEFA 143
Query: 150 IHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGK 209
I YG+G +SGF S+D++KIGDL ++ Q+F EAT EP L F +FDGILGLG+ ISV K
Sbjct: 144 IRYGSGELSGFVSQDNLKIGDLKIEKQDFAEATNEPGLAFAFGRFDGILGLGYDTISVNK 203
Query: 210 AVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYW 269
VP +YNM+NQGL++EPVF+F+ E + FGG+D DH+ GE T +P+ +K YW
Sbjct: 204 IVPPFYNMLNQGLLDEPVFAFYLGDANKEGDSSVATFGGIDKDHFTGELTKIPLRRKAYW 263
Query: 270 QFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA----TGIVSQEC 325
+ D+ + + G I D+GTSL+A P+T+ +N IGA TG S EC
Sbjct: 264 EVDLDAIALGDNVAELDNTGV--ILDTGTSLIALPSTLADLLNKEIGAKKGFTGQYSVEC 321
>gi|310796316|gb|EFQ31777.1| eukaryotic aspartyl protease [Glomerella graminicola M1.001]
Length = 399
Score = 261 bits (667), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 128/250 (51%), Positives = 167/250 (66%), Gaps = 7/250 (2%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V + N+M+AQYF EI IGTPPQ F V+ DTGSSNLWVPS +C SIACY H+KY S SS
Sbjct: 75 VPVSNFMNAQYFSEIQIGTPPQTFKVVLDTGSSNLWVPSQQC-GSIACYLHTKYDSSASS 133
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYK NG S +IHYG+G+++GF S+D V IGDL +K Q+F EAT EP L F +FDGILG
Sbjct: 134 TYKSNGSSFEIHYGSGSLTGFVSQDDVSIGDLKIKKQDFAEATSEPGLAFAFGRFDGILG 193
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ ISV K VP +YN+VNQ ++EPVF+F+ +E + E FGG+D HY+G+ T
Sbjct: 194 LGYDTISVNKIVPPFYNLVNQKAIDEPVFAFYLGDTNEEGDESEATFGGLDESHYEGKVT 253
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA-- 317
Y+P+ +K YW+ D+ + + +T G AI D+GTSL P+ + +N IGA
Sbjct: 254 YIPLRRKAYWEVDLDAISLGDETADL--EGHGAILDTGTSLNVLPSALAELLNKEIGAKK 311
Query: 318 --TGIVSQEC 325
G S EC
Sbjct: 312 GYNGQYSVEC 321
>gi|321461133|gb|EFX72168.1| hypothetical protein DAPPUDRAFT_227643 [Daphnia pulex]
Length = 394
Score = 261 bits (666), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 129/240 (53%), Positives = 164/240 (68%), Gaps = 3/240 (1%)
Query: 81 ALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCY-FSIACYFHSKYRSGRSS 139
L NY DAQYFG + +GTPPQ F +IFDTGS+NLWVPSS+C ++AC H++Y S SS
Sbjct: 58 PLTNYQDAQYFGPLTLGTPPQEFDIIFDTGSANLWVPSSECAPTNLACRNHNQYNSSLSS 117
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TY NG I YGTGA++GF S D + I V DQ F EA EP + F+ +FDGILG
Sbjct: 118 TYTPNGTEFSIQYGTGAMTGFLSTDVLGIAGAQVIDQTFAEAVEEPGVVFVAGRFDGILG 177
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADE-EEGGEIVFGGMDPDHYKGEH 258
+ + ISV VP++ NM+ QGLV+EPVFSFW NRN + E GGEI+FGG +P HY+GE
Sbjct: 178 MSYPSISVQGVVPMFQNMMAQGLVDEPVFSFWLNRNLNNPENGGEILFGGTNPTHYEGEI 237
Query: 259 TYVPVTQKGYWQFDMGDVMIDG-QTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
+YVPV++K YWQF + V + G FC GGC I+D+GTSL+ GP+ IT + IGA
Sbjct: 238 SYVPVSRKAYWQFSVDGVNLAGYDEYPFCNGGCEMISDTGTSLITGPSEEITLFHKLIGA 297
>gi|18858489|ref|NP_571785.1| cathepsin D [Danio rerio]
gi|12053845|emb|CAC20111.1| cathepsin D enzyme [Danio rerio]
Length = 399
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 144/307 (46%), Positives = 185/307 (60%), Gaps = 17/307 (5%)
Query: 32 RIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKY-SLRGNLG--ESGDADIVALKNYMDA 88
RI LKK R R S G S + SL+ NLG S D LKNY+DA
Sbjct: 20 RIPLKKF------RTLRRTLSDSGRSLEELVSSSNSLKYNLGFPASNDPTPETLKNYLDA 73
Query: 89 QYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSSTYKKNGKS 147
QY+GEIG+GTP Q FTV+FDTGSSNLWVPS C + IAC H KY G+SSTY KNG
Sbjct: 74 QYYGEIGLGTPVQTFTVVFDTGSSNLWVPSVHCSLTDIACLLHHKYNGGKSSTYVKNGTQ 133
Query: 148 ADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISV 207
I YG+G++SG+ S+D IGD+ V+ Q F EA ++P + F+ AKFDGILG+ + ISV
Sbjct: 134 FAIQYGSGSLSGYLSQDTCTIGDIAVEKQIFGEAIKQPGVAFIAAKFDGILGMAYPRISV 193
Query: 208 GKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKG 267
PV+ M++Q V + VFSF+ NRN D + GGE++ GG DP +Y G+ YV ++++
Sbjct: 194 DGVPPVFDMMMSQKKVEKNVFSFYLNRNPDTQPGGELLLGGTDPKYYTGDFNYVDISRQA 253
Query: 268 YWQFDMGDVMIDGQTTGFCAGGCAAIADSG--TSLLAGPTTIITQVNHAIGATGIVSQE- 324
YWQ M D M G C GGC AI D+G TSL+ GP + + AIGA ++ E
Sbjct: 254 YWQIHM-DGMSIGSGLSLCKGGCEAIVDTGTSTSLITGPAAEVKALQKAIGAIPLMQGEY 312
Query: 325 ---CKAV 328
CK V
Sbjct: 313 MVDCKKV 319
>gi|432850603|ref|XP_004066829.1| PREDICTED: cathepsin D-like isoform 3 [Oryzias latipes]
Length = 416
Score = 260 bits (665), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 142/318 (44%), Positives = 185/318 (58%), Gaps = 29/318 (9%)
Query: 30 LYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLG--ESGDADIVALKNYMD 87
L RI LKK + + L E+ K+SL+ NLG S LKNY+D
Sbjct: 19 LIRIPLKKFR-----SIRRELTDSGREAHELLADKHSLKYNLGFPSSNGPTPETLKNYLD 73
Query: 88 AQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNGK 146
AQY+GEI +GTPPQ FTV+FDTGSSNLWVPS C IAC KY S +SSTY KNG
Sbjct: 74 AQYYGEIALGTPPQPFTVVFDTGSSNLWVPSVHCSLLDIACXXXHKYNSAKSSTYVKNGT 133
Query: 147 SADIHYGTGAISGFFSEDHVK--------------------IGDLVVKDQEFIEATREPS 186
S I YG+G++SG+ S+D IGD+ V++Q F EA ++P
Sbjct: 134 SFSIQYGSGSLSGYLSQDTCTVSVGGAVTPPTTHSVETAKAIGDISVENQVFGEAIKQPG 193
Query: 187 LTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVF 246
+ F+ AKFDGILG+ + ISV VPV+ N++ Q V+ VFSF+ NRN D E GGE++
Sbjct: 194 VAFIAAKFDGILGMAYPRISVDGVVPVFDNIMQQKKVDSNVFSFYLNRNPDTEPGGELLL 253
Query: 247 GGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTT 306
GG DP +Y G+ YV ++++ YWQ M D M G C GGC AI D+GTSLL GP+
Sbjct: 254 GGTDPKYYSGDFHYVNISRQAYWQIHM-DGMAVGSQLSLCKGGCEAIVDTGTSLLTGPSA 312
Query: 307 IITQVNHAIGATGIVSQE 324
+ + AIGA ++ E
Sbjct: 313 EVKALQKAIGAIPLIQGE 330
>gi|197631813|gb|ACH70630.1| cathepsin D [Salmo salar]
gi|223648160|gb|ACN10838.1| Cathepsin D precursor [Salmo salar]
Length = 398
Score = 260 bits (664), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 128/262 (48%), Positives = 171/262 (65%), Gaps = 2/262 (0%)
Query: 64 KYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYF 123
KY+ G S LKN+MDAQY+GEIG+GTP Q FTV+FDTGSSNLWVPS C F
Sbjct: 52 KYNNLGFPSSSNGPTPETLKNFMDAQYYGEIGLGTPAQTFTVVFDTGSSNLWVPSVHCSF 111
Query: 124 S-IACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEAT 182
+ IAC H KY +SSTY KNG + I YG+G++SG+ S+D IG L +++Q F EA
Sbjct: 112 TDIACLLHHKYNGAKSSTYVKNGTAFAIQYGSGSLSGYLSQDTCTIGGLSIEEQVFGEAI 171
Query: 183 REPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGG 242
++P + F+ AKFDGILG+ + ISV P + N+++Q V + VFSF+ NRN + E GG
Sbjct: 172 KQPGVAFIAAKFDGILGMAYPRISVDGVAPPFDNIMSQKKVEQNVFSFYLNRNPESEPGG 231
Query: 243 EIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLA 302
E++ GG DP +Y G+ Y+ V+++ YWQ M D M G C GGC AI D+GTSL+
Sbjct: 232 ELLLGGTDPKYYSGDFQYLNVSRQAYWQVHM-DGMGVGSQLSLCKGGCEAIVDTGTSLIT 290
Query: 303 GPTTIITQVNHAIGATGIVSQE 324
GPT + + AIGAT ++ E
Sbjct: 291 GPTAEVKALQKAIGATPLIQGE 312
>gi|302696543|ref|XP_003037950.1| hypothetical protein SCHCODRAFT_71897 [Schizophyllum commune H4-8]
gi|300111647|gb|EFJ03048.1| hypothetical protein SCHCODRAFT_71897 [Schizophyllum commune H4-8]
Length = 406
Score = 260 bits (664), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 133/253 (52%), Positives = 169/253 (66%), Gaps = 9/253 (3%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L N+M+AQYF EI +GTPPQNF VI DTGSSNLWVPSSKC SIAC+ H+KY S SS
Sbjct: 85 VPLTNFMNAQYFTEITLGTPPQNFKVILDTGSSNLWVPSSKCT-SIACFLHAKYDSSASS 143
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYK+NG I YG+G++ GF S+D + IGDL + Q+F EA +EP LTF KFDGILG
Sbjct: 144 TYKQNGTEFSIQYGSGSMEGFVSQDVLTIGDLTIPGQDFAEAVKEPGLTFAFGKFDGILG 203
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ ISV VP YNM+N+GL++EPVFSF ++ EE+GGE +FGG+D YKG+ T
Sbjct: 204 LGYDTISVNHIVPPHYNMINKGLLDEPVFSFRLGKS--EEDGGEAIFGGVDKSAYKGDLT 261
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA-- 317
YVPV +K YW+ ++ + + + G A D+GTSL+A PT + +N IGA
Sbjct: 262 YVPVRRKAYWEVELEKISFGSEELELESTGAA--IDTGTSLIALPTDMAEMINAEIGAKK 319
Query: 318 --TGIVSQECKAV 328
G EC V
Sbjct: 320 SWNGQYQVECSKV 332
>gi|4927648|gb|AAD33219.1| cathepsin D [Hynobius leechii]
Length = 397
Score = 260 bits (664), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 128/244 (52%), Positives = 163/244 (66%), Gaps = 2/244 (0%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNY+DAQY+GEI IGTPPQ FTV+FDTGSSNLWVPS C IAC H KY S SST
Sbjct: 68 LKNYLDAQYYGEICIGTPPQCFTVVFDTGSSNLWVPSVHCSLLDIACLVHPKYDSSSSST 127
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y KNG I YGTG++SG+ +D V +G L V Q F EA ++P + F+ AKFDGILG+
Sbjct: 128 YVKNGTEFSIQYGTGSLSGYLRQDTVSVGGLGVLKQVFGEAIKQPGVAFIAAKFDGILGM 187
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ISV V+ N+++Q LV + VFSF+ NRN D GGE++ GG DP++Y G+ TY
Sbjct: 188 AYPRISVDGVTTVFDNIMSQKLVEKNVFSFYLNRNPDTRPGGELLLGGTDPNYYTGDFTY 247
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
+ VT K YWQ M + + Q T C GGC AI D+GTSL+ GP+ +T + AIGA +
Sbjct: 248 LNVTPKAYWQIHMDQLGVGDQLT-LCKGGCEAIVDTGTSLIIGPSAEVTALQKAIGAIPL 306
Query: 321 VSQE 324
+ E
Sbjct: 307 IQGE 310
>gi|380483026|emb|CCF40872.1| vacuolar protease A [Colletotrichum higginsianum]
Length = 399
Score = 260 bits (664), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 128/250 (51%), Positives = 166/250 (66%), Gaps = 7/250 (2%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V + N+M+AQYF EI IG PPQ F V+ DTGSSNLWVPS +C SIACY H+KY S SS
Sbjct: 75 VPVSNFMNAQYFSEIEIGNPPQTFKVVLDTGSSNLWVPSQQC-GSIACYLHTKYDSSASS 133
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYK NG S +IHYG+G+++GF S+D V IGDL +K Q+F EAT EP L F +FDGILG
Sbjct: 134 TYKANGSSFEIHYGSGSLTGFVSQDDVSIGDLKIKKQDFAEATSEPGLAFAFGRFDGILG 193
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ ISV K VP +YN+VNQ ++EPVF+F+ +E + E FGG+D HY+G+ T
Sbjct: 194 LGYDTISVNKIVPPFYNLVNQKAIDEPVFAFYLGDTNEEGDESEATFGGLDDSHYEGKIT 253
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA-- 317
Y+P+ +K YW+ D+ + + QT G AI D+GTSL P+ + +N IGA
Sbjct: 254 YIPLRRKAYWEVDLDAISLGDQTAEL--EGHGAILDTGTSLNVLPSALAELLNKEIGAKK 311
Query: 318 --TGIVSQEC 325
G S EC
Sbjct: 312 GYNGQYSVEC 321
>gi|361128953|gb|EHL00878.1| putative Vacuolar protease A [Glarea lozoyensis 74030]
Length = 399
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 140/287 (48%), Positives = 185/287 (64%), Gaps = 26/287 (9%)
Query: 49 RLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFD 108
R ++ E E F+ S SL G A V + N+++AQYF EI IGTPPQ+F V+ D
Sbjct: 53 RPEAHEQEMFKDS----SLHTEKG----AHPVPVSNFLNAQYFSEITIGTPPQSFKVVLD 104
Query: 109 TGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKI 168
TGSSNLWVPSS+C SIACY H+KY S SSTYKKNG +I YG+G++SGF S+D + I
Sbjct: 105 TGSSNLWVPSSEC-GSIACYLHTKYDSSSSSTYKKNGSDFEIRYGSGSLSGFVSQDTMTI 163
Query: 169 GDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVF 228
GDL +KDQ F EAT EP L F +FDGILGLGF ISV K VP +Y+M+NQGL++EPVF
Sbjct: 164 GDLKIKDQIFAEATEEPGLAFAFGRFDGILGLGFDTISVNKIVPPFYSMINQGLLDEPVF 223
Query: 229 SFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDM-----GDVMIDGQTT 283
+F+ + EE E FGG++ DHY G+ T +P+ +K YW+ D+ GD + + T
Sbjct: 224 AFYLGDTNNGEE-SEATFGGVNEDHYTGKMTTIPLRRKAYWEVDLDAITFGDATAELENT 282
Query: 284 GFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA----TGIVSQECK 326
G I D+GTSL+A P+T+ +N +GA G + EC+
Sbjct: 283 G-------VILDTGTSLIALPSTLAELLNKEMGAKKGYNGQYTVECE 322
>gi|73947914|ref|XP_533610.2| PREDICTED: napsin-A [Canis lupus familiaris]
Length = 422
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 139/284 (48%), Positives = 179/284 (63%), Gaps = 4/284 (1%)
Query: 73 ESGDADI-VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFH 130
SGD + V L NYM+ QY+GEIG+GTPPQNF+VIFDTGSSNLWVPS +C +FS+ C+FH
Sbjct: 58 SSGDNPVFVPLSNYMNVQYYGEIGLGTPPQNFSVIFDTGSSNLWVPSIRCHFFSLPCWFH 117
Query: 131 SKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFL 190
+Y S SS+++ NG I YGTG + G SED + IG + F EA EPSL F
Sbjct: 118 HRYNSKASSSFQPNGTKFAIQYGTGRLDGILSEDKLTIGGVKSASVIFGEALWEPSLVFT 177
Query: 191 LAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMD 250
LA FDGILGLGF ++VG P +V+QGL+++PVFSF+ NR+ + +GGE+V GG D
Sbjct: 178 LAHFDGILGLGFPILAVGGVQPPLDLLVDQGLLDKPVFSFYLNRDPEAVDGGELVLGGSD 237
Query: 251 PDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQ 310
P HY T++PVT YWQ M V + G CA GCAAI D+GTSL+ GPT I
Sbjct: 238 PAHYIPPLTFLPVTVPAYWQIHMERVKV-GTGLILCAQGCAAILDTGTSLITGPTEEIQA 296
Query: 311 VNHAIGATGIVSQECKAVVSQYGE-EIINMLLAKVSAKLSLRHY 353
+N AIG ++ E S+ I+ LL V L+ + Y
Sbjct: 297 LNAAIGGFSLLLGEYLIQCSEIPTLPPISFLLGGVWFNLTAQDY 340
>gi|308483047|ref|XP_003103726.1| CRE-ASP-4 protein [Caenorhabditis remanei]
gi|308259744|gb|EFP03697.1| CRE-ASP-4 protein [Caenorhabditis remanei]
Length = 462
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 138/278 (49%), Positives = 172/278 (61%), Gaps = 23/278 (8%)
Query: 64 KYSLRGNLGESGDADIVALKNYMD----------------AQYFGEIGIGTPPQNFTVIF 107
KY+ LGE + L+NYMD AQYFG I IGTP QNFTVIF
Sbjct: 73 KYASVKPLGEIDEL----LRNYMDVRAQRLCCLKSKIIFQAQYFGTISIGTPGQNFTVIF 128
Query: 108 DTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHV 166
DTGSSNLWVPS KC ++ IAC H +Y S SSTYK++G+ I YGTG++ GF S+D V
Sbjct: 129 DTGSSNLWVPSKKCPFYDIACMLHHRYDSKSSSTYKEDGRKMAIQYGTGSMKGFISKDSV 188
Query: 167 KIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEP 226
+ + ++Q F EAT EP +TF+ AKFDGILG+ + EI+V PV+ + Q V
Sbjct: 189 CVAGVCAEEQPFAEATSEPGITFVAAKFDGILGMAYPEIAVLGVQPVFNTLFEQKKVPSN 248
Query: 227 VFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFC 286
VFSFW NRN D + GGEI FGG+DP Y TY PVT+KGYWQF M V+ G C
Sbjct: 249 VFSFWLNRNPDSDLGGEITFGGIDPRRYVEPITYTPVTRKGYWQFKMDKVV--GSGVLGC 306
Query: 287 AGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE 324
+ GC AIAD+GTSL+AGP I + + IGA ++ E
Sbjct: 307 SNGCQAIADTGTSLIAGPKAQIEAIQNFIGAEPLIKGE 344
>gi|149757990|ref|XP_001490885.1| PREDICTED: napsin-A [Equus caballus]
Length = 401
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 128/254 (50%), Positives = 168/254 (66%), Gaps = 3/254 (1%)
Query: 73 ESGDADI-VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFH 130
GD I V L +YM+AQY+GEIG+GTPPQNF+V+FDTGSSNLWVPS +C +FS+ C+FH
Sbjct: 55 SPGDKPIFVPLSDYMNAQYYGEIGLGTPPQNFSVLFDTGSSNLWVPSVRCHFFSLPCWFH 114
Query: 131 SKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFL 190
++ SS++K NG I YGTG ++G SED + IG + F EA EPSL F
Sbjct: 115 HRFNPKASSSFKPNGTKFAIQYGTGRLNGILSEDKLTIGGITGASVVFGEALSEPSLIFT 174
Query: 191 LAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMD 250
+A FDGILGLGF ++V P +V+QGL+++PVFSF+ NR+ + +GGE+V GG D
Sbjct: 175 IAHFDGILGLGFPILAVEGVRPPLDTLVDQGLLDKPVFSFYLNRDPEAADGGELVLGGSD 234
Query: 251 PDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQ 310
P HY T+VPVT YWQ M V + G CA GCAAI D+GTSL+ GPT I
Sbjct: 235 PSHYIPPLTFVPVTIPAYWQIHMKRVKV-GTGLTLCAQGCAAILDTGTSLITGPTEEIRA 293
Query: 311 VNHAIGATGIVSQE 324
++ AIG +++ E
Sbjct: 294 LHAAIGGIPLLAGE 307
>gi|6978973|dbj|BAA90785.1| aspartic proteinase family member similar to renin [Mus musculus]
Length = 419
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 136/290 (46%), Positives = 181/290 (62%), Gaps = 16/290 (5%)
Query: 30 LYRIGLKKRKFDLNNRVAARLDSKE--GESFRTSIRKYSLRGNLGESGDADIVALKNYMD 87
L R+ L++ L +R+ L+ E E RTS G+ V L +M+
Sbjct: 23 LIRVPLQR--IHLGHRILNPLNGWEQLAELSRTST----------SGGNPSFVPLSKFMN 70
Query: 88 AQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNGK 146
QYFG IG+GTPPQNFTV+FDTGSSNLWVPS++C +FS+AC+FH ++ SS+++ NG
Sbjct: 71 TQYFGTIGLGTPPQNFTVVFDTGSSNLWVPSTRCHFFSLACWFHHRFNPKASSSFRPNGT 130
Query: 147 SADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEIS 206
I YGTG +SG S+D++ IG + F EA EPSL F LA FDGILGLGF ++
Sbjct: 131 KFAIQYGTGRLSGILSQDNLTIGGIHDAFVTFGEALWEPSLIFALAHFDGILGLGFPTLA 190
Query: 207 VGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQK 266
VG P +MV QGL+ +PVFSF+ NR+++ +GGE+V GG DP HY T++PVT
Sbjct: 191 VGGVQPPLDSMVEQGLLEKPVFSFYLNRDSEGSDGGELVLGGSDPAHYVPPLTFIPVTIP 250
Query: 267 GYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIG 316
YWQ M V + G CA GC+AI D+GTSL+ GP+ I +N AIG
Sbjct: 251 AYWQVHMESVKV-GTGLSLCAQGCSAILDTGTSLITGPSEEIRALNKAIG 299
>gi|346973691|gb|EGY17143.1| vacuolar protease A [Verticillium dahliae VdLs.17]
Length = 398
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 128/255 (50%), Positives = 169/255 (66%), Gaps = 7/255 (2%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V + N+M+AQYF EI IGTPPQ F V+ DTGSSNLWVPS +C SIACY H+KY S SS
Sbjct: 74 VPVSNFMNAQYFSEITIGTPPQTFKVVLDTGSSNLWVPSQQCS-SIACYLHTKYDSSDSS 132
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYK NG +IHYG+G+++GF S+D V IGD+ +K+Q+F EAT EP L F +FDGILG
Sbjct: 133 TYKANGSEFEIHYGSGSLTGFVSQDTVTIGDIKIKNQDFAEATSEPGLAFAFGRFDGILG 192
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ ISV K VP +Y MVNQ V+EPVF+F+ ++ + E+VFGG+D HY+G+ T
Sbjct: 193 LGYDTISVNKIVPPFYQMVNQKAVDEPVFAFYLGDTNEQGDESEVVFGGVDESHYEGKIT 252
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA-- 317
+P+ +K YW+ D+ + + T G AI D+GTSL P+T+ +N+ IGA
Sbjct: 253 TIPLRRKAYWEVDLDSISLGDNTAEL--DGHGAILDTGTSLNVLPSTLADMLNNEIGAKK 310
Query: 318 --TGIVSQECKAVVS 330
G S EC S
Sbjct: 311 GYNGQWSVECDKRAS 325
>gi|74199699|dbj|BAE41511.1| unnamed protein product [Mus musculus]
Length = 419
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 136/290 (46%), Positives = 180/290 (62%), Gaps = 16/290 (5%)
Query: 30 LYRIGLKKRKFDLNNRVAARLDSKE--GESFRTSIRKYSLRGNLGESGDADIVALKNYMD 87
L R+ L++ L +R+ L+ E E RTS G+ V L +M+
Sbjct: 23 LIRVPLQR--IHLGHRILNPLNGWEQLAELSRTST----------SGGNPSFVPLSKFMN 70
Query: 88 AQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNGK 146
QYFG IG+GTPPQNFTV+FDTGSSNLWVPS++C +FS+AC+FH ++ SS+++ NG
Sbjct: 71 TQYFGTIGLGTPPQNFTVVFDTGSSNLWVPSTRCHFFSLACWFHHRFNPKASSSFRPNGT 130
Query: 147 SADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEIS 206
I YGTG +SG S+D++ IG + F EA EPSL F LA FDGILGLGF ++
Sbjct: 131 KFAIQYGTGRLSGILSQDNLTIGGIHDAFATFGEALWEPSLIFALAHFDGILGLGFPTLA 190
Query: 207 VGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQK 266
VG P MV QGL+ +PVFSF+ NR+++ +GGE+V GG DP HY T++PVT
Sbjct: 191 VGGVQPPLDAMVEQGLLEKPVFSFYLNRDSEGSDGGELVLGGSDPAHYVPPLTFIPVTIP 250
Query: 267 GYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIG 316
YWQ M V + G CA GC+AI D+GTSL+ GP+ I +N AIG
Sbjct: 251 AYWQVHMESVKV-GTGLSLCAQGCSAILDTGTSLITGPSEEIRALNKAIG 299
>gi|195134378|ref|XP_002011614.1| GI11124 [Drosophila mojavensis]
gi|193906737|gb|EDW05604.1| GI11124 [Drosophila mojavensis]
Length = 373
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 140/317 (44%), Positives = 195/317 (61%), Gaps = 17/317 (5%)
Query: 16 LLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSL-RGNLGES 74
+ + VV + + L+R+ + K + N V R + K +++ + KY+L L E
Sbjct: 5 ITVLAVVLAVASAELHRVPILKHE----NFVKTRENVKAEKAYLRA--KYNLPNARLNEE 58
Query: 75 GDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKY 133
L N ++ Y+G I IGTPPQ+F V+FD+GSSNLWVPSS C +F +AC H++Y
Sbjct: 59 ------ELSNSINMAYYGTISIGTPPQSFKVLFDSGSSNLWVPSSTCWFFDVACMNHNQY 112
Query: 134 RSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAK 193
+SSTY+ NG+S I YG+G++SGF S D V + LV+K Q F EAT EP +F +K
Sbjct: 113 DHDKSSTYEANGESFSIQYGSGSLSGFLSTDTVDVNGLVIKKQTFAEATSEPGNSFTNSK 172
Query: 194 FDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDH 253
FDGILG+ +Q ++V VP +YNMV+QGLV+E VFSF+ R+ EGGE++FGG D
Sbjct: 173 FDGILGMAYQSLAVDNVVPPFYNMVSQGLVDESVFSFYLARDGTSNEGGELIFGGSDSSL 232
Query: 254 YKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNH 313
Y GE TYVP++Q+GYWQF + + IDGQT C C AIAD+GTSLL +N+
Sbjct: 233 YTGELTYVPISQQGYWQFAVDSISIDGQT--LC-DNCQAIADTGTSLLVVSEDAYDILNN 289
Query: 314 AIGATGIVSQECKAVVS 330
+ +C AV S
Sbjct: 290 LLNVDEDGLVDCSAVDS 306
>gi|119592255|gb|EAW71849.1| napsin A aspartic peptidase, isoform CRA_c [Homo sapiens]
Length = 328
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 132/262 (50%), Positives = 166/262 (63%), Gaps = 3/262 (1%)
Query: 74 SGDADI-VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHS 131
GD I V L NY D QYFGEIG+GTPPQNFTV FDTGSSNLWVPS +C +FS+ C+ H
Sbjct: 61 PGDKPIFVPLSNYRDVQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPCWLHH 120
Query: 132 KYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLL 191
++ SS+++ NG I YGTG + G SED + IG + F EA EPSL F
Sbjct: 121 RFDPKASSSFQANGTKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWEPSLVFAF 180
Query: 192 AKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDP 251
A FDGILGLGF +SV P +V QGL+++PVFSF+ NR+ +E +GGE+V GG DP
Sbjct: 181 AHFDGILGLGFPILSVEGVRPPMDVLVEQGLLDKPVFSFYLNRDPEEPDGGELVLGGSDP 240
Query: 252 DHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQV 311
HY T+VPVT YWQ M V + G CA GCAAI D+GTSL+ GPT I +
Sbjct: 241 AHYIPPLTFVPVTVPAYWQIHMERVKV-GPGLTLCAKGCAAILDTGTSLITGPTEEIRAL 299
Query: 312 NHAIGATGIVSQECKAVVSQYG 333
+ AIG +++ E ++ YG
Sbjct: 300 HAAIGGIPLLAGEVRSQSGGYG 321
>gi|332241362|ref|XP_003269849.1| PREDICTED: napsin-A-like [Nomascus leucogenys]
Length = 421
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 133/276 (48%), Positives = 174/276 (63%), Gaps = 3/276 (1%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRS 138
V L NY D QYFGEIG+GTPPQNFTV+FDTGSSNLWVPS +C +FS+ C+ H ++ S
Sbjct: 69 VPLSNYRDVQYFGEIGLGTPPQNFTVVFDTGSSNLWVPSRRCHFFSVPCWLHHRFDPKAS 128
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
S+++ NG DI YGTG + G SED + IG + F EA EPSL F A FDGIL
Sbjct: 129 SSFQANGTKFDIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWEPSLVFTFAHFDGIL 188
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEH 258
GLGF +SV P +V QGL+++P+FSF+ NR+ +E +GGE+V GG DP HY
Sbjct: 189 GLGFPILSVEGVRPPVDVLVEQGLLDKPIFSFYLNRDPEEPDGGELVLGGSDPAHYIPPL 248
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
T+VPVT YWQ M V + G CA GCAAI D+GTSL+ GPT I ++ AIG
Sbjct: 249 TFVPVTVPAYWQIHMERVKV-GPGLTLCARGCAAILDTGTSLITGPTEEIRALHAAIGGY 307
Query: 319 GIVSQECKAVVSQYGE-EIINMLLAKVSAKLSLRHY 353
+++ E + S+ + ++ LL V L+ + Y
Sbjct: 308 PLLAGEYIILCSEIPKLPAVSFLLGGVWFNLTAQDY 343
>gi|307166067|gb|EFN60339.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 370
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 134/279 (48%), Positives = 182/279 (65%), Gaps = 17/279 (6%)
Query: 40 FDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTP 99
F L + R+ ++ +S R +++K + D +L Y+DAQY+G I IGTP
Sbjct: 12 FALIDAKTQRVLLQKTDSVRKTLKKVGI--------DLTSESLFKYLDAQYYGVISIGTP 63
Query: 100 PQNFTVIFDTGSSNLWVPSSK---CYFSIACY---FHSKYRSGRSSTYKKNGKSADIHYG 153
PQNFTV+FDTGSSNLWVPS K ++ ++C+ +H KY + +S TY+ N I YG
Sbjct: 64 PQNFTVLFDTGSSNLWVPSIKSEITFYKLSCWTAPYHHKYNNSKSITYQANSAPFAIEYG 123
Query: 154 TGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPV 213
+G +SGF S D V + L V++Q F EAT E S+ F+L +FDGILG+G+ ISV P+
Sbjct: 124 SGDLSGFLSTDVVNVAGLNVRNQTFAEATHESSI-FILMQFDGILGMGYPTISVDGVTPI 182
Query: 214 WYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDM 273
+ NM+ Q LV++P+FSF+ NRN EEGGE++ GG DP+HY GE TYVPVT +GYWQF M
Sbjct: 183 FQNMIQQRLVSQPIFSFYLNRNPSAEEGGELILGGCDPNHYVGEFTYVPVTVEGYWQFTM 242
Query: 274 GDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVN 312
D +I G CA GC AIAD+GTSL+ GP+ I +N
Sbjct: 243 -DSVIAGNYI-LCAQGCQAIADTGTSLIVGPSEDIDVIN 279
>gi|296417651|ref|XP_002838466.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295634405|emb|CAZ82657.1| unnamed protein product [Tuber melanosporum]
Length = 396
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 142/318 (44%), Positives = 194/318 (61%), Gaps = 34/318 (10%)
Query: 28 GGLYRIGLKK-------RKFDLNNRVAA--------RLDSKEGESFRTSIRKYSLRGNLG 72
G++R LKK D+N ++ A R + + E F+T K
Sbjct: 18 AGVHRAPLKKVPLTEQLSHHDINTQMRALGQKYMGIRPEKIDEEMFKTQEIK-------- 69
Query: 73 ESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSK 132
+ D V + N+++AQYF EI IGTPPQ F V+ DTGSSNLWVPSS+C SIACY HSK
Sbjct: 70 -TDDGHPVPVSNFLNAQYFSEITIGTPPQTFKVVLDTGSSNLWVPSSQCG-SIACYLHSK 127
Query: 133 YRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLA 192
Y S SSTY+ NG S +I YG+G++SGF S+D+++IG+L +KDQ F EAT EP L F
Sbjct: 128 YDSSTSSTYRPNGTSFEIRYGSGSLSGFVSQDNIEIGNLKIKDQTFAEATSEPGLAFAFG 187
Query: 193 KFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPD 252
+FDGILGLG+ ISV VP +Y MV+QGL++EPVF+F+ D+++ E +FGG+D
Sbjct: 188 RFDGILGLGYDSISVNHIVPPFYQMVDQGLLDEPVFAFYL---GDKDDQSEAIFGGIDKA 244
Query: 253 HYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVN 312
HY+G+ +PV +K YW+ + + T F G AI D+GTSL+A P+T+ +N
Sbjct: 245 HYQGKLIKLPVRRKAYWEVEFEAITFGKSTAQFENTG--AIVDTGTSLIALPSTLAELLN 302
Query: 313 HAIGA----TGIVSQECK 326
IGA G S EC+
Sbjct: 303 KEIGAKKGFNGQYSVECE 320
>gi|12832561|dbj|BAB22158.1| unnamed protein product [Mus musculus]
Length = 419
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 136/290 (46%), Positives = 180/290 (62%), Gaps = 16/290 (5%)
Query: 30 LYRIGLKKRKFDLNNRVAARLDSKE--GESFRTSIRKYSLRGNLGESGDADIVALKNYMD 87
L R+ L++ L +R+ L+ E E RTS G+ V L +M+
Sbjct: 23 LIRVPLQR--IHLGHRILNPLNGWEQLAELSRTST----------SGGNPSFVPLSKFMN 70
Query: 88 AQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNGK 146
QYFG IG+GTPPQNFTV+FDTGSSNLWVPS++C +FS+AC+FH ++ SS+++ NG
Sbjct: 71 TQYFGTIGLGTPPQNFTVVFDTGSSNLWVPSTRCHFFSLACWFHHRFNPKASSSFRPNGT 130
Query: 147 SADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEIS 206
I YGTG +SG S+D++ IG + F EA EPSL F LA FDGILGLGF ++
Sbjct: 131 KFAIQYGTGRLSGILSQDNLTIGGIHDAFVTFGEALWEPSLIFALAHFDGILGLGFPTLA 190
Query: 207 VGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQK 266
VG P MV QGL+ +PVFSF+ NR+++ +GGE+V GG DP HY T++PVT
Sbjct: 191 VGGVQPPLDAMVEQGLLEKPVFSFYLNRDSEGSDGGELVLGGSDPAHYVPPLTFIPVTIP 250
Query: 267 GYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIG 316
YWQ M V + G CA GC+AI D+GTSL+ GP+ I +N AIG
Sbjct: 251 AYWQVHMESVKV-GTGLSLCAQGCSAILDTGTSLITGPSEEIRALNKAIG 299
>gi|311258028|ref|XP_003127411.1| PREDICTED: napsin-A [Sus scrofa]
Length = 416
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 139/303 (45%), Positives = 177/303 (58%), Gaps = 12/303 (3%)
Query: 24 STPNGG-LYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVAL 82
S P G L RI L++ L R K E R GD V L
Sbjct: 16 SEPAGAMLIRIPLRRVHAGLRTLNPLRAWEKSAEPPRLGAPS---------PGDKTFVPL 66
Query: 83 KNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYF-SIACYFHSKYRSGRSSTY 141
NY++ QY+GEIG+GTPPQNF+VIFDTGSSNLWVPS +C+F S+ C+ H +Y S SS++
Sbjct: 67 SNYLNVQYYGEIGLGTPPQNFSVIFDTGSSNLWVPSGRCHFLSLPCWLHHRYHSKASSSF 126
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
N I YGTG ++G SED + IG L F EA EPSL F A FDGILGLG
Sbjct: 127 HSNETKFAIQYGTGRLNGILSEDKLTIGGLTGASVIFGEALWEPSLVFAFAHFDGILGLG 186
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
F ++VG P ++V+QGL+++PVFSF+ NR+ + +GGE+V GG DP HY T+V
Sbjct: 187 FPVLAVGGVRPPLDSLVDQGLLDKPVFSFYLNRDPEAADGGELVLGGSDPAHYIPPLTFV 246
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIV 321
PVT YWQ + V + G CA GCAAI D+GTSL+ GPT I + AIG ++
Sbjct: 247 PVTVPAYWQVHVERVHV-GTGLTLCAQGCAAILDTGTSLITGPTEEIQALQAAIGGIPLL 305
Query: 322 SQE 324
E
Sbjct: 306 MGE 308
>gi|6680552|ref|NP_032463.1| napsin-A precursor [Mus musculus]
gi|6016430|sp|O09043.1|NAPSA_MOUSE RecName: Full=Napsin-A; AltName: Full=KDAP-1; AltName:
Full=Kidney-derived aspartic protease-like protein;
Short=KAP; Flags: Precursor
gi|1906810|dbj|BAA19004.1| kidney-derived aspartic protease-like protein [Mus musculus]
gi|7340352|emb|CAB82907.1| Napsin [Mus musculus]
gi|15928694|gb|AAH14813.1| Napsin A aspartic peptidase [Mus musculus]
gi|74220342|dbj|BAE31398.1| unnamed protein product [Mus musculus]
Length = 419
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 136/290 (46%), Positives = 180/290 (62%), Gaps = 16/290 (5%)
Query: 30 LYRIGLKKRKFDLNNRVAARLDSKE--GESFRTSIRKYSLRGNLGESGDADIVALKNYMD 87
L R+ L++ L +R+ L+ E E RTS G+ V L +M+
Sbjct: 23 LIRVPLQR--IHLGHRILNPLNGWEQLAELSRTST----------SGGNPSFVPLSKFMN 70
Query: 88 AQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNGK 146
QYFG IG+GTPPQNFTV+FDTGSSNLWVPS++C +FS+AC+FH ++ SS+++ NG
Sbjct: 71 TQYFGTIGLGTPPQNFTVVFDTGSSNLWVPSTRCHFFSLACWFHHRFNPKASSSFRPNGT 130
Query: 147 SADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEIS 206
I YGTG +SG S+D++ IG + F EA EPSL F LA FDGILGLGF ++
Sbjct: 131 KFAIQYGTGRLSGILSQDNLTIGGIHDAFVTFGEALWEPSLIFALAHFDGILGLGFPTLA 190
Query: 207 VGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQK 266
VG P MV QGL+ +PVFSF+ NR+++ +GGE+V GG DP HY T++PVT
Sbjct: 191 VGGVQPPLDAMVEQGLLEKPVFSFYLNRDSEGSDGGELVLGGSDPAHYVPPLTFIPVTIP 250
Query: 267 GYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIG 316
YWQ M V + G CA GC+AI D+GTSL+ GP+ I +N AIG
Sbjct: 251 AYWQVHMESVKV-GTGLSLCAQGCSAILDTGTSLITGPSEEIRALNKAIG 299
>gi|406861956|gb|EKD15008.1| aspartic endopeptidase Pep2 [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 401
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 142/319 (44%), Positives = 203/319 (63%), Gaps = 27/319 (8%)
Query: 15 CLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVA-ARLDS---KEGESFRTSIRKYS---- 66
L+ + S+ + G++++ LKK L+ ++A A +D+ G+ + IR S
Sbjct: 4 ALVTAATLLSSASAGIHKLPLKK--VSLSEQLATANIDAHVKNLGQKYM-GIRPQSHADE 60
Query: 67 --LRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS 124
++ E G V + N+++AQYF EI IGTPPQ F V+ DTGSSNLWVPSS+C S
Sbjct: 61 MFKETSVHEDGSDHTVPVSNFLNAQYFSEITIGTPPQTFKVVLDTGSSNLWVPSSQC-GS 119
Query: 125 IACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATRE 184
IACY H+KY S SSTYKKNG + +I YG+G++SGF SED + IGDL +K+Q F EAT+E
Sbjct: 120 IACYLHTKYDSSSSSTYKKNGTAFEIRYGSGSLSGFTSEDTMSIGDLKIKNQIFAEATQE 179
Query: 185 PSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWF-NRNADEEEGGE 243
P L F +FDGILGLG+ ISV K P +YNMVNQ L++EPVF+F+ + + EE+ E
Sbjct: 180 PGLAFAFGRFDGILGLGYDTISVNKIPPPFYNMVNQELLDEPVFAFYLGSTDKGEEDQSE 239
Query: 244 IVFGGMDPDHYKGEHTYVPVTQKGYWQFDM-----GDVMIDGQTTGFCAGGCAAIADSGT 298
+FGG++ DH+ G+ T +P+ +K YW+ D+ GD + + TG I D+GT
Sbjct: 240 AIFGGVNKDHFTGKITEIPLRRKAYWEVDLDAITFGDATAELENTG-------VILDTGT 292
Query: 299 SLLAGPTTIITQVNHAIGA 317
SL+A P+T+ +N +GA
Sbjct: 293 SLIALPSTLAELLNKEMGA 311
>gi|426244096|ref|XP_004015868.1| PREDICTED: napsin-A [Ovis aries]
Length = 443
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 125/246 (50%), Positives = 163/246 (66%), Gaps = 2/246 (0%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRS 138
V L NY++AQY+GEIG+GTPPQNF+V+FDTGSSNLWVPS +C +FS+ C+ H ++ S
Sbjct: 68 VPLSNYLNAQYYGEIGLGTPPQNFSVVFDTGSSNLWVPSVRCRFFSLPCWLHHRFNPKAS 127
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
S+++ NG I YGTG ++G SED + IG + F EA EPSL F A FDGIL
Sbjct: 128 SSFRFNGTKFAIQYGTGRLAGILSEDKLTIGGITGATVTFGEALWEPSLVFTFAHFDGIL 187
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEH 258
GLGF ++VG P +V+QGL+++PVFSF+ NRN + +GGE+V GG DP HY
Sbjct: 188 GLGFPVLAVGGVQPPLDRLVDQGLLDKPVFSFYLNRNPEAADGGELVLGGSDPAHYIPPL 247
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
T+VPVT +WQ M V + G CA GCAAI D+GTSL+ GPT I + AIGA
Sbjct: 248 TFVPVTIPAFWQIHMERVQV-GTGLTLCARGCAAILDTGTSLITGPTEEIRALQKAIGAV 306
Query: 319 GIVSQE 324
++ E
Sbjct: 307 PLLMGE 312
>gi|58258949|ref|XP_566887.1| endopeptidase [Cryptococcus neoformans var. neoformans JEC21]
gi|134107071|ref|XP_777848.1| hypothetical protein CNBA5450 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50260546|gb|EAL23201.1| hypothetical protein CNBA5450 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57223024|gb|AAW41068.1| endopeptidase, putative [Cryptococcus neoformans var. neoformans
JEC21]
Length = 438
Score = 257 bits (656), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 139/260 (53%), Positives = 173/260 (66%), Gaps = 19/260 (7%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L NYM+AQYF + IGTP Q F VI DTGSSNLWVPS KC SIAC+ HSKY S +SS
Sbjct: 117 VPLSNYMNAQYFATMEIGTPFQTFKVILDTGSSNLWVPSVKCT-SIACFLHSKYDSSQSS 175
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYK NG +IHYG+G++ GF S+D V IGDLVVK Q+F EAT+EP L F KFDGILG
Sbjct: 176 TYKANGSDFEIHYGSGSLEGFISQDTVSIGDLVVKKQDFAEATKEPGLAFAFGKFDGILG 235
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ ISV VP +YNM+NQ L++EPVFS F + +E+GGE +FGG+D Y GE
Sbjct: 236 LGYDTISVNHIVPPFYNMLNQHLLDEPVFS--FRLGSSDEDGGEAIFGGIDDSAYSGELQ 293
Query: 260 YVPVTQKGYWQFDM-----GDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHA 314
YVPV +KGYW+ ++ GD ++ + TG AAI D+GTSL+ PT + +N
Sbjct: 294 YVPVRRKGYWEVELESISFGDEELELENTG------AAI-DTGTSLIVMPTDVAELLNKE 346
Query: 315 IGA----TGIVSQECKAVVS 330
IGA G + +C V S
Sbjct: 347 IGAEKSWNGQYTVDCSTVSS 366
>gi|125858582|gb|AAI29608.1| Ce1-A protein [Xenopus laevis]
Length = 394
Score = 257 bits (656), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 134/302 (44%), Positives = 190/302 (62%), Gaps = 5/302 (1%)
Query: 17 LLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGD 76
+L ++F+T GL R+ LK++K + R + K + + +
Sbjct: 1 ILVLLLFATLVYGLIRVPLKRQK---SIRKTLKEKGKLSHIWTQQGIDMVQYTDSCSNDQ 57
Query: 77 ADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSG 136
A L NYMD +YFGEI +GTPPQNFTVIFDTGSSNLWVPS C S AC H +++
Sbjct: 58 APSEPLINYMDVEYFGEISVGTPPQNFTVIFDTGSSNLWVPSVYC-ISQACAQHDRFQPQ 116
Query: 137 RSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDG 196
SSTY+ NG + + YGTG++SG D V + ++V++Q+F E+ EP TF+ A+FDG
Sbjct: 117 LSSTYESNGNNFSLQYGTGSLSGVIGIDAVTVEGILVQNQQFGESVSEPGSTFVDAEFDG 176
Query: 197 ILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKG 256
ILGLG+ I+VG PV+ NM+ Q LV P+FS + +RN + GGE+VFGG D + G
Sbjct: 177 ILGLGYPSIAVGDCTPVFDNMIAQNLVELPMFSIYMSRNPNSAVGGELVFGGFDASRFSG 236
Query: 257 EHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIG 316
+ +VPVT +GYWQ + +V I+G+ FC+GGC AI D+GTSL+ GP++ I Q+ + IG
Sbjct: 237 QLNWVPVTNQGYWQIQLDNVQINGEVL-FCSGGCQAIVDTGTSLITGPSSDIVQLQNIIG 295
Query: 317 AT 318
A+
Sbjct: 296 AS 297
>gi|18203300|sp|Q9MZS8.1|CATD_SHEEP RecName: Full=Cathepsin D; Flags: Precursor
gi|8886526|gb|AAF80494.1|AF164143_1 cathepsin D [Ovis aries]
Length = 365
Score = 257 bits (656), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 142/331 (42%), Positives = 196/331 (59%), Gaps = 27/331 (8%)
Query: 38 RKFDLNNRVAAR-LDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGI 96
KF N R + + E + I KY+ R G + L NYMDAQY+GEIGI
Sbjct: 2 HKFTSNRRTMSEAMGPVEHLIAKGPISKYATREPAVRQGPIPEL-LTNYMDAQYYGEIGI 60
Query: 97 GTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTG 155
GTPPQ FTV+FDTGS+NLWVPS C IAC+ H KY S +SSTY KNG + DIHYG+G
Sbjct: 61 GTPPQCFTVVFDTGSANLWVPSIHCKLLDIACWVHHKYNSDKSSTYVKNGTTFDIHYGSG 120
Query: 156 AISGFFSEDHVKI---------GDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEIS 206
++SG+ S+D V + G + V+ Q F EA ++P + F+ AKFDGILG+ + IS
Sbjct: 121 SLSGYLSQDTVSVPCNPSSSSPGGVTVQRQTFGEAIKQPGVVFIAAKFDGILGMAYPRIS 180
Query: 207 VGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQK 266
V +PV+ N++ Q LV++ VFSF+ NR+ + G E++ GG D +Y+G TY VT++
Sbjct: 181 VNNVLPVFDNLMRQKLVDKNVFSFFLNRDPKAQPGEELMLGGTDSKYYRGSLTYHNVTRQ 240
Query: 267 GYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE-- 324
YWQ M D + G + C GGC AI D+GTSL+ GP + +++ AIGA ++ E
Sbjct: 241 AYWQIHM-DQLDVGSSLTVCKGGCEAIVDTGTSLMVGPVDEVRELHKAIGAVPLIQGEYM 299
Query: 325 --CKAVVSQYGEEIINMLLAKVSAKLSLRHY 353
C+ V S L +V+ KL + Y
Sbjct: 300 IPCEKVSS----------LPQVTLKLGGKDY 320
>gi|281182624|ref|NP_001162374.1| cathepsin D precursor [Papio anubis]
gi|160904227|gb|ABX52210.1| cathepsin D (predicted) [Papio anubis]
Length = 412
Score = 257 bits (656), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 125/255 (49%), Positives = 169/255 (66%), Gaps = 13/255 (5%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNYMDAQY+GEIGIGTPPQ FTV+FDTGSSNLWVPS C IAC+ H KY S +SST
Sbjct: 71 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWLHRKYNSDKSST 130
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKI-----------GDLVVKDQEFIEATREPSLTF 189
Y KNG S IHYG+G++SG+ S+D V + G + V+ Q F EA ++P +TF
Sbjct: 131 YVKNGTSFAIHYGSGSLSGYLSQDTVSVPCKSASSTAALGGVKVERQVFGEAIKQPGITF 190
Query: 190 LLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGM 249
+ AKFDGILG+ + ISV +PV+ N++ Q LV++ +FSF+ NR+ + GGE++ GG
Sbjct: 191 IAAKFDGILGMAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLNRDPTAQPGGELMLGGT 250
Query: 250 DPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIIT 309
D +Y+G +Y+ VT+K YWQ + V + T C GC AI D+GTSL+ GP +
Sbjct: 251 DSKYYRGSLSYLNVTRKAYWQVHLDQVEVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVR 309
Query: 310 QVNHAIGATGIVSQE 324
++ AIGA ++ E
Sbjct: 310 ELQKAIGAVPLIQGE 324
>gi|380036056|ref|NP_001244039.1| cathepsin D1 precursor [Ictalurus punctatus]
gi|330689904|gb|AEC33270.1| cathepsin D1 [Ictalurus punctatus]
Length = 396
Score = 257 bits (656), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 126/254 (49%), Positives = 167/254 (65%), Gaps = 6/254 (2%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSST 140
LKNY+DAQY+GEIG+G+P Q FTV+FDTGSSNLWVPS C + IAC H KY +SST
Sbjct: 68 LKNYLDAQYYGEIGLGSPVQTFTVVFDTGSSNLWVPSVHCSLTDIACLLHHKYNGAKSST 127
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y KNG + I YG+G++SG+ S+D IGD+ V+ Q F EA ++P + F+ AKFDGILG+
Sbjct: 128 YVKNGTAFAIQYGSGSLSGYLSQDVCTIGDIAVEKQIFGEAIKQPGVAFIAAKFDGILGM 187
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ I+V PV+ M++Q V + VFSF+ NRN D + GGE++ GG DP Y G+ Y
Sbjct: 188 AYPRIAVDGVPPVFDMMMSQKKVEKNVFSFYLNRNPDTQPGGELLLGGTDPKFYTGDFHY 247
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
V +T++ YWQ M + I Q T C GGC AI D+GTSL+ GP + + AIGA +
Sbjct: 248 VNITRQAYWQIHMDGMTIGSQLT-LCKGGCEAIVDTGTSLITGPAAEVKALQKAIGAIPL 306
Query: 321 VSQE----CKAVVS 330
+ E CK V S
Sbjct: 307 IQGEYMVDCKKVPS 320
>gi|313219527|emb|CBY30450.1| unnamed protein product [Oikopleura dioica]
Length = 396
Score = 257 bits (656), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 123/247 (49%), Positives = 170/247 (68%), Gaps = 4/247 (1%)
Query: 75 GDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKY 133
GD + NYMDAQY+G I IGTPPQ F+VIFDTGSSNLWVPS+KC F+ +AC+ H KY
Sbjct: 57 GDGHSEPITNYMDAQYYGTIHIGTPPQEFSVIFDTGSSNLWVPSTKCKFTNVACFLHRKY 116
Query: 134 RSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAK 193
S S+++K +G+ I YG+G++SGF S D V++ + V+DQ+F EA EP +TF+ AK
Sbjct: 117 DSQSSTSWKADGQEFAIQYGSGSLSGFCSTDAVEVAGVWVQDQKFAEAVEEPGITFVAAK 176
Query: 194 FDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDH 253
FDGI+GLG+ I+V K P NM+ QGL+++ +FSF+ NR A+ E+GGE+ GG+D
Sbjct: 177 FDGIMGLGYPSIAVNKITPPVNNMIEQGLLSDGMFSFFLNRTANAEDGGELTIGGVDNSR 236
Query: 254 YKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAG---GCAAIADSGTSLLAGPTTIITQ 310
+ G+ ++ VT++ YWQ M + + G+ C G GC I DSGTSLLA P + +
Sbjct: 237 FTGDFSWNEVTRQAYWQIKMDNFEVQGKGVSACGGNENGCQVIVDSGTSLLAVPKNLAEE 296
Query: 311 VNHAIGA 317
+NHAIGA
Sbjct: 297 INHAIGA 303
>gi|148227998|ref|NP_001079043.1| cathepsin E-A precursor [Xenopus laevis]
gi|46395761|sp|Q805F3.1|CATEA_XENLA RecName: Full=Cathepsin E-A; Flags: Precursor
gi|28460653|dbj|BAC57453.1| cathepsin E1 [Xenopus laevis]
gi|213625998|gb|AAI69692.1| Cathepsin E1 [Xenopus laevis]
gi|213627772|gb|AAI69694.1| Cathepsin E1 [Xenopus laevis]
Length = 397
Score = 256 bits (655), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 134/302 (44%), Positives = 190/302 (62%), Gaps = 5/302 (1%)
Query: 17 LLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGD 76
+L ++F+T GL R+ LK++K + R + K + + +
Sbjct: 4 ILVLLLFATLVYGLIRVPLKRQK---SIRKTLKEKGKLSHIWTQQGIDMVQYTDSCSNDQ 60
Query: 77 ADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSG 136
A L NYMD +YFGEI +GTPPQNFTVIFDTGSSNLWVPS C S AC H +++
Sbjct: 61 APSEPLINYMDVEYFGEISVGTPPQNFTVIFDTGSSNLWVPSVYC-ISQACAQHDRFQPQ 119
Query: 137 RSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDG 196
SSTY+ NG + + YGTG++SG D V + ++V++Q+F E+ EP TF+ A+FDG
Sbjct: 120 LSSTYESNGNNFSLQYGTGSLSGVIGIDAVTVEGILVQNQQFGESVSEPGSTFVDAEFDG 179
Query: 197 ILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKG 256
ILGLG+ I+VG PV+ NM+ Q LV P+FS + +RN + GGE+VFGG D + G
Sbjct: 180 ILGLGYPSIAVGDCTPVFDNMIAQNLVELPMFSVYMSRNPNSAVGGELVFGGFDASRFSG 239
Query: 257 EHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIG 316
+ +VPVT +GYWQ + +V I+G+ FC+GGC AI D+GTSL+ GP++ I Q+ + IG
Sbjct: 240 QLNWVPVTNQGYWQIQLDNVQINGEVL-FCSGGCQAIVDTGTSLITGPSSDIVQLQNIIG 298
Query: 317 AT 318
A+
Sbjct: 299 AS 300
>gi|386869594|ref|NP_001247483.1| cathepsin D precursor [Macaca mulatta]
gi|67971186|dbj|BAE01935.1| unnamed protein product [Macaca fascicularis]
gi|384939322|gb|AFI33266.1| cathepsin D preproprotein [Macaca mulatta]
Length = 412
Score = 256 bits (655), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 125/255 (49%), Positives = 169/255 (66%), Gaps = 13/255 (5%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNYMDAQY+GEIGIGTPPQ FTV+FDTGSSNLWVPS C IAC+ H KY S +SST
Sbjct: 71 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWLHHKYNSDKSST 130
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKI-----------GDLVVKDQEFIEATREPSLTF 189
Y KNG S IHYG+G++SG+ S+D V + G + V+ Q F EA ++P +TF
Sbjct: 131 YVKNGTSFAIHYGSGSLSGYLSQDTVSVPCKSASSTAALGGVKVERQVFGEAIKQPGITF 190
Query: 190 LLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGM 249
+ AKFDGILG+ + ISV +PV+ N++ Q LV++ +FSF+ NR+ + GGE++ GG
Sbjct: 191 IAAKFDGILGMAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLNRDPTAQPGGELMLGGT 250
Query: 250 DPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIIT 309
D +Y+G +Y+ VT+K YWQ + V + T C GC AI D+GTSL+ GP +
Sbjct: 251 DSKYYRGSLSYLNVTRKAYWQVRLDQVEVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVR 309
Query: 310 QVNHAIGATGIVSQE 324
++ AIGA ++ E
Sbjct: 310 ELQKAIGAVPLIQGE 324
>gi|410982348|ref|XP_003997519.1| PREDICTED: napsin-A [Felis catus]
Length = 422
Score = 256 bits (655), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 130/283 (45%), Positives = 179/283 (63%), Gaps = 9/283 (3%)
Query: 76 DADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYR 134
+ ++ L NYM+ QY+GEIG+GTPPQNF+V+FDTGSSNLWVPS +C +FS+ C+ H ++
Sbjct: 62 NPTVIPLSNYMNVQYYGEIGLGTPPQNFSVVFDTGSSNLWVPSIRCHFFSLPCWLHHRFN 121
Query: 135 SGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKF 194
SS+++ NG DI YGTG ++G SED + IG ++ F EA E SL F LA+F
Sbjct: 122 PKASSSFQPNGTKFDIQYGTGRLAGILSEDKLTIGGMMNASVIFGEALWESSLVFTLARF 181
Query: 195 DGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHY 254
DGILGL F ++VG P +V+QGL+++PVFSF+ NR+ + +GGE+V GG DP HY
Sbjct: 182 DGILGLAFPVLAVGGVRPPLDVLVDQGLLDKPVFSFYLNRDPEAADGGELVLGGSDPAHY 241
Query: 255 KGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHA 314
T+VPVT YWQ M + M G CA GCAAI D+GTSL+ GPT I +N A
Sbjct: 242 IPPLTFVPVTIPAYWQIHM-ERMKVGTGLTLCAQGCAAILDTGTSLITGPTEEIRALNTA 300
Query: 315 IGATGIVSQE----CKAVVSQYGEEIINMLLAKVSAKLSLRHY 353
IG ++ E C+ + + ++ LL V L+ + Y
Sbjct: 301 IGGISLLVGEYLIQCETIPTL---PPVSFLLGGVWFNLTAQDY 340
>gi|90076280|dbj|BAE87820.1| unnamed protein product [Macaca fascicularis]
Length = 412
Score = 256 bits (655), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 125/255 (49%), Positives = 169/255 (66%), Gaps = 13/255 (5%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNYMDAQY+GEIGIGTPPQ FTV+FDTGSSNLWVPS C IAC+ H KY S +SST
Sbjct: 71 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWLHHKYNSDKSST 130
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKI-----------GDLVVKDQEFIEATREPSLTF 189
Y KNG S IHYG+G++SG+ S+D V + G + V+ Q F EA ++P +TF
Sbjct: 131 YVKNGTSFAIHYGSGSLSGYLSQDTVSVPCKSAPSTAALGGVKVERQVFGEAIKQPGITF 190
Query: 190 LLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGM 249
+ AKFDGILG+ + ISV +PV+ N++ Q LV++ +FSF+ NR+ + GGE++ GG
Sbjct: 191 IAAKFDGILGMAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLNRDPTAQPGGELMLGGT 250
Query: 250 DPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIIT 309
D +Y+G +Y+ VT+K YWQ + V + T C GC AI D+GTSL+ GP +
Sbjct: 251 DSKYYRGSLSYLNVTRKAYWQVRLDQVEVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVR 309
Query: 310 QVNHAIGATGIVSQE 324
++ AIGA ++ E
Sbjct: 310 ELQKAIGAVPLIQGE 324
>gi|115279794|gb|ABI85390.1| cathepsin D [Hippoglossus hippoglossus]
Length = 399
Score = 256 bits (655), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 123/252 (48%), Positives = 168/252 (66%), Gaps = 6/252 (2%)
Query: 84 NYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYK 142
N+MDAQY+GEIGIGTPPQ FTV+FDTGSSNLW+PS C F++AC+ H +Y S +SSTY
Sbjct: 73 NFMDAQYYGEIGIGTPPQPFTVLFDTGSSNLWIPSIHCNLFNVACWLHHRYNSKKSSTYV 132
Query: 143 KNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGF 202
KNG I YG G+++G+ SED V + L V Q+F EA ++P +TF +A+FDG+LG+G+
Sbjct: 133 KNGTEFSIQYGRGSLTGYISEDTVSLAGLSVPGQQFAEAVKQPGITFAVARFDGVLGMGY 192
Query: 203 QEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVP 262
ISV K PV+ + + L+ + VFSF+ +R+A GGE++ GG DP +Y G+ YV
Sbjct: 193 PSISVDKVKPVFDSAMAAKLLPQNVFSFYISRDASATVGGELILGGTDPQYYTGDLHYVN 252
Query: 263 VTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVS 322
VT+K YWQ M V + Q T C GC AI D+GTSL+ GP + ++ AIGA ++
Sbjct: 253 VTRKAYWQIKMDGVEVGTQLT-LCKAGCQAIVDTGTSLIVGPREEVRALHRAIGALPLIM 311
Query: 323 QE----CKAVVS 330
E CK + S
Sbjct: 312 GEYLIDCKKIPS 323
>gi|148690790|gb|EDL22737.1| napsin A aspartic peptidase, isoform CRA_a [Mus musculus]
Length = 393
Score = 256 bits (655), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 133/282 (47%), Positives = 176/282 (62%), Gaps = 14/282 (4%)
Query: 38 RKFDLNNRVAARLDSKE--GESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIG 95
++ L +R+ L+ E E RTS G+ V L +M+ QYFG IG
Sbjct: 4 QRIHLGHRILNPLNGWEQLAELSRTST----------SGGNPSFVPLSKFMNTQYFGTIG 53
Query: 96 IGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNGKSADIHYGT 154
+GTPPQNFTV+FDTGSSNLWVPS++C +FS+AC+FH ++ SS+++ NG I YGT
Sbjct: 54 LGTPPQNFTVVFDTGSSNLWVPSTRCHFFSLACWFHHRFNPKASSSFRPNGTKFAIQYGT 113
Query: 155 GAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVW 214
G +SG S+D++ IG + F EA EPSL F LA FDGILGLGF ++VG P
Sbjct: 114 GRLSGILSQDNLTIGGIHDAFVTFGEALWEPSLIFALAHFDGILGLGFPTLAVGGVQPPL 173
Query: 215 YNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMG 274
MV QGL+ +PVFSF+ NR+++ +GGE+V GG DP HY T++PVT YWQ M
Sbjct: 174 DAMVEQGLLEKPVFSFYLNRDSEGSDGGELVLGGSDPAHYVPPLTFIPVTIPAYWQVHME 233
Query: 275 DVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIG 316
V + G CA GC+AI D+GTSL+ GP+ I +N AIG
Sbjct: 234 SVKV-GTGLSLCAQGCSAILDTGTSLITGPSEEIRALNKAIG 274
>gi|21552717|gb|AAM62283.1|AF396662_1 cathepsin D preproprotein [Silurus asotus]
Length = 395
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 145/324 (44%), Positives = 191/324 (58%), Gaps = 17/324 (5%)
Query: 15 CLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSI-RKYSLRGNLGE 73
CLLL + T + L RI LKK R R S G + S + + NLG
Sbjct: 5 CLLLLVFIAWTADA-LVRIPLKKF------RSIRRTMSDSGRAVEESRGNSQNTKYNLGV 57
Query: 74 SGD--ADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFH 130
+ LKNY+DAQY+GEIG+GTP Q FTV+FDTGSSNLWVPS C + IAC H
Sbjct: 58 TNKFGPTPETLKNYLDAQYYGEIGLGTPVQTFTVVFDTGSSNLWVPSVHCSLTDIACLLH 117
Query: 131 SKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFL 190
KY +SSTY KNG + I YG+G++SG+ S+D IGD+ V+ Q F EA ++P + F+
Sbjct: 118 HKYNGAKSSTYVKNGTAFAIQYGSGSLSGYLSQDVCSIGDIAVEKQIFGEAIKQPGVAFI 177
Query: 191 LAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMD 250
AKFDGILG+ + I+V VP ++M++Q + VFSF+ NRN D + GGE++ GG D
Sbjct: 178 AAKFDGILGMAYPRIAV-DGVPPVFDMMSQKKFEKNVFSFYLNRNPDTQPGGELLLGGTD 236
Query: 251 PDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQ 310
P Y G+ YV +T++ YWQ M D M G C GGC AI D+GTSL+ GP +
Sbjct: 237 PKFYTGDFHYVNITRQAYWQIHM-DGMSIGSQLSLCNGGCEAIVDTGTSLITGPAAEVKA 295
Query: 311 VNHAIGATGIVSQE----CKAVVS 330
+ AIGA ++ E CK V S
Sbjct: 296 LQKAIGAIPLIQGEYMVDCKKVPS 319
>gi|16119024|gb|AAL14708.1|AF420068_1 aspartic protease [Clonorchis sinensis]
Length = 419
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 144/305 (47%), Positives = 193/305 (63%), Gaps = 19/305 (6%)
Query: 17 LLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGD 76
LLF V+ +T + RI L K ++ R+ E +F TSIR + G++ E
Sbjct: 6 LLFWVLLTTSECSVIRIPLTGFK-NVRRRLMEVGTPVEQLNF-TSIR-FVGNGSIPE--- 59
Query: 77 ADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRS 135
L NY+DAQY+GEIGIGTPPQ+F V+FDTGSSNLWVPS C FSIAC+ H KY S
Sbjct: 60 ----ILNNYLDAQYYGEIGIGTPPQSFEVVFDTGSSNLWVPSKHCSIFSIACWLHHKYDS 115
Query: 136 GRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFD 195
+SSTY NG +I YG+G++SG S D+V +G + VK+Q F EA +EP + F+ AKFD
Sbjct: 116 AKSSTYMANGTEFNIRYGSGSVSGILSTDYVSVGTVTVKNQTFGEAMKEPGIAFVAAKFD 175
Query: 196 GILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYK 255
GILG+GF+ ISV ++ NM++QG F F +RN + GGE++ GG DP +YK
Sbjct: 176 GILGMGFKTISVDGVPTLFDNMISQG------FGFRLDRNRSDPVGGELLLGGTDPKYYK 229
Query: 256 GEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAI 315
GE + P+T + YWQF + D M G C GC AIAD+GTSL+AGP+ + ++N A+
Sbjct: 230 GEILWAPLTHEAYWQFKV-DSMNVGSMK-LCENGCQAIADTGTSLIAGPSEEVGKLNDAL 287
Query: 316 GATGI 320
GA I
Sbjct: 288 GAINI 292
>gi|13637914|sp|P80209.2|CATD_BOVIN RecName: Full=Cathepsin D; Flags: Precursor
Length = 390
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 132/286 (46%), Positives = 181/286 (63%), Gaps = 25/286 (8%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNYMDAQY+GEIGIGTPPQ FTV+FDTGS+NLWVPS C IAC+ H KY S +SST
Sbjct: 51 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSANLWVPSIHCKLLDIACWTHRKYNSDKSST 110
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKI---------GDLVVKDQEFIEATREPSLTFLL 191
Y KNG + DIHYG+G++SG+ S+D V + G + V+ Q F EA ++P + F+
Sbjct: 111 YVKNGTTFDIHYGSGSLSGYLSQDTVSVPCNPSSSSPGGVTVQRQTFGEAIKQPGVVFIA 170
Query: 192 AKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDP 251
AKFDGILG+ + ISV +PV+ N++ Q LV++ VFSF+ NR+ + GGE++ GG D
Sbjct: 171 AKFDGILGMAYPRISVNNVLPVFDNLMQQKLVDKNVFSFFLNRDPKAQPGGELMLGGTDS 230
Query: 252 DHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQV 311
+Y+G + VT++ YWQ M D + G + C GGC AI D+GTSL+ GP + ++
Sbjct: 231 KYYRGSLMFHNVTRQAYWQIHM-DQLDVGSSLTVCKGGCEAIVDTGTSLIVGPVEEVREL 289
Query: 312 NHAIGATGIVSQE----CKAVVSQYGEEIINMLLAKVSAKLSLRHY 353
AIGA ++ E C+ V S L +V+ KL + Y
Sbjct: 290 QKAIGAVPLIQGEYMIPCEKVSS----------LPEVTVKLGGKDY 325
>gi|169770745|ref|XP_001819842.1| vacuolar protease A [Aspergillus oryzae RIB40]
gi|238486794|ref|XP_002374635.1| aspartic endopeptidase Pep2 [Aspergillus flavus NRRL3357]
gi|21392388|dbj|BAC00850.1| pepsinogen [Aspergillus oryzae]
gi|83767701|dbj|BAE57840.1| unnamed protein product [Aspergillus oryzae RIB40]
gi|220699514|gb|EED55853.1| aspartic endopeptidase Pep2 [Aspergillus flavus NRRL3357]
gi|391867458|gb|EIT76704.1| aspartyl protease [Aspergillus oryzae 3.042]
Length = 397
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 137/277 (49%), Positives = 180/277 (64%), Gaps = 19/277 (6%)
Query: 59 RTSIRKYSLRGN-LGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVP 117
R +I++ L N + + G D++ + N+++AQYF EI IGTPPQ F V+ DTGSSNLWVP
Sbjct: 53 RPNIKQDLLNENPINDMGRHDVL-VDNFLNAQYFSEIEIGTPPQKFKVVLDTGSSNLWVP 111
Query: 118 SSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQE 177
SS+C SIACY H+KY S SSTY+KNG I YG+G++SGF S+D +KIGDL VKDQ
Sbjct: 112 SSECG-SIACYLHNKYDSSSSSTYQKNGSEFAIKYGSGSLSGFVSQDTLKIGDLKVKDQL 170
Query: 178 FIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNAD 237
F EAT EP L F +FDGILGLGF ISV K P +Y+M++QGL++EPVF+F+
Sbjct: 171 FAEATSEPGLAFAFGRFDGILGLGFDTISVNKIPPPFYSMLDQGLLDEPVFAFYLGDTNK 230
Query: 238 EEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFD-----MGDVMIDGQTTGFCAGGCAA 292
E + FGG+D DHY GE +P+ +K YW+ D +GD + + TG
Sbjct: 231 EGDDSVATFGGVDKDHYTGELVKIPLRRKAYWEVDLDAIALGDSVAELDNTG-------V 283
Query: 293 IADSGTSLLAGPTTIITQVNHAIGA----TGIVSQEC 325
I D+GTSL+A PTT+ +N IGA TG S +C
Sbjct: 284 ILDTGTSLIALPTTLAELINKEIGAKKGFTGQYSVDC 320
>gi|440899428|gb|ELR50729.1| Cathepsin D, partial [Bos grunniens mutus]
Length = 394
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 132/286 (46%), Positives = 181/286 (63%), Gaps = 25/286 (8%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNYMDAQY+GEIGIGTPPQ FTV+FDTGS+NLWVPS C IAC+ H KY S +SST
Sbjct: 55 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSANLWVPSIHCKLLDIACWTHRKYNSDKSST 114
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKI---------GDLVVKDQEFIEATREPSLTFLL 191
Y KNG + DIHYG+G++SG+ S+D V + G + V+ Q F EA ++P + F+
Sbjct: 115 YVKNGTTFDIHYGSGSLSGYLSQDTVSVPCNPSSSSPGGVTVQRQTFGEAIKQPGVVFIA 174
Query: 192 AKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDP 251
AKFDGILG+ + ISV +PV+ N++ Q LV++ VFSF+ NR+ + GGE++ GG D
Sbjct: 175 AKFDGILGMAYPRISVNNVLPVFDNLMQQKLVDKNVFSFFLNRDPKAQPGGELMLGGTDS 234
Query: 252 DHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQV 311
+Y+G + VT++ YWQ M D + G + C GGC AI D+GTSL+ GP + ++
Sbjct: 235 KYYRGSLMFHNVTRQAYWQIHM-DQLDVGSSLTVCKGGCEAIVDTGTSLIVGPVEEVREL 293
Query: 312 NHAIGATGIVSQE----CKAVVSQYGEEIINMLLAKVSAKLSLRHY 353
AIGA ++ E C+ V S L +V+ KL + Y
Sbjct: 294 QKAIGAVPLIQGEYMIPCEKVSS----------LPQVTVKLGGKDY 329
>gi|13928928|ref|NP_113858.1| napsin A aspartic peptidase precursor [Rattus norvegicus]
gi|6689137|emb|CAB65392.1| napsin [Rattus norvegicus]
gi|51260062|gb|AAH78790.1| Napsin A aspartic peptidase [Rattus norvegicus]
gi|149056039|gb|EDM07470.1| napsin A aspartic peptidase, isoform CRA_a [Rattus norvegicus]
Length = 420
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 123/251 (49%), Positives = 166/251 (66%), Gaps = 2/251 (0%)
Query: 75 GDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKY 133
G V L +M+ QYFG+IG+GTPPQNFTV+FDTGSSNLWVPS++C +FS+AC+FH ++
Sbjct: 59 GKTAFVPLSKFMNTQYFGDIGLGTPPQNFTVVFDTGSSNLWVPSTRCHFFSLACWFHHRF 118
Query: 134 RSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAK 193
SS+++ NG I YGTG +SG S D++ IG + F EA EPSL F LA+
Sbjct: 119 NPKASSSFRPNGTKFAIQYGTGRLSGILSRDNLTIGGIHNVSVTFGEALWEPSLVFALAR 178
Query: 194 FDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDH 253
FDGILGLGF ++VG P +V Q L+ +PVFSF+ NR+++ +GGE+V GG DPDH
Sbjct: 179 FDGILGLGFPTLAVGGVQPPLDALVEQRLLEKPVFSFYLNRDSEGSDGGELVLGGSDPDH 238
Query: 254 YKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNH 313
Y T++PVT YWQ M V + G CA GC AI D+GTSL+ GP+ I +N
Sbjct: 239 YVPPLTFIPVTIPAYWQVHMQSVKV-GTGLNLCAQGCGAILDTGTSLITGPSEEIRALNK 297
Query: 314 AIGATGIVSQE 324
A+G +++ +
Sbjct: 298 AVGGFPLLTGQ 308
>gi|301764903|ref|XP_002917936.1| PREDICTED: napsin-A-like [Ailuropoda melanoleuca]
Length = 406
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 128/254 (50%), Positives = 166/254 (65%), Gaps = 3/254 (1%)
Query: 73 ESGDADI-VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYF-SIACYFH 130
GD I V L NYM+AQY+GEIG+GTPPQNF+V+FDTGSSNLWVPS +C+F S+ C+FH
Sbjct: 58 SPGDKPIFVPLSNYMNAQYYGEIGLGTPPQNFSVVFDTGSSNLWVPSIRCHFLSLPCWFH 117
Query: 131 SKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFL 190
++ S SS++ NG I YGTG + G SED + IG + F EA EPSL F
Sbjct: 118 HRFNSKASSSFHPNGTKFAIQYGTGKLDGILSEDKLTIGGIKGASVIFGEALWEPSLVFT 177
Query: 191 LAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMD 250
A FDG+LGLGF ++VG P +V+QGL+++PVFSF+ NR+ + +GGE+V GG D
Sbjct: 178 FAHFDGVLGLGFPILAVGGVRPPLDTLVDQGLLDKPVFSFYLNRDPEAADGGELVLGGSD 237
Query: 251 PDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQ 310
P HY T++PVT YWQ M V + G CA GCAAI D+GTSL+ GPT I
Sbjct: 238 PAHYVPPLTFLPVTIPAYWQIHMERVNV-GTGLTLCAQGCAAILDTGTSLITGPTEEIQA 296
Query: 311 VNHAIGATGIVSQE 324
++ AIG ++ E
Sbjct: 297 LHAAIGGVSLLVGE 310
>gi|1507725|gb|AAB06575.1| aspartic protease, partial [Ancylostoma caninum]
Length = 442
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 130/227 (57%), Positives = 155/227 (68%), Gaps = 5/227 (2%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYF-SIACYFHSKYRSGRSST 140
L+NYMDAQYFG I IGTP QNFTVIFDTGSSNLWVPS K F IAC +Y SG SST
Sbjct: 80 LRNYMDAQYFGTIQIGTPAQNFTVIFDTGSSNLWVPSEKMPFHDIACMLRHRYDSGASST 139
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
YK++G+ I YGTG++ GF S+D+V I + ++Q F EAT EP LTF+ AKFDGILG+
Sbjct: 140 YKEDGRKMAIQYGTGSMKGFISKDNVCIAGICAEEQPFAEATSEPGLTFIAAKFDGILGI 199
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
F EISV PV++ + Q V PVF+ W NRN D E GGEI GGMD Y T+
Sbjct: 200 TFPEISVLGVPPVFHTFIEQKKVPSPVFALWLNRNPDSELGGEITLGGMDTRRYVEPITW 259
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCA---GGCAAIADSGTSLLAGP 304
PVT++GYWQF M D + G T+ C GC AIAD+GTSL+AGP
Sbjct: 260 TPVTRRGYWQFKM-DKVQGGSTSIACPNEFSGCQAIADTGTSLIAGP 305
>gi|297462061|ref|XP_001790669.2| PREDICTED: napsin-A [Bos taurus]
gi|297485858|ref|XP_002695173.1| PREDICTED: napsin-A [Bos taurus]
gi|296477597|tpg|DAA19712.1| TPA: napsin A aspartic peptidase [Bos taurus]
Length = 408
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 130/279 (46%), Positives = 174/279 (62%), Gaps = 9/279 (3%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRS 138
V L +YM+ QY+GEIG+GTPPQNF+V+FDTGSSNLWVPS +C +FS+ C+ H ++ S
Sbjct: 68 VPLSDYMNVQYYGEIGLGTPPQNFSVVFDTGSSNLWVPSVRCHFFSLPCWLHHRFNPKAS 127
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
S+++ NG I YGTG ++G SED + IG + F EA EPSL F A FDGIL
Sbjct: 128 SSFRSNGTKFAIQYGTGRLAGILSEDKLTIGGITGATVTFGEALWEPSLVFTFAHFDGIL 187
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEH 258
GLGF ++VG P +V+QGL+++PVFSF+ NRN + +GGE+V GG DP HY
Sbjct: 188 GLGFPVLAVGGVRPPLDRLVDQGLLDKPVFSFYLNRNPEAADGGELVLGGSDPAHYIPPL 247
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
T+VPVT +WQ M V + G CA GCAAI D+GTSL+ GPT I + AIGA
Sbjct: 248 TFVPVTIPAFWQIHMERVQV-GTGLTLCARGCAAILDTGTSLITGPTEEIRALQKAIGAV 306
Query: 319 GIVSQ----ECKAVVSQYGEEIINMLLAKVSAKLSLRHY 353
++ EC + + ++ LL V L+ + Y
Sbjct: 307 PLLMGKYYIECSKIPTL---PPVSFLLGGVWFNLTAQDY 342
>gi|115396430|ref|XP_001213854.1| vacuolar protease A precursor [Aspergillus terreus NIH2624]
gi|114193423|gb|EAU35123.1| vacuolar protease A precursor [Aspergillus terreus NIH2624]
Length = 397
Score = 256 bits (653), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 128/251 (50%), Positives = 165/251 (65%), Gaps = 7/251 (2%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V + N+M+AQYF EI +GTPPQ F V+ DTGSSNLWVPSS+C SIACY H+KY S SS
Sbjct: 74 VLVDNFMNAQYFSEIELGTPPQKFKVVLDTGSSNLWVPSSECS-SIACYLHNKYDSSASS 132
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYKKNG I YG+G++SGF SED +KIGDL +K+Q F EAT EP L F +FDGILG
Sbjct: 133 TYKKNGTEFSIRYGSGSLSGFVSEDTLKIGDLTIKEQLFAEATNEPGLAFAFGRFDGILG 192
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LGF ISV + P +Y MVNQGL++EPVF+F+ E + FGG+D HY GE
Sbjct: 193 LGFDTISVNRIEPPFYKMVNQGLLDEPVFAFYLGDANKEGDESVATFGGVDKSHYTGELI 252
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA-- 317
+P+ +K YW+ D+ + + +T G I D+GTSL+A P+ + +N IGA
Sbjct: 253 KIPLRRKAYWEVDLDAITLGDETADLENTGV--ILDTGTSLIALPSNLAEMINAQIGAKK 310
Query: 318 --TGIVSQECK 326
TG S +C+
Sbjct: 311 GFTGQYSVDCE 321
>gi|17389633|gb|AAH17842.1| Napsin A aspartic peptidase [Homo sapiens]
gi|123982255|gb|ABM82919.1| napsin A aspartic peptidase [synthetic construct]
gi|123997015|gb|ABM86109.1| napsin A aspartic peptidase [synthetic construct]
Length = 420
Score = 256 bits (653), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 136/283 (48%), Positives = 174/283 (61%), Gaps = 4/283 (1%)
Query: 74 SGDADI-VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHS 131
GD I V L NY D QYFGEIG+GTPPQNFTV FDTGSSNLWVPS +C +FS+ C+ H
Sbjct: 61 PGDKPIFVPLSNYRDVQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPCWLHH 120
Query: 132 KYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLL 191
++ SS+++ NG I YGTG + G SED + IG + F EA EPSL F
Sbjct: 121 RFDPKASSSFQANGTKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWEPSLVFAF 180
Query: 192 AKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDP 251
A FDGILGLGF +SV P +V QGL+++PVFSF+ NR+ +E +GGE+V GG DP
Sbjct: 181 AHFDGILGLGFPILSVEGVRPPMDVLVEQGLLDKPVFSFYLNRDPEEPDGGELVLGGSDP 240
Query: 252 DHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQV 311
HY T+VPVT YWQ M V + G CA GCAAI D+GTSL+ GPT I +
Sbjct: 241 AHYIPPLTFVPVTVPAYWQIHMERVKV-GPGLTLCAKGCAAILDTGTSLITGPTEEIRAL 299
Query: 312 NHAIGATGIVSQECKAVVSQYGE-EIINMLLAKVSAKLSLRHY 353
+ AIG +++ E + S+ + ++ LL V L+ Y
Sbjct: 300 HAAIGGIPLLAGEYIILCSEIPKLPAVSFLLGGVWFNLTAHDY 342
>gi|302899226|ref|XP_003048007.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256728939|gb|EEU42294.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 396
Score = 256 bits (653), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 130/255 (50%), Positives = 170/255 (66%), Gaps = 20/255 (7%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V + N+M+AQYF EI IG PPQ+F V+ DTGSSNLWVPS +C SIACY HSKY S SS
Sbjct: 76 VPISNFMNAQYFSEITIGNPPQSFKVVLDTGSSNLWVPSQEC-GSIACYLHSKYDSSASS 134
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYK+NG +IHYG+G++SGF S D V IGDL +K Q+F EAT+EP L F +FDGILG
Sbjct: 135 TYKQNGSEFEIHYGSGSLSGFISNDDVSIGDLKIKGQDFAEATKEPGLAFAFGRFDGILG 194
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ ISV VP +Y MVNQ L+++PVF+F+ AD+E E+VFGG+D HY+G+
Sbjct: 195 LGYDTISVNHIVPPFYQMVNQKLLDDPVFAFYL---ADQEGESEVVFGGVDKSHYEGDIE 251
Query: 260 YVPVTQKGYWQFD-----MGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHA 314
Y+P+ +K YW+ D +GD + + + TG AI D+GTSL P+ + +N
Sbjct: 252 YIPLRRKAYWEVDLDAIALGDEVAEQENTG-------AILDTGTSLNVLPSALAELLNKE 304
Query: 315 IGA----TGIVSQEC 325
IGA G + EC
Sbjct: 305 IGAKKGYNGQYTVEC 319
>gi|4758754|ref|NP_004842.1| napsin-A preproprotein [Homo sapiens]
gi|6225749|sp|O96009.1|NAPSA_HUMAN RecName: Full=Napsin-A; AltName: Full=Aspartyl protease 4;
Short=ASP4; Short=Asp 4; AltName: Full=Napsin-1;
AltName: Full=TA01/TA02; Flags: Precursor
gi|4154287|gb|AAD04917.1| napsin A [Homo sapiens]
gi|4235425|gb|AAD13215.1| napsin 1 precursor [Homo sapiens]
gi|6561818|gb|AAF17081.1| aspartyl protease 4 [Homo sapiens]
gi|119592253|gb|EAW71847.1| napsin A aspartic peptidase, isoform CRA_a [Homo sapiens]
Length = 420
Score = 256 bits (653), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 136/283 (48%), Positives = 174/283 (61%), Gaps = 4/283 (1%)
Query: 74 SGDADI-VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHS 131
GD I V L NY D QYFGEIG+GTPPQNFTV FDTGSSNLWVPS +C +FS+ C+ H
Sbjct: 61 PGDKPIFVPLSNYRDVQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPCWLHH 120
Query: 132 KYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLL 191
++ SS+++ NG I YGTG + G SED + IG + F EA EPSL F
Sbjct: 121 RFDPKASSSFQANGTKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWEPSLVFAF 180
Query: 192 AKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDP 251
A FDGILGLGF +SV P +V QGL+++PVFSF+ NR+ +E +GGE+V GG DP
Sbjct: 181 AHFDGILGLGFPILSVEGVRPPMDVLVEQGLLDKPVFSFYLNRDPEEPDGGELVLGGSDP 240
Query: 252 DHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQV 311
HY T+VPVT YWQ M V + G CA GCAAI D+GTSL+ GPT I +
Sbjct: 241 AHYIPPLTFVPVTVPAYWQIHMERVKV-GPGLTLCAKGCAAILDTGTSLITGPTEEIRAL 299
Query: 312 NHAIGATGIVSQECKAVVSQYGE-EIINMLLAKVSAKLSLRHY 353
+ AIG +++ E + S+ + ++ LL V L+ Y
Sbjct: 300 HAAIGGIPLLAGEYIILCSEIPKLPAVSFLLGGVWFNLTAHDY 342
>gi|299522|gb|AAB26186.1| cathepsin D {EC 3.4.23.5} [cattle, Peptide Partial, 346 aa]
Length = 346
Score = 256 bits (653), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 132/286 (46%), Positives = 181/286 (63%), Gaps = 25/286 (8%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNYMDAQY+GEIGIGTPPQ FTV+FDTGS+NLWVPS C IAC+ H KY S +SST
Sbjct: 7 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSANLWVPSIHCKLLDIACWTHRKYNSDKSST 66
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKI---------GDLVVKDQEFIEATREPSLTFLL 191
Y KNG + DIHYG+G++SG+ S+D V + G + V+ Q F EA ++P + F+
Sbjct: 67 YVKNGTTFDIHYGSGSLSGYLSQDTVSVPCNPSSSSPGGVTVQRQTFGEAIKQPGVVFIA 126
Query: 192 AKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDP 251
AKFDGILG+ + ISV +PV+ N++ Q LV++ VFSF+ NR+ + GGE++ GG D
Sbjct: 127 AKFDGILGMAYPRISVNNVLPVFDNLMQQKLVDKNVFSFFLNRDPKAQPGGELMLGGTDS 186
Query: 252 DHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQV 311
+Y+G + VT++ YWQ M D + G + C GGC AI D+GTSL+ GP + ++
Sbjct: 187 KYYRGSLMFHNVTRQAYWQIHM-DQLDVGSSLTVCKGGCEAIVDTGTSLIVGPVEEVREL 245
Query: 312 NHAIGATGIVSQE----CKAVVSQYGEEIINMLLAKVSAKLSLRHY 353
AIGA ++ E C+ V S L +V+ KL + Y
Sbjct: 246 QKAIGAVPLIQGEYMIPCEKVSS----------LPEVTVKLGGKDY 281
>gi|313226363|emb|CBY21507.1| unnamed protein product [Oikopleura dioica]
Length = 396
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 123/247 (49%), Positives = 169/247 (68%), Gaps = 4/247 (1%)
Query: 75 GDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKY 133
GD + NYMDAQY+G I IGTPPQ F+VIFDTGSSNLWVPS+KC F+ +AC H KY
Sbjct: 57 GDGHSEPITNYMDAQYYGTIHIGTPPQEFSVIFDTGSSNLWVPSTKCKFTNVACLLHRKY 116
Query: 134 RSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAK 193
S S+++K +G+ I YG+G++SGF S D V++ + V+DQ+F EA EP +TF+ AK
Sbjct: 117 DSQSSTSWKADGQEFAIQYGSGSLSGFCSTDAVEVAGVWVQDQKFAEAVEEPGITFVAAK 176
Query: 194 FDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDH 253
FDGI+GLG+ I+V K P NM+ QGL+++ +FSF+ NR A+ E+GGE+ GG+D
Sbjct: 177 FDGIMGLGYPSIAVNKITPPVNNMIEQGLLSDGMFSFFLNRTANAEDGGELTIGGVDNSR 236
Query: 254 YKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAG---GCAAIADSGTSLLAGPTTIITQ 310
+ G+ ++ VT++ YWQ M + + G+ C G GC I DSGTSLLA P + +
Sbjct: 237 FTGDFSWNEVTRQAYWQIKMDNFEVQGKGVSACGGNENGCQVIVDSGTSLLAVPKNLAEE 296
Query: 311 VNHAIGA 317
+NHAIGA
Sbjct: 297 INHAIGA 303
>gi|355566182|gb|EHH22561.1| Cathepsin D [Macaca mulatta]
Length = 450
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 132/289 (45%), Positives = 181/289 (62%), Gaps = 27/289 (9%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNYMDAQY+GEIGIGTPPQ FTV+FDTGSSNLWVPS C IAC+ H KY S +SST
Sbjct: 109 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWLHHKYNSDKSST 168
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKI-----------GDLVVKDQEFIEATREPSLTF 189
Y KNG S IHYG+G++SG+ S+D V + G + V+ Q F EA ++P +TF
Sbjct: 169 YVKNGTSFAIHYGSGSLSGYLSQDTVSVPCKSASSTAALGGVKVERQVFGEAIKQPGITF 228
Query: 190 LLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGM 249
+ AKFDGILG+ + ISV +PV+ N++ Q LV++ +FSF+ NR+ + GGE++ GG
Sbjct: 229 IAAKFDGILGMAYPRISVNNVLPVFDNLMQQKLVDQNIFSFYLNRDPTAQPGGELMLGGT 288
Query: 250 DPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIIT 309
D +Y+G +Y+ VT+K YWQ + V + T C GC AI D+GTSL+ GP +
Sbjct: 289 DSKYYRGSLSYLNVTRKAYWQVRLDQVEVASGLT-LCKEGCEAIVDTGTSLMVGPVDEVR 347
Query: 310 QVNHAIGATGIVSQE----CKAVVSQYGEEIINMLLAKVSAKLSLRHYK 354
++ AIGA ++ E C+ V + L ++ KL + YK
Sbjct: 348 ELQKAIGAVPLIQGEYMIPCEKVST----------LPTITLKLGGKGYK 386
>gi|261194088|ref|XP_002623449.1| aspartyl proteinase [Ajellomyces dermatitidis SLH14081]
gi|239588463|gb|EEQ71106.1| aspartyl proteinase [Ajellomyces dermatitidis SLH14081]
gi|239606974|gb|EEQ83961.1| aspartyl proteinase [Ajellomyces dermatitidis ER-3]
gi|327354563|gb|EGE83420.1| aspartyl proteinase [Ajellomyces dermatitidis ATCC 18188]
Length = 398
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 125/238 (52%), Positives = 162/238 (68%), Gaps = 3/238 (1%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V + N+++AQY+ EI IGTPPQ F V+ DTGSSNLWVPSS+C SIACY H+KY S SS
Sbjct: 75 VLVDNFLNAQYYSEITIGTPPQTFKVVLDTGSSNLWVPSSEC-GSIACYLHNKYDSSTSS 133
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TY+KNG I YG+G++SGF S+D V+IGDL +K Q F EAT EP L F +FDGILG
Sbjct: 134 TYQKNGSEFAIRYGSGSLSGFVSQDTVRIGDLTIKSQLFAEATNEPGLAFAFGRFDGILG 193
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ ISV K P +Y MVNQGL++EPVFSF+ E++ E VFGG++ DHY GE
Sbjct: 194 LGYDTISVNKIPPPFYEMVNQGLLDEPVFSFYLGDANIEDDDSEAVFGGINKDHYTGELV 253
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
+P+ +K YW+ D+ + +T G I D+GTSL+A P+T+ +N IGA
Sbjct: 254 MIPLRRKAYWEVDLDAITFGKETAQLENTG--VILDTGTSLIALPSTLAELLNKEIGA 309
>gi|326523981|dbj|BAJ97001.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 428
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 119/240 (49%), Positives = 168/240 (70%), Gaps = 3/240 (1%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L +YM+AQY+ EIGIGTPPQ F V+ DTGSSNLWVPS++C SIAC+ H ++ + +SS
Sbjct: 93 VPLTDYMNAQYYAEIGIGTPPQPFGVVMDTGSSNLWVPSTRCS-SIACWLHRRFDATKSS 151
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
T+K+NG I YG+G++ G S D V IGDL + + +F E+T+EP + F L KFDGI+G
Sbjct: 152 TFKENGTDFAIRYGSGSLEGVISTDTVTIGDLELTETDFGESTKEPGIAFALGKFDGIMG 211
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWF-NRNADEEEGGEIVFGGMDPDHYKGEH 258
LG+ I+V + VP +Y M+NQ L+++P+F+FW + N D E GGE+VFG +D DHY+G+
Sbjct: 212 LGYDTIAVQQVVPPFYQMINQKLIDKPLFTFWLGDTNKDAENGGELVFGEIDKDHYEGDI 271
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
Y PV +KGYW+ +++I+ + F AAI D+GTSL+A PT +N +GAT
Sbjct: 272 VYAPVVRKGYWEVKFNELLINDEPADFLGNATAAI-DTGTSLIACPTEAAETINTMLGAT 330
>gi|30575834|gb|AAP32823.1| aspartyl proteinase [Paracoccidioides brasiliensis]
Length = 400
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 123/238 (51%), Positives = 162/238 (68%), Gaps = 3/238 (1%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V + N+++AQYF EI IGTPPQ F V+ DTGSSNLWVPSS+C SIACY HSKY S SS
Sbjct: 76 VLVDNFLNAQYFSEISIGTPPQTFKVVLDTGSSNLWVPSSQCS-SIACYLHSKYDSSASS 134
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
T++KNG I YG+G++SGF S+D ++IGD+ V+ Q+F EAT EP L F +FDGILG
Sbjct: 135 THRKNGTEFAIRYGSGSLSGFVSQDVLRIGDMTVESQDFAEATSEPGLAFAFGRFDGILG 194
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ ISV + VP +Y MVNQGL++EPVFSF+ + + + E FGG+D DHY G T
Sbjct: 195 LGYDTISVNRIVPTFYLMVNQGLLDEPVFSFYLGNSDTDGDDSEATFGGIDKDHYTGNLT 254
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
+ + +K YW+ D+ + +T G I D+GTSLLA P+T+ +N IGA
Sbjct: 255 MISLRRKAYWEVDLDAITFGSETAELENTGV--ILDTGTSLLALPSTVAEILNQKIGA 310
>gi|281348334|gb|EFB23918.1| hypothetical protein PANDA_006240 [Ailuropoda melanoleuca]
Length = 379
Score = 255 bits (651), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 128/254 (50%), Positives = 166/254 (65%), Gaps = 3/254 (1%)
Query: 73 ESGDADI-VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYF-SIACYFH 130
GD I V L NYM+AQY+GEIG+GTPPQNF+V+FDTGSSNLWVPS +C+F S+ C+FH
Sbjct: 33 SPGDKPIFVPLSNYMNAQYYGEIGLGTPPQNFSVVFDTGSSNLWVPSIRCHFLSLPCWFH 92
Query: 131 SKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFL 190
++ S SS++ NG I YGTG + G SED + IG + F EA EPSL F
Sbjct: 93 HRFNSKASSSFHPNGTKFAIQYGTGKLDGILSEDKLTIGGIKGASVIFGEALWEPSLVFT 152
Query: 191 LAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMD 250
A FDG+LGLGF ++VG P +V+QGL+++PVFSF+ NR+ + +GGE+V GG D
Sbjct: 153 FAHFDGVLGLGFPILAVGGVRPPLDTLVDQGLLDKPVFSFYLNRDPEAADGGELVLGGSD 212
Query: 251 PDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQ 310
P HY T++PVT YWQ M V + G CA GCAAI D+GTSL+ GPT I
Sbjct: 213 PAHYVPPLTFLPVTIPAYWQIHMERVNV-GTGLTLCAQGCAAILDTGTSLITGPTEEIQA 271
Query: 311 VNHAIGATGIVSQE 324
++ AIG ++ E
Sbjct: 272 LHAAIGGVSLLVGE 285
>gi|114678580|ref|XP_524345.2| PREDICTED: napsin-A isoform 4 [Pan troglodytes]
Length = 420
Score = 255 bits (651), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 136/283 (48%), Positives = 174/283 (61%), Gaps = 4/283 (1%)
Query: 74 SGDADI-VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHS 131
GD I V L NY D QYFGEIG+GTPPQNFTV FDTGSSNLWVPS +C +FS+ C+ H
Sbjct: 61 PGDKTIFVPLSNYRDVQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPCWLHH 120
Query: 132 KYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLL 191
++ SS+++ NG I YGTG + G SED + IG + F EA EPSL F
Sbjct: 121 RFDPKASSSFQANGTKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWEPSLVFAF 180
Query: 192 AKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDP 251
A FDGILGLGF +SV P +V QGL+++PVFSF+ NR+ +E +GGE+V GG DP
Sbjct: 181 AHFDGILGLGFPILSVEGVRPPMDVLVEQGLLDKPVFSFYLNRDPEEPDGGELVLGGSDP 240
Query: 252 DHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQV 311
HY T+VPVT YWQ M V + G CA GCAAI D+GTSL+ GPT I +
Sbjct: 241 AHYIPPLTFVPVTVPAYWQIHMERVKV-GPGLTLCAQGCAAILDTGTSLITGPTEEIRAL 299
Query: 312 NHAIGATGIVSQECKAVVSQYGE-EIINMLLAKVSAKLSLRHY 353
+ AIG +++ E + S+ + ++ LL V L+ Y
Sbjct: 300 HAAIGGIPLLAGEYIILCSEIPKLPAVSFLLGGVWFNLTAHDY 342
>gi|74220823|dbj|BAE31380.1| unnamed protein product [Mus musculus]
Length = 404
Score = 255 bits (651), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 139/313 (44%), Positives = 195/313 (62%), Gaps = 23/313 (7%)
Query: 33 IGLKKRKFDLNNRVAARLD-SKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYF 91
I + RKF R + S E + I KYS++ + ++ + LKNY+DAQY+
Sbjct: 22 IRIPLRKFTSIRRTMTEVGGSVEDLILKGPITKYSMQSS-PKTTEPVSELLKNYLDAQYY 80
Query: 92 GEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNGKSADI 150
G+IGIGTPPQ FTV+FDTGSSNLWVPS C IAC+ H KY S +SSTY KNG S DI
Sbjct: 81 GDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSSTYVKNGTSFDI 140
Query: 151 HYGTGAISGFFSEDHV---------KIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
HYG+G++S + S+D V K + V+ Q F EAT++P + F+ AKFDGILG+G
Sbjct: 141 HYGSGSLSRYLSQDTVSVPCKSDQSKARGIKVEKQIFGEATKQPGIVFVAAKFDGILGMG 200
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ ISV +PV+ N++ Q LV++ +FSF+ NR+ + + GGE++ GG D +Y GE +Y+
Sbjct: 201 YPHISVNNVLPVFDNLMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDSKYYHGELSYL 260
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIV 321
VT+K YW ++G+ + C GGC AI D+GTSLL GP + ++ AIGA ++
Sbjct: 261 NVTRKAYW-LEVGNELT------LCKGGCEAIVDTGTSLLVGPVEEVKELQKAIGAVPLI 313
Query: 322 SQE----CKAVVS 330
E C+ V S
Sbjct: 314 QGEYMIPCEKVSS 326
>gi|157423181|gb|AAI53793.1| Cathepsin E2 [Xenopus laevis]
Length = 397
Score = 255 bits (651), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 137/306 (44%), Positives = 189/306 (61%), Gaps = 9/306 (2%)
Query: 13 FLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLG 72
L LLLF V F GL R+ LK++K + R + K + +
Sbjct: 4 ILVLLLF-VTFVY---GLIRVPLKRQK---SIRKTLKEKGKLSHVWTQQGIDMVQYTDSC 56
Query: 73 ESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSK 132
+ A L NYMD QYFGEI IGTPPQNFTVIFDTGSSNLWVPS C S AC H++
Sbjct: 57 NNDQAPSEPLINYMDVQYFGEISIGTPPQNFTVIFDTGSSNLWVPSVYC-ISPACAQHNR 115
Query: 133 YRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLA 192
++ SSTY+ NG + + YGTG++SG D V + ++V++Q+F E+ EP TF+ A
Sbjct: 116 FQPQLSSTYESNGNNFSLQYGTGSLSGVIGIDSVTVEGILVQNQQFGESVSEPGSTFVDA 175
Query: 193 KFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPD 252
FDGILGLG+ I+VG PV+ NM+ Q LV P+FS + +R+ + GGE+VFGG D
Sbjct: 176 SFDGILGLGYPSIAVGGCTPVFDNMIAQNLVELPMFSVYMSRDPNSPVGGELVFGGFDAS 235
Query: 253 HYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVN 312
+ G+ +VPVT +GYWQ + ++ I+G+ FC+GGC AI D+GTS++ GP++ I Q+
Sbjct: 236 RFSGQLNWVPVTNQGYWQIQLDNIQINGEVV-FCSGGCQAIVDTGTSMITGPSSDIVQLQ 294
Query: 313 HAIGAT 318
IGA+
Sbjct: 295 SIIGAS 300
>gi|47213062|emb|CAF91576.1| unnamed protein product [Tetraodon nigroviridis]
Length = 395
Score = 255 bits (651), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 124/256 (48%), Positives = 168/256 (65%), Gaps = 13/256 (5%)
Query: 81 ALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSS 139
AL NY+DAQY+GEIG+GTPPQ FTV+FDTGSSNLWVPS C IAC H KY S +SS
Sbjct: 56 ALTNYLDAQYYGEIGLGTPPQPFTVVFDTGSSNLWVPSVHCSLLDIACLLHRKYNSAKSS 115
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHV-----------KIGDLVVKDQEFIEATREPSLT 188
TY KNG + I YG+G++SG+ S+D ++G L V+ Q F EA ++P +
Sbjct: 116 TYVKNGTAFAIRYGSGSLSGYLSQDTCTVRACDPCPFFQVGGLAVEKQLFGEAIKQPGIA 175
Query: 189 FLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGG 248
F+ AKFDGILG+G+ ISV PV+ N+++Q V + VFSF+ NRN + GGE++ GG
Sbjct: 176 FIAAKFDGILGMGYPRISVDGVAPVFDNIMSQKKVEKNVFSFYLNRNPQTQPGGELLLGG 235
Query: 249 MDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTII 308
DP +Y G+ +YV VT++ YWQ + ++ + Q T C GC AI D+GTSLL GP+ +
Sbjct: 236 TDPQYYTGDFSYVNVTRQAYWQIHVDELSVGSQLT-LCKSGCEAIVDTGTSLLTGPSEEV 294
Query: 309 TQVNHAIGATGIVSQE 324
+ AIGA ++ E
Sbjct: 295 RSLQKAIGALPLIQGE 310
>gi|407924694|gb|EKG17726.1| Peptidase A1 [Macrophomina phaseolina MS6]
Length = 378
Score = 255 bits (651), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 124/238 (52%), Positives = 161/238 (67%), Gaps = 3/238 (1%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V + N+++AQYF E+ +GTPPQ F VI DTGSSNLWVPSS+C SIACY H+KY S SS
Sbjct: 55 VPVTNFLNAQYFSEVSLGTPPQTFKVILDTGSSNLWVPSSECG-SIACYLHTKYDSSASS 113
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TY KNG + +I YG+G++SGF S D IGDL VKDQ+F EAT EP L F +FDGILG
Sbjct: 114 TYSKNGSTFEIRYGSGSLSGFVSNDVFTIGDLTVKDQDFAEATSEPGLAFAFGRFDGILG 173
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ ISV VP +YNM++QGL++EPVF+F+ + DE FGG+D HY G+ T
Sbjct: 174 LGYDTISVNHIVPPFYNMIDQGLLDEPVFAFYLSDTNDEGSESVATFGGIDESHYTGKLT 233
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
+P+ +K YW+ D+ + T G AI D+GTSL+A P+T+ +N IGA
Sbjct: 234 KIPLRRKAYWEVDLDSITFGDATAELDNTG--AILDTGTSLIALPSTLAELLNKEIGA 289
>gi|387915422|gb|AFK11320.1| cathepsin E-A-like protein [Callorhinchus milii]
Length = 401
Score = 255 bits (651), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 131/278 (47%), Positives = 177/278 (63%), Gaps = 9/278 (3%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L NYMDAQY+GEIGIGTP Q FTV+FDTGSSNLWVPS+ C S AC H +++S S+TY
Sbjct: 74 LSNYMDAQYYGEIGIGTPLQKFTVVFDTGSSNLWVPSAYC-ISEACKMHEQFKSFHSTTY 132
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
G I YGTG ++G +D V+IG++ ++ QEF E+ EP TF +A+FDGILGLG
Sbjct: 133 APRGNQFSIRYGTGQLAGVLGKDMVRIGNITIRAQEFGESVFEPGSTFAVAQFDGILGLG 192
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ I+ G A+PV+ M++Q LV EP+FS NR D + GGE++ GG++ + Y G +V
Sbjct: 193 YPSIAEGGALPVFDRMMHQNLVVEPIFSVLINREMDSDYGGELLLGGINHECYTGSINWV 252
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIV 321
PVT++GYWQ M +V IDG T C GCAAI D+GTSL+ GP I +++ +GA +
Sbjct: 253 PVTERGYWQIRMDNVKIDGMLT-LCINGCAAIVDTGTSLITGPEKEIRKLHKQLGAMSVG 311
Query: 322 SQE----CKAVVSQYGEEIINMLLAKVSAKLSLRHYKK 355
E CK + S + + +V LS Y K
Sbjct: 312 DGEYVVDCKRISSMAS---VTFTIGEVEFSLSPNDYVK 346
>gi|440475206|gb|ELQ43907.1| vacuolar protease A [Magnaporthe oryzae Y34]
Length = 395
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 138/318 (43%), Positives = 191/318 (60%), Gaps = 30/318 (9%)
Query: 15 CLLLFPVVFSTPNGGLYRIGLKK-------RKFDLNNR--------VAARLDSKEGESFR 59
++ V+ T G++++ +KK + FDLN + + R +S + F
Sbjct: 4 AMMTAAVLLGTAEAGVHKLKMKKIPLEDQLKTFDLNAQMRGLGQKYLGIRPESHQQAVFS 63
Query: 60 TSIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSS 119
+ S GN V + N+M+AQYF EI IGTPPQNF VI DTGSSNLWVPSS
Sbjct: 64 NDAVQAS--GN-------HPVPISNFMNAQYFSEITIGTPPQNFKVILDTGSSNLWVPSS 114
Query: 120 KCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFI 179
C SIACY H+KY S SSTYKKNG I YG+G++ GF S D + IGDL +K+ +F
Sbjct: 115 SCG-SIACYLHNKYESSSSSTYKKNGTEFKIQYGSGSMEGFVSNDFMTIGDLKIKNLDFA 173
Query: 180 EATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEE 239
EAT+EP L F +FDGILG+GF +SV K VP +Y MV+Q L++EPVF+F+ ADE+
Sbjct: 174 EATKEPGLAFAFGRFDGILGMGFDRLSVNKIVPPFYAMVDQKLIDEPVFAFYL---ADEK 230
Query: 240 EGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTS 299
E+VFGG++ DH G+ T +P+ +K YW+ D+ + + + G I D+GTS
Sbjct: 231 SESEVVFGGVNKDHIDGKITEIPLRRKAYWEVDLDAIALGDEVAELDNTGV--ILDTGTS 288
Query: 300 LLAGPTTIITQVNHAIGA 317
L+A P+ + +N IGA
Sbjct: 289 LIALPSQLAELLNSQIGA 306
>gi|397485038|ref|XP_003813670.1| PREDICTED: napsin-A-like [Pan paniscus]
Length = 420
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 136/283 (48%), Positives = 173/283 (61%), Gaps = 4/283 (1%)
Query: 74 SGDADI-VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHS 131
GD I V L NY D QYFGEIG+GTPPQNFTV FDTGSSNLWVPS +C +FS+ C+ H
Sbjct: 61 PGDKTIFVPLSNYRDVQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPCWLHH 120
Query: 132 KYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLL 191
++ SS+++ NG I YGTG + G SED + IG + F EA EPSL F
Sbjct: 121 RFDPKASSSFQANGTKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWEPSLVFAF 180
Query: 192 AKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDP 251
A FDGILGLGF +SV P +V QGL+ +PVFSF+ NR+ +E +GGE+V GG DP
Sbjct: 181 AHFDGILGLGFPILSVEGVRPPMDVLVEQGLLEKPVFSFYLNRDPEEPDGGELVLGGSDP 240
Query: 252 DHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQV 311
HY T+VPVT YWQ M V + G CA GCAAI D+GTSL+ GPT I +
Sbjct: 241 AHYIPPLTFVPVTVPAYWQIHMERVKV-GPGLTLCAQGCAAILDTGTSLITGPTEEIRAL 299
Query: 312 NHAIGATGIVSQECKAVVSQYGE-EIINMLLAKVSAKLSLRHY 353
+ AIG +++ E + S+ + ++ LL V L+ Y
Sbjct: 300 HAAIGGIPLLAGEYIILCSEIPKLPAVSFLLGGVWFNLTAHDY 342
>gi|358372259|dbj|GAA88863.1| aspartic protease (PepE) [Aspergillus kawachii IFO 4308]
Length = 398
Score = 254 bits (650), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 134/300 (44%), Positives = 181/300 (60%), Gaps = 7/300 (2%)
Query: 30 LYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQ 89
L ++ L+++ + N R ++ R SI K + N V + N+++AQ
Sbjct: 25 LNKVPLEEQLYTHNIDAHVRALGQKYMGIRPSIHKELVEENPINDMSRHDVLVDNFLNAQ 84
Query: 90 YFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSAD 149
YF EI +GTPPQ F V+ DTGSSNLWVPSS+C SIACY H+KY S SSTY KNG
Sbjct: 85 YFSEIELGTPPQKFKVVLDTGSSNLWVPSSECS-SIACYLHNKYDSSASSTYHKNGSEFA 143
Query: 150 IHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGK 209
I YG+G++SGF S+D +KIGDL VK Q+F EAT EP L F +FDGILGLG+ ISV K
Sbjct: 144 IKYGSGSLSGFISQDTLKIGDLKVKGQDFAEATNEPGLAFAFGRFDGILGLGYDTISVNK 203
Query: 210 AVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYW 269
VP +YNM++QGL++EPVF+F+ E + FGG+D DHY GE +P+ +K YW
Sbjct: 204 IVPPFYNMLDQGLLDEPVFAFYLGDTNKEGDDSVATFGGVDKDHYTGELIKIPLRRKAYW 263
Query: 270 QFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA----TGIVSQEC 325
+ D+ + + G I D+GTSL+A P + +N IGA TG + +C
Sbjct: 264 EVDLDAIALGDDVAELDNTGV--ILDTGTSLIALPADLAEMINAQIGAKKGWTGQYTVDC 321
>gi|327278613|ref|XP_003224055.1| PREDICTED: cathepsin E-like [Anolis carolinensis]
Length = 396
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 135/307 (43%), Positives = 188/307 (61%), Gaps = 14/307 (4%)
Query: 15 CLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLD-SKEGESFRTSIRKYSLRGNLGE 73
C +LF GL R+ LK+ K L N + R + SK +S++ +Y+ + +
Sbjct: 9 CFILF-------VSGLQRVPLKRHK-SLRNILRERGELSKFWKSYKVDNIQYTQDCSAFQ 60
Query: 74 SGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKY 133
+ L NY D +YFGEI IGTPPQNFTV+FDTGSSNLWVPS C S AC HS++
Sbjct: 61 EANE---PLLNYFDVEYFGEISIGTPPQNFTVLFDTGSSNLWVPSVYCA-SKACVEHSRF 116
Query: 134 RSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAK 193
SSTY + G S IHYGTG+++G D V + + V +Q+F E+ EP TFL ++
Sbjct: 117 HPTESSTYNEVGTSFSIHYGTGSLTGIIGMDSVTVEGITVTNQQFAESVSEPGKTFLDSE 176
Query: 194 FDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDH 253
FDGILGL + ++V PV+ NM+ Q LV P+FS + +RN D GGE++FGG DP
Sbjct: 177 FDGILGLAYPSLAVDGVTPVFDNMMAQNLVELPLFSVYLSRNPDSSIGGELIFGGYDPSL 236
Query: 254 YKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNH 313
+ G ++PV++KGYWQ + ++ + G T FCA GC AI D+GTSL+ GP+ I Q+ +
Sbjct: 237 FSGNLNWIPVSKKGYWQIQLDNIQVGG-TIAFCAEGCQAIVDTGTSLITGPSDDIKQMQN 295
Query: 314 AIGATGI 320
IGA +
Sbjct: 296 LIGAQPV 302
>gi|440898030|gb|ELR49612.1| Napsin-A, partial [Bos grunniens mutus]
Length = 406
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 129/279 (46%), Positives = 174/279 (62%), Gaps = 9/279 (3%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRS 138
V L +YM+ QY+GEIG+GTPPQNF+V+FDTGSSNLWVPS +C +FS+ C+ H ++ S
Sbjct: 68 VPLSDYMNVQYYGEIGLGTPPQNFSVVFDTGSSNLWVPSVRCHFFSLPCWLHHRFNPKAS 127
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
S+++ NG I YGTG ++G SED + IG + F EA EPSL F A FDGIL
Sbjct: 128 SSFRSNGTKFAIQYGTGRLAGILSEDKLTIGGITGATVTFGEALWEPSLVFTFAHFDGIL 187
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEH 258
GLGF ++VG P +V++GL+++PVFSF+ NRN + +GGE+V GG DP HY
Sbjct: 188 GLGFPVLAVGGVRPPLDRLVDRGLLDKPVFSFYLNRNPEAADGGELVLGGSDPAHYIPPL 247
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
T+VPVT +WQ M V + G CA GCAAI D+GTSL+ GPT I + AIGA
Sbjct: 248 TFVPVTIPAFWQIHMERVQV-GTGLTLCARGCAAILDTGTSLITGPTEEIRALQKAIGAV 306
Query: 319 GIVSQ----ECKAVVSQYGEEIINMLLAKVSAKLSLRHY 353
++ EC + + ++ LL V L+ + Y
Sbjct: 307 PLLMGKYYIECSKIPTL---PPVSFLLGGVWFNLTAQDY 342
>gi|148236737|ref|NP_001079044.1| cathepsin E-B precursor [Xenopus laevis]
gi|46395760|sp|Q805F2.1|CATEB_XENLA RecName: Full=Cathepsin E-B; Flags: Precursor
gi|28460655|dbj|BAC57454.1| cathepsin E2 [Xenopus laevis]
Length = 397
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 121/238 (50%), Positives = 165/238 (69%), Gaps = 2/238 (0%)
Query: 81 ALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSST 140
L NYMD QYFGEI IGTPPQNFTVIFDTGSSNLWVPS C S AC H++++ SST
Sbjct: 65 PLINYMDVQYFGEISIGTPPQNFTVIFDTGSSNLWVPSVYC-ISPACAQHNRFQPQLSST 123
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y+ NG + + YGTG++SG D V + ++V++Q+F E+ EP TF+ A FDGILGL
Sbjct: 124 YESNGNNFSLQYGTGSLSGVIGIDSVTVEGILVQNQQFGESVSEPGSTFVDASFDGILGL 183
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
G+ I+VG PV+ NM+ Q LV P+FS + +R+ + GGE+VFGG D + G+ +
Sbjct: 184 GYPSIAVGGCTPVFDNMIAQNLVELPMFSVYMSRDPNSPVGGELVFGGFDASRFSGQLNW 243
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
VPVT +GYWQ + ++ I+G+ FC+GGC AI D+GTS++ GP++ I Q+ IGA+
Sbjct: 244 VPVTNQGYWQIQLDNIQINGEVV-FCSGGCQAIVDTGTSMITGPSSDIVQLQSIIGAS 300
>gi|281207795|gb|EFA81975.1| cathepsin D [Polysphondylium pallidum PN500]
Length = 390
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 134/249 (53%), Positives = 172/249 (69%), Gaps = 5/249 (2%)
Query: 71 LGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSI-ACYF 129
LG SG + + +Y DAQY+G I IGTP Q+F V+FDTGSSNLW+PS KC ++ AC
Sbjct: 50 LGASGTT--IPISDYEDAQYYGAITIGTPAQSFKVVFDTGSSNLWIPSKKCPVTVVACDL 107
Query: 130 HSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTF 189
HSKY S +SS+Y NG S I YG+GA+SGF S+D V++G L V++Q F EAT EP + F
Sbjct: 108 HSKYDSSKSSSYVANGTSFSIQYGSGAMSGFVSQDTVQVGSLTVQNQLFAEATAEPGIAF 167
Query: 190 LLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGM 249
LAKFDGILGL FQ ISV PV+YNM+ QGLV +PVF+FW ++ GGE+ FG +
Sbjct: 168 DLAKFDGILGLAFQSISVNSIPPVFYNMMAQGLVQQPVFAFWLSK-VPGANGGELTFGSI 226
Query: 250 DPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFC-AGGCAAIADSGTSLLAGPTTII 308
D Y G TYVP+T + YW+F M D ++G + G+C A GC AI DSGTSL+AGP+ I
Sbjct: 227 DTTRYTGPITYVPLTNETYWEFKMDDFALNGNSLGYCGADGCHAICDSGTSLIAGPSAQI 286
Query: 309 TQVNHAIGA 317
+N +GA
Sbjct: 287 NALNTKLGA 295
>gi|403299328|ref|XP_003940441.1| PREDICTED: napsin-A-like [Saimiri boliviensis boliviensis]
Length = 421
Score = 254 bits (649), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 131/276 (47%), Positives = 173/276 (62%), Gaps = 3/276 (1%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRS 138
V L NY D QYFGEIG+G PPQNFTV+FDTGSSNLWVPS +C +FS+ C+ H ++ S
Sbjct: 69 VPLSNYRDVQYFGEIGLGMPPQNFTVVFDTGSSNLWVPSRRCHFFSVPCWLHHRFDPKAS 128
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
S+++ NG I YG+G + G SED + IG + F EA EPSL F A FDGIL
Sbjct: 129 SSFQPNGTKFAIQYGSGRVDGILSEDKLTIGGIKGASVIFGEALWEPSLVFTFAHFDGIL 188
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEH 258
GLGF ++V P +V QGL+++PVFSF+FNR+ ++ +GGE+V GG DP HY
Sbjct: 189 GLGFPVLAVEGVRPPLDVLVEQGLLDKPVFSFYFNRDPEKPDGGELVLGGSDPAHYIPPL 248
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
T+VPVT YWQ M V + G CA GCAAI D+GTSL+ GPT I +N AIG
Sbjct: 249 TFVPVTVPAYWQIHMERVKV-GSGLTLCARGCAAILDTGTSLITGPTEEIQALNAAIGGF 307
Query: 319 GIVSQECKAVVSQYGE-EIINMLLAKVSAKLSLRHY 353
+++ E + S+ + ++ LL V L+ + Y
Sbjct: 308 PLLAGEYIILCSEIPKLPAVSFLLGGVWFNLTAQDY 343
>gi|384498765|gb|EIE89256.1| endopeptidase [Rhizopus delemar RA 99-880]
Length = 401
Score = 254 bits (649), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 142/319 (44%), Positives = 193/319 (60%), Gaps = 22/319 (6%)
Query: 12 FFLCLLLFPVVFST-PNGGLYRIGLKK------RKFDLNNRVAARLDSKEGESFRTSIRK 64
FF L+ FS + + +I +KK K + L K S R +
Sbjct: 3 FFALSLVVSAAFSVFTDAAITKIPIKKVHETATEKLSRYSHTGEYLTQKYFNSQRNNQPM 62
Query: 65 YSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS 124
+ + N S + V L NY++AQY+GEI IGTPPQ FTV+FDTGSSNLWVPS+ C S
Sbjct: 63 ETFKLNPDGSANHG-VPLSNYLNAQYYGEIEIGTPPQPFTVVFDTGSSNLWVPSTHCT-S 120
Query: 125 IACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATRE 184
IAC+ H +Y S S TY +NG I YGTG++ GF S+D + +G + V+DQ F E+T+E
Sbjct: 121 IACFLHKRYDSASSRTYSENGTEFAIQYGTGSLEGFISQDTLSVGGIQVEDQGFAESTKE 180
Query: 185 PSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNR-NADEEEGGE 243
P LTF AKFDGI GLG+ ISV +P +Y+MVN+ LV+EP+FSFW N N D++ GGE
Sbjct: 181 PGLTFAFAKFDGIFGLGYDTISVKHTIPPFYHMVNRDLVDEPLFSFWLNDANKDQDNGGE 240
Query: 244 IVFGGMDPDHYKGEHTYVPVTQKGYWQFDM-----GDVMIDGQTTGFCAGGCAAIADSGT 298
++FGG+D DH++G+ + V +KGYW+ M GD +D G AAI D+G+
Sbjct: 241 LIFGGVDEDHFEGDIHWSDVRRKGYWEITMENIKFGDDYVDIDPVG------AAI-DTGS 293
Query: 299 SLLAGPTTIITQVNHAIGA 317
SLL PTT+ +N +GA
Sbjct: 294 SLLVAPTTVAALINKELGA 312
>gi|389640809|ref|XP_003718037.1| vacuolar protease A [Magnaporthe oryzae 70-15]
gi|58257401|gb|AAW69322.1| vacuolar protease A-like protein [Magnaporthe grisea]
gi|351640590|gb|EHA48453.1| vacuolar protease A [Magnaporthe oryzae 70-15]
gi|440487134|gb|ELQ66940.1| vacuolar protease A [Magnaporthe oryzae P131]
Length = 395
Score = 254 bits (649), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 138/318 (43%), Positives = 191/318 (60%), Gaps = 30/318 (9%)
Query: 15 CLLLFPVVFSTPNGGLYRIGLKK-------RKFDLNNR--------VAARLDSKEGESFR 59
++ V+ T G++++ +KK + FDLN + + R +S + F
Sbjct: 4 AMMTAAVLLGTAEAGVHKLKMKKIPLEDQLKTFDLNAQMRGLGQKYLGIRPESHQQAVFS 63
Query: 60 TSIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSS 119
+ S GN V + N+M+AQYF EI IGTPPQNF VI DTGSSNLWVPSS
Sbjct: 64 NDAVQAS--GN-------HPVPISNFMNAQYFSEITIGTPPQNFKVILDTGSSNLWVPSS 114
Query: 120 KCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFI 179
C SIACY H+KY S SSTYKKNG I YG+G++ GF S D + IGDL +K+ +F
Sbjct: 115 SC-GSIACYLHNKYESSSSSTYKKNGTEFKIQYGSGSMEGFVSNDVMTIGDLKIKNLDFA 173
Query: 180 EATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEE 239
EAT+EP L F +FDGILG+GF +SV K VP +Y MV+Q L++EPVF+F+ ADE+
Sbjct: 174 EATKEPGLAFAFGRFDGILGMGFDRLSVNKIVPPFYAMVDQKLIDEPVFAFYL---ADEK 230
Query: 240 EGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTS 299
E+VFGG++ DH G+ T +P+ +K YW+ D+ + + + G I D+GTS
Sbjct: 231 SESEVVFGGVNKDHIDGKITEIPLRRKAYWEVDLDAIALGDEVAELDNTGV--ILDTGTS 288
Query: 300 LLAGPTTIITQVNHAIGA 317
L+A P+ + +N IGA
Sbjct: 289 LIALPSQLAELLNSQIGA 306
>gi|426389739|ref|XP_004061277.1| PREDICTED: napsin-A-like [Gorilla gorilla gorilla]
Length = 420
Score = 254 bits (648), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 133/276 (48%), Positives = 171/276 (61%), Gaps = 3/276 (1%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRS 138
V L NY D QYFGEIG+GTPPQNFTV FDTGSSNLWVPS +C +FS+ C+ H ++ S
Sbjct: 68 VPLLNYRDVQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPCWLHDRFDPKAS 127
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
S+++ NG I YGTG + G SED + IG + F EA EPSL F A FDGIL
Sbjct: 128 SSFQANGTKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWEPSLVFAFAHFDGIL 187
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEH 258
GLGF +SV P +V QGL+++PVFSF+ NR+ +E +GGE+V GG DP HY
Sbjct: 188 GLGFPILSVEGVRPPMDVLVEQGLLDKPVFSFYLNRDPEEPDGGELVLGGSDPAHYIPPL 247
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
T+VPVT YWQ M V + G CA GCAAI D+GTSL+ GPT I ++ AIG
Sbjct: 248 TFVPVTVPAYWQIHMERVKV-GPGLTLCAQGCAAILDTGTSLITGPTEEIRALHAAIGGI 306
Query: 319 GIVSQECKAVVSQYGE-EIINMLLAKVSAKLSLRHY 353
+++ E + S+ + ++ LL V L+ Y
Sbjct: 307 PLLAGEYIILCSEIPKLPAVSFLLGGVWFNLTAHDY 342
>gi|426198518|gb|EKV48444.1| hypothetical protein AGABI2DRAFT_192052 [Agaricus bisporus var.
bisporus H97]
Length = 413
Score = 254 bits (648), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 130/255 (50%), Positives = 169/255 (66%), Gaps = 9/255 (3%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L N+M+AQYF EI IG+PPQ F VI DTGSSNLWVPS KC SIAC+ H+KY SG+SS
Sbjct: 92 VPLSNFMNAQYFTEIQIGSPPQTFKVILDTGSSNLWVPSVKCT-SIACFLHTKYDSGQSS 150
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYK NG + +I YG+GA+ GF S+D ++IGDL +K Q+F EAT+EP L F KFDGILG
Sbjct: 151 TYKANGSTFEIQYGSGAMEGFVSQDQLQIGDLTIKGQDFAEATKEPGLAFAFGKFDGILG 210
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ ISV VP +Y M+ Q L++E VFS F + +E+GGE VFGG+D YKG+
Sbjct: 211 LGYDTISVNHIVPPFYKMIEQNLLDERVFS--FRLGSSDEDGGEAVFGGIDESAYKGKMH 268
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA-- 317
YVP+ QK YW+ + + + G+ G A D+GTSL+A P+ + +N IGA
Sbjct: 269 YVPIRQKAYWEVQLDKISLGGEELELENTGAA--IDTGTSLIALPSDMAEMLNTQIGAKK 326
Query: 318 --TGIVSQECKAVVS 330
G + +C V S
Sbjct: 327 SWNGQYTIDCAKVAS 341
>gi|119592254|gb|EAW71848.1| napsin A aspartic peptidase, isoform CRA_b [Homo sapiens]
Length = 357
Score = 254 bits (648), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 133/276 (48%), Positives = 171/276 (61%), Gaps = 3/276 (1%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRS 138
V L NY D QYFGEIG+GTPPQNFTV FDTGSSNLWVPS +C +FS+ C+ H ++ S
Sbjct: 5 VPLSNYRDVQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPCWLHHRFDPKAS 64
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
S+++ NG I YGTG + G SED + IG + F EA EPSL F A FDGIL
Sbjct: 65 SSFQANGTKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWEPSLVFAFAHFDGIL 124
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEH 258
GLGF +SV P +V QGL+++PVFSF+ NR+ +E +GGE+V GG DP HY
Sbjct: 125 GLGFPILSVEGVRPPMDVLVEQGLLDKPVFSFYLNRDPEEPDGGELVLGGSDPAHYIPPL 184
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
T+VPVT YWQ M V + G CA GCAAI D+GTSL+ GPT I ++ AIG
Sbjct: 185 TFVPVTVPAYWQIHMERVKV-GPGLTLCAKGCAAILDTGTSLITGPTEEIRALHAAIGGI 243
Query: 319 GIVSQECKAVVSQYGE-EIINMLLAKVSAKLSLRHY 353
+++ E + S+ + ++ LL V L+ Y
Sbjct: 244 PLLAGEYIILCSEIPKLPAVSFLLGGVWFNLTAHDY 279
>gi|242781757|ref|XP_002479865.1| aspartic endopeptidase Pep2 [Talaromyces stipitatus ATCC 10500]
gi|218720012|gb|EED19431.1| aspartic endopeptidase Pep2 [Talaromyces stipitatus ATCC 10500]
Length = 395
Score = 253 bits (647), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 126/244 (51%), Positives = 162/244 (66%), Gaps = 13/244 (5%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V + N+++AQYF EI IGTPPQNF V+ DTGSSNLWVPS+ C SIACY H+KY S SS
Sbjct: 72 VLVDNFLNAQYFSEITIGTPPQNFKVVLDTGSSNLWVPSASCN-SIACYLHNKYDSSSSS 130
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYKKNG I YG+G++ GF S D V IGD+ +KDQ+F EAT EP L F +FDGILG
Sbjct: 131 TYKKNGSEFAIQYGSGSLEGFVSRDVVTIGDITIKDQDFAEATNEPGLAFAFGRFDGILG 190
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LGF ISV K VP +YNM+NQ ++EPVF+F+ + E + E FGG+D HY GE
Sbjct: 191 LGFDTISVNKIVPPFYNMLNQKTLDEPVFAFYLGDSNKEGDNSEATFGGIDKSHYTGELV 250
Query: 260 YVPVTQKGYWQFD-----MGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHA 314
+P+ +K YW+ D GD + + + TG I D+GTSL+A P+T+ +N
Sbjct: 251 KIPLRRKAYWEVDFDAVAFGDNVAELENTG-------VILDTGTSLIALPSTLAELLNKE 303
Query: 315 IGAT 318
IGA+
Sbjct: 304 IGAS 307
>gi|453084572|gb|EMF12616.1| aspartyl proteinase [Mycosphaerella populorum SO2202]
Length = 396
Score = 253 bits (647), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 125/252 (49%), Positives = 172/252 (68%), Gaps = 8/252 (3%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
VA+ N+++AQYF EI +GTPPQ F V+ DTGSSNLWVPSS+C SIACY HSKY G S+
Sbjct: 74 VAVSNFLNAQYFSEIAVGTPPQEFKVVLDTGSSNLWVPSSEC-GSIACYLHSKYNHGDSN 132
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYK+NG I YG+G++ G+ S+D V+IGDL +KDQ F EAT EP L F +FDGI+G
Sbjct: 133 TYKQNGSEFAIRYGSGSLEGYVSQDTVQIGDLKIKDQLFAEATSEPGLAFAFGRFDGIMG 192
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ ISV P +YNM++QGL++E VF+F+ + + D+ + E +FGG++ DHY G+ T
Sbjct: 193 LGYDTISVNGIPPPFYNMIDQGLLDEKVFAFYLS-STDKGDESEAIFGGVNKDHYTGDMT 251
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA-- 317
+P+ +K YW+ D+ + QT A G AI D+GTSL+A P+T+ +N IGA
Sbjct: 252 KIPLRRKAYWEVDLDAITFGKQTAEIDATG--AILDTGTSLIALPSTLAELLNKEIGAKK 309
Query: 318 --TGIVSQECKA 327
G + +C A
Sbjct: 310 SYNGQYTVDCSA 321
>gi|145232965|ref|XP_001399855.1| vacuolar protease A [Aspergillus niger CBS 513.88]
gi|134056777|emb|CAK37685.1| aspartic protease pepE-Aspergillus niger
Length = 398
Score = 253 bits (647), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 136/305 (44%), Positives = 185/305 (60%), Gaps = 17/305 (5%)
Query: 30 LYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQ 89
L ++ L+++ + N R ++ R SI K + N V + N+++AQ
Sbjct: 25 LNKVPLEEQLYTHNIDAHVRALGQKYMGIRPSIHKELVEENPINDMSRHDVLVDNFLNAQ 84
Query: 90 YFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSAD 149
YF EI +GTPPQ F V+ DTGSSNLWVPSS+C SIACY H+KY S SSTY KNG
Sbjct: 85 YFSEIELGTPPQKFKVVLDTGSSNLWVPSSECS-SIACYLHNKYDSSASSTYHKNGSEFA 143
Query: 150 IHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGK 209
I YG+G++SGF S+D +KIGDL VK Q+F EAT EP L F +FDGILGLG+ ISV K
Sbjct: 144 IKYGSGSLSGFISQDTLKIGDLKVKGQDFAEATNEPGLAFAFGRFDGILGLGYDTISVNK 203
Query: 210 AVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYW 269
VP +YNM++QGL++EPVF+F+ E + FGG+D DHY GE +P+ +K YW
Sbjct: 204 IVPPFYNMLDQGLLDEPVFAFYLGDTNKEGDESVATFGGVDKDHYTGELIKIPLRRKAYW 263
Query: 270 QFD-----MGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA----TGI 320
+ + +GD + + + TG I D+GTSL+A P + +N IGA TG
Sbjct: 264 EVELDAIALGDDVAEMENTG-------VILDTGTSLIALPADLAEMINAQIGAKKGWTGQ 316
Query: 321 VSQEC 325
+ +C
Sbjct: 317 YTVDC 321
>gi|325087547|gb|EGC40857.1| aspartic endopeptidase Pep2 [Ajellomyces capsulatus H88]
Length = 398
Score = 253 bits (647), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 132/300 (44%), Positives = 188/300 (62%), Gaps = 7/300 (2%)
Query: 30 LYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQ 89
L +I L ++ ++N R ++ + + + G+ ++ + + N+++AQ
Sbjct: 25 LQKIPLSEQFANVNIDAHVRALGQKYMGVKPNQNVQDVFGDPAKASGGHSLPVDNFLNAQ 84
Query: 90 YFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSAD 149
YF EIGIGTPPQ F V+ DTGSSNLWVPSS+C SIACY H+KY S SST+KKNG
Sbjct: 85 YFSEIGIGTPPQTFKVVLDTGSSNLWVPSSECG-SIACYLHNKYDSSASSTHKKNGSEFS 143
Query: 150 IHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGK 209
I YG+G+++GF S+D + IGDLVV++Q F EAT EP L F +FDGILGLG+ ISV K
Sbjct: 144 ITYGSGSLTGFVSQDCLTIGDLVVENQVFAEATSEPGLAFAFGRFDGILGLGYDTISVNK 203
Query: 210 AVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYW 269
VP +Y M+N+ L++EP+FSF+ +++ E+VFGGM+ D + GE T +P+ +K YW
Sbjct: 204 IVPPFYEMLNKDLLDEPMFSFYLGDANIDDDQSEVVFGGMNKDRFTGELTKIPLRRKAYW 263
Query: 270 QFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA----TGIVSQEC 325
+ D+ + QT G I D+GTSL+A P+TI +N IGA G + EC
Sbjct: 264 EVDLDSITFGKQTAMMTNTGV--ILDTGTSLIALPSTIAELLNKEIGAKKSFNGQYTVEC 321
>gi|196123668|gb|ACG70181.1| cathepsin D-like protein [Homarus americanus]
Length = 386
Score = 253 bits (647), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 131/282 (46%), Positives = 181/282 (64%), Gaps = 7/282 (2%)
Query: 49 RLDSKEGESFRT--SIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVI 106
R+ K+ E RT +R+ + N +D++ L NY DAQY+G I IGTP Q F VI
Sbjct: 20 RIPLKKIEKSRTLQDLRRTRVFLNHRYGVGSDVIDLDNYEDAQYYGPITIGTPGQGFDVI 79
Query: 107 FDTGSSNLWVPSSKCY-FSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDH 165
FDTGSSNLW+PS KC+ ++A H++Y S +SSTY +NG + DI YG+GA+ GF S D+
Sbjct: 80 FDTGSSNLWIPSEKCFILNLARRLHNRYDSTKSSTYIENGTAFDIQYGSGALHGFLSSDN 139
Query: 166 VKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNE 225
V++G + Q F EAT+EP L F++ K DGILG+ F EISV V+ MV QG V++
Sbjct: 140 VEMGGVNAMGQTFAEATQEPGLAFIMGKLDGILGMAFTEISVMGIPTVFDTMVAQGAVDQ 199
Query: 226 PVFSFWFNRNA---DEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQT 282
P+FSF+ N + +E GGE+V GG DP+HY+GE YVPV++ GYWQ + +
Sbjct: 200 PIFSFYLNHDVSDMNETLGGELVLGGSDPNHYEGEFHYVPVSKVGYWQVTAEAIKVGDNV 259
Query: 283 TGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE 324
TGFC C AI D+GTSL+AGP + ++ H +G G ++ E
Sbjct: 260 TGFC-NPCEAIVDTGTSLIAGPNAEVKEIVHMLGGYGFIAGE 300
>gi|334562337|gb|AEG79714.1| cathepsin D [Apostichopus japonicus]
Length = 372
Score = 253 bits (646), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 134/315 (42%), Positives = 191/315 (60%), Gaps = 15/315 (4%)
Query: 12 FFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNL 71
FL LL P+ L RI L K + + R + E+ + ++ + G+
Sbjct: 4 LFLVGLLLPIA-----SALQRIPLFKVESARQRLIRTRSSKSDLEAIGSGLQVKEVNGSP 58
Query: 72 GESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYF-SIACYFH 130
+ LK+Y+DAQY+G I +GTPPQ+F V+FDTGSSNLWVPSS C + IAC F
Sbjct: 59 --------IILKDYLDAQYYGPITLGTPPQDFVVVFDTGSSNLWVPSSTCSWKDIACSFT 110
Query: 131 SKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFL 190
KY SSTY N + I YG+G +GF S D + +G++ VK Q F EAT EP L+++
Sbjct: 111 KKYDHSVSSTYVANDTAFAIPYGSGNCAGFLSYDTLMMGNVAVKSQLFGEATAEPGLSWI 170
Query: 191 LAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMD 250
+A+FDGILG+G+ ISV +P + N++N+ L++ +FSF+ +++ GGE++ GG D
Sbjct: 171 MAQFDGILGMGYPTISVDGVIPPFDNIMNRKLISNNIFSFYLSKDPSAAVGGELLLGGTD 230
Query: 251 PDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAG-GCAAIADSGTSLLAGPTTIIT 309
+Y G TYV V++KGYWQF M V I G+ G+C G C+AI D+GTSL+AGPT I
Sbjct: 231 SKYYTGNFTYVKVSKKGYWQFAMDKVSIGGKDAGYCTGKNCSAICDTGTSLIAGPTADIN 290
Query: 310 QVNHAIGATGIVSQE 324
+N IGA ++ E
Sbjct: 291 DLNKKIGAIPLIKGE 305
>gi|530795|gb|AAA20876.1| pepsinogen [Aspergillus niger]
gi|350634685|gb|EHA23047.1| extracellular aspartic protease [Aspergillus niger ATCC 1015]
Length = 398
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 136/305 (44%), Positives = 185/305 (60%), Gaps = 17/305 (5%)
Query: 30 LYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQ 89
L ++ L+++ + N R ++ R SI K + N V + N+++AQ
Sbjct: 25 LNKVPLEEQLYTHNIDAHVRALGQKYMGIRPSIHKELVEENPINDMSRHDVLVDNFLNAQ 84
Query: 90 YFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSAD 149
YF EI +GTPPQ F V+ DTGSSNLWVPSS+C SIACY H+KY S SSTY KNG
Sbjct: 85 YFSEIELGTPPQKFKVVLDTGSSNLWVPSSECS-SIACYLHNKYDSSASSTYHKNGSEFA 143
Query: 150 IHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGK 209
I YG+G++SGF S+D +KIGDL VK Q+F EAT EP L F +FDGILGLG+ ISV K
Sbjct: 144 IKYGSGSLSGFVSQDTLKIGDLKVKGQDFAEATNEPGLAFAFGRFDGILGLGYDTISVNK 203
Query: 210 AVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYW 269
VP +YNM++QGL++EPVF+F+ E + FGG+D DHY GE +P+ +K YW
Sbjct: 204 IVPPFYNMLDQGLLDEPVFAFYLGDTNKEGDESVATFGGVDKDHYTGELIKIPLRRKAYW 263
Query: 270 QFD-----MGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA----TGI 320
+ + +GD + + + TG I D+GTSL+A P + +N IGA TG
Sbjct: 264 EVELDAIALGDDVAEMENTG-------VILDTGTSLIALPADLAEMINAQIGAKKGWTGQ 316
Query: 321 VSQEC 325
+ +C
Sbjct: 317 YTVDC 321
>gi|367047895|ref|XP_003654327.1| hypothetical protein THITE_2117251 [Thielavia terrestris NRRL 8126]
gi|347001590|gb|AEO67991.1| hypothetical protein THITE_2117251 [Thielavia terrestris NRRL 8126]
Length = 396
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 128/238 (53%), Positives = 161/238 (67%), Gaps = 6/238 (2%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V + NYM+AQYF EI +GTPPQ+F V+ DTGSSNLWVPS +C SIACY HSKY S SS
Sbjct: 76 VPISNYMNAQYFSEITLGTPPQSFKVVLDTGSSNLWVPSVEC-GSIACYLHSKYDSSASS 134
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYKKNG S DI YG+G++SGF S+D + IGD+ VK Q+F EAT EP L F +FDGILG
Sbjct: 135 TYKKNGTSFDIRYGSGSLSGFVSQDTLSIGDITVKGQDFAEATSEPGLAFAFGRFDGILG 194
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ ISV VP +Y MV Q LV+EPVF+F+ AD E+VFGG+D D YKG+ T
Sbjct: 195 LGYDTISVNGIVPPFYKMVEQKLVDEPVFAFYL---ADTNGESEVVFGGVDKDRYKGKIT 251
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
+P+ +K YW+ D + T F G AI D+GTSL+ P+ + +N +GA
Sbjct: 252 TIPLRRKAYWEVDFESLSYGDDTADFENTG--AILDTGTSLITLPSQLAEMLNAQLGA 307
>gi|451992127|gb|EMD84649.1| hypothetical protein COCHEDRAFT_1189444 [Cochliobolus
heterostrophus C5]
gi|452004574|gb|EMD97030.1| hypothetical protein COCHEDRAFT_1189956 [Cochliobolus
heterostrophus C5]
Length = 399
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 128/250 (51%), Positives = 168/250 (67%), Gaps = 8/250 (3%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V + NY++AQYF EI +GTPPQ+F VI DTGSSNLWVPS++C SIAC+ H KY S SS
Sbjct: 77 VPVSNYLNAQYFSEISLGTPPQSFKVILDTGSSNLWVPSTQCT-SIACFLHDKYDSSSSS 135
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TY+KNG +I YG+G++ GF S D ++IGDL VK+Q+F EAT EP L F KFDGILG
Sbjct: 136 TYQKNGSDFEIRYGSGSMKGFVSNDVLQIGDLKVKNQDFAEATSEPGLAFAFGKFDGILG 195
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ ISV VP +YNM+NQGL++EPVF+F+ AD ++G E FGG+D HY G+
Sbjct: 196 LGYDTISVNHIVPPFYNMINQGLLDEPVFAFYLGDVAD-KQGSEATFGGIDESHYTGKLI 254
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA-- 317
+P+ +K YW+ D+ D + G+ T I D+GTSL+A P+ + +N IGA
Sbjct: 255 KLPLRRKAYWEVDL-DAITFGKETAETE-NVGVILDTGTSLIALPSAMAELLNKEIGAKK 312
Query: 318 --TGIVSQEC 325
G S EC
Sbjct: 313 GFNGQYSVEC 322
>gi|187608619|ref|NP_001120469.1| cathepsin E precursor [Xenopus (Silurana) tropicalis]
gi|170284872|gb|AAI61297.1| LOC100145572 protein [Xenopus (Silurana) tropicalis]
Length = 397
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 121/238 (50%), Positives = 163/238 (68%), Gaps = 2/238 (0%)
Query: 81 ALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSST 140
L NYMD +YFGEI IGTPPQNFTVIFDTGSSNLWVPS C S AC H++++ SST
Sbjct: 65 PLINYMDVEYFGEISIGTPPQNFTVIFDTGSSNLWVPSVYC-ISPACAQHNRFQPQFSST 123
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y+ NG + + YGTG++SG D V + ++V+ Q+F E+ EP TF+ A+FDGILGL
Sbjct: 124 YQSNGNNFSLQYGTGSLSGIIGTDSVSVEGILVQSQQFGESVSEPGSTFVDAEFDGILGL 183
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
G+ I+VG PV+ NM+ Q LV P+FS + +RN + GGE+VFGG D + G+ +
Sbjct: 184 GYPSIAVGDCTPVFDNMMTQNLVELPMFSVYMSRNPNSPVGGELVFGGFDASRFSGQLNW 243
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
V VT +GYWQ + ++ I+G+ FC GGC AI D+GTSL+ GP++ I Q+ IGA+
Sbjct: 244 VSVTNQGYWQIQLDNIQINGEVV-FCTGGCQAIVDTGTSLITGPSSDIVQLQSIIGAS 300
>gi|451853159|gb|EMD66453.1| hypothetical protein COCSADRAFT_34972 [Cochliobolus sativus ND90Pr]
Length = 399
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 128/250 (51%), Positives = 168/250 (67%), Gaps = 8/250 (3%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V + N+++AQYF +I +GTPPQ+F VI DTGSSNLWVPS++C SIACY H+KY S SS
Sbjct: 77 VPVSNFLNAQYFSDISLGTPPQSFKVILDTGSSNLWVPSTECS-SIACYLHTKYDSSASS 135
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYKKNG +I YG+G++SGF S D +IGDL VK+Q+F EAT EP L F +FDGI+G
Sbjct: 136 TYKKNGSEFEIRYGSGSLSGFVSNDVFQIGDLKVKNQDFAEATSEPGLAFAFGRFDGIMG 195
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ ISV VP +YNM+NQGL++EPVF+F+ D +E E FGG+D HY G+ T
Sbjct: 196 LGYDTISVNGIVPPFYNMLNQGLLDEPVFAFYLGDTKDGKE-SEATFGGIDESHYTGKLT 254
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA-- 317
+P+ +K YW+ D+ + +T G AI D+GTSL+A P+ I +N IGA
Sbjct: 255 KLPLRRKAYWEVDLDAITFGKETAEMENIG--AILDTGTSLIALPSAIAELLNKEIGAKK 312
Query: 318 --TGIVSQEC 325
G S EC
Sbjct: 313 GFNGQYSVEC 322
>gi|195399277|ref|XP_002058247.1| GJ15982 [Drosophila virilis]
gi|194150671|gb|EDW66355.1| GJ15982 [Drosophila virilis]
Length = 374
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 133/289 (46%), Positives = 183/289 (63%), Gaps = 18/289 (6%)
Query: 16 LLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRG--NLGE 73
+ + VV + + L+R+ + K + N V R + K S+ LRG NL
Sbjct: 5 ITVLAVVLALASAELHRVPIYKHE----NFVKTRENVKAEVSY--------LRGKYNLPS 52
Query: 74 SGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSK 132
+ + L N ++ Y+G I IGTPPQ+F V+FD+GSSNLWVPSS C +F +AC H++
Sbjct: 53 ARSENEEELSNSINMAYYGAITIGTPPQSFKVLFDSGSSNLWVPSSTCWFFDVACMNHNQ 112
Query: 133 YRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLA 192
Y +SSTY NG+S I YG+G++SGF S D V + LV+K Q F EAT EP +F A
Sbjct: 113 YDHDKSSTYTSNGESFSIQYGSGSLSGFLSTDTVDVNGLVIKSQTFAEATSEPGTSFNNA 172
Query: 193 KFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPD 252
KFDGILG+ +Q ++V VP +YNMV+QGLV++ VFSF+ R+ +GGE++FGG D
Sbjct: 173 KFDGILGMAYQSLAVDNVVPPFYNMVSQGLVDQSVFSFYLARDGTSSQGGELIFGGSDSS 232
Query: 253 HYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLL 301
Y G+ TYVP++++GYWQF M IDGQ+ C C AIAD+GTSLL
Sbjct: 233 LYSGDLTYVPISEQGYWQFTMAGASIDGQS--LC-DNCQAIADTGTSLL 278
>gi|12697815|dbj|BAB21620.1| cathepsin D [Bos taurus]
Length = 386
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 131/286 (45%), Positives = 180/286 (62%), Gaps = 25/286 (8%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNYMDAQY+GEIGIGTPPQ FTV+FDTGS+NLWVPS C IAC+ H KY S +SST
Sbjct: 47 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSANLWVPSIHCKLLDIACWTHRKYNSDKSST 106
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKI---------GDLVVKDQEFIEATREPSLTFLL 191
Y KNG + DIHYG+G++SG+ S+D V + G + V+ Q F EA ++P + F+
Sbjct: 107 YVKNGTTFDIHYGSGSLSGYLSQDTVSVPCNPSSSSPGGVTVQRQTFGEAIKQPGVVFIA 166
Query: 192 AKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDP 251
AKF GILG+ + ISV +PV+ N++ Q LV++ VFSF+ NR+ + GGE++ GG D
Sbjct: 167 AKFGGILGMAYPRISVNNVLPVFDNLMQQKLVDKNVFSFFLNRDPKAQPGGELMLGGTDS 226
Query: 252 DHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQV 311
+Y+G + VT++ YWQ M D + G + C GGC AI D+GTSL+ GP + ++
Sbjct: 227 KYYRGSLMFHNVTRQAYWQIHM-DQLDVGSSLTVCKGGCEAIVDTGTSLIVGPVEEVREL 285
Query: 312 NHAIGATGIVSQE----CKAVVSQYGEEIINMLLAKVSAKLSLRHY 353
AIGA ++ E C+ V S L +V+ KL + Y
Sbjct: 286 QKAIGAVPLIQGEYMIPCEKVSS----------LPQVTVKLGGKDY 321
>gi|347836229|emb|CCD50801.1| similar to vacuolar protease A (secreted protein) [Botryotinia
fuckeliana]
Length = 398
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 133/277 (48%), Positives = 182/277 (65%), Gaps = 23/277 (8%)
Query: 46 VAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTV 105
+ R +S E F+ + ++ ++GD V + N+++AQYF EI IGTPPQ+F V
Sbjct: 50 MGVRPESHASEMFKET--------SVHDAGD-HTVPVSNFLNAQYFSEITIGTPPQSFKV 100
Query: 106 IFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDH 165
+ DTGSSNLWVPSS+C SIACY H+KY S SSTYK+NG S +I YG+G++SGF S+D
Sbjct: 101 VLDTGSSNLWVPSSQC-GSIACYLHTKYDSSSSSTYKQNGTSFEIRYGSGSLSGFTSKDV 159
Query: 166 VKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNE 225
+ IGDL +KDQ F EAT EP L F +FDGILGLG+ ISV VP +Y+MV+QGL++E
Sbjct: 160 MTIGDLKIKDQVFAEATEEPGLAFAFGRFDGILGLGYDTISVNSIVPPFYSMVDQGLLDE 219
Query: 226 PVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDM-----GDVMIDG 280
PVF+F+ N DE + E +FGG++ DHY G+ T +P+ +K YW+ D+ GD +
Sbjct: 220 PVFAFYLGSN-DESDPSEAIFGGVNKDHYDGKITEIPLRRKAYWEVDLDSIAFGDSEAEL 278
Query: 281 QTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
+ TG I D+GTSL+A P + +N IGA
Sbjct: 279 ENTG-------VILDTGTSLIALPADLAGLLNAEIGA 308
>gi|67524891|ref|XP_660507.1| hypothetical protein AN2903.2 [Aspergillus nidulans FGSC A4]
gi|40744298|gb|EAA63474.1| hypothetical protein AN2903.2 [Aspergillus nidulans FGSC A4]
gi|259486160|tpe|CBF83780.1| TPA: vacuolar aspartyl protease (proteinase A) (Eurofung)
[Aspergillus nidulans FGSC A4]
Length = 394
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 125/244 (51%), Positives = 165/244 (67%), Gaps = 13/244 (5%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V + N+M+AQYF EI +GTPPQ F V+ DTGSSNLWVPSS+C SIACY H K+ S SS
Sbjct: 71 VLVDNFMNAQYFSEIQLGTPPQTFKVVLDTGSSNLWVPSSECG-SIACYLHQKFDSSASS 129
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYKKNG I YG+G++SGF S D+++IGDL VK Q+F EAT EP L F +FDGILG
Sbjct: 130 TYKKNGSEFAIKYGSGSLSGFVSRDNLQIGDLKVKGQDFAEATSEPGLAFAFGRFDGILG 189
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LGF ISV + VP +YNM++QGL++EPVF+F+ + + FGG+D DHY+GE
Sbjct: 190 LGFDTISVNRIVPPFYNMIHQGLLDEPVFAFYLGDANKDGDSSVATFGGIDKDHYEGELI 249
Query: 260 YVPVTQKGYWQFD-----MGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHA 314
+P+ +K YW+ D +GD + + + TG I D+GTSL+A P+ + +N
Sbjct: 250 KIPLRRKAYWEVDLDAIALGDEVAELENTG-------VILDTGTSLIALPSNLAEMINTE 302
Query: 315 IGAT 318
IGAT
Sbjct: 303 IGAT 306
>gi|260837471|ref|XP_002613727.1| hypothetical protein BRAFLDRAFT_114822 [Branchiostoma floridae]
gi|229299116|gb|EEN69736.1| hypothetical protein BRAFLDRAFT_114822 [Branchiostoma floridae]
Length = 392
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 127/277 (45%), Positives = 177/277 (63%), Gaps = 13/277 (4%)
Query: 81 ALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSST 140
+LKN+MD QY+G I +GTPPQ+F VIFDTGSSNLWVPS KC AC H +Y +S T
Sbjct: 65 SLKNFMDVQYYGVISLGTPPQDFNVIFDTGSSNLWVPSVKCE-GAACANHQRYNHSKSCT 123
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
YK +G+ I YG+G++SGF S+D V IG +V+K+Q F EAT EP F KFDGILGL
Sbjct: 124 YKADGRPLKITYGSGSLSGFLSQDVVMIGSIVIKNQTFGEATNEPGSAFATGKFDGILGL 183
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ +I+V PV+ +++Q LV++ VFSF+ +R+ GGE++ GG DP +Y G TY
Sbjct: 184 AYPQIAVDHIRPVFDMIMDQKLVDKNVFSFYLDRDPSRAPGGELLLGGTDPTYYTGNFTY 243
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
+PV+ +GYWQ +M V + Q CAGGC AI D+GTSL+AGP+ I ++ AIG+ I
Sbjct: 244 IPVSYQGYWQLNMDGVHVGDQK--LCAGGCQAIVDTGTSLIAGPSEEIHKLQAAIGSQQI 301
Query: 321 VS----------QECKAVVSQYGEEIINMLLAKVSAK 347
+ V Q+G+++ N+ + + K
Sbjct: 302 SPGQYLVDCGRLDDLPVVSFQFGDKLFNLTGQEYTVK 338
>gi|154309857|ref|XP_001554261.1| hypothetical protein BC1G_06849 [Botryotinia fuckeliana B05.10]
gi|38195404|gb|AAR13364.1| aspartic proteinase precursor [Botryotinia fuckeliana]
Length = 398
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 133/277 (48%), Positives = 182/277 (65%), Gaps = 23/277 (8%)
Query: 46 VAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTV 105
+ R +S E F+ + ++ ++GD V + N+++AQYF EI IGTPPQ+F V
Sbjct: 50 MGVRPESHASEMFKET--------SVHDAGD-HTVPVSNFLNAQYFSEITIGTPPQSFKV 100
Query: 106 IFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDH 165
+ DTGSSNLWVPSS+C SIACY H+KY S SSTYK+NG S +I YG+G++SGF S+D
Sbjct: 101 VLDTGSSNLWVPSSQC-GSIACYLHTKYDSSSSSTYKQNGTSFEIRYGSGSLSGFTSKDV 159
Query: 166 VKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNE 225
+ IGDL +KDQ F EAT EP L F +FDGILGLG+ ISV VP +Y+MV+QGL++E
Sbjct: 160 MTIGDLKIKDQVFAEATEEPGLAFAFGRFDGILGLGYDTISVNSIVPPFYSMVDQGLLDE 219
Query: 226 PVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDM-----GDVMIDG 280
PVF+F+ N DE + E +FGG++ DHY G+ T +P+ +K YW+ D+ GD +
Sbjct: 220 PVFAFYLGSN-DESDPSEAIFGGVNKDHYDGKITEIPLRRKAYWEVDLDSIAFGDSEAEL 278
Query: 281 QTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
+ TG I D+GTSL+A P + +N IGA
Sbjct: 279 ENTG-------VILDTGTSLIALPADLAGLLNAEIGA 308
>gi|402906424|ref|XP_003916002.1| PREDICTED: napsin-A-like [Papio anubis]
Length = 421
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 130/276 (47%), Positives = 172/276 (62%), Gaps = 3/276 (1%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRS 138
V L NY D QYFG+IG+GTPPQNFTV+FDTGSSNLWVPS +C +FS+ C+ H ++ S
Sbjct: 69 VPLSNYRDVQYFGKIGLGTPPQNFTVVFDTGSSNLWVPSRRCHFFSVPCWLHHRFDPKAS 128
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
S+++ NG I YGTG + G SED + IG + F EA EP L F A FDGIL
Sbjct: 129 SSFQANGTKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWEPGLVFTFAHFDGIL 188
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEH 258
GLGF +SV P +V QGL+++PVFSF+ NR+ +E +GGE+V GG DP HY
Sbjct: 189 GLGFPILSVEGVRPPMDVLVEQGLLDKPVFSFYLNRDPEEPDGGELVLGGSDPAHYIPPL 248
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
T+VPVT YWQ M V + G C GCAAI D+GTSL+ GPT I ++ AIG
Sbjct: 249 TFVPVTVPAYWQIHMERVKV-GPGLTLCVPGCAAILDTGTSLITGPTEEIRALHAAIGGY 307
Query: 319 GIVSQECKAVVSQYGE-EIINMLLAKVSAKLSLRHY 353
+++ E + S+ + ++ LL +V L+ + Y
Sbjct: 308 PLLAGEYIILCSEIPKLPAVSFLLGEVWFNLTAQDY 343
>gi|396499231|ref|XP_003845423.1| similar to Vacuolar aspartyl protease (proteinase A) [Leptosphaeria
maculans JN3]
gi|21914374|gb|AAM81358.1|AF522873_1 aspartyl proteinase [Leptosphaeria maculans]
gi|312222004|emb|CBY01944.1| similar to Vacuolar aspartyl protease (proteinase A) [Leptosphaeria
maculans JN3]
Length = 397
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 126/250 (50%), Positives = 168/250 (67%), Gaps = 8/250 (3%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V + N+++AQYF EI +GTPPQ F V+ DTGSSNLWVPSS+C SIACY H+KY S SS
Sbjct: 75 VPVSNFLNAQYFSEISLGTPPQTFKVVLDTGSSNLWVPSSECN-SIACYLHTKYDSSASS 133
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYKKNG S +I YG+G +SGF S D +IGDL VK+Q+F EAT EP L F +FDGI+G
Sbjct: 134 TYKKNGTSFEIRYGSGELSGFVSNDVFQIGDLKVKNQDFAEATSEPGLAFAFGRFDGIMG 193
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ ISV VP +YNM++QGL++EPVF+F+ + +E++ E FGG+D HY G+
Sbjct: 194 LGYDTISVNHIVPPFYNMLDQGLLDEPVFAFYLG-DTNEQQESEATFGGIDESHYSGKLI 252
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA-- 317
+P+ +K YW+ D+ + +T G I D+GTSL+A P+T+ +N IGA
Sbjct: 253 KLPLRRKAYWEVDLDAITFGKETAEMDNTGV--ILDTGTSLIALPSTMAELLNREIGAKK 310
Query: 318 --TGIVSQEC 325
G S EC
Sbjct: 311 GFNGQYSVEC 320
>gi|344269496|ref|XP_003406588.1| PREDICTED: LOW QUALITY PROTEIN: napsin-A-like [Loxodonta africana]
Length = 396
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 133/301 (44%), Positives = 179/301 (59%), Gaps = 12/301 (3%)
Query: 30 LYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGD-ADIVALKNYMDA 88
L RI L + D + R ++R + SL + GD + V L NYM+
Sbjct: 26 LIRIPLHRVHPDPRTLNSPR-------AWRKAAEHMSLVAS--SPGDKSTFVPLSNYMNV 76
Query: 89 QYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYF-SIACYFHSKYRSGRSSTYKKNGKS 147
QYFGEIG+GTPPQNF+V+FDTGSSNLWVPS +C+F S+ C+ H ++ SS+++ NG
Sbjct: 77 QYFGEIGLGTPPQNFSVVFDTGSSNLWVPSKRCHFLSLPCWVHHRFNPNASSSFQPNGTK 136
Query: 148 ADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISV 207
I YGTG ++G SED + IG + F EA EPSL F A FDGILGLGF ++V
Sbjct: 137 FAIQYGTGRLTGILSEDKLTIGGIEGTSVVFGEALWEPSLVFTFAPFDGILGLGFPILAV 196
Query: 208 GKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKG 267
P +V QGLV++PVFSF+ NR+ + +GGE+V GG DP HY ++PVT
Sbjct: 197 DGVRPPLDILVEQGLVDKPVFSFYLNRDPEAPDGGELVLGGSDPAHYIPPLNFMPVTIPA 256
Query: 268 YWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQECKA 327
YWQ M V + G CA GCAAI D+GTSL+ GP I +N AIG +++ + +
Sbjct: 257 YWQIHMERVKV-GTGLNLCAQGCAAILDTGTSLITGPAEEIQALNSAIGGVALLTGQVRT 315
Query: 328 V 328
+
Sbjct: 316 L 316
>gi|45360583|ref|NP_988964.1| cathepsin D precursor [Xenopus (Silurana) tropicalis]
gi|38174445|gb|AAH61433.1| cathepsin D (lysosomal aspartyl protease) [Xenopus (Silurana)
tropicalis]
Length = 398
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 145/328 (44%), Positives = 193/328 (58%), Gaps = 22/328 (6%)
Query: 3 MVFKSITAGFFLCLLLFPVVFSTPNGGLYRIGLKK-----RKFDLNNRVAARLDSKEGES 57
M + A LC ++ P L RI LKK R ++ A +L E +
Sbjct: 1 MASAPVWALLALCCVM------QPGSSLVRIPLKKFTSIRRAMSETDQDALKLSGNEAAT 54
Query: 58 FRTSIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVP 117
KYS N S + L NY+DAQY+GEIGIGTPPQ FTV+FDTGSSNLWVP
Sbjct: 55 ------KYSAFLN---SKNPTPETLLNYLDAQYYGEIGIGTPPQPFTVVFDTGSSNLWVP 105
Query: 118 SSKCYF-SIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQ 176
S C F +AC+ H KY S +S+TY NG I YG+G+++G+ S+D V IGDL V Q
Sbjct: 106 SIHCSFWDLACWLHHKYDSSKSTTYINNGTEFAIQYGSGSLTGYLSKDTVTIGDLAVNGQ 165
Query: 177 EFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNA 236
F EA ++P +TF+ AKFDGILG+G+ +ISV PV+ +++ Q LV+ +FSF+ NRN
Sbjct: 166 FFAEAIKQPGITFVAAKFDGILGMGYPKISVDGVPPVFDDIMEQKLVDSNIFSFYLNRNP 225
Query: 237 DEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADS 296
D GGE++ GG DP Y G+ Y+ VT+K YWQ M + + G C GC AI D+
Sbjct: 226 DTLPGGELLLGGTDPAFYTGDFNYMNVTRKAYWQIHMDQLSV-GDRLSLCKDGCEAIVDT 284
Query: 297 GTSLLAGPTTIITQVNHAIGATGIVSQE 324
GTSL+ GP +T + AIGA ++ E
Sbjct: 285 GTSLITGPVEEVTALQRAIGAIPLICGE 312
>gi|46138535|ref|XP_390958.1| hypothetical protein FG10782.1 [Gibberella zeae PH-1]
gi|408391598|gb|EKJ70970.1| hypothetical protein FPSE_08829 [Fusarium pseudograminearum CS3096]
Length = 396
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 133/261 (50%), Positives = 170/261 (65%), Gaps = 22/261 (8%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V + N+M+AQYF EI IGTPPQ+F V+ DTGSSNLWVPS +C SIACY HSKY S SS
Sbjct: 76 VPVSNFMNAQYFSEITIGTPPQSFKVVLDTGSSNLWVPSQEC-GSIACYLHSKYDSSASS 134
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYKKNG +IHYG+G++SGF S D V IGDL +KDQ+F EAT+EP L F +FDGILG
Sbjct: 135 TYKKNGSEFEIHYGSGSLSGFVSNDVVSIGDLKIKDQDFAEATKEPGLAFAFGRFDGILG 194
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEG-GEIVFGGMDPDHYKGEH 258
LG+ I+V VP +Y MVNQ L++EPVF+F+ D +EG E FGG+D Y G+
Sbjct: 195 LGYDRIAVNGMVPPFYQMVNQKLLDEPVFAFYL----DGQEGQSEATFGGVDKSKYTGDL 250
Query: 259 TYVPVTQKGYWQFDM-----GDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNH 313
Y+P+ +K YW+ D+ GD + + + TG AI D+GTSL P+ + +N
Sbjct: 251 EYIPLRRKAYWEVDLDAIAFGDEVAEQENTG-------AILDTGTSLNVLPSALAELLNK 303
Query: 314 AIGA----TGIVSQECKAVVS 330
IGA G + EC V S
Sbjct: 304 EIGAKKGYNGQYTIECDKVSS 324
>gi|70999520|ref|XP_754479.1| aspartic endopeptidase Pep2 [Aspergillus fumigatus Af293]
gi|74675969|sp|O42630.1|CARP_ASPFU RecName: Full=Vacuolar protease A; AltName: Full=Aspartic
endopeptidase pep2; AltName: Full=Aspartic protease
pep2; Flags: Precursor
gi|2664292|emb|CAA75754.1| cellular aspartic protease [Aspergillus fumigatus]
gi|4200293|emb|CAA10674.1| aspartic protease [Aspergillus fumigatus]
gi|66852116|gb|EAL92441.1| aspartic endopeptidase Pep2 [Aspergillus fumigatus Af293]
gi|159127496|gb|EDP52611.1| aspartic endopeptidase Pep2 [Aspergillus fumigatus A1163]
Length = 398
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 139/324 (42%), Positives = 190/324 (58%), Gaps = 22/324 (6%)
Query: 16 LLLFPVVFSTPNGGLYRIGLKKRKFD-----LNNRVAARLDSKEGESFRTSIRKYSLRGN 70
LL V+ + + ++++ L K D N R ++ R ++ + L N
Sbjct: 6 LLTASVLLGSASAAVHKLKLNKVPLDEQLYTHNIDAHVRALGQKYMGIRPNVHQELLEEN 65
Query: 71 LGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFH 130
V + N+++AQYF EI +GTPPQ F V+ DTGSSNLWVP S C SIAC+ H
Sbjct: 66 SLNDMSRHDVLVDNFLNAQYFSEISLGTPPQKFKVVLDTGSSNLWVPGSDCS-SIACFLH 124
Query: 131 SKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFL 190
+KY S SSTYK NG I YG+G +SGF S+D ++IGDL V Q+F EAT EP L F
Sbjct: 125 NKYDSSASSTYKANGTEFAIKYGSGELSGFVSQDTLQIGDLKVVKQDFAEATNEPGLAFA 184
Query: 191 LAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMD 250
+FDGILGLG+ ISV K VP +YNM++QGL++EPVF+F+ E + E FGG+D
Sbjct: 185 FGRFDGILGLGYDTISVNKIVPPFYNMLDQGLLDEPVFAFYLGDTNKEGDNSEASFGGVD 244
Query: 251 PDHYKGEHTYVPVTQKGYWQFD-----MGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPT 305
+HY GE T +P+ +K YW+ D +GD + + + TG I D+GTSL+A P+
Sbjct: 245 KNHYTGELTKIPLRRKAYWEVDFDAIALGDNVAELENTGI-------ILDTGTSLIALPS 297
Query: 306 TIITQVNHAIGA----TGIVSQEC 325
T+ +N IGA TG S EC
Sbjct: 298 TLADLLNKEIGAKKGFTGQYSIEC 321
>gi|449299914|gb|EMC95927.1| hypothetical protein BAUCODRAFT_34686 [Baudoinia compniacensis UAMH
10762]
Length = 376
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 126/250 (50%), Positives = 166/250 (66%), Gaps = 8/250 (3%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
VA+ N+++AQYF +I IGTPPQ+F V+ DTGSSNLWVPS C SIACY HSKY SS
Sbjct: 54 VAVSNFLNAQYFSDISIGTPPQDFKVVLDTGSSNLWVPSQDC-GSIACYLHSKYDHSDSS 112
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYKKNG I YG+G + G+ S+D V+IGDL +K+Q F EAT EP L F +FDGI+G
Sbjct: 113 TYKKNGSDFQIRYGSGELEGYISQDTVRIGDLSIKNQLFAEATSEPGLAFAFGRFDGIMG 172
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ ISV VP +YNM+NQGL++E VF+F+ + +E E FGG+D HY+G+ T
Sbjct: 173 LGYDTISVNHIVPPFYNMINQGLIDEQVFAFYLSDTNKGDE-SEATFGGIDESHYEGKMT 231
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA-- 317
+P+ +K YW+ D+ + QT + G AI D+GTSL+A PTT+ +N IGA
Sbjct: 232 KIPLRRKAYWEVDLDAITFGDQTAEIDSTG--AILDTGTSLIALPTTLAELLNREIGAKK 289
Query: 318 --TGIVSQEC 325
G + EC
Sbjct: 290 SYNGQYTIEC 299
>gi|198457045|ref|XP_001360531.2| GA10074 [Drosophila pseudoobscura pseudoobscura]
gi|198135836|gb|EAL25106.2| GA10074 [Drosophila pseudoobscura pseudoobscura]
Length = 399
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 134/324 (41%), Positives = 185/324 (57%), Gaps = 14/324 (4%)
Query: 12 FFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRV--AARLDSKEGESFRTSIRKYSLRG 69
FL L+L P+V +P+ LYR+ L++ N V R+D FR + R
Sbjct: 4 LFLSLILPPLVAPSPSTELYRVPLRRFPSARNRFVQFGIRMDR-----FRLKYSRVDGRS 58
Query: 70 NLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACY 128
+ L NY+DAQYFG I IG+PPQ F VIFDTGSSNLWVPS+ C + +AC
Sbjct: 59 RPRGGWEVRSEPLSNYLDAQYFGPITIGSPPQTFKVIFDTGSSNLWVPSTSCAPTMVACM 118
Query: 129 FHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLT 188
HS+Y + +SS++++NG IHYG+G+++G+ S D V++ L +++Q F E T P
Sbjct: 119 VHSRYNARQSSSHRRNGVRFAIHYGSGSLAGYLSSDTVRVAGLEIQNQTFAEVTTMPGPI 178
Query: 189 FLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGG 248
FL AKFDGI GL +Q IS+ P +Y ++ Q L++ PVFS + NR + EGG + FGG
Sbjct: 179 FLAAKFDGIFGLAYQSISMQDVKPPFYAIMEQKLLSNPVFSVYLNRQQEHPEGGALFFGG 238
Query: 249 MDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTII 308
+P +Y+G TYVPV+ + YWQ M I+ C GC I D+GTS LA P
Sbjct: 239 SNPRYYRGNFTYVPVSHRAYWQVRMEAATIN--DLRLCQHGCEVIIDTGTSFLALPYDQA 296
Query: 309 TQVNHAIGAT----GIVSQECKAV 328
+N +IG T G S C V
Sbjct: 297 ILINESIGGTPSEYGQYSVPCDQV 320
>gi|409079719|gb|EKM80080.1| hypothetical protein AGABI1DRAFT_113304 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 413
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 129/255 (50%), Positives = 168/255 (65%), Gaps = 9/255 (3%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L N+M+AQYF EI IG+PPQ F VI DTGSSNLWVPS KC SIAC+ H+KY SG+SS
Sbjct: 92 VPLSNFMNAQYFTEIQIGSPPQTFKVILDTGSSNLWVPSVKCT-SIACFLHTKYDSGQSS 150
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYK NG + +I YG+GA+ GF S+D ++IGDL + Q+F EAT+EP L F KFDGILG
Sbjct: 151 TYKANGSTFEIQYGSGAMEGFVSQDQLQIGDLTINGQDFAEATKEPGLAFAFGKFDGILG 210
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ ISV VP +Y M+ Q L++E VFS F + +E+GGE VFGG+D YKG+
Sbjct: 211 LGYDTISVNHIVPPFYKMIEQNLLDERVFS--FRLGSSDEDGGEAVFGGIDESAYKGKMH 268
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA-- 317
YVP+ QK YW+ + + + G+ G A D+GTSL+A P+ + +N IGA
Sbjct: 269 YVPIRQKAYWEVQLDKISLGGEELELENTGAA--IDTGTSLIALPSDMAEMLNTQIGAKK 326
Query: 318 --TGIVSQECKAVVS 330
G + +C V S
Sbjct: 327 SWNGQYTIDCAKVAS 341
>gi|74198157|dbj|BAE35255.1| unnamed protein product [Mus musculus]
Length = 335
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 127/258 (49%), Positives = 172/258 (66%), Gaps = 15/258 (5%)
Query: 87 DAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNG 145
DAQY+G+IGIGTPPQ FTV+FDTGSSNLWVPS C IAC+ H KY S +SSTY KNG
Sbjct: 1 DAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSSTYVKNG 60
Query: 146 KSADIHYGTGAISGFFSEDHV---------KIGDLVVKDQEFIEATREPSLTFLLAKFDG 196
S DIHYG+G++SG+ S+D V K + V+ Q F EAT++P + F+ AKFDG
Sbjct: 61 TSFDIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIFGEATKQPGIVFVAAKFDG 120
Query: 197 ILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKG 256
ILG+G+ ISV +PV+ N++ Q LV++ +FSF+ NR+ + + GGE++ GG D +Y G
Sbjct: 121 ILGMGYPHISVNNVLPVFDNLMQQKLVDKNIFSFYLNRDPEGQPGGELMLGGTDSKYYHG 180
Query: 257 EHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIG 316
E +Y+ VT+K YWQ M + + + T C GGC AI D+GTSLL GP + ++ AIG
Sbjct: 181 ELSYLNVTRKAYWQVHMDQLEVGNELT-LCKGGCEAIVDTGTSLLVGPVEEVKELQKAIG 239
Query: 317 ATGIVSQE----CKAVVS 330
A ++ E C+ V S
Sbjct: 240 AMPLIQGEYMIPCEKVSS 257
>gi|395531206|ref|XP_003767673.1| PREDICTED: cathepsin E [Sarcophilus harrisii]
Length = 395
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 131/295 (44%), Positives = 179/295 (60%), Gaps = 19/295 (6%)
Query: 49 RLDSKEGESFRTSIRKYSLRGNLGESGDADIV--------------ALKNYMDAQYFGEI 94
RL K +S R ++R+ E+ D++ L NY+D +Y+G I
Sbjct: 22 RLPLKRHKSLRKTLRERGQLSQFWETHKLDMLQFTDFCSQDQSENEPLINYLDMEYYGVI 81
Query: 95 GIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGT 154
IG+PPQNFTVIFDTGSSNLWVPS C S AC H+++ +SSTY +NG S I YGT
Sbjct: 82 SIGSPPQNFTVIFDTGSSNLWVPSVYC-VSPACKNHNRFYPSQSSTYVENGNSFSIQYGT 140
Query: 155 GAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVW 214
G++SG D V + + V +Q+F E+ EP TF+ A+FDGILGL + ++VG PV+
Sbjct: 141 GSLSGIIGMDQVSVEGITVANQQFGESVSEPGSTFVNAEFDGILGLAYPSLAVGGVTPVF 200
Query: 215 YNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMG 274
NM+ Q LV+ P+FS + RN D G E+VFGG D H+ G +VPVT++GYWQ +
Sbjct: 201 DNMIAQNLVDMPIFSVYMTRNPDSPTGSELVFGGYDHAHFTGSLNWVPVTKQGYWQIALD 260
Query: 275 DVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT---GIVSQECK 326
++ + G T FCA GC AI D+GTSL+ GP+ I Q+ +AIGA G + EC
Sbjct: 261 NIQVGG-TIMFCAEGCQAIVDTGTSLITGPSDKIKQLQNAIGAVLTDGEYAMECN 314
>gi|119491657|ref|XP_001263323.1| aspartic endopeptidase Pep2 [Neosartorya fischeri NRRL 181]
gi|119411483|gb|EAW21426.1| aspartic endopeptidase Pep2 [Neosartorya fischeri NRRL 181]
Length = 398
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 139/324 (42%), Positives = 189/324 (58%), Gaps = 22/324 (6%)
Query: 16 LLLFPVVFSTPNGGLYRIGLKKRKFD-----LNNRVAARLDSKEGESFRTSIRKYSLRGN 70
LL V+ + + ++++ L K D N R ++ R ++ + L N
Sbjct: 6 LLTASVLLGSASAAVHKLKLNKVPLDEQLYTHNIDAHVRALGQKYMGIRPNVHQELLEEN 65
Query: 71 LGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFH 130
V + N+++AQYF EI +GTPPQ F V+ DTGSSNLWVP S C SIAC+ H
Sbjct: 66 SLNDMSRHDVLVDNFLNAQYFSEISLGTPPQKFKVVLDTGSSNLWVPGSDCS-SIACFLH 124
Query: 131 SKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFL 190
+KY S SSTYK NG I YG+G +SGF S+D ++IGDL V Q+F EAT EP L F
Sbjct: 125 NKYDSSASSTYKANGTEFAIKYGSGELSGFVSQDTLQIGDLKVVKQDFAEATNEPGLAFA 184
Query: 191 LAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMD 250
+FDGILGLG+ ISV K VP +YNM+ QGL++EPVF+F+ E + E FGG+D
Sbjct: 185 FGRFDGILGLGYDTISVNKIVPPFYNMLEQGLLDEPVFAFYLGDTNKEGDNSEASFGGVD 244
Query: 251 PDHYKGEHTYVPVTQKGYWQFD-----MGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPT 305
+HY GE T +P+ +K YW+ D +GD + + + TG I D+GTSL+A P+
Sbjct: 245 KNHYTGELTKIPLRRKAYWEVDFDAIALGDNVAELENTG-------VILDTGTSLIALPS 297
Query: 306 TIITQVNHAIGA----TGIVSQEC 325
T+ +N IGA TG S EC
Sbjct: 298 TLADLLNKEIGAKKGFTGQYSIEC 321
>gi|169600915|ref|XP_001793880.1| hypothetical protein SNOG_03312 [Phaeosphaeria nodorum SN15]
gi|111068923|gb|EAT90043.1| hypothetical protein SNOG_03312 [Phaeosphaeria nodorum SN15]
Length = 347
Score = 251 bits (641), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 129/251 (51%), Positives = 168/251 (66%), Gaps = 10/251 (3%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V + N+++AQYF EI +GTPPQ F V+ DTGSSNLWVPSS+C SIACY H+KY S SS
Sbjct: 25 VPVSNFLNAQYFSEISLGTPPQTFKVVLDTGSSNLWVPSSECN-SIACYLHTKYDSSSSS 83
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYKKNG S +I YG+G +SGF S D +IGDL VK+Q+F EAT EP L F +FDGI+G
Sbjct: 84 TYKKNGTSFEIRYGSGELSGFVSNDVFQIGDLKVKNQDFAEATSEPGLAFAFGRFDGIMG 143
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWF-NRNADEEEGGEIVFGGMDPDHYKGEH 258
LG+ ISV K VP +YNM+ QGL++EPVF+F+ + NA +E E FGG+D HY G+
Sbjct: 144 LGYDTISVNKIVPPFYNMLEQGLLDEPVFAFYLGDTNAQQES--EATFGGIDESHYSGKL 201
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA- 317
+P+ +K YW+ D+ + +T G I D+GTSL+A P+TI +N IGA
Sbjct: 202 IKLPLRRKAYWEVDLDAITFGKETAEMDDTGV--ILDTGTSLIALPSTIAELLNKEIGAK 259
Query: 318 ---TGIVSQEC 325
G + EC
Sbjct: 260 KGFNGQYTVEC 270
>gi|170091822|ref|XP_001877133.1| aspartic peptidase A1 [Laccaria bicolor S238N-H82]
gi|164648626|gb|EDR12869.1| aspartic peptidase A1 [Laccaria bicolor S238N-H82]
Length = 408
Score = 251 bits (641), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 130/255 (50%), Positives = 164/255 (64%), Gaps = 9/255 (3%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L N+M+AQYF EI IG PPQ+F VI DTGSSNLWVPS KC SIAC+ H+KY S SS
Sbjct: 87 VPLSNFMNAQYFTEISIGNPPQSFKVILDTGSSNLWVPSVKCT-SIACFLHTKYDSASSS 145
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
T+K NG IHYG+G++ GF S D + IGD+ +K Q+F EA +EP L F KFDGILG
Sbjct: 146 TFKANGSEFSIHYGSGSMEGFVSNDLLSIGDITIKGQDFAEAVKEPGLAFAFGKFDGILG 205
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ ISV +P +Y+M+NQGL++ PVFS F + EE+GGE VFGG+D YKG+ T
Sbjct: 206 LGYDTISVNHIIPPFYSMINQGLIDSPVFS--FRLGSSEEDGGEAVFGGIDESAYKGKIT 263
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA-- 317
YVPV +K YW+ ++ V + G A D+GTSL+ PT I +N IGA
Sbjct: 264 YVPVRRKAYWEVELEKVSFGNDDLELESTGAA--IDTGTSLIVLPTDIAEMLNTQIGAKK 321
Query: 318 --TGIVSQECKAVVS 330
G +C V S
Sbjct: 322 SWNGQYQVDCAKVPS 336
>gi|225556537|gb|EEH04825.1| aspartic endopeptidase Pep2 [Ajellomyces capsulatus G186AR]
Length = 398
Score = 251 bits (641), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 131/289 (45%), Positives = 186/289 (64%), Gaps = 5/289 (1%)
Query: 30 LYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQ 89
L +I L ++ ++N R ++ + + + G+ ++ + + N+++AQ
Sbjct: 25 LQKIPLSEQFANVNIDAHVRALGQKYMGVKPNQNGQDVFGDPAKASGGHSLPVDNFLNAQ 84
Query: 90 YFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSAD 149
YF EIGIGTPPQ F V+ DTGSSNLWVPSS+C SIACY H+KY S SST+KKNG
Sbjct: 85 YFSEIGIGTPPQTFKVVLDTGSSNLWVPSSECG-SIACYLHNKYDSSASSTHKKNGSEFS 143
Query: 150 IHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGK 209
I YG+G+++GF S+D + IGDLVV++Q F EAT EP L F +FDGILGLG+ ISV K
Sbjct: 144 ITYGSGSLTGFVSQDCLTIGDLVVENQVFAEATSEPGLAFAFGRFDGILGLGYDTISVNK 203
Query: 210 AVPVWYNMVNQGLVNEPVFSFWF-NRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGY 268
VP +Y M+N+ L++EP+FSF+ + N D +E E+VFGGM+ + + GE T +P+ +K Y
Sbjct: 204 IVPPFYEMLNKNLLDEPMFSFYLGDANVDGDE-SEVVFGGMNKNRFMGELTKIPLRRKAY 262
Query: 269 WQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
W+ D+ + QT G I D+GTSL+A P+TI +N IGA
Sbjct: 263 WEVDLDSITFGKQTAMMANTGV--ILDTGTSLIALPSTIAELLNKEIGA 309
>gi|440633873|gb|ELR03792.1| vacuolar protease A [Geomyces destructans 20631-21]
Length = 395
Score = 251 bits (641), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 130/257 (50%), Positives = 166/257 (64%), Gaps = 20/257 (7%)
Query: 79 IVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRS 138
+V + N+++AQYF EI IG PPQ F V+ DTGSSNLWVPS C SIACY HSKY S S
Sbjct: 74 LVPVSNFLNAQYFSEITIGNPPQTFKVVLDTGSSNLWVPSQSC-GSIACYLHSKYDSSES 132
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
TY+KNG I YG+G++SG+ S+D V IGDLV+KDQ F EA EP L F +FDGIL
Sbjct: 133 KTYEKNGTEFAIQYGSGSVSGYISQDQVTIGDLVIKDQLFGEAVEEPGLAFAFGRFDGIL 192
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEH 258
GLGF ISV K VP +Y+M++QGL++E VFSF+ AD++ E VFGG+D HY G+
Sbjct: 193 GLGFDTISVNKVVPPFYSMIDQGLLDEKVFSFYL---ADDKSQSEAVFGGIDKSHYTGDL 249
Query: 259 TYVPVTQKGYWQFD-----MGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNH 313
TY+P+ +K YW+ D GDV D TG I D+GTSL P+++ +N
Sbjct: 250 TYIPLRRKAYWEVDFDAISFGDVKADLDNTG-------VILDTGTSLNTLPSSLAELLNK 302
Query: 314 AIGA----TGIVSQECK 326
IGA G + +CK
Sbjct: 303 EIGAKKGYNGQYTIDCK 319
>gi|116203505|ref|XP_001227563.1| vacuolar protease A precursor [Chaetomium globosum CBS 148.51]
gi|88175764|gb|EAQ83232.1| vacuolar protease A precursor [Chaetomium globosum CBS 148.51]
Length = 396
Score = 251 bits (640), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 137/315 (43%), Positives = 187/315 (59%), Gaps = 23/315 (7%)
Query: 15 CLLLFPVVFSTPNGGLYRIGLKK-------RKFDLNNRVAARLDSKEGESFRTSIRKYSL 67
LL V+ + G ++++ L+K LN ++ G R S
Sbjct: 4 ALLTAAVLLGSAQGAVHKMKLQKVPLSEQLEAVPLNTQLEQLGQKYMGVRPRQSHANAVF 63
Query: 68 RGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIAC 127
G + E V + N+M+AQYF EI IG+PPQ F V+ DTGSSNLWVPS +C SIAC
Sbjct: 64 NGMVAEVKGNHPVPISNFMNAQYFSEITIGSPPQTFKVVLDTGSSNLWVPSVEC-GSIAC 122
Query: 128 YFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSL 187
Y H+KY S SSTYKKNG + +I YG+G++SGF S+D + IGD+ +K Q+F EAT EP L
Sbjct: 123 YLHTKYDSSASSTYKKNGTNFEIRYGSGSLSGFVSQDTMTIGDITIKGQDFAEATSEPGL 182
Query: 188 TFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFG 247
F +FDGILGLG+ ISV VP +Y M+ Q L++EPVF+F+ ADE+ E+VFG
Sbjct: 183 AFAFGRFDGILGLGYDTISVNGIVPPFYKMLEQKLIDEPVFAFYL---ADEKGQSEVVFG 239
Query: 248 GMDPDHYKGEHTYVPVTQKGYWQFDM-----GDVMIDGQTTGFCAGGCAAIADSGTSLLA 302
G+D D YKG+ T +P+ +K YW+ D GD + + TG I D+GTSL+A
Sbjct: 240 GVDSDKYKGKITTIPLRRKAYWEVDFDAISYGDDTAELENTG-------VILDTGTSLIA 292
Query: 303 GPTTIITQVNHAIGA 317
P+ + +N IGA
Sbjct: 293 LPSQLAEMLNAQIGA 307
>gi|109125662|ref|XP_001116026.1| PREDICTED: napsin-A-like [Macaca mulatta]
Length = 421
Score = 251 bits (640), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 130/276 (47%), Positives = 171/276 (61%), Gaps = 3/276 (1%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRS 138
V L NY D QYFG+IG+GTPPQNFTV+FDTGSSNLWVPS +C +FS+ C+ H ++ S
Sbjct: 69 VPLSNYRDVQYFGKIGLGTPPQNFTVVFDTGSSNLWVPSRRCHFFSVPCWLHHRFDPKAS 128
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
S+++ NG I YGTG + G SED + IG + F EA EP L F A FDGIL
Sbjct: 129 SSFQANGTKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWEPGLVFTFAHFDGIL 188
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEH 258
GLGF +SV P +V QGL+++PVFSF+ NR+ +E +GGE+V GG DP HY
Sbjct: 189 GLGFPILSVEGVRPPMDVLVEQGLLDKPVFSFYLNRDPEEPDGGELVLGGSDPAHYIPPL 248
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
T+VPVT YWQ M V + G C GCAAI D+GTSL+ GPT I ++ AIG
Sbjct: 249 TFVPVTVPAYWQIHMERVKV-GPGLTLCVRGCAAILDTGTSLITGPTEEIRALHAAIGGY 307
Query: 319 GIVSQECKAVVSQYGE-EIINMLLAKVSAKLSLRHY 353
+++ E + S+ + ++ LL V L+ + Y
Sbjct: 308 PLLAGEYIILCSEIPKLPAVSFLLGGVWFNLTAQDY 343
>gi|443927046|gb|ELU45582.1| endopeptidase [Rhizoctonia solani AG-1 IA]
Length = 934
Score = 251 bits (640), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 146/323 (45%), Positives = 194/323 (60%), Gaps = 33/323 (10%)
Query: 14 LCLLLF-------PVVFSTPNGGLYRIGLKK----RKFD-------LNNRVAARLDSKEG 55
LC LL PV P G++R+ L+K FD L + A+
Sbjct: 47 LCSLLIAGLAGATPVKRDPP--GVHRMKLQKLAHPASFDPGLQAVALGQKYGAQTPLTTT 104
Query: 56 ESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLW 115
E T+ LRG G + V L NY++AQY+ +I +G+PPQ+F V+ DTGSSNLW
Sbjct: 105 EDLFTT---QELRGEGGHN-----VPLHNYLNAQYYADITLGSPPQSFKVVLDTGSSNLW 156
Query: 116 VPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKD 175
VP C SIAC+ H+KY S S+TYK NG I YG+G++SGF S+D + IGD+ VK
Sbjct: 157 VPGKSCT-SIACFLHAKYDSSASNTYKANGTEFAIQYGSGSLSGFMSQDTLTIGDIAVKH 215
Query: 176 QEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRN 235
Q+F EAT+EP L F KFDGILGL F ISV AVP YNM++QGL+ EP+F+F
Sbjct: 216 QDFAEATKEPGLAFAFGKFDGILGLAFPRISVNGAVPPVYNMIDQGLIKEPLFTFRV--G 273
Query: 236 ADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIAD 295
+ E++GGE VFGG+D HYKG+ YVPV ++ YW+ ++ V + G+ T AAI D
Sbjct: 274 SSEQDGGEAVFGGIDESHYKGKIHYVPVRRQAYWEVELSSVSL-GEDTLELENTGAAI-D 331
Query: 296 SGTSLLAGPTTIITQVNHAIGAT 318
+GTSL+A PT I +N IGA+
Sbjct: 332 TGTSLIALPTDIAEMINAQIGAS 354
>gi|395858453|ref|XP_003801583.1| PREDICTED: napsin-A [Otolemur garnettii]
Length = 419
Score = 251 bits (640), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 130/277 (46%), Positives = 173/277 (62%), Gaps = 3/277 (1%)
Query: 79 IVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGR 137
+V L +++D QYFGEIG+GTPPQNF+V+FDTGSSNLWVPS +C +FS+ C+FH ++
Sbjct: 66 LVPLSDFLDVQYFGEIGLGTPPQNFSVVFDTGSSNLWVPSRRCHFFSVPCWFHHRFNPNA 125
Query: 138 SSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGI 197
SS+++ NG I YG+G ++G S+D + IG L F EA EPSLTF A FDGI
Sbjct: 126 SSSFQPNGTKFAIEYGSGRLNGILSKDKLTIGGLKGASVVFGEALWEPSLTFTFAPFDGI 185
Query: 198 LGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGE 257
LGLGF ++V P +V QGL+++PVFSF+ NR+ D +GGE+V GG DP HY
Sbjct: 186 LGLGFPILAVEGVRPPLDVLVEQGLLDKPVFSFYLNRDPDVADGGELVLGGSDPAHYIPP 245
Query: 258 HTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
T+VPVT YWQ M V + G CA GCAAI D+GTSL+ GPT I ++ AIG
Sbjct: 246 LTFVPVTIPAYWQIHMERVKV-GTGLTLCAQGCAAILDTGTSLITGPTEEIRALHAAIGG 304
Query: 318 TGIVSQECKAVVSQYGE-EIINMLLAKVSAKLSLRHY 353
+ E S+ ++ LL V L+ + Y
Sbjct: 305 IPLPPGEHLIECSEIPRLPPVSFLLGGVWFNLTGKDY 341
>gi|195399279|ref|XP_002058248.1| GJ15983 [Drosophila virilis]
gi|194150672|gb|EDW66356.1| GJ15983 [Drosophila virilis]
Length = 372
Score = 250 bits (639), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 134/315 (42%), Positives = 183/315 (58%), Gaps = 14/315 (4%)
Query: 16 LLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESG 75
+ + VV + + L+R+ + K++ + R R E R R + R E
Sbjct: 5 ITVLAVVLALASAELHRVPVLKQENFVKTRANVR---AEVSHLRAKYRLPTARSVNEED- 60
Query: 76 DADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRS 135
L N M+ Y+G I IGTPPQ+F V+FD+GSSNLWVPS C S AC H++Y S
Sbjct: 61 ------LSNSMNMAYYGAITIGTPPQSFKVLFDSGSSNLWVPSKTCS-SYACEVHNQYDS 113
Query: 136 GRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFD 195
SSTY+ NG+S I YGTG++SG + D V + L V+ Q F EAT EP F A FD
Sbjct: 114 SASSTYQANGESFSIQYGTGSLSGILATDIVNVNGLSVESQTFAEATNEPGTNFNDANFD 173
Query: 196 GILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYK 255
GILG+G+Q I+ VP +YNMV+QGLV++ VFSF+ R+ +GGE++FGG D Y
Sbjct: 174 GILGMGYQSIAQDNVVPPFYNMVSQGLVDQSVFSFYLARDGTSSQGGELIFGGSDSSLYS 233
Query: 256 GEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAI 315
G+ TYVP++++GYWQF M IDGQ+ C C AIAD+GTSL+ P Q+N +
Sbjct: 234 GDLTYVPISEQGYWQFTMAGASIDGQS--LC-DNCQAIADTGTSLIVAPANAYMQLNDIL 290
Query: 316 GATGIVSQECKAVVS 330
+C +V S
Sbjct: 291 NVDDQGLVDCSSVSS 305
>gi|405117936|gb|AFR92711.1| endopeptidase [Cryptococcus neoformans var. grubii H99]
Length = 438
Score = 250 bits (639), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 133/260 (51%), Positives = 173/260 (66%), Gaps = 19/260 (7%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L N+M+AQYF + +GTP Q F V+ DTGSSNLWVPS KC SIAC+ H+KY S +SS
Sbjct: 117 VPLSNFMNAQYFATVELGTPFQTFKVVLDTGSSNLWVPSVKCT-SIACFLHNKYDSSQSS 175
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYK NG +IHYG+G++ GF S+D + IGDLVVK Q+F EAT+EP L F KFDGILG
Sbjct: 176 TYKANGSDFEIHYGSGSLEGFISQDTLSIGDLVVKKQDFAEATKEPGLAFAFGKFDGILG 235
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ ISV VP +YNM+NQ L++EPVFS F + +E+GGE +FGG+D Y G+
Sbjct: 236 LGYDTISVNHIVPPFYNMLNQHLLDEPVFS--FRLGSSDEDGGEAIFGGIDDSAYSGKLA 293
Query: 260 YVPVTQKGYWQFDM-----GDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHA 314
YVPV +KGYW+ ++ GD ++ + TG AAI D+GTSL+ PT + +N
Sbjct: 294 YVPVRRKGYWEVELESISFGDEELELENTG------AAI-DTGTSLIVMPTDVAELLNKE 346
Query: 315 IGA----TGIVSQECKAVVS 330
IGA G + +C V S
Sbjct: 347 IGAEKSWNGQYTVDCNTVSS 366
>gi|390601248|gb|EIN10642.1| endopeptidase [Punctularia strigosozonata HHB-11173 SS5]
Length = 412
Score = 250 bits (639), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 133/258 (51%), Positives = 171/258 (66%), Gaps = 19/258 (7%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L N+M+AQYF EI +GTPPQ+F VI DTGSSNLWVPS KC SIAC+ H KY S +SS
Sbjct: 91 VPLSNFMNAQYFSEITLGTPPQSFKVILDTGSSNLWVPSVKCT-SIACFLHQKYDSSQSS 149
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
+YK NG I YG+G++ GF S D + IGDL +K Q+F EAT+EP L F KFDGILG
Sbjct: 150 SYKANGSEFSIQYGSGSMEGFVSRDTLTIGDLTIKGQDFAEATKEPGLAFAFGKFDGILG 209
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ ISV P +Y+M+N L+++PVFS F + EE+GGE VFGG+D Y+G+ T
Sbjct: 210 LGYDTISVNHITPPFYSMINAALLDDPVFS--FRLGSSEEDGGEAVFGGIDSSAYEGKIT 267
Query: 260 YVPVTQKGYWQFDM-----GDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHA 314
YVPV +K YW+ ++ GD ++ + TG AAI D+GTSL+A PT + +N
Sbjct: 268 YVPVRRKAYWEVELEKIKFGDDELELENTG------AAI-DTGTSLIALPTDLAEMLNAQ 320
Query: 315 IGAT----GIVSQECKAV 328
IGAT G + EC V
Sbjct: 321 IGATKSWNGQYTVECSKV 338
>gi|432870116|ref|XP_004071815.1| PREDICTED: cathepsin D-like [Oryzias latipes]
Length = 397
Score = 250 bits (639), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 133/310 (42%), Positives = 185/310 (59%), Gaps = 16/310 (5%)
Query: 29 GLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLR-GNLGESGDADIVA--LKNY 85
L R+ L K R RL S G S +R +R G+L S ++ L N+
Sbjct: 20 ALIRVPLHK------TRSLRRLMSDNGMSL-DDLRALGMRVGSLDSSASPELPVERLTNF 72
Query: 86 MDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYF-SIACYFHSKYRSGRSSTYKKN 144
MDAQY+G I IGTPPQNF+V+FDTGSSNLWVPS C F +AC+ H +Y S +SS+Y KN
Sbjct: 73 MDAQYYGLISIGTPPQNFSVLFDTGSSNLWVPSIHCSFLDVACWVHRRYNSKKSSSYVKN 132
Query: 145 GKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQE 204
G I YG G++SGF S+D V + L V Q+F EA ++P +TF +A+FDG+LG+ +
Sbjct: 133 GTEFSIRYGRGSLSGFISQDTVSVAGLSVPGQQFGEAVKQPGITFAVARFDGVLGMAYPS 192
Query: 205 ISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVT 264
ISV PV+ + L+ + +FS + +R+ E GGE++ GG+DP ++ G+ YV VT
Sbjct: 193 ISVANVTPVFDTAMAAKLLPQNIFSVYISRDTAAEVGGELILGGIDPQYFSGDLHYVNVT 252
Query: 265 QKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE 324
+K YWQ M V + Q T C GC +I D+GTSL+ GP I ++ AIGA ++ E
Sbjct: 253 RKAYWQIQMDRVDVGNQLT-LCKAGCQSIVDTGTSLMVGPAEEIRALHKAIGALPLLMGE 311
Query: 325 ----CKAVVS 330
CK + S
Sbjct: 312 YFIDCKKIPS 321
>gi|367031892|ref|XP_003665229.1| aspartic protease [Myceliophthora thermophila ATCC 42464]
gi|347012500|gb|AEO59984.1| aspartic protease [Myceliophthora thermophila ATCC 42464]
Length = 397
Score = 250 bits (639), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 140/326 (42%), Positives = 192/326 (58%), Gaps = 27/326 (8%)
Query: 16 LLLFPVVFSTPNGGLYRIGLKK-------RKFDLNNRVAARLDSKEGESFRTSIRKYSLR 68
LL V+ + G ++++ L+K +N ++ G R S +
Sbjct: 6 LLTAAVLLGSAQGAVHKMKLQKIPLSEQLEAVPINTQLEHLGQKYMGLRPRESQADAIFK 65
Query: 69 GNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACY 128
G + + + + N+M+AQYF EI IGTPPQ+F V+ DTGSSNLWVPS +C SIACY
Sbjct: 66 GMVADVKGNHPIPISNFMNAQYFSEITIGTPPQSFKVVLDTGSSNLWVPSVEC-GSIACY 124
Query: 129 FHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLT 188
HSKY S SSTYKKNG S +I YG+G++SGF S+D V IGD+ ++ Q+F EAT EP L
Sbjct: 125 LHSKYDSSASSTYKKNGTSFEIRYGSGSLSGFVSQDTVSIGDITIQGQDFAEATSEPGLA 184
Query: 189 FLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGG 248
F +FDGILGLG+ ISV VP +Y MV Q L++EPVF+F+ AD E+VFGG
Sbjct: 185 FAFGRFDGILGLGYDRISVNGIVPPFYKMVEQKLIDEPVFAFYL---ADTNGQSEVVFGG 241
Query: 249 MDPDHYKGEHTYVPVTQKGYWQFDM-----GDVMIDGQTTGFCAGGCAAIADSGTSLLAG 303
+D D YKG+ T +P+ +K YW+ D GD + + TG I D+GTSL+A
Sbjct: 242 VDHDKYKGKITTIPLRRKAYWEVDFDAISYGDDTAELENTGI-------ILDTGTSLIAL 294
Query: 304 PTTIITQVNHAIGA----TGIVSQEC 325
P+ + +N IGA TG + +C
Sbjct: 295 PSQLAEMLNAQIGAKKSYTGQYTIDC 320
>gi|351702766|gb|EHB05685.1| Napsin-A [Heterocephalus glaber]
Length = 417
Score = 250 bits (638), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 128/277 (46%), Positives = 171/277 (61%), Gaps = 3/277 (1%)
Query: 79 IVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGR 137
+V L +M+ QYFGEIG+GTPPQNF+V+FDTGSSNLWVPS +C +FS+ C+FH +Y
Sbjct: 64 LVPLSKFMNVQYFGEIGLGTPPQNFSVVFDTGSSNLWVPSKRCHFFSVPCWFHHRYDPKA 123
Query: 138 SSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGI 197
SS+++ NG I YGTG +SG SED + IG + F EA EPSL F A FDGI
Sbjct: 124 SSSFRPNGTKFAIQYGTGRLSGILSEDKLNIGGISNASVTFGEALWEPSLVFAFASFDGI 183
Query: 198 LGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGE 257
GLGF ++V + P +V QGL+ +P+FSF+ NR+ +GGE+V GG DP HY
Sbjct: 184 FGLGFPTLAVDRVPPPLDVLVEQGLLEKPIFSFYLNRDFAGADGGELVLGGADPAHYIPP 243
Query: 258 HTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
T+VPVT YWQ M V + G CA GCAAI D+GTSL+ GP+ I ++ AIG
Sbjct: 244 LTFVPVTVPAYWQIHMERVKV-GTGLTLCAQGCAAIVDTGTSLITGPSEEIRALHRAIGG 302
Query: 318 TGIVSQECKAVVSQYGE-EIINMLLAKVSAKLSLRHY 353
++ E + S+ ++ LL V L+ + Y
Sbjct: 303 LPWLAGEHFILCSKIPTLPPVSFLLGGVWFNLTAQDY 339
>gi|432099182|gb|ELK28547.1| Cathepsin D [Myotis davidii]
Length = 351
Score = 250 bits (638), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 126/258 (48%), Positives = 169/258 (65%), Gaps = 15/258 (5%)
Query: 87 DAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNG 145
+AQY+GEIGIGTPPQ FTV+FDTGSSNLWVPS C IAC+ H KY SG+SSTY +NG
Sbjct: 34 EAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSGKSSTYVENG 93
Query: 146 KSADIHYGTGAISGFFSEDHVKI---------GDLVVKDQEFIEATREPSLTFLLAKFDG 196
+ DIHYG+G++SG+ S+D V + G + V+ Q F EAT++P +TF+ AKFDG
Sbjct: 94 TTFDIHYGSGSLSGYLSQDTVSVPCNSGLASLGGVKVERQVFGEATKQPGITFIAAKFDG 153
Query: 197 ILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKG 256
ILG+ + ISV VPV+ N++ Q LV + +FSF+ NR+ + GGE++ GG D +YKG
Sbjct: 154 ILGMAYPRISVNNVVPVFDNLMQQKLVEKNIFSFYLNRDPSAQPGGELMLGGTDSKYYKG 213
Query: 257 EHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIG 316
Y+ VT+K YWQ M V + G C GC AI D+GTSL+ GP + ++ AIG
Sbjct: 214 PIAYLNVTRKAYWQVHMDQVDV-GNGLTLCKEGCEAIVDTGTSLMVGPVDEVRELQKAIG 272
Query: 317 ATGIVSQE----CKAVVS 330
A ++ E C+ V S
Sbjct: 273 AVPLIQGEYMIPCEKVSS 290
>gi|154284392|ref|XP_001542991.1| vacuolar protease A precursor [Ajellomyces capsulatus NAm1]
gi|150406632|gb|EDN02173.1| vacuolar protease A precursor [Ajellomyces capsulatus NAm1]
Length = 398
Score = 250 bits (638), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 131/300 (43%), Positives = 185/300 (61%), Gaps = 7/300 (2%)
Query: 30 LYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQ 89
L +I L ++ ++N R ++ + + + G+ ++ + + N+++AQ
Sbjct: 25 LQKIPLSEQFANVNIDAHVRALGQKYMGVKPNQNGQDVFGDPAKASGGHSLPVDNFLNAQ 84
Query: 90 YFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSAD 149
YF EIGIGTPPQ F V+ DTGSSNLWVPSS+C SIACY H+KY S SST+KKNG
Sbjct: 85 YFSEIGIGTPPQTFKVVLDTGSSNLWVPSSECG-SIACYLHNKYDSSASSTHKKNGSEFS 143
Query: 150 IHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGK 209
I YG+G+++GF S+D + IGDLVV+ Q F EAT EP L F +FDGILGLG+ ISV K
Sbjct: 144 ITYGSGSLTGFVSQDCLTIGDLVVESQVFAEATSEPGLAFAFGRFDGILGLGYDTISVNK 203
Query: 210 AVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYW 269
VP +Y M+N L++EP+FSF+ + + E+VFGGM+ D + G+ T +P+ +K YW
Sbjct: 204 IVPPFYEMLNNNLLDEPMFSFYLGDANVDSDDSEVVFGGMNEDRFTGKLTKIPLRRKAYW 263
Query: 270 QFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA----TGIVSQEC 325
+ D+ + QT G I D+GTSL+A P+TI +N IGA G + EC
Sbjct: 264 EVDLDSITFGKQTALMSNTGV--ILDTGTSLIALPSTIAELLNKEIGAKKSFNGQYTVEC 321
>gi|393246119|gb|EJD53628.1| aspartic peptidase A1 [Auricularia delicata TFB-10046 SS5]
Length = 415
Score = 250 bits (638), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 130/249 (52%), Positives = 167/249 (67%), Gaps = 15/249 (6%)
Query: 74 SGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKY 133
+GD V L N+ +AQYF EI +G+P QNF V+ DTGSSNLWVPSS C SIAC+ H+KY
Sbjct: 88 AGDGHKVPLSNFANAQYFAEISLGSPAQNFKVVLDTGSSNLWVPSSGCT-SIACFLHAKY 146
Query: 134 RSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAK 193
S SSTYKKNG S +IHYG+G++ GF S+D +KIGD+ + Q+F EA +EP L F K
Sbjct: 147 DSSASSTYKKNGSSFEIHYGSGSMEGFISQDTLKIGDISIPGQDFAEAMKEPGLAFAFGK 206
Query: 194 FDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDH 253
FDGILGL + I+V P +YNMVN+ L+++PVFSF A E +GG VFGG+D H
Sbjct: 207 FDGILGLAYDTIAVNHITPPFYNMVNKKLLDQPVFSFRL--GASESDGGSAVFGGVDSSH 264
Query: 254 YKGEHTYVPVTQKGYWQFD-----MGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTII 308
YKG+ TYVPV +K YW+ + +GD +D + TG AAI D+GTSL+ P I
Sbjct: 265 YKGQITYVPVRRKAYWEVELEGIKLGDDEVDFENTG------AAI-DTGTSLIVLPVDIG 317
Query: 309 TQVNHAIGA 317
+N IGA
Sbjct: 318 EMINAQIGA 326
>gi|342882947|gb|EGU83511.1| hypothetical protein FOXB_05921 [Fusarium oxysporum Fo5176]
Length = 396
Score = 250 bits (638), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 129/260 (49%), Positives = 169/260 (65%), Gaps = 20/260 (7%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V + N+M+AQYF EI IGTPPQ+F V+ DTGSSNLWVPS +C SIACY HSKY S SS
Sbjct: 76 VPVSNFMNAQYFSEITIGTPPQSFKVVLDTGSSNLWVPSQQC-GSIACYLHSKYDSSASS 134
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYK+NG +IHYG+G++SGF S D V IGDL +KDQ+F EAT+EP L F +FDGILG
Sbjct: 135 TYKENGTEFEIHYGSGSLSGFVSNDVVSIGDLEIKDQDFAEATKEPGLAFAFGRFDGILG 194
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ I+V VP +Y MVNQ L++EPVF+F+ + D+E E FGG+D + G+
Sbjct: 195 LGYDRIAVNGMVPPFYQMVNQKLLDEPVFAFYLD---DQEGESEATFGGIDKSKFTGDIE 251
Query: 260 YVPVTQKGYWQFDM-----GDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHA 314
Y+P+ +K YW+ D+ GD + + + TG AI D+GTSL P+ + +N
Sbjct: 252 YIPLRRKAYWEVDLEAIAFGDEVAEQENTG-------AILDTGTSLNVLPSALAELLNKE 304
Query: 315 IGA----TGIVSQECKAVVS 330
IGA G + EC S
Sbjct: 305 IGAKKGYNGQYTIECDKRAS 324
>gi|301618285|ref|XP_002938556.1| PREDICTED: cathepsin E-A-like [Xenopus (Silurana) tropicalis]
Length = 402
Score = 250 bits (638), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 116/237 (48%), Positives = 165/237 (69%), Gaps = 2/237 (0%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L +YM+AQY+GEI +GTPPQNF+V+FDTGSSN WVPSS C S AC H +++S S++Y
Sbjct: 73 LVDYMNAQYYGEISVGTPPQNFSVVFDTGSSNFWVPSSYC-LSEACQVHERFKSFESTSY 131
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
+ G+ IHYGTG + G D ++I ++ ++ Q+F E+ EP TF+LA+FDG+LGLG
Sbjct: 132 EHGGRPFSIHYGTGQLVGVTGRDTLRISNMSIEGQDFGESILEPGRTFVLAQFDGVLGLG 191
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ ++V AVPV+ +VNQ LV + +FSF NR+ D E GGE++FGG+D YKG+ ++
Sbjct: 192 YPSLAVAGAVPVFDRIVNQKLVEQQLFSFHLNRDYDSEYGGELIFGGIDHSLYKGQIHWI 251
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
P+T+KGYWQ + +V +DG+ FC C I DSGTSL+ GP I ++ +GAT
Sbjct: 252 PLTEKGYWQIRLDNVKVDGEAM-FCQSSCQVIVDSGTSLITGPKAEIKKLQELLGAT 307
>gi|452840489|gb|EME42427.1| hypothetical protein DOTSEDRAFT_73302 [Dothistroma septosporum
NZE10]
Length = 398
Score = 250 bits (638), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 131/269 (48%), Positives = 172/269 (63%), Gaps = 13/269 (4%)
Query: 49 RLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFD 108
R S+ GE F+ + S+ G VA+ N+++AQYF EI IGTPPQ F V+ D
Sbjct: 54 RPQSRLGEMFKET----SVHAEKGHP-----VAVDNFLNAQYFSEIAIGTPPQEFKVVLD 104
Query: 109 TGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKI 168
TGSSNLWVPS C SIACY HSKY SSTYKKNG I YG+G++ G+ S+D V+I
Sbjct: 105 TGSSNLWVPSQDC-GSIACYLHSKYDHSESSTYKKNGSDFAIRYGSGSLEGYVSKDTVQI 163
Query: 169 GDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVF 228
GDL +KDQ F EAT EP L F +FDGILGLG+ ISV P +YNM++Q L++E VF
Sbjct: 164 GDLKIKDQLFAEATSEPGLAFAFGRFDGILGLGYDTISVNGIPPPFYNMIDQDLLDEKVF 223
Query: 229 SFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAG 288
+F+ + +E E +FGG++ DHY GE T +P+ +K YW+ D+ + QT +
Sbjct: 224 AFYLSDTNKGDE-SEAIFGGVNKDHYTGEMTKIPLRRKAYWEVDLDAITFGDQTAEIDST 282
Query: 289 GCAAIADSGTSLLAGPTTIITQVNHAIGA 317
G AI D+GTSLLA P+T+ +N IGA
Sbjct: 283 G--AILDTGTSLLALPSTLAELLNKEIGA 309
>gi|195150257|ref|XP_002016071.1| GL10692 [Drosophila persimilis]
gi|194109918|gb|EDW31961.1| GL10692 [Drosophila persimilis]
Length = 399
Score = 250 bits (638), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 134/324 (41%), Positives = 185/324 (57%), Gaps = 14/324 (4%)
Query: 12 FFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRV--AARLDSKEGESFRTSIRKYSLRG 69
FL L+L P+V +P LYR+ L++ N V R+D FR + R
Sbjct: 4 LFLSLILPPLVAPSPLTELYRVPLRRFPSARNRFVQFGIRMDR-----FRLKYSRVDGRS 58
Query: 70 NLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACY 128
+ L NY+DAQYFG I IG+PPQ F VIFDTGSSNLWVPS+ C + +AC
Sbjct: 59 RPRGGWEVRSEPLSNYLDAQYFGPITIGSPPQTFKVIFDTGSSNLWVPSTSCAPTMVACM 118
Query: 129 FHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLT 188
HS+Y + +SS++++NG IHYG+G+++G+ S D V++ L +++Q F E T P
Sbjct: 119 VHSRYNARQSSSHRRNGVRFAIHYGSGSLAGYLSSDTVRVAGLEIQNQTFAEVTTMPGPI 178
Query: 189 FLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGG 248
FL AKFDGI GL +Q IS+ P +Y ++ Q L++ PVFS + NR + EGG + FGG
Sbjct: 179 FLAAKFDGIFGLAYQSISMQGVKPPFYAIMEQKLLSNPVFSVYLNREQEHPEGGALFFGG 238
Query: 249 MDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTII 308
+P +Y+G TYVPV+++ YWQ M I+ C GC I D+GTS LA P
Sbjct: 239 SNPRYYRGNFTYVPVSRRAYWQVRMEAATIN--DLRLCQHGCEVIIDTGTSFLALPYDQA 296
Query: 309 TQVNHAIGAT----GIVSQECKAV 328
+N +IG T G S C V
Sbjct: 297 ILINESIGGTPSEYGQYSVPCDQV 320
>gi|118102416|ref|XP_001235024.1| PREDICTED: cathepsin E [Gallus gallus]
Length = 397
Score = 250 bits (638), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 134/298 (44%), Positives = 184/298 (61%), Gaps = 9/298 (3%)
Query: 25 TPNGGLYRIGL-KKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNL-GESGDADIVAL 82
TP GL R+ L + R + R +L S+ ++ R + +YS +L GE+ + L
Sbjct: 17 TPCNGLKRVTLTRHRSLRKSLRDRGQL-SQFWKAHRLDMVQYSQDCSLFGEANEP----L 71
Query: 83 KNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYK 142
NY+D +YFG+I IGTPPQNFTV+FDTGSSNLWVPS C S AC H++++ SSTY+
Sbjct: 72 INYLDMEYFGQISIGTPPQNFTVVFDTGSSNLWVPSIYCT-SKACTKHARFQPSHSSTYQ 130
Query: 143 KNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGF 202
G I YGTG+++G D V + + V +Q F E+ EP TF ++FDGILGL +
Sbjct: 131 PLGIPVSIQYGTGSLTGIIGSDQVTVEGMTVYNQPFAESVSEPGKTFQDSEFDGILGLAY 190
Query: 203 QEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVP 262
++V PV+ NM+ Q LV P+FS + + N D GGE++FGG DP + G +VP
Sbjct: 191 PSLAVDGVTPVFDNMMAQDLVEMPIFSVYMSANPDSSLGGEVLFGGFDPSRFLGTLHWVP 250
Query: 263 VTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
VTQ+GYWQ + +V + G T FCA GC AI D+GTSLL GPT I ++ IGAT +
Sbjct: 251 VTQQGYWQIQLDNVQVGG-TVAFCADGCQAIVDTGTSLLTGPTKDIKEMQRYIGATAM 307
>gi|449481456|ref|XP_002189698.2| PREDICTED: cathepsin E-A-like [Taeniopygia guttata]
Length = 405
Score = 250 bits (638), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 125/277 (45%), Positives = 181/277 (65%), Gaps = 16/277 (5%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L +YM+AQY+G + +GTPPQ+FTV+FDTGSSN WVPS+ C S AC H K++S +S +Y
Sbjct: 73 LYDYMNAQYYGVVSVGTPPQSFTVVFDTGSSNFWVPSAYC-ISEACRVHQKFKSFKSDSY 131
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
+ G++ + YG+G + G +D ++I ++ +K Q+F E+ EP TF+LA FDG+LGLG
Sbjct: 132 EHGGEAFSLQYGSGQLLGIAGKDTLQISNISIKGQDFGESVFEPGATFVLAHFDGVLGLG 191
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ ++VG A+PV+ +++NQ LV EPVFSF+ R D E GGE++ GG+D YKG +V
Sbjct: 192 YPSLAVGNALPVFDSIMNQHLVEEPVFSFYLKRGEDTENGGELILGGIDHSLYKGSIHWV 251
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA---- 317
PVT+K YWQ M ++ I G+ T FC+ GC AI DSGTSL+ GP++ I ++ IGA
Sbjct: 252 PVTEKSYWQIHMNNIKIQGRVT-FCSHGCEAIVDSGTSLITGPSSQIRRLQAYIGASPSN 310
Query: 318 TGIVSQECKAVVSQYGEEIINMLLAKVSAKLSLRHYK 354
TG +C+ + S L +S + R YK
Sbjct: 311 TGEFLVDCRRLSS----------LPHISFTIGHREYK 337
>gi|318977821|ref|NP_001187407.1| cathepsin D precursor [Ictalurus punctatus]
gi|308322929|gb|ADO28602.1| cathepsin D [Ictalurus punctatus]
Length = 398
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 119/244 (48%), Positives = 164/244 (67%), Gaps = 2/244 (0%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
L N+MDAQY+G I IGTPPQ FTV+FDTGSSNLWVPS C +F +AC+ H +Y S +SST
Sbjct: 70 LSNFMDAQYYGVISIGTPPQEFTVLFDTGSSNLWVPSIHCAFFDLACWLHHRYDSKKSST 129
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y +NG I YG G++SGFFS+D V + L V++Q F EA ++P + F LAKFDG+LG+
Sbjct: 130 YVQNGTQFSIQYGRGSLSGFFSQDTVTLAGLGVQNQMFAEAVKQPGVVFALAKFDGVLGM 189
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ +SVGK P++ +++ L+ + +FSF+ NR+ E GGE++ GG D ++ G+ Y
Sbjct: 190 AYPILSVGKVRPIFDSIMAGKLLQQNIFSFYINRDPKAEVGGELMLGGCDKQYFDGDLHY 249
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
+ VT+K YWQ M V + G T C GC AI DSGTS++ GP I +N AIGA +
Sbjct: 250 LNVTRKAYWQIKMDTVEV-GSTLTLCKDGCQAIVDSGTSMITGPVEEIRALNKAIGAVPL 308
Query: 321 VSQE 324
+ E
Sbjct: 309 IMGE 312
>gi|355703800|gb|EHH30291.1| hypothetical protein EGK_10923 [Macaca mulatta]
Length = 423
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 129/289 (44%), Positives = 179/289 (61%), Gaps = 6/289 (2%)
Query: 70 NLGESGDAD---IVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSI 125
LG D +V L ++DAQYFGEIG+GTPPQNFTV+FDTGSSNLWVPS +C +FS+
Sbjct: 56 RLGAPSPGDKPALVPLSKFLDAQYFGEIGLGTPPQNFTVVFDTGSSNLWVPSRRCHFFSV 115
Query: 126 ACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREP 185
C+FH ++ SS+++ NG I YGTG + G SED + IG + F EA E
Sbjct: 116 PCWFHHRFNPNASSSFQPNGTKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWES 175
Query: 186 SLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIV 245
SL F +++ DGILGLGF ++V P +V QGL+++PVFSF+ NR+++ +GGE+V
Sbjct: 176 SLVFTISRPDGILGLGFPILAVEGVPPPLDVLVEQGLLDKPVFSFYLNRDSEVADGGELV 235
Query: 246 FGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPT 305
GG DP HY T+VPVT YWQ M VM+ G CA GCAAI D+GT ++ GPT
Sbjct: 236 LGGSDPAHYIPPLTFVPVTVPAYWQIHMERVMV-GSGLTLCARGCAAILDTGTPVIIGPT 294
Query: 306 TIITQVNHAIGATGIVSQECKAVVSQYGE-EIINMLLAKVSAKLSLRHY 353
I ++ AIG +++ E S+ + +++L+ V L+ + Y
Sbjct: 295 EEIRALHEAIGGIPLLAGEYIIRCSEIPKLPTVSLLIGGVWFNLTAQDY 343
>gi|345568347|gb|EGX51242.1| hypothetical protein AOL_s00054g478 [Arthrobotrys oligospora ATCC
24927]
Length = 392
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 137/317 (43%), Positives = 189/317 (59%), Gaps = 30/317 (9%)
Query: 27 NGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIR----KYSLRGNLGESGDADI--- 79
+ G++++ LKK + D+ G++F+T ++ KY R ++ D+
Sbjct: 17 SAGVHKMSLKKIPVE---------DTMLGQNFQTQVQALAQKYINRAGNQQAFTNDVNAD 67
Query: 80 ----VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRS 135
V + N+++AQY+ EI +GTPPQ F V+ DTGSSNLWVPS C SIAC+ H+KY S
Sbjct: 68 GGHSVPVNNFLNAQYYSEITLGTPPQTFKVVLDTGSSNLWVPSKSCS-SIACFLHTKYDS 126
Query: 136 GRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFD 195
SSTYK NG I YG+G++ GF S+D + IGDL +K+Q F EAT+EP L F KFD
Sbjct: 127 SESSTYKANGTEFSIQYGSGSMEGFISQDTLTIGDLTIKNQLFAEATKEPGLAFAFGKFD 186
Query: 196 GILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYK 255
GILGLG+ ISV K P +Y M++Q LV+EPVF+F+ R DE E VFGG+D HY
Sbjct: 187 GILGLGYDTISVNKIPPPFYQMISQKLVDEPVFAFYLGREEDESEA---VFGGIDKSHYT 243
Query: 256 GEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAI 315
G+ T+V V +K YW+ + QT + G A+ D+GTSL+ P+ +N AI
Sbjct: 244 GDITWVDVRRKAYWEVPFDSISFGDQTAELDSWG--AVLDTGTSLITLPSDYAEMLNSAI 301
Query: 316 GAT----GIVSQECKAV 328
GAT G S C+ V
Sbjct: 302 GATKGWNGQYSVPCEKV 318
>gi|195997417|ref|XP_002108577.1| hypothetical protein TRIADDRAFT_19349 [Trichoplax adhaerens]
gi|190589353|gb|EDV29375.1| hypothetical protein TRIADDRAFT_19349, partial [Trichoplax
adhaerens]
Length = 370
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 128/264 (48%), Positives = 174/264 (65%), Gaps = 8/264 (3%)
Query: 62 IRKYSL---RGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPS 118
I KY+L R +L ++G D + L NY+DA+YFG I IGTPPQ+F V+FDTGSS+ WVPS
Sbjct: 25 ISKYNLETGRDDL-KTGRGDEI-LNNYLDAEYFGPITIGTPPQDFLVLFDTGSSDFWVPS 82
Query: 119 SKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEF 178
S+C S AC H +Y +SSTY+ NGK I YG+G+ GF S D VK+ + V++ F
Sbjct: 83 SECT-SQACEMHHRYDHSKSSTYRPNGKRWSIEYGSGSAEGFLSTDVVKVAGITVQNVTF 141
Query: 179 IEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADE 238
E T P F AKFDGILGLGF +SV ++ M+ QGL+ +PVFS + NR +
Sbjct: 142 GEVTNLPGPIFAAAKFDGILGLGFASLSVEGVKTIFDLMLQQGLIQKPVFSVYLNRQGTQ 201
Query: 239 EEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGT 298
GGE+VFGG DP++Y G +YVP++++GYWQF++ I+ + FC GGC A+ D+GT
Sbjct: 202 NVGGELVFGGSDPNYYTGAFSYVPLSKEGYWQFELDGGTIENEF--FCEGGCQAVIDTGT 259
Query: 299 SLLAGPTTIITQVNHAIGATGIVS 322
SL+ GP + ++NH IGA I S
Sbjct: 260 SLIVGPNEEVAKINHLIGADSIQS 283
>gi|402857430|ref|XP_003893258.1| PREDICTED: cathepsin E [Papio anubis]
Length = 396
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 128/295 (43%), Positives = 189/295 (64%), Gaps = 7/295 (2%)
Query: 27 NGGLYRIGLKKRKFDLNNRVAARLDSKE-GESFRTSIRKYSLRGNLGESGDADIVALKNY 85
G L+R+ L+ R L ++ AR E +S + +++ ++ +S + ++ NY
Sbjct: 18 QGSLHRVPLR-RHPSLKKKLRARSQLSELWKSQNLDMIQFTESCSMDQSANEPLI---NY 73
Query: 86 MDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNG 145
+D +YFG I IG+PPQNFTVIFDTGSSNLWVPS C S AC H++++ +SSTY + G
Sbjct: 74 LDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHTRFQPSQSSTYSQPG 132
Query: 146 KSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEI 205
+S I YGTG++SG D V + L V Q+F E+ EP TF+ A+FDGILGLG+ +
Sbjct: 133 QSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAEFDGILGLGYPSL 192
Query: 206 SVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQ 265
+VG PV+ NM+ Q LV+ P+FS + + N + G E++FGG D H+ G ++VPVT+
Sbjct: 193 AVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGSLSWVPVTK 252
Query: 266 KGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
+GYWQ + ++ + G T FC+ GC AI D+GTSL+ GP+ I Q+ +AIGA +
Sbjct: 253 QGYWQIALDNIQVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGAAPV 306
>gi|353234557|emb|CCA66581.1| probable PEP4-aspartyl protease [Piriformospora indica DSM 11827]
Length = 411
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 129/244 (52%), Positives = 164/244 (67%), Gaps = 15/244 (6%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L NYM+AQY+ +I IGTPPQ F V+ DTGSSNLWVPS+ C SIAC+ H+KY S SS
Sbjct: 90 VPLNNYMNAQYYADITIGTPPQTFKVVLDTGSSNLWVPSTSCT-SIACFLHTKYDSSASS 148
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYK NG I YG+G++ GF S+D + +GDL +K Q+F EAT+EP L F KFDGILG
Sbjct: 149 TYKANGTEFAIRYGSGSLEGFVSQDTMTLGDLTIKKQDFAEATKEPGLAFAFGKFDGILG 208
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
L + ISV P +YN ++QGL+ E VF+ F A E +GGE VFGG+D HY G+ T
Sbjct: 209 LAYDTISVNHITPPFYNAIDQGLLKEKVFT--FRVGASEADGGEAVFGGIDSSHYTGKIT 266
Query: 260 YVPVTQKGYWQFDM-----GDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHA 314
YVPV +KGYW+ ++ GD ++ + TG AAI D+GTSL+ PTTI +N
Sbjct: 267 YVPVRRKGYWEVELESVAFGDDELELENTG------AAI-DTGTSLIVMPTTIAEMLNSE 319
Query: 315 IGAT 318
IGAT
Sbjct: 320 IGAT 323
>gi|212526768|ref|XP_002143541.1| aspartic endopeptidase Pep2 [Talaromyces marneffei ATCC 18224]
gi|210072939|gb|EEA27026.1| aspartic endopeptidase Pep2 [Talaromyces marneffei ATCC 18224]
Length = 395
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 126/239 (52%), Positives = 159/239 (66%), Gaps = 3/239 (1%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V + N+++AQYF EI IGTPPQNF V+ DTGSSNLWVPSS C SIACY HSKY S SS
Sbjct: 72 VLVDNFLNAQYFSEITIGTPPQNFKVVLDTGSSNLWVPSSSCN-SIACYLHSKYDSSSSS 130
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYKKNG I YG+G++ GF S D V IGD+ +KDQ+F EAT EP L F +FDGILG
Sbjct: 131 TYKKNGSDFAIQYGSGSLEGFVSRDTVTIGDITIKDQDFAEATNEPGLAFAFGRFDGILG 190
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LGF ISV K VP +YNM+NQ ++EPVF+F+ + E + E FGG+D HY GE
Sbjct: 191 LGFDTISVNKIVPPFYNMLNQKSLDEPVFAFYLGDSNKEGDASEATFGGIDKSHYTGELV 250
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
+P+ +K YW+ D D + G+ I D+GTSL+A P+T+ +N IGA+
Sbjct: 251 KIPLRRKAYWEVDF-DAIAFGENVAELE-NTGVILDTGTSLIALPSTLAELLNKEIGAS 307
>gi|398396710|ref|XP_003851813.1| hypothetical protein MYCGRDRAFT_104895 [Zymoseptoria tritici
IPO323]
gi|339471693|gb|EGP86789.1| hypothetical protein MYCGRDRAFT_104895 [Zymoseptoria tritici
IPO323]
Length = 398
Score = 249 bits (636), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 124/252 (49%), Positives = 169/252 (67%), Gaps = 8/252 (3%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V + N+++AQYF EI IGTPPQ F V+ DTGSSNLWVPS C SIACY HSKY G S+
Sbjct: 76 VGVSNFLNAQYFSEIAIGTPPQEFKVVLDTGSSNLWVPSKDC-GSIACYLHSKYNHGDSN 134
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYK+NG I YG+G++ G+ S+D V+IGDL +K+Q F EAT EP L F +FDGI+G
Sbjct: 135 TYKQNGSDFAIQYGSGSLEGYISQDTVQIGDLKIKNQLFAEATSEPGLAFAFGRFDGIMG 194
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ ISV P +YNM++QGL++E VF+F+ + + D+ + E +FGG++ DHY G+ T
Sbjct: 195 LGYDTISVNGIPPPFYNMIDQGLLDEKVFAFYLS-STDKGDESEAIFGGVNKDHYTGKMT 253
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA-- 317
+P+ +K YW+ D + + QT + G AI D+GTSL+A P+T+ +N IGA
Sbjct: 254 NIPLRRKAYWEVDFDAITLGDQTAELDSTG--AILDTGTSLIALPSTMAELLNKEIGAKK 311
Query: 318 --TGIVSQECKA 327
G S EC A
Sbjct: 312 GYNGQYSVECSA 323
>gi|403414885|emb|CCM01585.1| predicted protein [Fibroporia radiculosa]
Length = 414
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 142/296 (47%), Positives = 181/296 (61%), Gaps = 26/296 (8%)
Query: 44 NRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNF 103
N V AR ++GE + + + G V L N+M+AQYF EI +GTP Q+F
Sbjct: 63 NMVLARPVHEDGEDLLWTQEEILVNGGHN-------VPLSNFMNAQYFAEIQLGTPAQSF 115
Query: 104 TVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSE 163
VI DTGSSNLWVPSSKC SIAC+ H+KY S S+TYK NG I YG+G++ GF S+
Sbjct: 116 KVILDTGSSNLWVPSSKCT-SIACFLHAKYDSSSSTTYKANGSEFSIQYGSGSMEGFVSQ 174
Query: 164 DHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLV 223
D +KIGDL +K Q+F EAT+EP L F KFDGILGLG+ ISV P +Y MV Q L+
Sbjct: 175 DLLKIGDLSIKHQDFAEATKEPGLAFAFGKFDGILGLGYDTISVNHMTPPFYEMVAQKLI 234
Query: 224 NEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFD-----MGDVMI 278
+EPVF+F + EE+GGE VFGG+D Y G YVPV +K YW+ + +GD +
Sbjct: 235 DEPVFAFRL--GSSEEDGGEAVFGGIDRTAYTGSIDYVPVRRKAYWEVELQKVALGDDEL 292
Query: 279 DGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA----TGIVSQECKAVVS 330
D + TG AAI D+GTSL+A PT I +N IGA G + +C V S
Sbjct: 293 DLEHTG------AAI-DTGTSLIALPTDIAEMINTQIGAQKQWNGQYTVDCSKVPS 341
>gi|392568782|gb|EIW61956.1| aspartic peptidase A1 [Trametes versicolor FP-101664 SS1]
Length = 415
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 138/280 (49%), Positives = 179/280 (63%), Gaps = 23/280 (8%)
Query: 43 NNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQN 102
N RV+ R K+GE + ++S G V L N+M+AQYF EI +GTPPQ+
Sbjct: 64 NVRVS-RPTVKDGEELFWTQDEFSTEG-------GHTVPLSNFMNAQYFAEITLGTPPQS 115
Query: 103 FTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFS 162
F VI DTGSSNLWVPS+KC SIAC+ H+KY S SSTYK NG I YG+G++ GF S
Sbjct: 116 FKVILDTGSSNLWVPSTKCT-SIACFLHAKYDSSASSTYKANGSEFSIQYGSGSMEGFVS 174
Query: 163 EDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGL 222
D + IGDL VK+ +F EAT+EP L F KFDGILGLG+ ISV VP +Y +VNQGL
Sbjct: 175 RDVLTIGDLTVKNLDFAEATKEPGLAFAFGKFDGILGLGYDTISVNHIVPPFYALVNQGL 234
Query: 223 VNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFD-----MGDVM 277
++ PVFSF + EE+GGE +FGG+D Y G+ YVPV +K YW+ + +GD
Sbjct: 235 LDSPVFSFRLGDS--EEDGGEAIFGGIDDSAYSGKIEYVPVRRKAYWEVELEKIRLGDEE 292
Query: 278 IDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
++ + TG AAI D+GTSL+A P+ + +N IGA
Sbjct: 293 LELENTG------AAI-DTGTSLIALPSDLAEMLNAQIGA 325
>gi|393215979|gb|EJD01470.1| aspartic peptidase A1 [Fomitiporia mediterranea MF3/22]
Length = 412
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 133/258 (51%), Positives = 172/258 (66%), Gaps = 19/258 (7%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L N+M+AQYF I +GTPPQ F VI DTGSSNLWVPS+KC SIAC+ H+KY S SS
Sbjct: 91 VPLTNFMNAQYFTTITLGTPPQEFKVILDTGSSNLWVPSTKCT-SIACFLHAKYDSSASS 149
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
T+KKNG S I YG+G++ GF S D + IGDL + DQ+F EAT+EP L F KFDGILG
Sbjct: 150 THKKNGTSFKIEYGSGSMEGFVSNDVLSIGDLKIHDQDFAEATKEPGLAFAFGKFDGILG 209
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ ISV P +Y+MVN+GL++ PVFS F + EE+GGE VFGG+D Y G+
Sbjct: 210 LGYDTISVNHITPPFYSMVNKGLLDAPVFS--FRLGSSEEDGGEAVFGGIDESAYSGKIN 267
Query: 260 YVPVTQKGYWQFDM-----GDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHA 314
Y PV +K YW+ ++ GD +++ + TG AAI D+GTSL+A P+ + +N
Sbjct: 268 YAPVRRKAYWEVELPKVAFGDDVLELENTG------AAI-DTGTSLIALPSDVAEMLNAQ 320
Query: 315 IGAT----GIVSQECKAV 328
IGAT G + +CK V
Sbjct: 321 IGATKSWNGQYTVDCKKV 338
>gi|109018632|ref|XP_001090284.1| PREDICTED: cathepsin E isoform 4 [Macaca mulatta]
Length = 396
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 127/295 (43%), Positives = 188/295 (63%), Gaps = 7/295 (2%)
Query: 27 NGGLYRIGLKKRKFDLNNRVAARLDSKE-GESFRTSIRKYSLRGNLGESGDADIVALKNY 85
G L+R+ L++ L ++ AR E +S + +++ ++ +S + ++ NY
Sbjct: 18 QGSLHRVPLRRHP-SLKKKLRARSQLSEFWKSQNLDMIQFTESCSMDQSANEPLI---NY 73
Query: 86 MDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNG 145
+D +YFG I IG+PPQNFTVIFDTGSSNLWVPS C S AC H++++ +SSTY + G
Sbjct: 74 LDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHTRFQPSQSSTYSQPG 132
Query: 146 KSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEI 205
+S I YGTG++SG D V + L V Q+F E+ EP TF+ A+FDGILGLG+ +
Sbjct: 133 QSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAEFDGILGLGYPSL 192
Query: 206 SVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQ 265
+VG PV+ NM+ Q LV+ P+FS + + N + G E++FGG D H+ G +VPVT+
Sbjct: 193 AVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGVGSELIFGGYDHSHFSGSLNWVPVTK 252
Query: 266 KGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
+GYWQ + ++ + G T FC+ GC AI D+GTSL+ GP+ I Q+ +AIGA +
Sbjct: 253 QGYWQIALDNIQVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGAAPV 306
>gi|336373584|gb|EGO01922.1| hypothetical protein SERLA73DRAFT_177556 [Serpula lacrymans var.
lacrymans S7.3]
gi|336386403|gb|EGO27549.1| hypothetical protein SERLADRAFT_461213 [Serpula lacrymans var.
lacrymans S7.9]
Length = 413
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 129/255 (50%), Positives = 166/255 (65%), Gaps = 9/255 (3%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L N+M+AQY+ EI +G+P Q F VI DTGSSNLWVPSSKC SIAC+ H+KY S SS
Sbjct: 92 VPLTNFMNAQYYTEITLGSPAQTFKVILDTGSSNLWVPSSKCT-SIACFLHTKYDSSSSS 150
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYK NG I YG+G++ GF S++ +KIGDL ++ Q+F EAT+EP L F KFDGILG
Sbjct: 151 TYKANGTEFSIQYGSGSMEGFVSQESMKIGDLSIQHQDFAEATKEPGLAFAFGKFDGILG 210
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ ISV P +YNM++QGL++EP+FS F + E++GGE VFGG+D Y G T
Sbjct: 211 LGYDTISVNHITPPFYNMIDQGLLDEPLFS--FRLGSSEDDGGEAVFGGIDSSAYTGSIT 268
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT- 318
YVPV +K YW+ ++ V G G A D+GTSL+A PT + +N IGAT
Sbjct: 269 YVPVRRKAYWEVELEKVSFGGDELDLENTGAA--IDTGTSLIALPTDVAEMLNTQIGATR 326
Query: 319 ---GIVSQECKAVVS 330
G +C V S
Sbjct: 327 SWNGQYQVDCAKVPS 341
>gi|60816208|gb|AAX36374.1| cathepsin E [synthetic construct]
Length = 396
Score = 248 bits (633), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 127/295 (43%), Positives = 186/295 (63%), Gaps = 7/295 (2%)
Query: 27 NGGLYRIGLKKRKFDLNNRVAARLDSKE-GESFRTSIRKYSLRGNLGESGDADIVALKNY 85
G L+R+ L++ L ++ AR E +S + +++ ++ +S ++ NY
Sbjct: 18 QGSLHRVPLRRHP-SLKKKLRARSQLSEFWKSHNLDMIQFTESCSMDQSAKEPLI---NY 73
Query: 86 MDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNG 145
+D +YFG I IG+PPQNFTVIFDTGSSNLWVPS C S AC HS+++ +SSTY + G
Sbjct: 74 LDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHSRFQPSQSSTYSQPG 132
Query: 146 KSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEI 205
+S I YGTG++SG D V + L V Q+F E+ EP TF+ A+FDGILGLG+ +
Sbjct: 133 QSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAEFDGILGLGYPSL 192
Query: 206 SVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQ 265
+VG PV+ NM+ Q LV+ P+FS + + N + G E++FGG D H+ G +VPVT+
Sbjct: 193 AVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGSLNWVPVTK 252
Query: 266 KGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
+ YWQ + ++ + G T FC+ GC AI D+GTSL+ GP+ I Q+ +AIGA +
Sbjct: 253 QAYWQIALDNIQVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGAAPV 306
>gi|4503145|ref|NP_001901.1| cathepsin E isoform a preproprotein [Homo sapiens]
gi|114572172|ref|XP_001163151.1| PREDICTED: cathepsin E isoform 2 [Pan troglodytes]
gi|181194|gb|AAA52130.1| cathepsin E precursor [Homo sapiens]
gi|181205|gb|AAA52300.1| cathepsin E [Homo sapiens]
gi|7339520|emb|CAB82850.1| procathepsin E [Homo sapiens]
gi|27502799|gb|AAH42537.1| Cathepsin E [Homo sapiens]
gi|61358295|gb|AAX41543.1| cathepsin E [synthetic construct]
gi|119611998|gb|EAW91592.1| cathepsin E, isoform CRA_a [Homo sapiens]
gi|158257546|dbj|BAF84746.1| unnamed protein product [Homo sapiens]
gi|325463731|gb|ADZ15636.1| cathepsin E [synthetic construct]
Length = 396
Score = 248 bits (633), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 127/295 (43%), Positives = 186/295 (63%), Gaps = 7/295 (2%)
Query: 27 NGGLYRIGLKKRKFDLNNRVAARLDSKE-GESFRTSIRKYSLRGNLGESGDADIVALKNY 85
G L+R+ L++ L ++ AR E +S + +++ ++ +S ++ NY
Sbjct: 18 QGSLHRVPLRRHP-SLKKKLRARSQLSEFWKSHNLDMIQFTESCSMDQSAKEPLI---NY 73
Query: 86 MDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNG 145
+D +YFG I IG+PPQNFTVIFDTGSSNLWVPS C S AC HS+++ +SSTY + G
Sbjct: 74 LDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHSRFQPSQSSTYSQPG 132
Query: 146 KSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEI 205
+S I YGTG++SG D V + L V Q+F E+ EP TF+ A+FDGILGLG+ +
Sbjct: 133 QSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAEFDGILGLGYPSL 192
Query: 206 SVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQ 265
+VG PV+ NM+ Q LV+ P+FS + + N + G E++FGG D H+ G +VPVT+
Sbjct: 193 AVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGSLNWVPVTK 252
Query: 266 KGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
+ YWQ + ++ + G T FC+ GC AI D+GTSL+ GP+ I Q+ +AIGA +
Sbjct: 253 QAYWQIALDNIQVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGAAPV 306
>gi|321250483|ref|XP_003191823.1| endopeptidase [Cryptococcus gattii WM276]
gi|317458290|gb|ADV20036.1| Endopeptidase, putative [Cryptococcus gattii WM276]
Length = 432
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 133/264 (50%), Positives = 170/264 (64%), Gaps = 19/264 (7%)
Query: 76 DADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRS 135
D V L NYM+AQYF +I +GTP Q F VI DTGSSNLWVPS C SIAC+ HSKY S
Sbjct: 107 DGHGVPLSNYMNAQYFAQIELGTPAQTFKVILDTGSSNLWVPSVGCT-SIACFLHSKYDS 165
Query: 136 GRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFD 195
+SSTYK NG +IHYG+G++ GF S+D + IGDL +K Q+F EAT+EP L F KFD
Sbjct: 166 SQSSTYKANGSDFEIHYGSGSLEGFISQDTLAIGDLAIKGQDFAEATKEPGLAFAFGKFD 225
Query: 196 GILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYK 255
GILGL + ISV VP +YNM+NQ L+++PVFSF + E +GGE +FGG+D Y
Sbjct: 226 GILGLAYDTISVNHIVPPFYNMLNQDLLDDPVFSFRLGSS--ENDGGEAIFGGIDKSAYS 283
Query: 256 GEHTYVPVTQKGYWQFDM-----GDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQ 310
G YVPV +KGYW+ ++ GD ++ + TG AAI D+GTSL+ PT +
Sbjct: 284 GSLHYVPVRRKGYWEVELESISFGDDELELENTG------AAI-DTGTSLIVMPTDVAEM 336
Query: 311 VNHAIGA----TGIVSQECKAVVS 330
+N IGA G + +C V S
Sbjct: 337 LNKEIGAEKSWNGQYTVDCNTVPS 360
>gi|296230582|ref|XP_002760770.1| PREDICTED: cathepsin E isoform 1 [Callithrix jacchus]
Length = 396
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 128/295 (43%), Positives = 186/295 (63%), Gaps = 7/295 (2%)
Query: 27 NGGLYRIGLKKRKFDLNNRVAARLDSKE-GESFRTSIRKYSLRGNLGESGDADIVALKNY 85
G L+R+ L+K L R+ AR E +S + + + ++ +S + ++ NY
Sbjct: 18 QGSLHRVPLRKHP-SLKKRLRARSQLSEFLKSQNLDMIQSTESCSMDQSANEPLI---NY 73
Query: 86 MDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNG 145
+D +YFG I IG+PPQNFTVIFDTGSSNLWVPS C S AC H++++ +S+TY + G
Sbjct: 74 LDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKRHTRFQPSQSNTYNQPG 132
Query: 146 KSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEI 205
+S I YGTG++SG D V + L V Q+F E+ EP TF+ A+FDGILGLG+ +
Sbjct: 133 QSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAEFDGILGLGYPSL 192
Query: 206 SVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQ 265
+VG PV+ NM+ Q LV+ P+FS + + N + G E++FGG D H+ G +VPVT+
Sbjct: 193 AVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGSLNWVPVTK 252
Query: 266 KGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
+ YWQ + D+ + G T FC+ GC AI D+GTSL+ GP+ I Q+ +AIGA +
Sbjct: 253 QAYWQIALDDIQVGG-TAMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGAAPV 306
>gi|397504824|ref|XP_003822980.1| PREDICTED: cathepsin E [Pan paniscus]
Length = 396
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 127/295 (43%), Positives = 186/295 (63%), Gaps = 7/295 (2%)
Query: 27 NGGLYRIGLKKRKFDLNNRVAARLDSKE-GESFRTSIRKYSLRGNLGESGDADIVALKNY 85
G L+R+ L++ L ++ AR E +S + +++ ++ +S ++ NY
Sbjct: 18 QGSLHRVPLRRHP-SLKKKLRARSQLSEFWKSHNLDMIQFTESCSMDQSAKEPLI---NY 73
Query: 86 MDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNG 145
+D +YFG I IG+PPQNFTVIFDTGSSNLWVPS C S AC HS+++ +SSTY + G
Sbjct: 74 LDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHSRFQPSQSSTYSQPG 132
Query: 146 KSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEI 205
+S I YGTG++SG D V + L V Q+F E+ EP TF+ A+FDGILGLG+ +
Sbjct: 133 QSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAEFDGILGLGYPSL 192
Query: 206 SVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQ 265
+VG PV+ NM+ Q LV+ P+FS + + N + G E++FGG D H+ G +VPVT+
Sbjct: 193 AVGGVTPVFDNMLAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGSLNWVPVTK 252
Query: 266 KGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
+ YWQ + ++ + G T FC+ GC AI D+GTSL+ GP+ I Q+ +AIGA +
Sbjct: 253 QAYWQIALDNIQVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGAAPV 306
>gi|112950081|gb|ABI26643.1| aspartic proteinase [Cucumis sativus]
Length = 399
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 136/294 (46%), Positives = 182/294 (61%), Gaps = 21/294 (7%)
Query: 30 LYRIGLKKR---KFDLNNRVAARLDSKEGESFRTSIRKYSLRGNL-GESGDADIVALKNY 85
++RI L+++ K NN AA++ + KY+++ NL G SG + + +
Sbjct: 18 MHRIPLQRQENFKLTKNNIQAAKVHLR---------NKYNVKSNLLGRSGTTEQLT-QGQ 67
Query: 86 MDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSSTYKKN 144
+ ++Y+G IGIGTP Q FTV+FD+GSSNLWVPS+KC S AC H S SSTY N
Sbjct: 68 LTSEYYGTIGIGTPAQEFTVVFDSGSSNLWVPSAKCSSSDQACKNH---NSAASSTYVPN 124
Query: 145 GKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQE 204
G+ I YGTG+++GF S D V + L ++ Q F EAT EP TF+ + FDGILGL ++
Sbjct: 125 GEQFSIQYGTGSLTGFLSTDTVTVNGLTIQSQTFAEATNEPGSTFVDSTFDGILGLAYET 184
Query: 205 ISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVT 264
IS VP +YNMV+Q LV+ PVFS +F R+ GE++FGG D Y+G YVPVT
Sbjct: 185 ISQDNVVPPFYNMVSQSLVSNPVFSVYFGRSKAANNNGEVIFGGSDSTVYQGPINYVPVT 244
Query: 265 QKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
Q+GYWQF M V ++GQ AIAD+GTSLLA PT+ +N AIGAT
Sbjct: 245 QQGYWQFTMDGVYVNGQQ---VISSAQAIADTGTSLLAAPTSAFYTLNEAIGAT 295
>gi|395328846|gb|EJF61236.1| endopeptidase [Dichomitus squalens LYAD-421 SS1]
Length = 412
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 141/312 (45%), Positives = 191/312 (61%), Gaps = 27/312 (8%)
Query: 28 GGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMD 87
GG ++ L N R++ R ++GE + ++S+ G V L N+M+
Sbjct: 47 GGRSQVPLMGAGTGRNVRLS-RPAVQDGEELFWTQEEFSVEGGHN-------VPLSNFMN 98
Query: 88 AQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKS 147
AQYF EI +GTPPQ F VI DTGSSNLWVPS KC SIAC+ H+KY S SSTYK NG
Sbjct: 99 AQYFAEISLGTPPQTFKVILDTGSSNLWVPSVKCT-SIACFLHTKYDSSSSSTYKANGTE 157
Query: 148 ADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISV 207
I YG+G++ GF S+D +IGDL V +F EAT+EP L F KFDGILGL + I+V
Sbjct: 158 FSIQYGSGSMEGFVSQDTFRIGDLTVDGLDFAEATKEPGLAFAFGKFDGILGLAYDTIAV 217
Query: 208 GKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKG 267
P +Y+++N+GLV+EPVFS F + E++GGE +FGG+D Y G+ YVPV +K
Sbjct: 218 NHITPPFYHLINKGLVDEPVFS--FRLGSSEDDGGEAIFGGVDDSAYTGKIQYVPVRRKA 275
Query: 268 YWQFD-----MGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT---- 318
YW+ + +GD +++ ++TG AAI D+GTSL+A PT I +N IGAT
Sbjct: 276 YWEVELEKVSLGDDVLELESTG------AAI-DTGTSLIALPTDIAEMINTQIGATKSWN 328
Query: 319 GIVSQECKAVVS 330
G + +C V S
Sbjct: 329 GQYTVDCAKVPS 340
>gi|355756059|gb|EHH59806.1| hypothetical protein EGM_10003 [Macaca fascicularis]
Length = 423
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 129/289 (44%), Positives = 178/289 (61%), Gaps = 6/289 (2%)
Query: 70 NLGESGDAD---IVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSI 125
LG D +V L ++DAQYFGEIG+GTPPQNFTV+FDTGSSNLWVPS +C +FS+
Sbjct: 56 RLGAPSPGDKPALVPLSKFLDAQYFGEIGLGTPPQNFTVVFDTGSSNLWVPSRRCHFFSV 115
Query: 126 ACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREP 185
C+FH ++ SS+++ NG I YGTG + G SED + IG + F EA E
Sbjct: 116 PCWFHHRFNPNASSSFQPNGTKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWES 175
Query: 186 SLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIV 245
SL F +++ DGILGLGF +SV P +V QGL+++PVFSF+ NR+++ +GGE+V
Sbjct: 176 SLVFTISRPDGILGLGFPILSVEGVRPPMDVLVEQGLLDKPVFSFYLNRDSEVADGGELV 235
Query: 246 FGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPT 305
GG DP HY T+VPVT YWQ M V + G CA GCAAI D+GT ++ GPT
Sbjct: 236 LGGSDPAHYIPPLTFVPVTVPAYWQIHMERVTV-GSGLTLCARGCAAILDTGTPVIIGPT 294
Query: 306 TIITQVNHAIGATGIVSQECKAVVSQYGE-EIINMLLAKVSAKLSLRHY 353
I ++ AIG +++ E S+ + +++L+ V L+ + Y
Sbjct: 295 EEIRALHEAIGGIPLLAGEYIIRCSEIPKLPTVSLLIGGVWFNLTAQDY 343
>gi|262073106|ref|NP_001159993.1| cathepsin D precursor [Bos taurus]
gi|296471411|tpg|DAA13526.1| TPA: cathepsin D [Bos taurus]
Length = 410
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 130/286 (45%), Positives = 179/286 (62%), Gaps = 27/286 (9%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNYMD Y+GEIGIGTPPQ FTV+FDTGS+NLWVPS C IAC+ H KY S +SST
Sbjct: 73 LKNYMD--YYGEIGIGTPPQCFTVVFDTGSANLWVPSIHCKLLDIACWTHRKYNSDKSST 130
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKI---------GDLVVKDQEFIEATREPSLTFLL 191
Y KNG + DIHYG+G++SG+ S+D V + G + V+ Q F EA ++P + F+
Sbjct: 131 YVKNGTTFDIHYGSGSLSGYLSQDTVSVPCNPSSSSPGGVTVQRQTFGEAIKQPGVVFIA 190
Query: 192 AKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDP 251
AKFDGILG+ + ISV +PV+ N++ Q LV++ VFSF+ NR+ + GGE++ GG D
Sbjct: 191 AKFDGILGMAYPRISVNNVLPVFDNLMQQKLVDKNVFSFFLNRDPKAQPGGELMLGGTDS 250
Query: 252 DHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQV 311
+Y+G + VT++ YWQ M D + G + C GGC AI D+GTSL+ GP + ++
Sbjct: 251 KYYRGSLMFHNVTRQAYWQIHM-DQLDVGSSLTVCKGGCEAIVDTGTSLIVGPVEEVREL 309
Query: 312 NHAIGATGIVSQE----CKAVVSQYGEEIINMLLAKVSAKLSLRHY 353
AIGA ++ E C+ V S L +V+ KL + Y
Sbjct: 310 QKAIGAVPLIQGEYMIPCEKVSS----------LPQVTVKLGGKDY 345
>gi|336273300|ref|XP_003351405.1| hypothetical protein SMAC_03712 [Sordaria macrospora k-hell]
Length = 381
Score = 248 bits (632), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 135/309 (43%), Positives = 187/309 (60%), Gaps = 12/309 (3%)
Query: 15 CLLLFPVVFSTPNGGLYRIGLKKRKF-DLNNRVAARLDSKEGESFRTSIRKYSLRGNLGE 73
LL ++ + G++ + LKK + V + + T +R S + +
Sbjct: 4 ALLTAAMLLGSAQAGVHTMKLKKVPLAEQLESVPIDMQVQHLGQKYTGLRPESHTQAMFK 63
Query: 74 SGDADI-----VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACY 128
+ DA + V + N+M+AQYF EI +GTPPQ F V+ DTGSSNLWVPSS+C SIACY
Sbjct: 64 ATDAQVTGNHPVPISNFMNAQYFSEITLGTPPQTFKVVLDTGSSNLWVPSSQC-GSIACY 122
Query: 129 FHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLT 188
H+KY S SSTYKKNG S +I YG+G++SGF S+D + IGD+ + DQ F EAT EP L
Sbjct: 123 LHNKYESSESSTYKKNGTSFEIQYGSGSLSGFVSQDRMTIGDITINDQLFAEATSEPGLA 182
Query: 189 FLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGG 248
F +FDGILGLG+ I+V P +Y MV Q LV+EPVFSF+ AD++ E+VFGG
Sbjct: 183 FAFGRFDGILGLGYSRIAVNGITPPFYKMVEQKLVDEPVFSFYL---ADQDGESEVVFGG 239
Query: 249 MDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTII 308
++ D Y G+ T +P+ +K YW+ D D + G+ G I D+GTSL+A P+ +
Sbjct: 240 VNKDRYTGKITTIPLRRKAYWEVDF-DAIGYGEDIADLE-GHGVILDTGTSLIALPSQLA 297
Query: 309 TQVNHAIGA 317
+N IGA
Sbjct: 298 EMLNAQIGA 306
>gi|397485042|ref|XP_003813672.1| PREDICTED: napsin-A-like [Pan paniscus]
Length = 420
Score = 247 bits (631), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 130/289 (44%), Positives = 177/289 (61%), Gaps = 6/289 (2%)
Query: 70 NLGESGDAD---IVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSI 125
LG D +V L ++DAQYFGEIG+GTPPQNFTV FDTGSSNLWVPS +C +FS+
Sbjct: 55 KLGAPSPGDKPALVPLSKFLDAQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSV 114
Query: 126 ACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREP 185
C+FH ++ SS++K NG I YGTG + G SED + IG + F EA E
Sbjct: 115 PCWFHHRFNPNASSSFKPNGTKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWES 174
Query: 186 SLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIV 245
SL F +++ DGILGLGF +SV P +V QGL+++PVFSF+ NR+ + +GGE+V
Sbjct: 175 SLVFTVSRPDGILGLGFPILSVEGVRPPLDVLVEQGLLDKPVFSFYLNRDPEVADGGELV 234
Query: 246 FGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPT 305
GG DP HY T+VPVT YWQ M V + + T CA GCAAI D+GT ++ GPT
Sbjct: 235 LGGSDPAHYIPPLTFVPVTVPAYWQIHMERVKVGSRLT-LCAQGCAAILDTGTPVIVGPT 293
Query: 306 TIITQVNHAIGATGIVSQECKAVVSQYGE-EIINMLLAKVSAKLSLRHY 353
I ++ AIG +++ E S+ + +++L+ V L+ + Y
Sbjct: 294 EEIRALHAAIGGIPLLAGEYIIRCSEIPKLPAVSLLIGGVWFNLTAQDY 342
>gi|380092926|emb|CCC09679.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 410
Score = 247 bits (630), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 135/309 (43%), Positives = 187/309 (60%), Gaps = 12/309 (3%)
Query: 15 CLLLFPVVFSTPNGGLYRIGLKKRKF-DLNNRVAARLDSKEGESFRTSIRKYSLRGNLGE 73
LL ++ + G++ + LKK + V + + T +R S + +
Sbjct: 4 ALLTAAMLLGSAQAGVHTMKLKKVPLAEQLESVPIDMQVQHLGQKYTGLRPESHTQAMFK 63
Query: 74 SGDADI-----VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACY 128
+ DA + V + N+M+AQYF EI +GTPPQ F V+ DTGSSNLWVPSS+C SIACY
Sbjct: 64 ATDAQVTGNHPVPISNFMNAQYFSEITLGTPPQTFKVVLDTGSSNLWVPSSQC-GSIACY 122
Query: 129 FHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLT 188
H+KY S SSTYKKNG S +I YG+G++SGF S+D + IGD+ + DQ F EAT EP L
Sbjct: 123 LHNKYESSESSTYKKNGTSFEIQYGSGSLSGFVSQDRMTIGDITINDQLFAEATSEPGLA 182
Query: 189 FLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGG 248
F +FDGILGLG+ I+V P +Y MV Q LV+EPVFSF+ AD++ E+VFGG
Sbjct: 183 FAFGRFDGILGLGYSRIAVNGITPPFYKMVEQKLVDEPVFSFYL---ADQDGESEVVFGG 239
Query: 249 MDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTII 308
++ D Y G+ T +P+ +K YW+ D D + G+ G I D+GTSL+A P+ +
Sbjct: 240 VNKDRYTGKITTIPLRRKAYWEVDF-DAIGYGEDIADLEGH-GVILDTGTSLIALPSQLA 297
Query: 309 TQVNHAIGA 317
+N IGA
Sbjct: 298 EMLNAQIGA 306
>gi|348511299|ref|XP_003443182.1| PREDICTED: cathepsin D-like [Oreochromis niloticus]
Length = 397
Score = 247 bits (630), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 127/285 (44%), Positives = 172/285 (60%), Gaps = 4/285 (1%)
Query: 43 NNRVAARLDSKEGESFRT--SIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPP 100
R RL S G S ++ K + + S + L N++D+QY+G I IGTPP
Sbjct: 28 KTRSLRRLMSDNGMSVDELRALAKSTGSPDSAPSPQLPVERLTNFLDSQYYGIISIGTPP 87
Query: 101 QNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISG 159
QNFTV+FDTGSSNLWVPS C IAC+FH +Y S +SSTY KNG I YGTG++SG
Sbjct: 88 QNFTVLFDTGSSNLWVPSIHCSLLDIACWFHHRYNSKKSSTYAKNGTEFSIQYGTGSLSG 147
Query: 160 FFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVN 219
F S D V I L V Q+F EA ++P +TF A+FDG+LG+G+ ISV +PV+ +
Sbjct: 148 FISGDTVTIAGLSVPGQQFGEAVKQPGITFAFARFDGVLGMGYPSISVDNVMPVFDTAMA 207
Query: 220 QGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMID 279
L+ + +FSF+ +R+ GGE++ GG DP +Y G+ YV VT+K +WQ M V +
Sbjct: 208 AKLLPQNIFSFYISRDPTAAVGGELMLGGTDPQYYTGDLHYVNVTRKAFWQIGMNRVDVG 267
Query: 280 GQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE 324
Q T C GC AI D+GTSL+ GP + + AIGA ++ E
Sbjct: 268 NQLT-LCKAGCQAIVDTGTSLIVGPKEEVKALQKAIGAIPLLMGE 311
>gi|402906426|ref|XP_003916003.1| PREDICTED: napsin-A-like [Papio anubis]
Length = 423
Score = 247 bits (630), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 128/289 (44%), Positives = 178/289 (61%), Gaps = 6/289 (2%)
Query: 70 NLGESGDAD---IVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSI 125
LG D +V L ++DAQYFGEIG+GTPPQNFTV+FDTGSSNLWVPS +C +FS+
Sbjct: 56 RLGAPSPGDKPALVPLSKFLDAQYFGEIGLGTPPQNFTVVFDTGSSNLWVPSRRCHFFSV 115
Query: 126 ACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREP 185
C+FH ++ SS+++ NG I YGTG + G SED + IG + F EA E
Sbjct: 116 PCWFHHRFNPNASSSFQPNGTKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWES 175
Query: 186 SLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIV 245
SL F +++ DGILGLGF ++V P +V QGL+++PVFSF+ NR+++ +GGE+V
Sbjct: 176 SLVFTISRPDGILGLGFPILAVEGVPPPLDVLVEQGLLDKPVFSFYLNRDSEVADGGELV 235
Query: 246 FGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPT 305
GG DP HY T+VPVT YWQ M V + G CA GCAAI D+GT ++ GPT
Sbjct: 236 LGGSDPAHYIPPLTFVPVTVPAYWQIHMERVTV-GSGLTLCARGCAAILDTGTPVIIGPT 294
Query: 306 TIITQVNHAIGATGIVSQECKAVVSQYGE-EIINMLLAKVSAKLSLRHY 353
I ++ AIG +++ E S+ + +++L+ V L+ + Y
Sbjct: 295 EEIRALHEAIGGIPLLAGEYIIRCSEIPKLPTVSLLIGGVWFNLTAQDY 343
>gi|426333516|ref|XP_004028322.1| PREDICTED: cathepsin E isoform 1 [Gorilla gorilla gorilla]
Length = 396
Score = 246 bits (629), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 126/295 (42%), Positives = 186/295 (63%), Gaps = 7/295 (2%)
Query: 27 NGGLYRIGLKKRKFDLNNRVAARLDSKE-GESFRTSIRKYSLRGNLGESGDADIVALKNY 85
G L+R+ L++ L ++ AR E +S + +++ ++ +S ++ NY
Sbjct: 18 QGSLHRVPLRRHP-SLKKKLRARSQLSEFWKSHNLDMIQFTESCSMDQSAKEPLI---NY 73
Query: 86 MDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNG 145
+D +YFG I IG+PPQNFTVIFDTGSSNLWVPS C S AC HS+++ +SSTY + G
Sbjct: 74 LDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHSRFQPSQSSTYSQPG 132
Query: 146 KSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEI 205
+S I YGTG++SG D V + L V Q+F E+ EP TF+ A+FDGILGLG+ +
Sbjct: 133 QSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAEFDGILGLGYPSL 192
Query: 206 SVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQ 265
+VG PV+ NM+ Q LV+ P+FS + + N + G E++FGG D H+ G +VPVT+
Sbjct: 193 AVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGSLNWVPVTK 252
Query: 266 KGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
+ YWQ + ++ + G T FC+ GC AI D+GTSL+ GP+ I Q+ +AIG+ +
Sbjct: 253 QAYWQIALDNIQVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGSAPV 306
>gi|194374823|dbj|BAG62526.1| unnamed protein product [Homo sapiens]
Length = 325
Score = 246 bits (629), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 122/249 (48%), Positives = 163/249 (65%), Gaps = 2/249 (0%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRS 138
V L ++DAQYFGEIG+GTPPQNFTV FDTGSSNLWVPS +C +FS+ C+FH ++ S
Sbjct: 68 VPLSKFLDAQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPCWFHHRFNPNAS 127
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
S++K +G I YGTG + G SED + IG + F EA E SL F +++ DGIL
Sbjct: 128 SSFKPSGTKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWESSLVFTVSRPDGIL 187
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEH 258
GLGF +SV P +V QGL+++PVFSF+FNR+ + +GGE+V GG DP HY
Sbjct: 188 GLGFPILSVEGVRPPLDVLVEQGLLDKPVFSFYFNRDPEVADGGELVLGGSDPAHYIPPL 247
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
T+VPVT YWQ M V + + T CA GCAAI D+GT ++ GPT I ++ AIG
Sbjct: 248 TFVPVTVPAYWQIHMERVKVGSRLT-LCAQGCAAILDTGTPVIVGPTEEIRALHAAIGGI 306
Query: 319 GIVSQECKA 327
+++ E ++
Sbjct: 307 PLLAGEVRS 315
>gi|114678578|ref|XP_530061.2| PREDICTED: napsin-A-like [Pan troglodytes]
Length = 420
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 127/276 (46%), Positives = 173/276 (62%), Gaps = 3/276 (1%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRS 138
V L ++DAQYFGEIG+GTPPQNFTV FDTGSSNLWVPS +C +FS+ C+FH ++ S
Sbjct: 68 VPLSKFLDAQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPCWFHHRFNPNAS 127
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
S++K NG I YGTG + G SED + IG + F EA E SL F +++ DGIL
Sbjct: 128 SSFKPNGTKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWESSLVFTVSRPDGIL 187
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEH 258
GLGF +SV P +V QGL+++PVFSF+ NR+ + +GGE+V GG DP HY
Sbjct: 188 GLGFPILSVEGVRPPLDVLVEQGLLDKPVFSFYLNRDPEVADGGELVLGGSDPAHYIPPL 247
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
T+VPVT YWQ M V + + T CA GCAAI D+GT ++ GPT I ++ AIG
Sbjct: 248 TFVPVTVPAYWQIHMERVKVGSRLT-LCAQGCAAILDTGTPVIVGPTEEIRALHAAIGGI 306
Query: 319 GIVSQECKAVVSQYGE-EIINMLLAKVSAKLSLRHY 353
+++ E S+ + +++L+ V L+ + Y
Sbjct: 307 PLLAGEYIIRCSEIPKLPAVSLLIGGVWFTLTAQDY 342
>gi|6561816|gb|AAF17080.1| aspartyl protease 3 [Homo sapiens]
Length = 450
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 127/276 (46%), Positives = 174/276 (63%), Gaps = 3/276 (1%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRS 138
V L ++DAQYFGEIG+GTPPQNFTV FDTGSSNLWVPS +C +FS+ C+FH ++ S
Sbjct: 68 VPLSKFLDAQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPCWFHHRFNPNAS 127
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
S++K +G I YGTG + G SED + IG + F EA E SL F +++ DGIL
Sbjct: 128 SSFKPSGTKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWESSLVFTVSRPDGIL 187
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEH 258
GLGF +SV P +V QGL+++PVFSF+FNR+ + +GGE+V GG DP HY
Sbjct: 188 GLGFPILSVEGVRPPLDVLVEQGLLDKPVFSFYFNRDPEVADGGELVLGGSDPAHYIPPL 247
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
T+VPVT YWQ M V + + T CA GCAAI D+GT ++ GPT I ++ AIG
Sbjct: 248 TFVPVTVPAYWQIHMERVKVGSRLT-LCAQGCAAILDTGTPVIVGPTEEIRALHAAIGGI 306
Query: 319 GIVSQECKAVVSQYGE-EIINMLLAKVSAKLSLRHY 353
+++ E S+ + +++L+ V L+ + Y
Sbjct: 307 PLLAGEYIIRCSEIPKLPAVSLLIGGVWFNLTAQDY 342
>gi|119592251|gb|EAW71845.1| hCG1733572, isoform CRA_a [Homo sapiens]
Length = 449
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 127/276 (46%), Positives = 174/276 (63%), Gaps = 3/276 (1%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRS 138
V L ++DAQYFGEIG+GTPPQNFTV FDTGSSNLWVPS +C +FS+ C+FH ++ S
Sbjct: 68 VPLSKFLDAQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPCWFHHRFNPNAS 127
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
S++K +G I YGTG + G SED + IG + F EA E SL F +++ DGIL
Sbjct: 128 SSFKPSGTKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWESSLVFTVSRPDGIL 187
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEH 258
GLGF +SV P +V QGL+++PVFSF+FNR+ + +GGE+V GG DP HY
Sbjct: 188 GLGFPILSVEGVRPPLDVLVEQGLLDKPVFSFYFNRDPEVADGGELVLGGSDPAHYIPPL 247
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
T+VPVT YWQ M V + + T CA GCAAI D+GT ++ GPT I ++ AIG
Sbjct: 248 TFVPVTVPAYWQIHMERVKVGSRLT-LCAQGCAAILDTGTPVIVGPTEEIRALHAAIGGI 306
Query: 319 GIVSQECKAVVSQYGE-EIINMLLAKVSAKLSLRHY 353
+++ E S+ + +++L+ V L+ + Y
Sbjct: 307 PLLAGEYIIRCSEIPKLPAVSLLIGGVWFNLTAQDY 342
>gi|85094599|ref|XP_959917.1| vacuolar protease A precursor [Neurospora crassa OR74A]
gi|59802879|sp|Q01294.2|CARP_NEUCR RecName: Full=Vacuolar protease A; Flags: Precursor
gi|28921374|gb|EAA30681.1| vacuolar protease A precursor [Neurospora crassa OR74A]
gi|40804614|emb|CAF05874.1| aspartic proteinase, pepstatin-sensitive [Neurospora crassa]
gi|336467530|gb|EGO55694.1| aspartic proteinase, pepstatin-sensitive [Neurospora tetrasperma
FGSC 2508]
gi|350287820|gb|EGZ69056.1| aspartic proteinase, pepstatin-sensitive [Neurospora tetrasperma
FGSC 2509]
Length = 396
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 139/317 (43%), Positives = 190/317 (59%), Gaps = 28/317 (8%)
Query: 15 CLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKE--------GESFRTSIRKYS 66
LL ++ + G++ + LKK +A +L+S G+ + T +R S
Sbjct: 4 ALLTAAMLLGSAQAGVHTMKLKKVP------LAEQLESVPIDVQVQHLGQKY-TGLRTES 56
Query: 67 LRGNLGESGDADI-----VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC 121
+ ++ DA + V + N+M+AQYF EI IGTPPQ F V+ DTGSSNLWVPSS+C
Sbjct: 57 HTQAMFKATDAQVSGNHPVPITNFMNAQYFSEITIGTPPQTFKVVLDTGSSNLWVPSSQC 116
Query: 122 YFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEA 181
SIACY H+KY S SSTYKKNG S I YG+G++SGF S+D + IGD+ + DQ F EA
Sbjct: 117 -GSIACYLHNKYESSESSTYKKNGTSFKIEYGSGSLSGFVSQDRMTIGDITINDQLFAEA 175
Query: 182 TREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEG 241
T EP L F +FDGILGLG+ I+V P +Y MV Q LV+EPVFSF+ AD++
Sbjct: 176 TSEPGLAFAFGRFDGILGLGYDRIAVNGITPPFYKMVEQKLVDEPVFSFYL---ADQDGE 232
Query: 242 GEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCA-GGCAAIADSGTSL 300
E+VFGG++ D Y G+ T +P+ +K YW+ D + G F G I D+GTSL
Sbjct: 233 SEVVFGGVNKDRYTGKITTIPLRRKAYWEVDFDAI---GYGKDFAELEGHGVILDTGTSL 289
Query: 301 LAGPTTIITQVNHAIGA 317
+A P+ + +N IGA
Sbjct: 290 IALPSQLAEMLNAQIGA 306
>gi|1039445|gb|AAA79878.1| vacuolar protease A [Neurospora crassa]
Length = 396
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 136/310 (43%), Positives = 185/310 (59%), Gaps = 14/310 (4%)
Query: 15 CLLLFPVVFSTPNGGLYRIGLKKRKF-DLNNRVAARLDSKEGESFRTSIRKYSLRGNLGE 73
LL ++ + G++ + LKK D V + + T +R S + +
Sbjct: 4 ALLTAAMLLGSAQAGVHTMKLKKVPLADELESVPIDVQVQHLGQKYTGLRTESHTQAMFK 63
Query: 74 SGDADI-----VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACY 128
+ DA + V + N+M+AQYF EI IGTPPQ F V+ DTGSSNLWVPSS+C SIACY
Sbjct: 64 ATDAQVSGNHPVPITNFMNAQYFSEITIGTPPQTFKVVLDTGSSNLWVPSSQC-GSIACY 122
Query: 129 FHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLT 188
H+KY S SSTYKKNG S I YG+G++SGF S+D + IGD+ + DQ F EAT EP L
Sbjct: 123 LHNKYESSESSTYKKNGTSFKIEYGSGSLSGFVSQDRMTIGDITINDQLFAEATSEPGLA 182
Query: 189 FLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGG 248
F +FDGILGLG+ ++V P +Y MV Q LV+EPVFSF+ AD++ E+VFGG
Sbjct: 183 FAFGRFDGILGLGYDRLAVPGITPPFYKMVEQKLVDEPVFSFYL---ADQDGESEVVFGG 239
Query: 249 MDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCA-GGCAAIADSGTSLLAGPTTI 307
++ D Y G+ T +P+ +K YW+ D + G F G I D+GTSL+A P+ +
Sbjct: 240 VNKDRYTGKITTIPLRRKAYWEVDFDAI---GYGKDFAELEGHGVILDTGTSLIALPSQL 296
Query: 308 ITQVNHAIGA 317
+N IGA
Sbjct: 297 AEMLNAQIGA 306
>gi|302657131|ref|XP_003020295.1| hypothetical protein TRV_05606 [Trichophyton verrucosum HKI 0517]
gi|306531031|sp|D4DEN7.1|CARP_TRIVH RecName: Full=Probable vacuolar protease A; AltName: Full=Aspartic
endopeptidase PEP2; AltName: Full=Aspartic protease
PEP2; Flags: Precursor
gi|291184114|gb|EFE39677.1| hypothetical protein TRV_05606 [Trichophyton verrucosum HKI 0517]
Length = 400
Score = 246 bits (627), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 118/239 (49%), Positives = 154/239 (64%), Gaps = 3/239 (1%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V + N+++AQYF EI IGTPPQ F V+ DTGSSNLWVP C SIAC+ HS Y S SS
Sbjct: 77 VLIDNFLNAQYFSEISIGTPPQTFKVVLDTGSSNLWVPGKDCS-SIACFLHSTYDSSASS 135
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TY KNG I YG+G++ GF S+D VKIGD+ +K+Q F EAT EP L F +FDGI+G
Sbjct: 136 TYSKNGTKFAIRYGSGSLEGFVSQDSVKIGDMTIKNQLFAEATSEPGLAFAFGRFDGIMG 195
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
+GF ISV P +YNM++QGL++EPVFSF+ E + + FGG D H+ G+ T
Sbjct: 196 MGFSSISVNGITPPFYNMIDQGLIDEPVFSFYLGDTNKEGDQSVVTFGGSDTKHFTGDMT 255
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
+P+ +K YW+ D + + T G I D+GTSL+A PTT+ +N IGAT
Sbjct: 256 TIPLRRKAYWEVDFDAISLGEDTAALENTGI--ILDTGTSLIALPTTLAEMINTQIGAT 312
>gi|340373429|ref|XP_003385244.1| PREDICTED: cathepsin D-like [Amphimedon queenslandica]
Length = 382
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 120/237 (50%), Positives = 154/237 (64%), Gaps = 3/237 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
L NY+DA+Y+G I IGTPPQNF VIFDTGSSNLW+PSSKC AC H +Y SST
Sbjct: 58 LTNYLDAEYYGNITIGTPPQNFLVIFDTGSSNLWIPSSKCDPKDKACQTHHQYNHDHSST 117
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y KN I YGTG ++GF S D V I +L V Q+F EA +P TF+ A+FDGILG+
Sbjct: 118 YVKNDTKFAIQYGTGNLTGFLSVDTVTIANLTVPAQKFAEAVEQPGDTFVNAQFDGILGM 177
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ISV +P + N+V Q LV +PVF F+ +R+ + GGE+ GG DP HYK Y
Sbjct: 178 AWPSISVDGVIPFFNNLVQQSLVAQPVFGFYLDRDENGTLGGELALGGTDPSHYKAPINY 237
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
VP++ K YWQF + + + G T C+ GC AIAD+GTSLL GP+ + ++ IGA
Sbjct: 238 VPLSDKTYWQFKLDKIKVGGTT--LCSNGCQAIADTGTSLLVGPSVDVQKIMKEIGA 292
>gi|195046656|ref|XP_001992194.1| GH24344 [Drosophila grimshawi]
gi|193893035|gb|EDV91901.1| GH24344 [Drosophila grimshawi]
Length = 373
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 133/276 (48%), Positives = 179/276 (64%), Gaps = 20/276 (7%)
Query: 30 LYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRG--NLGESGDADIVALKNYMD 87
L+R+ + K + N V R + K +++ LRG N+ + DAD L N ++
Sbjct: 18 LHRVPILKHE----NFVKTRENVKAEKAY--------LRGKYNMPSARDAD-EELSNSIN 64
Query: 88 AQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYF-SIACYFHSKYRSGRSSTYKKNGK 146
Y+G I IGTPPQ+F V+FD+GSSNLWVPSS+C+F IAC H+KY +SSTY NG+
Sbjct: 65 MAYYGAITIGTPPQSFKVLFDSGSSNLWVPSSRCFFLDIACQNHNKYDHDKSSTYVANGE 124
Query: 147 SADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEIS 206
S I YG+G++SGF S D V + L +K Q F EAT EP +F AKFDGILG+ +Q IS
Sbjct: 125 SFSIQYGSGSLSGFLSTDDVDVSGLTIKSQTFAEATNEPGTSFNNAKFDGILGMAYQSIS 184
Query: 207 VGKAVPVWYNMVNQGLVNEPVFSFWFNRN-ADEEEGGEIVFGGMDPDHYKGEHTYVPVTQ 265
VP +YNMV+QGLV++ VFSF+ R+ +GGE++FGG DP Y G+ +YVP+++
Sbjct: 185 SDNVVPPFYNMVSQGLVDDSVFSFYLARDGTSTTDGGELIFGGSDPAKYTGDLSYVPISE 244
Query: 266 KGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLL 301
+GYWQF + IDGQT G AIAD+GTSLL
Sbjct: 245 QGYWQFAVDSATIDGQTLG---ESFQAIADTGTSLL 277
>gi|171679543|ref|XP_001904718.1| hypothetical protein [Podospora anserina S mat+]
gi|170939397|emb|CAP64625.1| unnamed protein product [Podospora anserina S mat+]
Length = 397
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 128/268 (47%), Positives = 171/268 (63%), Gaps = 20/268 (7%)
Query: 67 LRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIA 126
+G+L + V + N+M+AQYF EI IGTPPQ+F V+ DTGSSNLWVPS C SIA
Sbjct: 64 FQGSLADPKGIHPVPISNFMNAQYFSEITIGTPPQSFKVVLDTGSSNLWVPSVDC-GSIA 122
Query: 127 CYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPS 186
CY HSKY S SST+K NG S +I YG+G++SG+ S+D + IGD+ +K+Q+F EAT EP
Sbjct: 123 CYLHSKYDSSASSTFKANGSSFEIRYGSGSLSGYVSQDTMTIGDIKIKEQDFAEATSEPG 182
Query: 187 LTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVF 246
L F +FDGI+GLGF ISV VP +Y M+ Q L++EPVF+F + AD E E+ F
Sbjct: 183 LAFAFGRFDGIMGLGFDRISVNGIVPPFYKMIEQKLIDEPVFAF---KLADTEGESEVTF 239
Query: 247 GGMDPDHYKGEHTYVPVTQKGYWQFDM-----GDVMIDGQTTGFCAGGCAAIADSGTSLL 301
GG+D D YKG+ +P+ +K YW+ D GD D + TG I D+GTSL+
Sbjct: 240 GGVDKDAYKGKLITIPLRRKAYWEVDFDAISYGDDTADLENTGI-------ILDTGTSLI 292
Query: 302 AGPTTIITQVNHAIGA----TGIVSQEC 325
A P+ + +N IGA TG + +C
Sbjct: 293 ALPSQLAEMLNAQIGAKKGYTGQYTVDC 320
>gi|354497676|ref|XP_003510945.1| PREDICTED: napsin-A [Cricetulus griseus]
Length = 569
Score = 245 bits (625), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 117/238 (49%), Positives = 157/238 (65%), Gaps = 2/238 (0%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRS 138
V L +M+ QYFG+IG+GTPPQNFTV+FDTGSSNLWVPS +C +FS+ C+FH ++ S
Sbjct: 63 VPLYKFMNTQYFGDIGLGTPPQNFTVVFDTGSSNLWVPSVRCHFFSLPCWFHRRFNPKAS 122
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
S+++ NG I YG+G ++G S+D++ IG++ F EA E S+ F LA FDGIL
Sbjct: 123 SSFRPNGTKLAIQYGSGQLTGILSQDNLTIGEIRGVSVTFGEALWESSMVFTLAHFDGIL 182
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEH 258
GLGF ++V P MV QGL+ +P+FSF+ NR+A+ +GGE+V GG DP HY
Sbjct: 183 GLGFPSLAVDGVQPPLDAMVEQGLLQKPIFSFYLNRDAEGSDGGELVLGGSDPAHYIPPL 242
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIG 316
T++PVT YWQ M V + G CA GC I D+GTSL+ GP+ I +N AIG
Sbjct: 243 TFIPVTIPAYWQVHMESVNV-GTGLSLCAQGCGVILDTGTSLITGPSEEIHALNKAIG 299
>gi|332247693|ref|XP_003272996.1| PREDICTED: cathepsin E [Nomascus leucogenys]
Length = 396
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 125/295 (42%), Positives = 185/295 (62%), Gaps = 7/295 (2%)
Query: 27 NGGLYRIGLKKRKFDLNNRVAARLDSKE-GESFRTSIRKYSLRGNLGESGDADIVALKNY 85
G L+R+ L++ L ++ AR E +S + +++ ++ +S ++ NY
Sbjct: 18 QGSLHRVPLRRHP-SLKKKLRARSQLSEFWKSHNLDMIQFTESCSMDQSAKEPLI---NY 73
Query: 86 MDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNG 145
+D +YFG I IG+PPQNFTVIFDTGSSNLWVPS C S AC H++++ +SSTY + G
Sbjct: 74 LDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHTRFQPSQSSTYSQPG 132
Query: 146 KSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEI 205
+S I YGTG++SG D V + L V Q+F E+ EP TF+ A+FDGILGLG+ +
Sbjct: 133 QSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAEFDGILGLGYPSL 192
Query: 206 SVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQ 265
+VG PV+ NM+ Q LV+ P+FS + + N + G E++FGG D H+ G +VPVT+
Sbjct: 193 AVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGSLNWVPVTK 252
Query: 266 KGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
+ YWQ + ++ + G T FC+ GC AI D+GTSL+ GP+ I Q+ + IGA +
Sbjct: 253 QAYWQIALDNIQVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNTIGAAPV 306
>gi|297705581|ref|XP_002829653.1| PREDICTED: napsin-A, partial [Pongo abelii]
Length = 392
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 127/276 (46%), Positives = 171/276 (61%), Gaps = 3/276 (1%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRS 138
V L NY D QYFGEIG+GTPPQNFTV FDTGSSNLWVPS +C +FS+ C+FH ++ S
Sbjct: 40 VPLSNYWDVQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPCWFHHRFNPSAS 99
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
S++K NG I YGTG + G SED + IG + F EA E SL F +++ DGIL
Sbjct: 100 SSFKPNGTKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWESSLVFTVSRPDGIL 159
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEH 258
GLGF ++V P +V QGL+++P+FSF+ NR+ +GGE+V GG DP HY
Sbjct: 160 GLGFPILAVEGVRPPLDVLVKQGLLDKPIFSFYLNRDPKVADGGELVLGGSDPAHYIPPL 219
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
T+VPVT YWQ M V + G CA GCAAI D+GT ++ GPT I ++ AIG
Sbjct: 220 TFVPVTVPAYWQIHMERVKV-GSGLTLCARGCAAILDTGTPVIVGPTEEIRALHAAIGGI 278
Query: 319 GIVSQECKAVVSQYGE-EIINMLLAKVSAKLSLRHY 353
+++ E S+ + +++L+A V L+ + Y
Sbjct: 279 PLLAGEYIIRCSEIPKLPAVSLLIAGVWFNLTAQDY 314
>gi|73915318|gb|AAZ92540.1| aspartyl protease 1 [Coccidioides posadasii]
gi|73915320|gb|AAZ92541.1| aspartyl protease 1 [Coccidioides posadasii]
Length = 399
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 120/242 (49%), Positives = 158/242 (65%), Gaps = 3/242 (1%)
Query: 76 DADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRS 135
D V + N+++AQYF EI IG PPQNF V+ DTGSSNLWVPSS+C SIACY H+KY S
Sbjct: 71 DGHNVLVDNFLNAQYFSEISIGNPPQNFKVVLDTGSSNLWVPSSECG-SIACYLHNKYDS 129
Query: 136 GRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFD 195
SSTYKKNG I YG+G++SGF S+D ++IGDL ++ Q+F EAT EP L F +FD
Sbjct: 130 SASSTYKKNGTEFAIRYGSGSLSGFVSQDTLRIGDLTIEGQDFAEATNEPGLAFAFGRFD 189
Query: 196 GILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYK 255
GILGLG+ ISV K VP +YNM+N+GL++EPVF F+ E + FGG+D +
Sbjct: 190 GILGLGYDTISVNKIVPPFYNMINEGLIDEPVFGFYLGDTNKEGDDSYATFGGVDSSLFS 249
Query: 256 GEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAI 315
GE +P+ +K YW+ D + + G I D+GTSL+A P+T+ +N I
Sbjct: 250 GEMIKIPLRRKAYWEVDFDAIAFGNERAELEDTGI--ILDTGTSLIALPSTLAELLNREI 307
Query: 316 GA 317
GA
Sbjct: 308 GA 309
>gi|326911558|ref|XP_003202125.1| PREDICTED: cathepsin E-A-like [Meleagris gallopavo]
Length = 404
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 117/243 (48%), Positives = 167/243 (68%), Gaps = 2/243 (0%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L +YM+AQY+G I +GTPPQ+FTV+FDTGSSN WVPS C S AC H +++S S +Y
Sbjct: 73 LYDYMNAQYYGVISVGTPPQSFTVVFDTGSSNFWVPSVYC-ISEACRVHQRFKSFLSDSY 131
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
+ G+ + YGTG + G ++D ++I ++ +K Q+F E+ EP +TF LA FDG+LGLG
Sbjct: 132 EHGGEPFSLQYGTGQLLGIAAKDTLQISNISIKGQDFGESVFEPGMTFALAHFDGVLGLG 191
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ ++VG A+PV+ +++NQ LV EPVFSF+ R D E GGE++ GG+D YKG +V
Sbjct: 192 YPSLAVGNALPVFDSIMNQKLVEEPVFSFYLKRGDDTENGGELILGGIDHSLYKGSIHWV 251
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIV 321
PVT+K YWQ + ++ I G+ FC+ GC AI DSGTSL+ GP++ I ++ IGA+
Sbjct: 252 PVTEKSYWQIHLNNIKIQGR-VAFCSHGCEAIVDSGTSLITGPSSQIRRLQEYIGASPSR 310
Query: 322 SQE 324
S E
Sbjct: 311 SGE 313
>gi|403294878|ref|XP_003938389.1| PREDICTED: cathepsin E [Saimiri boliviensis boliviensis]
Length = 396
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 126/295 (42%), Positives = 186/295 (63%), Gaps = 7/295 (2%)
Query: 27 NGGLYRIGLKKRKFDLNNRVAARLDSKE-GESFRTSIRKYSLRGNLGESGDADIVALKNY 85
G L+R+ L+K L ++ AR E +S + +++ + +S + ++ NY
Sbjct: 18 QGSLHRVPLRKHP-SLKKKLRARSQLSEFWKSHNLDMIQFTESCSTDQSANEPLI---NY 73
Query: 86 MDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNG 145
+D +YFG I IG+PPQNFTVIFDTGSSNLWVPS C S AC H++++ +S+TY + G
Sbjct: 74 LDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKRHTRFQPSQSNTYNQPG 132
Query: 146 KSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEI 205
+S I YGTG++SG D V + L V Q+F E+ EP TF+ A+FDGILGLG+ +
Sbjct: 133 QSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAEFDGILGLGYPSL 192
Query: 206 SVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQ 265
+VG PV+ NM+ Q LV+ P+FS + + N + G E++FGG D H+ G +VPVT+
Sbjct: 193 AVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGVGSELIFGGYDHSHFSGSLNWVPVTK 252
Query: 266 KGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
+ YWQ + ++ + G T FC+ GC AI D+GTSL+ GP+ I Q+ +AIGA +
Sbjct: 253 QAYWQIALDNIQVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGAAPV 306
>gi|378731872|gb|EHY58331.1| vacuolar protease A [Exophiala dermatitidis NIH/UT8656]
Length = 398
Score = 244 bits (624), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 134/306 (43%), Positives = 185/306 (60%), Gaps = 22/306 (7%)
Query: 35 LKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDA------DI-----VALK 83
+K +K L +++A + G+ R KY+ + G + D DI V ++
Sbjct: 23 MKLQKVPLEQQLSA---ANIGDHLRALRHKYTQKTLGGPAEDIFRHTSIDIDSPHEVPVE 79
Query: 84 NYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKK 143
N+++AQYF I +GTPPQ F V+ DTGSSNLWVPSS+C SIACY H KY S SSTYKK
Sbjct: 80 NFLNAQYFSTIALGTPPQEFKVVLDTGSSNLWVPSSEC-GSIACYLHQKYDSSASSTYKK 138
Query: 144 NGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQ 203
NG I YG+G ++GF S+D ++IGDL +KDQ F EAT EP L F +FDGILGLG+
Sbjct: 139 NGSEFGIRYGSGEVAGFISQDILRIGDLKIKDQLFGEATSEPGLAFAFGRFDGILGLGYD 198
Query: 204 EISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPV 263
I+V P +YNM++QGL++EPVF+F+ D E E FGG+D DHY G+ +P+
Sbjct: 199 TIAVNHIPPPFYNMIDQGLLDEPVFAFYLGNTNDGTE-SEATFGGIDKDHYTGKMVKIPL 257
Query: 264 TQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA----TG 319
+K YW+ ++ + +T G I D+GTSL+A P+T+ +N IGA G
Sbjct: 258 RRKAYWEVNLDAITFGKETADLDNTGV--ILDTGTSLIALPSTLAELLNKEIGAKKGFNG 315
Query: 320 IVSQEC 325
+ EC
Sbjct: 316 QYTVEC 321
>gi|195382956|ref|XP_002050194.1| GJ22010 [Drosophila virilis]
gi|194144991|gb|EDW61387.1| GJ22010 [Drosophila virilis]
Length = 394
Score = 244 bits (624), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 119/253 (47%), Positives = 154/253 (60%), Gaps = 6/253 (2%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L NY+DAQYFG I IGTPPQ F VIFDTGS+NLWVPS C+ +AC HS+Y S S
Sbjct: 65 VPLSNYLDAQYFGPISIGTPPQKFNVIFDTGSANLWVPSESCHQKLACQIHSRYNSRHSR 124
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
+YK +GK DI YG+G+++G+ S+D V++ L + +Q F EAT P FL AKFDGI G
Sbjct: 125 SYKSDGKQFDIQYGSGSLAGYLSQDTVRVAGLEITNQTFAEATEMPGPIFLAAKFDGIFG 184
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
L ++ IS+ P +Y ++ Q L+ PVFS + NR A +GG + FGG P +Y+G T
Sbjct: 185 LAYRGISIQNIKPPFYAVMEQNLLKRPVFSVYLNRIASSRQGGYLFFGGSSPRYYRGNFT 244
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT- 318
YVPVT + YWQ + I C GC I D+GTS LA P +N +IG T
Sbjct: 245 YVPVTHRAYWQVKLEAARIG--PLQLCLNGCQVIIDTGTSFLAVPYEQAILINESIGGTP 302
Query: 319 ---GIVSQECKAV 328
G S C+ V
Sbjct: 303 AAYGQFSVPCEQV 315
>gi|402226359|gb|EJU06419.1| endopeptidase [Dacryopinax sp. DJM-731 SS1]
Length = 413
Score = 244 bits (624), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 124/240 (51%), Positives = 160/240 (66%), Gaps = 7/240 (2%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L ++M+AQYF EI +GTPPQ F V+ DTGSSNLWVPS KC SIAC+ H KY S SS
Sbjct: 92 VPLTDFMNAQYFAEITLGTPPQTFKVVLDTGSSNLWVPSIKCT-SIACFLHQKYDSAASS 150
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYK NG + +IHYG+G++ GF S D + IGDL V+ +F EAT+EP L F L +FDGILG
Sbjct: 151 TYKSNGTAFEIHYGSGSMEGFVSNDLLTIGDLQVQKLDFAEATKEPGLAFALGRFDGILG 210
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWF-NRNADEEEGGEIVFGGMDPDHYKGEH 258
L + ISV PV+Y M+NQ L+ PVF+F N +AD GGE FGG+D Y G+
Sbjct: 211 LAYDTISVLHMTPVFYQMINQKLLENPVFAFRLGNSDAD---GGEATFGGIDESAYTGKI 267
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
YVPV +KGYW+ ++ + + G+ + G A D+GTSL+A P+ I +N IGAT
Sbjct: 268 DYVPVRRKGYWEIELDKISLGGEDLELESTGAA--IDTGTSLIALPSDIAEMLNKEIGAT 325
>gi|195134380|ref|XP_002011615.1| GI11125 [Drosophila mojavensis]
gi|193906738|gb|EDW05605.1| GI11125 [Drosophila mojavensis]
Length = 371
Score = 244 bits (624), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 136/315 (43%), Positives = 183/315 (58%), Gaps = 15/315 (4%)
Query: 16 LLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESG 75
+ + VV + + L+R+ + K++ N V R + K S + KY L G
Sbjct: 5 ITVLAVVLAVASAELHRVPVLKQE----NFVKTRGNIKAELSHLRA--KYGL-----PVG 53
Query: 76 DADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRS 135
A L N ++ Y+G I IGTPPQNF V+FD+GSSNLWVPS C S AC H++Y S
Sbjct: 54 RAVEEELSNSLNMAYYGAITIGTPPQNFKVLFDSGSSNLWVPSKNCP-SYACEVHNQYDS 112
Query: 136 GRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFD 195
SSTY+ NG+S I YGTG++SGF S D V + L +K Q F EAT EP F A FD
Sbjct: 113 SASSTYEANGESFSIQYGTGSLSGFLSTDTVDVNGLSIKKQTFAEATNEPGTNFNNANFD 172
Query: 196 GILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYK 255
GILG+G+Q IS VP +YNMV+Q L+++ VFSF+ R+ +GGE++FGG D Y
Sbjct: 173 GILGMGYQSISQDNVVPPFYNMVSQDLIDQSVFSFYLARDGTSSQGGELIFGGSDSSLYS 232
Query: 256 GEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAI 315
G+ TYVP++Q+GYWQF M ++G + C C AIAD+GTSLL P +N +
Sbjct: 233 GDFTYVPISQEGYWQFTMAGASVEGYS--LC-DNCQAIADTGTSLLVAPANAYELLNEIL 289
Query: 316 GATGIVSQECKAVVS 330
+C V S
Sbjct: 290 NVNDEGLVDCSTVSS 304
>gi|258563860|ref|XP_002582675.1| vacuolar protease A [Uncinocarpus reesii 1704]
gi|237908182|gb|EEP82583.1| vacuolar protease A [Uncinocarpus reesii 1704]
Length = 400
Score = 244 bits (624), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 124/250 (49%), Positives = 160/250 (64%), Gaps = 7/250 (2%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V + N+++AQYF EI IG PPQNF V+ DTGSSNLWVPSS+C SIAC+ HSKY S SS
Sbjct: 74 VLVDNFLNAQYFSEISIGNPPQNFKVVLDTGSSNLWVPSSQCG-SIACFLHSKYDSSASS 132
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYKKNG I YG+G++SGF S+D ++IGDLVVK+Q+F EAT EP L F +FDGILG
Sbjct: 133 TYKKNGTEFSIRYGSGSLSGFVSQDTLRIGDLVVKEQDFAEATNEPGLAFAFGRFDGILG 192
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ ISV K VP +YNM+NQ L++EPVF F+ E + FGG+D + +
Sbjct: 193 LGYDTISVNKIVPPFYNMLNQKLIDEPVFGFYLGDTNKEGDDSYATFGGVDDSLFSDDMI 252
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA-- 317
+P+ +K YW+ D V G I D+GTSL+A P+T+ +N IGA
Sbjct: 253 KIPLRRKAYWEVDFDAVTFGNDRAELENTGI--ILDTGTSLIALPSTLAELLNKEIGAKK 310
Query: 318 --TGIVSQEC 325
G + EC
Sbjct: 311 SWNGQYTVEC 320
>gi|340966614|gb|EGS22121.1| aspartic-type endopeptidase-like protein [Chaetomium thermophilum
var. thermophilum DSM 1495]
Length = 396
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 135/317 (42%), Positives = 190/317 (59%), Gaps = 27/317 (8%)
Query: 15 CLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAA--------RLDSK-EGESFRTSIRKY 65
LL V+ + G +++ LK +K L+ ++ A +L K G R S
Sbjct: 4 ALLTAAVLLGSAQGAVHK--LKLQKVPLSEQLDAVPIEIQVQQLGQKYMGTRSRQSHSDA 61
Query: 66 SLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSI 125
+G + E+ + V + N+M+AQYF EI +GTPPQ F V+ DTGSSNLWVPS C SI
Sbjct: 62 VWKGMMPEAMGSHPVPISNFMNAQYFSEISLGTPPQTFKVVLDTGSSNLWVPSVDC-GSI 120
Query: 126 ACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREP 185
ACY H+KY S SSTYK NG +I YG+G++SGF S+D ++IGD+ VK Q+F EAT EP
Sbjct: 121 ACYLHTKYDSSASSTYKPNGTKFEIRYGSGSLSGFVSQDVLRIGDITVKGQDFAEATSEP 180
Query: 186 SLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIV 245
L F +FDGILGLG+ ISV + VP +YNM+ Q +++EPVF+F+ +D E+
Sbjct: 181 GLAFAFGRFDGILGLGYDTISVNRIVPPFYNMIEQKVIDEPVFAFYL---SDTSGQSEVT 237
Query: 246 FGGMDPDHYKGEHTYVPVTQKGYWQFDM-----GDVMIDGQTTGFCAGGCAAIADSGTSL 300
FGG+D YKG+ T +P+ +K YW+ D GD + + TG I D+GTSL
Sbjct: 238 FGGIDKTKYKGKITTIPLRRKAYWEVDFDAISYGDDTAELENTG-------VILDTGTSL 290
Query: 301 LAGPTTIITQVNHAIGA 317
+A P+ + +N +GA
Sbjct: 291 IALPSQLAEMLNAQLGA 307
>gi|118082412|ref|XP_416090.2| PREDICTED: cathepsin E-A-like [Gallus gallus]
Length = 404
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 117/243 (48%), Positives = 167/243 (68%), Gaps = 2/243 (0%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L +YM+AQY+G I +GTPPQ+FTV+FDTGSSN WVPS C S AC H +++S S +Y
Sbjct: 73 LYDYMNAQYYGVISVGTPPQSFTVVFDTGSSNFWVPSVYC-ISEACRVHQRFKSFLSDSY 131
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
+ G+ + YGTG + G ++D ++I ++ +K Q+F E+ EP +TF LA FDG+LGLG
Sbjct: 132 EHGGEPFSLQYGTGQLLGIAAKDTLQISNISIKGQDFGESVFEPGMTFALAHFDGVLGLG 191
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ ++VG A+PV+ +++NQ LV EPVFSF+ R D E GGE++ GG+D YKG +V
Sbjct: 192 YPSLAVGNALPVFDSIMNQKLVEEPVFSFYLKRGDDTENGGELILGGIDHSLYKGSIHWV 251
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIV 321
PVT+K YWQ + ++ I G+ FC+ GC AI DSGTSL+ GP++ I ++ IGA+
Sbjct: 252 PVTEKSYWQIHLNNIKIQGRVV-FCSHGCEAIVDSGTSLITGPSSQIRRLQEYIGASPSR 310
Query: 322 SQE 324
S E
Sbjct: 311 SGE 313
>gi|195046637|ref|XP_001992191.1| GH24623 [Drosophila grimshawi]
gi|193893032|gb|EDV91898.1| GH24623 [Drosophila grimshawi]
Length = 374
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 139/316 (43%), Positives = 183/316 (57%), Gaps = 14/316 (4%)
Query: 16 LLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESG 75
+ + VV + + L+R+ + K + N V R + K S + KY L +
Sbjct: 5 ITVLAVVLALASAELHRVPILKHE----NFVKTRGNVKAELSHLRA--KYGLPTGIRAVT 58
Query: 76 DADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRS 135
D + L N M+ Y+G I IGTPPQ+F V+FD+GSSNLWVPS+ C S AC H++Y S
Sbjct: 59 DEE---LSNSMNMAYYGAITIGTPPQSFEVLFDSGSSNLWVPSNTCT-STACEVHNQYDS 114
Query: 136 GRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFD 195
SSTY+ NG+S I YGTG++SGF S D V I L V Q F EAT EP F A FD
Sbjct: 115 SASSTYQSNGESFSIQYGTGSLSGFLSTDTVDINGLSVTSQTFAEATDEPGTNFNNANFD 174
Query: 196 GILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNR-NADEEEGGEIVFGGMDPDHY 254
GILG+G+Q IS VPV+YNMV+QGLV++ VFSF+ R +GGE++FGG D Y
Sbjct: 175 GILGMGYQTISQDDVVPVFYNMVSQGLVDQSVFSFYLARAGTSTTDGGELIFGGSDSSLY 234
Query: 255 KGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHA 314
G+ TYVPV+Q+GYWQF M DG + C C AIAD+GTSL+ P +N
Sbjct: 235 SGDLTYVPVSQEGYWQFTMDSATADGNS--LC-DDCQAIADTGTSLIVAPANAYELLNEI 291
Query: 315 IGATGIVSQECKAVVS 330
+ +C + S
Sbjct: 292 LNVDDEGLVDCSTISS 307
>gi|169861123|ref|XP_001837196.1| endopeptidase [Coprinopsis cinerea okayama7#130]
gi|116501918|gb|EAU84813.1| endopeptidase [Coprinopsis cinerea okayama7#130]
Length = 411
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 137/295 (46%), Positives = 181/295 (61%), Gaps = 28/295 (9%)
Query: 45 RVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFT 104
R AR DS++GE L + + V L N+M+AQY+ EI +GTPPQ F
Sbjct: 63 RRVARPDSRDGEQ---------LFWTQDDLKNGHKVPLTNFMNAQYYTEITLGTPPQTFK 113
Query: 105 VIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSED 164
VI DTGSSNLWVPS KC SIAC+ H+KY S +S+TYK NG I YG+G++ GF S+D
Sbjct: 114 VILDTGSSNLWVPSIKCT-SIACFLHTKYDSSQSTTYKANGTEFSIQYGSGSMEGFVSQD 172
Query: 165 HVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVN 224
+ IGDL +K Q+F EA +EP L F KFDGILGL + ISV + VP +YNM+NQ L++
Sbjct: 173 TLGIGDLTIKGQDFAEALKEPGLAFAFGKFDGILGLAYDTISVNRIVPPFYNMINQKLID 232
Query: 225 EPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDM-----GDVMID 279
PVF+F + EE+GGE FGG+D + Y G+ YVPV +K YW+ ++ GD ++
Sbjct: 233 SPVFAFRI--GSSEEDGGEATFGGIDHEAYTGKLHYVPVRRKAYWEVELEKISFGDDELE 290
Query: 280 GQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA----TGIVSQECKAVVS 330
+ TG AAI D+GTSL+A PT + +N IGA G +C V S
Sbjct: 291 LEHTG------AAI-DTGTSLIALPTDMAEMLNTQIGARKSWNGQYQVDCNKVPS 338
>gi|322708430|gb|EFZ00008.1| vacuolar protease A [Metarhizium anisopliae ARSEF 23]
Length = 395
Score = 244 bits (623), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 131/243 (53%), Positives = 165/243 (67%), Gaps = 16/243 (6%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V + N+M+AQYF EI +GTPPQ F V+ DTGSSNLWVPS C SIACY HS Y S SS
Sbjct: 75 VPVSNFMNAQYFSEITVGTPPQTFKVVLDTGSSNLWVPSQSCS-SIACYLHSTYDSSSSS 133
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYKKNG S +I YG+G++SGF S+D V IGDL +KDQ+F EAT EP L F KFDGILG
Sbjct: 134 TYKKNGSSFEIRYGSGSLSGFVSQDVVTIGDLKIKDQDFAEATSEPGLAFAFGKFDGILG 193
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ +SV K VP +Y M+NQ L++EPVF+F+ + EEG E VFGG+D DHY G+
Sbjct: 194 LGYDTLSVNKIVPPFYQMINQKLLDEPVFAFYLGSS---EEGSEAVFGGIDKDHYTGKIE 250
Query: 260 YVPVTQKGYWQFDM-----GDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHA 314
Y+P+ +K YW+ D+ GD + + TG AI D+GTSL P+T+ +N
Sbjct: 251 YIPLRRKAYWEVDIHSIAFGDDVAELDRTG-------AILDTGTSLNVLPSTLAELLNKE 303
Query: 315 IGA 317
IGA
Sbjct: 304 IGA 306
>gi|452981069|gb|EME80829.1| hypothetical protein MYCFIDRAFT_89289 [Pseudocercospora fijiensis
CIRAD86]
Length = 396
Score = 244 bits (623), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 119/242 (49%), Positives = 164/242 (67%), Gaps = 4/242 (1%)
Query: 76 DADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRS 135
D VA+ N+++AQYF +I IGTPPQ F V+ DTGSSNLWVPS C SIACY HSKY
Sbjct: 70 DGHPVAVDNFLNAQYFSQIAIGTPPQEFKVVLDTGSSNLWVPSQDC-GSIACYLHSKYDH 128
Query: 136 GRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFD 195
G S+TYK+NG I YG+G++ G+ S+D V+IGDL +K+Q F EAT EP L F +FD
Sbjct: 129 GESTTYKQNGSDFAIRYGSGSLEGYVSQDTVQIGDLKIKNQLFAEATSEPGLAFAFGRFD 188
Query: 196 GILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYK 255
GI+GLG+ ISV P +YNM++QGL++E F+F+ + + D+ + E +FGG++ DHY
Sbjct: 189 GIMGLGYDTISVNGIPPPFYNMIDQGLLDEKKFAFYLS-STDKGDESEAIFGGVNEDHYT 247
Query: 256 GEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAI 315
G+ +P+ +K YW+ D+ + QT A G AI D+GTSL+A P+T+ +N I
Sbjct: 248 GKMINIPLRRKAYWEVDLDAITFGDQTAEIDATG--AILDTGTSLIALPSTLAELLNKEI 305
Query: 316 GA 317
GA
Sbjct: 306 GA 307
>gi|195029909|ref|XP_001987814.1| GH19747 [Drosophila grimshawi]
gi|193903814|gb|EDW02681.1| GH19747 [Drosophila grimshawi]
Length = 390
Score = 244 bits (623), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 125/269 (46%), Positives = 164/269 (60%), Gaps = 9/269 (3%)
Query: 64 KYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYF 123
KYSLR G+S + V L NY+DAQYFG I IGTPPQ F VIFDTGS+NLWVPS C+
Sbjct: 48 KYSLRTR-GDSLRS--VPLSNYLDAQYFGPISIGTPPQTFNVIFDTGSANLWVPSETCHR 104
Query: 124 SIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATR 183
+AC HS+Y + RS +YK NG DI YG+G+++G+ S+D V++ L + +Q F EAT
Sbjct: 105 KLACQIHSRYNAKRSRSYKSNGSQFDIQYGSGSLTGYLSQDTVRMAGLELLNQTFAEATD 164
Query: 184 EPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGE 243
P FL AKFDGI GLG+Q IS+ P +Y ++ Q L+ PVFS + NR++ +GG
Sbjct: 165 MPGPIFLAAKFDGIFGLGYQAISIKNIKPPFYAVMEQSLLERPVFSVYLNRDSTSLQGGY 224
Query: 244 IVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAG 303
+ FGG +Y+G TYVPVT + YWQ + I C GC I D+GTS +A
Sbjct: 225 LFFGGSSRRYYRGNFTYVPVTHRAYWQVKLEAAYIG--KLQMCQKGCHVIIDTGTSFIAV 282
Query: 304 PTTIITQVNHAIGAT----GIVSQECKAV 328
P +N +IG T G S C+ V
Sbjct: 283 PYEQAILINESIGGTPAAYGQFSVPCEQV 311
>gi|119187279|ref|XP_001244246.1| hypothetical protein CIMG_03687 [Coccidioides immitis RS]
gi|303317132|ref|XP_003068568.1| aspartyl proteinase [Coccidioides posadasii C735 delta SOWgp]
gi|6760077|gb|AAF28186.1|AF162132_1 aspartyl proteinase [Coccidioides posadasii]
gi|240108249|gb|EER26423.1| aspartyl proteinase [Coccidioides posadasii C735 delta SOWgp]
gi|392870962|gb|EAS32810.2| vacuolar protease A [Coccidioides immitis RS]
Length = 399
Score = 244 bits (623), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 119/238 (50%), Positives = 157/238 (65%), Gaps = 3/238 (1%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V + N+++AQYF EI IG PPQNF V+ DTGSSNLWVPSS+C SIACY H+KY S SS
Sbjct: 75 VLVDNFLNAQYFSEISIGNPPQNFKVVLDTGSSNLWVPSSECG-SIACYLHNKYDSSASS 133
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYKKNG I YG+G++SGF S+D ++IGDL ++ Q+F EAT EP L F +FDGILG
Sbjct: 134 TYKKNGTEFAIRYGSGSLSGFVSQDTLRIGDLTIEGQDFAEATNEPGLAFAFGRFDGILG 193
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ ISV K VP +YNM+N+GL++EPVF F+ E + FGG+D + GE
Sbjct: 194 LGYDTISVNKIVPPFYNMINEGLIDEPVFGFYLGDTNKEGDDSYATFGGVDSSLFSGEMI 253
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
+P+ +K YW+ D + + G I D+GTSL+A P+T+ +N IGA
Sbjct: 254 KIPLRRKAYWEVDFDAIAFGNERAELEDTGI--ILDTGTSLIALPSTLAELLNREIGA 309
>gi|410974821|ref|XP_003993838.1| PREDICTED: cathepsin D [Felis catus]
Length = 418
Score = 244 bits (622), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 142/338 (42%), Positives = 196/338 (57%), Gaps = 27/338 (7%)
Query: 30 LYRIGLKKRKFDLNNRVAARLDSK-EGESFRTSIRKYSLRGNLGESGDADIVALKNYMDA 88
L RI L KF R + L E + I KY+ +G +G LKNY+DA
Sbjct: 31 LERIPL--HKFTSVRRTMSELGGPVEDLIAKGPISKYA-QGVPAVTGGPIPEILKNYLDA 87
Query: 89 QYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNGKS 147
QY+GEIGIGTPPQ FTV+FDTGS+NLWVPS C IAC+ S Y KNG S
Sbjct: 88 QYYGEIGIGTPPQCFTVVFDTGSANLWVPSIHCKLLDIACWGGSVAXXXXXXXYVKNGTS 147
Query: 148 ADIHYGTGAISGFFSEDHVKI-------GDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
DIHYG+G++SG+ S+D V + + V+ Q F EA ++P +TF+ AKFDGILG+
Sbjct: 148 FDIHYGSGSLSGYLSQDTVSVPCQTPTVAGVKVERQIFGEAIKQPGITFIAAKFDGILGM 207
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ISV +PV+ N++ Q LV + +FSF+ NR+ + + GGE++ GG D +YKG +Y
Sbjct: 208 AYPRISVDDVLPVFDNLMKQKLVEKNIFSFYLNRDPNAQPGGELMLGGTDSKYYKGPLSY 267
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
+ VT+K YWQ M V + G + C GGC AI D+GTSL+ GP + ++ AIGA +
Sbjct: 268 LNVTRKAYWQVHMDQVDV-GTSLTLCKGGCEAILDTGTSLMVGPVDEVRELQKAIGAVPL 326
Query: 321 VSQE----CKAVVSQYGEEIINMLLAKVSAKLSLRHYK 354
+ E C+ V + L +V+ KL + YK
Sbjct: 327 IQGEYMIPCEKVST----------LPEVTVKLGGKGYK 354
>gi|302497761|ref|XP_003010880.1| hypothetical protein ARB_02919 [Arthroderma benhamiae CBS 112371]
gi|306531030|sp|D4B385.1|CARP_ARTBC RecName: Full=Probable vacuolar protease A; AltName: Full=Aspartic
endopeptidase PEP2; AltName: Full=Aspartic protease
PEP2; Flags: Precursor
gi|291174425|gb|EFE30240.1| hypothetical protein ARB_02919 [Arthroderma benhamiae CBS 112371]
Length = 400
Score = 244 bits (622), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 117/239 (48%), Positives = 153/239 (64%), Gaps = 3/239 (1%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V + N+++AQYF EI IGTPPQ F V+ DTGSSNLWVP C SIAC+ HS Y S SS
Sbjct: 77 VLIDNFLNAQYFSEISIGTPPQTFKVVLDTGSSNLWVPGKDCS-SIACFLHSTYDSSASS 135
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TY KNG I YG+G++ GF S D VKIGD+ +K Q F EAT EP L F +FDGI+G
Sbjct: 136 TYSKNGTKFAIRYGSGSLEGFVSRDSVKIGDMTIKKQLFAEATSEPGLAFAFGRFDGIMG 195
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
+GF ISV P +YNM++QGL++EPVFSF+ + + + FGG D +H+ G+ T
Sbjct: 196 MGFSSISVNGITPPFYNMIDQGLIDEPVFSFYLGDTNKDGDQSVVTFGGSDTNHFTGDMT 255
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
+P+ +K YW+ D + + T G I D+GTSL+A PTT+ +N IGAT
Sbjct: 256 TIPLRRKAYWEVDFDAISLGKDTAALENTGI--ILDTGTSLIALPTTLAEMINTQIGAT 312
>gi|354478111|ref|XP_003501259.1| PREDICTED: cathepsin E-like isoform 1 [Cricetulus griseus]
Length = 396
Score = 244 bits (622), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 132/327 (40%), Positives = 193/327 (59%), Gaps = 7/327 (2%)
Query: 28 GGLYRIGLKKRKFDLNNRVAARLD-SKEGESFRTSIRKYSLRGNLGESGDADIVALKNYM 86
G ++R+ L+K + + ++ AR S +S + ++S + +S + ++ NY+
Sbjct: 19 GAIHRVPLRKHQ-SIRKKLRARGKLSDFWKSQNLDMIQFSESCKMDQSTNEPLI---NYL 74
Query: 87 DAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGK 146
D +YFG I IG+PPQNFTVIFDTGSSNLWVPS C S AC H + +SSTY++ G
Sbjct: 75 DVEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHPVFHPSQSSTYEEVGN 133
Query: 147 SADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEIS 206
I YGTG+++G D V + L V Q+F E+ +EP TF+ A+FDGILGLG+ ++
Sbjct: 134 HFSIQYGTGSLTGIIGADQVSVEGLTVDGQQFGESVKEPGQTFVNAEFDGILGLGYPSLA 193
Query: 207 VGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQK 266
VG PV+ NM+ Q LV+ P+FS + + + G E+ FGG DP H+ G ++PVT++
Sbjct: 194 VGGVTPVFDNMMAQNLVDLPIFSVYMSSDPQGGSGSELTFGGFDPSHFSGNLNWIPVTKQ 253
Query: 267 GYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQECK 326
GYWQ + V + G T FC+ GC AI D+GTSL+ GP+ I Q+ AIGAT + +
Sbjct: 254 GYWQIALDGVQV-GDTVMFCSEGCQAIVDTGTSLITGPSHKIKQLQEAIGATPMDGEYAV 312
Query: 327 AVVSQYGEEIINMLLAKVSAKLSLRHY 353
+ + +L VS LS Y
Sbjct: 313 DCANLNTMPNVAFILNGVSYTLSPTAY 339
>gi|189211129|ref|XP_001941895.1| vacuolar protease A precursor [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187977988|gb|EDU44614.1| vacuolar protease A precursor [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 399
Score = 244 bits (622), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 123/250 (49%), Positives = 165/250 (66%), Gaps = 8/250 (3%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V + N+++AQYF EI +GTPPQ F VI DTGSSNLWVPSS C SIACY H+KY S SS
Sbjct: 77 VPVTNFLNAQYFSEISLGTPPQTFKVILDTGSSNLWVPSSSCN-SIACYLHTKYDSSSSS 135
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYKKNG +I YG+G++SGF S D +IGDL VK+Q+F EAT EP L F +FDGI+G
Sbjct: 136 TYKKNGTEFEIRYGSGSLSGFVSNDVFQIGDLKVKNQDFAEATSEPGLAFAFGRFDGIMG 195
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ ISV VP +YNM+ QGL++EPVF+F+ + ++++ E FGG+D Y G+
Sbjct: 196 LGYDTISVKGIVPPFYNMLEQGLLDEPVFAFYLG-DTNQQQESEATFGGIDESKYTGKMI 254
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA-- 317
+P+ +K YW+ ++ + +T G I D+GTSL+A P+TI +N IGA
Sbjct: 255 KLPLRRKAYWEVELDALTFGKETAEMDNTGI--ILDTGTSLIALPSTIAELLNKEIGAKK 312
Query: 318 --TGIVSQEC 325
G + EC
Sbjct: 313 SFNGQYTVEC 322
>gi|315051426|ref|XP_003175087.1| hypothetical protein MGYG_02617 [Arthroderma gypseum CBS 118893]
gi|311340402|gb|EFQ99604.1| hypothetical protein MGYG_02617 [Arthroderma gypseum CBS 118893]
Length = 401
Score = 244 bits (622), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 117/239 (48%), Positives = 153/239 (64%), Gaps = 3/239 (1%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V + N+++AQYF EI IGTPPQ F V+ DTGSSNLWVP C SIAC+ HS Y S SS
Sbjct: 78 VLIDNFLNAQYFSEISIGTPPQTFKVVLDTGSSNLWVPGKDCS-SIACFLHSTYDSSASS 136
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TY KNG I YG+G++ GF S+D VKIGD+ +KDQ F EAT EP L F +FDGI+G
Sbjct: 137 TYHKNGTKFAIRYGSGSLEGFVSQDDVKIGDMTIKDQLFAEATSEPGLAFAFGRFDGIMG 196
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
+GF ISV P +Y M++QGL++EPVFSF+ E + + FGG D H+ G+ T
Sbjct: 197 MGFSSISVNGITPPFYKMIDQGLIDEPVFSFYLGDTNKEGDQSVVTFGGSDESHFTGKMT 256
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
+P+ +K YW+ + + + T G I D+GTSL+A PTT+ +N IGAT
Sbjct: 257 TIPLRRKAYWEVEFNAISLGKDTAALENTGI--ILDTGTSLIALPTTLAEMINSQIGAT 313
>gi|332241360|ref|XP_003269848.1| PREDICTED: napsin-A-like [Nomascus leucogenys]
Length = 421
Score = 244 bits (622), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 125/276 (45%), Positives = 172/276 (62%), Gaps = 3/276 (1%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRS 138
V L ++DAQYFGEIG+GTPPQNFTV FDTGSSNLWVPS +C +FS+ C+FH ++ S
Sbjct: 69 VPLSKFLDAQYFGEIGLGTPPQNFTVTFDTGSSNLWVPSRRCHFFSVPCWFHHRFNPNAS 128
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
S++K NG I YGTG + G SED + IG + F EA E SL F +++ DGIL
Sbjct: 129 SSFKPNGTKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWESSLVFTVSRPDGIL 188
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEH 258
GLGF ++V P +V QGL+++P+FSF+ NR+ + +GGE+V GG DP HY
Sbjct: 189 GLGFPILAVEGVRPPLDVLVEQGLLDKPIFSFYLNRDPEVADGGELVLGGSDPAHYIPPL 248
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
T+VPVT YWQ M V + G CA GCAAI D+GT ++ GPT I ++ AIG
Sbjct: 249 TFVPVTVPAYWQIHMERVKV-GSGLTLCARGCAAILDTGTPVIIGPTEEIRALHAAIGGI 307
Query: 319 GIVSQECKAVVSQYGE-EIINMLLAKVSAKLSLRHY 353
+++ E S+ + +++L+ V L+ + Y
Sbjct: 308 SLLAGEYLIRCSEIPKLPAVSLLIGGVWFNLTAQDY 343
>gi|198421979|ref|XP_002130758.1| PREDICTED: similar to Ctsd protein [Ciona intestinalis]
Length = 385
Score = 244 bits (622), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 130/275 (47%), Positives = 168/275 (61%), Gaps = 3/275 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
L NYMDAQYFGEI IGTP Q FTVIFDTGSSNLWVPS+ C + AC H+KY S SST
Sbjct: 55 LTNYMDAQYFGEISIGTPEQTFTVIFDTGSSNLWVPSASCPSTNYACMTHNKYNSAASST 114
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y +G+ I YGTG++ G+ S D VKI + Q F EA EP +TF+ AKFDGILG+
Sbjct: 115 YVADGEEFRIQYGTGSMVGYDSVDTVKIAGVPSTSQTFAEALEEPGITFVAAKFDGILGM 174
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
G+ I+V PV+ M QG V++ +F+F+ NR+ + +GGEI GG++P Y G+ Y
Sbjct: 175 GYPNIAVNGMKPVFNQMFEQGAVDQNLFAFYLNRDPEAADGGEITLGGVNPARYVGDFNY 234
Query: 261 VPVTQKGYWQFDMGDVMI-DGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATG 319
VT++GYWQ M + I D T C GGC I DSGTSL+ GP+ +N AIGA
Sbjct: 235 HDVTRQGYWQIKMDGLSIADTAKTTACNGGCQVIVDSGTSLITGPSADTDAINQAIGAIK 294
Query: 320 IVSQECKAVVSQYGE-EIINMLLAKVSAKLSLRHY 353
V E + + E I +L + L+ + Y
Sbjct: 295 FVQGEYLVICRRIPEMPDITFVLDGIEYVLTPQDY 329
>gi|327296035|ref|XP_003232712.1| hypothetical protein TERG_06704 [Trichophyton rubrum CBS 118892]
gi|326465023|gb|EGD90476.1| hypothetical protein TERG_06704 [Trichophyton rubrum CBS 118892]
Length = 400
Score = 244 bits (622), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 117/239 (48%), Positives = 155/239 (64%), Gaps = 3/239 (1%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V + N+++AQYF EI IGTPPQ F V+ DTGSSNLWVP C SIAC+ HS Y S SS
Sbjct: 77 VLIDNFLNAQYFSEISIGTPPQTFKVVLDTGSSNLWVPGKDCS-SIACFLHSTYDSSASS 135
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TY +NG I YG+G++ GF S D+VKIGDL +K+Q F EAT EP L F +FDGI+G
Sbjct: 136 TYSRNGTKFAIRYGSGSLEGFVSRDNVKIGDLTIKNQLFAEATSEPGLAFAFGRFDGIMG 195
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
+GF ISV P +YNM++QGL++EPVFSF+ + + + FGG D +H+ G+ T
Sbjct: 196 MGFSSISVNGIPPPFYNMIDQGLLDEPVFSFYLGDTNKDGDQSVVTFGGSDTNHFTGDMT 255
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
+P+ +K YW+ D + + T G I D+GTSL+A PTT+ +N IGAT
Sbjct: 256 TIPLRRKAYWEVDFDAISLGKDTAALENTGI--ILDTGTSLIALPTTLAEMINTQIGAT 312
>gi|291416270|ref|XP_002724368.1| PREDICTED: pepsin II-4-like [Oryctolagus cuniculus]
Length = 387
Score = 244 bits (622), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 133/297 (44%), Positives = 180/297 (60%), Gaps = 32/297 (10%)
Query: 62 IRKYSLRGNLGESG------------------------DADIVALKNYMDAQYFGEIGIG 97
+RK SLR NL E G +L+NY+DA+YFG I IG
Sbjct: 23 VRKKSLRKNLIEKGLLQDYLKTHTPNPATKYFPKETFATVSTESLENYLDAEYFGTISIG 82
Query: 98 TPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAI 157
TPPQ+FTVIFDTGSSNLWVPS+ C S+AC H ++ SSTY+ ++ I YGTG++
Sbjct: 83 TPPQDFTVIFDTGSSNLWVPSTYCS-SLACALHKRFNPEDSSTYQGTSETLSITYGTGSM 141
Query: 158 SGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNM 217
+G D VK+G + +Q F + EP LTFL A FDGILGLG+ IS A PV+ NM
Sbjct: 142 TGILGYDTVKVGSIEDTNQIFGLSKTEPGLTFLFAPFDGILGLGYPSISASDATPVFDNM 201
Query: 218 VNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVM 277
N+GLV++ +FS + ++D+E+G ++FGG+D +Y G +VPV+ +GYWQ M V
Sbjct: 202 WNEGLVSQDLFSVYL--SSDDEKGSLVMFGGIDSSYYTGSLNWVPVSYEGYWQITMDSVS 259
Query: 278 IDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE----CKAVVS 330
I+G+T CA C AI D+GTSLLAGPT+ I+ + IGA+ + E C A+ S
Sbjct: 260 INGETIA-CADSCQAIVDTGTSLLAGPTSAISNIQSYIGASKNLLGENVISCSAISS 315
>gi|195430468|ref|XP_002063276.1| GK21477 [Drosophila willistoni]
gi|194159361|gb|EDW74262.1| GK21477 [Drosophila willistoni]
Length = 402
Score = 243 bits (621), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 131/304 (43%), Positives = 180/304 (59%), Gaps = 20/304 (6%)
Query: 30 LYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQ 89
L+R +R+F+ + R++ R +R YS+ GE + L NY+DAQ
Sbjct: 35 LHRFPSARRRFE---QFGIRME-------RLRLR-YSVMPRDGEKLRTE--PLTNYLDAQ 81
Query: 90 YFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCY-FSIACYFHSKYRSGRSSTYKKNGKSA 148
YFG I IGTPPQ F VIFDTGS+NLWVPS+ C S+AC HS++ + RS++Y G
Sbjct: 82 YFGPITIGTPPQIFKVIFDTGSANLWVPSTSCSPASVACMIHSRFHAKRSTSYYPIGAPF 141
Query: 149 DIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVG 208
IHYG+G++SG+ S D V++ L +++Q F EAT P FL AKFDGI GLG++ ISV
Sbjct: 142 AIHYGSGSLSGYLSRDTVRVAGLEIENQVFAEATNMPGPIFLAAKFDGIFGLGYRSISVQ 201
Query: 209 KAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGY 268
+ P +Y M+ Q L+ PVFS + NR+ +EGG + FGG +P +Y G TYVPV+++ Y
Sbjct: 202 RIKPPFYAMMEQNLLASPVFSVYLNRDVAAKEGGALFFGGSNPQYYTGNFTYVPVSRRSY 261
Query: 269 WQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT----GIVSQE 324
WQ M I + C GC I D+GTS LA P +N +IG T G+ S
Sbjct: 262 WQITMDSAHI--KDLNLCEQGCEVIIDTGTSFLAMPYDQAMLINKSIGGTPSSYGMFSIP 319
Query: 325 CKAV 328
C+ V
Sbjct: 320 CEQV 323
>gi|283806592|ref|NP_001164549.1| pepsin II-1 precursor [Oryctolagus cuniculus]
gi|129777|sp|P28712.1|PEPA1_RABIT RecName: Full=Pepsin II-1; AltName: Full=Pepsin A; Flags: Precursor
gi|22218074|dbj|BAC07514.1| pepsinogen II-1 [Oryctolagus cuniculus]
Length = 387
Score = 243 bits (621), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 125/281 (44%), Positives = 174/281 (61%), Gaps = 28/281 (9%)
Query: 62 IRKYSLRGNLGESG------------------------DADIVALKNYMDAQYFGEIGIG 97
+RK SLR NL E G +L+NY+DA+YFG I IG
Sbjct: 23 VRKKSLRKNLIEKGLLQDYLKTHTPNLATKYFPKETFASVSTESLENYLDAEYFGTISIG 82
Query: 98 TPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAI 157
TPPQ FTVIFDTGSSNLWVPS+ C S+AC+ H ++ SST++ ++ I YGTG++
Sbjct: 83 TPPQEFTVIFDTGSSNLWVPSTYCS-SLACFLHKRFNPDDSSTFQATSETLSITYGTGSM 141
Query: 158 SGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNM 217
+G D VK+G++ +Q F + EP +TFL+A FDGILGL + IS A PV+ NM
Sbjct: 142 TGILGYDTVKVGNIEDTNQIFGLSKTEPGITFLVAPFDGILGLAYPSISASDATPVFDNM 201
Query: 218 VNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVM 277
N+GLV+E +FS + + N E+G ++FGG+D +Y G +VPV+ +GYWQ M +
Sbjct: 202 WNEGLVSEDLFSVYLSSNG--EKGSMVMFGGIDSSYYTGSLNWVPVSHEGYWQITMDSIT 259
Query: 278 IDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
I+G+T CA C A+ D+GTSLLAGPT+ I+++ IGA+
Sbjct: 260 INGETIA-CADSCQAVVDTGTSLLAGPTSAISKIQSYIGAS 299
>gi|402072590|gb|EJT68339.1| vacuolar protease A [Gaeumannomyces graminis var. tritici
R3-111a-1]
Length = 396
Score = 243 bits (621), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 120/243 (49%), Positives = 165/243 (67%), Gaps = 16/243 (6%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V + N+M+AQY+ EI +GTPPQ+F V+ DTGSSNLWVPS C SIACY HSKY S SS
Sbjct: 75 VPVSNFMNAQYYSEITVGTPPQSFKVVLDTGSSNLWVPSQSC-GSIACYLHSKYDSSASS 133
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYKKNG +I YG+G++SGF S D ++IGD+ +K+Q+F EAT+EP L F +FDGILG
Sbjct: 134 TYKKNGTEFEITYGSGSLSGFVSNDVMQIGDIKIKNQDFAEATKEPGLAFAFGRFDGILG 193
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LGF +SV K VP +Y M++Q L++EPVF+F+ AD+++ E +FGG++ DH G+
Sbjct: 194 LGFDRLSVNKMVPPFYQMIDQKLIDEPVFAFYL---ADQDDESEAIFGGINKDHIDGKII 250
Query: 260 YVPVTQKGYWQFD-----MGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHA 314
+P+ +K YW+ D +GD + + + TG I D+GTSL PT + +N
Sbjct: 251 EIPLRRKAYWEVDFDAIALGDEVGELENTG-------VILDTGTSLNVLPTQLAEMLNAQ 303
Query: 315 IGA 317
IGA
Sbjct: 304 IGA 306
>gi|392586802|gb|EIW76137.1| Asp-domain-containing protein [Coniophora puteana RWD-64-598 SS2]
Length = 409
Score = 243 bits (621), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 127/255 (49%), Positives = 165/255 (64%), Gaps = 9/255 (3%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L N+M+AQYF EI +G+P Q F VI DTGSSNLWVPS++C SIAC+ H+KY S S+
Sbjct: 88 VPLSNFMNAQYFTEIELGSPAQTFKVILDTGSSNLWVPSAQCT-SIACFLHAKYDSSSSA 146
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
+YK NG I YGTG++ GF S+D +KIGD+ + Q+F EAT+EP LTF KFDGILG
Sbjct: 147 SYKANGTEFSIQYGTGSMEGFVSQDTLKIGDVSISHQDFAEATKEPGLTFAFGKFDGILG 206
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ ISV P YNM+NQGL++EP+FS F + E +GGE VFGG+D Y G+
Sbjct: 207 LGYDTISVNHITPPVYNMINQGLLDEPLFS--FRLGSSESDGGEAVFGGIDHSAYTGDIE 264
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA-- 317
YVPV +K YW+ ++ V G + G A D+GTSL+A PT + +N IGA
Sbjct: 265 YVPVRRKAYWEVELEKVSFGGDELELESTGAA--IDTGTSLIALPTDVAEMLNTQIGAKR 322
Query: 318 --TGIVSQECKAVVS 330
G + +C V S
Sbjct: 323 SWNGQYTIDCSKVPS 337
>gi|449549767|gb|EMD40732.1| hypothetical protein CERSUDRAFT_44393 [Ceriporiopsis subvermispora
B]
Length = 413
Score = 243 bits (621), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 127/258 (49%), Positives = 172/258 (66%), Gaps = 19/258 (7%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L N+M+AQYF EI +GTPPQ+F V+ DTGSSNLWVPS+KC SIAC+ H+KY S SS
Sbjct: 91 VPLSNFMNAQYFAEITLGTPPQSFKVVLDTGSSNLWVPSTKCT-SIACFLHAKYDSSASS 149
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
+YK NG +IHYG+G++ GF S+D + IGD+ + + +F EAT+EP L F KFDGILG
Sbjct: 150 SYKANGTEFEIHYGSGSMEGFISQDVLSIGDISINNLDFAEATKEPGLAFAFGKFDGILG 209
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
L + ISV VP +Y+MVN+ L++ PVFSF + EE+GGE +FGG+D Y G+
Sbjct: 210 LAYDTISVNHVVPPFYHMVNKNLIDSPVFSFRL--GSSEEDGGEAIFGGVDESAYTGKID 267
Query: 260 YVPVTQKGYWQFD-----MGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHA 314
YVPV +K YW+ + +GD ++ + TG AAI D+GTSL+A P+ + +N
Sbjct: 268 YVPVRRKAYWEVELQKISLGDDELELENTG------AAI-DTGTSLIALPSDMAEMLNTQ 320
Query: 315 IGA----TGIVSQECKAV 328
IGA G + EC+ V
Sbjct: 321 IGAKRSWNGQYTVECEKV 338
>gi|255936729|ref|XP_002559391.1| Pc13g09680 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211584011|emb|CAP92037.1| Pc13g09680 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 398
Score = 243 bits (621), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 125/250 (50%), Positives = 164/250 (65%), Gaps = 7/250 (2%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V + N+++AQYF EI IGTPPQ F V+ DTGSSNLWVPSS+C SIAC+ HSKY S SS
Sbjct: 75 VLVDNFLNAQYFSEISIGTPPQTFKVVLDTGSSNLWVPSSQCS-SIACFLHSKYDSSSSS 133
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TY+KNG +I YG+G++SGF S D ++IGDL VK Q+F EAT EP L F +FDGILG
Sbjct: 134 TYEKNGTEFEIRYGSGSLSGFVSRDTLQIGDLKVKGQDFAEATNEPGLAFAFGRFDGILG 193
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ ISV K VP +Y+M+NQ LV+EPVF+F+ + + FGG+D HY GE
Sbjct: 194 LGYDTISVNKMVPPFYHMINQKLVDEPVFAFYLGDANKDGDNSVATFGGIDESHYTGELI 253
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA-- 317
+P+ +K YW+ ++ + + G I D+GTSL+A P+T+ +N IGA
Sbjct: 254 KIPLRRKAYWEVELNSIALGDNVAELENTGV--ILDTGTSLIALPSTMAELLNKEIGATK 311
Query: 318 --TGIVSQEC 325
TG S EC
Sbjct: 312 GFTGQYSVEC 321
>gi|194756946|ref|XP_001960731.1| GF13504 [Drosophila ananassae]
gi|190622029|gb|EDV37553.1| GF13504 [Drosophila ananassae]
Length = 402
Score = 243 bits (621), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 137/324 (42%), Positives = 186/324 (57%), Gaps = 22/324 (6%)
Query: 16 LLLFPVVFSTPNGGLYRIGLKK-----RKFDLNNRVAARLDSKEGESFRTSIRKYSLRGN 70
L L PV T N LY + L++ +F+ ++ R+D + S RGN
Sbjct: 11 LWLLPV---TCNWELYSVPLRRFPSARHRFE---KLGIRMDRLR---LKYSSESSEERGN 61
Query: 71 LGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYF 129
+ L NY+DAQYFG I IGTPPQ F VIFDTGSSNLWVPS+ C + +AC
Sbjct: 62 SRTKWNVKSTTLSNYLDAQYFGPITIGTPPQTFQVIFDTGSSNLWVPSATCSSTMVACRV 121
Query: 130 HSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTF 189
HS+Y + RS +Y+ G IHYG+G+++GF S D V++ L ++DQ F EAT P F
Sbjct: 122 HSRYYARRSRSYRPIGDHFVIHYGSGSLAGFLSTDTVRVAGLEIEDQVFAEATNMPGPIF 181
Query: 190 LLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRN-ADEEEGGEIVFGG 248
L AKFDGI GL ++ IS+ + P +Y M+ QGL+ VFS + NR+ ++EEGG + FGG
Sbjct: 182 LAAKFDGIFGLAYRSISMQRIKPPFYAMIEQGLLPRAVFSVYLNRHLGNQEEGGVLFFGG 241
Query: 249 MDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTII 308
+P++Y+G TYVPV+++ YWQ M I + C GC I D+GTS LA P
Sbjct: 242 SNPEYYRGNFTYVPVSRRAYWQVKMDAATI--RKLELCQNGCEVIIDTGTSFLALPYDQA 299
Query: 309 TQVNHAIG----ATGIVSQECKAV 328
+N +IG A G S C V
Sbjct: 300 ILINKSIGGRPSAYGQFSVPCDQV 323
>gi|426333405|ref|XP_004028268.1| PREDICTED: renin [Gorilla gorilla gorilla]
Length = 406
Score = 243 bits (620), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 127/289 (43%), Positives = 184/289 (63%), Gaps = 20/289 (6%)
Query: 32 RIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYF 91
R LK+R D+ ARL G +R +++ +L GN S V L NYMD QY+
Sbjct: 43 RESLKERGVDM-----ARL----GPEWRQPMKRLTL-GNTTSS-----VILTNYMDTQYY 87
Query: 92 GEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNGKSADI 150
GEIGIGTPPQ F V+FDTGSSN+WVPSSKC AC +H + + SS+YK NG +
Sbjct: 88 GEIGIGTPPQTFKVVFDTGSSNVWVPSSKCSRLYTACVYHKLFDASDSSSYKHNGTELTL 147
Query: 151 HYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKA 210
Y TG +SGF S+D + +G + V Q F E T P+L F+LA+FDG++G+GF E ++G+
Sbjct: 148 RYSTGTVSGFLSQDIITVGGITVT-QMFGEVTEMPALPFMLAEFDGVVGMGFIEQAIGRV 206
Query: 211 VPVWYNMVNQGLVNEPVFSFWFNRNAD--EEEGGEIVFGGMDPDHYKGEHTYVPVTQKGY 268
P++ N+++QG++ E VFSF++NR+++ + GG+IV GG DP HY+G Y+ + + G
Sbjct: 207 TPIFDNIISQGVLKEDVFSFYYNRDSENFQSLGGQIVLGGSDPQHYEGNFHYINLIKTGV 266
Query: 269 WQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
WQ M V + G +T C GC A+ D+G S ++G T+ I ++ A+GA
Sbjct: 267 WQIQMKGVSV-GSSTLLCEDGCLALVDTGASYISGSTSSIEKLMEALGA 314
>gi|291409618|ref|XP_002721075.1| PREDICTED: pepsin II-4-like [Oryctolagus cuniculus]
Length = 387
Score = 243 bits (620), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 128/281 (45%), Positives = 174/281 (61%), Gaps = 28/281 (9%)
Query: 62 IRKYSLRGNLGESG------------------------DADIVALKNYMDAQYFGEIGIG 97
+RK SLR NL E G +L+NY+DA+YFG I IG
Sbjct: 23 VRKKSLRKNLIEKGLLQDYLKTHTPNPATKYFPKETFATVSTESLENYLDAEYFGTISIG 82
Query: 98 TPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAI 157
TPPQ+FTVIFDTGSSNLWVPS+ C S+AC H ++ SSTY+ ++ I YGTG++
Sbjct: 83 TPPQDFTVIFDTGSSNLWVPSTYCS-SLACALHKRFNPEDSSTYQGTSETLSITYGTGSM 141
Query: 158 SGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNM 217
+G D VK+G + +Q F + EP LTFL A FDGILGLG+ IS A PV+ NM
Sbjct: 142 TGILGYDTVKVGSIEDTNQIFGLSKTEPGLTFLFAPFDGILGLGYPSISASDATPVFDNM 201
Query: 218 VNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVM 277
N+GLV++ +FS + ++D+E+G ++FGG+D +Y G +VPV+ +GYWQ M +
Sbjct: 202 WNEGLVSQDLFSVYL--SSDDEKGSLVMFGGIDSSYYTGSLNWVPVSYEGYWQITMDSIS 259
Query: 278 IDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
I+G+T CA C AI D+GTSLLAGPT+ I+ + IGA+
Sbjct: 260 INGETIA-CADSCQAIVDTGTSLLAGPTSAISNIQSYIGAS 299
>gi|330930051|ref|XP_003302872.1| hypothetical protein PTT_14856 [Pyrenophora teres f. teres 0-1]
gi|311321500|gb|EFQ89048.1| hypothetical protein PTT_14856 [Pyrenophora teres f. teres 0-1]
Length = 399
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 121/250 (48%), Positives = 166/250 (66%), Gaps = 8/250 (3%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V + N+++AQYF EI +GTPPQ F V+ DTGSSNLWVPS+ C SIACY H+KY S SS
Sbjct: 77 VPVSNFLNAQYFSEISLGTPPQTFKVVLDTGSSNLWVPSTSCN-SIACYLHTKYDSSSSS 135
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYKKNG +I YG+G++SGF S D +IGDL VK+Q+F EAT EP L F +FDGI+G
Sbjct: 136 TYKKNGTEFEIRYGSGSLSGFVSNDVFQIGDLKVKNQDFAEATSEPGLAFAFGRFDGIMG 195
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ ISV VP +YNM++QGL++EPVF+F+ + ++++ E FGG+D Y G+
Sbjct: 196 LGYDTISVKGIVPPFYNMLDQGLLDEPVFAFYLG-DTNQQQESEATFGGIDESKYTGKMI 254
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA-- 317
+P+ +K YW+ ++ + +T G I D+GTSL+A P+TI +N IGA
Sbjct: 255 KLPLRRKAYWEVELDALTFGKETAEMDNTGI--ILDTGTSLIALPSTIAELLNKEIGAKK 312
Query: 318 --TGIVSQEC 325
G + EC
Sbjct: 313 SFNGQYTVEC 322
>gi|355745980|gb|EHH50605.1| hypothetical protein EGM_01462 [Macaca fascicularis]
Length = 401
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 127/300 (42%), Positives = 189/300 (63%), Gaps = 12/300 (4%)
Query: 27 NGGLYRIGLKKRKFDLNNRVAARLDSKE-GESFRTSIRKYSLRGNLGESGDADIVALKNY 85
G L+R+ L++ L ++ AR E +S + +++ ++ +S + ++ NY
Sbjct: 18 QGSLHRVPLRRHP-SLKKKLRARSQLSEFWKSQNLDMIQFTESCSMDQSANEPLI---NY 73
Query: 86 MDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNG 145
+D +YFG I IG+PPQNFTVIFDTGSSNLWVPS C S AC H++++ +SSTY + G
Sbjct: 74 LDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHTRFQPSQSSTYSQPG 132
Query: 146 KSADIHYGTGAISGFFSEDHV-----KIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
+S I YGTG++SG D V ++ L V Q+F E+ EP TF+ A+FDGILGL
Sbjct: 133 QSFSIQYGTGSLSGIIGADQVSAFSCQVEGLTVVGQQFGESVTEPGQTFVDAEFDGILGL 192
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
G+ ++VG PV+ NM+ Q LV+ P+FS + + N + G E++FGG D H+ G +
Sbjct: 193 GYPSLAVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGSLDW 252
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
VPVT++GYWQ + ++ + G T FC+ GC AI D+GTSL+ GP+ I Q+ +AIGA +
Sbjct: 253 VPVTKQGYWQIALDNIQVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGAAPV 311
>gi|50557048|ref|XP_505932.1| YALI0F27071p [Yarrowia lipolytica]
gi|49651802|emb|CAG78744.1| YALI0F27071p [Yarrowia lipolytica CLIB122]
Length = 396
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 134/306 (43%), Positives = 178/306 (58%), Gaps = 17/306 (5%)
Query: 39 KFDLNNRVAARLDSKE----------GESFRTSIRKYSLRGNLGESGDADIVALKNYMDA 88
K +N A L KE G+ + +K L D L NY++A
Sbjct: 22 KVSINKMSTAELLGKENGFEDHLRMMGQKYMGKFQKLGEFNELASIQDVSNSPLTNYLNA 81
Query: 89 QYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSA 148
QY+ EI IGTPPQ F VI DTGSSNLWVPS +C SIACY H KY S SS+YK NG +
Sbjct: 82 QYYTEIEIGTPPQKFNVILDTGSSNLWVPSVQCN-SIACYLHQKYDSAASSSYKANGTAF 140
Query: 149 DIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVG 208
+I YG+G++ GF S+D +K+G LV+ +Q+F EAT EP L F KFDGILGL + ISV
Sbjct: 141 EIQYGSGSMEGFVSQDTLKLGSLVLPEQDFAEATSEPGLAFAFGKFDGILGLAYDTISVN 200
Query: 209 KAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGY 268
K VP YN VN+GL+++ FSF+ +GG FGG+D D+++G+ T++PV +K Y
Sbjct: 201 KIVPPVYNAVNRGLLDKNQFSFFLGDTNKGTDGGVATFGGVDEDYFEGKITWLPVRRKAY 260
Query: 269 WQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT----GIVSQE 324
W+ + + + QT G A D+GTSLLA P+ + +N IGAT G + E
Sbjct: 261 WEVEFNSITLGDQTAELVNTGAA--IDTGTSLLALPSGLAEVLNSEIGATKGWSGQYTVE 318
Query: 325 CKAVVS 330
C V S
Sbjct: 319 CDKVDS 324
>gi|355558837|gb|EHH15617.1| hypothetical protein EGK_01732 [Macaca mulatta]
Length = 401
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 127/300 (42%), Positives = 189/300 (63%), Gaps = 12/300 (4%)
Query: 27 NGGLYRIGLKKRKFDLNNRVAARLDSKE-GESFRTSIRKYSLRGNLGESGDADIVALKNY 85
G L+R+ L++ L ++ AR E +S + +++ ++ +S + ++ NY
Sbjct: 18 QGSLHRVPLRRHP-SLKKKLRARSQLSEFWKSQNLDMIQFTESCSMDQSANEPLI---NY 73
Query: 86 MDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNG 145
+D +YFG I IG+PPQNFTVIFDTGSSNLWVPS C S AC H++++ +SSTY + G
Sbjct: 74 LDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHTRFQPSQSSTYSQPG 132
Query: 146 KSADIHYGTGAISGFFSEDHV-----KIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
+S I YGTG++SG D V ++ L V Q+F E+ EP TF+ A+FDGILGL
Sbjct: 133 QSFSIQYGTGSLSGIIGADQVSAFSCQVEGLTVVGQQFGESVTEPGQTFVDAEFDGILGL 192
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
G+ ++VG PV+ NM+ Q LV+ P+FS + + N + G E++FGG D H+ G +
Sbjct: 193 GYPSLAVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGVGSELIFGGYDHSHFSGSLNW 252
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
VPVT++GYWQ + ++ + G T FC+ GC AI D+GTSL+ GP+ I Q+ +AIGA +
Sbjct: 253 VPVTKQGYWQIALDNIQVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGAAPV 311
>gi|354478113|ref|XP_003501260.1| PREDICTED: cathepsin E-like isoform 2 [Cricetulus griseus]
Length = 363
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 126/292 (43%), Positives = 182/292 (62%), Gaps = 7/292 (2%)
Query: 28 GGLYRIGLKKRKFDLNNRVAARLD-SKEGESFRTSIRKYSLRGNLGESGDADIVALKNYM 86
G ++R+ L+K + + ++ AR S +S + ++S + +S + ++ NY+
Sbjct: 19 GAIHRVPLRKHQ-SIRKKLRARGKLSDFWKSQNLDMIQFSESCKMDQSTNEPLI---NYL 74
Query: 87 DAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGK 146
D +YFG I IG+PPQNFTVIFDTGSSNLWVPS C S AC H + +SSTY++ G
Sbjct: 75 DVEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHPVFHPSQSSTYEEVGN 133
Query: 147 SADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEIS 206
I YGTG+++G D V + L V Q+F E+ +EP TF+ A+FDGILGLG+ ++
Sbjct: 134 HFSIQYGTGSLTGIIGADQVSVEGLTVDGQQFGESVKEPGQTFVNAEFDGILGLGYPSLA 193
Query: 207 VGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQK 266
VG PV+ NM+ Q LV+ P+FS + + + G E+ FGG DP H+ G ++PVT++
Sbjct: 194 VGGVTPVFDNMMAQNLVDLPIFSVYMSSDPQGGSGSELTFGGFDPSHFSGNLNWIPVTKQ 253
Query: 267 GYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
GYWQ + V + G T FC+ GC AI D+GTSL+ GP+ I Q+ AIGAT
Sbjct: 254 GYWQIALDGVQV-GDTVMFCSEGCQAIVDTGTSLITGPSHKIKQLQEAIGAT 304
>gi|209154266|gb|ACI33365.1| Cathepsin D precursor [Salmo salar]
Length = 402
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 120/254 (47%), Positives = 165/254 (64%), Gaps = 6/254 (2%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYF-SIACYFHSKYRSGRSST 140
L N+MDAQY+G I IGTPPQ+FTV+FDTGSSNLWVPS C F +AC+ H +Y S +SST
Sbjct: 74 LTNFMDAQYYGVISIGTPPQDFTVLFDTGSSNLWVPSIHCSFLDVACWLHHRYNSKKSST 133
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y +NG I YG G++SGF S D V + + V Q+F EA ++P +TF +A+FDG+LG+
Sbjct: 134 YVQNGTKFSIQYGRGSLSGFISGDTVSLAGMQVTGQQFGEAVKQPGITFAVARFDGVLGM 193
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
G+ ISV PV+ + L+ + +FSF+ +R+ GGE++ GG DP +Y G+ Y
Sbjct: 194 GYPTISVNNITPVFDTAMAAKLLPQNIFSFYISRDPLAAVGGELMLGGTDPLYYTGDLHY 253
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
V VT+K YWQ +M +V + Q T C GC AI D+GTSL+ GP + ++ AIGA +
Sbjct: 254 VNVTRKAYWQIEMSNVEVGNQLT-LCKAGCQAIVDTGTSLIIGPAEEVRVLHKAIGALPL 312
Query: 321 VSQE----CKAVVS 330
+ E CK V S
Sbjct: 313 LMGEYWIDCKKVPS 326
>gi|46397366|sp|P14091.2|CATE_HUMAN RecName: Full=Cathepsin E; Contains: RecName: Full=Cathepsin E form
I; Contains: RecName: Full=Cathepsin E form II; Flags:
Precursor
Length = 401
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 127/300 (42%), Positives = 187/300 (62%), Gaps = 12/300 (4%)
Query: 27 NGGLYRIGLKKRKFDLNNRVAARLDSKE-GESFRTSIRKYSLRGNLGESGDADIVALKNY 85
G L+R+ L++ L ++ AR E +S + +++ ++ +S ++ NY
Sbjct: 18 QGSLHRVPLRRHP-SLKKKLRARSQLSEFWKSHNLDMIQFTESCSMDQSAKEPLI---NY 73
Query: 86 MDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNG 145
+D +YFG I IG+PPQNFTVIFDTGSSNLWVPS C S AC HS+++ +SSTY + G
Sbjct: 74 LDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHSRFQPSQSSTYSQPG 132
Query: 146 KSADIHYGTGAISGFFSEDHV-----KIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
+S I YGTG++SG D V ++ L V Q+F E+ EP TF+ A+FDGILGL
Sbjct: 133 QSFSIQYGTGSLSGIIGADQVSAFATQVEGLTVVGQQFGESVTEPGQTFVDAEFDGILGL 192
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
G+ ++VG PV+ NM+ Q LV+ P+FS + + N + G E++FGG D H+ G +
Sbjct: 193 GYPSLAVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGSLNW 252
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
VPVT++ YWQ + ++ + G T FC+ GC AI D+GTSL+ GP+ I Q+ +AIGA +
Sbjct: 253 VPVTKQAYWQIALDNIQVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGAAPV 311
>gi|296230510|ref|XP_002760737.1| PREDICTED: renin isoform 1 [Callithrix jacchus]
gi|50401196|sp|Q9TSZ1.1|RENI_CALJA RecName: Full=Renin; AltName: Full=Angiotensinogenase; Flags:
Precursor
gi|6687184|emb|CAB64879.1| preprorenin [Callithrix jacchus]
Length = 400
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 123/278 (44%), Positives = 177/278 (63%), Gaps = 11/278 (3%)
Query: 49 RLDSKEGESFRTSIRKYSL-RGNLGESGDADI-----VALKNYMDAQYFGEIGIGTPPQN 102
R+ K S R S+++ + LG A + V L NYMD QY+GEIGIGTPPQ
Sbjct: 33 RISLKRMPSIRESLKERGVDMARLGPERMALVNITSSVILTNYMDTQYYGEIGIGTPPQT 92
Query: 103 FTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFF 161
F V+FDTGSSN+WVPSSKC AC +H + + SS+YK NG + Y TG +SGF
Sbjct: 93 FKVVFDTGSSNVWVPSSKCSRLYTACVYHKLFDASDSSSYKHNGTELTLRYSTGTVSGFL 152
Query: 162 SEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQG 221
S+D + +G + V Q F E T P+L F+LA+FDG++G+GF E ++GK P++ N+++QG
Sbjct: 153 SQDVITVGGITVT-QTFGEVTEMPALPFMLAEFDGVVGMGFSEQAIGKVTPLFDNIISQG 211
Query: 222 LVNEPVFSFWFNRNADEEE--GGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMID 279
L+ E VFSF++NR+++ + GG+IV GG DP HY+G Y+ + + G WQ M V +
Sbjct: 212 LLKEDVFSFYYNRDSENSQSLGGQIVLGGSDPQHYEGNFHYINLIRTGLWQIPMKGVSV- 270
Query: 280 GQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
G +T C GC A+ D+G S ++G T+ I ++ A+GA
Sbjct: 271 GSSTLLCEDGCLALVDTGASYISGSTSSIEKLMEALGA 308
>gi|348559312|ref|XP_003465460.1| PREDICTED: napsin-A-like [Cavia porcellus]
Length = 523
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 133/286 (46%), Positives = 174/286 (60%), Gaps = 12/286 (4%)
Query: 75 GDADI-VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSK 132
GD+ V L +++ QYFGEIG+GTPPQNF+V+FDTGSSNLWVPS C +FS+ C+FH +
Sbjct: 59 GDSPFFVPLSKFLNVQYFGEIGLGTPPQNFSVVFDTGSSNLWVPSKSCRFFSLPCWFHHR 118
Query: 133 YRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLA 192
Y SS++ NG I YGTG +SG S+D + IG + F EA EPSL F A
Sbjct: 119 YDPKASSSFCPNGTKFAIQYGTGRLSGILSQDKLTIGGINNVSVTFGEALWEPSLVFAFA 178
Query: 193 KFDGILGLGFQEISVGKAVPVWYN-MVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDP 251
FDGI GLGF ++V VP + MV QGL+++PVFSF+ NR+ + GGE+V GG DP
Sbjct: 179 SFDGIFGLGFPALAV-DGVPTPLDVMVEQGLLDKPVFSFYLNRDFEGTHGGELVLGGSDP 237
Query: 252 DHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQV 311
HY T+VPVT YWQ M VM+ G CA GCAAI D+GTSL+ GP+ I +
Sbjct: 238 AHYIPPLTFVPVTIPAYWQIHMDRVMV-GTGLTLCAQGCAAIVDTGTSLITGPSEEIRAL 296
Query: 312 NHAIGATGIVSQE----CKAVVSQYGEEIINMLLAKVSAKLSLRHY 353
+ AIG ++ E C + + I+ LL V L+ + Y
Sbjct: 297 HRAIGGLPWLAGEHFIQCSKIPTL---PPISFLLGGVWFNLTAQDY 339
>gi|195485971|ref|XP_002091310.1| GE13586 [Drosophila yakuba]
gi|194177411|gb|EDW91022.1| GE13586 [Drosophila yakuba]
Length = 404
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 132/312 (42%), Positives = 182/312 (58%), Gaps = 20/312 (6%)
Query: 16 LLLFPVVFSTPNG---GLYRIGLKK-----RKFDLNNRVAARLDSKEGESFRTSIRKYSL 67
L + PV F P LYR+ L++ +F+ RL K E + ++
Sbjct: 11 LFVLPVQFPPPVSCTLQLYRVPLRRFPSARHRFEKLGIRMDRLRLKYAEE----VSQFRG 66
Query: 68 RGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IA 126
GNL + L NY+DAQYFG I IGTPPQ+F VIFDTGSSNLWVPS+ C +A
Sbjct: 67 EGNL----EVKSTPLSNYLDAQYFGPITIGTPPQSFKVIFDTGSSNLWVPSATCASRMVA 122
Query: 127 CYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPS 186
C H++Y + RS++++ G IHYG+G++ GF S D V++ L ++DQ F EAT P
Sbjct: 123 CRVHNRYFAKRSTSHQVRGDRFAIHYGSGSLFGFLSTDTVRVAGLEIRDQTFAEATEMPG 182
Query: 187 LTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVF 246
FL AKFDGI GLG++ IS+ + P +Y M+ QGL+ +P+FS + +R+ E+EGG I F
Sbjct: 183 PIFLAAKFDGIFGLGYRSISMQRIKPPFYAMMEQGLLTKPIFSVYLSRHG-EKEGGAIFF 241
Query: 247 GGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTT 306
GG +P +Y G TYV V+ + YWQ M +I + C GC I D+GTS LA P
Sbjct: 242 GGSNPHYYTGNFTYVQVSHRAYWQVKMDSAVI--RNLELCQQGCEVIIDTGTSFLALPYD 299
Query: 307 IITQVNHAIGAT 318
+N +IG T
Sbjct: 300 QAILINESIGGT 311
>gi|309319873|pdb|2X0B|A Chain A, Crystal Structure Of Human Angiotensinogen Complexed With
Renin
gi|309319875|pdb|2X0B|C Chain C, Crystal Structure Of Human Angiotensinogen Complexed With
Renin
gi|309319877|pdb|2X0B|E Chain E, Crystal Structure Of Human Angiotensinogen Complexed With
Renin
gi|309319879|pdb|2X0B|G Chain G, Crystal Structure Of Human Angiotensinogen Complexed With
Renin
Length = 383
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 126/289 (43%), Positives = 183/289 (63%), Gaps = 20/289 (6%)
Query: 32 RIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYF 91
R LK+R D+ ARL G + +++ +L GN S V L NYMD QY+
Sbjct: 20 RESLKERGVDM-----ARL----GPEWSQPMKRLTL-GNTTSS-----VILTNYMDTQYY 64
Query: 92 GEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNGKSADI 150
GEIGIGTPPQ F V+FDTGSSN+WVPSSKC AC +H + + SS+YK NG +
Sbjct: 65 GEIGIGTPPQTFKVVFDTGSSNVWVPSSKCSRLYTACVYHKLFDASDSSSYKHNGTELTL 124
Query: 151 HYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKA 210
Y TG +SGF S+D + +G + V Q F E T P+L F+LA+FDG++G+GF E ++G+
Sbjct: 125 RYSTGTVSGFLSQDIITVGGITVT-QMFGEVTEMPALPFMLAEFDGVVGMGFIEQAIGRV 183
Query: 211 VPVWYNMVNQGLVNEPVFSFWFNRNADEEE--GGEIVFGGMDPDHYKGEHTYVPVTQKGY 268
P++ N+++QG++ E VFSF++NR+++ + GG+IV GG DP HY+G Y+ + + G
Sbjct: 184 TPIFDNIISQGVLKEDVFSFYYNRDSENSQSLGGQIVLGGSDPQHYEGNFHYINLIKTGV 243
Query: 269 WQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
WQ M V + G +T C GC A+ D+G S ++G T+ I ++ A+GA
Sbjct: 244 WQIQMKGVSV-GSSTLLCEDGCLALVDTGASYISGSTSSIEKLMEALGA 291
>gi|326475448|gb|EGD99457.1| aspartyl proteinase [Trichophyton tonsurans CBS 112818]
gi|326477485|gb|EGE01495.1| vacuolar protease A [Trichophyton equinum CBS 127.97]
Length = 400
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 116/238 (48%), Positives = 152/238 (63%), Gaps = 3/238 (1%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V + N+++AQYF EI IGTPPQ F V+ DTGSSNLWVP C SIAC+ HS Y S SS
Sbjct: 77 VLIDNFLNAQYFSEISIGTPPQTFKVVLDTGSSNLWVPGKDCS-SIACFLHSTYDSSASS 135
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TY KNG I YG+G++ GF S D+VKIGD+ +K Q F EAT EP L F +FDGI+G
Sbjct: 136 TYSKNGTKFAIRYGSGSLEGFVSRDNVKIGDMTIKKQLFAEATSEPGLAFAFGRFDGIMG 195
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
+GF ISV P +YNM++QGL++EPVFSF+ + + + FGG D H+ G+ T
Sbjct: 196 MGFSSISVNGITPPFYNMIDQGLIDEPVFSFYLGDTNKDGDQSVVTFGGSDASHFTGDMT 255
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
+P+ +K YW+ D + + T G I D+GTSL+A PTT+ +N IGA
Sbjct: 256 TIPLRRKAYWEVDFDAISLGEDTAALENTG--VILDTGTSLIALPTTLAEMINTQIGA 311
>gi|297662235|ref|XP_002809619.1| PREDICTED: renin [Pongo abelii]
Length = 406
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 127/289 (43%), Positives = 183/289 (63%), Gaps = 20/289 (6%)
Query: 32 RIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYF 91
R LK+R D+ ARL G + +++ +L GN S V L NYMD QY+
Sbjct: 43 RESLKERGVDM-----ARL----GPEWSQPMKRLTL-GNTTSS-----VILTNYMDTQYY 87
Query: 92 GEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNGKSADI 150
GEIGIGTPPQ F V+FDTGSSN+WVPSSKC AC +H + + SS+YK NG +
Sbjct: 88 GEIGIGTPPQTFKVVFDTGSSNVWVPSSKCSRLYTACVYHKLFDASDSSSYKHNGTELTL 147
Query: 151 HYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKA 210
Y TG +SGF S+D + +G + V Q F E T P+L F+LA+FDG++G+GF E ++G+
Sbjct: 148 RYSTGTVSGFLSQDIITVGGITVT-QMFGEVTEMPALPFMLAEFDGVVGMGFIEQAIGRV 206
Query: 211 VPVWYNMVNQGLVNEPVFSFWFNRNADEEE--GGEIVFGGMDPDHYKGEHTYVPVTQKGY 268
P++ N+++QG++ E VFSF++NR+++ + GG+IV GG DP HY+G YV + + G
Sbjct: 207 TPIFDNIISQGVLKEDVFSFYYNRDSENSQSLGGQIVLGGSDPQHYEGNFHYVNLIKTGV 266
Query: 269 WQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
WQ M V + G +T C GC A+ D+G S ++G T+ I ++ A+GA
Sbjct: 267 WQIQMKGVSV-GSSTLLCEDGCLALVDTGASYISGSTSSIEKLMEALGA 314
>gi|449282010|gb|EMC88940.1| Cathepsin E-B, partial [Columba livia]
Length = 387
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 113/237 (47%), Positives = 164/237 (69%), Gaps = 2/237 (0%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L +YM+AQY+G + +GTPPQ FTV+FDTGSSN WVPS+ C S AC H K++S S +Y
Sbjct: 55 LYDYMNAQYYGVVSVGTPPQRFTVVFDTGSSNFWVPSAYC-ISEACRVHQKFKSFLSDSY 113
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
+ G++ + YGTG + G +D ++I ++ +K Q+F E+ EP TF+ A FDG+LGLG
Sbjct: 114 EHGGEAFSLQYGTGQLLGVAGKDTLQISNISIKGQDFGESVFEPGSTFVFAHFDGVLGLG 173
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ ++VG A+PV+ +++NQ LV EP+FSF+ R D E GGE++ GG+D YKG +V
Sbjct: 174 YPSLAVGNALPVFDSIMNQQLVEEPIFSFYLKREDDTENGGELILGGIDHSLYKGSIHWV 233
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
PVT+K YWQ + ++ I G+ FC+ GC AI DSGTSL+ GP++ I ++ IGA+
Sbjct: 234 PVTEKSYWQIHLNNIKIQGR-VAFCSHGCEAIVDSGTSLITGPSSQIRRLQEYIGAS 289
>gi|37790800|gb|AAR03502.1| renin [Homo sapiens]
gi|119611911|gb|EAW91505.1| renin [Homo sapiens]
Length = 403
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 127/287 (44%), Positives = 182/287 (63%), Gaps = 19/287 (6%)
Query: 32 RIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYF 91
R LK+R D+ ARL G + +++ +L GN S V L NYMD QY+
Sbjct: 43 RESLKERGVDM-----ARL----GPEWSQPMKRLTL-GNTTSS-----VILTNYMDTQYY 87
Query: 92 GEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNGKSADI 150
GEIGIGTPPQ F V+FDTGSSN+WVPSSKC AC +H + + SS+YK NG +
Sbjct: 88 GEIGIGTPPQTFKVVFDTGSSNVWVPSSKCSRLYTACVYHKLFDASDSSSYKHNGTELTL 147
Query: 151 HYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKA 210
Y TG +SGF S+D + +G + V Q F E T P+L F+LA+FDG++G+GF E ++G+
Sbjct: 148 RYSTGTVSGFLSQDIITVGGITVT-QMFGEVTEMPALPFMLAEFDGVVGMGFIEQAIGRV 206
Query: 211 VPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQ 270
P++ N+++QG++ E VFSF++NRN+ + GG+IV GG DP HY+G Y+ + + G WQ
Sbjct: 207 TPIFDNIISQGVLKEDVFSFYYNRNS-QSLGGQIVLGGSDPQHYEGNFHYINLIKTGVWQ 265
Query: 271 FDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
M V + G +T C GC A+ D+G S ++G T+ I ++ A+GA
Sbjct: 266 IQMKGVSV-GSSTLLCEDGCLALVDTGASYISGSTSSIEKLMEALGA 311
>gi|390477486|ref|XP_003735302.1| PREDICTED: cathepsin E isoform 2 [Callithrix jacchus]
Length = 401
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 128/300 (42%), Positives = 187/300 (62%), Gaps = 12/300 (4%)
Query: 27 NGGLYRIGLKKRKFDLNNRVAARLDSKE-GESFRTSIRKYSLRGNLGESGDADIVALKNY 85
G L+R+ L+K L R+ AR E +S + + + ++ +S + ++ NY
Sbjct: 18 QGSLHRVPLRKHP-SLKKRLRARSQLSEFLKSQNLDMIQSTESCSMDQSANEPLI---NY 73
Query: 86 MDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNG 145
+D +YFG I IG+PPQNFTVIFDTGSSNLWVPS C S AC H++++ +S+TY + G
Sbjct: 74 LDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKRHTRFQPSQSNTYNQPG 132
Query: 146 KSADIHYGTGAISGFFSEDHV-----KIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
+S I YGTG++SG D V ++ L V Q+F E+ EP TF+ A+FDGILGL
Sbjct: 133 QSFSIQYGTGSLSGIIGADQVSAFSWQVEGLTVVGQQFGESVTEPGQTFVDAEFDGILGL 192
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
G+ ++VG PV+ NM+ Q LV+ P+FS + + N + G E++FGG D H+ G +
Sbjct: 193 GYPSLAVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGSLNW 252
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
VPVT++ YWQ + D+ + G T FC+ GC AI D+GTSL+ GP+ I Q+ +AIGA +
Sbjct: 253 VPVTKQAYWQIALDDIQVGG-TAMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGAAPV 311
>gi|4506475|ref|NP_000528.1| renin preproprotein [Homo sapiens]
gi|57114109|ref|NP_001009122.1| renin precursor [Pan troglodytes]
gi|132326|sp|P00797.1|RENI_HUMAN RecName: Full=Renin; AltName: Full=Angiotensinogenase; Flags:
Precursor
gi|38503275|sp|P60016.1|RENI_PANTR RecName: Full=Renin; AltName: Full=Angiotensinogenase; Flags:
Precursor
gi|11118368|gb|AAG30305.1|AF193456_1 renin [Pan troglodytes]
gi|190994|gb|AAA60363.1| renin [Homo sapiens]
gi|337340|gb|AAD03461.1| renin [Homo sapiens]
gi|29126911|gb|AAH47752.1| Renin [Homo sapiens]
gi|49168484|emb|CAG38737.1| REN [Homo sapiens]
gi|54311156|gb|AAH33474.1| Renin [Homo sapiens]
gi|166706825|gb|ABY87560.1| renin [Homo sapiens]
gi|208967276|dbj|BAG73652.1| renin [synthetic construct]
gi|312153236|gb|ADQ33130.1| renin [synthetic construct]
Length = 406
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 126/289 (43%), Positives = 183/289 (63%), Gaps = 20/289 (6%)
Query: 32 RIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYF 91
R LK+R D+ ARL G + +++ +L GN S V L NYMD QY+
Sbjct: 43 RESLKERGVDM-----ARL----GPEWSQPMKRLTL-GNTTSS-----VILTNYMDTQYY 87
Query: 92 GEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNGKSADI 150
GEIGIGTPPQ F V+FDTGSSN+WVPSSKC AC +H + + SS+YK NG +
Sbjct: 88 GEIGIGTPPQTFKVVFDTGSSNVWVPSSKCSRLYTACVYHKLFDASDSSSYKHNGTELTL 147
Query: 151 HYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKA 210
Y TG +SGF S+D + +G + V Q F E T P+L F+LA+FDG++G+GF E ++G+
Sbjct: 148 RYSTGTVSGFLSQDIITVGGITVT-QMFGEVTEMPALPFMLAEFDGVVGMGFIEQAIGRV 206
Query: 211 VPVWYNMVNQGLVNEPVFSFWFNRNADEEE--GGEIVFGGMDPDHYKGEHTYVPVTQKGY 268
P++ N+++QG++ E VFSF++NR+++ + GG+IV GG DP HY+G Y+ + + G
Sbjct: 207 TPIFDNIISQGVLKEDVFSFYYNRDSENSQSLGGQIVLGGSDPQHYEGNFHYINLIKTGV 266
Query: 269 WQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
WQ M V + G +T C GC A+ D+G S ++G T+ I ++ A+GA
Sbjct: 267 WQIQMKGVSV-GSSTLLCEDGCLALVDTGASYISGSTSSIEKLMEALGA 314
>gi|429860373|gb|ELA35113.1| vacuolar protease a [Colletotrichum gloeosporioides Nara gc5]
Length = 399
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 129/250 (51%), Positives = 167/250 (66%), Gaps = 7/250 (2%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V + N+M+AQYF EI IGTPPQ+F V+ DTGSSNLWVPS +C SIACY HSKY S SS
Sbjct: 75 VPVSNFMNAQYFSEISIGTPPQSFKVVLDTGSSNLWVPSQQC-GSIACYLHSKYDSSSSS 133
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYK NG +IHYG+G+++GF S+D V IGD+ +K Q+F EAT EP L F +FDGILG
Sbjct: 134 TYKSNGSEFEIHYGSGSLTGFVSQDDVSIGDIKIKKQDFAEATSEPGLAFAFGRFDGILG 193
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ ISV K VP +Y MVNQ ++EPVF+F+ DE + E VFGG+D HY+G+ T
Sbjct: 194 LGYDTISVNKIVPPFYQMVNQKAIDEPVFAFYLGDTNDEGDESEAVFGGVDDSHYEGKIT 253
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA-- 317
Y+P+ +K YW+ D+ + + +T G AI D+GTSL P+ + +N IGA
Sbjct: 254 YIPLRRKAYWEVDLDAITLGDETADL--EGHGAILDTGTSLNVLPSALAELLNKEIGAKK 311
Query: 318 --TGIVSQEC 325
G S EC
Sbjct: 312 GFNGQYSVEC 321
>gi|210109642|gb|ACJ07131.1| cathepsin D-like protein, partial [Homarus gammarus]
Length = 231
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 118/232 (50%), Positives = 158/232 (68%), Gaps = 5/232 (2%)
Query: 89 QYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCY-FSIACYFHSKYRSGRSSTYKKNGKS 147
QY+G I IGTP Q F VIFDTGSSNLW+PS KC+ ++AC H++Y S +SSTY +NG +
Sbjct: 1 QYYGPITIGTPGQGFDVIFDTGSSNLWIPSEKCFILNLACRLHNRYDSTKSSTYIENGTA 60
Query: 148 ADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISV 207
DI YG+GA+ GF S D+V++G + Q F EAT+EP L F++ KFDGILG+ F EISV
Sbjct: 61 FDIQYGSGALHGFLSSDNVEMGGVNAMGQTFAEATQEPGLAFIMGKFDGILGMAFTEISV 120
Query: 208 GKAVPVWYNMVNQGLVNEPVFSFWFNRNA---DEEEGGEIVFGGMDPDHYKGEHTYVPVT 264
V+ MV QG V++P+FSF+ N + +E GGE+V GG DP+HY+GE YVPV+
Sbjct: 121 MGIPTVFDTMVAQGAVDQPIFSFYLNHDVSDMNETLGGELVLGGSDPNHYEGEFHYVPVS 180
Query: 265 QKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIG 316
+ GYWQ + + TGFC C AI D+GTSL+AGP + ++ H +G
Sbjct: 181 KVGYWQVTAEAIKVGDNVTGFC-NPCEAIVDTGTSLIAGPNAEVQEIVHMLG 231
>gi|195159706|ref|XP_002020719.1| GL15694 [Drosophila persimilis]
gi|194117669|gb|EDW39712.1| GL15694 [Drosophila persimilis]
Length = 401
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 116/240 (48%), Positives = 161/240 (67%), Gaps = 4/240 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSST 140
L N M+ Y+G+I IGTPPQ F V+FDTGSSNLW+PS++C + +AC H++Y + SST
Sbjct: 80 LGNSMNMYYYGQISIGTPPQYFNVVFDTGSSNLWIPSAQCLSTDVACQQHNQYNASASST 139
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y N ++ I YGTG+++G+ + D V I L + +Q F EA +P +F FDGILG+
Sbjct: 140 YVANSQNFSIQYGTGSVTGYLATDTVTINGLAIANQTFGEAVSQPGSSFTDVAFDGILGM 199
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
G+Q I+V VP +YN+ QGL++EP F F+ RN EEGG+++ GG+D G+ TY
Sbjct: 200 GYQTIAVDSVVPPFYNLYEQGLIDEPTFGFYLARNGSSEEGGQLLLGGVDETLMAGDLTY 259
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
VPV+Q+GYWQF + ++ +G T C GC AIAD+GTSLLA P + TQ+N IGA I
Sbjct: 260 VPVSQEGYWQFSVNNISWNG--TVLC-DGCQAIADTGTSLLACPQAVYTQINQLIGAVLI 316
>gi|449280945|gb|EMC88160.1| Cathepsin E, partial [Columba livia]
Length = 374
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 115/239 (48%), Positives = 155/239 (64%), Gaps = 2/239 (0%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L NY+D +YFG+I IGTPPQNFTV+FDTGSSNLWVPS C S AC H+K++ +SSTY
Sbjct: 48 LINYLDMEYFGQISIGTPPQNFTVVFDTGSSNLWVPSVYC-VSKACAEHAKFQPSQSSTY 106
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
+ G I YGTG+++G D V + L V +Q+F E+ EP FL A FDG+LGL
Sbjct: 107 QAIGTPFSIQYGTGSLTGVIGSDQVVVEGLTVNNQQFAESISEPGKAFLDAPFDGVLGLA 166
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ ++V PV+ NM+ Q LV P+FS + + N + GGE++FGG DP + G +V
Sbjct: 167 YPSLAVDGVTPVFDNMMAQNLVELPIFSVYLSTNPESSLGGELLFGGFDPSRFMGTLNWV 226
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
PVTQ+GYWQ + ++ + G T FC GC AI D+GTSL+ GPT + + IGAT +
Sbjct: 227 PVTQQGYWQIQLDNIQLAG-TVAFCTNGCQAIVDTGTSLITGPTKDVKVLQKYIGATPV 284
>gi|384490965|gb|EIE82161.1| hypothetical protein RO3G_06866 [Rhizopus delemar RA 99-880]
Length = 403
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 120/255 (47%), Positives = 166/255 (65%), Gaps = 9/255 (3%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L NYM+AQY+GEI IGTP Q FTVIFDTGSSNLWVPS+ C S AC H +Y S +S+
Sbjct: 78 VPLSNYMNAQYYGEIQIGTPAQTFTVIFDTGSSNLWVPSTHC-MSFACLMHRRYSSSKST 136
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TY+KN I YG+G++ G S+D +++G + ++DQ F E+T EP LTF +A+FDGI G
Sbjct: 137 TYRKNETDFVIRYGSGSLQGINSQDTLRVGGIEIRDQGFAESTVEPGLTFAMARFDGIFG 196
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEE--GGEIVFGGMDPDHYKGE 257
LG+ ISV + VP +YNM+N+ L+++ +FSFW + D GGE+ FGG+D + G
Sbjct: 197 LGYDTISVQQTVPPFYNMINKKLIDQEIFSFWLSDTNDGNNNLGGELAFGGIDEARFSGN 256
Query: 258 HTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIG- 316
T+ PVT+KGYW+ ++ + + Q G A D+GTSLL PT + VN+ IG
Sbjct: 257 ITWSPVTRKGYWEIELQNTKFNDQPMNM--GSIGAAIDTGTSLLIAPTAVAEFVNNQIGG 314
Query: 317 ---ATGIVSQECKAV 328
A G + +C +V
Sbjct: 315 QADAYGQYTVDCSSV 329
>gi|224085770|ref|XP_002189383.1| PREDICTED: cathepsin E [Taeniopygia guttata]
Length = 435
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 114/236 (48%), Positives = 155/236 (65%), Gaps = 2/236 (0%)
Query: 85 YMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKN 144
Y+D +YFG+I IGTPPQNFTV+FDTGSSNLWVPS C S AC H+++ +SSTY+
Sbjct: 112 YLDMEYFGQISIGTPPQNFTVVFDTGSSNLWVPSVYC-VSKACTEHTRFHPTQSSTYQVI 170
Query: 145 GKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQE 204
G I YGTG+++G D V + L V +Q+F E+ EP FL A+FDGILGL +
Sbjct: 171 GTPFSIQYGTGSLTGIIGSDQVAVEGLAVSNQQFAESISEPGKAFLDAEFDGILGLAYPS 230
Query: 205 ISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVT 264
++V PV+ NM+ Q LV P+FS + + N D +GGE++FGG D + G +VPVT
Sbjct: 231 LAVDGVTPVFDNMMAQNLVELPIFSVYMSSNPDSPQGGEVLFGGFDTSRFTGTLNWVPVT 290
Query: 265 QKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
Q+GYWQ + ++ + G T FCA GC AI D+GTSL+ GPT I ++ + IGA +
Sbjct: 291 QQGYWQIQLDNIQLGGTVT-FCANGCQAIVDTGTSLITGPTKEIKKLQNLIGAVSV 345
>gi|73535294|pdb|1TZS|A Chain A, Crystal Structure Of An Activation Intermediate Of
Cathepsin E
Length = 351
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 118/239 (49%), Positives = 161/239 (67%), Gaps = 2/239 (0%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L NY+D +YFG I IG+PPQNFTVIFDTGSSNLWVPS C S AC HS+++ +SSTY
Sbjct: 17 LINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHSRFQPSQSSTY 75
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
+ G+S I YGTG++SG D V + L V Q+F E+ EP TF+ A+FDGILGLG
Sbjct: 76 SQPGQSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAEFDGILGLG 135
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ ++VG PV+ NM+ Q LV+ P+FS + + N + G E++FGG D H+ G +V
Sbjct: 136 YPSLAVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGSLNWV 195
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
PVT++ YWQ + ++ + G T FC+ GC AI D+GTSL+ GP+ I Q+ +AIGA +
Sbjct: 196 PVTKQAYWQIALDNIQVGG-TVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGAAPV 253
>gi|392575952|gb|EIW69084.1| hypothetical protein TREMEDRAFT_39371 [Tremella mesenterica DSM
1558]
Length = 446
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 136/307 (44%), Positives = 180/307 (58%), Gaps = 22/307 (7%)
Query: 34 GLKKRKFDLNNRVAARLDSKEGESFRTSI------RKYSLRGNLGESGDADIVALKNYMD 87
G K + ++ A ++ E F + K L+G G V L +YM+
Sbjct: 80 GRKHKHYEHEGHHAPDFSDEDVERFWAQMIGQGDSEKRVLKGGHG-------VPLSDYMN 132
Query: 88 AQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKS 147
AQY+ I IGTPPQ F V+ DTGSSNLWVPSS C SIAC+ HSKY S SSTYK NG
Sbjct: 133 AQYYAPITIGTPPQEFKVVLDTGSSNLWVPSSSCT-SIACFLHSKYDSSASSTYKANGSD 191
Query: 148 ADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISV 207
I YG+G++ GF S D V I DL +K Q+F EAT+EP L F KFDGI+GL + ISV
Sbjct: 192 FAIRYGSGSLEGFVSSDTVTIADLSLKHQDFAEATKEPGLAFAFGKFDGIMGLAYDTISV 251
Query: 208 GKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKG 267
VP +Y M+N+GL++EPVFSF +DE +GGE +FGG+D Y G+ YVP+ +KG
Sbjct: 252 NHIVPPFYTMLNRGLLDEPVFSFRL--GSDENDGGECIFGGVDDSAYTGKIQYVPIRRKG 309
Query: 268 YWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT----GIVSQ 323
YW+ ++ + G+ AAI D+GTSL+ P+ + +N IGAT G +
Sbjct: 310 YWEVELEKIGF-GEEELELENTGAAI-DTGTSLIVMPSDVAEMLNKEIGATKSWNGQYTV 367
Query: 324 ECKAVVS 330
+C V S
Sbjct: 368 DCNTVPS 374
>gi|337347|gb|AAA60364.1| renin [Homo sapiens]
Length = 403
Score = 241 bits (614), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 124/287 (43%), Positives = 180/287 (62%), Gaps = 19/287 (6%)
Query: 32 RIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYF 91
R LK+R D+ + G + +++ +L GN S V L NYMD QY+
Sbjct: 43 RESLKERGVDM---------ASLGPEWSQPMKRLTL-GNTTSS-----VILTNYMDTQYY 87
Query: 92 GEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNGKSADI 150
GEIGIGTPPQ F V+FDTGSSN+WVPSSKC AC +H + + SS+YK NG +
Sbjct: 88 GEIGIGTPPQTFKVVFDTGSSNVWVPSSKCSRLYTACVYHKLFDASDSSSYKHNGTELTL 147
Query: 151 HYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKA 210
Y TG +SGF S+D + +G + V Q F E T P+L F+LA+FDG++G+GF E ++G+
Sbjct: 148 RYSTGTVSGFLSQDIITVGGITVT-QMFGEVTEMPALPFMLAQFDGVVGMGFIEQAIGRV 206
Query: 211 VPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQ 270
P++ N+++QG++ E VFSF++NRN+ + GG+IV GG DP HY+G Y+ + + G WQ
Sbjct: 207 TPIFDNIISQGVLKEDVFSFYYNRNS-QSLGGQIVLGGSDPQHYEGNFHYINLIKTGVWQ 265
Query: 271 FDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
M V + G +T C GC A+ D+G S ++G T+ I ++ A+GA
Sbjct: 266 IQMKGVSV-GSSTLLCEDGCLALVDTGASYISGSTSCIEKLMEALGA 311
>gi|226288833|gb|EEH44345.1| vacuolar protease A [Paracoccidioides brasiliensis Pb18]
Length = 400
Score = 241 bits (614), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 118/243 (48%), Positives = 162/243 (66%), Gaps = 13/243 (5%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V + N+++AQYF EI IGTPPQ F V+ DTGSSNLWVPS++C SIAC+ H+KY S SS
Sbjct: 76 VLVDNFLNAQYFSEISIGTPPQTFKVVLDTGSSNLWVPSAQC-MSIACFLHNKYDSSVSS 134
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
T++KNG I YG+G++SGF S+D V+IGD+ V +Q+F EAT EP L F +FDGILG
Sbjct: 135 THRKNGTEFAIRYGSGSLSGFVSQDVVRIGDMTVNNQDFAEATSEPGLAFAFGRFDGILG 194
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ ISV VP++Y M+NQ L++ PVF F+ + + + E FGG+D H+ GE T
Sbjct: 195 LGYDTISVNHIVPLFYQMINQKLLDMPVFGFYLGNSDVDGDDSEATFGGIDESHFTGELT 254
Query: 260 YVPVTQKGYWQFDM-----GDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHA 314
+ + ++ YW+ D+ G+ M + + TG I D+GTSLLA P+TI +N
Sbjct: 255 TISLRRRAYWEVDLDAIIFGNEMAELENTG-------VILDTGTSLLALPSTIAELLNKQ 307
Query: 315 IGA 317
IGA
Sbjct: 308 IGA 310
>gi|402857516|ref|XP_003893299.1| PREDICTED: renin [Papio anubis]
gi|62287423|sp|Q6DLS0.1|RENI_MACFA RecName: Full=Renin; AltName: Full=Angiotensinogenase; Flags:
Precursor
gi|50346961|gb|AAT75162.1| renin [Macaca fascicularis]
Length = 406
Score = 241 bits (614), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 126/289 (43%), Positives = 183/289 (63%), Gaps = 20/289 (6%)
Query: 32 RIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYF 91
R LK+R D+ ARL G + +++ +L GN S V L NYMD QY+
Sbjct: 43 RESLKERGVDM-----ARL----GPEWSQPMKRLAL-GNTTSS-----VILTNYMDTQYY 87
Query: 92 GEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNGKSADI 150
GEIGIGTPPQ F V+FDTGSSN+WVPSSKC AC +H + + SS+YK NG +
Sbjct: 88 GEIGIGTPPQTFKVVFDTGSSNVWVPSSKCSRLYTACVYHKLFDASDSSSYKHNGTELTL 147
Query: 151 HYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKA 210
Y TG +SGF S+D + +G + V Q F E T P+L F+LA+FDG++G+GF E ++G+
Sbjct: 148 RYSTGTVSGFLSQDIITVGGITVT-QMFGEVTEMPALPFMLAEFDGVVGMGFIEQAIGRV 206
Query: 211 VPVWYNMVNQGLVNEPVFSFWFNRNADEEE--GGEIVFGGMDPDHYKGEHTYVPVTQKGY 268
P++ N+++QG++ E VFSF++NR+++ + GG+IV GG DP HY+G Y+ + + G
Sbjct: 207 TPIFDNILSQGVLKEDVFSFYYNRDSENAQSLGGQIVLGGSDPQHYEGNFHYINLIKTGV 266
Query: 269 WQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
WQ M V + G +T C GC A+ D+G S ++G T+ I ++ A+GA
Sbjct: 267 WQIQMKGVSV-GSSTLLCEDGCLALVDTGASYISGSTSSIEKLMEALGA 314
>gi|283806612|ref|NP_001164557.1| pepsin II-2/3 precursor [Oryctolagus cuniculus]
gi|129781|sp|P27821.1|PEPA2_RABIT RecName: Full=Pepsin II-2/3; AltName: Full=Pepsin A; Flags:
Precursor
gi|165600|gb|AAA85369.1| pepsinogen [Oryctolagus cuniculus]
Length = 387
Score = 241 bits (614), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 127/281 (45%), Positives = 173/281 (61%), Gaps = 28/281 (9%)
Query: 62 IRKYSLRGNLGESG------------------------DADIVALKNYMDAQYFGEIGIG 97
+RK SLR NL E G +++NY+DA+YFG I IG
Sbjct: 23 VRKKSLRKNLIEKGLLQDYLKTHTPNPATKYFPKETFATVSTESMENYLDAEYFGTISIG 82
Query: 98 TPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAI 157
TPPQ+FTVIFDTGSSNLWVPS+ C S+AC H ++ SSTY+ ++ I YGTG++
Sbjct: 83 TPPQDFTVIFDTGSSNLWVPSTYCS-SLACALHKRFNPEDSSTYQGTSETLSITYGTGSM 141
Query: 158 SGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNM 217
+G D VK+G + +Q F + EPSLTFL A FDGILGL + IS A PV+ NM
Sbjct: 142 TGILGYDTVKVGSIEDTNQIFGLSKTEPSLTFLFAPFDGILGLAYPSISSSDATPVFDNM 201
Query: 218 VNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVM 277
N+GLV++ +FS + ++D+E+G ++FGG+D +Y G +VPV+ +GYWQ M V
Sbjct: 202 WNEGLVSQDLFSVYL--SSDDEKGSLVMFGGIDSSYYTGSLNWVPVSYEGYWQITMDSVS 259
Query: 278 IDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
I+G+T CA C AI D+GTSLL GPT+ I+ + IGA+
Sbjct: 260 INGETIA-CADSCQAIVDTGTSLLTGPTSAISNIQSYIGAS 299
>gi|397504905|ref|XP_003823019.1| PREDICTED: renin [Pan paniscus]
Length = 406
Score = 241 bits (614), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 126/289 (43%), Positives = 182/289 (62%), Gaps = 20/289 (6%)
Query: 32 RIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYF 91
R LK+R D+ ARL G + +++ +L GN S V L NYMD QY+
Sbjct: 43 RESLKERGVDM-----ARL----GPEWSQPMKRLTL-GNTTSS-----VILTNYMDTQYY 87
Query: 92 GEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNGKSADI 150
GEIGIGTPPQ F V+FDTGSSN+WVPSSKC AC +H + + SS+YK NG +
Sbjct: 88 GEIGIGTPPQTFKVVFDTGSSNVWVPSSKCSRLYTACVYHKLFDASDSSSYKHNGTELTL 147
Query: 151 HYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKA 210
Y TG +SGF S+D + +G + V Q F E T P+L F+LA+FDG++G+GF E ++G
Sbjct: 148 RYSTGTVSGFLSQDIITVGGITVT-QMFGEVTEMPALPFMLAEFDGVVGMGFIEQAIGSV 206
Query: 211 VPVWYNMVNQGLVNEPVFSFWFNRNADEEE--GGEIVFGGMDPDHYKGEHTYVPVTQKGY 268
P++ N+++QG++ E VFSF++NR+++ + GG+IV GG DP HY+G Y+ + + G
Sbjct: 207 TPIFDNIISQGVLKEDVFSFYYNRDSENSQSLGGQIVLGGSDPQHYEGNFHYINLIKTGV 266
Query: 269 WQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
WQ M V + G +T C GC A+ D+G S ++G T+ I ++ A+GA
Sbjct: 267 WQIQMKGVSV-GSSTLLCEDGCLALVDTGASYISGSTSSIEKLMEALGA 314
>gi|45643446|gb|AAS72876.1| aspartyl protease [Triatoma infestans]
Length = 387
Score = 241 bits (614), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 127/308 (41%), Positives = 184/308 (59%), Gaps = 12/308 (3%)
Query: 12 FFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNL 71
F LL+ V P+ Y + L K + + +E + ++ S+R Y + +
Sbjct: 2 FTFVLLVVAAVGIIPSQSYYHVPLYKM---YKSPRSVEEPQRELKDYKDSLRMYPMLKKI 58
Query: 72 GESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHS 131
G L+N ++ QY+G + +GTPPQ TV+FDTGS+NLWVP + C S AC H+
Sbjct: 59 GRE------ILRNSLNTQYYGNVTLGTPPQELTVVFDTGSANLWVPLANCP-SFACIIHN 111
Query: 132 KYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLL 191
Y +SSTY+ NGK+ I+YGTG+I+G S D ++IGDL VK+Q F EA + + F
Sbjct: 112 TYDHKQSSTYQPNGKALRINYGTGSITGEMSSDVLQIGDLQVKNQLFGEAPQVSNSPFGR 171
Query: 192 AKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDP 251
+K DGILGL F I+ G+A+P ++NM++QGL+++PVFS + NRN DEE GGEI+FGG+D
Sbjct: 172 SKADGILGLAFPPIAKGQAIPPFFNMIDQGLLDKPVFSVYLNRNPDEEVGGEIIFGGVDE 231
Query: 252 DHYKGEH-TYVPVTQKGYWQFDMGDVMIDGQT-TGFCAGGCAAIADSGTSLLAGPTTIIT 309
+ E T VP+T YW F M +V G +C GC A AD+GTS + GPT +
Sbjct: 232 KRFNKESLTTVPLTNPTYWMFKMDEVSTSGTNGKSWCQNGCRATADTGTSFIVGPTKEVA 291
Query: 310 QVNHAIGA 317
++ + A
Sbjct: 292 EIMEFLDA 299
>gi|198475392|ref|XP_001357030.2| GA17303 [Drosophila pseudoobscura pseudoobscura]
gi|198138802|gb|EAL34096.2| GA17303 [Drosophila pseudoobscura pseudoobscura]
Length = 401
Score = 241 bits (614), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 116/240 (48%), Positives = 161/240 (67%), Gaps = 4/240 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSST 140
L N M+ Y+G+I IGTPPQ F V+FDTGSSNLW+PS++C + +AC H++Y + SST
Sbjct: 80 LGNSMNMYYYGQISIGTPPQYFNVVFDTGSSNLWIPSAQCLSTDVACQQHNQYNASASST 139
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y N ++ I YGTG+++G+ + D V I L + +Q F EA +P +F FDGILG+
Sbjct: 140 YVANSQNFSIQYGTGSVTGYLAMDTVTINGLAIANQTFGEAVSQPGSSFTDVAFDGILGM 199
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
G+Q I+V VP +YN+ QGL++EP F F+ RN EEGG+++ GG+D G+ TY
Sbjct: 200 GYQTIAVDSVVPPFYNLYEQGLIDEPTFGFYLARNGSSEEGGQLLLGGVDETLMAGDLTY 259
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
VPV+Q+GYWQF + ++ +G T C GC AIAD+GTSLLA P + TQ+N IGA I
Sbjct: 260 VPVSQEGYWQFSVNNISWNG--TVLC-DGCQAIADTGTSLLACPQAVYTQINQLIGAVLI 316
>gi|225681688|gb|EEH19972.1| cathepsin D [Paracoccidioides brasiliensis Pb03]
Length = 349
Score = 241 bits (614), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 118/243 (48%), Positives = 162/243 (66%), Gaps = 13/243 (5%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V + N+++AQYF EI IGTPPQ F V+ DTGSSNLWVPS++C SIAC+ H+KY S SS
Sbjct: 25 VLVDNFLNAQYFSEISIGTPPQTFKVVLDTGSSNLWVPSAQC-MSIACFLHNKYDSSVSS 83
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
T++KNG I YG+G++SGF S+D V+IGD+ V +Q+F EAT EP L F +FDGILG
Sbjct: 84 THRKNGTEFTIRYGSGSLSGFVSQDVVRIGDMTVNNQDFAEATSEPGLAFAFGRFDGILG 143
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ ISV VP++Y M+NQ L++ PVF F+ + + + E FGG+D H+ GE T
Sbjct: 144 LGYDSISVNHIVPLFYQMINQKLLDTPVFGFYLGNSDVDGDDSEATFGGIDESHFTGELT 203
Query: 260 YVPVTQKGYWQFDM-----GDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHA 314
+ + ++ YW+ D+ G+ M + + TG I D+GTSLLA P+TI +N
Sbjct: 204 TISLRRRAYWEVDLDAIIFGNEMAELENTG-------VILDTGTSLLALPSTIAELLNKQ 256
Query: 315 IGA 317
IGA
Sbjct: 257 IGA 259
>gi|425767355|gb|EKV05929.1| Vacuolar protease A [Penicillium digitatum PHI26]
gi|425779798|gb|EKV17829.1| Vacuolar protease A [Penicillium digitatum Pd1]
Length = 399
Score = 240 bits (613), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 124/250 (49%), Positives = 162/250 (64%), Gaps = 7/250 (2%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V + N+++AQYF EI IGTPPQ F V+ DTGSSNLWVPSS+C SIAC+ HSKY S SS
Sbjct: 76 VLVDNFLNAQYFSEITIGTPPQTFKVVLDTGSSNLWVPSSQCS-SIACFLHSKYDSSSSS 134
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TY+KNG +I YG+G++SGF S D ++IGDL V+ Q+F EAT EP L F +FDGILG
Sbjct: 135 TYQKNGTDFEIRYGSGSLSGFVSRDTLQIGDLKVEGQDFAEATNEPGLAFAFGRFDGILG 194
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ ISV K VP +Y M+ Q LV+EPVF+F+ + + FGG+D HY GE
Sbjct: 195 LGYDTISVNKMVPPFYQMIKQKLVDEPVFAFYLGDANKDGDNSVATFGGIDESHYTGELI 254
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA-- 317
+PV +K YW+ ++ + + G I D+GTSL+A P+T+ +N IGA
Sbjct: 255 KIPVRRKAYWEVELNSIALGNNVAELDDTGV--ILDTGTSLIALPSTMAELLNKEIGATK 312
Query: 318 --TGIVSQEC 325
TG S EC
Sbjct: 313 GFTGQYSVEC 322
>gi|326933745|ref|XP_003212960.1| PREDICTED: cathepsin E-like [Meleagris gallopavo]
Length = 403
Score = 240 bits (613), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 132/305 (43%), Positives = 182/305 (59%), Gaps = 14/305 (4%)
Query: 21 VVFSTPNGGLYRIGL------KKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNL-GE 73
V+ P GL R L + R + R +L S+ ++ R + +Y+ +L GE
Sbjct: 14 VLCIAPCSGLKRPALCRVTLTRHRSLRKSLRDRGQL-SQFWKAHRLDMVQYTQDCSLFGE 72
Query: 74 SGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKY 133
+ + L NY+D +YFG+I IGTPPQNFTVIFDTGSSNLWVPS C S AC H+++
Sbjct: 73 ANEP----LINYLDMEYFGQISIGTPPQNFTVIFDTGSSNLWVPSIYCT-SKACTNHARF 127
Query: 134 RSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAK 193
+ RSSTY+ G + YGTG+++G D V + + V +Q F E+ EP F ++
Sbjct: 128 QPSRSSTYQPLGLPISLQYGTGSLTGIIGSDQVTVEGMTVCNQPFAESVSEPGKAFQDSE 187
Query: 194 FDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDH 253
FDGILGL + ++V PV+ NM+ Q LV P+FS + + N D GGE++FGG DP
Sbjct: 188 FDGILGLAYPSLAVDGVTPVFDNMMAQDLVELPIFSVYMSANPDSSLGGEVLFGGFDPSR 247
Query: 254 YKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNH 313
+ G +VPVT +GYWQ + +V + G T FCA GC AI D+GTSLL GPT I ++
Sbjct: 248 FLGTLHWVPVTVQGYWQIQLDNVQVGG-TVVFCANGCQAIVDTGTSLLTGPTKDIKEMQR 306
Query: 314 AIGAT 318
IGAT
Sbjct: 307 YIGAT 311
>gi|74136391|ref|NP_001028088.1| renin precursor [Macaca mulatta]
gi|67461396|sp|Q6DLW5.2|RENI_MACMU RecName: Full=Renin; AltName: Full=Angiotensinogenase; Flags:
Precursor
gi|61699710|gb|AAT74864.2| prorenin [Macaca mulatta]
Length = 406
Score = 240 bits (613), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 126/289 (43%), Positives = 183/289 (63%), Gaps = 20/289 (6%)
Query: 32 RIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYF 91
R LK+R D+ ARL G + +++ +L GN S V L NYMD QY+
Sbjct: 43 RESLKERGVDM-----ARL----GPEWSQPMKRLAL-GNTTSS-----VILTNYMDTQYY 87
Query: 92 GEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNGKSADI 150
GEIGIGTPPQ F V+FDTGSSN+WVPSSKC AC +H + + SS+YK NG +
Sbjct: 88 GEIGIGTPPQTFKVVFDTGSSNVWVPSSKCSRLYTACVYHKLFDASDSSSYKHNGTELTL 147
Query: 151 HYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKA 210
Y TG +SGF S+D + +G + V Q F E T P+L F+LA+FDG++G+GF E ++G+
Sbjct: 148 RYSTGTVSGFLSQDIITVGGITVT-QMFGEVTEMPALPFMLAEFDGVVGMGFIEQAIGRV 206
Query: 211 VPVWYNMVNQGLVNEPVFSFWFNRNADEEE--GGEIVFGGMDPDHYKGEHTYVPVTQKGY 268
P++ N+++QG++ E VFSF++NR+++ + GG+IV GG DP HY+G Y+ + + G
Sbjct: 207 TPIFDNILSQGVLKEDVFSFYYNRDSENAQSLGGQIVLGGSDPQHYEGNFHYINLIKTGV 266
Query: 269 WQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
WQ M V + G +T C GC A+ D+G S ++G T+ I ++ A+GA
Sbjct: 267 WQIPMKGVSV-GSSTLLCEDGCLALVDTGASYISGSTSSIEKLMEALGA 314
>gi|24653643|ref|NP_610961.1| CG10104 [Drosophila melanogaster]
gi|7303185|gb|AAF58249.1| CG10104 [Drosophila melanogaster]
Length = 404
Score = 240 bits (612), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 131/312 (41%), Positives = 179/312 (57%), Gaps = 20/312 (6%)
Query: 16 LLLFPVVFSTPNG---GLYRIGLKK-----RKFDLNNRVAARLDSKEGESFRTSIRKYSL 67
L + PV F P LYR+ L++ +F+ RL K E
Sbjct: 11 LFILPVQFQHPVSCKLQLYRVPLRRFPSARHRFEKLGIRMDRLRLKYAEEVS------HF 64
Query: 68 RGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IA 126
RG + + L NY+DAQYFG I IGTPPQ F VIFDTGSSNLWVPS+ C + +A
Sbjct: 65 RGEWNSAVKS--TPLSNYLDAQYFGPITIGTPPQTFKVIFDTGSSNLWVPSATCASTMVA 122
Query: 127 CYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPS 186
C H++Y + RS++++ G IHYG+G++SGF S D V++ L ++DQ F EAT P
Sbjct: 123 CRVHNRYFAKRSTSHQVRGDHFAIHYGSGSLSGFLSTDTVRVAGLEIRDQTFAEATEMPG 182
Query: 187 LTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVF 246
FL AKFDGI GL ++ IS+ + P +Y M+ QGL+ +P+FS + +RN E++GG I F
Sbjct: 183 PIFLAAKFDGIFGLAYRSISMQRIKPPFYAMMEQGLLTKPIFSVYLSRNG-EKDGGAIFF 241
Query: 247 GGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTT 306
GG +P +Y G TYV V+ + YWQ M +I + C GC I D+GTS LA P
Sbjct: 242 GGSNPHYYTGNFTYVQVSHRAYWQVKMDSAVI--RNLELCQQGCEVIIDTGTSFLALPYD 299
Query: 307 IITQVNHAIGAT 318
+N +IG T
Sbjct: 300 QAILINESIGGT 311
>gi|400598686|gb|EJP66395.1| vacuolar protease A [Beauveria bassiana ARSEF 2860]
Length = 395
Score = 240 bits (612), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 132/267 (49%), Positives = 170/267 (63%), Gaps = 20/267 (7%)
Query: 70 NLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYF 129
+ ES D V + N+ +AQYF EI IGTPPQ F V+ DTGSSNLWVPS C SIAC+
Sbjct: 65 QVAESKDGHPVPVTNFANAQYFSEITIGTPPQTFKVVLDTGSSNLWVPSQSCS-SIACFL 123
Query: 130 HSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTF 189
HS Y S SSTYKKNG +IHYG+G+++GF S D V IGDL +K+ +F EAT EP L F
Sbjct: 124 HSTYDSSSSSTYKKNGSDFEIHYGSGSLTGFVSNDVVSIGDLTIKNTDFAEATSEPGLAF 183
Query: 190 LLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGM 249
+FDGILGLG+ ISV K VP +Y M+NQ L++EPVF+F+ E+ G E +FGG+
Sbjct: 184 AFGRFDGILGLGYDTISVNKMVPPFYQMINQKLIDEPVFAFYL---GSEDSGSEAIFGGV 240
Query: 250 DPDHYKGEHTYVPVTQKGYWQFD-----MGDVMIDGQTTGFCAGGCAAIADSGTSLLAGP 304
D DHY+G+ Y+P+ +K YW+ D GD + + + TG I D+GTSL P
Sbjct: 241 DKDHYEGKIEYIPLRRKAYWEVDFDAIAFGDEVAELENTG-------VILDTGTSLNTLP 293
Query: 305 TTIITQVNHAIGAT----GIVSQECKA 327
T + +N IGA G S +CKA
Sbjct: 294 TDLAELLNKEIGAKKGFGGQYSIDCKA 320
>gi|328860092|gb|EGG09199.1| hypothetical protein MELLADRAFT_42703 [Melampsora larici-populina
98AG31]
Length = 429
Score = 240 bits (612), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 119/241 (49%), Positives = 160/241 (66%), Gaps = 6/241 (2%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L NY++AQYF EI IGTPPQ+F VI DTGSSNLWVPS++C SIAC+ HSKY SS
Sbjct: 103 VPLSNYLNAQYFSEITIGTPPQSFKVILDTGSSNLWVPSTRCT-SIACFLHSKYDCEASS 161
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
+YK NG I YG+G++ G S D V+IGDL ++D +F E+T+EP L F KFDGILG
Sbjct: 162 SYKANGTEFQIRYGSGSLEGVISNDVVRIGDLEIRDTDFAESTKEPGLAFAFGKFDGILG 221
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNAD---EEEGGEIVFGGMDPDHYKG 256
LG+ ISV VP +Y M+ QGL++EPVF+F+ + + + +GGE +FGG+D HY+G
Sbjct: 222 LGYDTISVLHTVPPFYEMIEQGLLDEPVFAFYLGTSHESGVDNQGGEAIFGGIDEAHYEG 281
Query: 257 EHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIG 316
+ Y PV ++GYW+ + V + G A D+GTSL+A PT +N ++G
Sbjct: 282 DIHYAPVRRRGYWEVALEGVRFGKEEMKLVNVGAA--IDTGTSLIALPTDTAEIINASLG 339
Query: 317 A 317
A
Sbjct: 340 A 340
>gi|115719|sp|P00795.2|CATD_PIG RecName: Full=Cathepsin D; Contains: RecName: Full=Cathepsin D
light chain; Contains: RecName: Full=Cathepsin D heavy
chain; Flags: Precursor
Length = 345
Score = 240 bits (612), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 138/288 (47%), Positives = 182/288 (63%), Gaps = 26/288 (9%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNYMDAQY+GEIGIGTPPQ FTV+FDTGSSNLWVPS C IAC+ H KY SG+SST
Sbjct: 7 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSGKSST 66
Query: 141 YKKNGKSADIHYGTGAISGFFS-EDHVKI---------GDLVVKDQEFIEATREPSLTFL 190
Y KNG + IHYG+G++SG+ S +D V + G + V+ Q F EAT++P LTF+
Sbjct: 67 YVKNGTTFAIHYGSGSLSGYLSSQDTVSVPCNSALSGVGGIKVERQTFGEATKQPGLTFI 126
Query: 191 LAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMD 250
AKFDGILG+ + ISV VPV+ N++ Q LV++ +FSF+ NR+ + GGE++ GG+D
Sbjct: 127 AAKFDGILGMAYPRISVNNVVPVFDNLMQQKLVDKDIFSFYLNRDPGAQPGGELMLGGID 186
Query: 251 PDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQ 310
+YKG Y VT+K YWQ M V + G + C GGC AI D+GTSL+ G + +
Sbjct: 187 SKYYKGSLDYHNVTRKAYWQIHMNQVAV-GSSLTLCKGGCEAIVDTGTSLIVGQPEEVRE 245
Query: 311 VNHAIGATGIVSQE----CKAVVSQYGEEIINMLLAKVSAKLSLRHYK 354
+ AIGA ++ E C+ V S L V+ L + YK
Sbjct: 246 LGKAIGAVPLIQGEYMIPCEKVPS----------LPDVTVTLGGKKYK 283
>gi|21063965|gb|AAM29212.1| AT05209p [Drosophila melanogaster]
Length = 404
Score = 240 bits (612), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 131/312 (41%), Positives = 179/312 (57%), Gaps = 20/312 (6%)
Query: 16 LLLFPVVFSTPNG---GLYRIGLKK-----RKFDLNNRVAARLDSKEGESFRTSIRKYSL 67
L + PV F P LYR+ L++ +F+ RL K E
Sbjct: 11 LFILPVQFQHPVSCKLQLYRVPLRRFPSARHRFEKLGIRMDRLRLKYAEEVS------HF 64
Query: 68 RGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IA 126
RG + + L NY+DAQYFG I IGTPPQ F VIFDTGSSNLWVPS+ C + +A
Sbjct: 65 RGEWNSAVKS--TPLSNYLDAQYFGPITIGTPPQTFKVIFDTGSSNLWVPSATCASTMVA 122
Query: 127 CYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPS 186
C H++Y + RS++++ G IHYG+G++SGF S D V++ L ++DQ F EAT P
Sbjct: 123 CRVHNRYFAKRSTSHQVRGDHFAIHYGSGSLSGFLSTDTVRVAGLEIRDQTFAEATEMPG 182
Query: 187 LTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVF 246
FL AKFDGI GL ++ IS+ + P +Y M+ QGL+ +P+FS + +RN E++GG I F
Sbjct: 183 PIFLAAKFDGIFGLAYRSISMQRIKPPFYAMMEQGLLTKPIFSVYLSRNG-EKDGGAIFF 241
Query: 247 GGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTT 306
GG +P +Y G TYV V+ + YWQ M +I + C GC I D+GTS LA P
Sbjct: 242 GGSNPHYYTGNFTYVQVSHRAYWQVKMDSAVI--RNLELCQQGCEVIIDTGTSFLALPYD 299
Query: 307 IITQVNHAIGAT 318
+N +IG T
Sbjct: 300 QAILINESIGGT 311
>gi|158254091|gb|AAI54325.1| Zgc:63831 [Danio rerio]
Length = 412
Score = 240 bits (612), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 114/252 (45%), Positives = 162/252 (64%), Gaps = 6/252 (2%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYF-SIACYFHSKYRSGRSST 140
L N+MDAQY+G I IGTPPQ+F+V+FDTGSSNLWVPS C F IAC+ H +Y S +SST
Sbjct: 84 LTNFMDAQYYGMISIGTPPQDFSVLFDTGSSNLWVPSIHCAFLDIACWLHRRYNSKKSST 143
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y +NG I YG G++SGF S+D V + L V Q+F EA ++P + F +A+FDG+LG+
Sbjct: 144 YVQNGTEFSIQYGRGSLSGFISQDTVNLAGLNVTGQQFAEAVKQPGIVFAVARFDGVLGM 203
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ISV + PV+ + ++ + +FSF+ NR+ + GGE++ GG D ++ G+ Y
Sbjct: 204 AYPAISVDRVTPVFDTAMAAKILPQNIFSFYINRDPAGDVGGELMLGGFDQQYFNGDLHY 263
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
V VT+K YWQ M +V + G T C GC AI D+GTS++ GP + + AIGA +
Sbjct: 264 VNVTRKAYWQIKMDEVQV-GSTLTLCKSGCQAIVDTGTSMITGPVQEVRALQKAIGAIPL 322
Query: 321 VSQE----CKAV 328
+ E CK +
Sbjct: 323 LMGEYWIDCKKI 334
>gi|41053329|ref|NP_956325.1| uncharacterized protein LOC336746 precursor [Danio rerio]
gi|34783813|gb|AAH56836.1| Zgc:63831 [Danio rerio]
Length = 412
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 114/252 (45%), Positives = 162/252 (64%), Gaps = 6/252 (2%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYF-SIACYFHSKYRSGRSST 140
L N+MDAQY+G I IGTPPQ+F+V+FDTGSSNLWVPS C F IAC+ H +Y S +SST
Sbjct: 84 LTNFMDAQYYGMISIGTPPQDFSVLFDTGSSNLWVPSIHCAFLDIACWLHRRYNSKKSST 143
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y +NG I YG G++SGF S+D V + L V Q+F EA ++P + F +A+FDG+LG+
Sbjct: 144 YVQNGTEFSIQYGRGSLSGFISQDTVNLAGLNVTGQQFAEAVKQPGIVFAVARFDGVLGM 203
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ISV + PV+ + ++ + +FSF+ NR+ + GGE++ GG D ++ G+ Y
Sbjct: 204 AYPAISVDRVTPVFDTAMAAKILPQNIFSFYINRDPAGDVGGELMLGGFDQQYFNGDLHY 263
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
V VT+K YWQ M +V + G T C GC AI D+GTS++ GP + + AIGA +
Sbjct: 264 VNVTRKAYWQIKMDEVQV-GSTLTLCKSGCQAIVDTGTSMITGPVQEVRALQKAIGAIPL 322
Query: 321 VSQE----CKAV 328
+ E CK +
Sbjct: 323 LMGEYWIDCKKI 334
>gi|283806610|ref|NP_001164556.1| pepsin II-4 precursor [Oryctolagus cuniculus]
gi|129787|sp|P28713.1|PEPA4_RABIT RecName: Full=Pepsin II-4; AltName: Full=Pepsin A; Flags: Precursor
gi|22218076|dbj|BAC07515.1| pepsinogen II-4 [Oryctolagus cuniculus]
Length = 387
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 127/281 (45%), Positives = 172/281 (61%), Gaps = 28/281 (9%)
Query: 62 IRKYSLRGNLGESG------------------------DADIVALKNYMDAQYFGEIGIG 97
+RK SLR NL E G +L+NY+DA+YFG I IG
Sbjct: 23 VRKKSLRKNLIEKGLLQDYLKTHTPNPATKYFPKETFATVSTESLENYLDAEYFGTISIG 82
Query: 98 TPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAI 157
TPPQ+FTVIFDTGSSNLWVPS+ C S+AC H ++ SSTY+ ++ I YGTG++
Sbjct: 83 TPPQDFTVIFDTGSSNLWVPSTYCS-SLACALHKRFNPEDSSTYQGTSETLSITYGTGSM 141
Query: 158 SGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNM 217
+G D VK+G + +Q F + EP LTFL A FDGILGL + IS A PV+ NM
Sbjct: 142 TGILGYDTVKVGSIEDTNQIFGLSKTEPGLTFLFAPFDGILGLAYPSISSSDATPVFDNM 201
Query: 218 VNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVM 277
N+GLV++ +FS + ++D+E+G ++FGG+D +Y G +VPV+ +GYWQ M V
Sbjct: 202 WNEGLVSQDLFSVYL--SSDDEKGSLVMFGGIDSSYYTGSLNWVPVSYEGYWQITMDSVS 259
Query: 278 IDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
I+G+T CA C AI D+GTSLL GPT+ I+ + IGA+
Sbjct: 260 INGETIA-CADSCQAIVDTGTSLLTGPTSAISNIQSYIGAS 299
>gi|366991455|ref|XP_003675493.1| hypothetical protein NCAS_0C01360 [Naumovozyma castellii CBS 4309]
gi|342301358|emb|CCC69126.1| hypothetical protein NCAS_0C01360 [Naumovozyma castellii CBS 4309]
Length = 406
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 131/311 (42%), Positives = 181/311 (58%), Gaps = 15/311 (4%)
Query: 27 NGGLYRIGLKKRKFDLNNRVA-----ARLDSKEGESFRTSIRKYSLRGN--LGESGDADI 79
N ++++ ++K LN V A L K F + + S + GD
Sbjct: 22 NAKVHKLKIQKEDLTLNEDVTFEQHIASLGHKYMNHFERANPEVSFSRDHPFFAEGDGHN 81
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L NY++AQYF +I +GTPPQNF VI DTGSSNLWVPSS+C S+AC+ HSKY SS
Sbjct: 82 VPLTNYLNAQYFADISVGTPPQNFKVILDTGSSNLWVPSSECN-SLACFLHSKYDHDASS 140
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
+YK NG I YG+G++ G+ S+D + IGDL + Q+F EAT EP LTF KFDGILG
Sbjct: 141 SYKANGTKFAIQYGSGSLEGYISQDTLNIGDLTIPKQDFAEATSEPGLTFAFGKFDGILG 200
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWF-NRNADEEEGGEIVFGGMDPDHYKGEH 258
L + ISV K VP +YN + QGL++E F+F+ + DE+ GGEI GG+D +KG+
Sbjct: 201 LAYDTISVDKVVPPFYNAIEQGLLDEKKFAFYLGDTKKDEKNGGEITIGGIDESKFKGDI 260
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA- 317
++PV +K YW+ + + Q G A D+GTSL+ P+ + +N IGA
Sbjct: 261 EWLPVRRKAYWEVKFEGIALGDQYAALENHGAA--IDTGTSLITLPSGLAEIINTEIGAK 318
Query: 318 ---TGIVSQEC 325
TG + +C
Sbjct: 319 KGWTGQYTLDC 329
>gi|322700747|gb|EFY92500.1| vacuolar protease A [Metarhizium acridum CQMa 102]
Length = 395
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 125/238 (52%), Positives = 163/238 (68%), Gaps = 6/238 (2%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V + N+M+AQYF EI IG+PPQ+F V+ DTGSSNLWVPS C SIACY HS Y S SS
Sbjct: 75 VPVSNFMNAQYFSEITIGSPPQSFKVVLDTGSSNLWVPSQSCN-SIACYLHSTYDSSSSS 133
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYKKNG S +I YG+G++SGF S+D V IGDL ++ Q+F EAT EP L F KFDGILG
Sbjct: 134 TYKKNGSSFEIRYGSGSLSGFVSQDVVSIGDLKIEHQDFAEATSEPGLAFAFGKFDGILG 193
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ +SV K VP +Y M++Q L++EPVF+F+ +EEG E VFGG+D +HY GE
Sbjct: 194 LGYDTLSVNKIVPPFYQMIDQKLLDEPVFAFYL---GSKEEGSEAVFGGIDKNHYTGELE 250
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
Y+P+ +K YW+ D+ + + + G AI D+GTSL P+T+ +N IGA
Sbjct: 251 YLPLRRKAYWEVDINSIALGDEIAELDHTG--AILDTGTSLNVLPSTLAELLNKEIGA 306
>gi|431892878|gb|ELK03306.1| Cathepsin E [Pteropus alecto]
Length = 396
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 124/292 (42%), Positives = 181/292 (61%), Gaps = 7/292 (2%)
Query: 27 NGGLYRIGLKKRKFDLNNRVAARLDSKE-GESFRTSIRKYSLRGNLGESGDADIVALKNY 85
G L+R+ L+K + L ++ A+ E +S ++ +Y+ + +S + ++ NY
Sbjct: 18 QGSLHRVPLRKYQ-SLRKKLRAQGQLSEFWKSQNLNMIQYTESCTMDQSANEPLI---NY 73
Query: 86 MDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNG 145
+D +YFG I +G+PPQNFTVIFDTGSSNLWVPS C S AC H+++ +S TY G
Sbjct: 74 LDMEYFGTISVGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHARFYPSQSDTYSTVG 132
Query: 146 KSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEI 205
IHYGTG++SG D V + L V Q+F E+ EP TF+ A+FDGILGLG+ +
Sbjct: 133 SHFSIHYGTGSLSGIIGADQVSVEGLTVVSQQFGESVTEPGQTFVNAEFDGILGLGYPSL 192
Query: 206 SVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQ 265
+VG PV+ NM+ Q LV+ P+FS + + + + G E++FGG D H+ G +VPVT+
Sbjct: 193 AVGGVTPVFDNMMAQNLVDVPMFSVYMSSDLEGGAGSELIFGGYDHSHFSGSLNWVPVTK 252
Query: 266 KGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
+GYWQ + + + G FC+ GC AI D+GTSL+ GP+ I Q+ AIGA
Sbjct: 253 QGYWQIALDTIQVGGAVI-FCSEGCQAIVDTGTSLITGPSEEIKQLQKAIGA 303
>gi|118138205|pdb|2I4Q|A Chain A, Human ReninPF02342674 COMPLEX
gi|118138206|pdb|2I4Q|B Chain B, Human ReninPF02342674 COMPLEX
Length = 336
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 113/241 (46%), Positives = 163/241 (67%), Gaps = 5/241 (2%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRS 138
V L NYMD QY+GEIGIGTPPQ F V+FDTGSSN+WVPSSKC AC +H + + S
Sbjct: 6 VILTNYMDTQYYGEIGIGTPPQTFKVVFDTGSSNVWVPSSKCSRLYTACVYHKLFDASDS 65
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
S+YK NG + Y TG +SGF S+D + +G + V Q F E T P+L F+LA+FDG++
Sbjct: 66 SSYKHNGTELTLRYSTGTVSGFLSQDIITVGGITVT-QMFGEVTEMPALPFMLAEFDGVV 124
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEE--GGEIVFGGMDPDHYKG 256
G+GF E ++G+ P++ N+++QG++ E VFSF++NR+++ + GG+IV GG DP HY+G
Sbjct: 125 GMGFIEQAIGRVTPIFDNIISQGVLKEDVFSFYYNRDSENSQSLGGQIVLGGSDPQHYEG 184
Query: 257 EHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIG 316
Y+ + + G WQ M V + G +T C GC A+ D+G S ++G T+ I ++ A+G
Sbjct: 185 NFHYINLIKTGVWQIQMKGVSV-GSSTLLCEDGCLALVDTGASYISGSTSSIEKLMEALG 243
Query: 317 A 317
A
Sbjct: 244 A 244
>gi|99031884|pdb|2BKS|A Chain A, Crystal Structure Of Renin-Pf00074777 Complex
gi|99031885|pdb|2BKS|B Chain B, Crystal Structure Of Renin-Pf00074777 Complex
gi|99031886|pdb|2BKT|A Chain A, Crystal Structure Of Renin-Pf00257567 Complex
gi|99031887|pdb|2BKT|B Chain B, Crystal Structure Of Renin-Pf00257567 Complex
gi|119390207|pdb|2IKO|A Chain A, Crystal Structure Of Human Renin Complexed With Inhibitor
gi|119390208|pdb|2IKO|B Chain B, Crystal Structure Of Human Renin Complexed With Inhibitor
gi|119390209|pdb|2IKU|A Chain A, Crystal Structure Of Human Renin Complexed With Inhibitors
gi|119390210|pdb|2IKU|B Chain B, Crystal Structure Of Human Renin Complexed With Inhibitors
gi|119390211|pdb|2IL2|A Chain A, Crystal Structure Of Human Renin Complexed With Inhibitor
gi|119390212|pdb|2IL2|B Chain B, Crystal Structure Of Human Renin Complexed With Inhibitor
gi|151568107|pdb|2V0Z|C Chain C, Crystal Structure Of Renin With Inhibitor 10 (Aliskiren)
gi|151568108|pdb|2V0Z|O Chain O, Crystal Structure Of Renin With Inhibitor 10 (Aliskiren)
gi|151568109|pdb|2V10|C Chain C, Crystal Structure Of Renin With Inhibitor 9
gi|151568110|pdb|2V10|O Chain O, Crystal Structure Of Renin With Inhibitor 9
gi|151568111|pdb|2V11|C Chain C, Crystal Structure Of Renin With Inhibitor 6
gi|151568112|pdb|2V11|O Chain O, Crystal Structure Of Renin With Inhibitor 6
gi|151568113|pdb|2V12|C Chain C, Crystal Structure Of Renin With Inhibitor 8
gi|151568114|pdb|2V12|O Chain O, Crystal Structure Of Renin With Inhibitor 8
gi|157830213|pdb|1BBS|A Chain A, X-Ray Analyses Of Peptide Inhibitor Complexes Define The
Structural Basis Of Specificity For Human And Mouse
Renins
gi|157830214|pdb|1BBS|B Chain B, X-Ray Analyses Of Peptide Inhibitor Complexes Define The
Structural Basis Of Specificity For Human And Mouse
Renins
gi|157833710|pdb|1RNE|A Chain A, The Crystal Structure Of Recombinant Glycosylated Human
Renin Alone And In Complex With A Transition State
Analog Inhibitor
gi|157836332|pdb|2REN|A Chain A, Structure Of Recombinant Human Renin, A Target For
Cardiovascular- Active Drugs, At 2.5 Angstroms
Resolution
gi|193885216|pdb|2V13|A Chain A, Crystal Structure Of Renin With Inhibitor 7
gi|193885217|pdb|2V16|C Chain C, Crystal Structure Of Renin With Inhibitor 3
gi|193885218|pdb|2V16|O Chain O, Crystal Structure Of Renin With Inhibitor 3
gi|242556522|pdb|3G72|A Chain A, Design And Preparation Of Potent, Non-Peptidic,
Bioavailable Renin Inhibitors
gi|242556523|pdb|3G72|B Chain B, Design And Preparation Of Potent, Non-Peptidic,
Bioavailable Renin Inhibitors
gi|308388162|pdb|3OQF|A Chain A, Crystal Structure Analysis Of Renin-Indole-Piperazine
Inhibitor Complexes
gi|308388163|pdb|3OQF|B Chain B, Crystal Structure Analysis Of Renin-Indole-Piperazine
Inhibitor Complexes
gi|310689956|pdb|3OOT|A Chain A, Crystal Structure Analysis Of Renin-Indole-Piperazin
Inhibitor Complexes
gi|310689957|pdb|3OOT|B Chain B, Crystal Structure Analysis Of Renin-Indole-Piperazin
Inhibitor Complexes
gi|310689958|pdb|3OQK|A Chain A, Crystal Structure Analysis Of Renin-Indole-Piperazin
Inhibitor Complexes
gi|310689959|pdb|3OQK|B Chain B, Crystal Structure Analysis Of Renin-Indole-Piperazin
Inhibitor Complexes
gi|342350963|pdb|3Q3T|A Chain A, Alkyl Amine Renin Inhibitors: Filling S1 From S3
gi|342350964|pdb|3Q3T|B Chain B, Alkyl Amine Renin Inhibitors: Filling S1 From S3
gi|345110923|pdb|3SFC|A Chain A, Structure-Based Optimization Of Potent 4- And
6-Azaindole-3- Carboxamides As Renin Inhibitors
gi|345110924|pdb|3SFC|B Chain B, Structure-Based Optimization Of Potent 4- And
6-Azaindole-3- Carboxamides As Renin Inhibitors
gi|358439749|pdb|3Q4B|A Chain A, Clinically Useful Alkyl Amine Renin Inhibitors
gi|358439750|pdb|3Q4B|B Chain B, Clinically Useful Alkyl Amine Renin Inhibitors
gi|358439751|pdb|3Q5H|A Chain A, Clinically Useful Alkyl Amine Renin Inhibitors
gi|358439752|pdb|3Q5H|B Chain B, Clinically Useful Alkyl Amine Renin Inhibitors
gi|400261138|pdb|3VSW|A Chain A, Human Renin In Complex With Compound 8
gi|400261139|pdb|3VSW|B Chain B, Human Renin In Complex With Compound 8
gi|400261140|pdb|3VSX|A Chain A, Human Renin In Complex With Compound 18
gi|400261141|pdb|3VSX|B Chain B, Human Renin In Complex With Compound 18
gi|430800765|pdb|3VYD|A Chain A, Human Renin In Complex With Inhibitor 6
gi|430800766|pdb|3VYD|B Chain B, Human Renin In Complex With Inhibitor 6
gi|430800767|pdb|3VYE|A Chain A, Human Renin In Complex With Inhibitor 7
gi|430800768|pdb|3VYE|B Chain B, Human Renin In Complex With Inhibitor 7
gi|430800769|pdb|3VYF|A Chain A, Human Renin In Complex With Inhibitor 9
gi|430800770|pdb|3VYF|B Chain B, Human Renin In Complex With Inhibitor 9
gi|449802496|pdb|4GJ8|A Chain A, Crystal Structure Of Renin In Complex With Pkf909-724
(compound 3)
gi|449802497|pdb|4GJ8|B Chain B, Crystal Structure Of Renin In Complex With Pkf909-724
(compound 3)
gi|449802498|pdb|4GJ9|A Chain A, Crystal Structure Of Renin In Complex With Gp055321
(compound 4)
gi|449802499|pdb|4GJ9|B Chain B, Crystal Structure Of Renin In Complex With Gp055321
(compound 4)
gi|449802500|pdb|4GJA|A Chain A, Crystal Structure Of Renin In Complex With Nvp-ayl747
(compound 5)
gi|449802501|pdb|4GJA|B Chain B, Crystal Structure Of Renin In Complex With Nvp-ayl747
(compound 5)
gi|449802502|pdb|4GJB|A Chain A, Crystal Structure Of Renin In Complex With Nvp-bbv031
(compound 6)
gi|449802503|pdb|4GJB|B Chain B, Crystal Structure Of Renin In Complex With Nvp-bbv031
(compound 6)
gi|449802504|pdb|4GJC|A Chain A, Crystal Structure Of Renin In Complex With Nvp-bch965
(compound 9)
gi|449802505|pdb|4GJC|B Chain B, Crystal Structure Of Renin In Complex With Nvp-bch965
(compound 9)
gi|449802506|pdb|4GJD|A Chain A, Crystal Structure Of Renin In Complex With Nvp-bgq311
(compound 12)
gi|449802507|pdb|4GJD|B Chain B, Crystal Structure Of Renin In Complex With Nvp-bgq311
(compound 12)
Length = 340
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 113/241 (46%), Positives = 163/241 (67%), Gaps = 5/241 (2%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRS 138
V L NYMD QY+GEIGIGTPPQ F V+FDTGSSN+WVPSSKC AC +H + + S
Sbjct: 10 VILTNYMDTQYYGEIGIGTPPQTFKVVFDTGSSNVWVPSSKCSRLYTACVYHKLFDASDS 69
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
S+YK NG + Y TG +SGF S+D + +G + V Q F E T P+L F+LA+FDG++
Sbjct: 70 SSYKHNGTELTLRYSTGTVSGFLSQDIITVGGITVT-QMFGEVTEMPALPFMLAEFDGVV 128
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEE--GGEIVFGGMDPDHYKG 256
G+GF E ++G+ P++ N+++QG++ E VFSF++NR+++ + GG+IV GG DP HY+G
Sbjct: 129 GMGFIEQAIGRVTPIFDNIISQGVLKEDVFSFYYNRDSENSQSLGGQIVLGGSDPQHYEG 188
Query: 257 EHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIG 316
Y+ + + G WQ M V + G +T C GC A+ D+G S ++G T+ I ++ A+G
Sbjct: 189 NFHYINLIKTGVWQIQMKGVSV-GSSTLLCEDGCLALVDTGASYISGSTSSIEKLMEALG 247
Query: 317 A 317
A
Sbjct: 248 A 248
>gi|190613737|pdb|3D91|A Chain A, Human Renin In Complex With Remikiren
gi|190613738|pdb|3D91|B Chain B, Human Renin In Complex With Remikiren
gi|242556515|pdb|3G6Z|A Chain A, Design And Preparation Of Potent, Non-Peptidic,
Bioavailable Renin Inhibitors
gi|242556516|pdb|3G6Z|B Chain B, Design And Preparation Of Potent, Non-Peptidic,
Bioavailable Renin Inhibitors
gi|242556519|pdb|3G70|A Chain A, Design And Preparation Of Potent, Non-Peptidic,
Bioavailable Renin Inhibitors
gi|242556520|pdb|3G70|B Chain B, Design And Preparation Of Potent, Non-Peptidic,
Bioavailable Renin Inhibitors
gi|290560276|pdb|3K1W|A Chain A, New Classes Of Potent And Bioavailable Human Renin
Inhibitors
gi|290560277|pdb|3K1W|B Chain B, New Classes Of Potent And Bioavailable Human Renin
Inhibitors
gi|315113750|pdb|3OWN|A Chain A, Potent Macrocyclic Renin Inhibitors
gi|315113751|pdb|3OWN|B Chain B, Potent Macrocyclic Renin Inhibitors
Length = 341
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 113/241 (46%), Positives = 163/241 (67%), Gaps = 5/241 (2%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRS 138
V L NYMD QY+GEIGIGTPPQ F V+FDTGSSN+WVPSSKC AC +H + + S
Sbjct: 10 VILTNYMDTQYYGEIGIGTPPQTFKVVFDTGSSNVWVPSSKCSRLYTACVYHKLFDASDS 69
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
S+YK NG + Y TG +SGF S+D + +G + V Q F E T P+L F+LA+FDG++
Sbjct: 70 SSYKHNGTELTLRYSTGTVSGFLSQDIITVGGITVT-QMFGEVTEMPALPFMLAEFDGVV 128
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEE--GGEIVFGGMDPDHYKG 256
G+GF E ++G+ P++ N+++QG++ E VFSF++NR+++ + GG+IV GG DP HY+G
Sbjct: 129 GMGFIEQAIGRVTPIFDNIISQGVLKEDVFSFYYNRDSENSQSLGGQIVLGGSDPQHYEG 188
Query: 257 EHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIG 316
Y+ + + G WQ M V + G +T C GC A+ D+G S ++G T+ I ++ A+G
Sbjct: 189 NFHYINLIKTGVWQIQMKGVSV-GSSTLLCEDGCLALVDTGASYISGSTSSIEKLMEALG 247
Query: 317 A 317
A
Sbjct: 248 A 248
>gi|110590169|pdb|2G24|A Chain A, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110590170|pdb|2G24|B Chain B, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110590171|pdb|2G26|A Chain A, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110590172|pdb|2G26|B Chain B, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110590173|pdb|2G27|A Chain A, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110590174|pdb|2G27|B Chain B, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110591465|pdb|2FS4|A Chain A, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
C Ring
gi|110591466|pdb|2FS4|B Chain B, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
C Ring
gi|110591524|pdb|2G1N|A Chain A, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110591525|pdb|2G1N|B Chain B, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110591526|pdb|2G1O|A Chain A, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110591527|pdb|2G1O|B Chain B, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110591528|pdb|2G1R|A Chain A, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
C Ring
gi|110591529|pdb|2G1R|B Chain B, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
C Ring
gi|110591530|pdb|2G1S|A Chain A, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
C Ring
gi|110591531|pdb|2G1S|B Chain B, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
C Ring
gi|110591532|pdb|2G1Y|A Chain A, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110591533|pdb|2G1Y|B Chain B, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110591534|pdb|2G20|A Chain A, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
C Ring
gi|110591535|pdb|2G20|B Chain B, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
C Ring
gi|110591536|pdb|2G21|A Chain A, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110591537|pdb|2G21|B Chain B, Ketopiperazine-Based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110591538|pdb|2G22|A Chain A, Ketopiperazine-based Renin Inhibitors: Optimization Of The
"c" Ring
gi|110591539|pdb|2G22|B Chain B, Ketopiperazine-based Renin Inhibitors: Optimization Of The
"c" Ring
Length = 333
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 113/241 (46%), Positives = 163/241 (67%), Gaps = 5/241 (2%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRS 138
V L NYMD QY+GEIGIGTPPQ F V+FDTGSSN+WVPSSKC AC +H + + S
Sbjct: 3 VILTNYMDTQYYGEIGIGTPPQTFKVVFDTGSSNVWVPSSKCSRLYTACVYHKLFDASDS 62
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
S+YK NG + Y TG +SGF S+D + +G + V Q F E T P+L F+LA+FDG++
Sbjct: 63 SSYKHNGTELTLRYSTGTVSGFLSQDIITVGGITVT-QMFGEVTEMPALPFMLAEFDGVV 121
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEE--GGEIVFGGMDPDHYKG 256
G+GF E ++G+ P++ N+++QG++ E VFSF++NR+++ + GG+IV GG DP HY+G
Sbjct: 122 GMGFIEQAIGRVTPIFDNIISQGVLKEDVFSFYYNRDSENSQSLGGQIVLGGSDPQHYEG 181
Query: 257 EHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIG 316
Y+ + + G WQ M V + G +T C GC A+ D+G S ++G T+ I ++ A+G
Sbjct: 182 NFHYINLIKTGVWQIQMKGVSV-GSSTLLCEDGCLALVDTGASYISGSTSSIEKLMEALG 240
Query: 317 A 317
A
Sbjct: 241 A 241
>gi|1065326|pdb|1HRN|A Chain A, High Resolution Crystal Structures Of Recombinant Human
Renin In Complex With Polyhydroxymonoamide Inhibitors
gi|1065327|pdb|1HRN|B Chain B, High Resolution Crystal Structures Of Recombinant Human
Renin In Complex With Polyhydroxymonoamide Inhibitors
gi|1310896|pdb|1BIM|A Chain A, Crystallographic Studies On The Binding Modes Of P2-P3
Butanediamide Renin Inhibitors
gi|1310897|pdb|1BIM|B Chain B, Crystallographic Studies On The Binding Modes Of P2-P3
Butanediamide Renin Inhibitors
gi|1310898|pdb|1BIL|A Chain A, Crystallographic Studies On The Binding Modes Of P2-P3
Butanediamide Renin Inhibitors
gi|1310899|pdb|1BIL|B Chain B, Crystallographic Studies On The Binding Modes Of P2-P3
Butanediamide Renin Inhibitors
gi|241913388|pdb|3GW5|A Chain A, Crystal Structure Of Human Renin Complexed With A Novel
Inhibitor
gi|241913389|pdb|3GW5|B Chain B, Crystal Structure Of Human Renin Complexed With A Novel
Inhibitor
gi|283807203|pdb|3KM4|A Chain A, Optimization Of Orally Bioavailable Alkyl Amine Renin
Inhibitors
gi|283807204|pdb|3KM4|B Chain B, Optimization Of Orally Bioavailable Alkyl Amine Renin
Inhibitors
Length = 337
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 113/241 (46%), Positives = 163/241 (67%), Gaps = 5/241 (2%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRS 138
V L NYMD QY+GEIGIGTPPQ F V+FDTGSSN+WVPSSKC AC +H + + S
Sbjct: 7 VILTNYMDTQYYGEIGIGTPPQTFKVVFDTGSSNVWVPSSKCSRLYTACVYHKLFDASDS 66
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
S+YK NG + Y TG +SGF S+D + +G + V Q F E T P+L F+LA+FDG++
Sbjct: 67 SSYKHNGTELTLRYSTGTVSGFLSQDIITVGGITVT-QMFGEVTEMPALPFMLAEFDGVV 125
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEE--GGEIVFGGMDPDHYKG 256
G+GF E ++G+ P++ N+++QG++ E VFSF++NR+++ + GG+IV GG DP HY+G
Sbjct: 126 GMGFIEQAIGRVTPIFDNIISQGVLKEDVFSFYYNRDSENSQSLGGQIVLGGSDPQHYEG 185
Query: 257 EHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIG 316
Y+ + + G WQ M V + G +T C GC A+ D+G S ++G T+ I ++ A+G
Sbjct: 186 NFHYINLIKTGVWQIQMKGVSV-GSSTLLCEDGCLALVDTGASYISGSTSSIEKLMEALG 244
Query: 317 A 317
A
Sbjct: 245 A 245
>gi|358057753|dbj|GAA96408.1| hypothetical protein E5Q_03075 [Mixia osmundae IAM 14324]
Length = 453
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 119/257 (46%), Positives = 161/257 (62%), Gaps = 9/257 (3%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L N+++AQYF +I +GTPPQ F V+ DTGSSNLWVPS++C SIAC+ H KY + SS
Sbjct: 128 VPLSNFLNAQYFADITLGTPPQEFKVVLDTGSSNLWVPSTRCS-SIACFLHKKYDASASS 186
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYK+NG I YG+G++ G S D + IGD+ +K Q+F E+T+EP L F KFDGILG
Sbjct: 187 TYKENGTEFKIQYGSGSLEGVISNDVMTIGDITIKKQDFAESTKEPGLAFAFGKFDGILG 246
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEE--EGGEIVFGGMDPDHYKGE 257
L + I+V P +YNM+ GLV++ FSFW AD E GGE V GG DP HYKG+
Sbjct: 247 LAYDRIAVQHVTPPFYNMIADGLVDKAEFSFWLGDTADGEGAPGGEFVMGGTDPAHYKGK 306
Query: 258 HTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
+ PV +KGYW+ ++ + + G A D+GTSL+A P+ + +N IGA
Sbjct: 307 IQWAPVRRKGYWEVELSKIKFGKDELELESTGAA--IDTGTSLIALPSDLAELLNKEIGA 364
Query: 318 ----TGIVSQECKAVVS 330
G + +C A+ S
Sbjct: 365 KKSWNGQYTVDCAAIPS 381
>gi|403294825|ref|XP_003938364.1| PREDICTED: renin [Saimiri boliviensis boliviensis]
Length = 400
Score = 238 bits (608), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 121/278 (43%), Positives = 177/278 (63%), Gaps = 11/278 (3%)
Query: 49 RLDSKEGESFRTSIRKYSL-RGNLGESGDADI-----VALKNYMDAQYFGEIGIGTPPQN 102
R+ K S R S+++ + LG A + V L NYMD QY+GEIGIGTPPQ
Sbjct: 33 RISLKRMPSIRESLKERGVDMARLGPERMALVNVTSSVILTNYMDTQYYGEIGIGTPPQI 92
Query: 103 FTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFF 161
F V+FDTGSSN+WVPSSKC AC +H + + SS+YK NG + Y TG +SGF
Sbjct: 93 FKVVFDTGSSNVWVPSSKCSRLYTACAYHKLFDASDSSSYKHNGTELTLRYSTGTVSGFL 152
Query: 162 SEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQG 221
S+D + +G + V Q F E T P+L F+LA+FDG++G+GF E ++G+ P++ N+++QG
Sbjct: 153 SQDVITVGGITVT-QTFGEVTEMPALPFMLAEFDGVVGMGFIEQAIGRVTPLFDNIISQG 211
Query: 222 LVNEPVFSFWFNRNADEEE--GGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMID 279
++ E VFSF++NR+++ + GG+IV GG DP HY+G Y+ + + G WQ M V +
Sbjct: 212 VLKEDVFSFYYNRDSENSQSLGGQIVLGGSDPQHYEGNFHYINLIRTGLWQIPMKGVSV- 270
Query: 280 GQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
G +T C GC A+ D+G S ++G T+ I ++ A+GA
Sbjct: 271 GSSTLLCEDGCLALVDTGASYISGSTSSIEKLMEALGA 308
>gi|2288908|emb|CAA71859.1| cathepsin E [Mus musculus]
Length = 397
Score = 238 bits (608), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 130/328 (39%), Positives = 186/328 (56%), Gaps = 7/328 (2%)
Query: 27 NGGLYRIGLKKRKFDLNNRVAARLDSKE-GESFRTSIRKYSLRGNLGESGDADIVALKNY 85
G L+R+ L++ + L ++ A+ E S + + S N+ S + ++ NY
Sbjct: 19 QGALHRVPLRRHQ-SLRKKLRAQGQLSEFWRSHNLDMTRLSESCNVYSSVNEPLI---NY 74
Query: 86 MDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNG 145
+D +YFG I IGTPPQNFTVIFDTGSSNLWVPS C S AC H + +S TY + G
Sbjct: 75 LDMEYFGTISIGTPPQNFTVIFDTGSSNLWVPSVYCT-SPACKAHPVFHPSQSDTYTEVG 133
Query: 146 KSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEI 205
I YGTG+++G D V + L V Q+F E+ +EP TF+ A+FDGILGLG+ +
Sbjct: 134 NHFSIQYGTGSLTGIIGADQVSVEGLTVDGQQFGESVKEPGQTFVNAEFDGILGLGYPSL 193
Query: 206 SVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQ 265
+ G PV+ NM+ Q LV P+FS + + + G E+ FGG DP H+ G ++PVT+
Sbjct: 194 AAGGVTPVFDNMMAQNLVALPMFSVYLSSDPQGGSGSELTFGGYDPSHFSGSLNWIPVTK 253
Query: 266 KGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQEC 325
+ YWQ + + + G T FC+ GC AI D+GTSL+ GP I Q+ AIGAT I +
Sbjct: 254 QAYWQIALDGIQV-GDTVMFCSEGCQAIVDTGTSLITGPPDKIKQLQEAIGATPIDGEYA 312
Query: 326 KAVVSQYGEEIINMLLAKVSAKLSLRHY 353
+ + L+ +VS L+ Y
Sbjct: 313 VDCATLDTMPNVTFLINEVSYTLNPTDY 340
>gi|389747274|gb|EIM88453.1| Asp-domain-containing protein [Stereum hirsutum FP-91666 SS1]
Length = 416
Score = 238 bits (608), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 122/243 (50%), Positives = 161/243 (66%), Gaps = 15/243 (6%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L N+M+AQY+ EI IGTPPQ F VI DTGSSNLWVPSS+C SIAC+ H+KY S SS
Sbjct: 95 VPLTNFMNAQYYTEIDIGTPPQTFKVILDTGSSNLWVPSSQCT-SIACFLHTKYDSSASS 153
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
+YK NG I YG+G++ GF S D + GD+ + +F EAT+EP L F KFDGILG
Sbjct: 154 SYKANGTEFSIQYGSGSMEGFVSNDDIVFGDMSLSSVDFAEATKEPGLAFAFGKFDGILG 213
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
L + I+V PV+Y +VNQG+++EPVFSF + E++GGE +FGG+DP Y G+
Sbjct: 214 LAYDTIAVNHITPVFYELVNQGIISEPVFSFRLGSS--EDDGGEAIFGGIDPSAYSGKID 271
Query: 260 YVPVTQKGYWQFDM-----GDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHA 314
Y PV +K YW+ ++ GD ++ + TG AAI D+GTSL+A PT + +N
Sbjct: 272 YAPVRRKAYWEVELEKVSFGDDDLELENTG------AAI-DTGTSLIALPTDVAEMLNTQ 324
Query: 315 IGA 317
IGA
Sbjct: 325 IGA 327
>gi|126723599|ref|NP_001075713.1| cathepsin E precursor [Oryctolagus cuniculus]
gi|1168791|sp|P43159.1|CATE_RABIT RecName: Full=Cathepsin E; Flags: Precursor
gi|402729|gb|AAC37308.1| procathepsin E [Oryctolagus cuniculus]
Length = 396
Score = 238 bits (608), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 126/304 (41%), Positives = 185/304 (60%), Gaps = 10/304 (3%)
Query: 27 NGGLYRIGLKKRKFDLNNRVAARLDSKE-GESFRTSIRKYSLRGNLGESGDADIVALKNY 85
G L R+ L+ R+ L ++ A+ E ++ + + +Y+ + +S + ++ NY
Sbjct: 18 QGTLDRVPLR-RQPSLRKKLRAQGQLSEFWKAHKVDMVQYTETCTMEQSANEPLI---NY 73
Query: 86 MDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNG 145
+D +YFG I IG+PPQNFTVIFDT SSNLWVPS C S AC H ++R +S+TY + G
Sbjct: 74 LDMEYFGTISIGSPPQNFTVIFDTVSSNLWVPSVYCT-SPACQMHPQFRPSQSNTYSEVG 132
Query: 146 KSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEI 205
I YGTG+++G D V + L V Q+F E+ +EP TF+ A+FDGILGLG+ +
Sbjct: 133 TPFSIAYGTGSLTGIIGADQVSVQGLTVVGQQFGESVKEPGQTFVNAEFDGILGLGYPSL 192
Query: 206 SVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQ 265
+ G PV+ NM+ Q LV+ P+FS + + N + G E+ FGG D H+ G +VPVT+
Sbjct: 193 AAGGVTPVFDNMMAQNLVSLPMFSVYMSSNPEGGSGSELTFGGYDSSHFSGSLNWVPVTK 252
Query: 266 KGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT---GIVS 322
+GYWQ + ++ + G FC GC AI D+GTSL+ GP+ I Q+ AIGAT G +
Sbjct: 253 QGYWQIALDEIQVGGSPM-FCPEGCQAIVDTGTSLITGPSDKIIQLQAAIGATPMDGEYA 311
Query: 323 QECK 326
EC+
Sbjct: 312 VECE 315
>gi|166235886|ref|NP_031825.2| cathepsin E preproprotein [Mus musculus]
gi|341940308|sp|P70269.2|CATE_MOUSE RecName: Full=Cathepsin E; Flags: Precursor
gi|5748654|emb|CAA08880.2| cathepsin E protein [Mus musculus]
gi|74146932|dbj|BAE25449.1| unnamed protein product [Mus musculus]
gi|74192082|dbj|BAE34257.1| unnamed protein product [Mus musculus]
gi|74219155|dbj|BAE26716.1| unnamed protein product [Mus musculus]
gi|74222421|dbj|BAE38113.1| unnamed protein product [Mus musculus]
gi|148707758|gb|EDL39705.1| cathepsin E [Mus musculus]
Length = 397
Score = 238 bits (608), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 130/328 (39%), Positives = 186/328 (56%), Gaps = 7/328 (2%)
Query: 27 NGGLYRIGLKKRKFDLNNRVAARLDSKE-GESFRTSIRKYSLRGNLGESGDADIVALKNY 85
G L+R+ L++ + L ++ A+ E S + + S N+ S + ++ NY
Sbjct: 19 QGALHRVPLRRHQ-SLRKKLRAQGQLSEFWRSHNLDMTRLSESCNVYSSVNEPLI---NY 74
Query: 86 MDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNG 145
+D +YFG I IGTPPQNFTVIFDTGSSNLWVPS C S AC H + +S TY + G
Sbjct: 75 LDMEYFGTISIGTPPQNFTVIFDTGSSNLWVPSVYCT-SPACKAHPVFHPSQSDTYTEVG 133
Query: 146 KSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEI 205
I YGTG+++G D V + L V Q+F E+ +EP TF+ A+FDGILGLG+ +
Sbjct: 134 NHFSIQYGTGSLTGIIGADQVSVEGLTVDGQQFGESVKEPGQTFVNAEFDGILGLGYPSL 193
Query: 206 SVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQ 265
+ G PV+ NM+ Q LV P+FS + + + G E+ FGG DP H+ G ++PVT+
Sbjct: 194 AAGGVTPVFDNMMAQNLVALPMFSVYLSSDPQGGSGSELTFGGYDPSHFSGSLNWIPVTK 253
Query: 266 KGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQEC 325
+ YWQ + + + G T FC+ GC AI D+GTSL+ GP I Q+ AIGAT I +
Sbjct: 254 QAYWQIALDGIQV-GDTVMFCSEGCQAIVDTGTSLITGPPDKIKQLQEAIGATPIDGEYA 312
Query: 326 KAVVSQYGEEIINMLLAKVSAKLSLRHY 353
+ + L+ +VS L+ Y
Sbjct: 313 VDCATLDTMPNVTFLINEVSYTLNPTDY 340
>gi|345797646|ref|XP_545694.3| PREDICTED: cathepsin E [Canis lupus familiaris]
Length = 396
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 131/308 (42%), Positives = 182/308 (59%), Gaps = 30/308 (9%)
Query: 49 RLDSKEGESFRTSIRKYSLRGNLGE---SGDADIV--------------ALKNYMDAQYF 91
R+ + +S R +R RG L E S + D++ L NY+D +YF
Sbjct: 23 RVPLRRHQSLRKKLRA---RGQLSEFWKSQNLDMIQFTESCTMNQDTNEPLINYLDMEYF 79
Query: 92 GEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIH 151
G I IG+PPQNFTVIFDTGSSNLWVPS C S AC H+K+ +S+TY G I
Sbjct: 80 GTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHAKFYPSQSNTYSALGNQFSIQ 138
Query: 152 YGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAV 211
YGTG++SG D V + LVV Q+F E+ EP TF+ A+FDGILGLG+ ++VG
Sbjct: 139 YGTGSLSGIIGADQVNVEGLVVVGQQFGESVTEPGQTFVNAEFDGILGLGYPSLAVGGVT 198
Query: 212 PVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQF 271
PV+ NM+ Q LV+ P+FS + + + + G E++FGG D H+ G +VPVT++GYWQ
Sbjct: 199 PVFDNMMAQNLVDIPMFSVYMSSDPEGGTGSELIFGGYDHSHFSGNLNWVPVTKQGYWQI 258
Query: 272 DMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQECKAVVSQ 331
+ + + G T FC+ GC AI D+GTSL+ GP+ I Q+ +AIGA + + +
Sbjct: 259 ALDAIQVGG-TVMFCSEGCQAIVDTGTSLITGPSDEIKQLQNAIGA--------EPMDGE 309
Query: 332 YGEEIINM 339
YG E N+
Sbjct: 310 YGVECANL 317
>gi|430811193|emb|CCJ31368.1| unnamed protein product, partial [Pneumocystis jirovecii]
Length = 411
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 139/308 (45%), Positives = 179/308 (58%), Gaps = 24/308 (7%)
Query: 29 GLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGES-------------- 74
GL+R+ L RK ++RV + E S RKY+L + G
Sbjct: 21 GLHRLEL--RKIPGDHRVNKVHNDIEAYSLA---RKYTLFYSYGRDERKNKEPIIHGKPL 75
Query: 75 -GDADIVALKNYMDAQYFGEIGIGTPPQNF-TVIFDTGSSNLWVPSSKCYFSIACYFHSK 132
+A V+L N+ +AQ +I IGTPPQ F V+ DTGSSNLWVPSSKC S+AC HSK
Sbjct: 76 GTNAHEVSLTNFFNAQCRIDITIGTPPQTFKVVVLDTGSSNLWVPSSKCT-SLACIIHSK 134
Query: 133 YRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLA 192
Y S SSTY NG +I YG+G+ISGF S D + D+V+ QEF EA EP TF
Sbjct: 135 YDSSLSSTYIANGSKFEIRYGSGSISGFISTDKFSVSDIVLPAQEFAEAMSEPGFTFTFG 194
Query: 193 KFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPD 252
+FDGILGLG+ I+V +P +YNMV Q +NEPVF+FW + EGGE FGG+DP
Sbjct: 195 RFDGILGLGYSSIAVNGIIPPFYNMVEQNAINEPVFAFWMGNIEKDIEGGECTFGGIDPM 254
Query: 253 HYKGEHTYVPVTQKGYWQ-FDMGDVMIDGQTTGFCA-GGCAAIADSGTSLLAGPTTIITQ 310
HY+G+ TY+PV +K YW+ F + D+ F AI D+GTSL+ P I
Sbjct: 255 HYEGDLTYIPVRRKAYWEAFCLVDLSFFAYGKDFIGMENVGAILDTGTSLIVMPKNIADL 314
Query: 311 VNHAIGAT 318
+N+AIGAT
Sbjct: 315 LNNAIGAT 322
>gi|68051036|emb|CAI46901.1| nothepsin [Podarcis siculus]
Length = 414
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 114/236 (48%), Positives = 158/236 (66%), Gaps = 2/236 (0%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L +YM+AQY+GE+ +GTPPQ FTV+FDTGSS+ WVPS++CY S AC H ++ S S +Y
Sbjct: 73 LYDYMNAQYYGEVSVGTPPQRFTVVFDTGSSDFWVPSARCY-SKACSMHKRFESFMSYSY 131
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
+ G+ + YGTG++ G ++D V+ +L ++ Q+F E EP LTF A FDG+LGLG
Sbjct: 132 AQVGEPFYLQYGTGSLIGVTAKDTVQFSNLSIEAQDFGEVRYEPDLTFTFAHFDGVLGLG 191
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ +SV +PV+ M+ Q L+ EPVFSF NR + E GGE++FGG+D YKG +V
Sbjct: 192 YPSLSVLHGLPVFDGMLRQQLIEEPVFSFILNRGGNTENGGELIFGGIDHSLYKGSIHWV 251
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
PVT++ YW+ M +V I G C GCAAI DSGTSL+ GP + I ++ IGA
Sbjct: 252 PVTEQKYWKIHMDNVKIQGHIAA-CKDGCAAIVDSGTSLITGPPSQIIRLQQKIGA 306
>gi|46395759|sp|Q800A0.1|CATE_RANCA RecName: Full=Cathepsin E; Flags: Precursor
gi|29647357|dbj|BAC75398.1| cathepsin E [Rana catesbeiana]
Length = 397
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 128/311 (41%), Positives = 177/311 (56%), Gaps = 18/311 (5%)
Query: 12 FFLCLLLFPVVFSTPNGGLYRIGLKK----RKFDLNNRVAARLDSKEGESFRTSIRKYSL 67
F + LL+ V G+ R+ LK+ RK + L +K+G F S
Sbjct: 4 FLVVLLILSFVH-----GIIRVPLKRQKSMRKILKEKGKLSHLWTKQGNEFLQLSDSCSS 58
Query: 68 RGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIAC 127
E L NY+D +YFG+I IGTPPQ FTVIFDTGSSNLWVPS C S AC
Sbjct: 59 PETASE-------PLMNYLDVEYFGQISIGTPPQQFTVIFDTGSSNLWVPSIYCT-SQAC 110
Query: 128 YFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSL 187
H++YR S+TY NG++ I YGTG ++G D V + + V+ Q F E+ EP
Sbjct: 111 TKHNRYRPSESTTYVSNGEAFFIQYGTGNLTGILGIDQVTVQGITVQSQTFAESVSEPGS 170
Query: 188 TFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFG 247
TF + FDGILGL + ++V +PV+ NM+ Q LV P+F + NR+ + +GGE+V G
Sbjct: 171 TFQDSNFDGILGLAYPNLAVDNCIPVFDNMIAQNLVELPLFGVYMNRDPNSADGGELVLG 230
Query: 248 GMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTI 307
G D + G+ +VP+T +GYWQ + + + GQ FC+ GC AI D+GTSL+ GP+
Sbjct: 231 GFDTSRFSGQLNWVPITVQGYWQIQVDSIQVAGQVI-FCSDGCQAIVDTGTSLITGPSGD 289
Query: 308 ITQVNHAIGAT 318
I Q+ + IG T
Sbjct: 290 IEQLQNYIGVT 300
>gi|119567604|gb|ABL84270.1| aspartic protease [Musca domestica]
Length = 379
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 126/285 (44%), Positives = 175/285 (61%), Gaps = 12/285 (4%)
Query: 52 SKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGS 111
SK+ S + +R + NL + D L+N ++ +Y+G+I IGTPPQ F V+FDTGS
Sbjct: 34 SKDIRSEKAVLRS---KYNLPQPRDVSDEPLENSLNMKYYGDITIGTPPQKFVVLFDTGS 90
Query: 112 SNLWVPSSKCY-FSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGD 170
SNLWVPSS C+ + IAC H++Y SSTY KNG+ I YG+G++SGF S+D V +
Sbjct: 91 SNLWVPSSHCWIWDIACKKHNQYNHDDSSTYVKNGELISISYGSGSMSGFLSQDDVTVEG 150
Query: 171 LVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSF 230
L +K+Q F EA EP +F A FDGI G+ +Q ++ VP +YNM QGLV+ +FSF
Sbjct: 151 LTIKNQVFAEAMNEPGNSFTDANFDGIFGMAYQSLAEDNVVPPFYNMFAQGLVDANMFSF 210
Query: 231 WFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGC 290
NR+ +GG+++ GG+D Y G+ TYVPV+ +GYWQF++ I GQ+ C
Sbjct: 211 LLNRDGTSTDGGQMILGGVDSSLYTGDITYVPVSSQGYWQFEVTSGAIKGQS---ICDNC 267
Query: 291 AAIADSGTSLLAGPTTIITQVNHAIGAT-----GIVSQECKAVVS 330
AIAD+GTSL+ P+ +N IGAT G +C AV S
Sbjct: 268 QAIADTGTSLIVAPSDAYNTLNAEIGATYNEDDGNYYVDCSAVDS 312
>gi|301030231|gb|ADK47877.1| cathepsin D [Triatoma infestans]
Length = 390
Score = 238 bits (607), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 121/267 (45%), Positives = 168/267 (62%), Gaps = 9/267 (3%)
Query: 53 KEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSS 112
KE + ++ ++ YS+ G L+N + QY+G I +GTPPQ FTVIFDTGSS
Sbjct: 40 KELKVYKDGLKMYSMLKKSGRE------VLRNSFNTQYYGNITLGTPPQEFTVIFDTGSS 93
Query: 113 NLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLV 172
NLW+PS+ C S+AC H+ Y RSSTY+ +G+ + YGTG+I+G S D ++IGDL
Sbjct: 94 NLWIPSAVCS-SVACRVHNTYDHDRSSTYQPDGRILRLTYGTGSIAGIMSSDVLQIGDLQ 152
Query: 173 VKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWF 232
VK+Q F EA + F AK DGILGL F I+ AVP ++NM+ Q L+++PVFS +
Sbjct: 153 VKNQLFGEALQVSDSPFARAKPDGILGLAFPSIAQDHAVPPFFNMIKQELLDKPVFSVYL 212
Query: 233 NRNADEEEGGEIVFGGMDPDHYKGEH-TYVPVTQKGYWQFDMGDVMIDGQT-TGFCAGGC 290
NRN DEE GGEI+FGG+D + Y E T VP+T YW F M + + T +C GC
Sbjct: 213 NRNPDEEVGGEIIFGGVDEELYNKESMTTVPLTSTSYWMFQMDGISTSAEDGTSWCQNGC 272
Query: 291 AAIADSGTSLLAGPTTIITQVNHAIGA 317
IAD+GTS + GP++ + ++ +GA
Sbjct: 273 PGIADTGTSFIVGPSSDVDEIMELVGA 299
>gi|355681644|gb|AER96811.1| cathepsin E [Mustela putorius furo]
Length = 375
Score = 237 bits (605), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 127/310 (40%), Positives = 187/310 (60%), Gaps = 15/310 (4%)
Query: 31 YRIGLKKRKFDLNNRVAARLDSKE-GESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQ 89
+R+ LK+R+ L ++ AR E +S + +++ + + + ++ NY+D +
Sbjct: 1 HRVPLKRRQ-SLRKKLRARGQLSEFWKSQNLDMIQFTESCAMNQDANEPLI---NYLDME 56
Query: 90 YFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSAD 149
YFG I +G+PPQNFTVIFDTGSSNLWVPS C S AC H+++ +SSTY G
Sbjct: 57 YFGTISVGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHTRFYPSQSSTYSTLGSHFS 115
Query: 150 IHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGK 209
I YGTG++SG D V + LVV Q+F E+ EP TF+ A+FDGILGLG+ ++VG
Sbjct: 116 IQYGTGSLSGILGADQVNVEGLVVVGQQFGESVTEPGQTFVNAEFDGILGLGYPSLAVGG 175
Query: 210 AVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYW 269
PV+ NM+ Q LV+ P+FS + + + + G E++FGG D H+ G +VPVT++GYW
Sbjct: 176 VTPVFDNMMAQNLVDIPMFSVYMSSDPEGGAGSELIFGGYDHSHFSGNLNWVPVTKQGYW 235
Query: 270 QFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQECKAVV 329
Q + + + G FC+ GC AI D+GTSL+ GP+ I Q+ AIGA + +
Sbjct: 236 QIALDAIQVGGAVM-FCSEGCQAIVDTGTSLITGPSDKIKQLQKAIGA--------EPMD 286
Query: 330 SQYGEEIINM 339
+YG E N+
Sbjct: 287 GEYGVECANL 296
>gi|410986349|ref|XP_003999473.1| PREDICTED: cathepsin E [Felis catus]
Length = 396
Score = 237 bits (605), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 126/289 (43%), Positives = 179/289 (61%), Gaps = 7/289 (2%)
Query: 30 LYRIGLKKRKFDLNNRVAARLDSKE-GESFRTSIRKYSLRGNLGESGDADIVALKNYMDA 88
L+R+ L++ L ++ AR E +S + +Y+ + + + ++ NYMD
Sbjct: 21 LHRVPLRRHP-SLRKKLRARGQLPEFWKSQNLDMIQYTEACTISQGANEPLI---NYMDT 76
Query: 89 QYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSA 148
+YFG I IG+PPQNFTVIFDTGSSNLWVPS C S AC H+++ +S TY G
Sbjct: 77 EYFGSISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHARFYPSQSDTYSALGNHF 135
Query: 149 DIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVG 208
I YGTG++SG D V + L+V Q+F E+ EP TF+ A+FDGILGLG+ ++VG
Sbjct: 136 SIQYGTGSLSGIIGTDQVYVEGLLVVGQQFGESVTEPGQTFVNAEFDGILGLGYPSLAVG 195
Query: 209 KAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGY 268
PV+ NM+ Q LV+ P+FS + + + + G E++FGG D H+ G +VPVT++GY
Sbjct: 196 GVTPVFDNMMAQNLVDIPMFSVYMSSDPESGVGSELIFGGYDHSHFSGTLNWVPVTKQGY 255
Query: 269 WQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
WQ + DV+ G T FC+ GC AI D+GTSL+ GP+ I Q+ AIGA
Sbjct: 256 WQIAL-DVIQVGGTVMFCSEGCQAIVDTGTSLITGPSDKIKQLQKAIGA 303
>gi|358385852|gb|EHK23448.1| hypothetical protein TRIVIDRAFT_215801 [Trichoderma virens Gv29-8]
Length = 395
Score = 237 bits (605), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 122/243 (50%), Positives = 160/243 (65%), Gaps = 16/243 (6%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V + N+M+AQYF EI IGTPPQ F V+ DTGSSNLWVPS C SIAC+ H+ Y S SS
Sbjct: 75 VPVTNFMNAQYFSEITIGTPPQTFKVVLDTGSSNLWVPSQSCN-SIACFLHATYDSSSSS 133
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYK+NG +IHYG+G+++GF S D V IGDL ++ Q+F EAT EP L F +FDGILG
Sbjct: 134 TYKQNGSDFEIHYGSGSLTGFISNDVVTIGDLKIQKQDFAEATSEPGLAFAFGRFDGILG 193
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ ISV +P +Y MVNQ L++EPVF+F+ +EG E VFGG+D HY G+
Sbjct: 194 LGYDTISVNGIIPPFYQMVNQKLLDEPVFAFYLGSG---DEGSEAVFGGVDESHYSGKIE 250
Query: 260 YVPVTQKGYWQFDM-----GDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHA 314
Y+P+ +K YW+ D+ GD + + + TG AI D+GTSL P+ + +N
Sbjct: 251 YIPLRRKAYWEVDLDSIAFGDEVAELENTG-------AILDTGTSLNVLPSGLAELLNAE 303
Query: 315 IGA 317
IGA
Sbjct: 304 IGA 306
>gi|156843876|ref|XP_001645003.1| hypothetical protein Kpol_1072p15 [Vanderwaltozyma polyspora DSM
70294]
gi|156115658|gb|EDO17145.1| hypothetical protein Kpol_1072p15 [Vanderwaltozyma polyspora DSM
70294]
Length = 399
Score = 237 bits (605), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 137/335 (40%), Positives = 190/335 (56%), Gaps = 29/335 (8%)
Query: 8 ITAGFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLD---SKEGESFRTSIRK 64
+T F LC+ V N L+++ L+KR D + RL S G + K
Sbjct: 2 LTVYFLLCVTWLGSV----NAKLHKVSLEKRD-DSEDIKGIRLSDYISHLGRKYLNQFEK 56
Query: 65 YSLRGNLGESGDADI----VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSK 120
++ + I V L NY++AQY+ ++ IGTPPQ F VI DTGSSNLWVPS
Sbjct: 57 AFPEVHISKKDQFTIESHDVPLDNYLNAQYYTDVSIGTPPQKFKVILDTGSSNLWVPSVG 116
Query: 121 CYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIE 180
C S+ACY HSKY SSTY+ NG I YG+G++ G+ S+D + IGDL++ Q+F E
Sbjct: 117 CS-SLACYLHSKYDHSLSSTYRSNGSDFVIQYGSGSLKGYISQDTLTIGDLIIPQQDFAE 175
Query: 181 ATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEE 240
AT EP L F KFDGILGL + ISV KAVP YN +++GL+++P+F+F+ +
Sbjct: 176 ATAEPGLAFAFGKFDGILGLAYDSISVNKAVPPLYNAIHRGLLDKPMFAFYLGDEKSSKN 235
Query: 241 GGEIVFGGMDPDHYKGEHTYVPVTQKGYW--QFD---MGDVMIDGQTTGFCAGGCAAIAD 295
GGE FGG DP ++GE ++PV +K YW QFD +GD + + G A D
Sbjct: 236 GGEATFGGYDPSRFEGEIKWLPVRRKAYWEVQFDGIKLGDKFMKLE-------GHGAAID 288
Query: 296 SGTSLLAGPTTIITQVNHAIGA----TGIVSQECK 326
+GTSL+ P+ I +N+ IGA G + +CK
Sbjct: 289 TGTSLITLPSQIADFLNNEIGAKKSWNGQYTIDCK 323
>gi|384485237|gb|EIE77417.1| hypothetical protein RO3G_02121 [Rhizopus delemar RA 99-880]
Length = 399
Score = 237 bits (605), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 113/241 (46%), Positives = 161/241 (66%), Gaps = 6/241 (2%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L NY++AQY+GEI +GTPPQ F+V+FDTGSSN WVPS++C FS+AC H +Y + RSS
Sbjct: 70 VPLANYLNAQYYGEISLGTPPQIFSVVFDTGSSNTWVPSTRC-FSLACLTHRRYSASRSS 128
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TY +NG I YGTGA+ G S+D +++G + + +Q+F E+T EP LTF+ A+FDGI G
Sbjct: 129 TYVRNGTQFSITYGTGALQGVISQDTLRVGGIQIDNQQFAESTIEPGLTFIYAQFDGIFG 188
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNR---NADEEEGGEIVFGGMDPDHYKG 256
LG+ ISV + VP +YNMVN+ L++E VFSFW N A+ + GGEI FG +D Y G
Sbjct: 189 LGYDTISVQRVVPPFYNMVNRNLISESVFSFWINDINVQAENDIGGEIAFGEIDQTRYTG 248
Query: 257 EHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIG 316
+ + PV +KGYW+ + + + A D+GTSL+ PT++ +++ +G
Sbjct: 249 DLIWSPVQRKGYWEIAIDNFRVGADPVN--PSSLTAAIDTGTSLILVPTSVSIEIHARLG 306
Query: 317 A 317
A
Sbjct: 307 A 307
>gi|194883084|ref|XP_001975634.1| GG20455 [Drosophila erecta]
gi|190658821|gb|EDV56034.1| GG20455 [Drosophila erecta]
Length = 404
Score = 237 bits (604), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 130/312 (41%), Positives = 177/312 (56%), Gaps = 20/312 (6%)
Query: 16 LLLFPVVFSTPNG---GLYRIGLKK-----RKFDLNNRVAARLDSKEGESFRTSIRKYSL 67
L + PV F P LYR+ L++ +F+ RL K E
Sbjct: 11 LFILPVQFQPPVSCTLQLYRVPLRRFPSARHRFEKLGIRMDRLRLKYAEEVS------HF 64
Query: 68 RGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IA 126
RG + + L NY+DAQYFG I IGTPPQ+F VIFDTGSSNLWVPS+ C +A
Sbjct: 65 RGEW--NSEVKATPLSNYLDAQYFGPITIGTPPQSFKVIFDTGSSNLWVPSATCASRMVA 122
Query: 127 CYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPS 186
C H++Y + RS++++ G IHYG+G++ GF S D V++ L + DQ F EAT P
Sbjct: 123 CRVHNRYFAKRSTSHQVRGDRFAIHYGSGSLFGFLSTDTVRVAGLEIHDQTFAEATEMPG 182
Query: 187 LTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVF 246
FL AKFDGI GL ++ IS+ + P +Y M+ QGL+ +P+FS + +R+ E+EGG I F
Sbjct: 183 PIFLAAKFDGIFGLAYRSISMQRIKPPFYAMMEQGLLTKPIFSVYLSRHG-EKEGGAIFF 241
Query: 247 GGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTT 306
GG +P +Y G TYV V+ + YWQ M +I + C GC I D+GTS LA P
Sbjct: 242 GGSNPHYYTGNFTYVQVSHRAYWQVKMDSAVI--RNLELCQQGCEVIIDTGTSFLALPYD 299
Query: 307 IITQVNHAIGAT 318
+N +IG T
Sbjct: 300 QAILINESIGGT 311
>gi|195161645|ref|XP_002021673.1| GL26637 [Drosophila persimilis]
gi|194103473|gb|EDW25516.1| GL26637 [Drosophila persimilis]
Length = 387
Score = 237 bits (604), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 116/238 (48%), Positives = 158/238 (66%), Gaps = 4/238 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
L+N ++ +Y+G IGIGTP Q F V+FDTGS+NLWVPS+KC ++AC H++Y SG+SST
Sbjct: 63 LQNTLNMEYYGLIGIGTPEQIFRVLFDTGSANLWVPSAKCPSTNVACQKHNQYHSGQSST 122
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y NG+S I YGTG+++GF SED V + + ++ Q F EA EP TF+ A F GI+GL
Sbjct: 123 YVANGESFSIQYGTGSLTGFLSEDTVWVAGIEIQQQTFAEALNEPGSTFVSAPFAGIMGL 182
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
F+ I+V P + NM+ QGL++EPV SF+ R +GGE++ GG+DP Y G TY
Sbjct: 183 AFKSIAVDGVTPPFDNMIAQGLLDEPVISFYLQRQGTAVQGGELILGGVDPSLYTGNLTY 242
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
VPV+ GYWQF + V G C+ GC AIAD+GTSL+ P ++N +GAT
Sbjct: 243 VPVSVAGYWQFKVNSVKSGGIL--LCS-GCQAIADTGTSLIVVPEAAYAKINSLLGAT 297
>gi|1657354|emb|CAA66056.1| procathepsin E [Mus musculus]
gi|13529380|gb|AAH05432.1| Cathepsin E [Mus musculus]
gi|71059833|emb|CAJ18460.1| Ctse [Mus musculus]
Length = 397
Score = 236 bits (603), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 129/328 (39%), Positives = 185/328 (56%), Gaps = 7/328 (2%)
Query: 27 NGGLYRIGLKKRKFDLNNRVAARLDSKE-GESFRTSIRKYSLRGNLGESGDADIVALKNY 85
G L+R+ L++ + L ++ A+ E S + + S N+ S + ++ NY
Sbjct: 19 QGALHRVPLRRHQ-SLRKKLRAQGQLSEFWRSHNLDMTRLSESCNVYSSVNEPLI---NY 74
Query: 86 MDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNG 145
+D +YFG I IGTPPQNFTVIFDTGSSNLWVPS C S AC H + +S TY + G
Sbjct: 75 LDMEYFGTISIGTPPQNFTVIFDTGSSNLWVPSVYCT-SPACKAHPVFHPSQSDTYTEVG 133
Query: 146 KSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEI 205
I YGTG+++G D V + L V Q+F E+ +EP TF+ A+FDGILGLG+ +
Sbjct: 134 NHFSIQYGTGSLTGIIGADQVSVEGLTVDGQQFGESVKEPGQTFVNAEFDGILGLGYPSL 193
Query: 206 SVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQ 265
+ G PV+ NM+ Q LV P+FS + + + G E+ FGG DP H+ G ++PVT+
Sbjct: 194 AAGGVTPVFDNMMAQNLVALPMFSVYLSSDPQGGSGSELTFGGYDPSHFSGSLNWIPVTK 253
Query: 266 KGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQEC 325
+ YWQ + + + G T FC+ GC AI D+GTSL+ GP I + AIGAT I +
Sbjct: 254 QAYWQIALDGIQV-GDTVMFCSEGCQAIVDTGTSLITGPPDKIKHLQEAIGATPIDGEYA 312
Query: 326 KAVVSQYGEEIINMLLAKVSAKLSLRHY 353
+ + L+ +VS L+ Y
Sbjct: 313 VDCATLDTMPNVTFLINEVSYTLNPTDY 340
>gi|255713834|ref|XP_002553199.1| KLTH0D11264p [Lachancea thermotolerans]
gi|238934579|emb|CAR22761.1| KLTH0D11264p [Lachancea thermotolerans CBS 6340]
Length = 417
Score = 236 bits (603), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 126/258 (48%), Positives = 169/258 (65%), Gaps = 18/258 (6%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L NY++AQYF EI +GTPPQ+F VI DTGSSNLWVPS +C S+AC+ HSKY SS
Sbjct: 93 VPLSNYLNAQYFTEITLGTPPQSFKVILDTGSSNLWVPSDEC-GSLACFLHSKYSHDASS 151
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
+YK NG + I YG+G++ G+ S+D + IGDL + Q+F EAT EP L F KFDGILG
Sbjct: 152 SYKANGTNFAIQYGSGSLEGYISQDTLSIGDLTIPKQDFAEATSEPGLAFAFGKFDGILG 211
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEG-GEIVFGGMDPDHYKGEH 258
LG+ I+V K VP Y +N GL++EP F+F+ N D EE GE+ FGG+D YKG
Sbjct: 212 LGYDTIAVDKVVPPVYKAINDGLLDEPRFAFYLNNADDSEESTGEVTFGGIDSSKYKGNI 271
Query: 259 TYVPVTQKGYWQ--FD---MGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNH 313
T++PV +K YW+ FD +GD + + TG AAI D+GTSL+A P+ + +N
Sbjct: 272 TWLPVRRKAYWEVKFDGIGLGDEYAELEGTG------AAI-DTGTSLIALPSGLAEVLNA 324
Query: 314 AIGA----TGIVSQECKA 327
IGA +G + +C++
Sbjct: 325 EIGAKKGWSGQYTVDCES 342
>gi|308809631|ref|XP_003082125.1| putative vacuaolar aspartic proteinase (ISS) [Ostreococcus tauri]
gi|116060592|emb|CAL55928.1| putative vacuaolar aspartic proteinase (ISS) [Ostreococcus tauri]
Length = 505
Score = 236 bits (603), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 109/219 (49%), Positives = 149/219 (68%), Gaps = 7/219 (3%)
Query: 124 SIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATR 183
S+ C H+K+ S S TY+ +G I YG+G++SGF S+D V +GD+ VK Q F EAT+
Sbjct: 91 SVPCDLHAKFDSAASETYEADGTPFAIQYGSGSLSGFLSQDDVTVGDITVKGQYFAEATK 150
Query: 184 EPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEE---- 239
EP + FL AKFDGILGLGF ISV K PV+YNM+ Q L+++ +FSFW NR ++ +
Sbjct: 151 EPGIAFLFAKFDGILGLGFDTISVDKVKPVFYNMMEQKLIDKNMFSFWLNRTSNVDGTPS 210
Query: 240 -EGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAG--GCAAIADS 296
GGE+VFGG DP H+ GEHTY PVT+ GYWQ M D + G++ G C G GC IAD+
Sbjct: 211 VTGGELVFGGSDPKHFVGEHTYAPVTRAGYWQIKMDDFKVAGRSLGVCKGENGCQVIADT 270
Query: 297 GTSLLAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEE 335
GTSLL GP ++ ++N IGA ++ +EC+ ++ QY +E
Sbjct: 271 GTSLLTGPADVVKKINDYIGAHSMLGEECRMLIDQYADE 309
>gi|126309849|ref|XP_001370462.1| PREDICTED: gastricsin-like [Monodelphis domestica]
Length = 390
Score = 236 bits (603), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 114/236 (48%), Positives = 152/236 (64%), Gaps = 1/236 (0%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L NYMD Y+GEI IGTPPQNF V+FDTGSSNLWVPS C S AC H ++ +SSTY
Sbjct: 65 LANYMDMSYYGEISIGTPPQNFLVLFDTGSSNLWVPSIYCQ-SQACTNHPQFNPSKSSTY 123
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
NG++ + YGTG+++G F D V I + + +QEF + EP F+ A+FDGILGL
Sbjct: 124 SSNGQTFSLQYGTGSLTGVFGYDTVTIQGISITNQEFGLSETEPGTNFVYAQFDGILGLA 183
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ IS G A V + + L+N PVF+F+ + N + GGE+VFGG+D Y G+ +
Sbjct: 184 YPAISSGGATTVMQGFLQENLLNSPVFAFYLSGNENSNNGGEVVFGGVDTSMYTGDIYWA 243
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
PVT++ YWQ + I GQ TG+C+GGC AI D+GTSLL P I +++ IGA
Sbjct: 244 PVTEEAYWQIAINGFSIGGQATGWCSGGCQAIVDTGTSLLTAPQQIFSELMQYIGA 299
>gi|388326405|pdb|3VCM|A Chain A, Crystal Structure Of Human Prorenin
gi|388326406|pdb|3VCM|B Chain B, Crystal Structure Of Human Prorenin
Length = 335
Score = 236 bits (603), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 113/239 (47%), Positives = 161/239 (67%), Gaps = 6/239 (2%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRS 138
V L NYMD QY+GEIGIGTPPQ F V+FDTGSSN+WVPSSKC AC +H + + S
Sbjct: 10 VILTNYMDTQYYGEIGIGTPPQTFKVVFDTGSSNVWVPSSKCSRLYTACVYHKLFDASDS 69
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
S+YK NG + Y TG +SGF S+D + +G + V Q F E T P+L F+LA+FDG++
Sbjct: 70 SSYKHNGTELTLRYSTGTVSGFLSQDIITVGGITVT-QMFGEVTEMPALPFMLAEFDGVV 128
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEH 258
G+GF E ++G+ P++ N+++QG++ E VFSF++NR++ GG+IV GG DP HY+G
Sbjct: 129 GMGFIEQAIGRVTPIFDNIISQGVLKEDVFSFYYNRDS---LGGQIVLGGSDPQHYEGNF 185
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
Y+ + + G WQ M V + G +T C GC A+ D+G S ++G T+ I ++ A+GA
Sbjct: 186 HYINLIKTGVWQIQMKGVSV-GSSTLLCEDGCLALVDTGASYISGSTSSIEKLMEALGA 243
>gi|355558869|gb|EHH15649.1| Renin [Macaca mulatta]
gi|355746005|gb|EHH50630.1| Renin [Macaca fascicularis]
Length = 406
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 128/290 (44%), Positives = 182/290 (62%), Gaps = 22/290 (7%)
Query: 32 RIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYF 91
R LK+R D+ ARL G + +++ +L GN S V L NYMD QY+
Sbjct: 43 RESLKERGVDM-----ARL----GPEWSQPMKRLAL-GNTTSS-----VILTNYMDTQYY 87
Query: 92 GEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNGKSADI 150
GEIGIGTPPQ F V+FDTGSSN+WVPSSKC AC +H + + SS+YK NG +
Sbjct: 88 GEIGIGTPPQTFKVVFDTGSSNVWVPSSKCSRLYTACVYHKLFDASDSSSYKHNGTELTL 147
Query: 151 HYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKA 210
Y TG +SGF S+D + +G + V Q F E T P+L F+LA+FDG++G+GF E ++G+
Sbjct: 148 RYSTGTVSGFLSQDIITVGGITVT-QMFGEVTEMPALPFMLAEFDGVVGMGFIEQAIGRV 206
Query: 211 VPVWYNMVNQGLVNEPVFSFWFNR---NADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKG 267
P++ N+++QG++ E VFSF++NR NA + GG+IV GG DP HY+G Y+ + + G
Sbjct: 207 TPIFDNILSQGVLKEDVFSFYYNRWGLNA-QSLGGQIVLGGSDPQHYEGNFHYINLIKTG 265
Query: 268 YWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
WQ M V + G +T C GC A+ D+G S ++G T+ I ++ A+GA
Sbjct: 266 VWQIPMKGVSV-GSSTLLCEDGCLALVDTGASYISGSTSSIEKLMEALGA 314
>gi|38303893|gb|AAH62002.1| Ctse protein [Rattus norvegicus]
Length = 398
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 127/322 (39%), Positives = 179/322 (55%), Gaps = 16/322 (4%)
Query: 46 VAARLDSKEGESFRTSIRKYSLRGNLGESGDADIV--------------ALKNYMDAQYF 91
V R+ + +S R +R + S + D++ L NY+D +YF
Sbjct: 22 VLHRVPLRRHQSLRKKLRAQGQLSDFWRSHNLDVIEFSESCNVDKGINEPLINYLDMEYF 81
Query: 92 GEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIH 151
G + IG+P QNFTVIFDTGSSNLWVPS C S AC H + +SSTY + G I
Sbjct: 82 GTVSIGSPSQNFTVIFDTGSSNLWVPSVYCT-SPACKAHPVFHPSQSSTYMEVGNHFSIQ 140
Query: 152 YGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAV 211
YGTG+++G D V + L V+ Q+F E+ +EP TF+ A+FDGILGLG+ ++VG
Sbjct: 141 YGTGSLTGIIGADQVSVEGLTVEGQQFGESVKEPGQTFVNAEFDGILGLGYPSLAVGGVT 200
Query: 212 PVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQF 271
PV+ NM+ Q LV P+FS + + + G E+ FGG DP H+ G ++PVT++GYWQ
Sbjct: 201 PVFDNMMAQNLVALPMFSVYLSSDPQGGSGSELTFGGYDPSHFSGSLNWIPVTKQGYWQI 260
Query: 272 DMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQECKAVVSQ 331
+ + + G T FC+ GC AI D+GTSL+ GP I Q+ AIGAT + + +
Sbjct: 261 ALDGIQV-GDTVMFCSEGCQAIVDTGTSLITGPPKKIKQLQEAIGATPMDGEYAVDCATL 319
Query: 332 YGEEIINMLLAKVSAKLSLRHY 353
+ L+ VS LS Y
Sbjct: 320 NMMPNVTFLINGVSYTLSPTAY 341
>gi|344312912|emb|CCC33063.1| cathepsin D-1 [Dermanyssus gallinae]
Length = 383
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 119/253 (47%), Positives = 160/253 (63%), Gaps = 6/253 (2%)
Query: 79 IVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGR 137
I L N+ DAQY+G I IGTPPQ F VIFDTGSS+LWVPSSKC S IAC HSKY + +
Sbjct: 54 IEPLNNFGDAQYYGPITIGTPPQTFQVIFDTGSSDLWVPSSKCPSSNIACATHSKYNAEK 113
Query: 138 SSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGI 197
SSTY NG I YG+G++SG S D V + + V Q F E T E +F+ K+DGI
Sbjct: 114 SSTYVANGTKFAIQYGSGSVSGVLSTDTVSVSGITVTKQTFGEITEESGDSFIYGKYDGI 173
Query: 198 LGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGE 257
LG+G+ EI+ +PV+ MV Q +V + +FSF+ R+ G E+V GG+DP HYKG+
Sbjct: 174 LGMGYPEIA-SSGLPVFDQMVKQKVVEKAIFSFFLTRDPQHPIGSELVLGGIDPKHYKGD 232
Query: 258 HTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPT----TIITQVNH 313
TY P+T++ YWQF + V ++G+ C GC IAD+GTSL GPT + +Q++
Sbjct: 233 ITYAPLTRESYWQFRVDKVTLNGKAAPVCQKGCEGIADTGTSLFVGPTADVAALASQLDA 292
Query: 314 AIGATGIVSQECK 326
A G+ +C+
Sbjct: 293 QETAPGLYLVDCE 305
>gi|149707989|ref|XP_001491088.1| PREDICTED: cathepsin E [Equus caballus]
Length = 396
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 127/314 (40%), Positives = 187/314 (59%), Gaps = 15/314 (4%)
Query: 27 NGGLYRIGLKKRKFDLNNRVAARLDSKE-GESFRTSIRKYSLRGNLGESGDADIVALKNY 85
+G L+R+ L++ L ++ AR E +S + +++ + +S + ++ NY
Sbjct: 18 HGLLHRVPLRRHP-SLRKKLRARGQLSEFWKSQNLDMIQFTETCTMDQSANEPLI---NY 73
Query: 86 MDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNG 145
+D +YFG I IG+PPQNFTVIFDTGSSNLWVPS C S AC H+++ +S+TY G
Sbjct: 74 LDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SSACKTHTRFYPSQSNTYSMVG 132
Query: 146 KSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEI 205
I YGTG++SG D V + L V Q F E+ EP TF+ A+FDGILGLG+ +
Sbjct: 133 SQFSIQYGTGSLSGIIGADQVSVEGLTVVGQRFGESVTEPGQTFVDAEFDGILGLGYPSL 192
Query: 206 SVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQ 265
+VG PV+ NM+ Q LV+ P+FS + + + + G E++FGG D H+ G +VPVT+
Sbjct: 193 AVGGVTPVFDNMMAQNLVDVPMFSVYMSSDPEGGAGSELIFGGYDHSHFSGSLNWVPVTK 252
Query: 266 KGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQEC 325
+GYWQ + + + G T FC+ GC AI D+GTSL+ GP I Q+ AIGA
Sbjct: 253 QGYWQIALDAIQVGG-TVMFCSQGCQAIVDTGTSLITGPPDKIKQLQEAIGA-------- 303
Query: 326 KAVVSQYGEEIINM 339
+ + +Y E +N+
Sbjct: 304 QPMDGEYAVECVNL 317
>gi|126309851|ref|XP_001370482.1| PREDICTED: gastricsin-like [Monodelphis domestica]
Length = 390
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 114/236 (48%), Positives = 152/236 (64%), Gaps = 1/236 (0%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L NYMD Y+GEI IGTPPQNF V+FDTGSSNLWVPS C S AC H ++ +SSTY
Sbjct: 65 LANYMDMSYYGEISIGTPPQNFLVLFDTGSSNLWVPSIYCQ-SQACTNHPQFNPSQSSTY 123
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
NG++ + YGTG+++G F D V I + + +QEF + EP F+ A+FDGILGL
Sbjct: 124 SSNGQTFSLQYGTGSLTGVFGYDTVTIQGISITNQEFGLSETEPGTNFVYAQFDGILGLA 183
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ IS G A V + + L+N PVF+F+ + N + GGE+VFGG+D Y G+ +
Sbjct: 184 YPAISSGGATTVMQGFLQENLLNSPVFAFYLSGNENSNNGGEVVFGGVDTSMYTGDIYWA 243
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
PVT++ YWQ + I GQ TG+C+GGC AI D+GTSLL P I +++ IGA
Sbjct: 244 PVTEEAYWQIAINGFSIGGQATGWCSGGCQAIVDTGTSLLTAPQQIFSELMQYIGA 299
>gi|195121164|ref|XP_002005091.1| GI20282 [Drosophila mojavensis]
gi|193910159|gb|EDW09026.1| GI20282 [Drosophila mojavensis]
Length = 392
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 121/279 (43%), Positives = 160/279 (57%), Gaps = 7/279 (2%)
Query: 56 ESFRTSIRKYSLR-GNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNL 114
+ F + + LR G +A V L NY+DAQYFG I IGTP Q F VIFDTGS+NL
Sbjct: 38 QQFGIRMDRLRLRYGIWANEQEARSVPLSNYLDAQYFGPISIGTPQQTFNVIFDTGSANL 97
Query: 115 WVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVK 174
WVPS C +AC HS++ + +SS+Y+ NGK DI YG+G+++G+ S D V++ L +
Sbjct: 98 WVPSESCQKKLACQIHSRFNAKKSSSYRSNGKRFDIQYGSGSLAGYLSHDTVRVAGLEIP 157
Query: 175 DQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNR 234
+Q F EAT P FL AKFDGI GLG++ IS+ P +Y ++ Q L+ PVFS + NR
Sbjct: 158 NQTFAEATDMPGPIFLAAKFDGIFGLGYRGISIQNIKPPFYAIMEQNLLKRPVFSVYLNR 217
Query: 235 NADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIA 294
+GG + FGG +Y+G TYVPVT + YWQ + I C GC I
Sbjct: 218 ELGSNQGGYLFFGGSSSRYYRGNFTYVPVTHRAYWQVKLETARIG--KLQLCLNGCQVII 275
Query: 295 DSGTSLLAGPTTIITQVNHAIGAT----GIVSQECKAVV 329
D+GTS LA P +N +IG T G S C V
Sbjct: 276 DTGTSFLAVPYEQAILINESIGGTPAAYGQFSVPCDQVA 314
>gi|343425806|emb|CBQ69339.1| probable PEP4-aspartyl protease [Sporisorium reilianum SRZ2]
Length = 419
Score = 236 bits (601), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 126/253 (49%), Positives = 162/253 (64%), Gaps = 9/253 (3%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L ++++AQYF +I +GTP Q F VI DTGSSNLWVPS+KC SIAC+ H KY S SS
Sbjct: 98 VPLTDFLNAQYFCDISLGTPAQEFKVILDTGSSNLWVPSTKCS-SIACFLHKKYDSSASS 156
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
+YKKNG I YG+G++ G S+D +KIGDL +K Q+F EAT EP L F KFDGILG
Sbjct: 157 SYKKNGTEFKIQYGSGSMEGIVSQDTLKIGDLTIKGQDFAEATSEPGLAFAFGKFDGILG 216
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
L + ISV VP +Y M++QGL++ P SF+ + EE+GGE VFGG+D HY G+
Sbjct: 217 LAYDTISVNGIVPPFYQMIDQGLLDSPQVSFYLGSS--EEDGGEAVFGGIDESHYSGKIH 274
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT- 318
+ PV +KGYW+ + D + G G AAI D+GTSL+A T +N IGAT
Sbjct: 275 WAPVKRKGYWEVAL-DKLALGDEELELENGSAAI-DTGTSLIAMATDTAEILNAEIGATK 332
Query: 319 ---GIVSQECKAV 328
G S +C V
Sbjct: 333 SWNGQYSVDCDKV 345
>gi|403299330|ref|XP_003940442.1| PREDICTED: napsin-A-like [Saimiri boliviensis boliviensis]
Length = 425
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 121/276 (43%), Positives = 169/276 (61%), Gaps = 3/276 (1%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRS 138
V L ++D QYFGEIG+GTPPQNFTV FDTGSSNLWVPS +C+ S + C+FH ++ S
Sbjct: 73 VPLSKFLDVQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSKRCHLSSVPCWFHHRFDPKAS 132
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
S+++ NG I YGTG + G SED + IG + F EA E SL F +++ DGIL
Sbjct: 133 SSFQPNGTKFAIQYGTGRVDGILSEDKLTIGGIKGASVIFGEALWESSLVFTVSRPDGIL 192
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEH 258
GLGF ++V P +V QGL+++PVFSF+ NR+ + +GGE+V GG DP HY
Sbjct: 193 GLGFPILAVEGVRPPLDVLVEQGLLDKPVFSFYLNRDPEVADGGELVLGGSDPAHYIPPL 252
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
T+VPVT YWQ M V + + T CA GCAA+ D+GT ++ GP I ++ AIG
Sbjct: 253 TFVPVTVPAYWQIHMERVKVGSELT-LCARGCAAVLDTGTPVIIGPAEEIRALHKAIGGL 311
Query: 319 GIVSQECKAVVSQYGE-EIINMLLAKVSAKLSLRHY 353
+++ E S+ + +++ L V L+ + Y
Sbjct: 312 PLLAGEYIIRCSEIPKLPTVSLFLGGVWFNLTAQDY 347
>gi|291409605|ref|XP_002721070.1| PREDICTED: pepsin II-1-like [Oryctolagus cuniculus]
Length = 387
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 122/281 (43%), Positives = 170/281 (60%), Gaps = 28/281 (9%)
Query: 62 IRKYSLRGNLGESG------------------------DADIVALKNYMDAQYFGEIGIG 97
+RK SLR NL E G +L+NY+D +YFG I IG
Sbjct: 23 VRKKSLRKNLIEKGLLQDYLKTHTPNLATKYFPKETFASVSTESLENYLDTEYFGTISIG 82
Query: 98 TPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAI 157
TPPQ FTVIFDTGSSNLWVPS+ C S+AC H ++ SST++ ++ I YGTG++
Sbjct: 83 TPPQEFTVIFDTGSSNLWVPSTYCS-SLACILHKRFNPDDSSTFQATSETLSITYGTGSM 141
Query: 158 SGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNM 217
+G D VK+G + +Q F + EP L L+A FDGILGL + IS A PV+ NM
Sbjct: 142 TGILGYDTVKVGSIEDTNQIFGLSKTEPGLFLLVAPFDGILGLAYPSISASDATPVFDNM 201
Query: 218 VNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVM 277
NQGLV++ +FS + ++DE++G ++FGG+D +Y G +VPV+ +GYWQ + +
Sbjct: 202 WNQGLVSQDLFSVYL--SSDEQKGSLVMFGGIDSSYYTGSLNWVPVSHEGYWQITVDSIT 259
Query: 278 IDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
+DG+T CA C A+ D+GTSLLAGPT+ I+ + IGA+
Sbjct: 260 MDGETIA-CADSCQAVVDTGTSLLAGPTSAISNIQSYIGAS 299
>gi|2851407|sp|P16228.3|CATE_RAT RecName: Full=Cathepsin E; Flags: Precursor
gi|1113086|dbj|BAA08128.1| cathepsin E precursor [Rattus rattus]
gi|149058663|gb|EDM09820.1| cathepsin E, isoform CRA_a [Rattus norvegicus]
Length = 398
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 127/322 (39%), Positives = 179/322 (55%), Gaps = 16/322 (4%)
Query: 46 VAARLDSKEGESFRTSIRKYSLRGNLGESGDADIV--------------ALKNYMDAQYF 91
V R+ + +S R +R + S + D++ L NY+D +YF
Sbjct: 22 VLHRVPLRRHQSLRKKLRAQGQLSDFWRSHNLDMIEFSESCNVDKGINEPLINYLDMEYF 81
Query: 92 GEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIH 151
G + IG+P QNFTVIFDTGSSNLWVPS C S AC H + +SSTY + G I
Sbjct: 82 GTVSIGSPSQNFTVIFDTGSSNLWVPSVYCT-SPACKAHPVFHPSQSSTYMEVGNHFSIQ 140
Query: 152 YGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAV 211
YGTG+++G D V + L V+ Q+F E+ +EP TF+ A+FDGILGLG+ ++VG
Sbjct: 141 YGTGSLTGIIGADQVSVEGLTVEGQQFGESVKEPGQTFVNAEFDGILGLGYPSLAVGGVT 200
Query: 212 PVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQF 271
PV+ NM+ Q LV P+FS + + + G E+ FGG DP H+ G ++PVT++GYWQ
Sbjct: 201 PVFDNMMAQNLVALPMFSVYLSSDPQGGSGSELTFGGYDPSHFSGSLNWIPVTKQGYWQI 260
Query: 272 DMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQECKAVVSQ 331
+ + + G T FC+ GC AI D+GTSL+ GP I Q+ AIGAT + + +
Sbjct: 261 ALDGIQV-GDTVMFCSEGCQAIVDTGTSLITGPPKKIKQLQEAIGATPMDGEYAVDCATL 319
Query: 332 YGEEIINMLLAKVSAKLSLRHY 353
+ L+ VS LS Y
Sbjct: 320 NMMPNVTFLINGVSYTLSPTAY 341
>gi|254583898|ref|XP_002497517.1| ZYRO0F07392p [Zygosaccharomyces rouxii]
gi|238940410|emb|CAR28584.1| ZYRO0F07392p [Zygosaccharomyces rouxii]
Length = 418
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 120/255 (47%), Positives = 166/255 (65%), Gaps = 7/255 (2%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L NY++AQY+ E+ +GTPPQNF VI DTGSSNLWVPS++C S+AC+ HSKY SS
Sbjct: 95 VPLTNYLNAQYYTEVSLGTPPQNFKVILDTGSSNLWVPSTECS-SLACFLHSKYDHDSSS 153
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
+YK NG I YG+G++ G+ S+D + +GDL + Q+F EAT EP L F KFDGILG
Sbjct: 154 SYKPNGTEFAIRYGSGSLEGYISQDTLNLGDLSITKQDFAEATSEPGLQFAFGKFDGILG 213
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ ISV VP +YN QGL++EP F+F+ R+ + ++GG FGG+D Y+GE T
Sbjct: 214 LGYDTISVDGVVPPFYNAWKQGLLDEPKFAFYLGRDGESQDGGVATFGGVDDSKYEGEIT 273
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA-- 317
++P+ +K YW+ + + G+ AAI D+GTSL+A P+ + +N IGA
Sbjct: 274 WLPIRRKAYWEVKFDGIGL-GEEYAELENHGAAI-DTGTSLIALPSGLAEIINAEIGAKK 331
Query: 318 --TGIVSQECKAVVS 330
TG + EC+A S
Sbjct: 332 SWTGQYTVECEARSS 346
>gi|327279867|ref|XP_003224677.1| PREDICTED: cathepsin E-A-like [Anolis carolinensis]
Length = 406
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 116/255 (45%), Positives = 164/255 (64%), Gaps = 6/255 (2%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L +YM+ +Y+GE+ IGTP Q FTVIFDTGS++ WVPS+ C S AC H K+++ S
Sbjct: 71 VKLCDYMNTEYYGEVSIGTPAQKFTVIFDTGSADFWVPSAYC-ISDACELHQKFKAFSSE 129
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
+Y G+ + YGTG + G ++D V+IG++ ++DQ F E+ EP +TF A FDG+LG
Sbjct: 130 SYAHGGQKFTLQYGTGRLMGIVAKDKVQIGNITIEDQAFGESVFEPGMTFAFAHFDGVLG 189
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ +SV ++PV+ N++ Q LV EP+FSF NR + + GG ++ GG+D + G
Sbjct: 190 LGYPTLSVTNSMPVFDNIIKQHLVEEPLFSFSLNREHNVDNGGVLILGGIDHSLFTGPIH 249
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIG--- 316
+ PVT+KGYWQ M V I GQ T C GC AI DSGTSL+ GP + I ++ +IG
Sbjct: 250 WFPVTKKGYWQIHMNSVKIQGQVTS-CISGCEAIVDSGTSLITGPLSQIVRLQQSIGAFP 308
Query: 317 -ATGIVSQECKAVVS 330
ATG +C+ V S
Sbjct: 309 TATGEFLVDCRRVSS 323
>gi|344276734|ref|XP_003410162.1| PREDICTED: LOW QUALITY PROTEIN: renin-like [Loxodonta africana]
Length = 409
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 134/315 (42%), Positives = 190/315 (60%), Gaps = 11/315 (3%)
Query: 13 FLCLLLFPVVFSTP--NGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGN 70
L +L F P +G RI LKK + R + + + T ++S R +
Sbjct: 12 LLLVLWGSCTFGLPADSGTFRRIFLKKMP---SVRESLKERGVDVAKLSTEWSQFSKRVS 68
Query: 71 LGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCY-FSIACYF 129
LG +G + ++ L NY+D QY+GEIGIGTPPQ F VIFDTGS+NLWVPSSKC AC
Sbjct: 69 LG-NGTSPMI-LTNYLDTQYYGEIGIGTPPQTFKVIFDTGSANLWVPSSKCSPLYTACET 126
Query: 130 HSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTF 189
H++Y S SS+Y +N I+YG+G + GF S+D V +G + V Q F E T P + F
Sbjct: 127 HNRYDSSESSSYVENKMEFTINYGSGKVKGFLSQDVVTMGGITVT-QTFGEVTELPVIPF 185
Query: 190 LLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGM 249
+LAKFDGILG+GF +V PV+ N+++QG++ E VFS +++RN+ GGEIV GG
Sbjct: 186 MLAKFDGILGMGFPAQAVSGVTPVFDNIISQGVLKEDVFSVYYSRNS-HLLGGEIVLGGS 244
Query: 250 DPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIIT 309
DP +Y+G YV +++ G WQ M V + T FC GCAA+ D+G S + GPT+ +
Sbjct: 245 DPQYYQGNFHYVSLSKNGLWQIKMKGVSVRSATL-FCEEGCAAMVDTGASFITGPTSSLK 303
Query: 310 QVNHAIGATGIVSQE 324
+ A+GA +++ E
Sbjct: 304 LLMDALGAKELITNE 318
>gi|195156259|ref|XP_002019018.1| GL25647 [Drosophila persimilis]
gi|198476098|ref|XP_002132261.1| GA25369 [Drosophila pseudoobscura pseudoobscura]
gi|194115171|gb|EDW37214.1| GL25647 [Drosophila persimilis]
gi|198137543|gb|EDY69663.1| GA25369 [Drosophila pseudoobscura pseudoobscura]
Length = 373
Score = 235 bits (599), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 127/290 (43%), Positives = 176/290 (60%), Gaps = 16/290 (5%)
Query: 16 LLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESF-RTSIRKYSLRGNLGES 74
L L V+ + + L+R+ + K + N V R + +S+ RT KY L +
Sbjct: 6 LALIAVLVAVASAELHRVPVLKEQ----NFVKTRQNVLAEKSYLRT---KYQLP----QP 54
Query: 75 GDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYR 134
D + L N M+ Y+G I IGTP Q+F V+FD+GSSNLWVPS+ C S AC H++Y
Sbjct: 55 RDINEETLSNSMNMAYYGAITIGTPAQSFKVLFDSGSSNLWVPSNTCQ-SDACLTHNQYD 113
Query: 135 SGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKF 194
S S+TY NG+S I YGTG+++G+ SED V + L V Q F E+T EP F A F
Sbjct: 114 SSASTTYVANGESFSIQYGTGSLTGYLSEDTVDVNGLKVTSQTFAESTNEPGTNFNNANF 173
Query: 195 DGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHY 254
DGILG+ ++ ++V PV+YNMV++GLV++ VFSF+ R +GGE++FGG D Y
Sbjct: 174 DGILGMAYESLAVDGVAPVFYNMVSEGLVDQSVFSFYLARAGSSSDGGELIFGGSDSSLY 233
Query: 255 KGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGP 304
G TYVP++++GYWQF M DG + CA C AIAD+GTSL+ P
Sbjct: 234 TGALTYVPISEQGYWQFTMASASSDGNS--LCA-DCQAIADTGTSLIVAP 280
>gi|198477124|ref|XP_002136749.1| GA22213 [Drosophila pseudoobscura pseudoobscura]
gi|198145061|gb|EDY71766.1| GA22213 [Drosophila pseudoobscura pseudoobscura]
Length = 373
Score = 235 bits (599), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 127/290 (43%), Positives = 176/290 (60%), Gaps = 16/290 (5%)
Query: 16 LLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESF-RTSIRKYSLRGNLGES 74
L L V+ + + L+R+ + K + N V R + +S+ RT KY L +
Sbjct: 6 LALIAVLVAVASAELHRVPVLKEQ----NFVKTRQNVLAEKSYLRT---KYQLP----QP 54
Query: 75 GDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYR 134
D + L N M+ Y+G I IGTP Q+F V+FD+GSSNLWVPS+ C S AC H++Y
Sbjct: 55 RDINEETLSNSMNMAYYGAITIGTPAQSFKVLFDSGSSNLWVPSNTCQ-SDACLTHNQYD 113
Query: 135 SGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKF 194
S S+TY NG+S I YGTG+++G+ SED V + L V Q F E+T EP F A F
Sbjct: 114 SSASTTYVANGESFSIQYGTGSLTGYLSEDTVDVNGLKVTSQTFAESTNEPGTNFNNANF 173
Query: 195 DGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHY 254
DGILG+ ++ ++V PV+YNMV++GLV++ VFSF+ R +GGE++FGG D Y
Sbjct: 174 DGILGMAYESLAVDGVAPVFYNMVSEGLVDQSVFSFYLARAGSSSDGGELIFGGSDSSLY 233
Query: 255 KGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGP 304
G TYVP++++GYWQF M DG + CA C AIAD+GTSL+ P
Sbjct: 234 TGALTYVPISEQGYWQFTMASASSDGNS--LCA-DCQAIADTGTSLIVAP 280
>gi|444316168|ref|XP_004178741.1| hypothetical protein TBLA_0B03830 [Tetrapisispora blattae CBS 6284]
gi|387511781|emb|CCH59222.1| hypothetical protein TBLA_0B03830 [Tetrapisispora blattae CBS 6284]
Length = 413
Score = 235 bits (599), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 119/256 (46%), Positives = 167/256 (65%), Gaps = 8/256 (3%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L NYM+AQYF +I IGTPPQ+F V+ DTGSSNLWVPS +C S+ACY HSKY SS
Sbjct: 89 VPLSNYMNAQYFADIKIGTPPQSFKVVLDTGSSNLWVPSKEC-GSLACYLHSKYNHDESS 147
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYK NG + I YG+G++ G+ S+D ++IGDL + Q+F EAT EP ++F KFDGILG
Sbjct: 148 TYKANGSAFAIQYGSGSLEGYISQDVMEIGDLKITKQDFAEATSEPGISFAFGKFDGILG 207
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWF-NRNADEEEGGEIVFGGMDPDHYKGEH 258
L + I+V + VP YN +NQGL++EP F+F+ + + ++ GGE VFGG+D ++G+
Sbjct: 208 LAYDTIAVNRVVPPVYNAINQGLLDEPKFAFYLGDASKSKDNGGEAVFGGIDETKFEGDI 267
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA- 317
T++PV +K YW+ + + + + T G A D+GTSL+ P+ + +N IGA
Sbjct: 268 TWLPVRRKAYWEVKLEGLGLGEEYTELENHGAA--IDTGTSLITLPSGLAEIINSEIGAK 325
Query: 318 ---TGIVSQECKAVVS 330
TG + EC S
Sbjct: 326 KGWTGQYTIECDKRAS 341
>gi|71021685|ref|XP_761073.1| hypothetical protein UM04926.1 [Ustilago maydis 521]
gi|46100637|gb|EAK85870.1| hypothetical protein UM04926.1 [Ustilago maydis 521]
Length = 418
Score = 235 bits (599), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 127/253 (50%), Positives = 162/253 (64%), Gaps = 9/253 (3%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L ++++AQYF +I +GTP Q+F VI DTGSSNLWVPS+KC SIAC+ H KY S SS
Sbjct: 97 VPLTDFLNAQYFCDISLGTPAQDFKVILDTGSSNLWVPSTKCS-SIACFLHKKYDSSASS 155
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
+YKKNG I YG+G++ G S D +KIGDL +K Q+F EAT EP L F KFDGILG
Sbjct: 156 SYKKNGTEFKIQYGSGSMEGIVSNDVLKIGDLTIKGQDFAEATSEPGLAFAFGKFDGILG 215
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
L + ISV VP Y M+NQGL++ P SF+ + EE+GGE VFGG+D HY G+
Sbjct: 216 LAYDTISVNGIVPPMYQMINQGLLDAPQVSFYLGSS--EEDGGEAVFGGIDDSHYTGKIH 273
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT- 318
+ PV +KGYW+ + D + G G AAI D+GTSL+A T +N IGAT
Sbjct: 274 WSPVKRKGYWEVAL-DKLALGDEELELDNGSAAI-DTGTSLIAMATDTAEILNAEIGATK 331
Query: 319 ---GIVSQECKAV 328
G S +C+ V
Sbjct: 332 SWNGQYSVDCEKV 344
>gi|149058665|gb|EDM09822.1| cathepsin E, isoform CRA_c [Rattus norvegicus]
Length = 365
Score = 234 bits (598), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 121/287 (42%), Positives = 168/287 (58%), Gaps = 16/287 (5%)
Query: 46 VAARLDSKEGESFRTSIRKYSLRGNLGESGDADIV--------------ALKNYMDAQYF 91
V R+ + +S R +R + S + D++ L NY+D +YF
Sbjct: 22 VLHRVPLRRHQSLRKKLRAQGQLSDFWRSHNLDMIEFSESCNVDKGINEPLINYLDMEYF 81
Query: 92 GEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIH 151
G + IG+P QNFTVIFDTGSSNLWVPS C S AC H + +SSTY + G I
Sbjct: 82 GTVSIGSPSQNFTVIFDTGSSNLWVPSVYCT-SPACKAHPVFHPSQSSTYMEVGNHFSIQ 140
Query: 152 YGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAV 211
YGTG+++G D V + L V+ Q+F E+ +EP TF+ A+FDGILGLG+ ++VG
Sbjct: 141 YGTGSLTGIIGADQVSVEGLTVEGQQFGESVKEPGQTFVNAEFDGILGLGYPSLAVGGVT 200
Query: 212 PVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQF 271
PV+ NM+ Q LV P+FS + + + G E+ FGG DP H+ G ++PVT++GYWQ
Sbjct: 201 PVFDNMMAQNLVALPMFSVYLSSDPQGGSGSELTFGGYDPSHFSGSLNWIPVTKQGYWQI 260
Query: 272 DMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
+ + + G T FC+ GC AI D+GTSL+ GP I Q+ AIGAT
Sbjct: 261 ALDGIQV-GDTVMFCSEGCQAIVDTGTSLITGPPKKIKQLQEAIGAT 306
>gi|56269596|gb|AAH86835.1| Nots protein [Danio rerio]
Length = 443
Score = 234 bits (598), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 139/360 (38%), Positives = 206/360 (57%), Gaps = 23/360 (6%)
Query: 9 TAGFFLCLLLFPVVFSTPNGGLYRIGLK-----KRKFDLNNRVAARLDSKEGESF-RTSI 62
+AG L L+L + FS GL R+ L+ + + + ++ L + + F R +
Sbjct: 30 SAGLILILVLH-LGFSH---GLLRVALRQYPSVRSRLRASAQLEEFLKQHQPDMFSRRYV 85
Query: 63 RKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCY 122
+ + + G L N+MDAQ+FG+I +G P QNFTV+FDTGSS+LWVPSS C
Sbjct: 86 QCFPPAQHFLRLGGRITERLYNFMDAQFFGQISLGRPEQNFTVVFDTGSSDLWVPSSYC- 144
Query: 123 FSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEAT 182
S AC H+K+++ SSTY +G+ IHYG+G + G + D +K+G + V++Q F EA
Sbjct: 145 VSQACALHNKFKAFESSTYTHDGRVFGIHYGSGHLLGVMARDELKVGSVCVQNQVFGEAV 204
Query: 183 REPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGG 242
EP +F+LA+FDG+LGLGF +++ K PV+ +M+ Q ++++PVFSF+ N GG
Sbjct: 205 YEPGFSFVLAQFDGVLGLGFPQLAEEKGSPVFDSMMEQNMLDQPVFSFYLTNNGS-GFGG 263
Query: 243 EIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFC---AGGCAAIADSGTS 299
E+VFGGMD + ++PVTQKGYWQ + V + G + FC GC AI D+GTS
Sbjct: 264 ELVFGGMDESRFLPPINWIPVTQKGYWQIKLDAVKVQGALS-FCYRSVQGCQAIVDTGTS 322
Query: 300 LLAGPTTIITQVNHAIGATGIVSQE----CKAVVSQYGEEIINMLLAKVSAKLSLRHYKK 355
L+ GP I + IGAT + E C V S +++ L+ V LS Y +
Sbjct: 323 LIGGPARDILILQQFIGATPTANGEFVVDCVRVSSL---PVVSFLINSVEYSLSGEQYIR 379
>gi|301786118|ref|XP_002928474.1| PREDICTED: cathepsin E-like [Ailuropoda melanoleuca]
Length = 396
Score = 234 bits (598), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 126/311 (40%), Positives = 187/311 (60%), Gaps = 15/311 (4%)
Query: 30 LYRIGLKKRKFDLNNRVAARLDSKE-GESFRTSIRKYSLRGNLGESGDADIVALKNYMDA 88
L+R+ L++ + L ++ AR E +S + +++ + + + ++ NY+D
Sbjct: 21 LHRVPLRRHQ-SLRKKLRARGQLSEFWKSQNLDMIQFTESCTMNQDANEPLI---NYLDM 76
Query: 89 QYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSA 148
+YFG I IG+PPQNFTVIFDTGSSNLWVPS C S AC HS++ +S+TY G
Sbjct: 77 EYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SAACKTHSRFYPSQSNTYSVLGSHF 135
Query: 149 DIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVG 208
I YGTG++SG D V + LVV Q+F E+ EP TF+ A+FDGILGLG+ ++VG
Sbjct: 136 SIQYGTGSLSGIIGADQVDVEGLVVVGQQFGESVTEPGQTFVNAEFDGILGLGYPSLAVG 195
Query: 209 KAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGY 268
PV+ NM+ Q LV+ P+FS + + + + G E++FGG D H+ G +VPVT++GY
Sbjct: 196 GVTPVFDNMMAQNLVDIPMFSVYMSSDPEGGAGSELIFGGYDHSHFSGNLHWVPVTKQGY 255
Query: 269 WQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQECKAV 328
WQ + + + G FC+ GC AI D+GTSL+ GP+ + Q+ AIGA + +
Sbjct: 256 WQIALDAIQVGGAVM-FCSEGCQAIVDTGTSLITGPSDKVKQLQKAIGA--------EPM 306
Query: 329 VSQYGEEIINM 339
+YG E N+
Sbjct: 307 DGEYGVECANL 317
>gi|291409616|ref|XP_002721074.1| PREDICTED: pepsin II-4-like [Oryctolagus cuniculus]
Length = 387
Score = 234 bits (598), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 128/280 (45%), Positives = 170/280 (60%), Gaps = 28/280 (10%)
Query: 62 IRKYSLRGNLGESG------------------------DADIVALKNYMDAQYFGEIGIG 97
+RK SLR NL E G ++NYMDA+YFG I IG
Sbjct: 23 VRKKSLRKNLIEKGLLQDYLKTHSPNPATKYFPNAAYAKESTEKMENYMDAEYFGTISIG 82
Query: 98 TPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAI 157
TPPQ+FTVIFDTGSSNLWVPS C S+AC FH ++ +SSTY+ K+ I YGTG++
Sbjct: 83 TPPQDFTVIFDTGSSNLWVPSIYCS-SLACAFHKQFNPKKSSTYQATDKTVSIAYGTGSM 141
Query: 158 SGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNM 217
+G D VK+G + Q F + EP TF+ A FDGILGLG+ IS A PV+ NM
Sbjct: 142 TGILGYDIVKVGSIDDTHQIFGLSETEPGDTFVFAPFDGILGLGYPSISSSDATPVFDNM 201
Query: 218 VNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVM 277
+ LV+E +FS + ++D+++G ++FGG+D +YKG +VPV+ +GYWQF M V
Sbjct: 202 WDHRLVSEDLFSVYL--SSDDKKGSLVMFGGIDESYYKGSLHWVPVSYEGYWQFTMDSVT 259
Query: 278 IDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
I+G+T CA C AI D+GTSLLAGPT I+++ I A
Sbjct: 260 INGKTIA-CADSCQAIIDTGTSLLAGPTNAISKIQRHIRA 298
>gi|6978719|ref|NP_037070.1| cathepsin E precursor [Rattus norvegicus]
gi|1113084|dbj|BAA07285.1| cathepsin E precursor [Rattus norvegicus]
Length = 365
Score = 234 bits (597), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 121/287 (42%), Positives = 168/287 (58%), Gaps = 16/287 (5%)
Query: 46 VAARLDSKEGESFRTSIRKYSLRGNLGESGDADIV--------------ALKNYMDAQYF 91
V R+ + +S R +R + S + D++ L NY+D +YF
Sbjct: 22 VLHRVPLRRHQSLRKKLRAQGQLSDFWRSHNLDMIEFSESCNVDKGINEPLINYLDMEYF 81
Query: 92 GEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIH 151
G + IG+P QNFTVIFDTGSSNLWVPS C S AC H + +SSTY + G I
Sbjct: 82 GTVSIGSPSQNFTVIFDTGSSNLWVPSVYCT-SSACKAHPVFHPSQSSTYMEVGNHFSIQ 140
Query: 152 YGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAV 211
YGTG+++G D V + L V+ Q+F E+ +EP TF+ A+FDGILGLG+ ++VG
Sbjct: 141 YGTGSLTGIIGADQVSVEGLTVEGQQFGESVKEPGQTFVNAEFDGILGLGYPSLAVGGVT 200
Query: 212 PVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQF 271
PV+ NM+ Q LV P+FS + + + G E+ FGG DP H+ G ++PVT++GYWQ
Sbjct: 201 PVFDNMMAQNLVALPMFSVYLSSDPQGGSGSELTFGGYDPSHFSGSLNWIPVTKQGYWQI 260
Query: 272 DMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
+ + + G T FC+ GC AI D+GTSL+ GP I Q+ AIGAT
Sbjct: 261 ALDGIQV-GDTVMFCSEGCQAIVDTGTSLITGPPKKIKQLQEAIGAT 306
>gi|331215715|ref|XP_003320537.1| saccharopepsin [Puccinia graminis f. sp. tritici CRL 75-36-700-3]
gi|309299527|gb|EFP76118.1| saccharopepsin [Puccinia graminis f. sp. tritici CRL 75-36-700-3]
Length = 430
Score = 234 bits (597), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 117/242 (48%), Positives = 155/242 (64%), Gaps = 6/242 (2%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L NY++AQYF EI +GTPPQ+F V+ DTGSSNLWVPS++C SIAC+ HSKY S
Sbjct: 104 VPLSNYLNAQYFSEISLGTPPQSFKVVLDTGSSNLWVPSTRCT-SIACFLHSKYDCEASE 162
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TY+ NG I YG+G++ G S D + IGDL V D +F E+T+EP L F KFDGI G
Sbjct: 163 TYQANGTEFKIRYGSGSLEGVISNDVLTIGDLTVPDVDFAESTKEPGLAFAFGKFDGIFG 222
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNR---NADEEEGGEIVFGGMDPDHYKG 256
LG+ ISV VP +Y M+ G++++PVF+F+ N + GGE+VFGG+D HY+G
Sbjct: 223 LGYDTISVLHTVPPFYKMMENGMLDDPVFAFYLGSAQGNKADPNGGEVVFGGVDEAHYEG 282
Query: 257 EHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIG 316
E Y PV ++GYW+ ++ V + G A D+GTSL+A PT +N IG
Sbjct: 283 EIFYAPVRRRGYWEVELKSVKFGKEEMKLHNVGAA--IDTGTSLIALPTDTAEIINAEIG 340
Query: 317 AT 318
AT
Sbjct: 341 AT 342
>gi|110277433|gb|ABG57251.1| vacuolar protease A [Trichoderma atroviride]
gi|358394485|gb|EHK43878.1| hypothetical protein TRIATDRAFT_137844 [Trichoderma atroviride IMI
206040]
Length = 395
Score = 234 bits (597), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 123/243 (50%), Positives = 158/243 (65%), Gaps = 16/243 (6%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V + N+M+AQYF EI IG+PPQ F V+ DTGSSNLWVPS C SIAC+ HS Y S SS
Sbjct: 75 VPVTNFMNAQYFSEITIGSPPQTFKVVLDTGSSNLWVPSQSCN-SIACFLHSTYDSSSSS 133
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
+YKKNG +IHYG+G+++GF S D V IGDL +K Q+F EAT EP L F +FDGILG
Sbjct: 134 SYKKNGSDFEIHYGSGSLTGFISNDVVTIGDLQIKGQDFAEATSEPGLAFAFGRFDGILG 193
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ ISV VP +Y MVNQ L++EPVF+F+ +EG FGG+D HY G+
Sbjct: 194 LGYDTISVNGIVPPFYQMVNQKLLDEPVFAFYLGSG---DEGSVATFGGVDESHYSGKIE 250
Query: 260 YVPVTQKGYWQFDM-----GDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHA 314
Y+P+ +K YW+ D+ GD + + + TG AI D+GTSL P+ I +N
Sbjct: 251 YIPLRRKAYWEVDLDSIAFGDEVAELENTG-------AILDTGTSLNVLPSGIAELLNAE 303
Query: 315 IGA 317
IGA
Sbjct: 304 IGA 306
>gi|332024604|gb|EGI64802.1| Lysosomal aspartic protease [Acromyrmex echinatior]
Length = 361
Score = 234 bits (597), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 122/252 (48%), Positives = 152/252 (60%), Gaps = 7/252 (2%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSST 140
L N+ +AQY+G I IGTP Q F V+FDTGS+NLWVPS C I C H KY + S T
Sbjct: 33 LLNFRNAQYYGVISIGTPRQRFKVLFDTGSANLWVPSVHCNLEDITCLSHRKYNNRTSRT 92
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y NG DI Y G +SG+ S D V + L + +Q F EA EP + FL AKFDGILG+
Sbjct: 93 YIPNGTLFDIQYEYGTLSGYLSTDVVNVAGLNIINQTFGEAINEPGIAFLYAKFDGILGM 152
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRN-ADEEEGGEIVFGGMDPDHYKGEHT 259
G+ IS+ PV+ NMV QGLV+ P+FSF+ NRN D G ++ GG DP Y GE T
Sbjct: 153 GYPNISILGVTPVFTNMVQQGLVSSPIFSFYLNRNLLDSSAGSVLILGGSDPALYDGELT 212
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA-- 317
YV VT KGYWQF M + ++ +T C GC AIAD+G S LAGP T I + I
Sbjct: 213 YVNVTHKGYWQFTMDKIQMENET--LCVNGCQAIADTGFSRLAGPPTDIAIITSRIAIDD 270
Query: 318 -TGIVSQECKAV 328
G+V +C +
Sbjct: 271 FNGVVYVDCDQI 282
>gi|344277046|ref|XP_003410316.1| PREDICTED: cathepsin E [Loxodonta africana]
Length = 396
Score = 234 bits (597), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 124/292 (42%), Positives = 177/292 (60%), Gaps = 7/292 (2%)
Query: 27 NGGLYRIGLKKRKFDLNNRVAARLDSKE-GESFRTSIRKYSLRGNLGESGDADIVALKNY 85
G L+R+ L++ K L ++ R E +S + +++ +S + ++ NY
Sbjct: 18 QGSLHRVPLRRHK-SLRKKLRERGQLSEFWKSQNLDMIQFTETCTRDQSANEPLI---NY 73
Query: 86 MDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNG 145
D +YFG I IG+P QNFTVIFDTGSSNLWVPS C S AC H ++ +SSTY G
Sbjct: 74 FDTEYFGAISIGSPSQNFTVIFDTGSSNLWVPSVYCT-SQACQTHPRFYPSQSSTYSSLG 132
Query: 146 KSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEI 205
I YGTG++SG D V + L V DQ+F E+ +EP TF+ + FDGILGLG+ +
Sbjct: 133 SPFSISYGTGSLSGIIGTDQVSVEGLTVIDQQFGESVKEPGQTFVDSAFDGILGLGYPSL 192
Query: 206 SVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQ 265
+VG PV+ NM+ Q LV+ P+FS + + + G E++FGG D H+ G +VPVT+
Sbjct: 193 AVGGVTPVFDNMMAQNLVDLPMFSVYMSSDPAGGMGSELIFGGYDHSHFSGSLNWVPVTK 252
Query: 266 KGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
+GYWQ + ++ + G T FC+ GC AI D+GTSL+ GP+ I Q+ AIGA
Sbjct: 253 QGYWQIALDNIQVGG-TVMFCSEGCQAIVDTGTSLITGPSNNIKQLQRAIGA 303
>gi|344234771|gb|EGV66639.1| Asp-domain-containing protein [Candida tenuis ATCC 10573]
Length = 425
Score = 234 bits (597), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 125/256 (48%), Positives = 164/256 (64%), Gaps = 18/256 (7%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L NY +AQYF EI +GTP Q F VI DTGSSNLW+PS C S+ACY HSKY SSTY
Sbjct: 103 LSNYANAQYFTEIEVGTPGQPFKVILDTGSSNLWIPSQDCS-SLACYLHSKYDHDASSTY 161
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
K NG I YG+GA+ G+ S D ++IGDL++K+Q+F EAT EP L F KFDGILGL
Sbjct: 162 KANGSEFAIQYGSGAMEGYVSTDALRIGDLLIKNQDFAEATSEPGLAFAFGKFDGILGLA 221
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWF-NRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ISV K VP YN +NQGL++E F+F+ + N DEE+GG FGG D + G+ T+
Sbjct: 222 YDTISVNKIVPPVYNAINQGLLDEKSFAFYLGDTNKDEEDGGVATFGGYDESKFTGKITW 281
Query: 261 VPVTQKGYWQFD-----MGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAI 315
+PV +K YW+ +GD + ++TG AAI D+GTSL+ P+++ +N I
Sbjct: 282 LPVRRKAYWEVSLEGLGLGDEFAELKSTG------AAI-DTGTSLITLPSSLAEIINAKI 334
Query: 316 GA----TGIVSQECKA 327
GA +G + EC A
Sbjct: 335 GAVKSWSGQYTVECDA 350
>gi|197247086|gb|AAI65335.1| Nots protein [Danio rerio]
Length = 416
Score = 234 bits (597), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 139/360 (38%), Positives = 206/360 (57%), Gaps = 23/360 (6%)
Query: 9 TAGFFLCLLLFPVVFSTPNGGLYRIGLK-----KRKFDLNNRVAARLDSKEGESF-RTSI 62
+AG L L+L + FS GL R+ L+ + + + ++ L + + F R +
Sbjct: 3 SAGLILILVLH-LGFSH---GLLRVALRQYPSVRSRLRASAQLEEFLKQHQPDMFSRRYV 58
Query: 63 RKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCY 122
+ + + G L N+MDAQ+FG+I +G P QNFTV+FDTGSS+LWVPSS C
Sbjct: 59 QCFPPAQHFLRLGGRITERLYNFMDAQFFGQISLGRPEQNFTVVFDTGSSDLWVPSSYC- 117
Query: 123 FSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEAT 182
S AC H+K+++ SSTY +G+ IHYG+G + G + D +K+G + V++Q F EA
Sbjct: 118 VSQACALHNKFKAFESSTYTHDGRVFGIHYGSGHLLGVMARDELKVGSVCVQNQVFGEAV 177
Query: 183 REPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGG 242
EP +F+LA+FDG+LGLGF +++ K PV+ +M+ Q ++++PVFSF+ N GG
Sbjct: 178 YEPGFSFVLAQFDGVLGLGFPQLAEEKGSPVFDSMMEQNMLDQPVFSFYLTNNGS-GFGG 236
Query: 243 EIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFC---AGGCAAIADSGTS 299
E+VFGGMD + ++PVTQKGYWQ + V + G + FC GC AI D+GTS
Sbjct: 237 ELVFGGMDESRFLPPINWIPVTQKGYWQIKLDAVKVQGALS-FCYRSVQGCQAIVDTGTS 295
Query: 300 LLAGPTTIITQVNHAIGATGIVSQE----CKAVVSQYGEEIINMLLAKVSAKLSLRHYKK 355
L+ GP I + IGAT + E C V S +++ L+ V LS Y +
Sbjct: 296 LIGGPARDILILQQFIGATPTANGEFVVDCVRVSSL---PVVSFLINSVEYSLSGEQYIR 352
>gi|50306705|ref|XP_453326.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49642460|emb|CAH00422.1| KLLA0D05929p [Kluyveromyces lactis]
Length = 409
Score = 234 bits (597), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 120/252 (47%), Positives = 165/252 (65%), Gaps = 7/252 (2%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L NY++AQYF EI +G+PPQ+F VI DTGSSNLWVPS++C S+AC+ H+KY SS
Sbjct: 86 VPLTNYLNAQYFTEITLGSPPQSFKVILDTGSSNLWVPSAEC-GSLACFLHTKYDHEASS 144
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYK NG I YG+G++ G+ S D + IGDLV+ DQ+F EAT EP L F KFDGILG
Sbjct: 145 TYKANGSEFAIQYGSGSLEGYVSRDLLTIGDLVIPDQDFAEATSEPGLAFAFGKFDGILG 204
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
L + ISV + VP YN + L+++PVF+F+ + E+GGE FGG+D + Y GE T
Sbjct: 205 LAYDSISVNRIVPPVYNAIKNKLLDDPVFAFYLGDSDKSEDGGEASFGGIDEEKYTGEIT 264
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA-- 317
++PV +K YW+ + + G+ G AAI D+GTSL+A P+ + +N IGA
Sbjct: 265 WLPVRRKAYWEVKFEGIGL-GEEYATLEGHGAAI-DTGTSLIALPSGLAEILNAEIGAKK 322
Query: 318 --TGIVSQECKA 327
+G S +C++
Sbjct: 323 GWSGQYSVDCES 334
>gi|388856266|emb|CCF50075.1| probable PEP4-aspartyl protease [Ustilago hordei]
Length = 418
Score = 234 bits (596), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 126/253 (49%), Positives = 161/253 (63%), Gaps = 9/253 (3%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L ++++AQYF +I +GTP Q F VI DTGSSNLWVPS+KC SIAC+ H KY S SS
Sbjct: 97 VPLTDFLNAQYFCDISLGTPAQEFKVILDTGSSNLWVPSNKCS-SIACFLHKKYDSSASS 155
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
+YKKNG I YG+G++ G S D +KIGDL +K Q+F EAT EP L F KFDGILG
Sbjct: 156 SYKKNGTEFKIQYGSGSMEGIVSNDVLKIGDLTIKGQDFAEATSEPGLAFAFGKFDGILG 215
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
L + ISV VP Y M+NQGL++ P SF+ + E++GGE VFGG+D HY G+
Sbjct: 216 LAYDTISVNGIVPPMYQMINQGLLDAPQVSFYLGSS--EQDGGEAVFGGIDESHYTGKIH 273
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT- 318
+ PV +KGYW+ + D + G G AAI D+GTSL+A T +N IGAT
Sbjct: 274 WAPVKRKGYWEVAL-DKLALGDEALELDNGSAAI-DTGTSLIAMATDTAEILNAEIGATK 331
Query: 319 ---GIVSQECKAV 328
G S +C+ V
Sbjct: 332 SWNGQYSVDCEKV 344
>gi|125984612|ref|XP_001356070.1| GA14340 [Drosophila pseudoobscura pseudoobscura]
gi|54644388|gb|EAL33129.1| GA14340 [Drosophila pseudoobscura pseudoobscura]
Length = 387
Score = 234 bits (596), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 115/239 (48%), Positives = 158/239 (66%), Gaps = 6/239 (2%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
L+N ++ +Y+G IGIGTP Q F V+FDTGS+NLWVPS+KC ++AC H++Y S +SST
Sbjct: 63 LQNTLNMEYYGLIGIGTPEQIFRVLFDTGSANLWVPSAKCPSTNVACQKHNQYHSEQSST 122
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y NG+S I YGTG+++GF SED V + + ++ Q F EA EP TF+ A F GI+GL
Sbjct: 123 YVANGESFSIQYGTGSLTGFLSEDTVWVAGIEIQQQTFAEALNEPGSTFVSAPFAGIMGL 182
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
F+ I+V P + NM+ QGL++EPV SF+ R +GGE++ GG+DP Y G TY
Sbjct: 183 AFKSIAVDGVTPPFDNMIAQGLLDEPVISFYLQRQGTAVQGGELILGGVDPSLYTGNLTY 242
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGF-CAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
VPV+ GYWQF + V ++ GF GC AIAD+GTSL+ P ++N +GAT
Sbjct: 243 VPVSVAGYWQFKVNSV----KSGGFLLCSGCQAIADTGTSLIVVPEAAYAKINSLLGAT 297
>gi|388579370|gb|EIM19694.1| aspartyl proteinase [Wallemia sebi CBS 633.66]
Length = 411
Score = 233 bits (595), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 123/260 (47%), Positives = 169/260 (65%), Gaps = 22/260 (8%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
+ + N+++AQY+ EIG+G+P Q F V+ DTGSSNLWVPS+KC SIAC+ H K+ S
Sbjct: 89 LPVSNFLNAQYYAEIGLGSPEQKFNVVLDTGSSNLWVPSNKC-MSIACFLHRKFNPEESK 147
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
+YK NG +I YG+G++ G +D + I DL VK+Q F EAT EP L F KFDGILG
Sbjct: 148 SYKANGTDFEIRYGSGSLKGIVGQDTLAIDDLHVKNQLFAEATSEPGLAFAFGKFDGILG 207
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGE--IVFGGMDPDHYKGE 257
LG+ ISV P +YN+++QGL++EPVFSF+ DE+ G E VFGG+D DHYKG+
Sbjct: 208 LGYDTISVNDIPPPFYNLIDQGLLDEPVFSFYL---TDEQSGKESQAVFGGIDHDHYKGQ 264
Query: 258 HTYVPVTQKGYWQFDM-----GDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVN 312
YVP+ +KGYW+ ++ GD ++ + TG AAI D+GTSL+A PT + +N
Sbjct: 265 LHYVPLRRKGYWEVELEKLTFGDDEVELENTG------AAI-DTGTSLIAIPTDMAEMLN 317
Query: 313 HAIGA----TGIVSQECKAV 328
IGA +G + +C V
Sbjct: 318 KMIGAKKSWSGQYTVDCNKV 337
>gi|194760707|ref|XP_001962579.1| GF14369 [Drosophila ananassae]
gi|190616276|gb|EDV31800.1| GF14369 [Drosophila ananassae]
Length = 371
Score = 233 bits (595), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 127/290 (43%), Positives = 174/290 (60%), Gaps = 16/290 (5%)
Query: 16 LLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDS-KEGESFRTSIRKYSLRGNLGES 74
L L V+ + + L+R+ + K + N V R + E RT + SLRG E
Sbjct: 5 LALIAVLVALASAELHRVPILKEE----NFVKTRQNVLAEKAYLRTKYQLPSLRGVNEEQ 60
Query: 75 GDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYR 134
L N ++ Y+G I IGTP Q+F V+FD+GSSNLWVPS+ C S AC H++Y
Sbjct: 61 -------LANSLNMAYYGAISIGTPAQSFKVLFDSGSSNLWVPSNTCT-SDACQTHNQYD 112
Query: 135 SGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKF 194
S SSTY NG+S I YGTG+++G+ SED V + L ++ Q F E+T EP F A F
Sbjct: 113 SSASSTYVANGESFSIQYGTGSLTGYLSEDTVDVNGLKIQSQTFAESTSEPGTNFNNANF 172
Query: 195 DGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHY 254
DGILG+ ++ ++V P +YNMV+QGLV+ VFSF+ R+ +GGE++FGG D Y
Sbjct: 173 DGILGMAYEALAVDGVAPPFYNMVSQGLVDSSVFSFYLARDGTSSQGGELIFGGSDSSLY 232
Query: 255 KGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGP 304
G+ TYVP++++GYWQF M DG T C C AIAD+GTSL+ P
Sbjct: 233 SGDLTYVPISEQGYWQFTMDGSSFDGYT--LCE-DCQAIADTGTSLIVAP 279
>gi|149725197|ref|XP_001502028.1| PREDICTED: pepsin A-like [Equus caballus]
Length = 387
Score = 233 bits (594), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 130/299 (43%), Positives = 179/299 (59%), Gaps = 36/299 (12%)
Query: 62 IRKYSLRGNLGESGD------------------------ADIVALKNYMDAQYFGEIGIG 97
+RK SLR NL E G D L+NY+D +YFG I IG
Sbjct: 23 VRKKSLRQNLIEHGLLEEFLKKHTPNPASKFFPKEAATLVDSEPLENYLDEEYFGTISIG 82
Query: 98 TPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAI 157
TPPQ FTVIFDTGSSNLWVPS+ C S+ACY H ++ +SSTY+ +S I YGTG++
Sbjct: 83 TPPQEFTVIFDTGSSNLWVPSTYCS-SLACYDHKRFNPEKSSTYQATSESISITYGTGSM 141
Query: 158 SGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNM 217
+G D V++G + +Q F + +EP LA FDGILGLG+ IS A PV+ N+
Sbjct: 142 TGILGYDTVRVGGIEDTNQIFGLSEKEPGFFLFLAPFDGILGLGYPSISASGATPVFDNI 201
Query: 218 VNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVM 277
+QGLV++ +FS + ++D+E G ++FGG+D +Y G +VPVT +GYWQ + +
Sbjct: 202 WDQGLVSQDLFSVYL--SSDDESGSVVMFGGIDSSYYTGSLHWVPVTTEGYWQIAVDSIT 259
Query: 278 IDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA------TGIVSQECKAVVS 330
I+G++ C+GGC AI D+GTSLLAGPT+ I + IGA G++S C A+ S
Sbjct: 260 INGESIA-CSGGCQAIVDTGTSLLAGPTSGIDNIQSYIGARKDLLGEGVIS--CSAIDS 315
>gi|156846613|ref|XP_001646193.1| hypothetical protein Kpol_1013p6 [Vanderwaltozyma polyspora DSM
70294]
gi|156116867|gb|EDO18335.1| hypothetical protein Kpol_1013p6 [Vanderwaltozyma polyspora DSM
70294]
Length = 402
Score = 233 bits (594), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 110/238 (46%), Positives = 156/238 (65%), Gaps = 3/238 (1%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
+ L NY++AQY+ +I +GTP Q+F VI DTGSSNLWVPS C S+ACY H+KY SS
Sbjct: 79 IPLSNYLNAQYYTDITLGTPAQSFKVILDTGSSNLWVPSVDCN-SLACYLHAKYDHSDSS 137
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYKKNG + I YG+G++ G+ S+D ++IGDLV+ Q+F EAT EP L F KFDGILG
Sbjct: 138 TYKKNGTTFSIQYGSGSMEGYISQDVLQIGDLVIPGQDFAEATSEPGLAFAFGKFDGILG 197
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
L + I+V + VP +YN +N+ LV+EP+FSF+ + E+GG++ FGG D + G+ T
Sbjct: 198 LAYDTIAVNRVVPPFYNAINKKLVDEPIFSFYLGDDTKSEDGGQVTFGGYDSSLFTGDIT 257
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
++PV +K YW+ + + + G A D+GTSL+ P+ + +N IGA
Sbjct: 258 WLPVRRKAYWEVKFDAIALGNEVADLVNHGAA--IDTGTSLITLPSGLAEVINSQIGA 313
>gi|21629629|gb|AAM61957.1| synthetic renin 2/1d [Mus musculus]
Length = 401
Score = 233 bits (594), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 133/316 (42%), Positives = 186/316 (58%), Gaps = 30/316 (9%)
Query: 14 LCLLLFPVVFSTPNGGLY-RIGLKK----------RKFDLNNRVAARLDSKEGESFRTSI 62
L LL P FS P G + RI LKK R D+ R++A D + S T++
Sbjct: 11 LLLLWSPCTFSLPTGTTFERIPLKKMPSVREILEERGVDMT-RLSAEWDVRTKRSSLTNL 69
Query: 63 RKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC- 121
V L NY+++QY+GEIGIGTPPQ F VIFDTGS+NLWVPS+KC
Sbjct: 70 --------------ISPVVLTNYLNSQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCS 115
Query: 122 YFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEA 181
+AC HS Y S SS+Y +NG IHYG+G + GF S+D V +G + V Q F E
Sbjct: 116 RLYLACGIHSLYESSDSSSYMENGDDFTIHYGSGRVKGFLSQDVVTVGGITVT-QTFGEV 174
Query: 182 TREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEG 241
T P + F+LAKFDG+LG+GF +VG PV+ ++++QG++ E VFS ++NR G
Sbjct: 175 TELPLIPFMLAKFDGVLGMGFPAQAVGGVTPVFDHILSQGVLKEEVFSVYYNRGP-HLLG 233
Query: 242 GEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLL 301
GE+V GG DP+HY+G+ YV +++ WQ M V + G +T C GC + D+G+S +
Sbjct: 234 GEVVLGGSDPEHYQGDFHYVSLSKTDSWQITMKGVSV-GSSTLLCEEGCEVVVDTGSSFI 292
Query: 302 AGPTTIITQVNHAIGA 317
+ PT+ + + A+GA
Sbjct: 293 SAPTSSLKLIMQALGA 308
>gi|194218276|ref|XP_001501986.2| PREDICTED: pepsin A-like [Equus caballus]
Length = 387
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 125/280 (44%), Positives = 171/280 (61%), Gaps = 28/280 (10%)
Query: 62 IRKYSLRGNLGESGD------------------------ADIVALKNYMDAQYFGEIGIG 97
+RK SLR NL E G D L+NY+D +YFG I IG
Sbjct: 23 VRKKSLRQNLIEHGLLEDFLKKHTPNPASKFFPKEAATLVDSEPLENYLDEEYFGTISIG 82
Query: 98 TPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAI 157
TPPQ FTVIFDTGSSNLWVPS+ C S+ACY H ++ +SSTY+ +S I YGTG++
Sbjct: 83 TPPQEFTVIFDTGSSNLWVPSTYCS-SLACYDHKRFNPEKSSTYQATSESISITYGTGSM 141
Query: 158 SGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNM 217
+G D V++G + +Q F + +EP LA FDGILGLG+ IS A PV+ N+
Sbjct: 142 TGILGYDTVRVGGIEDTNQIFGLSEKEPGFFLFLAPFDGILGLGYPSISASGATPVFDNI 201
Query: 218 VNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVM 277
+QGLV++ +FS + ++D+E G ++FGG+D +Y G +VPVT +GYWQ + +
Sbjct: 202 WDQGLVSQDLFSVYL--SSDDESGSVVMFGGIDSSYYTGSLHWVPVTTEGYWQIAVDSIT 259
Query: 278 IDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
I+G++ C+GGC AI D+GTSLLAGPT+ I + IGA
Sbjct: 260 INGESIA-CSGGCQAIVDTGTSLLAGPTSGIDNIQSYIGA 298
>gi|126309845|ref|XP_001370435.1| PREDICTED: gastricsin-like [Monodelphis domestica]
Length = 390
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 113/236 (47%), Positives = 151/236 (63%), Gaps = 1/236 (0%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L NYMD Y+GEI IGTPPQNF V+FDTGSSNLWV S C S AC H ++ +SSTY
Sbjct: 65 LANYMDMSYYGEISIGTPPQNFLVLFDTGSSNLWVASIYCQ-SQACTNHPQFNPSKSSTY 123
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
NG++ + YGTG+++G F D V I + + +QEF + EP F+ A+FDGILGL
Sbjct: 124 SSNGQTFSLQYGTGSLTGVFGYDTVTIQGISITNQEFGLSETEPGTNFVYAQFDGILGLA 183
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ IS G A V + + L+N PVF+F+ + N + GGE+VFGG+D Y G+ +
Sbjct: 184 YPAISSGGATTVMQGFLQENLLNSPVFAFYLSGNENSNNGGEVVFGGVDTSMYTGDIYWA 243
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
PVT++ YWQ + I GQ TG+C+GGC AI D+GTSLL P I +++ IGA
Sbjct: 244 PVTEEAYWQIAINGFSIGGQATGWCSGGCQAIVDTGTSLLTAPQQIFSELMQYIGA 299
>gi|351710945|gb|EHB13864.1| Cathepsin E, partial [Heterocephalus glaber]
Length = 391
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 121/280 (43%), Positives = 166/280 (59%), Gaps = 13/280 (4%)
Query: 49 RLDSKEGESFRTSIRKYSLRGNLGESGDADI----------VALKNYMDAQYFGEIGIGT 98
R+ + ES R +R + +S + D+ L NY+D +YFG I IG+
Sbjct: 23 RVPLQRHESLRKKLRAQGQLTDFWKSQNLDLDQCSTMQGSSEPLINYLDMEYFGTISIGS 82
Query: 99 PPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAIS 158
PPQNFTVIFDTGSSNLWVPS C S AC H + S+TY + G I YGTG+++
Sbjct: 83 PPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHPVFHPSLSNTYSEVGNPFSIQYGTGSLT 141
Query: 159 GFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMV 218
G D V + L V Q+F E+ +EP TF+ A+FDGILGLG+ ++ G PV+ NM+
Sbjct: 142 GIIGADQVSVEGLTVVGQQFGESVKEPGQTFVHAEFDGILGLGYPSLAAGGVTPVFDNMM 201
Query: 219 NQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMI 278
Q LV P+FS + + N GGE+ FGG DP H+ G +VPVT++ YWQ + +++
Sbjct: 202 AQNLVALPLFSVYMSSNPG-GSGGELTFGGYDPSHFSGSLNWVPVTKQAYWQIALDGILV 260
Query: 279 DGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
G + FC+ GC AI D+GTSL+ GP I Q+ A+GAT
Sbjct: 261 -GDSVMFCSEGCQAIVDTGTSLITGPPPKIKQLQEALGAT 299
>gi|194762106|ref|XP_001963199.1| GF19728 [Drosophila ananassae]
gi|190616896|gb|EDV32420.1| GF19728 [Drosophila ananassae]
Length = 390
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 116/237 (48%), Positives = 152/237 (64%), Gaps = 4/237 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSST 140
L + D +Y+G + IGTP QNF ++FDTGS+NLWVPS+KC S AC H+KY SG SST
Sbjct: 67 LHDSADREYYGLLSIGTPKQNFNILFDTGSANLWVPSAKCSASNKACQKHNKYHSGESST 126
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y NG+S I YGTG++SGF S D V++ + +K Q F EAT EP TF AKF GILGL
Sbjct: 127 YVANGESFSIEYGTGSLSGFLSTDTVEVAGIQIKSQTFAEATNEPGSTFTDAKFAGILGL 186
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
F+ I+V P W NM+ Q L++EPV SF+ +GGE++ GG+D YKG T+
Sbjct: 187 AFKSIAVDGVTPPWDNMIEQKLLDEPVISFYLKLKGTAVQGGEMILGGIDSSLYKGSLTW 246
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
VPVT+ YWQF + + G F + AIAD+GTSL+ P T++N+ IGA
Sbjct: 247 VPVTKAAYWQFKLTAIKTKGV---FISRNTQAIADTGTSLIVLPKAAYTRINNLIGA 300
>gi|164657049|ref|XP_001729651.1| hypothetical protein MGL_3195 [Malassezia globosa CBS 7966]
gi|159103544|gb|EDP42437.1| hypothetical protein MGL_3195 [Malassezia globosa CBS 7966]
Length = 419
Score = 233 bits (593), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 126/304 (41%), Positives = 175/304 (57%), Gaps = 15/304 (4%)
Query: 31 YRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQY 90
Y + K++ F L + L + ++ S ++ R G + V L ++++AQY
Sbjct: 53 YGVNNKQQPFSLYQDASGDLRVQSADAGMVS-EQWHARAKAGHN-----VPLTDFLNAQY 106
Query: 91 FGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADI 150
F +I +G+PPQ+F VI DTGS+NLWVPS C SIAC H KY + S TY+ NG I
Sbjct: 107 FADIELGSPPQSFKVILDTGSANLWVPSESCT-SIACLLHKKYDNSLSKTYQANGSEFQI 165
Query: 151 HYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKA 210
HYG+G++ GF S D ++IGDL VKDQ+F EA +EP L F KFDGILGL + ISV K
Sbjct: 166 HYGSGSMEGFVSRDTLRIGDLDVKDQDFAEAIKEPGLAFAFGKFDGILGLAYDTISVNKI 225
Query: 211 VPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQ 270
VP +Y M Q L+++ F F+ + E EGGE FGG+DP ++G Y PV ++GYW+
Sbjct: 226 VPPFYRMKEQNLLDQNQFGFYL--GSSESEGGEATFGGVDPSRFEGPIVYAPVRRRGYWE 283
Query: 271 FDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA----TGIVSQECK 326
+ + + G A D+GTSL+A PT + +N IGA TG S +C
Sbjct: 284 VALNKIGFGNEELVLTRTGAA--IDTGTSLIAMPTDVAEILNKEIGAKRSWTGQYSVDCS 341
Query: 327 AVVS 330
V S
Sbjct: 342 KVPS 345
>gi|122938522|gb|ABM69085.1| aspartic proteinase AspMD02 [Musca domestica]
Length = 379
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 125/272 (45%), Positives = 170/272 (62%), Gaps = 11/272 (4%)
Query: 53 KEGESFRTSIRKYSLRGNLGESGDADIVALK-----NYMDAQYFGEIGIGTPPQNFTVIF 107
KE +S IRK L+ G + A+I L NY+D Y+G+I IGTP Q F V+F
Sbjct: 28 KETKSKANEIRK--LKAKYGGTPKAEIRDLVVEKLFNYVDDSYYGKITIGTPGQEFLVLF 85
Query: 108 DTGSSNLWVPSSKCYF-SIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHV 166
DTGSSNLWVP + C + AC H+ Y SST+ K G+S I YG+G++SG+ ED V
Sbjct: 86 DTGSSNLWVPVAPCSADNAACENHNTYDPSASSTHVKKGESFSIQYGSGSLSGYLVEDTV 145
Query: 167 KIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEP 226
+ L +K Q F AT EP TF+ A FDGI+G+GF+ I+V P WYNM++Q L++E
Sbjct: 146 DVEGLKIKKQVFAAATNEPGETFVYAPFDGIMGMGFKSIAVDDVTPPWYNMISQHLISEK 205
Query: 227 VFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFC 286
VFSF+ R +EGG +V GG D +Y+G+ YVPV+++GYWQF+M + ++G C
Sbjct: 206 VFSFYLARRGTSDEGGVMVVGGNDDRYYEGDFHYVPVSEQGYWQFEMAEAHVNG--VRIC 263
Query: 287 AGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
C AIAD+GTSL+A PT ++ IGAT
Sbjct: 264 -DRCQAIADTGTSLIAVPTDKYEEIQKEIGAT 294
>gi|346322842|gb|EGX92440.1| vacuolar protease A precursor [Cordyceps militaris CM01]
Length = 395
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 121/247 (48%), Positives = 161/247 (65%), Gaps = 6/247 (2%)
Query: 71 LGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFH 130
+ ES + +V + N+ +AQYF EI IGTPPQ F V+ DTGSSNLWVPS C SIAC+ H
Sbjct: 66 VAESKNGHLVPVSNFANAQYFSEITIGTPPQTFKVVLDTGSSNLWVPSQSCS-SIACFLH 124
Query: 131 SKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFL 190
S Y S SSTYKKNG +IHYG+G+++G+ S D V+IGDL +K+ +F EAT EP L F
Sbjct: 125 STYDSSSSSTYKKNGSDFEIHYGSGSLTGYVSNDVVRIGDLTIKNTDFAEATNEPGLAFA 184
Query: 191 LAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMD 250
+FDGILGLG+ ISV VP +Y M+ Q L++EPVF+F+ EEEG E VFGG+D
Sbjct: 185 FGRFDGILGLGYDTISVNHMVPPFYQMIKQKLLDEPVFAFYL---GSEEEGSEAVFGGVD 241
Query: 251 PDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQ 310
+HY+G+ Y+P+ +K YW+ D + + G I D+GTSL P+ +
Sbjct: 242 KNHYEGKIEYLPLRRKAYWEVDFDAIAFGKEVAELENTGV--ILDTGTSLNTLPSDLAEL 299
Query: 311 VNHAIGA 317
+N IGA
Sbjct: 300 LNKEIGA 306
>gi|195034430|ref|XP_001988894.1| GH11416 [Drosophila grimshawi]
gi|193904894|gb|EDW03761.1| GH11416 [Drosophila grimshawi]
Length = 400
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 119/253 (47%), Positives = 160/253 (63%), Gaps = 5/253 (1%)
Query: 67 LRGNLGES-GDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS- 124
L G++ +S D L N + Y+GEI IGTPPQ F V+FDTGSSNLWVPS C +
Sbjct: 64 LSGSVEDSNADYTTEELSNNQNMDYYGEIAIGTPPQYFKVVFDTGSSNLWVPSVNCLPTD 123
Query: 125 IACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATRE 184
+AC H++Y S SSTY NG+S I YGTG+++G+ S D V I L + +Q F EAT +
Sbjct: 124 LACQTHNQYNSSASSTYVANGESFSIQYGTGSLTGYLSSDTVSISGLSIVNQSFAEATSQ 183
Query: 185 PSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEI 244
P+ +F FDGILG+ + I+ VP +YN+ NQGL+++P F F+ N E GGE+
Sbjct: 184 PNSSFTGVPFDGILGMAYSSIAEDSVVPPFYNLWNQGLIDKPTFGFYLTHNGSAELGGEL 243
Query: 245 VFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGP 304
+ GG+D ++G T VPV+Q GYWQF M V +D C+ C AIAD+GTSLLA P
Sbjct: 244 ILGGVDNTLFEGNLTSVPVSQMGYWQFAMAVVAMDNNV--ICS-DCQAIADTGTSLLAVP 300
Query: 305 TTIITQVNHAIGA 317
+T +N+ IGA
Sbjct: 301 ANQLTYINNIIGA 313
>gi|380865655|gb|AFF19538.1| pepsin F, partial [Camelus dromedarius]
Length = 354
Score = 232 bits (591), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 119/286 (41%), Positives = 173/286 (60%), Gaps = 10/286 (3%)
Query: 49 RLDSKEGESFRTSIRKYSLR--GNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVI 106
R + +E + + +Y+ R N + L+NY+D Y +I IGTPPQNF V+
Sbjct: 25 RENLREKNMLKDFLEQYTYRLSDNTAPAKRVYTQPLRNYLDLVYIADISIGTPPQNFKVV 84
Query: 107 FDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHV 166
FDTGS+NLWVPS C S AC HS + RS+T+ G+S +I YGTG I+GF D V
Sbjct: 85 FDTGSANLWVPSIYCD-SKACANHSVFNPPRSTTFSLEGRSFEITYGTGKIAGFLGYDTV 143
Query: 167 KIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEP 226
+IG+LV+ Q F + +EP + A FDGILGLG+ +S+ PV+ N+ Q L+ EP
Sbjct: 144 RIGNLVIGSQAFGMSQKEPGIFLEHAVFDGILGLGYPALSIVGTTPVFDNLKKQRLLKEP 203
Query: 227 VFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFC 286
+F+F+ + +E G ++FGG+D +YKGE +VPV+Q+ YWQ M + ++G+ G C
Sbjct: 204 IFAFYL--STKKENGSVVMFGGLDHSYYKGELKWVPVSQRLYWQISMDSITMNGKILG-C 260
Query: 287 AGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE----CKAV 328
GGC AI D+GT++L GPT ++T + AI A + E C A+
Sbjct: 261 KGGCQAIVDTGTAVLVGPTNVVTNIQKAINARPLTGYEYFIGCDAI 306
>gi|195433875|ref|XP_002064932.1| GK15196 [Drosophila willistoni]
gi|194161017|gb|EDW75918.1| GK15196 [Drosophila willistoni]
Length = 415
Score = 232 bits (591), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 113/245 (46%), Positives = 156/245 (63%), Gaps = 10/245 (4%)
Query: 81 ALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSI-ACYFHSKYRSGRSS 139
+L N + +Y+ + IGTPPQ F ++ DTGS+NLWVPSSKC ++ AC H +Y S SS
Sbjct: 87 SLGNAYNTEYYITVHIGTPPQEFRLLIDTGSANLWVPSSKCPSTVKACAAHQRYNSSASS 146
Query: 140 TYKKNGKSADIHYGTG-----AISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKF 194
TYK N + I Y + A+ GF S+D V IGDL +K+Q F E T EP TFL + F
Sbjct: 147 TYKANNTAFQIEYASNTAGGVALDGFLSQDTVAIGDLAIKNQVFAEMTNEPDGTFLTSPF 206
Query: 195 DGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRN-ADEEEGGEIVFGGMDPDH 253
DG++GL + IS+ +P YN+++QGL+ EP+FS + NRN + GGE++ GG+DP
Sbjct: 207 DGMIGLAYASISINGVIPPLYNLISQGLIPEPIFSIYLNRNGTNATNGGELILGGIDPAL 266
Query: 254 YKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNH 313
Y G TYVPV+Q+GYWQF+M ++ Q FC C AI D GTSL+ P + I ++N
Sbjct: 267 YSGCLTYVPVSQQGYWQFEMTSATLNDQE--FC-DNCQAILDVGTSLIVVPNSEIKEINQ 323
Query: 314 AIGAT 318
+G T
Sbjct: 324 ILGVT 328
>gi|290543422|ref|NP_001166408.1| cathepsin E precursor [Cavia porcellus]
gi|115721|sp|P25796.1|CATE_CAVPO RecName: Full=Cathepsin E; Flags: Precursor
gi|191295|gb|AAA37052.1| procathepsin E [Cavia porcellus]
gi|1246041|gb|AAB35844.1| procathepsin E [Cavia]
Length = 391
Score = 232 bits (591), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 123/291 (42%), Positives = 172/291 (59%), Gaps = 10/291 (3%)
Query: 28 GGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMD 87
G L+R+ L +R+ + R R + E +++ + + + L NY+D
Sbjct: 19 GALHRVPLSRRE---SLRKKLRAQGQLTELWKSQNLNMDQCSTIQSANEP----LINYLD 71
Query: 88 AQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKS 147
+YFG I IG+PPQNFTVIFDTGSSNLWVPS C S AC H + SSTY++ G S
Sbjct: 72 MEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACQTHPVFHPSLSSTYREVGNS 130
Query: 148 ADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISV 207
I YGTG+++G D V + L V Q+F E+ +EP TF+ A+FDGILGLG+ ++
Sbjct: 131 FSIQYGTGSLTGIIGADQVSVEGLTVVGQQFGESVQEPGKTFVHAEFDGILGLGYPSLAA 190
Query: 208 GKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKG 267
G PV+ NM+ Q LV P+FS + + N G E+ FGG DP H+ G +VPVT++
Sbjct: 191 GGVTPVFDNMMAQNLVALPMFSVYMSSNPG-GSGSELTFGGYDPSHFSGSLNWVPVTKQA 249
Query: 268 YWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
YWQ + + + G + FC+ GC AI D+GTSL+ GP I Q+ A+GAT
Sbjct: 250 YWQIALDGIQV-GDSVMFCSEGCQAIVDTGTSLITGPPGKIKQLQEALGAT 299
>gi|338712318|ref|XP_001501960.2| PREDICTED: pepsin II-1-like [Equus caballus]
Length = 397
Score = 231 bits (590), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 121/280 (43%), Positives = 169/280 (60%), Gaps = 28/280 (10%)
Query: 62 IRKYSLRGNLGESGD------------------------ADIVALKNYMDAQYFGEIGIG 97
+RK SLR NL E G AD L+NY+D +YFG I IG
Sbjct: 33 VRKKSLRKNLIEHGLLEDFLKKHTPNPASKFLPKEAATLADTEPLENYLDEEYFGTISIG 92
Query: 98 TPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAI 157
TPPQ FTVIFDTGSSNLWVPS+ C S+ACY H ++ +SSTY+ +S I YGTG++
Sbjct: 93 TPPQEFTVIFDTGSSNLWVPSTYCS-SLACYDHKRFNPEKSSTYRATSESISITYGTGSM 151
Query: 158 SGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNM 217
+G D V++G + +Q F + +EP LA FDGILGL + IS A PV+ N+
Sbjct: 152 TGILGYDTVRVGGIEDTNQIFGLSEKEPGFFLFLAPFDGILGLAYPSISASGATPVFDNI 211
Query: 218 VNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVM 277
+QGLV++ +FS + + N +E G ++FGG+D +Y G +VPV+ +GYWQ + +
Sbjct: 212 WDQGLVSQDLFSVYLSSN--DESGSVVMFGGIDSSYYTGSLHWVPVSHEGYWQITVDSIT 269
Query: 278 IDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
++G++ C+GGC A+ D+GTSLL GPT+ I + IGA
Sbjct: 270 VNGESIA-CSGGCQAVVDTGTSLLTGPTSAIDNIQSYIGA 308
>gi|296810640|ref|XP_002845658.1| vacuolar protease A [Arthroderma otae CBS 113480]
gi|263406266|sp|C5FS55.1|CARP_NANOT RecName: Full=Vacuolar protease A; AltName: Full=Aspartic
endopeptidase PEP2; AltName: Full=Aspartic protease
PEP2; Flags: Precursor
gi|238843046|gb|EEQ32708.1| vacuolar protease A [Arthroderma otae CBS 113480]
Length = 395
Score = 231 bits (590), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 112/227 (49%), Positives = 148/227 (65%), Gaps = 3/227 (1%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V + N+++AQYF EI IGTPPQ F V+ DTGSSNLWVP C SIAC+ HS Y S SS
Sbjct: 77 VLIDNFLNAQYFSEISIGTPPQTFKVVLDTGSSNLWVPGKDCS-SIACFLHSTYDSSASS 135
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
T+ +NG S I YG+G++ GF S+D+V+IGD+ +K+Q F EAT EP L F +FDGILG
Sbjct: 136 TFTRNGTSFAIRYGSGSLEGFVSQDNVQIGDMKIKNQLFAEATSEPGLAFAFGRFDGILG 195
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
+G+ ISV K P +Y MV QGLV+EPVFSF+ + + + FGG D HY G+ T
Sbjct: 196 MGYDTISVNKITPPFYKMVEQGLVDEPVFSFYLGDTNKDGDQSVVTFGGADKSHYTGDIT 255
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTT 306
+P+ +K YW+ + + + T G I D+GTSL+A PTT
Sbjct: 256 TIPLRRKAYWEVEFNAITLGKDTATLDNTGI--ILDTGTSLIALPTT 300
>gi|256274192|gb|EEU09100.1| Pep4p [Saccharomyces cerevisiae JAY291]
Length = 405
Score = 231 bits (589), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 118/251 (47%), Positives = 162/251 (64%), Gaps = 8/251 (3%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L NY++AQY+ +I +GTPPQNF VI DTGSSNLWVPS++C S+AC+ HSKY SS
Sbjct: 81 VPLTNYLNAQYYTDITLGTPPQNFKVILDTGSSNLWVPSNEC-GSLACFLHSKYDHEASS 139
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
+YK NG I YGTG++ G+ S+D + IGDL + Q+F EAT EP LTF KFDGILG
Sbjct: 140 SYKANGTEFAIQYGTGSLEGYISQDTLSIGDLTIPKQDFAEATSEPGLTFAFGKFDGILG 199
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWF-NRNADEEEGGEIVFGGMDPDHYKGEH 258
LG+ ISV K VP +YN + Q L++E F+F+ + + D E GGE FGG+D +KG+
Sbjct: 200 LGYDTISVDKVVPPFYNAIQQDLLDEKRFAFYLGDTSKDTENGGEATFGGIDESKFKGDI 259
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA- 317
T++PV +K YW+ + + + + G A D+GTSL+ P+ + +N IGA
Sbjct: 260 TWLPVRRKAYWEVKFEGIGLGDEYAELESHGAA--IDTGTSLITLPSGLAEMINAEIGAK 317
Query: 318 ---TGIVSQEC 325
TG + +C
Sbjct: 318 KGWTGQYTLDC 328
>gi|365986877|ref|XP_003670270.1| hypothetical protein NDAI_0E02105 [Naumovozyma dairenensis CBS 421]
gi|343769040|emb|CCD25027.1| hypothetical protein NDAI_0E02105 [Naumovozyma dairenensis CBS 421]
Length = 408
Score = 231 bits (589), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 120/253 (47%), Positives = 161/253 (63%), Gaps = 8/253 (3%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
+ L NY++AQYF +I +GTPPQ+F VI DTGSSNLWVPS +C S+ACY HSKY +SS
Sbjct: 84 IPLSNYLNAQYFADITLGTPPQSFKVILDTGSSNLWVPSVECG-SLACYLHSKYDHDKSS 142
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
+YK NG I YGTG++ G+ S+D + IGDL + Q+F EAT EP LTF KFDGILG
Sbjct: 143 SYKPNGTDFAIRYGTGSLEGYISQDTLNIGDLNIPKQDFAEATSEPGLTFAFGKFDGILG 202
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWF-NRNADEEEGGEIVFGGMDPDHYKGEH 258
L + ISV K VP +YN + Q L++E F+F+ + N E+GGEI GG+D +KG+
Sbjct: 203 LAYDSISVNKVVPPFYNAIEQELLDEKKFAFYLGDANKKSEDGGEITIGGIDKTKFKGDI 262
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA- 317
++PV +K YW+ + + Q G A D+GTSL+A P+ + +N IGA
Sbjct: 263 DWLPVRRKAYWEVKFEGIGLGDQFAELENHGAA--IDTGTSLIALPSGLAEIINTEIGAK 320
Query: 318 ---TGIVSQECKA 327
TG + EC A
Sbjct: 321 KGWTGQYTVECDA 333
>gi|6325103|ref|NP_015171.1| Pep4p [Saccharomyces cerevisiae S288c]
gi|115643|sp|P07267.1|CARP_YEAST RecName: Full=Saccharopepsin; AltName: Full=Aspartate protease;
Short=PrA; Short=Proteinase A; AltName:
Full=Carboxypeptidase Y-deficient protein 4; AltName:
Full=Proteinase YSCA; Flags: Precursor
gi|172122|gb|AAB63975.1| vacuolar proteinase A precursor [Saccharomyces cerevisiae]
gi|1370328|emb|CAA97859.1| PEP4 [Saccharomyces cerevisiae]
gi|1403555|emb|CAA65567.1| P2585 protein [Saccharomyces cerevisiae]
gi|151942645|gb|EDN60991.1| vacuolar proteinase A [Saccharomyces cerevisiae YJM789]
gi|190407806|gb|EDV11071.1| vacuolar proteinase A [Saccharomyces cerevisiae RM11-1a]
gi|259150002|emb|CAY86805.1| Pep4p [Saccharomyces cerevisiae EC1118]
gi|285815388|tpg|DAA11280.1| TPA: Pep4p [Saccharomyces cerevisiae S288c]
gi|323302701|gb|EGA56507.1| Pep4p [Saccharomyces cerevisiae FostersB]
gi|323331178|gb|EGA72596.1| Pep4p [Saccharomyces cerevisiae AWRI796]
gi|323346153|gb|EGA80443.1| Pep4p [Saccharomyces cerevisiae Lalvin QA23]
gi|323351977|gb|EGA84516.1| Pep4p [Saccharomyces cerevisiae VL3]
gi|365762755|gb|EHN04288.1| Pep4p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
gi|392295854|gb|EIW06957.1| Pep4p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 405
Score = 231 bits (589), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 118/251 (47%), Positives = 162/251 (64%), Gaps = 8/251 (3%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L NY++AQY+ +I +GTPPQNF VI DTGSSNLWVPS++C S+AC+ HSKY SS
Sbjct: 81 VPLTNYLNAQYYTDITLGTPPQNFKVILDTGSSNLWVPSNEC-GSLACFLHSKYDHEASS 139
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
+YK NG I YGTG++ G+ S+D + IGDL + Q+F EAT EP LTF KFDGILG
Sbjct: 140 SYKANGTEFAIQYGTGSLEGYISQDTLSIGDLTIPKQDFAEATSEPGLTFAFGKFDGILG 199
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWF-NRNADEEEGGEIVFGGMDPDHYKGEH 258
LG+ ISV K VP +YN + Q L++E F+F+ + + D E GGE FGG+D +KG+
Sbjct: 200 LGYDTISVDKVVPPFYNAIQQDLLDEKRFAFYLGDTSKDTENGGEATFGGIDESKFKGDI 259
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA- 317
T++PV +K YW+ + + + + G A D+GTSL+ P+ + +N IGA
Sbjct: 260 TWLPVRRKAYWEVKFEGIGLGDEYAELESHGAA--IDTGTSLITLPSGLAEMINAEIGAK 317
Query: 318 ---TGIVSQEC 325
TG + +C
Sbjct: 318 KGWTGQYTLDC 328
>gi|349581664|dbj|GAA26821.1| K7_Pep4p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 405
Score = 231 bits (588), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 118/251 (47%), Positives = 162/251 (64%), Gaps = 8/251 (3%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L NY++AQY+ +I +GTPPQNF VI DTGSSNLWVPS++C S+AC+ HSKY SS
Sbjct: 81 VPLTNYLNAQYYTDITLGTPPQNFKVILDTGSSNLWVPSNEC-GSLACFLHSKYDHEASS 139
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
+YK NG I YGTG++ G+ S+D + IGDL + Q+F EAT EP LTF KFDGILG
Sbjct: 140 SYKANGTEFAIQYGTGSLEGYISQDTLSIGDLTIPKQDFAEATSEPGLTFAFGKFDGILG 199
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWF-NRNADEEEGGEIVFGGMDPDHYKGEH 258
LG+ ISV K VP +YN + Q L++E F+F+ + + D E GGE FGG+D +KG+
Sbjct: 200 LGYDTISVDKVVPPFYNAIQQDLLDEKKFAFYLGDTSKDTENGGEATFGGIDESKFKGDI 259
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA- 317
T++PV +K YW+ + + + + G A D+GTSL+ P+ + +N IGA
Sbjct: 260 TWLPVRRKAYWEVKFEGIGLGDEYAELESHGAA--IDTGTSLITLPSGLAEMINAEIGAK 317
Query: 318 ---TGIVSQEC 325
TG + +C
Sbjct: 318 KGWTGQYTLDC 328
>gi|443894057|dbj|GAC71407.1| aspartyl protease [Pseudozyma antarctica T-34]
Length = 418
Score = 231 bits (588), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 125/253 (49%), Positives = 160/253 (63%), Gaps = 9/253 (3%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L ++++AQYF +I +GTP Q F VI DTGSSNLWVPS+KC SIAC+ H KY S SS
Sbjct: 97 VPLTDFLNAQYFCDISLGTPAQEFKVILDTGSSNLWVPSTKCS-SIACFLHKKYDSSASS 155
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
+YKKNG I YG+G++ G S D +KIGDL +K Q+F EAT EP L F KFDGILG
Sbjct: 156 SYKKNGTEFKIQYGSGSMEGIVSNDVLKIGDLTIKGQDFAEATSEPGLAFAFGKFDGILG 215
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
L + ISV VP Y M++QGL++ P SF+ + E +GGE VFGG+D HY G+
Sbjct: 216 LAYDTISVNGIVPPMYQMIDQGLLDAPQVSFYLGSS--EADGGEAVFGGIDDSHYTGKIH 273
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT- 318
+ PV +KGYW+ + D + G G AAI D+GTSL+A T +N IGAT
Sbjct: 274 WAPVKRKGYWEVAL-DKLALGDEELELDNGSAAI-DTGTSLIAMATDTAEILNAEIGATK 331
Query: 319 ---GIVSQECKAV 328
G S +C+ V
Sbjct: 332 SWNGQYSVDCEKV 344
>gi|432943847|ref|XP_004083297.1| PREDICTED: cathepsin E-A-like [Oryzias latipes]
Length = 412
Score = 231 bits (588), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 133/332 (40%), Positives = 193/332 (58%), Gaps = 25/332 (7%)
Query: 29 GLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDA 88
GL LK + D NR A+ +S R LG S + + N+MDA
Sbjct: 37 GLLDQFLKDNQPDTFNRRYAQCFPPGTQSLR-----------LGRSSEK----IYNFMDA 81
Query: 89 QYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSA 148
QY+GEI +GTP QNF+VIFDTGSS+LWVPSS C S AC FH +++ +SS++ +G++
Sbjct: 82 QYYGEIRLGTPEQNFSVIFDTGSSDLWVPSSYC-VSQACAFHRHFKAFKSSSFHHDGRTF 140
Query: 149 DIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVG 208
IHYG+G + G +D ++IG+L V +QEF E+ EP TF+ AKFDG+LGL + ++
Sbjct: 141 GIHYGSGHLLGVMGKDTLRIGNLTVLNQEFGESVYEPGSTFVTAKFDGVLGLAYPSLAEI 200
Query: 209 KAVPVWYNMVNQGLVNEPVFSFWFNRNADEE-EGGEIVFGGMDPDHYKGEHTYVPVTQKG 267
PV+ NM+ Q +++EP+FSF+ +R+ + G+++ GG D Y G +VPVT KG
Sbjct: 201 IGKPVFDNMLAQKILDEPIFSFYLSRSKSKSVPEGQLLLGGTDESLYSGPINWVPVTIKG 260
Query: 268 YWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT----GIVSQ 323
YWQ M V + G + C GC AI D+GTSL+AGP I +++ IGAT G
Sbjct: 261 YWQIRMDSVSVQG-VSSLCRRGCEAIVDTGTSLIAGPPREILRLHQLIGATPTHFGDFVV 319
Query: 324 ECKAVVSQYGEEIINMLLAKVSAKLSLRHYKK 355
+C + S + +L +V L+ HY +
Sbjct: 320 DCARLSSL---PHVTFVLGEVEYTLTSEHYIR 348
>gi|132329|sp|P00796.1|RENI2_MOUSE RecName: Full=Renin-2; AltName: Full=Angiotensinogenase; AltName:
Full=Submandibular gland renin; Contains: RecName:
Full=Renin-2 heavy chain; Contains: RecName:
Full=Renin-2 light chain; Flags: Precursor
gi|15029868|gb|AAH11157.1| Ren2 protein [Mus musculus]
Length = 401
Score = 231 bits (588), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 131/308 (42%), Positives = 186/308 (60%), Gaps = 14/308 (4%)
Query: 14 LCLLLFPVVFSTPNGGLY-RIGLKKRKFDLNNRVAARLDSKEGESFRTSIR--KYSLRGN 70
L LL P FS P G + RI LKK V L+ + + R S ++ R +
Sbjct: 11 LLLLWSPCTFSLPTGTTFERIPLKKMP-----SVREILEERGVDMTRLSAEWDVFTKRSS 65
Query: 71 LGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYF 129
L + V L NY+++QY+GEIGIGTPPQ F VIFDTGS+NLWVPS+KC +AC
Sbjct: 66 LTDL--ISPVVLTNYLNSQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCSRLYLACGI 123
Query: 130 HSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTF 189
HS Y S SS+Y +NG IHYG+G + GF S+D V +G + V Q F E T P + F
Sbjct: 124 HSLYESSDSSSYMENGDDFTIHYGSGRVKGFLSQDSVTVGGITVT-QTFGEVTELPLIPF 182
Query: 190 LLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGM 249
+LA+FDG+LG+GF +VG PV+ ++++QG++ E VFS ++NR GGE+V GG
Sbjct: 183 MLAQFDGVLGMGFPAQAVGGVTPVFDHILSQGVLKEKVFSVYYNRGP-HLLGGEVVLGGS 241
Query: 250 DPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIIT 309
DP+HY+G+ YV +++ WQ M V + G +T C GC + D+G+S ++ PT+ +
Sbjct: 242 DPEHYQGDFHYVSLSKTDSWQITMKGVSV-GSSTLLCEEGCEVVVDTGSSFISAPTSSLK 300
Query: 310 QVNHAIGA 317
+ A+GA
Sbjct: 301 LIMQALGA 308
>gi|291409620|ref|XP_002721076.1| PREDICTED: pepsinogen III-like [Oryctolagus cuniculus]
Length = 387
Score = 231 bits (588), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 125/281 (44%), Positives = 168/281 (59%), Gaps = 28/281 (9%)
Query: 62 IRKYSLRGNLGESG------------------------DADIVALKNYMDAQYFGEIGIG 97
+RK SLR NL E G L+NY+D +YFG IGIG
Sbjct: 23 VRKKSLRKNLIEKGLLKDYLKTHTPNLATKYLPKAAFDSVPTETLENYLDTEYFGTIGIG 82
Query: 98 TPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAI 157
TP Q+FTVIFDTGSSNLWVPS C S AC H+K+ SST++ +S I YGTG++
Sbjct: 83 TPAQDFTVIFDTGSSNLWVPSVYCS-SAACSVHNKFNPEDSSTFQATSESLSITYGTGSM 141
Query: 158 SGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNM 217
+GF D VK+G++ +Q F + EP A FDGILGL + IS A PV+ NM
Sbjct: 142 TGFLGYDTVKVGNIEDTNQIFGLSESEPGSFLYYAPFDGILGLAYPSISSSDATPVFDNM 201
Query: 218 VNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVM 277
N+GLV+E +FS + ++D+E G ++FGG+D +Y G +VPV+ +GYWQ + +
Sbjct: 202 WNEGLVSEDLFSVYL--SSDDESGSVVMFGGIDSSYYTGSLNWVPVSYEGYWQITLDSIT 259
Query: 278 IDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
+DG+T CA GC AI D+GTSLLAGPT+ I+ + IGA+
Sbjct: 260 MDGETIA-CADGCQAIVDTGTSLLAGPTSAISNIQSYIGAS 299
>gi|148669271|gb|EDL01218.1| mCG6933 [Mus musculus]
Length = 401
Score = 231 bits (588), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 131/308 (42%), Positives = 186/308 (60%), Gaps = 14/308 (4%)
Query: 14 LCLLLFPVVFSTPNGGLY-RIGLKKRKFDLNNRVAARLDSKEGESFRTSIR--KYSLRGN 70
L LL P FS P G + RI LKK V L+ + + R S ++ R +
Sbjct: 11 LLLLWSPCTFSLPTGTTFERIPLKKMP-----SVREILEERGVDMTRLSAEWDVFTKRSS 65
Query: 71 LGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYF 129
L + V L NY+++QY+GEIGIGTPPQ F VIFDTGS+NLWVPS+KC +AC
Sbjct: 66 LTDL--ISPVVLTNYLNSQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCSRLYLACGI 123
Query: 130 HSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTF 189
HS Y S SS+Y +NG IHYG+G + GF S+D V +G + V Q F E T P + F
Sbjct: 124 HSLYESSDSSSYMENGDDFTIHYGSGRVKGFLSQDSVTVGGITVT-QTFGEVTELPLIPF 182
Query: 190 LLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGM 249
+LA+FDG+LG+GF +VG PV+ ++++QG++ E VFS ++NR GGE+V GG
Sbjct: 183 MLAQFDGVLGMGFPAQAVGGVTPVFDHILSQGVLKEKVFSVYYNRGP-HLLGGEVVLGGS 241
Query: 250 DPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIIT 309
DP+HY+G+ YV +++ WQ M V + G +T C GC + D+G+S ++ PT+ +
Sbjct: 242 DPEHYQGDFHYVSLSKTDSWQITMKGVSV-GSSTLLCEEGCEVVVDTGSSFISAPTSSLK 300
Query: 310 QVNHAIGA 317
+ A+GA
Sbjct: 301 LIMQALGA 308
>gi|15079273|gb|AAH11473.1| Ren2 protein [Mus musculus]
Length = 401
Score = 231 bits (588), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 131/308 (42%), Positives = 186/308 (60%), Gaps = 14/308 (4%)
Query: 14 LCLLLFPVVFSTPNGGLY-RIGLKKRKFDLNNRVAARLDSKEGESFRTSIR--KYSLRGN 70
L LL P FS P G + RI LKK V L+ + + R S ++ R +
Sbjct: 11 LLLLWSPCTFSLPTGTTFERIPLKKMP-----SVREILEERGVDMTRLSAEWDVFTKRSS 65
Query: 71 LGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYF 129
L + V L NY+++QY+GEIGIGTPPQ F VIFDTGS+NLWVPS+KC +AC
Sbjct: 66 LTDL--ISPVVLTNYLNSQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCSRLYLACGI 123
Query: 130 HSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTF 189
HS Y S SS+Y +NG IHYG+G + GF S+D V +G + V Q F E T P + F
Sbjct: 124 HSLYESSDSSSYMENGDDFTIHYGSGRVKGFLSQDSVTVGGITVT-QTFGEVTELPLIPF 182
Query: 190 LLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGM 249
+LA+FDG+LG+GF +VG PV+ ++++QG++ E VFS ++NR GGE+V GG
Sbjct: 183 MLAQFDGVLGMGFPAQAVGGVTPVFDHILSQGVLKEKVFSVYYNRGP-HLLGGEVVLGGS 241
Query: 250 DPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIIT 309
DP+HY+G+ YV +++ WQ M V + G +T C GC + D+G+S ++ PT+ +
Sbjct: 242 DPEHYQGDFHYVSLSKTDSWQITMKGVSV-GSSTLLCEEGCEVVVDTGSSFISAPTSSLK 300
Query: 310 QVNHAIGA 317
+ A+GA
Sbjct: 301 LIMQALGA 308
>gi|207340638|gb|EDZ68928.1| YPL154Cp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 385
Score = 231 bits (588), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 118/252 (46%), Positives = 162/252 (64%), Gaps = 8/252 (3%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L NY++AQY+ +I +GTPPQNF VI DTGSSNLWVPS++C S+AC+ HSKY SS
Sbjct: 81 VPLTNYLNAQYYTDITLGTPPQNFKVILDTGSSNLWVPSNEC-GSLACFLHSKYDHEASS 139
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
+YK NG I YGTG++ G+ S+D + IGDL + Q+F EAT EP LTF KFDGILG
Sbjct: 140 SYKANGTEFAIQYGTGSLEGYISQDTLSIGDLTIPKQDFAEATSEPGLTFAFGKFDGILG 199
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWF-NRNADEEEGGEIVFGGMDPDHYKGEH 258
LG+ ISV K VP +YN + Q L++E F+F+ + + D E GGE FGG+D +KG+
Sbjct: 200 LGYDTISVDKVVPPFYNAIQQDLLDEKRFAFYLGDTSKDTENGGEATFGGIDESKFKGDI 259
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA- 317
T++PV +K YW+ + + + + G A D+GTSL+ P+ + +N IGA
Sbjct: 260 TWLPVRRKAYWEVKFEGIGLGDEYAELESHGAA--IDTGTSLITLPSGLAEMINAEIGAK 317
Query: 318 ---TGIVSQECK 326
TG + +C
Sbjct: 318 KGWTGQYTLDCN 329
>gi|444731560|gb|ELW71913.1| Cathepsin D [Tupaia chinensis]
Length = 684
Score = 231 bits (588), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 142/354 (40%), Positives = 186/354 (52%), Gaps = 53/354 (14%)
Query: 26 PNGGLYRIGLKKRKFDLNNRVAARLDSK-EGESFRTSIRKYSLRGNLGESGDADIVA-LK 83
P L RI L KF R + E I KYS + + LK
Sbjct: 17 PAAALIRIPL--HKFPSIRRTLTEMGGPVENLIAHEPISKYSQEAPTPAATKGPVPEILK 74
Query: 84 NYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYK 142
NYMDAQY+GEIGIGTPPQ FTVIFDTGS+NLWVPS C AC+FH KY S +SSTY
Sbjct: 75 NYMDAQYYGEIGIGTPPQCFTVIFDTGSANLWVPSIHCGMLDFACWFHHKYNSKKSSTYA 134
Query: 143 KNGKSADIHYGTGAI----SGFFSEDHVKIG---DLVVKDQE------------------ 177
KNG S DIHY +G+ E ++G D V++DQE
Sbjct: 135 KNGSSFDIHYRSGSQWLRQPLRVPEPGHRVGTDIDPVLRDQELWGNMSRGDSQPHTEPSC 194
Query: 178 ----------------FIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQG 221
F EAT++P +TFL AKFDGILG+ + ISV VPV+ N++ Q
Sbjct: 195 WKVPCHTVSVRVDKQTFGEATKQPGITFLAAKFDGILGMAYPRISVDNVVPVFDNLMKQK 254
Query: 222 LVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMI-DG 280
LV + +F+F+ NR+ + GGE++ GG+D +Y G Y VT+K YWQ M + + DG
Sbjct: 255 LVEKNIFAFYLNRDPSGQPGGELMLGGVDTKYYTGSLDYYNVTRKAYWQIHMDKLEVGDG 314
Query: 281 QTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE----CKAVVS 330
T C GC I D+GTSL+ GP + +++ A+GA ++ E C+ V S
Sbjct: 315 LT--LCQEGCEVIVDTGTSLIVGPVDEVRELHKAMGAVPLIQGEYMIPCEKVAS 366
>gi|401838744|gb|EJT42213.1| PEP4-like protein [Saccharomyces kudriavzevii IFO 1802]
Length = 405
Score = 230 bits (587), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 120/251 (47%), Positives = 163/251 (64%), Gaps = 8/251 (3%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L NY++AQY+ +I +GTPPQNF VI DTGSSNLWVPS++C S+AC+ HSKY SS
Sbjct: 81 VPLTNYLNAQYYTDITLGTPPQNFKVILDTGSSNLWVPSNEC-GSLACFLHSKYDHEASS 139
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
+YK NG I YGTG++ G+ S+D + IGDL + Q+F EAT EP LTF KFDGILG
Sbjct: 140 SYKANGTEFAIQYGTGSLEGYISQDTLSIGDLTIPKQDFAEATSEPGLTFAFGKFDGILG 199
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWF-NRNADEEEGGEIVFGGMDPDHYKGEH 258
LG+ ISV K VP +YN + Q L++E F+F+ + + D E GGE FGG+D +KG+
Sbjct: 200 LGYDTISVDKVVPPFYNAIQQDLLDEKKFAFYLGDTSKDSENGGEATFGGIDESKFKGDI 259
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA- 317
T++PV +K YW+ + + G G AAI D+GTSL+ P+ + +N +GA
Sbjct: 260 TWLPVRRKAYWEVKFEGIGL-GDEYAELEGHGAAI-DTGTSLITLPSGLAEMINAELGAK 317
Query: 318 ---TGIVSQEC 325
TG + +C
Sbjct: 318 KGWTGQYTLDC 328
>gi|14278413|pdb|1G0V|A Chain A, The Structure Of Proteinase A Complexed With A Ia3 Mutant,
Mvv
Length = 329
Score = 230 bits (587), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 118/252 (46%), Positives = 162/252 (64%), Gaps = 8/252 (3%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L NY++AQY+ +I +GTPPQNF VI DTGSSNLWVPS++C S+AC+ HSKY SS
Sbjct: 5 VPLTNYLNAQYYTDITLGTPPQNFKVILDTGSSNLWVPSNEC-GSLACFLHSKYDHEASS 63
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
+YK NG I YGTG++ G+ S+D + IGDL + Q+F EAT EP LTF KFDGILG
Sbjct: 64 SYKANGTEFAIQYGTGSLEGYISQDTLSIGDLTIPKQDFAEATSEPGLTFAFGKFDGILG 123
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWF-NRNADEEEGGEIVFGGMDPDHYKGEH 258
LG+ ISV K VP +YN + Q L++E F+F+ + + D E GGE FGG+D +KG+
Sbjct: 124 LGYDTISVDKVVPPFYNAIQQDLLDEKRFAFYLGDTSKDTENGGEATFGGIDESKFKGDI 183
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA- 317
T++PV +K YW+ + + + + G A D+GTSL+ P+ + +N IGA
Sbjct: 184 TWLPVRRKAYWEVKFEGIGLGDEYAELESHGAA--IDTGTSLITLPSGLAEMINAEIGAK 241
Query: 318 ---TGIVSQECK 326
TG + +C
Sbjct: 242 KGSTGQYTLDCN 253
>gi|365758066|gb|EHM99929.1| Pep4p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 405
Score = 230 bits (587), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 120/251 (47%), Positives = 163/251 (64%), Gaps = 8/251 (3%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L NY++AQY+ +I +GTPPQNF VI DTGSSNLWVPS++C S+AC+ HSKY SS
Sbjct: 81 VPLTNYLNAQYYTDITLGTPPQNFKVILDTGSSNLWVPSNEC-GSLACFLHSKYDHEASS 139
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
+YK NG I YGTG++ G+ S+D + IGDL + Q+F EAT EP LTF KFDGILG
Sbjct: 140 SYKANGTEFAIQYGTGSLEGYISQDTLTIGDLTIPKQDFAEATSEPGLTFAFGKFDGILG 199
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWF-NRNADEEEGGEIVFGGMDPDHYKGEH 258
LG+ ISV K VP +YN + Q L++E F+F+ + + D E GGE FGG+D +KG+
Sbjct: 200 LGYDTISVDKVVPPFYNAIQQDLLDEKKFAFYLGDTSKDSENGGEATFGGIDESKFKGDI 259
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA- 317
T++PV +K YW+ + + G G AAI D+GTSL+ P+ + +N +GA
Sbjct: 260 TWLPVRRKAYWEVKFEGIGL-GDEYAELEGHGAAI-DTGTSLITLPSGLAEMINAELGAK 317
Query: 318 ---TGIVSQEC 325
TG + +C
Sbjct: 318 KGWTGQYTLDC 328
>gi|118150650|ref|NP_112470.2| renin-2 [Mus musculus]
Length = 424
Score = 230 bits (587), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 131/308 (42%), Positives = 186/308 (60%), Gaps = 14/308 (4%)
Query: 14 LCLLLFPVVFSTPNGGLY-RIGLKKRKFDLNNRVAARLDSKEGESFRTSIR--KYSLRGN 70
L LL P FS P G + RI LKK V L+ + + R S ++ R +
Sbjct: 34 LLLLWSPCTFSLPTGTTFERIPLKKMP-----SVREILEERGVDMTRLSAEWDVFTKRSS 88
Query: 71 LGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYF 129
L + V L NY+++QY+GEIGIGTPPQ F VIFDTGS+NLWVPS+KC +AC
Sbjct: 89 LTDL--ISPVVLTNYLNSQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCSRLYLACGI 146
Query: 130 HSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTF 189
HS Y S SS+Y +NG IHYG+G + GF S+D V +G + V Q F E T P + F
Sbjct: 147 HSLYESSDSSSYMENGDDFTIHYGSGRVKGFLSQDSVTVGGITVT-QTFGEVTELPLIPF 205
Query: 190 LLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGM 249
+LA+FDG+LG+GF +VG PV+ ++++QG++ E VFS ++NR GGE+V GG
Sbjct: 206 MLAQFDGVLGMGFPAQAVGGVTPVFDHILSQGVLKEKVFSVYYNRGP-HLLGGEVVLGGS 264
Query: 250 DPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIIT 309
DP+HY+G+ YV +++ WQ M V + G +T C GC + D+G+S ++ PT+ +
Sbjct: 265 DPEHYQGDFHYVSLSKTDSWQITMKGVSV-GSSTLLCEEGCEVVVDTGSSFISAPTSSLK 323
Query: 310 QVNHAIGA 317
+ A+GA
Sbjct: 324 LIMQALGA 331
>gi|323335315|gb|EGA76604.1| Pep4p [Saccharomyces cerevisiae Vin13]
Length = 368
Score = 230 bits (587), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 118/252 (46%), Positives = 162/252 (64%), Gaps = 8/252 (3%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L NY++AQY+ +I +GTPPQNF VI DTGSSNLWVPS++C S+AC+ HSKY SS
Sbjct: 44 VPLTNYLNAQYYTDITLGTPPQNFKVILDTGSSNLWVPSNEC-GSLACFLHSKYDHEASS 102
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
+YK NG I YGTG++ G+ S+D + IGDL + Q+F EAT EP LTF KFDGILG
Sbjct: 103 SYKANGTEFAIQYGTGSLEGYISQDTLSIGDLTIPKQDFAEATSEPGLTFAFGKFDGILG 162
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWF-NRNADEEEGGEIVFGGMDPDHYKGEH 258
LG+ ISV K VP +YN + Q L++E F+F+ + + D E GGE FGG+D +KG+
Sbjct: 163 LGYDTISVDKVVPPFYNAIQQDLLDEKRFAFYLGDTSKDTENGGEATFGGIDESKFKGDI 222
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA- 317
T++PV +K YW+ + + + + G A D+GTSL+ P+ + +N IGA
Sbjct: 223 TWLPVRRKAYWEVKFEGIGLGDEYAELESHGAA--IDTGTSLITLPSGLAEMINAEIGAK 280
Query: 318 ---TGIVSQECK 326
TG + +C
Sbjct: 281 KGWTGQYTLDCN 292
>gi|354543755|emb|CCE40477.1| hypothetical protein CPAR2_105130 [Candida parapsilosis]
Length = 427
Score = 230 bits (587), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 118/238 (49%), Positives = 151/238 (63%), Gaps = 4/238 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L NY++AQYF EI IGTP Q F VI DTGSSNLWVPS C S+AC+ HSKY SSTY
Sbjct: 104 LTNYLNAQYFTEIQIGTPGQTFKVILDTGSSNLWVPSQDCT-SLACFLHSKYDHDASSTY 162
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
K NG I YG+G++ G+ S+D V IGDLV+ Q+F EAT EP L F KFDGILGL
Sbjct: 163 KANGSEFSIQYGSGSMEGYISQDTVSIGDLVIPKQDFAEATSEPGLAFAFGKFDGILGLA 222
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWF-NRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ISV K VP YN +NQ L++ P F F+ + N DEE+GG FGG D ++G+ T+
Sbjct: 223 YDTISVNKIVPPIYNAINQDLLDAPQFGFYLGDTNKDEEDGGVATFGGYDESLFQGKITW 282
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
+PV +K YW+ + + + G A D+GTSL+ P+T+ +N IGAT
Sbjct: 283 LPVRRKAYWEVAFEGIGLGDEYAELFKTGAA--IDTGTSLITLPSTLAEIINSKIGAT 338
>gi|401623301|gb|EJS41405.1| pep4p [Saccharomyces arboricola H-6]
Length = 405
Score = 230 bits (587), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 122/256 (47%), Positives = 165/256 (64%), Gaps = 18/256 (7%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L NY++AQY+ +I +GTPPQNF VI DTGSSNLWVPS++C S+AC+ HSKY SS
Sbjct: 81 VPLTNYLNAQYYTDITLGTPPQNFKVILDTGSSNLWVPSNEC-GSLACFLHSKYDHEASS 139
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
+YK NG I YGTG++ G+ S+D + IGDL + Q+F EAT EP LTF KFDGILG
Sbjct: 140 SYKANGTEFAIQYGTGSLEGYISQDTLSIGDLTIPKQDFAEATSEPGLTFAFGKFDGILG 199
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWF-NRNADEEEGGEIVFGGMDPDHYKGEH 258
LG+ ISV K VP +YN + Q L++E F+F+ + + D E GGE FGG+D +KG+
Sbjct: 200 LGYDSISVDKVVPPFYNAIQQDLLDEKKFAFYLGDTSKDSENGGEATFGGIDESKFKGDI 259
Query: 259 TYVPVTQKGYWQFD-----MGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNH 313
T++PV +K YW+ +GD + + G AAI D+GTSL+ P+ + +N
Sbjct: 260 TWLPVRRKAYWEVKFEGIGLGDEFAELENHG------AAI-DTGTSLITLPSGLAEMINA 312
Query: 314 AIGA----TGIVSQEC 325
IGA TG + +C
Sbjct: 313 EIGAKKGWTGQYTLDC 328
>gi|7766834|pdb|1DP5|A Chain A, The Structure Of Proteinase A Complexed With A Ia3 Mutant
Inhibitor
gi|7766836|pdb|1DPJ|A Chain A, The Structure Of Proteinase A Complexed With Ia3 Peptide
Inhibitor
gi|22218637|pdb|1FMU|A Chain A, Structure Of Native Proteinase A In P3221 Space Group.
gi|22218638|pdb|1FMX|A Chain A, Structure Of Native Proteinase A In The Space Group P21
gi|22218639|pdb|1FMX|B Chain B, Structure Of Native Proteinase A In The Space Group P21
gi|225346|prf||1301217A proteinase A,Asp
Length = 329
Score = 230 bits (587), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 118/252 (46%), Positives = 162/252 (64%), Gaps = 8/252 (3%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L NY++AQY+ +I +GTPPQNF VI DTGSSNLWVPS++C S+AC+ HSKY SS
Sbjct: 5 VPLTNYLNAQYYTDITLGTPPQNFKVILDTGSSNLWVPSNEC-GSLACFLHSKYDHEASS 63
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
+YK NG I YGTG++ G+ S+D + IGDL + Q+F EAT EP LTF KFDGILG
Sbjct: 64 SYKANGTEFAIQYGTGSLEGYISQDTLSIGDLTIPKQDFAEATSEPGLTFAFGKFDGILG 123
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWF-NRNADEEEGGEIVFGGMDPDHYKGEH 258
LG+ ISV K VP +YN + Q L++E F+F+ + + D E GGE FGG+D +KG+
Sbjct: 124 LGYDTISVDKVVPPFYNAIQQDLLDEKRFAFYLGDTSKDTENGGEATFGGIDESKFKGDI 183
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA- 317
T++PV +K YW+ + + + + G A D+GTSL+ P+ + +N IGA
Sbjct: 184 TWLPVRRKAYWEVKFEGIGLGDEYAELESHGAA--IDTGTSLITLPSGLAEMINAEIGAK 241
Query: 318 ---TGIVSQECK 326
TG + +C
Sbjct: 242 KGWTGQYTLDCN 253
>gi|195339961|ref|XP_002036585.1| GM18746 [Drosophila sechellia]
gi|194130465|gb|EDW52508.1| GM18746 [Drosophila sechellia]
Length = 392
Score = 230 bits (587), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 115/238 (48%), Positives = 150/238 (63%), Gaps = 4/238 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSST 140
L+N M+ +Y+G I IGTP Q F ++FDTGS+NLWVPS+ C S AC H+KY S SST
Sbjct: 68 LQNSMNNEYYGVIAIGTPKQRFNILFDTGSANLWVPSASCPASNTACQRHNKYNSAASST 127
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y NG+ I YGTG++SGF S D V I + ++DQ F EA EP TF+ A F GILGL
Sbjct: 128 YVANGEEFAIEYGTGSLSGFLSTDTVTIAGISIQDQTFGEALSEPGTTFVDAPFAGILGL 187
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
F I+V P + NMV+QGL++EPV SF+ R GGE++ GG+D Y+G TY
Sbjct: 188 AFSAIAVDGVTPPFDNMVSQGLLDEPVISFYLKRQGTAVRGGELILGGIDSSLYRGSLTY 247
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
VPV+ YWQF + + +G GC AIAD+GTSL+A P ++N +GAT
Sbjct: 248 VPVSVPAYWQFTVNTIKTNGI---LLCNGCQAIADTGTSLIAVPLAAYRKINRQLGAT 302
>gi|2624629|pdb|2JXR|A Chain A, Structure Of Yeast Proteinase A
gi|10835733|pdb|1FQ4|A Chain A, Crystal Structure Of A Complex Between Hydroxyethylene
Inhibitor Cp- 108,420 And Yeast Aspartic Proteinase A
gi|10835734|pdb|1FQ5|A Chain A, X-Ray Struture Of A Cyclic Statine Inhibitor Pd-129,541
Bound To Yeast Proteinase A
gi|10835735|pdb|1FQ6|A Chain A, X-Ray Structure Of Glycol Inhibitor Pd-133,450 Bound To
Saccharopepsin
gi|10835736|pdb|1FQ7|A Chain A, X-Ray Structure Of Inhibitor Cp-72,647 Bound To
Saccharopepsin
gi|10835737|pdb|1FQ8|A Chain A, X-Ray Structure Of Difluorostatine Inhibitor Cp81,198
Bound To Saccharopepsin
Length = 329
Score = 230 bits (586), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 118/252 (46%), Positives = 162/252 (64%), Gaps = 8/252 (3%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L NY++AQY+ +I +GTPPQNF VI DTGSSNLWVPS++C S+AC+ HSKY SS
Sbjct: 5 VPLTNYLNAQYYTDITLGTPPQNFKVILDTGSSNLWVPSNEC-GSLACFLHSKYDHEASS 63
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
+YK NG I YGTG++ G+ S+D + IGDL + Q+F EAT EP LTF KFDGILG
Sbjct: 64 SYKANGTEFAIQYGTGSLEGYISQDTLSIGDLTIPKQDFAEATSEPGLTFAFGKFDGILG 123
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWF-NRNADEEEGGEIVFGGMDPDHYKGEH 258
LG+ ISV K VP +YN + Q L++E F+F+ + + D E GGE FGG+D +KG+
Sbjct: 124 LGYDTISVDKVVPPFYNAIQQDLLDEKRFAFYLGDTSKDTENGGEATFGGIDESKFKGDI 183
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA- 317
T++PV +K YW+ + + + + G A D+GTSL+ P+ + +N IGA
Sbjct: 184 TWLPVRRKAYWEVKFEGIGLGDEYAELESHGAA--IDTGTSLITLPSGLAEMINAEIGAK 241
Query: 318 ---TGIVSQECK 326
TG + +C
Sbjct: 242 KGWTGQYTLDCN 253
>gi|195471992|ref|XP_002088286.1| GE18491 [Drosophila yakuba]
gi|194174387|gb|EDW87998.1| GE18491 [Drosophila yakuba]
Length = 392
Score = 230 bits (586), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 117/238 (49%), Positives = 149/238 (62%), Gaps = 4/238 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSST 140
L N M+ +Y+G I IGTP Q F ++FDTGS+NLWVPSS C S IAC H+KY S SST
Sbjct: 68 LHNSMNNEYYGVIAIGTPKQRFNILFDTGSANLWVPSSSCPASNIACKKHNKYNSAASST 127
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y NG+ I YGTG++SG S D V I + ++DQ F EA EP TF+ A F GILGL
Sbjct: 128 YVANGEEFAIEYGTGSLSGILSTDTVTIAGISIQDQTFGEALNEPGTTFVDAPFAGILGL 187
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
F I+V P + NMV+QGL++EPV SF+ R GGE++ GG+D YKG TY
Sbjct: 188 AFSAIAVDGVTPPFDNMVSQGLLDEPVISFYLKRQGTAVRGGELILGGIDSSLYKGSLTY 247
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
VPV+ YWQF + + +G C GC AIAD+GTSL+ P ++N +GAT
Sbjct: 248 VPVSVPAYWQFAVNTIKTNGIV--LC-NGCQAIADTGTSLIVAPLAAYRKINRQLGAT 302
>gi|407260952|ref|XP_003946102.1| PREDICTED: renin-1-like [Mus musculus]
Length = 400
Score = 230 bits (586), Expect = 9e-58, Method: Compositional matrix adjust.
Identities = 130/309 (42%), Positives = 177/309 (57%), Gaps = 12/309 (3%)
Query: 14 LCLLLFPVVFSTPN--GGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNL 71
L LL P FS P RI LKK V L+ + + R S +
Sbjct: 6 LLLLWSPCTFSLPTRTATFERIPLKKMP-----SVREILEERGVDMTRLSAERGVFTKRP 60
Query: 72 GESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFH 130
V L NY++ QY+GEIGIGTPPQ F VIFDTGS+NLWVPS+KC +AC H
Sbjct: 61 SLINLTSPVVLTNYLNTQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCSRLYLACGIH 120
Query: 131 SKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFL 190
S Y S SS+Y +NG IHYG+G + GF S+D V +G + V Q F E T P + F+
Sbjct: 121 SLYESSDSSSYMENGSDFTIHYGSGRVKGFLSQDVVTVGGITVT-QTFGEVTELPLIPFM 179
Query: 191 LAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEE--GGEIVFGG 248
LAKFDG+LG+GF +VG PV+ ++++QG++ E VFS ++NR GGE+V GG
Sbjct: 180 LAKFDGVLGMGFPAQAVGGVTPVFDHILSQGVLKEEVFSVYYNRKTKGSHLLGGEVVLGG 239
Query: 249 MDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTII 308
DP HY+G YV +++ WQ M V + G +T C GCA + D+G+S ++ PT+ +
Sbjct: 240 SDPQHYQGNFHYVSISKTDSWQITMKGVSV-GSSTLLCEEGCAVVVDTGSSFISAPTSSL 298
Query: 309 TQVNHAIGA 317
+ A+GA
Sbjct: 299 KLIMQALGA 307
>gi|126310959|ref|XP_001372683.1| PREDICTED: chymosin-like [Monodelphis domestica]
Length = 383
Score = 230 bits (586), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 120/275 (43%), Positives = 167/275 (60%), Gaps = 18/275 (6%)
Query: 64 KYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYF 123
KY L G + D L NY+D+QYFG+I IGTPPQ FTV+FDTGSSNLWVPS C
Sbjct: 53 KYQLYGEAAKVTDE---PLTNYLDSQYFGKIYIGTPPQEFTVVFDTGSSNLWVPSVYCN- 108
Query: 124 SIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATR 183
S AC H ++ S+T++ + I YGTG++ G D V + +VV DQ F +T+
Sbjct: 109 SDACQNHHRFNPASSTTFRSTQEPLSIQYGTGSMEGVLGYDTVTVSQIVVPDQIFGLSTQ 168
Query: 184 EPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGE 243
EP F ++FDGILGLG+ ++ +A PV+ NM+N+ LV + +FS + +R++ +G
Sbjct: 169 EPGEIFTYSEFDGILGLGYPSLAEDQATPVFDNMMNKNLVAQDLFSVYMSRDS---QGSM 225
Query: 244 IVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAG 303
++ G +DP +Y G +VPVT++GYWQF + + ++GQ C GGC AI D+GTSLL G
Sbjct: 226 LILGAIDPSYYTGSLHWVPVTEQGYWQFSVDSITVNGQVVA-CEGGCQAILDTGTSLLVG 284
Query: 304 PTTIITQVNHAIGATGIVSQECKAVVSQYGEEIIN 338
P+ I + IGAT QYGE IN
Sbjct: 285 PSYDIANIQSIIGATQ----------GQYGEYDIN 309
>gi|149245862|ref|XP_001472682.1| PREDICTED: renin-1-like isoform 1 [Mus musculus]
Length = 425
Score = 230 bits (586), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 130/307 (42%), Positives = 178/307 (57%), Gaps = 11/307 (3%)
Query: 14 LCLLLFPVVFSTPN--GGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNL 71
L LL P FS P RI LKK V L+ + + R S +
Sbjct: 34 LLLLWSPCTFSLPTRTATFERIPLKKMP-----SVREILEERGVDMTRLSAERGVFTKRP 88
Query: 72 GESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFH 130
V L NY++ QY+GEIGIGTPPQ F VIFDTGS+NLWVPS+KC +AC H
Sbjct: 89 SLINLTSPVVLTNYLNTQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCSRLYLACGIH 148
Query: 131 SKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFL 190
S Y S SS+Y +NG IHYG+G + GF S+D V +G + V Q F E T P + F+
Sbjct: 149 SLYESSDSSSYMENGSDFTIHYGSGRVKGFLSQDVVTVGGITVT-QTFGEVTELPLIPFM 207
Query: 191 LAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMD 250
LAKFDG+LG+GF +VG PV+ ++++QG++ E VFS ++NR + GGE+V GG D
Sbjct: 208 LAKFDGVLGMGFPAQAVGGVTPVFDHILSQGVLKEEVFSVYYNRGS-HLLGGEVVLGGSD 266
Query: 251 PDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQ 310
P HY+G YV +++ WQ M V + G +T C GCA + D+G+S ++ PT+ +
Sbjct: 267 PQHYQGNFHYVSISKTDSWQITMKGVSV-GSSTLLCEEGCAVVVDTGSSFISAPTSSLKL 325
Query: 311 VNHAIGA 317
+ A+GA
Sbjct: 326 IMQALGA 332
>gi|449542760|gb|EMD33738.1| hypothetical protein CERSUDRAFT_56642 [Ceriporiopsis subvermispora
B]
Length = 395
Score = 229 bits (585), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 128/259 (49%), Positives = 162/259 (62%), Gaps = 9/259 (3%)
Query: 73 ESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSK 132
SG V L NY +AQYF E+ +GTPPQNF VI DTGSSNLWVPS C SIAC+ HSK
Sbjct: 67 HSGPGHNVLLSNYANAQYFTEVSLGTPPQNFKVILDTGSSNLWVPSVHC-MSIACFMHSK 125
Query: 133 YRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLA 192
Y S +SS+Y NG S +I YG+G++ G S+D + IGDL + +Q+F EAT+EP L+F
Sbjct: 126 YDSSKSSSYNANGSSFEIQYGSGSMQGIVSQDTLSIGDLNITNQDFAEATKEPGLSFTFG 185
Query: 193 KFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPD 252
KFDGILGL + ISV P +YNMV QGL++ P+FSF + D GGE +FGG D
Sbjct: 186 KFDGILGLAYNSISVNYITPPFYNMVEQGLLDNPIFSF---KLGDAPLGGEAIFGGTDES 242
Query: 253 HYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVN 312
Y GE Y PV ++ YW+ ++ V + Q F G A D+GTSL+A PT T +N
Sbjct: 243 AYTGEIIYAPVRRQAYWEVELDKVTLGDQVFEFQDTGAA--IDTGTSLIAVPTAQATAIN 300
Query: 313 HAIGAT---GIVSQECKAV 328
IGAT G EC +
Sbjct: 301 KLIGATSKSGTYVVECSTI 319
>gi|340518711|gb|EGR48951.1| predicted protein [Trichoderma reesei QM6a]
Length = 395
Score = 229 bits (585), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 125/243 (51%), Positives = 162/243 (66%), Gaps = 16/243 (6%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V + N+M+AQYF EI IGTPPQ+F V+ DTGSSNLWVPS C SIAC+ HS Y S SS
Sbjct: 75 VPVTNFMNAQYFSEITIGTPPQSFKVVLDTGSSNLWVPSQSCN-SIACFLHSTYDSSSSS 133
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYK NG +IHYG+G+++GF S D V IGDL +K Q+F EAT EP L F +FDGILG
Sbjct: 134 TYKPNGSDFEIHYGSGSLTGFISNDVVTIGDLKIKGQDFAEATSEPGLAFAFGRFDGILG 193
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ ISV VP +Y MVNQ L++EPVF+F+ + +EG E VFGG+D HY+G+
Sbjct: 194 LGYDTISVNGIVPPFYQMVNQKLIDEPVFAFYLGSS---DEGSEAVFGGVDDAHYEGKIE 250
Query: 260 YVPVTQKGYWQFDM-----GDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHA 314
Y+P+ +K YW+ D+ GD + + + TG AI D+GTSL P+ + +N
Sbjct: 251 YIPLRRKAYWEVDLDSIAFGDEVAELENTG-------AILDTGTSLNVLPSGLAELLNAE 303
Query: 315 IGA 317
IGA
Sbjct: 304 IGA 306
>gi|432116085|gb|ELK37212.1| Cathepsin E [Myotis davidii]
Length = 396
Score = 229 bits (585), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 127/294 (43%), Positives = 181/294 (61%), Gaps = 7/294 (2%)
Query: 28 GGLYRIGLKKRKFDLNNRVAARLDSKE-GESFRTSIRKYSLRGNLGESGDADIVALKNYM 86
G L+R+ L++R+ L ++ AR E +S + +++ + +S + +V NY+
Sbjct: 19 GPLHRVPLRRRQ-SLRKKLRARGQLSEFWKSQHLDMIQFTESCTMDQSVNEPLV---NYL 74
Query: 87 DAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGK 146
D +YFG I IG+PPQNFTVIFDTGSSNLWVPS C S AC H ++ +SSTY G
Sbjct: 75 DMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SPACKTHPRFSPSQSSTYSSPGS 133
Query: 147 SADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEIS 206
I YGTG++SG ED V + L V Q+F E+ EP TF+ A+FDGILGLG+ ++
Sbjct: 134 HFFIQYGTGSLSGVIGEDQVSVEGLTVVGQQFGESVTEPGQTFVDAEFDGILGLGYPSLA 193
Query: 207 VGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQK 266
VG PV+ NM+ Q LV+ P+FS + + + + G E++FGG D H+ G +VPVT++
Sbjct: 194 VGGVTPVFDNMMAQNLVDVPMFSVYMSSDPEGGAGSELIFGGYDHSHFSGSLNWVPVTKQ 253
Query: 267 GYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
GYWQ + + + G FC+ GC AI D+GTSL+ GP I Q+ AIGA +
Sbjct: 254 GYWQIALDTIQVGGAVM-FCSEGCQAIVDTGTSLITGPPAEIKQLQKAIGAEPV 306
>gi|395534115|ref|XP_003769093.1| PREDICTED: gastricsin-like [Sarcophilus harrisii]
Length = 392
Score = 229 bits (585), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 110/236 (46%), Positives = 151/236 (63%), Gaps = 1/236 (0%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L NYMD Y+GEI IGTPPQNF V+FDTGSSNLWV S C S AC H ++ +SSTY
Sbjct: 67 LANYMDMSYYGEISIGTPPQNFLVLFDTGSSNLWVSSIYCQ-SQACTNHPQFNPNQSSTY 125
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
NG++ + YGTG+++G F D V I + + +QEF + EP +F+ A+FDGILGL
Sbjct: 126 SSNGQTFSLQYGTGSLTGVFGYDTVTIQGISITNQEFGLSETEPGTSFVYAQFDGILGLA 185
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ IS G A V ++ + L+N PVF+F+ + N + GGE+ FGG+D + G+ +
Sbjct: 186 YPSISSGGATTVMQGLLQENLINAPVFAFYLSGNENSNNGGEVTFGGVDTSMFTGDIYWA 245
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
PVTQ+ YWQ + I GQ TG+C+ GC A+ D+GTSLL P I +++ IGA
Sbjct: 246 PVTQEAYWQIAINGFSIGGQATGWCSEGCQAVVDTGTSLLTAPQQIFSELMQYIGA 301
>gi|13676837|ref|NP_112469.1| renin-1 precursor [Mus musculus]
gi|132327|sp|P06281.1|RENI1_MOUSE RecName: Full=Renin-1; AltName: Full=Angiotensinogenase; AltName:
Full=Kidney renin; Flags: Precursor
gi|53931|emb|CAA34636.1| unnamed protein product [Mus musculus]
gi|26342875|dbj|BAC35094.1| unnamed protein product [Mus musculus]
gi|26351563|dbj|BAC39418.1| unnamed protein product [Mus musculus]
gi|38512029|gb|AAH61053.1| Renin 1 structural [Mus musculus]
gi|148707703|gb|EDL39650.1| mCG131545 [Mus musculus]
Length = 402
Score = 229 bits (585), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 130/307 (42%), Positives = 178/307 (57%), Gaps = 11/307 (3%)
Query: 14 LCLLLFPVVFSTPN--GGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNL 71
L LL P FS P RI LKK V L+ + + R S
Sbjct: 11 LLLLWSPCTFSLPTRTATFERIPLKKMP-----SVREILEERGVDMTRLSAEWGVFTKRP 65
Query: 72 GESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFH 130
+ V L NY++ QY+GEIGIGTPPQ F VIFDTGS+NLWVPS+KC +AC H
Sbjct: 66 SLTNLTSPVVLTNYLNTQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCSRLYLACGIH 125
Query: 131 SKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFL 190
S Y S SS+Y +NG IHYG+G + GF S+D V +G + V Q F E T P + F+
Sbjct: 126 SLYESSDSSSYMENGSDFTIHYGSGRVKGFLSQDSVTVGGITVT-QTFGEVTELPLIPFM 184
Query: 191 LAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMD 250
LAKFDG+LG+GF +VG PV+ ++++QG++ E VFS ++NR + GGE+V GG D
Sbjct: 185 LAKFDGVLGMGFPAQAVGGVTPVFDHILSQGVLKEEVFSVYYNRGS-HLLGGEVVLGGSD 243
Query: 251 PDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQ 310
P HY+G YV +++ WQ M V + G +T C GCA + D+G+S ++ PT+ +
Sbjct: 244 PQHYQGNFHYVSISKTDSWQITMKGVSV-GSSTLLCEEGCAVVVDTGSSFISAPTSSLKL 302
Query: 311 VNHAIGA 317
+ A+GA
Sbjct: 303 IMQALGA 309
>gi|200688|gb|AAA40043.1| renin (Ren-1-d) [Mus musculus]
gi|148669208|gb|EDL01155.1| mCG129412 [Mus musculus]
Length = 402
Score = 229 bits (585), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 130/307 (42%), Positives = 178/307 (57%), Gaps = 11/307 (3%)
Query: 14 LCLLLFPVVFSTPN--GGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNL 71
L LL P FS P RI LKK V L+ + + R S +
Sbjct: 11 LLLLWSPCTFSLPTRTATFERIPLKKMP-----SVREILEERGVDMTRLSAERGVFTKRP 65
Query: 72 GESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFH 130
V L NY++ QY+GEIGIGTPPQ F VIFDTGS+NLWVPS+KC +AC H
Sbjct: 66 SLINLTSPVVLTNYLNTQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCSRLYLACGIH 125
Query: 131 SKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFL 190
S Y S SS+Y +NG IHYG+G + GF S+D V +G + V Q F E T P + F+
Sbjct: 126 SLYESSDSSSYMENGSDFTIHYGSGRVKGFLSQDVVTVGGITVT-QTFGEVTELPLIPFM 184
Query: 191 LAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMD 250
LAKFDG+LG+GF +VG PV+ ++++QG++ E VFS ++NR + GGE+V GG D
Sbjct: 185 LAKFDGVLGMGFPAQAVGGVTPVFDHILSQGVLKEEVFSVYYNRGS-HLLGGEVVLGGSD 243
Query: 251 PDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQ 310
P HY+G YV +++ WQ M V + G +T C GCA + D+G+S ++ PT+ +
Sbjct: 244 PQHYQGNFHYVSISKTDSWQITMKGVSV-GSSTLLCEEGCAVVVDTGSSFISAPTSSLKL 302
Query: 311 VNHAIGA 317
+ A+GA
Sbjct: 303 IMQALGA 309
>gi|385301236|gb|EIF45441.1| proteinase a [Dekkera bruxellensis AWRI1499]
Length = 429
Score = 229 bits (585), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 119/241 (49%), Positives = 155/241 (64%), Gaps = 13/241 (5%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L NYM+AQYF EI +GTP Q F VI DTGSSNLWVPSS C S+ACY H+KY +SSTY
Sbjct: 108 LTNYMNAQYFSEIELGTPGQKFKVILDTGSSNLWVPSSDCA-SLACYLHTKYDHEQSSTY 166
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
KKNG I YG+G++ G+ S+D +KI DL + +Q+F EAT EP L F KFDGILGLG
Sbjct: 167 KKNGSEFSIQYGSGSMKGYISQDTLKISDLEITNQDFAEATEEPGLAFAFGKFDGILGLG 226
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ ISV VP YN +N GL++ P FSF+ + E+GG FGG+D + G+ T++
Sbjct: 227 YDTISVNHIVPPVYNAINSGLLDNPQFSFYLGDTSKTEDGGVCTFGGIDDSKFTGKITWL 286
Query: 262 PVTQKGYWQFD-----MGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIG 316
PV +K YW+ +GD + Q+ G AAI D+GTSL+ P+ + +N IG
Sbjct: 287 PVRRKAYWEVKFEGIGLGDEYAELQSHG------AAI-DTGTSLIVLPSQLAEILNSEIG 339
Query: 317 A 317
A
Sbjct: 340 A 340
>gi|313220508|emb|CBY31359.1| unnamed protein product [Oikopleura dioica]
gi|313229843|emb|CBY07548.1| unnamed protein product [Oikopleura dioica]
Length = 397
Score = 229 bits (584), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 130/280 (46%), Positives = 172/280 (61%), Gaps = 25/280 (8%)
Query: 69 GNLGESGDADIVA--LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSI- 125
GNL AD+ L NY DAQYFG + IGTP QNFTVIFDTGSSNLWVPSSKC I
Sbjct: 30 GNLHSKYRADVPTNELTNYFDAQYFGPLTIGTPAQNFTVIFDTGSSNLWVPSSKCDPHIG 89
Query: 126 ---ACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKI----GDLVVKDQEF 178
AC H+KY S SST+ ++G +I YGTG++ GF S D + I G L+ K F
Sbjct: 90 TGFACLNHNKYDSDLSSTWTEDGTKFEIQYGTGSMVGFQSTDDIDIAPGSGGLIAKQATF 149
Query: 179 IEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNA-- 236
EA EP +TFL A FDGI+GL + ISV A P++ ++ +G VN VF+F+ +RN+
Sbjct: 150 AEAVEEPGITFLAAAFDGIMGLAYPSISVNGATPIYNQLMEEGQVNG-VFAFFVHRNSSK 208
Query: 237 --DEEEGGEIVFGGMDPDHYKG----EHTYVPVTQKGYWQFDMGDVMIDG------QTTG 284
+ + GGEI +GG++P+ ++G + V+++ YWQ +MG V ++G Q
Sbjct: 209 PGESDIGGEIAWGGVNPERFEGTFPDSFIWHEVSRQAYWQVNMGTVTVNGDGFVSDQPIV 268
Query: 285 FCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE 324
C GGC I DSGTSL+ GPT I Q+N AIGA ++ E
Sbjct: 269 MCEGGCQGIVDSGTSLITGPTEITDQINKAIGAIEFIAGE 308
>gi|194858591|ref|XP_001969211.1| GG24067 [Drosophila erecta]
gi|190661078|gb|EDV58270.1| GG24067 [Drosophila erecta]
Length = 372
Score = 229 bits (584), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 123/276 (44%), Positives = 169/276 (61%), Gaps = 16/276 (5%)
Query: 30 LYRIGLKKRKFDLNNRVAARLDSKEGESF-RTSIRKYSLRGNLGESGDADIVALKNYMDA 88
L+R+ + K + N V R + +S+ RT + SLR D L N M+
Sbjct: 19 LHRVPILKEQ----NFVKTRQNVLAEKSYLRTKYQLPSLRS-------VDEEQLSNSMNM 67
Query: 89 QYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSA 148
Y+G I IGTP Q+F V+FD+GSSNLWVPS+ C S AC H++Y S SSTY NG+S
Sbjct: 68 AYYGAISIGTPAQSFKVLFDSGSSNLWVPSNTCQ-SDACKTHNQYDSSASSTYVANGESF 126
Query: 149 DIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVG 208
I YGTG+++G+ S D V + L ++ Q F E+T EP F A FDGILG+ ++ ++V
Sbjct: 127 SIQYGTGSLTGYLSTDTVDVNGLSIQSQTFAESTNEPGTNFNDANFDGILGMAYESLAVD 186
Query: 209 KAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGY 268
P +YNMV+QGLV+ VFSF+ R+ +GGE++FGG D Y G TYVP++++GY
Sbjct: 187 GVTPPFYNMVSQGLVDSSVFSFYLARDGTSTKGGELIFGGSDSSLYSGSLTYVPISEQGY 246
Query: 269 WQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGP 304
WQF+M IDG + C C AIAD+GTSL+ P
Sbjct: 247 WQFNMASSSIDGFS--LC-DDCQAIADTGTSLIVAP 279
>gi|223468|prf||0807285A renin precursor
Length = 401
Score = 229 bits (584), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 130/308 (42%), Positives = 185/308 (60%), Gaps = 14/308 (4%)
Query: 14 LCLLLFPVVFSTPNGGLY-RIGLKKRKFDLNNRVAARLDSKEGESFRTSIR--KYSLRGN 70
L LL P FS P G + RI LKK V L+ + + R S ++ R +
Sbjct: 11 LLLLWSPCTFSLPTGTTFERIPLKKMP-----SVREILEERGVDMTRLSAEWDVFTKRSS 65
Query: 71 LGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYF 129
L + V L NY+++QY+GEIGIGTPPQ F VIFDTGS+NLWVPS+KC +AC
Sbjct: 66 LTDL--ISPVVLTNYLNSQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCSRLYLACGI 123
Query: 130 HSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTF 189
HS Y S SS+Y +NG IHYG+G + GF S+D V +G + V Q F E T P + F
Sbjct: 124 HSLYESSDSSSYMENGDDFTIHYGSGRVKGFLSQDSVTVGGITVT-QTFGEVTELPLIPF 182
Query: 190 LLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGM 249
+LA+FDG+LG+G +VG PV+ ++++QG++ E VFS ++NR GGE+V GG
Sbjct: 183 MLAQFDGVLGMGLSRSAVGGVTPVFDHILSQGVLKEKVFSVYYNRGP-HLLGGEVVLGGS 241
Query: 250 DPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIIT 309
DP+HY+G+ YV +++ WQ M V + G +T C GC + D+G+S ++ PT+ +
Sbjct: 242 DPEHYQGDFHYVSLSKTDSWQITMKGVSV-GSSTLLCEEGCEVVVDTGSSFISAPTSSLK 300
Query: 310 QVNHAIGA 317
+ A+GA
Sbjct: 301 LIMQALGA 308
>gi|409050032|gb|EKM59509.1| hypothetical protein PHACADRAFT_250062 [Phanerochaete carnosa
HHB-10118-sp]
Length = 407
Score = 229 bits (584), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 122/258 (47%), Positives = 163/258 (63%), Gaps = 19/258 (7%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
+ L+N+M+AQYF I IGTPPQ+F VI DTGSSNLWVPS++C SIAC+ H KY SG SS
Sbjct: 87 LPLQNFMNAQYFTTIEIGTPPQSFNVILDTGSSNLWVPSTQCT-SIACFLHKKYDSGSSS 145
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYK NG I YG+G++ GF S D + +GD+ + Q+F EAT+EP L F KFDGILG
Sbjct: 146 TYKPNGSEFSIQYGSGSMEGFVSRDVLTMGDITIGQQDFAEATKEPGLAFAFGKFDGILG 205
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
L + I+V P YNM +GL+ +PVF+F + EE+ GE FGG+D ++G+
Sbjct: 206 LAYDTIAVNHITPPHYNMFEKGLIEKPVFAFRL--GSTEEDAGEATFGGIDESAFEGKLH 263
Query: 260 YVPVTQKGYWQFD-----MGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHA 314
VPV +K YW+ + +GD ++ + TG AAI D+GTSL+A PT + +N
Sbjct: 264 RVPVRRKAYWEVELEKVRLGDDELELEDTG------AAI-DTGTSLIALPTDMAEMINAQ 316
Query: 315 IGA----TGIVSQECKAV 328
IGA G + EC V
Sbjct: 317 IGAKRGWNGQYTVECSTV 334
>gi|432090679|gb|ELK24020.1| Renin [Myotis davidii]
Length = 404
Score = 229 bits (584), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 126/294 (42%), Positives = 172/294 (58%), Gaps = 18/294 (6%)
Query: 32 RIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYF 91
R LK+R D+ ARL EG F G + V L NY+D QY+
Sbjct: 42 RESLKERGVDV-----ARLLRAEGSQFS---------GRPPFTNSTAPVVLTNYLDTQYY 87
Query: 92 GEIGIGTPPQNFTVIFDTGSSNLWVPSSKCY-FSIACYFHSKYRSGRSSTYKKNGKSADI 150
GEIGIGTPPQ F VIFDTGS+NLWVPS+KC AC HS Y S SSTY +NG I
Sbjct: 88 GEIGIGTPPQTFKVIFDTGSANLWVPSTKCSPLYTACEIHSLYDSLESSTYMENGTEFTI 147
Query: 151 HYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKA 210
YG+G ++GF S+D V +G + V Q F E T P + F+LAKFDG+LG+GF +V
Sbjct: 148 QYGSGKVNGFLSQDAVTVGGITVT-QTFGEVTELPLMPFMLAKFDGVLGMGFPAQAVAGV 206
Query: 211 VPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQ 270
PV+ ++++Q ++ E VFS +++RN+ GGEIV GG DP +Y+G YV +++ G WQ
Sbjct: 207 TPVFDHILSQRVLKEDVFSVYYSRNS-HLLGGEIVLGGSDPQYYQGNFHYVSISKTGSWQ 265
Query: 271 FDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE 324
M V + T C GC A+ D+G S ++GPT+ + + +GA + + E
Sbjct: 266 IKMKGVSVRSSTL-LCEEGCMAVVDTGASYISGPTSSLRLLMETLGAKELSTDE 318
>gi|335955136|gb|AEH76574.1| pepsinogen [Epinephelus bruneus]
Length = 375
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 133/304 (43%), Positives = 179/304 (58%), Gaps = 22/304 (7%)
Query: 38 RKFDLNNRVAARLDSKEGESFRTSIRKYSLRG--NLGESGDADIVALKNYMDAQYFGEIG 95
K L AR D +E + +KY +SG ++ N D Y+G I
Sbjct: 18 HKIPLIKGKTARQDLQEKGLWEEYRKKYPYNPMVKFQQSGTE---SMTNDADLSYYGVIS 74
Query: 96 IGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTG 155
IGTPPQ+FTVIFDTGSSNLWVPS C S AC H K+ +SST+K + I YGTG
Sbjct: 75 IGTPPQSFTVIFDTGSSNLWVPSVYCN-SQACQNHRKFNPQQSSTFKWGDQPLSIQYGTG 133
Query: 156 AISGFFSEDHVKIGDLVVKDQEF-IEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVW 214
+++G + D+V++G + V++Q F I T P + + A DGILGL FQ I+ VPV+
Sbjct: 134 SMTGRLAIDNVEVGGITVQNQVFGISQTEAPFMAHMAA--DGILGLAFQTIAADNVVPVF 191
Query: 215 YNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMG 274
NMV QGLV++P+FS + + + D +G E+VFGG+D HY G+ T+VP+T YWQ M
Sbjct: 192 DNMVKQGLVSQPLFSVYLSSHGD--QGSEVVFGGIDNSHYTGQVTWVPLTSATYWQIKMD 249
Query: 275 DVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQECKAVVSQYGE 334
V I+GQT CAGGC AI D+GTSL+ GPT I +N +GA+ +QYGE
Sbjct: 250 GVKINGQTVA-CAGGCQAIIDTGTSLIVGPTNDINNMNSWVGAS----------TNQYGE 298
Query: 335 EIIN 338
+N
Sbjct: 299 STVN 302
>gi|126681053|gb|ABO26561.1| cathepsin D-like aspartic protease [Ixodes ricinus]
Length = 382
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 125/273 (45%), Positives = 166/273 (60%), Gaps = 12/273 (4%)
Query: 57 SFRTSIRKY-SLRGNLGESG-----DADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTG 110
+FR + ++ S+R L E G L N +D +Y+G I IGTPPQ+F VIFDTG
Sbjct: 22 AFRIPLTRFKSVRKQLAEEGIYIHEGPYPEPLVNLLDVEYYGPISIGTPPQDFQVIFDTG 81
Query: 111 SSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGD 170
S+NLW+PSSKC + C H +Y S +SSTY+ +G++ I YG+G + GF S+D +IG
Sbjct: 82 SANLWLPSSKCT-TKYCLHHHRYDSSKSSTYEADGRNFTIVYGSGNVEGFISKDVCRIGS 140
Query: 171 LVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEP-VFS 229
V Q EA + L A FDGILGL + I+V VPV+ NM+ QGL+ E VFS
Sbjct: 141 AKVSGQPLGEALVVGGESLLEAPFDGILGLAYPSIAVDGVVPVFDNMMKQGLLGEQNVFS 200
Query: 230 FWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTG----F 285
+ NR+ +EGGE++FGG+D DHYKG TYVPVT KGYWQF + V +
Sbjct: 201 VYLNRDPSSKEGGEVLFGGIDHDHYKGSITYVPVTAKGYWQFHVDGVKSVSASKSAPELL 260
Query: 286 CAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
C GC AIAD+GTSL+ GP + +N +G T
Sbjct: 261 CKDGCEAIADTGTSLITGPPEEVDSLNQYLGGT 293
>gi|254596794|gb|ACT75642.1| pepsinogen A [Channa argus]
Length = 361
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 116/258 (44%), Positives = 167/258 (64%), Gaps = 16/258 (6%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
+ N D Y+G I IGTPPQ+F+VIFD+GSSNLWVPS C S AC H+K+ +SS++
Sbjct: 45 MTNDADMSYYGVISIGTPPQSFSVIFDSGSSNLWVPSVYCSSSQACQNHNKFNPQQSSSF 104
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEF-IEATREPSLTFLLAKFDGILGL 200
+ NG+S I YGTG+++G+ D V +G + V +Q F + + P + + A DGILGL
Sbjct: 105 QWNGESLSIQYGTGSMTGYLGADTVGVGGVSVANQVFGLSQSEAPFMAHMQA--DGILGL 162
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
FQ I+ VPV+ NMV+QGLV++P+FS + + N+ +G E+VFGG+D +HY G+ +
Sbjct: 163 AFQSIASDNVVPVFNNMVSQGLVSQPMFSVYLSSNS--AQGSEVVFGGVDSNHYTGQIAW 220
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
+P+T YWQ M V I+GQT C+GGC AI D+GTSL+ GPT+ I+ +N +GA+
Sbjct: 221 IPLTSATYWQIKMDSVSINGQTVA-CSGGCQAIIDTGTSLIVGPTSDISNINSWVGAS-- 277
Query: 321 VSQECKAVVSQYGEEIIN 338
QYG+ +N
Sbjct: 278 --------TDQYGDATVN 287
>gi|260940805|ref|XP_002615242.1| hypothetical protein CLUG_04124 [Clavispora lusitaniae ATCC 42720]
gi|238850532|gb|EEQ39996.1| hypothetical protein CLUG_04124 [Clavispora lusitaniae ATCC 42720]
Length = 406
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 116/238 (48%), Positives = 151/238 (63%), Gaps = 4/238 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L NY++AQYF EI +GTP Q F VI DTGSSNLWVPS C S+AC+ H+KY SSTY
Sbjct: 84 LTNYLNAQYFTEIQLGTPGQTFKVILDTGSSNLWVPSRDCS-SLACFLHTKYDHDESSTY 142
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
K NG I YG+GA+ G+ S+D + IGDLV+ Q+F EAT EP L F KFDGILGL
Sbjct: 143 KANGSEFSIQYGSGAMEGYISQDVLAIGDLVIPKQDFAEATSEPGLAFAFGKFDGILGLA 202
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWF-NRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ISV K VP YN + QGL++ P F F+ + N +EE GG FGG D +KG+ T+
Sbjct: 203 YDTISVNKIVPPVYNAIAQGLLDAPQFGFYLGDTNKNEENGGVATFGGYDEALFKGDLTW 262
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
+PV +K YW+ + + + A G A D+GTSL+ P+++ +N IGAT
Sbjct: 263 LPVRRKAYWEVSFDGIGLGDEYAELTATGAA--IDTGTSLITLPSSLAEIINAKIGAT 318
>gi|22218078|dbj|BAC07516.1| pepsinogen III [Oryctolagus cuniculus]
Length = 387
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 124/281 (44%), Positives = 167/281 (59%), Gaps = 28/281 (9%)
Query: 62 IRKYSLRGNLGESG------------------------DADIVALKNYMDAQYFGEIGIG 97
+RK SLR NL E G L+NY+D +YFG IGIG
Sbjct: 23 VRKKSLRKNLIEKGLLKDYLKTHTPNLATKYLPKAAFDSVPTETLENYLDTEYFGTIGIG 82
Query: 98 TPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAI 157
TP Q+FTVIFDTGSSNLWVPS C S AC H+K+ SST++ +S I YGTG++
Sbjct: 83 TPAQDFTVIFDTGSSNLWVPSVYCS-SAACSVHNKFNPEDSSTFQATSESLSITYGTGSM 141
Query: 158 SGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNM 217
+GF D VK+G++ +Q F + EP A FDGILGL + IS A PV+ NM
Sbjct: 142 TGFLGYDTVKVGNIEDTNQIFGLSESEPGSFLYYAPFDGILGLAYPSISSSDATPVFDNM 201
Query: 218 VNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVM 277
N+GLV+E +FS + ++D+E G ++FGG+D +Y G +VPV+ +GYWQ + +
Sbjct: 202 WNEGLVSEDLFSVYL--SSDDESGSVVMFGGIDSSYYTGSLNWVPVSYEGYWQITLDSIT 259
Query: 278 IDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
+DG+T CA C AI D+GTSLLAGPT+ I+ + IGA+
Sbjct: 260 MDGETIA-CADSCQAIVDTGTSLLAGPTSAISNIQSYIGAS 299
>gi|403217759|emb|CCK72252.1| hypothetical protein KNAG_0J01710 [Kazachstania naganishii CBS
8797]
Length = 415
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 116/251 (46%), Positives = 160/251 (63%), Gaps = 8/251 (3%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L NY++AQY+ +I +GTPPQ F VI DTGSSNLWVPSS+C S+AC+ H KY SS
Sbjct: 91 VPLSNYLNAQYYTDITLGTPPQQFKVILDTGSSNLWVPSSEC-GSLACFLHEKYDHSASS 149
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
+YK NG I YG+G++ G+ S+D + IGDL + Q+F EAT EP L F KFDGILG
Sbjct: 150 SYKANGTDFSIQYGSGSLEGYISQDTLSIGDLTIPKQDFAEATSEPGLAFAFGKFDGILG 209
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWF-NRNADEEEGGEIVFGGMDPDHYKGEH 258
L + ISV K VP +YN + Q L++E F+F+ + N D E+GGE +FGG+D Y G+
Sbjct: 210 LAYDTISVDKVVPPFYNALEQDLLDEAKFAFYLGDTNKDAEDGGEAIFGGVDKSKYTGDV 269
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA- 317
T++PV +K YW+ + + + + + G A D+GTSL+ P+ + +N IGA
Sbjct: 270 TWLPVRRKAYWEVKLEGLGLGDEYAELESHGAA--IDTGTSLITLPSGLAEIINSEIGAK 327
Query: 318 ---TGIVSQEC 325
TG + EC
Sbjct: 328 KGWTGQYTLEC 338
>gi|18152941|gb|AAB68519.2| proteinase A [Ogataea angusta]
gi|320580237|gb|EFW94460.1| proteinase A [Ogataea parapolymorpha DL-1]
Length = 413
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 115/237 (48%), Positives = 153/237 (64%), Gaps = 4/237 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L NY++AQYF EI +GTP Q+F VI DTGSSNLWVPSS C S+ACY H+KY SSTY
Sbjct: 91 LTNYLNAQYFTEIQLGTPGQSFKVILDTGSSNLWVPSSDC-TSLACYLHTKYDHDESSTY 149
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
+KNG S I YG+G++ G+ S+D + IGDLV+ Q+F EAT EP L F KFDGILGL
Sbjct: 150 QKNGSSFAIQYGSGSLEGYVSQDTLTIGDLVIPKQDFAEATSEPGLAFAFGKFDGILGLA 209
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWF-NRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ISV + VP YN +N GL++ P F F+ + + E++GGE FGG D Y G+ T+
Sbjct: 210 YDTISVNRIVPPIYNAINLGLLDTPQFGFYLGDTSKSEQDGGEATFGGYDVSKYTGDITW 269
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
+PV +K YW+ + + + G A D+GTSL+A P+ + +N IGA
Sbjct: 270 LPVRRKAYWEVKFSGIALGDEYAPLENTGAA--IDTGTSLIALPSQLAEILNSQIGA 324
>gi|149244964|ref|XP_001527016.1| vacuolar aspartic protease precursor [Lodderomyces elongisporus
NRRL YB-4239]
gi|146449410|gb|EDK43666.1| vacuolar aspartic protease precursor [Lodderomyces elongisporus
NRRL YB-4239]
Length = 429
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 115/238 (48%), Positives = 151/238 (63%), Gaps = 4/238 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L NY++AQYF EI +GTPPQ F VI DTGSSNLWVPS C S+AC+ HSKY SS+Y
Sbjct: 106 LTNYLNAQYFTEIQLGTPPQTFKVILDTGSSNLWVPSKDCS-SLACFLHSKYDHDASSSY 164
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
K NG I YG+G++ G+ S+D + IGDLV+ Q+F EAT EP L F KFDGILGL
Sbjct: 165 KANGSEFSIQYGSGSMEGYISQDILSIGDLVIPKQDFAEATSEPGLAFAFGKFDGILGLA 224
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWF-NRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ISV VP YN +NQGL++ P SF+ + N DE +GG FGG D ++G+ T+
Sbjct: 225 YDTISVNHIVPPVYNAINQGLLDSPQVSFYLGDTNKDENDGGVATFGGYDESLFQGKITW 284
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
+PV +K YW+ + + + G A D+GTSL+ P+T+ +N IGAT
Sbjct: 285 LPVRRKAYWEVAFEGLGLGDEYAELIQTGAA--IDTGTSLITLPSTLAEIINAKIGAT 340
>gi|221048011|gb|ACL98113.1| pepsinogen [Epinephelus coioides]
Length = 311
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 122/259 (47%), Positives = 165/259 (63%), Gaps = 17/259 (6%)
Query: 81 ALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSST 140
++ N D Y+G I IGTPPQ+F+VIFDTGSSNLWVPS C S AC H K+ +SST
Sbjct: 60 SMTNDADLSYYGVISIGTPPQSFSVIFDTGSSNLWVPSVYCS-SQACQNHRKFNPQQSST 118
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEF-IEATREPSLTFLLAKFDGILG 199
+K + I YGTG+++G + D+V++G + V++Q F I T P + + A DGILG
Sbjct: 119 FKWGDQPLSIQYGTGSMTGHLAIDNVEVGGITVQNQVFGISRTEAPFMAHMTA--DGILG 176
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
L FQ I+ VPV+ NMV QGLV++P+FS + + + E+G E+VFGG+D HY G+ T
Sbjct: 177 LAFQTIAADNVVPVFDNMVKQGLVSQPLFSVYLSSHG--EQGSEVVFGGIDSSHYTGQVT 234
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATG 319
+VP+T YWQ M V I+GQT CAGGC AI D+GTSL+ GPT I +N +GA+
Sbjct: 235 WVPLTSATYWQIKMDGVKINGQTVA-CAGGCQAIIDTGTSLIVGPTNDINNMNSWVGAS- 292
Query: 320 IVSQECKAVVSQYGEEIIN 338
+QYGE +N
Sbjct: 293 ---------TNQYGESTVN 302
>gi|448517372|ref|XP_003867779.1| vacuolar aspartic proteinase [Candida orthopsilosis Co 90-125]
gi|380352118|emb|CCG22342.1| vacuolar aspartic proteinase [Candida orthopsilosis]
Length = 429
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 116/238 (48%), Positives = 151/238 (63%), Gaps = 4/238 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L NY++AQYF EI IGTP Q F VI DTGSSNLWVPS C S+AC+ HSKY SSTY
Sbjct: 106 LTNYLNAQYFTEIQIGTPGQTFKVILDTGSSNLWVPSQDCT-SLACFLHSKYDHDASSTY 164
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
K NG I YG+G++ G+ S+D V IGDLV+ Q+F EAT EP L F KFDGILGL
Sbjct: 165 KANGSEFSIQYGSGSMEGYISQDTVSIGDLVIPKQDFAEATSEPGLAFAFGKFDGILGLA 224
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWF-NRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ISV + VP YN +NQ L++ P F F+ + N DEE+GG FGG D ++G+ T+
Sbjct: 225 YDTISVNRIVPPIYNAINQELLDSPQFGFYLGDTNKDEEDGGVATFGGYDESLFQGKITW 284
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
+PV +K YW+ + + + G A D+GTSL+ P+++ +N IGAT
Sbjct: 285 LPVRRKAYWEVSFEGIGLGDEYAELTKTGAA--IDTGTSLITLPSSLAEIINAKIGAT 340
>gi|162423778|gb|ABX89619.1| pepsinogen [Diplodus sargus]
Length = 376
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 134/328 (40%), Positives = 192/328 (58%), Gaps = 30/328 (9%)
Query: 13 FLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEG--ESFRTSIRKYSLRGN 70
+L +L V FS L+R+ L K K A + ++G E FR Y+
Sbjct: 3 WLVVLSALVAFSE---CLHRMPLIKGK------TARQTLQEQGKWEEFRKQ-HPYNPMAK 52
Query: 71 LGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFH 130
+SG ++ N D Y+G + IGTPPQ+F+VIFDTGSSNLW+PS C S AC H
Sbjct: 53 FIQSGTE---SMTNDADLSYYGVVSIGTPPQSFSVIFDTGSSNLWIPSVYCS-SQACQNH 108
Query: 131 SKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFL 190
K+ +SST+K + I YGTG+++G+ + D V++G + V +Q F + E +
Sbjct: 109 KKFNPQQSSTFKWGNQQLSIQYGTGSMTGYLASDVVEVGGISVANQVFGISQTEAAFMAS 168
Query: 191 LAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMD 250
+A DGILGL FQ I+ VPV+YNMV QGLV++P+FS + + N+ E+G E+VFGG D
Sbjct: 169 MAA-DGILGLAFQSIASDNVVPVFYNMVKQGLVSQPMFSVYLSGNS--EQGSEVVFGGTD 225
Query: 251 PDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQ 310
HY G+ T++P++ YWQ M V I+GQT C+GGC AI D+GTSL+ GPT+ I
Sbjct: 226 SSHYTGQITWIPLSSATYWQISMDSVTINGQTVA-CSGGCQAIIDTGTSLIVGPTSDINN 284
Query: 311 VNHAIGATGIVSQECKAVVSQYGEEIIN 338
+N +GA+ ++YGE +N
Sbjct: 285 MNSWVGAS----------TNEYGEATVN 302
>gi|200702|gb|AAA40050.1| renin [Mus musculus]
Length = 401
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 129/308 (41%), Positives = 185/308 (60%), Gaps = 14/308 (4%)
Query: 14 LCLLLFPVVFSTPNGGLY-RIGLKKRKFDLNNRVAARLDSKEGESFRTSIR--KYSLRGN 70
L LL P FS P G + RI LKK V L+ + + R S ++ R +
Sbjct: 11 LLLLWSPCTFSLPTGTTFERIPLKKMP-----SVREILEERGVDMTRLSAEWDVFTKRSS 65
Query: 71 LGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYF 129
L + V L NY+++QY+GEIGIGTPPQ F V+FDTGS+NLWVPS+KC +AC
Sbjct: 66 LTDL--ISPVVLTNYLNSQYYGEIGIGTPPQTFKVMFDTGSANLWVPSTKCSRLYLACGI 123
Query: 130 HSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTF 189
HS Y S SS+Y +NG IHYG+G + GF S+D V +G + V Q F E T P + F
Sbjct: 124 HSLYESSDSSSYMENGDDFTIHYGSGRVKGFLSQDSVTVGGITVT-QTFGEVTELPLIPF 182
Query: 190 LLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGM 249
+LA+FDG+LG+G +VG PV+ ++++QG++ E VFS ++NR GGE+V GG
Sbjct: 183 MLAQFDGVLGMGLSRSAVGGVTPVFDHILSQGVLKEKVFSVYYNRGP-HLLGGEVVLGGS 241
Query: 250 DPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIIT 309
DP+HY+G+ YV +++ WQ M V + G +T C GC + D+G+S ++ PT+ +
Sbjct: 242 DPEHYQGDFHYVSLSKTDSWQITMKGVSV-GSSTLLCEEGCEVVVDTGSSFISAPTSSLK 300
Query: 310 QVNHAIGA 317
+ A+GA
Sbjct: 301 LIMQALGA 308
>gi|291409611|ref|XP_002721072.1| PREDICTED: pepsin-3-like isoform 2 [Oryctolagus cuniculus]
Length = 387
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 122/281 (43%), Positives = 168/281 (59%), Gaps = 28/281 (9%)
Query: 62 IRKYSLRGNLGESG------------------------DADIVALKNYMDAQYFGEIGIG 97
+RK SLR NL E G +L+NY+D +YFG I IG
Sbjct: 23 VRKKSLRKNLIEKGLLKDYLKTHTLNLATKYLPKAAFDSVPTESLENYLDTEYFGTISIG 82
Query: 98 TPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAI 157
TPPQ+FTVIFDTGSSNLWVPS C S AC H+++ SST++ +S I YGTG++
Sbjct: 83 TPPQDFTVIFDTGSSNLWVPSVYCS-SAACSVHNQFNPEDSSTFQATSESLSITYGTGSM 141
Query: 158 SGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNM 217
+GF D V +G++ +Q F + EP A FDGILGL + IS A PV+ NM
Sbjct: 142 TGFLGYDTVNVGNIEDTNQIFGLSESEPGSFLYYAPFDGILGLAYPSISASDATPVFDNM 201
Query: 218 VNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVM 277
N+GLV+E +FS + ++D++ G ++FGG+D +Y G +VPV+ +GYWQ + +
Sbjct: 202 WNEGLVSEDLFSVYL--SSDDDSGSVVMFGGVDSSYYTGSLNWVPVSYEGYWQITVDSIT 259
Query: 278 IDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
+DG+T CA GC AI D+GTSLLAGPT+ I+ + IGA+
Sbjct: 260 MDGETIA-CADGCQAIVDTGTSLLAGPTSAISNIQSYIGAS 299
>gi|291409609|ref|XP_002721071.1| PREDICTED: pepsin-3-like isoform 1 [Oryctolagus cuniculus]
Length = 387
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 122/281 (43%), Positives = 168/281 (59%), Gaps = 28/281 (9%)
Query: 62 IRKYSLRGNLGESG------------------------DADIVALKNYMDAQYFGEIGIG 97
+RK SLR NL E G +L+NY+D +YFG I IG
Sbjct: 23 VRKKSLRKNLIEKGLLKDYLKTHTLNLATKYLPKAAFDSVPTESLENYLDTEYFGTISIG 82
Query: 98 TPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAI 157
TPPQ+FTVIFDTGSSNLWVPS C S AC H+++ SST++ +S I YGTG++
Sbjct: 83 TPPQDFTVIFDTGSSNLWVPSVYCS-SAACSVHNQFNPEDSSTFQATSESLSITYGTGSM 141
Query: 158 SGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNM 217
+GF D V +G++ +Q F + EP A FDGILGL + IS A PV+ NM
Sbjct: 142 TGFLGYDTVNVGNIEDTNQIFGLSESEPGSFLYYAPFDGILGLAYPSISASDATPVFDNM 201
Query: 218 VNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVM 277
N+GLV+E +FS + ++D++ G ++FGG+D +Y G +VPV+ +GYWQ + +
Sbjct: 202 WNEGLVSEDLFSVYL--SSDDDSGSVVMFGGVDSSYYTGSLNWVPVSYEGYWQITVDSIT 259
Query: 278 IDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
+DG+T CA GC AI D+GTSLLAGPT+ I+ + IGA+
Sbjct: 260 MDGETIA-CADGCQAIVDTGTSLLAGPTSAISNIQSYIGAS 299
>gi|20129385|ref|NP_609235.1| beta-site APP-cleaving enzyme [Drosophila melanogaster]
gi|7297427|gb|AAF52686.1| beta-site APP-cleaving enzyme [Drosophila melanogaster]
gi|54650878|gb|AAV37018.1| GH11417p [Drosophila melanogaster]
gi|220951556|gb|ACL88321.1| CG13095-PA [synthetic construct]
gi|220959834|gb|ACL92460.1| CG13095-PA [synthetic construct]
Length = 372
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 123/276 (44%), Positives = 167/276 (60%), Gaps = 16/276 (5%)
Query: 30 LYRIGLKKRKFDLNNRVAARLDSKEGESF-RTSIRKYSLRGNLGESGDADIVALKNYMDA 88
L+R+ + K + N V R + +S+ RT + SLR D L N M+
Sbjct: 19 LHRVPILKEQ----NFVKTRQNVLAEKSYLRTKYQLPSLRS-------VDEEQLSNSMNM 67
Query: 89 QYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSA 148
Y+G I IGTP Q+F V+FD+GSSNLWVPS+ C S AC H++Y S SSTY NG+S
Sbjct: 68 AYYGAISIGTPAQSFKVLFDSGSSNLWVPSNTCK-SDACLTHNQYDSSASSTYVANGESF 126
Query: 149 DIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVG 208
I YGTG+++G+ S D V + L ++ Q F E+T EP F A FDGILG+ ++ ++V
Sbjct: 127 SIQYGTGSLTGYLSTDTVDVNGLSIQSQTFAESTNEPGTNFNDANFDGILGMAYESLAVD 186
Query: 209 KAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGY 268
P +YNMV+QGLV+ VFSF+ R+ GGE++FGG D Y G TYVP++++GY
Sbjct: 187 GVAPPFYNMVSQGLVDNSVFSFYLARDGTSTMGGELIFGGSDASLYSGALTYVPISEQGY 246
Query: 269 WQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGP 304
WQF M IDG + C C AIAD+GTSL+ P
Sbjct: 247 WQFTMAGSSIDGYS--LC-DDCQAIADTGTSLIVAP 279
>gi|194854120|ref|XP_001968292.1| GG24793 [Drosophila erecta]
gi|190660159|gb|EDV57351.1| GG24793 [Drosophila erecta]
Length = 404
Score = 228 bits (581), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 111/236 (47%), Positives = 156/236 (66%), Gaps = 4/236 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSST 140
L N M+ Y+G IGIGTP Q F V+FDTGS+NLWVPS++C + +AC HS+Y S SST
Sbjct: 84 LGNSMNMYYYGLIGIGTPEQLFKVVFDTGSANLWVPSAQCLATDVACQQHSQYNSSASST 143
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
+ +G++ I YGTG++SG+ + D V I L + +Q F EA +P +F FDGILG+
Sbjct: 144 FVASGQNFSIQYGTGSVSGYLAMDTVTINGLAILNQTFGEAVSQPGASFTDVAFDGILGM 203
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
G+Q+I+ VP +YN+ +GL++EPVF F+ RN EGG++ GG D + GE TY
Sbjct: 204 GYQQIAEDFVVPPFYNLYEEGLIDEPVFGFYLARNGSAVEGGQLTLGGTDQNLIAGEMTY 263
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIG 316
PVTQ+GYWQF + ++ +G + GC AIAD+GTSL+A P+ Q+N+ IG
Sbjct: 264 TPVTQQGYWQFAVNNITWNGT---LISSGCQAIADTGTSLIAVPSAAYIQLNNLIG 316
>gi|18859121|ref|NP_571879.1| nothepsin [Danio rerio]
gi|12053847|emb|CAC20112.1| nothepsin [Danio rerio]
Length = 416
Score = 228 bits (581), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 135/359 (37%), Positives = 202/359 (56%), Gaps = 21/359 (5%)
Query: 9 TAGFFLCLLLFPVVFSTPNGGLYRIGLK-----KRKFDLNNRVAARLDSKEGESF-RTSI 62
+AG L L+L + FST GL R+ L+ + + + ++ L + + F R +
Sbjct: 3 SAGLILILVLH-LGFST---GLLRVALRQYPSVRSRLRASAQLEEFLKQHQPDMFSRRYV 58
Query: 63 RKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCY 122
+ + + G L N+MDAQ+FG+I +G P QNFTV+FDTGSS+LWVPSS C
Sbjct: 59 QCFPPAQHFLRPGRRVTERLYNFMDAQFFGQISLGRPEQNFTVVFDTGSSDLWVPSSYCV 118
Query: 123 FSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEAT 182
+ AC H+K+++ SSTY +G+ IHYG+G + G + D +K+G + V++Q F EA
Sbjct: 119 -TQACALHNKFKAFESSTYTHDGRVFGIHYGSGHLLGVMARDELKVGSVRVQNQVFGEAV 177
Query: 183 REPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGG 242
EP +F+LA+FDG+LGLGF +++ K PV+ M+ Q ++++PVFSF+ N GG
Sbjct: 178 YEPGFSFVLAQFDGVLGLGFPQLAEEKGSPVFDTMMEQNMLDQPVFSFYLTNNGS-GFGG 236
Query: 243 EIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQT--TGFCAGGCAAIADSGTSL 300
E+VFG D + ++PVTQKGYWQ + V + G + GC AI D+GTSL
Sbjct: 237 ELVFGANDESRFLPPINWIPVTQKGYWQIKLDAVKVQGALSFSDRSVQGCQAIVDTGTSL 296
Query: 301 LAGPTTIITQVNHAIGATGIVSQE----CKAVVSQYGEEIINMLLAKVSAKLSLRHYKK 355
+ GP I + IGAT + E C V S +++ L+ V LS Y +
Sbjct: 297 IGGPARDILILQQFIGATPTANGEFVVDCVRVSSL---PVVSFLINSVEYSLSGEQYVR 352
>gi|166361871|gb|ABY87034.1| pepsinogen A1 [Epinephelus coioides]
gi|166361875|gb|ABY87036.1| pepsinogen A1 [Epinephelus coioides]
Length = 376
Score = 228 bits (581), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 122/259 (47%), Positives = 165/259 (63%), Gaps = 17/259 (6%)
Query: 81 ALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSST 140
++ N D Y+G I IGTPPQ+F+VIFDTGSSNLWVPS C S AC H K+ +SST
Sbjct: 60 SMTNDADLSYYGVISIGTPPQSFSVIFDTGSSNLWVPSVYCS-SQACQNHRKFNPQQSST 118
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEF-IEATREPSLTFLLAKFDGILG 199
+K + I YGTG+++G + D+V++G + V++Q F I T P + + A DGILG
Sbjct: 119 FKWGDQPLSIQYGTGSMTGHLAIDNVEVGGITVQNQVFGISRTEAPFMAHMTA--DGILG 176
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
L FQ I+ VPV+ NMV QGLV++P+FS + + + E+G E+VFGG+D HY G+ T
Sbjct: 177 LAFQTIAADNVVPVFDNMVKQGLVSQPLFSVYLSSHG--EQGSEVVFGGIDSSHYTGQVT 234
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATG 319
+VP+T YWQ M V I+GQT CAGGC AI D+GTSL+ GPT I +N +GA+
Sbjct: 235 WVPLTSATYWQIKMDGVKINGQTVA-CAGGCQAIIDTGTSLIVGPTNDINNMNSWVGAS- 292
Query: 320 IVSQECKAVVSQYGEEIIN 338
+QYGE +N
Sbjct: 293 ---------TNQYGESTVN 302
>gi|444725492|gb|ELW66056.1| Gastricsin [Tupaia chinensis]
Length = 389
Score = 228 bits (580), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 111/254 (43%), Positives = 161/254 (63%), Gaps = 11/254 (4%)
Query: 85 YMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKN 144
YMDA YFGEI IGTPPQNF V+FDTGSSNLWVPS C S AC H ++ +SSTY N
Sbjct: 68 YMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTNHPRFNPSQSSTYSTN 126
Query: 145 GKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQE 204
G++ + YG+G+++GFF D + + + V +QEF + EP F+ A+FDGI+G+ +
Sbjct: 127 GQTFSLQYGSGSLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGIMGMAYPA 186
Query: 205 ISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVT 264
+S+G A M+ +G++ PVFSF+ + E+GG ++FGG+D Y G+ + PVT
Sbjct: 187 LSMGGATTALQGMLQEGVLTSPVFSFYLSNQQGSEDGGAVIFGGVDNSLYSGQIYWAPVT 246
Query: 265 QKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE 324
Q+ YWQ + + +I GQ +G+C+ GC AI D+GTSLL P ++ + A GA QE
Sbjct: 247 QELYWQIGIEEFLIGGQASGWCSQGCQAIVDTGTSLLTVPQQYMSTLLQATGA-----QE 301
Query: 325 CKAVVSQYGEEIIN 338
+YG+ ++N
Sbjct: 302 -----DEYGQFLVN 310
>gi|147905812|ref|NP_001079036.1| gastricsin precursor [Xenopus laevis]
gi|12082174|dbj|BAB20797.1| pepsinogen C [Xenopus laevis]
gi|213625030|gb|AAI69665.1| Pepsinogen C [Xenopus laevis]
gi|213626584|gb|AAI69663.1| Pepsinogen C [Xenopus laevis]
Length = 383
Score = 228 bits (580), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 123/305 (40%), Positives = 175/305 (57%), Gaps = 15/305 (4%)
Query: 13 FLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLG 72
FL L L + S G+ R+ LKK K + R R + + + + Y+
Sbjct: 3 FLILALVCLQLSE---GIIRVPLKKFK---SMREVMRENGIKAPLVDPATKYYNQYATAY 56
Query: 73 ESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSK 132
E L NYMD Y+GEI IGTPPQNF V+FDTGSSNLWV S+ C S AC H
Sbjct: 57 E-------PLSNYMDMSYYGEISIGTPPQNFLVLFDTGSSNLWVASTYCQ-SQACTNHPL 108
Query: 133 YRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLA 192
+ +SSTY N + + YGTG+++G D V I ++ + QEF + EP F+ A
Sbjct: 109 FNPSQSSTYSSNQQQFSLQYGTGSLTGILGYDTVTIQNVAISQQEFGLSETEPGTNFVYA 168
Query: 193 KFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPD 252
+FDGILGL + I+VG A V M+ Q L+N+P+F F+ + + + GGE+ FGG+D +
Sbjct: 169 QFDGILGLAYPSIAVGGATTVMQGMMQQNLLNQPIFGFYLSGQSS-QNGGEVAFGGVDQN 227
Query: 253 HYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVN 312
+Y G+ + PVT + YWQ + I+GQ TG+C+ GC AI D+GTSLL P ++ + +
Sbjct: 228 YYTGQIYWTPVTSETYWQIGIQGFSINGQATGWCSQGCQAIVDTGTSLLTAPQSVFSSLI 287
Query: 313 HAIGA 317
+IGA
Sbjct: 288 QSIGA 292
>gi|301786583|ref|XP_002928700.1| PREDICTED: pepsin A-like isoform 2 [Ailuropoda melanoleuca]
Length = 393
Score = 228 bits (580), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 127/294 (43%), Positives = 176/294 (59%), Gaps = 31/294 (10%)
Query: 62 IRKYSLRGNLGESG--------------------DADIVA---LKNYMDAQYFGEIGIGT 98
++K SLR NL E G +A ++A L+NYMD +YFG IGIGT
Sbjct: 23 VKKKSLRKNLKEHGLLKDFLKNHSPNPASKYFPQEAAVMATQPLENYMDMEYFGTIGIGT 82
Query: 99 PPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAIS 158
PPQ FTVIFDTGSSNLWVPS C S AC H+++ +SSTY+ ++ I YGTG+++
Sbjct: 83 PPQEFTVIFDTGSSNLWVPSVYCS-SPACSNHNRFNPQQSSTYEGTSQTVSIAYGTGSMT 141
Query: 159 GFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMV 218
G D V++G + +Q F + EP A FDGILGL + +IS A PV+ NM
Sbjct: 142 GILGYDTVQVGGISDTNQIFGLSETEPGSFLYYAPFDGILGLAYPQISSSGATPVFDNMW 201
Query: 219 NQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMI 278
NQGLV++ +FS + ++D++ G ++FGG+D ++ G +VPV+ +GYWQ M V I
Sbjct: 202 NQGLVSQDLFSVYL--SSDDQSGSVVMFGGIDSSYFTGNLNWVPVSVEGYWQITMDSVTI 259
Query: 279 DGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT----GIVSQECKAV 328
+GQ C+ GC AI D+GTSLLAGPT I + IGA+ G ++ C A+
Sbjct: 260 NGQAIA-CSQGCQAIVDTGTSLLAGPTNSIANIQSYIGASEDSNGEMTISCSAI 312
>gi|50419019|ref|XP_458031.1| DEHA2C08074p [Debaryomyces hansenii CBS767]
gi|49653697|emb|CAG86094.1| DEHA2C08074p [Debaryomyces hansenii CBS767]
Length = 416
Score = 228 bits (580), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 118/250 (47%), Positives = 157/250 (62%), Gaps = 8/250 (3%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L NY++AQYF EI +GTP Q+F VI DTGSSNLWVPS C S+AC+ HSKY SSTY
Sbjct: 94 LTNYLNAQYFTEIQLGTPGQSFKVILDTGSSNLWVPSEDCS-SLACFLHSKYAHDSSSTY 152
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
K NG S I YG+GA+ G+ S+D + IGDL++ Q+F EAT EP L F KFDGILGL
Sbjct: 153 KANGSSFSIQYGSGAMEGYVSQDTLAIGDLIIPKQDFAEATSEPGLAFAFGKFDGILGLA 212
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWF-NRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ISV K VP YN + QGL+ EP F+F+ + + +EE+GG FGG+D D Y G+
Sbjct: 213 YNTISVNKIVPPVYNAIEQGLLEEPRFAFYLGDTSKNEEDGGVATFGGIDEDLYTGKVVD 272
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA--- 317
+PV +K YW+ + + + G A D+GTSL+ P+++ +N IGA
Sbjct: 273 LPVRRKAYWEVAFEGIGLGDEYAELTKTGAA--IDTGTSLITLPSSLAEIINSKIGAEKS 330
Query: 318 -TGIVSQECK 326
+G EC+
Sbjct: 331 WSGQYQIECE 340
>gi|195339269|ref|XP_002036242.1| GM12869 [Drosophila sechellia]
gi|194130122|gb|EDW52165.1| GM12869 [Drosophila sechellia]
Length = 372
Score = 228 bits (580), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 123/276 (44%), Positives = 167/276 (60%), Gaps = 16/276 (5%)
Query: 30 LYRIGLKKRKFDLNNRVAARLDSKEGESF-RTSIRKYSLRGNLGESGDADIVALKNYMDA 88
L+R+ + K + N V R + +S+ RT + SLR D L N M+
Sbjct: 19 LHRVPILKEQ----NFVKTRQNVLAEKSYLRTKYQLPSLRS-------VDEEQLSNSMNM 67
Query: 89 QYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSA 148
Y+G I IGTP Q+F V+FD+GSSNLWVPS+ C S AC H++Y S SSTY NG+S
Sbjct: 68 AYYGAISIGTPAQSFKVLFDSGSSNLWVPSNTCQ-SDACKTHNQYDSSASSTYVANGESF 126
Query: 149 DIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVG 208
I YGTG+++G+ S D V + L ++ Q F E+T EP F A FDGILG+ ++ ++V
Sbjct: 127 SIQYGTGSLTGYLSTDTVDVNGLSIQSQTFAESTNEPGTNFNDANFDGILGMAYESLAVD 186
Query: 209 KAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGY 268
P +YNMV+QGLV+ VFSF+ R+ GGE++FGG D Y G TYVP++++GY
Sbjct: 187 GVAPPFYNMVSQGLVDNSVFSFYLARDGTSTMGGELIFGGSDSSLYSGALTYVPISEQGY 246
Query: 269 WQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGP 304
WQF M IDG + C C AIAD+GTSL+ P
Sbjct: 247 WQFTMAGSSIDGYS--LC-DDCQAIADTGTSLIVAP 279
>gi|19921120|ref|NP_609458.1| CG17134 [Drosophila melanogaster]
gi|7297766|gb|AAF53016.1| CG17134 [Drosophila melanogaster]
gi|17944939|gb|AAL48533.1| RE02351p [Drosophila melanogaster]
gi|220947772|gb|ACL86429.1| CG17134-PA [synthetic construct]
gi|220957078|gb|ACL91082.1| CG17134-PA [synthetic construct]
Length = 391
Score = 228 bits (580), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 115/238 (48%), Positives = 151/238 (63%), Gaps = 4/238 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSST 140
L N M+ +Y+G I IGTP Q F ++FDTGS+NLWVPS+ C S AC H+KY S SST
Sbjct: 68 LHNSMNNEYYGVIAIGTPEQRFNILFDTGSANLWVPSASCPASNTACQRHNKYDSSASST 127
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y NG+ I YGTG++SGF S D V I + +++Q F EA EP TF+ A F GILGL
Sbjct: 128 YVANGEEFAIEYGTGSLSGFLSNDIVTIAGISIQNQTFGEALSEPGTTFVDAPFAGILGL 187
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
F I+V P + NM++QGL++EPV SF+ R GGE++ GG+D Y+G TY
Sbjct: 188 AFSAIAVDGVTPPFDNMISQGLLDEPVISFYLKRQGTAVRGGELILGGIDSSLYRGSLTY 247
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
VPV+ YWQF + + +G T C GC AIAD+GTSL+A P ++N +GAT
Sbjct: 248 VPVSVPAYWQFKVNTIKTNG--TLLC-NGCQAIADTGTSLIAVPLAAYRKINRQLGAT 302
>gi|448115983|ref|XP_004202951.1| Piso0_001822 [Millerozyma farinosa CBS 7064]
gi|359383819|emb|CCE79735.1| Piso0_001822 [Millerozyma farinosa CBS 7064]
Length = 414
Score = 228 bits (580), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 120/256 (46%), Positives = 167/256 (65%), Gaps = 18/256 (7%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L +Y++AQY+ IG+G+P Q F VI DTGSSNLWVPS+ C S+AC+ HSKY SS+Y
Sbjct: 92 LVDYLNAQYYTTIGLGSPAQEFKVILDTGSSNLWVPSTDCS-SLACFLHSKYYHDESSSY 150
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
K+NG I YGTG++ G+ S+D + + L ++ Q+F EAT EP LTF AKFDGILGL
Sbjct: 151 KQNGSDFSIQYGTGSLEGYVSQDTLNLAGLTIEKQDFAEATSEPGLTFAFAKFDGILGLA 210
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWF-NRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ISV VP YN ++QGL++EP F+F+ +++ DE EGG FGG+D HYKG+
Sbjct: 211 YDSISVDNIVPPIYNAIDQGLLDEPKFAFYLGDKDKDENEGGVATFGGVDTKHYKGDIIE 270
Query: 261 VPVTQKGYWQ--FD---MGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAI 315
+PV +K YW+ FD +GD + +TG AAI D+GTSL+ P+++ +N I
Sbjct: 271 LPVRRKAYWEVSFDGIGLGDEYAELTSTG------AAI-DTGTSLITLPSSLAEIINAKI 323
Query: 316 GA----TGIVSQECKA 327
GA +G S +C +
Sbjct: 324 GAKKSWSGQYSVDCDS 339
>gi|301786581|ref|XP_002928699.1| PREDICTED: pepsin A-like isoform 1 [Ailuropoda melanoleuca]
gi|281347483|gb|EFB23067.1| hypothetical protein PANDA_018738 [Ailuropoda melanoleuca]
Length = 385
Score = 228 bits (580), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 127/294 (43%), Positives = 176/294 (59%), Gaps = 31/294 (10%)
Query: 62 IRKYSLRGNLGESG--------------------DADIVA---LKNYMDAQYFGEIGIGT 98
++K SLR NL E G +A ++A L+NYMD +YFG IGIGT
Sbjct: 23 VKKKSLRKNLKEHGLLKDFLKNHSPNPASKYFPQEAAVMATQPLENYMDMEYFGTIGIGT 82
Query: 99 PPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAIS 158
PPQ FTVIFDTGSSNLWVPS C S AC H+++ +SSTY+ ++ I YGTG+++
Sbjct: 83 PPQEFTVIFDTGSSNLWVPSVYCS-SPACSNHNRFNPQQSSTYEGTSQTVSIAYGTGSMT 141
Query: 159 GFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMV 218
G D V++G + +Q F + EP A FDGILGL + +IS A PV+ NM
Sbjct: 142 GILGYDTVQVGGISDTNQIFGLSETEPGSFLYYAPFDGILGLAYPQISSSGATPVFDNMW 201
Query: 219 NQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMI 278
NQGLV++ +FS + ++D++ G ++FGG+D ++ G +VPV+ +GYWQ M V I
Sbjct: 202 NQGLVSQDLFSVYL--SSDDQSGSVVMFGGIDSSYFTGNLNWVPVSVEGYWQITMDSVTI 259
Query: 279 DGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT----GIVSQECKAV 328
+GQ C+ GC AI D+GTSLLAGPT I + IGA+ G ++ C A+
Sbjct: 260 NGQAIA-CSQGCQAIVDTGTSLLAGPTNSIANIQSYIGASEDSNGEMTISCSAI 312
>gi|194862073|ref|XP_001969914.1| GG23678 [Drosophila erecta]
gi|190661781|gb|EDV58973.1| GG23678 [Drosophila erecta]
Length = 392
Score = 228 bits (580), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 118/258 (45%), Positives = 158/258 (61%), Gaps = 6/258 (2%)
Query: 64 KYSL--RGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC 121
KYS+ + S D +L+N M+ +Y+G I IGTP Q F ++FDTGS+NLWVPS+ C
Sbjct: 48 KYSVVSETSFSTSSAGDTESLQNSMNNEYYGVITIGTPQQRFNILFDTGSANLWVPSASC 107
Query: 122 YFS-IACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIE 180
S AC H+KY S SSTY NG+ I YGTG++SGF S D V I + ++DQ F E
Sbjct: 108 PASNTACQRHNKYNSTASSTYVANGEEFAIEYGTGSLSGFLSTDTVAIAGVTIRDQTFGE 167
Query: 181 ATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEE 240
A EP TF+ A F GILGL F I+ P + NM++QG+++EPV SF+ R
Sbjct: 168 ALSEPGTTFVDAPFAGILGLAFSTIADDGVTPPFDNMISQGVLDEPVISFYLKRQGTAVL 227
Query: 241 GGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSL 300
GGE++ GG+D YKG TYVPV+ YWQF + + +G C+ GC AIAD+GTSL
Sbjct: 228 GGELILGGIDSSLYKGSLTYVPVSVPAYWQFTVNTIKTNGVL--LCS-GCQAIADTGTSL 284
Query: 301 LAGPTTIITQVNHAIGAT 318
+ P ++N +GAT
Sbjct: 285 IVAPLAAYKRINRQLGAT 302
>gi|194764262|ref|XP_001964249.1| GF20814 [Drosophila ananassae]
gi|190619174|gb|EDV34698.1| GF20814 [Drosophila ananassae]
Length = 405
Score = 228 bits (580), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 129/291 (44%), Positives = 173/291 (59%), Gaps = 24/291 (8%)
Query: 27 NGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYM 86
NG L RI +++ R AA+ E F + +KY+L +G + L NY
Sbjct: 22 NGSLCRIPIQRSPNFKRTRAAAK-----AEMFYIA-KKYNLNVATNSTG---VEQLSNYD 72
Query: 87 DAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGK 146
+ QY+G I IGTP QNF V FDTGSSNLW+PSS+C S +C H++Y S +SSTYK NG
Sbjct: 73 NFQYYGSINIGTPGQNFQVQFDTGSSNLWIPSSQCT-SSSCMVHTRYSSYQSSTYKSNGS 131
Query: 147 SADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEIS 206
+I YGTG++SGF S+D V + LV+++Q F E T E FL A FDGILGL F ++
Sbjct: 132 IFNITYGTGSVSGFMSQDVVSVAGLVIRNQTFGEVTSESGSNFLNASFDGILGLAFPMLA 191
Query: 207 VGKAVPVWYNMVNQGLVNEPVFSFWFNRNADE-EEGGEIVFGGMDPDHYKGEHTYVPVTQ 265
V P + N+++Q +V +PVFSF+ N GGE++ GG DP Y+G+ TYVPV+
Sbjct: 192 VNLVTPFFQNLISQKVVQQPVFSFYLRNNGTTVTYGGELILGGSDPKLYRGKLTYVPVSY 251
Query: 266 KGYWQF-----DMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQV 311
YWQF MG+ +I + G AAIAD+GTSLL P TQ+
Sbjct: 252 PAYWQFYTDSIQMGNTLI--------STGDAAIADTGTSLLVAPQAEYTQI 294
>gi|283806594|ref|NP_001164550.1| pepsin-3 precursor [Oryctolagus cuniculus]
gi|129783|sp|P27822.1|PEPA3_RABIT RecName: Full=Pepsin-3; AltName: Full=Pepsin A; AltName:
Full=Pepsin III; Flags: Precursor
gi|165598|gb|AAA85370.1| pepsinogen [Oryctolagus cuniculus]
Length = 387
Score = 228 bits (580), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 123/281 (43%), Positives = 167/281 (59%), Gaps = 28/281 (9%)
Query: 62 IRKYSLRGNLGESG------------------------DADIVALKNYMDAQYFGEIGIG 97
+RK SLR NL E G L+NY+D +YFG IGIG
Sbjct: 23 VRKKSLRKNLIEKGLLKDYLKTHTPNLATKYLPKAAFDSVPTETLENYLDTEYFGTIGIG 82
Query: 98 TPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAI 157
TP Q+FTVIFDTGSSNLWVPS C S AC H+++ SST++ +S I YGTG++
Sbjct: 83 TPAQDFTVIFDTGSSNLWVPSVYCS-SAACSVHNQFNPEDSSTFQATSESLSITYGTGSM 141
Query: 158 SGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNM 217
+GF D VK+G++ +Q F + EP A FDGILGL + IS A PV+ NM
Sbjct: 142 TGFLGYDTVKVGNIEDTNQIFGLSESEPGSFLYYAPFDGILGLAYPSISSSDATPVFDNM 201
Query: 218 VNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVM 277
N+GLV+E +FS + ++D+E G ++FGG+D +Y G +VPV+ +GYWQ + +
Sbjct: 202 WNEGLVSEDLFSVYL--SSDDESGSVVMFGGIDSSYYTGSLNWVPVSYEGYWQITLDSIT 259
Query: 278 IDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
+DG+T CA C AI D+GTSLLAGPT+ I+ + IGA+
Sbjct: 260 MDGETIA-CADSCQAIVDTGTSLLAGPTSAISNIQSYIGAS 299
>gi|162423776|gb|ABX89618.1| pepsinogen [Sparus aurata]
Length = 376
Score = 228 bits (580), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 118/258 (45%), Positives = 166/258 (64%), Gaps = 15/258 (5%)
Query: 81 ALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSST 140
++ N D Y+G I IGTPPQ+FTVIFDTGSSNLWVPS C S AC H K+ +SST
Sbjct: 60 SMTNDADLSYYGVISIGTPPQSFTVIFDTGSSNLWVPSVYCS-SQACQNHKKFNPQQSST 118
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
+K + I YGTG+++G+ + D V++G + V +Q F + E + +A DGILGL
Sbjct: 119 FKWGDQQLSIQYGTGSMTGYLASDTVEVGGISVANQVFGISQSEAAFMASMAA-DGILGL 177
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
FQ I+ VPV+YNMV QGLV++P+FS + + N+ E+G E+VFGG D +HY G+ T+
Sbjct: 178 AFQSIASDNVVPVFYNMVQQGLVSQPMFSVYLSGNS--EQGSEVVFGGTDSNHYTGQITW 235
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
+P++ Y+Q M V I+GQT C+GGC AI D+GTSL+ GPT+ I +N +GA+
Sbjct: 236 IPLSSATYYQISMDSVTINGQTVA-CSGGCQAIIDTGTSLIVGPTSDINNMNSWVGAS-- 292
Query: 321 VSQECKAVVSQYGEEIIN 338
++YGE +N
Sbjct: 293 --------TNEYGEATVN 302
>gi|195386060|ref|XP_002051722.1| GJ17077 [Drosophila virilis]
gi|194148179|gb|EDW63877.1| GJ17077 [Drosophila virilis]
Length = 404
Score = 228 bits (580), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 114/239 (47%), Positives = 155/239 (64%), Gaps = 4/239 (1%)
Query: 79 IVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGR 137
I L N + Y+G IGIGTPPQ F V+FDTGS+NLWVPS +C + +AC H++Y S
Sbjct: 81 IETLSNNQNMDYYGVIGIGTPPQYFNVVFDTGSANLWVPSVQCLPTDVACQNHNQYNSSA 140
Query: 138 SSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGI 197
SSTY NG+S I YGTG+++GF S D V I L + Q F EA +P+ +F FDGI
Sbjct: 141 SSTYVANGQSFSIQYGTGSLTGFLSTDTVTINGLSIACQTFGEAISQPNGSFTGVPFDGI 200
Query: 198 LGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGE 257
LG+G+ I+V + VP +YN+ QGL++EP F F+ R ++GG++V GG+D + G
Sbjct: 201 LGMGYSTIAVDQVVPPFYNLYEQGLIDEPSFGFYLARTGSAQDGGQLVLGGVDYQLFSGN 260
Query: 258 HTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIG 316
TYVPV+Q+GYWQF + +++G C AIAD+GTSLLA P + TQ+N IG
Sbjct: 261 LTYVPVSQEGYWQFVVTSAVMNGFVV---CSNCQAIADTGTSLLACPGSSYTQLNQLIG 316
>gi|320588396|gb|EFX00865.1| aspartic endopeptidase pep2 [Grosmannia clavigera kw1407]
Length = 401
Score = 227 bits (579), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 126/272 (46%), Positives = 173/272 (63%), Gaps = 12/272 (4%)
Query: 46 VAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTV 105
+ ARL S E F+T++ E+ D + + N+++AQYF EI IGTPPQ+F V
Sbjct: 52 MGARLGSHADEMFKTAVV---------ETDDNHPLPVSNFLNAQYFAEISIGTPPQSFKV 102
Query: 106 IFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDH 165
+ DTGSSNLWVPSS+C SIACY H+KY S SS+YK NG + YG+G++SGF S+D
Sbjct: 103 VLDTGSSNLWVPSSQCG-SIACYLHTKYDSESSSSYKSNGSAFAAQYGSGSLSGFVSQDT 161
Query: 166 VKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNE 225
V IGDL + Q+F EAT EP L F A+FDGILGLGF ISV VP +YN++NQ L++
Sbjct: 162 VSIGDLKIVKQDFAEATEEPGLAFAFARFDGILGLGFDTISVNHIVPPFYNLINQKLIDS 221
Query: 226 PVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGF 285
VF+F+ + + E VFGG+D HY G+ T +P+ +K YW+ D+ + + T
Sbjct: 222 GVFAFYLGNADSDGDDSEAVFGGVDKAHYTGKITTIPLRRKAYWEVDLDSISLGEDTAEL 281
Query: 286 CAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
G I D+GTSL+A P+++ +N IGA
Sbjct: 282 ENTGV--ILDTGTSLIALPSSLAEMLNAQIGA 311
>gi|291409613|ref|XP_002721073.1| PREDICTED: pepsinogen III-like [Oryctolagus cuniculus]
Length = 387
Score = 227 bits (579), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 120/281 (42%), Positives = 167/281 (59%), Gaps = 28/281 (9%)
Query: 62 IRKYSLRGNLGESG------------------------DADIVALKNYMDAQYFGEIGIG 97
+RK SLR NL E G +L+NY+D +YFG I IG
Sbjct: 23 VRKKSLRKNLIEKGLLKDYLKTHTPNLATKYFPKETFASVSTESLENYLDTEYFGTISIG 82
Query: 98 TPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAI 157
TPPQ+FTVIFDTGSSNLWVPS+ C S AC H+++ SST++ ++ I YGTG++
Sbjct: 83 TPPQDFTVIFDTGSSNLWVPSTYCS-SAACTVHNRFNPDDSSTFQATSETLSITYGTGSM 141
Query: 158 SGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNM 217
+G D V +G + +Q F + EP A FDGILGL + IS A PV+ NM
Sbjct: 142 TGILGYDTVNVGSIEDTNQIFGLSETEPGSFLYYAPFDGILGLAYPSISASDATPVFDNM 201
Query: 218 VNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVM 277
N+GLV++ +FS + ++D+E G ++FGG+D +Y G +VPV+ +GYWQ + +
Sbjct: 202 WNEGLVSQDLFSVYL--SSDDESGSLVMFGGIDSSYYTGSLNWVPVSYEGYWQITLDSIT 259
Query: 278 IDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
+DG+T CA GC AI D+GTSLLAGPT+ I+ + IGA+
Sbjct: 260 MDGETIA-CADGCQAIVDTGTSLLAGPTSAISNIQSYIGAS 299
>gi|150866171|ref|XP_001385673.2| aspartic proteinase precursor [Scheffersomyces stipitis CBS 6054]
gi|149387427|gb|ABN67644.2| aspartic proteinase precursor [Scheffersomyces stipitis CBS 6054]
Length = 417
Score = 227 bits (579), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 119/243 (48%), Positives = 155/243 (63%), Gaps = 14/243 (5%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L NY++AQYF EI +GTP Q F VI DTGSSNLWVPS +C S+AC+ H+KY SSTY
Sbjct: 95 LTNYLNAQYFTEISLGTPAQQFKVILDTGSSNLWVPSQECS-SLACFLHTKYDHDSSSTY 153
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
K NG I YG+GA+ G+ S+D + IGDLV+ Q+F EAT EP L F KFDGILGL
Sbjct: 154 KANGSEFSIQYGSGAMEGYVSQDTLAIGDLVIPKQDFAEATSEPGLAFAFGKFDGILGLA 213
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWF-NRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ISV K VP YN + QGL++EP F+F+ + DE +GG FGG D + G+ T+
Sbjct: 214 YNTISVNKIVPPVYNALAQGLLDEPQFAFYLGDTKKDENDGGLATFGGYDESAFTGKITW 273
Query: 261 VPVTQKGYWQFD-----MGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAI 315
+PV +K YW+ +GD + TG AAI D+GTSL+ P+++ +N I
Sbjct: 274 LPVRRKAYWEVSFEGIGLGDEYAELDNTG------AAI-DTGTSLITLPSSLAEIINAKI 326
Query: 316 GAT 318
GAT
Sbjct: 327 GAT 329
>gi|410986287|ref|XP_003999442.1| PREDICTED: renin [Felis catus]
Length = 407
Score = 227 bits (578), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 130/304 (42%), Positives = 180/304 (59%), Gaps = 24/304 (7%)
Query: 32 RIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYF 91
R LK+R D+ ARL ++ + ++ R + G S V L NY+D QY+
Sbjct: 43 RESLKERGVDV-----ARLGAEWSQ--------FTKRFSFGNSTSP--VVLTNYLDTQYY 87
Query: 92 GEIGIGTPPQNFTVIFDTGSSNLWVPSSKCY-FSIACYFHSKYRSGRSSTYKKNGKSADI 150
GEIGIGTPPQ F VIFDTGS+NLWVPS+KC AC HS Y S SS+Y +NG + I
Sbjct: 88 GEIGIGTPPQTFKVIFDTGSANLWVPSTKCSPLYTACEIHSLYDSSESSSYMENGTAFAI 147
Query: 151 HYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKA 210
HYG+G + GF S+D V +G + V Q F E T P + F+LAKFDGILG+GF +VG
Sbjct: 148 HYGSGKVKGFLSQDEVTVGGITVT-QTFGEVTELPLIPFMLAKFDGILGMGFPAQAVGGV 206
Query: 211 VPVWYNMVNQGLVNEPVFSFWFNRNADEEE--GGEIVFGGMDPDHYKGEHTYVPVTQKGY 268
PV+ ++++QG++ E VFS +++RN+ GGE+V GG DP +Y+G YV V++ G
Sbjct: 207 TPVFDHILSQGVLKEDVFSVYYSRNSKNSHLLGGEVVLGGSDPQYYQGNFHYVSVSKTGS 266
Query: 269 WQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE---- 324
WQ M V + T C GC + D+G S ++GPT+ + + +GA + E
Sbjct: 267 WQIKMKGVSVRSATV-VCEEGCMVVVDTGASYISGPTSSLRLLMETLGAKELSRNEYVVN 325
Query: 325 CKAV 328
CK V
Sbjct: 326 CKQV 329
>gi|149725292|ref|XP_001501875.1| PREDICTED: pepsin A-like [Equus caballus]
Length = 387
Score = 227 bits (578), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 113/237 (47%), Positives = 157/237 (66%), Gaps = 4/237 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L+NYMD +YFG I IGTPPQ FTVIFDTGSSNLWVPS+ C S+AC H+++ SSTY
Sbjct: 67 LENYMDEEYFGTISIGTPPQEFTVIFDTGSSNLWVPSTYCS-SLACSNHNRFNPEDSSTY 125
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
+ +S I YGTG+++G D V++G + +Q F + EP A FDGILGL
Sbjct: 126 EATSESVSITYGTGSMTGVLGYDTVRVGGIEDTNQIFGLSETEPGSFLYYAPFDGILGLA 185
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ IS A PV+ NM +QGLV++ +FS + ++D+E G ++FGG+D +Y G +V
Sbjct: 186 YPSISASGATPVFDNMWDQGLVSQDLFSVYL--SSDDESGSVVMFGGIDSSYYSGSLNWV 243
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
PV+ +GYWQ M + ++G++ C+GGC AI D+GTSLLAGPT+ I + IGA+
Sbjct: 244 PVSNEGYWQITMDSITMNGESIA-CSGGCQAIVDTGTSLLAGPTSAIDNIQSYIGAS 299
>gi|206611|gb|AAA42031.1| renin [Rattus norvegicus]
Length = 352
Score = 227 bits (578), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 112/239 (46%), Positives = 156/239 (65%), Gaps = 4/239 (1%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRS 138
V L NY+D QY+GEIGIGTP Q F VIFDTGS+NLWVPS+KC AC H+ Y S S
Sbjct: 74 VVLTNYLDTQYYGEIGIGTPSQTFKVIFDTGSANLWVPSTKCGPLYTACEIHNLYDSSES 133
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
S+Y +NG IHYG+G + GF S+D V +G ++V Q F E T P + F+LAKFDG+L
Sbjct: 134 SSYMENGTEFTIHYGSGKVKGFLSQDVVTVGGIIVT-QTFGEVTELPLIPFMLAKFDGVL 192
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEH 258
G+GF +V +PV+ ++++Q ++ E VFS +++R + GGE+V GG DP HY+G
Sbjct: 193 GMGFPAQAVDGVIPVFDHILSQRVLKEEVFSVYYSRES-HLLGGEVVLGGSDPQHYQGNF 251
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
YV +++ G WQ M V + G T C GC A+ D+GTS ++GPT+ + + A+G
Sbjct: 252 HYVSISKAGSWQITMKGVSV-GPATLLCEEGCMAVVDTGTSYISGPTSSLQLIMQALGV 309
>gi|190576563|gb|ACE79054.1| gastricsin precursor (predicted) [Sorex araneus]
Length = 389
Score = 227 bits (578), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 109/254 (42%), Positives = 159/254 (62%), Gaps = 11/254 (4%)
Query: 85 YMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKN 144
Y+DA YFGEI IGTPPQNF V+FDTGSSNLWVPS C S AC H+++ +SSTY N
Sbjct: 68 YLDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTGHARFNPSKSSTYSTN 126
Query: 145 GKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQE 204
G++ + YG+G+++GFF D + + ++ V QEF + EP F+ A+FDGI+G+ +
Sbjct: 127 GQTFSLQYGSGSLTGFFGYDTMTLQNIKVPHQEFGLSQNEPGDNFVYAQFDGIMGMAYPT 186
Query: 205 ISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVT 264
+++G A M+ G ++ PVFSF+ + ++GG +VFGG+D Y G+ + PVT
Sbjct: 187 LAMGGATTALQGMLQAGALDSPVFSFYLSNQQSSQDGGAVVFGGVDNSLYTGQIFWTPVT 246
Query: 265 QKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE 324
Q+ YWQ + +I GQ TG+C+ GC AI D+GTSLL P ++ + A GA
Sbjct: 247 QELYWQIGVEQFLIGGQATGWCSQGCQAIVDTGTSLLTVPQQYMSALQQATGAQ------ 300
Query: 325 CKAVVSQYGEEIIN 338
+ QYG+ ++N
Sbjct: 301 ----LDQYGQMVVN 310
>gi|340374170|ref|XP_003385611.1| PREDICTED: cathepsin D-like [Amphimedon queenslandica]
Length = 389
Score = 227 bits (578), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 129/317 (40%), Positives = 177/317 (55%), Gaps = 21/317 (6%)
Query: 12 FFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNL 71
F LL S P LYR + R+ R + R +KYS R
Sbjct: 6 LFALLLAVSYATSIP---LYRFKSAYETHRRDGRITMR------DVARGLEQKYS-RNLP 55
Query: 72 GESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFH 130
G GD + +K+Y+ AQY+G I +GTP Q+F +FDTGSSNLWVPS KC IAC H
Sbjct: 56 GYVGDDE--PMKDYLMAQYYGPISLGTPDQDFNCMFDTGSSNLWVPSKKCGLLDIACRLH 113
Query: 131 SKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFL 190
+KY S +SSTY NG + YG+GA SGFFS D++KIG+ + Q EAT EP + F+
Sbjct: 114 NKYDSTKSSTYIANGTKFSLQYGSGATSGFFSTDNMKIGNSTITKQSIGEATHEPGVAFV 173
Query: 191 LAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMD 250
AKFDGI G+ + IS + P + NM++Q LVN +F + + + GG++ GG +
Sbjct: 174 AAKFDGICGMAYPAISAERQTPFFDNMISQNLVNAGMFGVFLSADTSASLGGDLNLGGPN 233
Query: 251 PDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTG---FCAGGCAAIADSGTSLLAGPTTI 307
+Y G+ YVP+T K Y+ + +DG G C GGC I D+GTSL+AGPT
Sbjct: 234 EKYYTGDFNYVPLTSKTYYM-----IKVDGMNAGNLSLCDGGCNGIVDTGTSLIAGPTAE 288
Query: 308 ITQVNHAIGATGIVSQE 324
+T++ AIGA ++ E
Sbjct: 289 VTKIATAIGAKSTLAGE 305
>gi|281339451|gb|EFB15035.1| hypothetical protein PANDA_018433 [Ailuropoda melanoleuca]
Length = 388
Score = 227 bits (578), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 129/318 (40%), Positives = 181/318 (56%), Gaps = 40/318 (12%)
Query: 49 RLDSKEGESFRTSIRKYSLRGNLGE---SGDADIV--------------ALKNYMDAQYF 91
R+ + +S R +R RG L E S + D++ L NY+D +YF
Sbjct: 5 RVPLRRHQSLRKKLRA---RGQLSEFWKSQNLDMIQFTESCTMNQDANEPLINYLDMEYF 61
Query: 92 GEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIH 151
G I IG+PPQNFTVIFDTGSSNLWVPS C S AC HS++ +S+TY G I
Sbjct: 62 GTISIGSPPQNFTVIFDTGSSNLWVPSVYCT-SAACKTHSRFYPSQSNTYSVLGSHFSIQ 120
Query: 152 YGTGAISGFFSEDHV----------KIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
YGTG++SG D V ++ LVV Q+F E+ EP TF+ A+FDGILGLG
Sbjct: 121 YGTGSLSGIIGADQVDVTFFWVFSRQVEGLVVVGQQFGESVTEPGQTFVNAEFDGILGLG 180
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ ++VG PV+ NM+ Q LV+ P+FS + + + + G E++FGG D H+ G +V
Sbjct: 181 YPSLAVGGVTPVFDNMMAQNLVDIPMFSVYMSSDPEGGAGSELIFGGYDHSHFSGNLHWV 240
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIV 321
PVT++GYWQ + + + G FC+ GC AI D+GTSL+ GP+ + Q+ AIGA
Sbjct: 241 PVTKQGYWQIALDAIQVGGAVM-FCSEGCQAIVDTGTSLITGPSDKVKQLQKAIGA---- 295
Query: 322 SQECKAVVSQYGEEIINM 339
+ + +YG E N+
Sbjct: 296 ----EPMDGEYGVECANL 309
>gi|30024582|dbj|BAC75704.1| proteinase A [Candida boidinii]
Length = 420
Score = 227 bits (578), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 127/262 (48%), Positives = 159/262 (60%), Gaps = 9/262 (3%)
Query: 72 GESGDADIVA-LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFH 130
GE D+ A L NYM+AQYF EI +GTP Q F VI DTGSSNLWVP C S+ACY H
Sbjct: 87 GEFKDSKTDAPLTNYMNAQYFTEIQLGTPGQVFKVILDTGSSNLWVPGKDCS-SLACYLH 145
Query: 131 SKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFL 190
SKY SSTYKKNG I YGTG++ GF S D + IGDLV+ DQ F EAT EP LTF
Sbjct: 146 SKYDHDESSTYKKNGTEFAIRYGTGSLEGFVSSDTLTIGDLVIPDQGFAEATSEPGLTFA 205
Query: 191 LAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEE-EGGEIVFGGM 249
KFDGILGL + ISV K VP Y ++ GL+++P FSF+ A E +GG FGG+
Sbjct: 206 FGKFDGILGLAYDTISVQKVVPPVYKAIDSGLLDKPQFSFYLGDTAKSETDGGVATFGGI 265
Query: 250 DPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIIT 309
D + G+ T++PV +K YW+ V + + G A D+GTSL+A P+ +
Sbjct: 266 DESKFNGKLTWLPVRRKAYWEVAFDGVGLGSEYAPLLNTGAA--IDTGTSLIALPSGLAE 323
Query: 310 QVNHAIGAT----GIVSQECKA 327
+N IGAT G + +C A
Sbjct: 324 ILNSEIGATKSWSGQYTIDCAA 345
>gi|195438439|ref|XP_002067144.1| GK24836 [Drosophila willistoni]
gi|194163229|gb|EDW78130.1| GK24836 [Drosophila willistoni]
Length = 372
Score = 227 bits (578), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 118/290 (40%), Positives = 170/290 (58%), Gaps = 16/290 (5%)
Query: 16 LLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDS-KEGESFRTSIRKYSLRGNLGES 74
++ ++ + + L+R+ + K + N + R + E RT R + R E
Sbjct: 5 FVILSILVALASAELHRVSVHKEQ----NFIKTRENVLAEKAYLRTKYRLPTTRAVNEEQ 60
Query: 75 GDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYR 134
L N ++ Y+G I IGTP Q+F V+FD+GSSNLW+PS+ C S AC H++Y
Sbjct: 61 -------LSNSLNMAYYGAISIGTPAQSFKVLFDSGSSNLWIPSNTCQ-STACLSHNQYD 112
Query: 135 SGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKF 194
S SSTY NG+S I YGTG+++G+ S D V + L + Q F E+T EP F A F
Sbjct: 113 SSASSTYVANGESFSIQYGTGSLTGYLSTDTVDVNGLKIVSQTFAESTNEPGTNFNNANF 172
Query: 195 DGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHY 254
DGILG+ ++ ++V P +YNMV+Q LV+ VFSF+ R+ +GGE++FGG D Y
Sbjct: 173 DGILGMAYKSLAVDSVTPPFYNMVSQSLVDSSVFSFYLARDGSATDGGELIFGGSDASLY 232
Query: 255 KGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGP 304
G+ TYVP++++GYWQF+M DG T C C AIAD+GTSL+ P
Sbjct: 233 TGDLTYVPISEQGYWQFEMTSASFDGYT--LC-DDCQAIADTGTSLIVAP 279
>gi|118344566|ref|NP_001072055.1| nothepsin precursor [Takifugu rubripes]
gi|55771088|dbj|BAD69804.1| nothepsin [Takifugu rubripes]
Length = 414
Score = 226 bits (577), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 116/287 (40%), Positives = 175/287 (60%), Gaps = 19/287 (6%)
Query: 35 LKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYFGEI 94
L++R+ DL R + G S R + ++S L NYMD Q++GEI
Sbjct: 47 LQERRPDLFQRRYFQCFPATGPSLR--VERFS-------------ETLYNYMDVQFYGEI 91
Query: 95 GIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGT 154
+GTP QNF+V+FDTGSS+LWVPS C H ++++ S++Y+ +G+ +IHYG+
Sbjct: 92 ELGTPGQNFSVVFDTGSSDLWVPSVYCVSQTCGTVHRRFKAFESTSYRHDGRVFEIHYGS 151
Query: 155 GAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVW 214
G + G + D +K+ ++ V++QEF E+ EP + F++A FDGILG+G+ ++ PV+
Sbjct: 152 GHMLGIMARDTLKVNNVTVQNQEFGESVYEPGVAFVMAHFDGILGMGYPSLAQILGNPVF 211
Query: 215 YNMVNQGLVNEPVFSFW---FNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQF 271
NM+ Q +V EP+FSF+ + R + + GE++ GGMD D + G ++PVT KGYWQ
Sbjct: 212 DNMLAQQMVEEPIFSFYLSKYERFSGSKLQGELLLGGMDQDLFTGPINWLPVTTKGYWQI 271
Query: 272 DMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
+ V + G T FC GC AI D+GTSL+AGPT I ++ IGAT
Sbjct: 272 KVDSVAVQGVDT-FCPEGCQAIVDTGTSLIAGPTRDILRLQQLIGAT 317
>gi|348521340|ref|XP_003448184.1| PREDICTED: cathepsin E-A-like [Oreochromis niloticus]
Length = 406
Score = 226 bits (577), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 121/291 (41%), Positives = 177/291 (60%), Gaps = 18/291 (6%)
Query: 29 GLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDA 88
GL LK D+ NR A+ S R LG + + + N+MDA
Sbjct: 36 GLLNEFLKDHAPDMINRRYAQCIPSSTPSLR-----------LGRTSER----IYNFMDA 80
Query: 89 QYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSA 148
Q++GEI +GTP QNF+VIFDTGS++LWVPS+ C S AC H ++++ +S++++ +G+
Sbjct: 81 QFYGEISLGTPEQNFSVIFDTGSADLWVPSTYC-ISEACALHRRFKAFKSASFRHDGRRF 139
Query: 149 DIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVG 208
I+YG+G + G ++D +KIG L + +QEF E+ EPS +FL AKFDG+LG+ +Q ++
Sbjct: 140 GIYYGSGHLLGTMAKDTLKIGGLTILNQEFGESVYEPSESFLTAKFDGVLGMSYQSLAEI 199
Query: 209 KAVPVWYNMVNQGLVNEPVFSFWFNRNADE-EEGGEIVFGGMDPDHYKGEHTYVPVTQKG 267
V+ NM+ Q LV++PVFSF+ +R + + GE++ GG + Y G + PVT KG
Sbjct: 200 LGTNVFDNMIAQKLVDQPVFSFYLSRKSSRTKPAGELLLGGTNEALYIGPINWHPVTAKG 259
Query: 268 YWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
YWQ M V + G FC GC AI D+GTSL+AGPT I ++ IGAT
Sbjct: 260 YWQIKMDSVAVQG-VNLFCPSGCQAIVDTGTSLIAGPTNDILRLQQLIGAT 309
>gi|50978822|ref|NP_001003117.1| pepsin A preproprotein [Canis lupus familiaris]
gi|73621384|sp|Q9GMY6.1|PEPA_CANFA RecName: Full=Pepsin A; Flags: Precursor
gi|9798660|dbj|BAB11752.1| pepsinogen A [Canis lupus familiaris]
Length = 386
Score = 226 bits (577), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 127/296 (42%), Positives = 173/296 (58%), Gaps = 31/296 (10%)
Query: 62 IRKYSLRGNLGESG--------------------DADIVA---LKNYMDAQYFGEIGIGT 98
+RK SLR NL E G + ++A LKNYMD +YFG IGIGT
Sbjct: 23 VRKKSLRQNLIEHGLLNDFLKNQSPNPASKYFPQEPTVLATQSLKNYMDMEYFGTIGIGT 82
Query: 99 PPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAIS 158
PPQ FTVIFDTGSSNLWVPS C S AC H+++ SSTY+ + I YGTG+++
Sbjct: 83 PPQEFTVIFDTGSSNLWVPSVYCS-SPACSNHNRFNPQESSTYQGTNRPVSIAYGTGSMT 141
Query: 159 GFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMV 218
G D V++G + +Q F + EP A FDGILGL + +IS A PV+ NM
Sbjct: 142 GILGYDTVQVGGIADTNQIFGLSETEPGSFLYYAPFDGILGLAYPQISASGATPVFDNMW 201
Query: 219 NQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMI 278
N+GLV++ +FS + ++D++ G ++FGG+D +Y G +VPV+ +GYWQ + V +
Sbjct: 202 NEGLVSQDLFSVYL--SSDDQSGSVVMFGGIDSSYYSGNLNWVPVSVEGYWQITVDSVTM 259
Query: 279 DGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT----GIVSQECKAVVS 330
+GQ C+ GC AI D+GTSLLAGPT I + IGA+ G + C A+ S
Sbjct: 260 NGQAIA-CSDGCQAIVDTGTSLLAGPTNAIANIQSYIGASQNSYGQMVISCSAINS 314
>gi|73620985|sp|P81498.2|PEPC_SUNMU RecName: Full=Gastricsin; AltName: Full=Pepsinogen C-1; Flags:
Precursor
gi|9798662|dbj|BAB11753.1| pepsinogen C [Suncus murinus]
Length = 389
Score = 226 bits (577), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 108/250 (43%), Positives = 157/250 (62%), Gaps = 5/250 (2%)
Query: 85 YMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKN 144
YMDA YFGEI IGTPPQNF V+FDTGSSNLWVPS C S AC H+++ +SSTY N
Sbjct: 68 YMDASYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTGHARFNPNQSSTYSTN 126
Query: 145 GKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQE 204
G++ + YG+G+++GFF D + + ++ V QEF + EP F+ A+FDGI+G+ +
Sbjct: 127 GQTFSLQYGSGSLTGFFGYDTMTVQNIKVPHQEFGLSQNEPGTNFIYAQFDGIMGMAYPS 186
Query: 205 ISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVT 264
+++G A M+ +G + PVFSF+ + + GG ++FGG+D Y G+ + PVT
Sbjct: 187 LAMGGATTALQGMLQEGALTSPVFSFYLSNQQGSQNGGAVIFGGVDNSLYTGQIFWAPVT 246
Query: 265 QKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT----GI 320
Q+ YWQ + + +I GQ TG+C GC AI D+GTSLL P ++ + A GA G
Sbjct: 247 QELYWQIGVEEFLIGGQATGWCQQGCQAIVDTGTSLLTVPQQFMSALQQATGAQQDQYGQ 306
Query: 321 VSQECKAVVS 330
++ C ++ S
Sbjct: 307 LAVNCNSIQS 316
>gi|410082415|ref|XP_003958786.1| hypothetical protein KAFR_0H02420 [Kazachstania africana CBS 2517]
gi|372465375|emb|CCF59651.1| hypothetical protein KAFR_0H02420 [Kazachstania africana CBS 2517]
Length = 416
Score = 226 bits (577), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 113/238 (47%), Positives = 148/238 (62%), Gaps = 3/238 (1%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L NY++AQYF +I IG+P Q F VI DTGSSNLWVPS C S+AC+ H+KY SS
Sbjct: 89 VPLNNYLNAQYFADISIGSPGQTFRVIMDTGSSNLWVPSVDCN-SLACFLHNKYDHRVSS 147
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TY +NG I YG+GA+ G+ S D V +GDL + Q+F EAT EP L F KFDGI G
Sbjct: 148 TYVRNGTRFAIRYGSGALEGYMSNDTVTVGDLQIPKQDFAEATSEPGLAFAFGKFDGIFG 207
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
L F ISV +AVP +YN VN+GL++ P F+F+ +EGGE+ FGG D + G T
Sbjct: 208 LAFDTISVNRAVPPFYNAVNRGLLDAPQFAFYLGDKRLRKEGGEVTFGGYDETRFTGNIT 267
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
++PV ++ YW+ D + Q A G A D+GTSL+ P+ + +N IGA
Sbjct: 268 WLPVRREAYWEVDFNGISFGSQYAPLTATGAA--IDTGTSLITLPSGLAEILNAQIGA 323
>gi|146414892|ref|XP_001483416.1| hypothetical protein PGUG_04145 [Meyerozyma guilliermondii ATCC
6260]
gi|146391889|gb|EDK40047.1| hypothetical protein PGUG_04145 [Meyerozyma guilliermondii ATCC
6260]
Length = 408
Score = 226 bits (577), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 113/238 (47%), Positives = 153/238 (64%), Gaps = 4/238 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L NY++AQYF EI +GTP Q F VI DTGSSNLWVPS+ C S+AC+ H+KY SSTY
Sbjct: 86 LSNYVNAQYFTEIQLGTPGQTFKVILDTGSSNLWVPSADCS-SLACFLHTKYDHDSSSTY 144
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
K NG I YG+GA+ G+ S D + +GDL++ Q+F EAT EP L F KFDGILGL
Sbjct: 145 KANGSEFSIQYGSGAMEGYVSRDTLALGDLIIPRQDFAEATSEPGLAFAFGKFDGILGLA 204
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWF-NRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ISV K VP YN ++QGL++EPVF+F + + DE +GG FGG D + G+ T+
Sbjct: 205 YNTISVNKIVPPIYNAIDQGLLDEPVFAFRLGDTSKDENDGGVATFGGYDKSQFTGKITW 264
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
+PV +K YW+ + + + + G A D+GTSL+ P+++ +N IGAT
Sbjct: 265 LPVRRKAYWEVSFEGIGLGDEYAELTSTGAA--IDTGTSLITLPSSLAEIMNTKIGAT 320
>gi|410968030|ref|XP_003990516.1| PREDICTED: pepsin B-like [Felis catus]
Length = 390
Score = 226 bits (577), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 115/266 (43%), Positives = 168/266 (63%), Gaps = 14/266 (5%)
Query: 76 DADIVALK---NYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSK 132
+ D VA + NY+++ YFGEI IGTPPQNF V+FDTGSSNLWVPS+ C S AC H+
Sbjct: 57 NNDAVAYEPFTNYLNSYYFGEISIGTPPQNFLVLFDTGSSNLWVPSTYCK-SQACSNHNT 115
Query: 133 YRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLA 192
+ SSTY+ NG++ ++YG+G+++ D V + ++V+ +QEF + EPS F A
Sbjct: 116 FNPSMSSTYQNNGQTYTLYYGSGSLTVLLGYDTVTVQNIVIHNQEFGLSEIEPSNPFYYA 175
Query: 193 KFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPD 252
FDGILG+ + ++VG + V +M+ QG + P+FSF+F+R E GGE++ GGM+
Sbjct: 176 NFDGILGMAYPNLAVGNSPTVMESMMQQGQLTSPIFSFYFSRQPTYEYGGELILGGMNSQ 235
Query: 253 HYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVN 312
Y GE + PVT++ YWQ + + ++ Q TG C+ GC AI D+GT +LA P + N
Sbjct: 236 FYSGEIVWTPVTRELYWQVAIDEFLVGNQPTGLCSQGCQAIVDTGTYVLAVPQQYM---N 292
Query: 313 HAIGATGIVSQECKAVVSQYGEEIIN 338
+ ATG A VSQYG+ ++N
Sbjct: 293 SFLQATG-------AEVSQYGDFVVN 311
>gi|348514690|ref|XP_003444873.1| PREDICTED: pepsin A-like [Oreochromis niloticus]
Length = 377
Score = 226 bits (577), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 117/259 (45%), Positives = 165/259 (63%), Gaps = 17/259 (6%)
Query: 81 ALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSST 140
++ N D Y+G I IGTPPQ+F+VIFDTGSSNLWVPS C S AC H+++ +SST
Sbjct: 61 SMTNDADLSYYGTISIGTPPQSFSVIFDTGSSNLWVPSVYCN-STACENHNQFNPSQSST 119
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEF-IEATREPSLTFLLAKFDGILG 199
++ +S I YGTG+++GF D V++G + V +Q F + T +T++ A DGILG
Sbjct: 120 FQWGNQSLSIQYGTGSMTGFLGSDTVEVGGISVANQVFGLSQTEASFMTYMQA--DGILG 177
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
L FQ I+ VPV+ M+ +GLV+EP+FS + + N+ E+G E+VFGG D HY G T
Sbjct: 178 LAFQSIASDNVVPVFNTMITEGLVSEPIFSVYLSGNS--EQGSEVVFGGTDSTHYTGTIT 235
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATG 319
++P++ YWQ +M V I+GQT C+GGC AI D+GTSL+ GPTT I +N +GA+
Sbjct: 236 WIPLSSATYWQINMDSVTINGQTVA-CSGGCQAIIDTGTSLIVGPTTDINNLNSWVGAS- 293
Query: 320 IVSQECKAVVSQYGEEIIN 338
Q G+ I+N
Sbjct: 294 ---------TDQSGDAIVN 303
>gi|327271207|ref|XP_003220379.1| PREDICTED: gastricsin-like [Anolis carolinensis]
Length = 388
Score = 226 bits (577), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 112/254 (44%), Positives = 162/254 (63%), Gaps = 11/254 (4%)
Query: 85 YMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKN 144
YMDA Y+G+IGIGTP QNF V+FDTGSSNLWVPS C + AC H+++ +SSTY N
Sbjct: 67 YMDASYYGQIGIGTPAQNFLVLFDTGSSNLWVPSIYCN-TEACTRHARFNPSQSSTYSTN 125
Query: 145 GKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQE 204
G++ + YG+G ++GFF D + + ++VV +QEF + EP F+ A+FDGILG+ +
Sbjct: 126 GQTFFLQYGSGNLAGFFGYDTLTLQNIVVTNQEFGLSKNEPGANFIYAEFDGILGMAYPS 185
Query: 205 ISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVT 264
++VG A M+ + L+++ VFSF+ +R + + GGE+VFGG+D Y GE + PVT
Sbjct: 186 LAVGGATTALERMLQENLLSQSVFSFYLSRQPNSQYGGEVVFGGVDTRLYSGEIYWAPVT 245
Query: 265 QKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE 324
Q+ YWQ + + I GQ TG+C+ GC AI D+GTSLL P ++ A+GA
Sbjct: 246 QELYWQIGIQEFSIGGQATGWCSQGCQAIVDTGTSLLTVPQQYMSNFLSAVGAQQ----- 300
Query: 325 CKAVVSQYGEEIIN 338
+QYG+ +N
Sbjct: 301 -----NQYGQYAVN 309
>gi|414887123|tpg|DAA63137.1| TPA: hypothetical protein ZEAMMB73_794362 [Zea mays]
Length = 608
Score = 226 bits (577), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 98/130 (75%), Positives = 120/130 (92%)
Query: 215 YNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMG 274
YNMV QGL+++PVFSFWFNR+ADE EGGEIVFGGMD HYKG+HT+VPVT+KGYWQF+MG
Sbjct: 308 YNMVKQGLISDPVFSFWFNRHADEGEGGEIVFGGMDSSHYKGDHTFVPVTRKGYWQFNMG 367
Query: 275 DVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQECKAVVSQYGE 334
DV++DG++TGFCAGGCAA+ADSGTSLLAGPT IIT++N IGA G+VSQECK VVSQYG+
Sbjct: 368 DVLVDGKSTGFCAGGCAAVADSGTSLLAGPTAIITEINEKIGAAGVVSQECKTVVSQYGQ 427
Query: 335 EIINMLLAKV 344
+I+++LLA+
Sbjct: 428 QILDLLLAET 437
>gi|441648777|ref|XP_003266334.2| PREDICTED: LOW QUALITY PROTEIN: gastricsin [Nomascus leucogenys]
Length = 388
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 127/305 (41%), Positives = 181/305 (59%), Gaps = 20/305 (6%)
Query: 39 KFDLNNRVAARLDSKE----GESFRTSIRKYSLRGNLGE-SGDADIVALKNYMDAQYFGE 93
K LN + R KE GE RT + + + G+ S + +A YMDA YFGE
Sbjct: 20 KXPLNEFKSIRETMKEKGLLGEFLRTHKYDPAWKYHFGDLSVSYEPMA---YMDAAYFGE 76
Query: 94 IGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYG 153
+ IGTPPQNF V+FDTGSSNLWVPS C S AC HS++ +SSTY NG++ + YG
Sbjct: 77 VSIGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHSRFNPSKSSTYSTNGQTFSLQYG 135
Query: 154 TGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPV 213
+G+++GFF D + + + V +QEF + EP F+ A+FDGI+GL + +SV +A
Sbjct: 136 SGSLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFIYARFDGIMGLAYPALSVDEATTA 195
Query: 214 WYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDM 273
MV +G + PVFSF+ + N + GG +VFGG+D Y G+ + PVTQ+ YWQ +
Sbjct: 196 MQGMVQEGALTSPVFSFYLS-NQEGSSGGAVVFGGVDSSLYTGQIYWAPVTQELYWQIGI 254
Query: 274 GDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQECKAVVSQYG 333
+ +I GQ +G+C+ GC AI D+GTSLL P ++ + A GA QE +YG
Sbjct: 255 EEFLIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMSALLQATGA-----QE-----DEYG 304
Query: 334 EEIIN 338
+ ++N
Sbjct: 305 QFLVN 309
>gi|149058614|gb|EDM09771.1| renin 1, isoform CRA_a [Rattus norvegicus]
Length = 366
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 112/239 (46%), Positives = 156/239 (65%), Gaps = 4/239 (1%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRS 138
V L NY+D QY+GEIGIGTP Q F VIFDTGS+NLWVPS+KC AC H+ Y S S
Sbjct: 38 VVLTNYLDTQYYGEIGIGTPSQTFKVIFDTGSANLWVPSTKCGPLYTACEIHNLYDSSES 97
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
S+Y +NG IHYG+G + GF S+D V +G ++V Q F E T P + F+LAKFDG+L
Sbjct: 98 SSYMENGTEFTIHYGSGKVKGFLSQDVVTVGGIIVT-QTFGEVTELPLIPFMLAKFDGVL 156
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEH 258
G+GF +V +PV+ ++++Q ++ E VFS +++R + GGE+V GG DP HY+G
Sbjct: 157 GMGFPAQAVDGVIPVFDHILSQRVLKEEVFSVYYSRES-HLLGGEVVLGGSDPQHYQGNF 215
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
YV +++ G WQ M V + G T C GC A+ D+GTS ++GPT+ + + A+G
Sbjct: 216 HYVSISKAGSWQITMKGVSV-GPATLLCEEGCMAVVDTGTSYISGPTSSLQLIMQALGV 273
>gi|540097|gb|AAB08492.1| preprochymosin, partial [Sus scrofa]
Length = 380
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 131/320 (40%), Positives = 183/320 (57%), Gaps = 24/320 (7%)
Query: 17 LLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRK--YSLRGNLGES 74
+L V+ + G+ R+ L+K K + R + KE ++K Y+L
Sbjct: 5 VLLAVLALSQGSGITRVPLRKGK-------SLRKELKERGLLEDFLQKQPYALSSKYSSF 57
Query: 75 GDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYR 134
G+ L NY+D QYFG+I IGTPPQ FTV+FDTGSS LWVPS C S AC H ++
Sbjct: 58 GEVASEPLTNYLDTQYFGKIYIGTPPQEFTVVFDTGSSELWVPSVYCK-SDACQNHHRFN 116
Query: 135 SGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKF 194
+SST++ K I YGTG+I GF D V + +V Q +T+EPS F ++F
Sbjct: 117 PSKSSTFQNLDKPLSIQYGTGSIQGFLGYDTVMVAGIVDAHQTVGLSTQEPSDIFTYSEF 176
Query: 195 DGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHY 254
DGILGLG+ E++ VPV+ NM+++ LV + +F+ + +RN +EG + G +DP +Y
Sbjct: 177 DGILGLGYPELASEYTVPVFDNMMHRHLVAQDLFAVYMSRN---DEGSMLTLGAIDPSYY 233
Query: 255 KGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHA 314
G +VPVT + YWQF + V I+G C GGC AI D+GTS+LAGP++ I + A
Sbjct: 234 TGSLHWVPVTMQLYWQFTVDSVTINGVVVA-CNGGCQAILDTGTSMLAGPSSDILNIQMA 292
Query: 315 IGATGIVSQECKAVVSQYGE 334
IGAT SQYGE
Sbjct: 293 IGATE----------SQYGE 302
>gi|148747255|ref|NP_036774.4| renin precursor [Rattus norvegicus]
gi|1350571|sp|P08424.2|RENI_RAT RecName: Full=Renin; AltName: Full=Angiotensinogenase; Flags:
Precursor
gi|30027675|gb|AAP13916.1| renin [Rattus sp.]
gi|51261221|gb|AAH78878.1| Renin [Rattus norvegicus]
gi|149058615|gb|EDM09772.1| renin 1, isoform CRA_b [Rattus norvegicus]
Length = 402
Score = 226 bits (575), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 112/239 (46%), Positives = 156/239 (65%), Gaps = 4/239 (1%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRS 138
V L NY+D QY+GEIGIGTP Q F VIFDTGS+NLWVPS+KC AC H+ Y S S
Sbjct: 74 VVLTNYLDTQYYGEIGIGTPSQTFKVIFDTGSANLWVPSTKCGPLYTACEIHNLYDSSES 133
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
S+Y +NG IHYG+G + GF S+D V +G ++V Q F E T P + F+LAKFDG+L
Sbjct: 134 SSYMENGTEFTIHYGSGKVKGFLSQDVVTVGGIIVT-QTFGEVTELPLIPFMLAKFDGVL 192
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEH 258
G+GF +V +PV+ ++++Q ++ E VFS +++R + GGE+V GG DP HY+G
Sbjct: 193 GMGFPAQAVDGVIPVFDHILSQRVLKEEVFSVYYSRES-HLLGGEVVLGGSDPQHYQGNF 251
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
YV +++ G WQ M V + G T C GC A+ D+GTS ++GPT+ + + A+G
Sbjct: 252 HYVSISKAGSWQITMKGVSV-GPATLLCEEGCMAVVDTGTSYISGPTSSLQLIMQALGV 309
>gi|193735605|gb|ACF20292.1| vacuolar protease A [Trichoderma aureoviride]
gi|226374420|gb|ACO52389.1| vacuolar protease A [Trichoderma aureoviride]
Length = 395
Score = 226 bits (575), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 123/243 (50%), Positives = 159/243 (65%), Gaps = 16/243 (6%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V + N+M+AQYF EI IG+PPQ F V+ DTGSSNLWVPS C SIAC+ HS Y S SS
Sbjct: 75 VPVTNFMNAQYFSEITIGSPPQTFKVVLDTGSSNLWVPSQSCN-SIACFLHSTYDSSSSS 133
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYKKNG +IHYG+G+++GF S D V IGDL +K Q+F EAT EP L F +FDGILG
Sbjct: 134 TYKKNGSDFEIHYGSGSLTGFISNDVVTIGDLKIKGQDFAEATSEPGLAFAFGRFDGILG 193
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ ISV VP +Y MVNQ L++EPVF+F+ + +EG FGG+D H+ G+
Sbjct: 194 LGYDTISVNGIVPPFYQMVNQKLLDEPVFAFYL---GNSDEGSVATFGGVDESHFSGKIE 250
Query: 260 YVPVTQKGYWQFDM-----GDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHA 314
Y+P+ +K YW+ D+ GD + + + TG AI D+GTSL P+ I +N
Sbjct: 251 YIPLRRKAYWEVDLDSIAFGDEVAELENTG-------AILDTGTSLNVLPSGIAELLNAE 303
Query: 315 IGA 317
IGA
Sbjct: 304 IGA 306
>gi|355329699|dbj|BAL14143.1| pepsinogen 2 [Pagrus major]
Length = 377
Score = 226 bits (575), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 134/328 (40%), Positives = 192/328 (58%), Gaps = 30/328 (9%)
Query: 13 FLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEG--ESFRTSIRKYSLRGN 70
+L +L V FS L+R+ L K K A + ++G E FR Y+
Sbjct: 4 WLVVLSALVAFSE---CLHRMPLIKGK------TARQTLQEQGKWEEFRKQ-YPYNPMAK 53
Query: 71 LGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFH 130
+SG ++ N D Y+G + IGTPPQ+FTVIFDTGSSNLW+PS C S AC H
Sbjct: 54 FVQSGTE---SMTNDADLSYYGVVSIGTPPQSFTVIFDTGSSNLWIPSVYCN-SQACQNH 109
Query: 131 SKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFL 190
K+ +SST+K ++ I YGTG+++G+ + D V++G + V +Q F + E +
Sbjct: 110 KKFNPQQSSTFKWGNEALSIQYGTGSMTGYLAIDTVEVGGISVANQVFGISQTEAAFMAS 169
Query: 191 LAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMD 250
+A DGILGL FQ I+ VPV+ NM+ QGLV++P+FS + + N+ E+G E+VFGG D
Sbjct: 170 MAA-DGILGLAFQSIASDNVVPVFDNMIKQGLVSQPMFSVYLSGNS--EQGSEVVFGGTD 226
Query: 251 PDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQ 310
+HY G+ T++P++ YWQ M V I+GQT C+GGC AI D+GTSL+ GPT I
Sbjct: 227 SNHYTGQITWIPLSSATYWQISMDSVTINGQTVA-CSGGCQAIIDTGTSLIVGPTNDINN 285
Query: 311 VNHAIGATGIVSQECKAVVSQYGEEIIN 338
+N +GA+ +QYGE +N
Sbjct: 286 MNSWVGAS----------TNQYGEATVN 303
>gi|374431137|gb|AEZ51819.1| pepsin, partial [Oreochromis niloticus]
Length = 339
Score = 226 bits (575), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 117/259 (45%), Positives = 165/259 (63%), Gaps = 17/259 (6%)
Query: 81 ALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSST 140
++ N D Y+G I IGTPPQ+F+VIFDTGSSNLWVPS C S AC H+++ +SST
Sbjct: 23 SMTNDADLSYYGTISIGTPPQSFSVIFDTGSSNLWVPSVYCN-STACENHNQFNPSQSST 81
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEF-IEATREPSLTFLLAKFDGILG 199
++ +S I YGTG+++GF D V++G + V +Q F + T +T++ A DGILG
Sbjct: 82 FQWGNQSLSIQYGTGSMTGFLGSDTVEVGGISVANQVFGLSQTEASFMTYMQA--DGILG 139
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
L FQ I+ VPV+ M+ +GLV+EP+FS + + N+ E+G E+VFGG D HY G T
Sbjct: 140 LAFQSIASDNVVPVFNTMITEGLVSEPIFSVYLSGNS--EQGSEVVFGGTDSTHYTGTIT 197
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATG 319
++P++ YWQ +M V I+GQT C+GGC AI D+GTSL+ GPTT I +N +GA+
Sbjct: 198 WIPLSSATYWQINMDSVTINGQTVA-CSGGCQAIIDTGTSLIVGPTTDINNLNSWVGAS- 255
Query: 320 IVSQECKAVVSQYGEEIIN 338
Q G+ I+N
Sbjct: 256 ---------TDQSGDAIVN 265
>gi|129786|sp|P27678.1|PEPA4_MACFU RecName: Full=Pepsin A-4; AltName: Full=Pepsin I/II; Flags:
Precursor
gi|38071|emb|CAA42425.1| prepropepsin A [Macaca fuscata]
Length = 388
Score = 226 bits (575), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 126/298 (42%), Positives = 171/298 (57%), Gaps = 33/298 (11%)
Query: 62 IRKYSLRGNLGESGD-------------------------ADIVALKNYMDAQYFGEIGI 96
+RK SLR NL E G D L+NY+D +YFG IGI
Sbjct: 23 VRKKSLRRNLSEHGLLKDFLKKHNLNPASKYFPQAEAPTLIDEQPLENYLDVEYFGTIGI 82
Query: 97 GTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGA 156
GTP QNFTV+FDTGSSNLWVPS CY S+AC H+ + SSTY+ K+ I YGTG+
Sbjct: 83 GTPAQNFTVVFDTGSSNLWVPSVYCY-SLACMDHNLFNPQDSSTYRATSKTVSITYGTGS 141
Query: 157 ISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYN 216
++G D VK+G + +Q F + EP A FDGILGL + IS A PV+ N
Sbjct: 142 MTGILGYDTVKVGGISDTNQIFGLSETEPGFFLYFAPFDGILGLAYPSISSSGATPVFDN 201
Query: 217 MVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDV 276
+ NQ LV++ +FS + +AD++ G ++FGG+D +Y G +VPV+ +GYWQ + +
Sbjct: 202 IWNQRLVSQDLFSVYL--SADDQSGSVVIFGGIDSSYYTGSLNWVPVSVEGYWQISVDSI 259
Query: 277 MIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT----GIVSQECKAVVS 330
++G+T CA GC AI D+GTSLL GPT+ I + IGA+ G + C A+ S
Sbjct: 260 TMNGKTIA-CAKGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGEMVVSCSAISS 316
>gi|291223845|ref|XP_002731921.1| PREDICTED: expressed hypothetical protein-like [Saccoglossus
kowalevskii]
Length = 959
Score = 226 bits (575), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 112/237 (47%), Positives = 157/237 (66%), Gaps = 4/237 (1%)
Query: 83 KNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSSTY 141
Y+DA Y+GEIGIGTPP F V+FDTGSS LWVPS+ C S +AC FH+ Y + +SSTY
Sbjct: 634 NTYIDASYYGEIGIGTPPATFLVLFDTGSSYLWVPSAMCPESNMACAFHNSYDNLKSSTY 693
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
+S +I YG+G++SG S D + IGD+ +++Q F E T P + +LA+FDGILGLG
Sbjct: 694 TATRESFNITYGSGSVSGVISRDTIVIGDVRIENQLFGETTAWPDTSIVLARFDGILGLG 753
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ + +PV+ NM+ Q L++EPVFS + + ++ GE++ GG D HY GE TY+
Sbjct: 754 YPNLQTRSILPVFDNMLAQHLISEPVFSVYVRGDGNK---GELILGGSDQHHYSGEFTYL 810
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
PVT KGYWQF M + + + + +C GC A+ D+GTS++AGP I +N IGA
Sbjct: 811 PVTIKGYWQFTMDSIHVYDKPSQYCLDGCQAVVDTGTSVIAGPMEDIETLNTEIGAV 867
>gi|56971213|gb|AAH88063.1| LOC496913 protein, partial [Xenopus (Silurana) tropicalis]
Length = 380
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 113/276 (40%), Positives = 162/276 (58%), Gaps = 9/276 (3%)
Query: 49 RLDSKEGESFRTSIRKYSLRGNLGESGD-------ADIVALKNYMDAQYFGEIGIGTPPQ 101
++ K +S R +R++ ++ + + L NYMD Y+GEI IGTPPQ
Sbjct: 16 KVPLKRFKSMREVMREHGIKAPIVDPASKYYNQYATAFEPLANYMDMSYYGEISIGTPPQ 75
Query: 102 NFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFF 161
NF V+FDTGSSNLWV S+ C S AC H + +SSTY N + + YGTG+++G
Sbjct: 76 NFLVLFDTGSSNLWVASTNCQ-SQACTNHPLFNPSQSSTYSSNQQQFSLQYGTGSLTGIL 134
Query: 162 SEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQG 221
D V I ++ + QEF + EP F+ A+FDGILGL + I+VG A V M+ Q
Sbjct: 135 GYDTVTIQNIAISQQEFGLSVTEPGTNFVYAQFDGILGLAYPSIAVGGATTVMQGMLQQN 194
Query: 222 LVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQ 281
L+NEPVF F+ + + + GGE+ FGG+D ++Y G+ + PVT + YWQ + I+GQ
Sbjct: 195 LLNEPVFGFYLS-GENTQSGGEVAFGGVDQNYYTGQIYWTPVTSETYWQIGIQGFSINGQ 253
Query: 282 TTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
+G+C+ GC I D+GTSLL P +I + IGA
Sbjct: 254 ASGWCSQGCQGIVDTGTSLLTAPQSIFASLMQDIGA 289
>gi|301606848|ref|XP_002933026.1| PREDICTED: gastricsin isoform 2 [Xenopus (Silurana) tropicalis]
Length = 382
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 113/276 (40%), Positives = 162/276 (58%), Gaps = 9/276 (3%)
Query: 49 RLDSKEGESFRTSIRKYSLRGNLGESGD-------ADIVALKNYMDAQYFGEIGIGTPPQ 101
++ K +S R +R++ ++ + + L NYMD Y+GEI IGTPPQ
Sbjct: 19 KVPLKRFKSMREVMREHGIKAPIVDPASKYYNQYATAFEPLANYMDMSYYGEISIGTPPQ 78
Query: 102 NFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFF 161
NF V+FDTGSSNLWV S+ C S AC H + +SSTY N + + YGTG+++G
Sbjct: 79 NFLVLFDTGSSNLWVASTNCQ-SQACTNHPLFNPSQSSTYSSNQQQFSLQYGTGSLTGIL 137
Query: 162 SEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQG 221
D V I ++ + QEF + EP F+ A+FDGILGL + I+VG A V M+ Q
Sbjct: 138 GYDTVTIQNIAISQQEFGLSVTEPGTNFVYAQFDGILGLAYPSIAVGGATTVMQGMLQQN 197
Query: 222 LVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQ 281
L+NEPVF F+ + + + GGE+ FGG+D ++Y G+ + PVT + YWQ + I+GQ
Sbjct: 198 LLNEPVFGFYLS-GENTQSGGEVAFGGVDQNYYTGQIYWTPVTSETYWQIGIQGFSINGQ 256
Query: 282 TTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
+G+C+ GC I D+GTSLL P +I + IGA
Sbjct: 257 ASGWCSQGCQGIVDTGTSLLTAPQSIFASLMQDIGA 292
>gi|193499291|gb|ACF18588.1| pepsinogen A1 precursor [Siniperca scherzeri]
gi|193499301|gb|ACF18593.1| pepsinogen A1 precursor [Siniperca scherzeri]
Length = 378
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 111/240 (46%), Positives = 159/240 (66%), Gaps = 7/240 (2%)
Query: 81 ALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSST 140
++ N D Y+G I IGTPPQ+F VIFDTGSSNLWVPS C S AC H K+ G+SST
Sbjct: 63 SMTNDADLSYYGIISIGTPPQSFKVIFDTGSSNLWVPSVYCN-SAACNNHDKFNPGKSST 121
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEF-IEATREPSLTFLLAKFDGILG 199
Y+ NG I YGTG+++G+ D V +G L VK+Q F + T P + ++ A DGILG
Sbjct: 122 YRNNGSPLTIQYGTGSMTGYLGYDTVTVGGLAVKNQIFGLSQTEAPFMQYMRA--DGILG 179
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
L + +S A PV+ NM+N+GLVN+ +FS + +A+ ++G + FGG+DP+HY G T
Sbjct: 180 LAYPRLSASGATPVFDNMMNEGLVNQDLFSVYL--SANSQQGSVVTFGGIDPNHYYGSIT 237
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATG 319
++P++ + YWQ + V ++GQ C+GGC AI D+GTSL+ GP + I+ +N +GA+G
Sbjct: 238 WIPLSSELYWQITVDSVTVNGQVVA-CSGGCQAIVDTGTSLIVGPQSSISNINSGVGASG 296
>gi|448113357|ref|XP_004202330.1| Piso0_001822 [Millerozyma farinosa CBS 7064]
gi|359465319|emb|CCE89024.1| Piso0_001822 [Millerozyma farinosa CBS 7064]
Length = 414
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 116/256 (45%), Positives = 167/256 (65%), Gaps = 18/256 (7%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L+NY++AQY+ IG+G+P Q F V+ DTGSSNLWVPS+ C S+AC+ H+KY SS+Y
Sbjct: 92 LENYLNAQYYTTIGLGSPVQEFKVVLDTGSSNLWVPSTDCS-SLACFLHTKYDHSESSSY 150
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
K+NG I YG+G++ G+ S+D + + L ++ Q+F EAT EP L F AKFDGILGL
Sbjct: 151 KQNGSEFAIRYGSGSLEGYVSQDTLNLAGLTIEKQDFAEATSEPGLAFAFAKFDGILGLA 210
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWF-NRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ISV VP YN +NQGL++EP F+F+ +++ DE +GG FGG+D HYKG+
Sbjct: 211 YDTISVNNIVPPIYNAINQGLLDEPKFAFYLGDKDKDENDGGVATFGGVDTKHYKGDIVE 270
Query: 261 VPVTQKGYWQ--FD---MGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAI 315
+P+ +K YW+ FD +GD + +TG AAI D+GTSL+ P+++ +N I
Sbjct: 271 LPIRRKAYWEVSFDGIGLGDEYAELTSTG------AAI-DTGTSLITLPSSLAEIINAKI 323
Query: 316 GA----TGIVSQECKA 327
GA +G S +C +
Sbjct: 324 GAKKSWSGQYSVDCDS 339
>gi|301606846|ref|XP_002933025.1| PREDICTED: gastricsin isoform 1 [Xenopus (Silurana) tropicalis]
Length = 383
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 113/276 (40%), Positives = 162/276 (58%), Gaps = 9/276 (3%)
Query: 49 RLDSKEGESFRTSIRKYSLRGNLGESGD-------ADIVALKNYMDAQYFGEIGIGTPPQ 101
++ K +S R +R++ ++ + + L NYMD Y+GEI IGTPPQ
Sbjct: 19 KVPLKRFKSMREVMREHGIKAPIVDPASKYYNQYATAFEPLANYMDMSYYGEISIGTPPQ 78
Query: 102 NFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFF 161
NF V+FDTGSSNLWV S+ C S AC H + +SSTY N + + YGTG+++G
Sbjct: 79 NFLVLFDTGSSNLWVASTNCQ-SQACTNHPLFNPSQSSTYSSNQQQFSLQYGTGSLTGIL 137
Query: 162 SEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQG 221
D V I ++ + QEF + EP F+ A+FDGILGL + I+VG A V M+ Q
Sbjct: 138 GYDTVTIQNIAISQQEFGLSVTEPGTNFVYAQFDGILGLAYPSIAVGGATTVMQGMLQQN 197
Query: 222 LVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQ 281
L+NEPVF F+ + + + GGE+ FGG+D ++Y G+ + PVT + YWQ + I+GQ
Sbjct: 198 LLNEPVFGFYLS-GENTQSGGEVAFGGVDQNYYTGQIYWTPVTSETYWQIGIQGFSINGQ 256
Query: 282 TTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
+G+C+ GC I D+GTSLL P +I + IGA
Sbjct: 257 ASGWCSQGCQGIVDTGTSLLTAPQSIFASLMQDIGA 292
>gi|301606850|ref|XP_002933027.1| PREDICTED: gastricsin isoform 3 [Xenopus (Silurana) tropicalis]
Length = 380
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 113/276 (40%), Positives = 161/276 (58%), Gaps = 11/276 (3%)
Query: 49 RLDSKEGESFRTSIRKYSLRGNLGESGD-------ADIVALKNYMDAQYFGEIGIGTPPQ 101
++ K +S R +R++ ++ + + L NYMD Y+GEI IGTPPQ
Sbjct: 19 KVPLKRFKSMREVMREHGIKAPIVDPASKYYNQYATAFEPLANYMDMSYYGEISIGTPPQ 78
Query: 102 NFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFF 161
NF V+FDTGSSNLWV S+ C S AC H + +SSTY N + + YGTG+++G
Sbjct: 79 NFLVLFDTGSSNLWVASTNCQ-SQACTNHPLFNPSQSSTYSSNQQQFSLQYGTGSLTGIL 137
Query: 162 SEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQG 221
D V I ++ + QEF + EP F+ A+FDGILGL + I+VG A V M+ Q
Sbjct: 138 GYDTVTIQNIAISQQEFGLSVTEPGTNFVYAQFDGILGLAYPSIAVGGATTVMQGMLQQN 197
Query: 222 LVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQ 281
L+NEPVF F+ + + GGE+ FGG+D ++Y G+ + PVT + YWQ + I+GQ
Sbjct: 198 LLNEPVFGFYLK---NTQSGGEVAFGGVDQNYYTGQIYWTPVTSETYWQIGIQGFSINGQ 254
Query: 282 TTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
+G+C+ GC I D+GTSLL P +I + IGA
Sbjct: 255 ASGWCSQGCQGIVDTGTSLLTAPQSIFASLMQDIGA 290
>gi|407726061|dbj|BAM46128.1| pepsinogen C [Cynops pyrrhogaster]
Length = 383
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 110/237 (46%), Positives = 148/237 (62%), Gaps = 2/237 (0%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L NYMD Y+GEI IGTPPQNF V+FDTGSSNLWV S+ C S AC H+ + +SSTY
Sbjct: 60 LSNYMDMSYYGEISIGTPPQNFLVLFDTGSSNLWVASTYCS-SSACTNHATFNPSQSSTY 118
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
N + I YGTG+++G D V I + + QEF + EP F+ A+FDGILGL
Sbjct: 119 TSNNQKFSIQYGTGSLTGILGYDTVSIQGITITQQEFALSVNEPGTNFVYAQFDGILGLA 178
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ I+ A V M+NQGL+++ +F F+ + + GGE+VFGG+D ++Y G+ T+
Sbjct: 179 YPSIAADGATTVMEGMMNQGLLSQNIFGFYLGQQG-SQSGGELVFGGVDSNYYTGQITWT 237
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
PVTQ+ YWQ + ++GQ TG+C GC I D+GTSLL P I + IGAT
Sbjct: 238 PVTQQMYWQIGISGFGVNGQPTGWCGQGCQGIVDTGTSLLTAPGQYIAALMQEIGAT 294
>gi|255724976|ref|XP_002547417.1| vacuolar aspartic protease precursor [Candida tropicalis MYA-3404]
gi|240135308|gb|EER34862.1| vacuolar aspartic protease precursor [Candida tropicalis MYA-3404]
Length = 421
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 119/243 (48%), Positives = 155/243 (63%), Gaps = 14/243 (5%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L NY++AQYF EI +GTP Q F VI DTGSSNLWVPS C S+AC+ HSKY SS+Y
Sbjct: 98 LTNYLNAQYFTEIELGTPGQPFKVILDTGSSNLWVPSQDCT-SLACFLHSKYDHDASSSY 156
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
K NG I YG+G++ G+ S+D + IGDLV+ Q+F EAT EP L F KFDGILGL
Sbjct: 157 KANGSEFSIQYGSGSMEGYISQDILTIGDLVIPKQDFAEATSEPGLAFAFGKFDGILGLA 216
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWF-NRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ISV VP YN +NQGL+++P F F+ N DE +GG FGG D ++G+ T+
Sbjct: 217 YDTISVNHIVPPIYNAINQGLLDKPQFGFYLGNTEKDENDGGLATFGGYDASLFQGKVTW 276
Query: 261 VPVTQKGYWQ-----FDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAI 315
+PV +K YW+ +GD + Q TG AAI D+GTSL+ P+++ +N I
Sbjct: 277 LPVRRKAYWEVAFEGIGLGDEYAELQKTG------AAI-DTGTSLITLPSSLAEIINAKI 329
Query: 316 GAT 318
GAT
Sbjct: 330 GAT 332
>gi|222425180|dbj|BAH20539.1| pepsinogen A-43 [Pongo abelii]
Length = 388
Score = 225 bits (573), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 117/253 (46%), Positives = 161/253 (63%), Gaps = 8/253 (3%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L+NY+D +YFG IGIGTP Q+FTV+FDTGSSNLWVPS CY S+AC H+ + SSTY
Sbjct: 68 LENYLDVEYFGSIGIGTPAQDFTVVFDTGSSNLWVPSVYCY-SLACMDHNLFNPQDSSTY 126
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
K ++ I YGTG+++G D VK+G + +Q F + EP A FDGILGL
Sbjct: 127 KSTSETVSITYGTGSMTGILGYDTVKVGGISDTNQIFGLSESEPGSFLFFAPFDGILGLA 186
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ IS A PV+ N+ NQGLV++ +FS + +AD++ G ++FGG+D +Y G +V
Sbjct: 187 YPSISSSGATPVFDNIWNQGLVSQDLFSVYL--SADDKSGSVVIFGGIDSSYYTGSLNWV 244
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT--- 318
PVT +GYWQ + + ++G+T CA GC AI D+GTSLL GPT+ I + IGA+
Sbjct: 245 PVTVEGYWQITVDSITMNGKTIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENS 303
Query: 319 -GIVSQECKAVVS 330
G + C A+ S
Sbjct: 304 DGDMVVSCSAISS 316
>gi|82469881|gb|ABB77194.1| cathepsin D2-like protein [Schistosoma mansoni]
Length = 401
Score = 225 bits (573), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 119/277 (42%), Positives = 166/277 (59%), Gaps = 5/277 (1%)
Query: 50 LDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDT 109
L+ + S + I K+ R ++G DI L+NY + +Y+GEI IGTPPQ F VIFDT
Sbjct: 38 LNYQWNLSVKQFINKWLQRLTSSKNG-IDIEYLENYQNIEYYGEISIGTPPQIFHVIFDT 96
Query: 110 GSSNLWVPSSKCYFS-IACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKI 168
GS LW+PS KC S +AC H KY S +S TYK NG + YGTG SGF S D V I
Sbjct: 97 GSPYLWIPSKKCDPSNLACQLHHKYDSSKSLTYKPNGALFYVQYGTGTASGFLSSDCVHI 156
Query: 169 GDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVG-KAVPVWYNMVNQGLVNEPV 227
G L + DQ F E +P F+ FDGI+G+GFQ+ S P++ NM+ Q LV++P+
Sbjct: 157 GSLNIVDQTFGEVINQPGKVFVNFHFDGIMGMGFQQTSQNSNPTPIFMNMIEQNLVDKPL 216
Query: 228 FSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCA 287
F+ + N N D+ GEI+FGG+D +Y G TY V + YW ++ + I+G+ FC
Sbjct: 217 FAVYLNLNEDKTTSGEIMFGGIDDRYYTGNLTYSDVVSEEYWMINIDGISINGEI--FCP 274
Query: 288 GGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE 324
G A+ D+GT+L++GPT I +N +G+ I + E
Sbjct: 275 SGSTALIDTGTALISGPTEKINNINKYLGSIQISNNE 311
>gi|195583376|ref|XP_002081498.1| GD11051 [Drosophila simulans]
gi|194193507|gb|EDX07083.1| GD11051 [Drosophila simulans]
Length = 399
Score = 225 bits (573), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 126/316 (39%), Positives = 169/316 (53%), Gaps = 21/316 (6%)
Query: 13 FLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLG 72
+L + L PV+F P + + K + + + R R S + IR LR
Sbjct: 2 WLLVSLLPVLFILPVQFQHPVSCKLQLYRVPLR---RFPSARHRFEKLGIRMDRLRLKYA 58
Query: 73 E---------SGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYF 123
E S L NY+DAQYFG I IGTPPQ F VIFDTGSSNLWVPS+ C
Sbjct: 59 EEVSHFRGDWSSAVKSTPLSNYLDAQYFGPITIGTPPQTFKVIFDTGSSNLWVPSATCAS 118
Query: 124 S-IACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEAT 182
+ +AC H++Y + RS +++ G IHYG+G++SGF S D V++ L ++DQ F EAT
Sbjct: 119 TMVACRVHNRYFAKRSKSHQARGDRFAIHYGSGSLSGFLSTDTVRVAGLEIRDQTFAEAT 178
Query: 183 REPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGG 242
P FL AKFDGI GL + IS+ + P +Y M+ QGL+ +P+F N
Sbjct: 179 EMPGPIFLAAKFDGIFGLAYHSISMQRIKPPFYAMMEQGLLTKPIF------NMARMMVE 232
Query: 243 EIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLA 302
I FGG +P +Y G TYV V+ + YWQ M +I + C GC I D+GTS LA
Sbjct: 233 PIFFGGSNPHYYTGNFTYVQVSHRAYWQVKMDSAVI--RNLELCQQGCEVIIDTGTSFLA 290
Query: 303 GPTTIITQVNHAIGAT 318
P +N +IG T
Sbjct: 291 LPYDQAILINESIGGT 306
>gi|222425184|dbj|BAH20541.1| pepsinogen A-14 [Pongo abelii]
Length = 388
Score = 225 bits (573), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 117/253 (46%), Positives = 161/253 (63%), Gaps = 8/253 (3%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L+NY+D +YFG IGIGTP Q+FTV+FDTGSSNLWVPS CY S+AC H+ + SSTY
Sbjct: 68 LENYLDVEYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCY-SLACMDHNLFNPQDSSTY 126
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
K ++ I YGTG+++G D VK+G + +Q F + EP A FDGILGL
Sbjct: 127 KSTSETVSITYGTGSMTGILGYDTVKVGGISDTNQIFGLSESEPGSFLFFAPFDGILGLA 186
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ IS A PV+ N+ NQGLV++ +FS + +AD++ G ++FGG+D +Y G +V
Sbjct: 187 YPSISSSGATPVFDNIWNQGLVSQDLFSVYL--SADDKSGSVVIFGGIDSSYYTGSLNWV 244
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT--- 318
PVT +GYWQ + + ++G+T CA GC AI D+GTSLL GPT+ I + IGA+
Sbjct: 245 PVTVEGYWQITVDSITMNGKTIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENS 303
Query: 319 -GIVSQECKAVVS 330
G + C A+ S
Sbjct: 304 NGDMVVSCSAISS 316
>gi|410045159|ref|XP_001145764.3| PREDICTED: pepsin A-5 isoform 1 [Pan troglodytes]
Length = 434
Score = 225 bits (573), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 117/254 (46%), Positives = 161/254 (63%), Gaps = 8/254 (3%)
Query: 81 ALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSST 140
L+NY+D +YFG IGIGTP Q+FTV+FDTGSSNLWVPS CY S+AC H+ + SST
Sbjct: 67 PLENYLDVEYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCY-SLACMDHNLFNPQDSST 125
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
YK K+ I YGTG+++G D V++G + +Q F + EP A FDGILGL
Sbjct: 126 YKSTSKTVSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLFFAPFDGILGL 185
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ IS A PV+ N+ NQGLV++ +FS + +AD++ G ++FGG+D +Y G +
Sbjct: 186 AYPSISSSGATPVFDNIWNQGLVSQDLFSVYL--SADDKSGSVVIFGGIDSSYYTGSLNW 243
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT-- 318
VPVT +GYWQ + + ++G+T CA GC AI D+GTSLL GPT+ I + IGA+
Sbjct: 244 VPVTVEGYWQITVDSITMNGKTIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASEN 302
Query: 319 --GIVSQECKAVVS 330
G + C A+ S
Sbjct: 303 SDGDMVVSCSAISS 316
>gi|224458278|ref|NP_001138942.1| pepsinogen A precursor [Pongo abelii]
gi|222425178|dbj|BAH20538.1| pepsinogen A-75 [Pongo abelii]
Length = 388
Score = 224 bits (572), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 117/253 (46%), Positives = 161/253 (63%), Gaps = 8/253 (3%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L+NY+D +YFG IGIGTP Q+FTV+FDTGSSNLWVPS CY S+AC H+ + SSTY
Sbjct: 68 LENYLDVEYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCY-SLACMDHNLFNPQDSSTY 126
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
K ++ I YGTG+++G D VK+G + +Q F + EP A FDGILGL
Sbjct: 127 KSTSETVSITYGTGSMTGILGYDTVKVGGISDTNQIFGLSESEPGSFLFFAPFDGILGLA 186
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ IS A PV+ N+ NQGLV++ +FS + +AD++ G ++FGG+D +Y G +V
Sbjct: 187 YPSISSSGATPVFDNIWNQGLVSQDLFSVYL--SADDKSGSVVIFGGIDSSYYTGSLNWV 244
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT--- 318
PVT +GYWQ + + ++G+T CA GC AI D+GTSLL GPT+ I + IGA+
Sbjct: 245 PVTVEGYWQITVDSITMNGKTIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENS 303
Query: 319 -GIVSQECKAVVS 330
G + C A+ S
Sbjct: 304 DGDMVVSCSAISS 316
>gi|253762217|gb|ACT35560.1| pepsinogen A2 precursor [Siniperca chuatsi]
Length = 376
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 115/258 (44%), Positives = 167/258 (64%), Gaps = 17/258 (6%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
+ N D Y+G I IG+PPQ+F+VIFDTGSSNLW+PS C S AC H ++ +S+T+
Sbjct: 61 MTNDADLSYYGVISIGSPPQSFSVIFDTGSSNLWIPSVYCS-SQACENHRRFNPQQSTTF 119
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEF-IEATREPSLTFLLAKFDGILGL 200
K + I YGTG+++G+ + D V++G + V +Q F I T P + ++ A DGILGL
Sbjct: 120 KWGNQPLSIQYGTGSMTGYLAIDTVEVGGISVANQVFGISRTEAPFMAYMQA--DGILGL 177
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
FQ I+ VPV+ NMV QGLV++P+FS + + N+ E+G E+VFGG+D HY G+ T+
Sbjct: 178 AFQTIASDNVVPVFDNMVKQGLVSQPLFSVYLSSNS--EQGSEVVFGGIDSSHYTGQITW 235
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
+P++ YWQ M V I+GQT C+GGC AI D+GTSL+ GPT+ I +N +GA+
Sbjct: 236 IPLSSATYWQIKMDSVTINGQTVA-CSGGCQAIIDTGTSLIVGPTSDINNMNAWVGAS-- 292
Query: 321 VSQECKAVVSQYGEEIIN 338
+QYGE +++
Sbjct: 293 --------TNQYGEAVVS 302
>gi|195134382|ref|XP_002011616.1| GI11126 [Drosophila mojavensis]
gi|193906739|gb|EDW05606.1| GI11126 [Drosophila mojavensis]
Length = 421
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 111/245 (45%), Positives = 153/245 (62%), Gaps = 4/245 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L N ++ +Y+G I IGTPPQ F V+FDTGSSNLWVPS +C S +C H + S+TY
Sbjct: 74 LVNDVNMEYYGIITIGTPPQTFNVLFDTGSSNLWVPSIQCA-SPSCQDHMSFNPTLSTTY 132
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
+ + + YG+G +SGF D + + LVV +Q F AT E + TF+ FDGILG+G
Sbjct: 133 RYTNEMITLEYGSGGMSGFLGIDVINVSGLVVANQTFGLATTELNNTFVRDGFDGILGMG 192
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ ++V VP +YNM+ QGL+ PVFSF+ RN ++GGE++FGG DP YKG TY
Sbjct: 193 YASLAVDNVVPPFYNMLAQGLIANPVFSFYLARNGTSQQGGELIFGGSDPSLYKGSMTYA 252
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIV 321
+TQ+ YWQF+M ++GQ C CAAIAD+GTSLL PT I ++ +G
Sbjct: 253 DITQQNYWQFNMDSATLNGQV--LCT-NCAAIADTGTSLLVAPTDIYNKIKVVLGVNTDD 309
Query: 322 SQECK 326
+ +C
Sbjct: 310 TIDCS 314
>gi|345802472|ref|XP_854465.2| PREDICTED: pepsin B-like [Canis lupus familiaris]
Length = 390
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 112/266 (42%), Positives = 166/266 (62%), Gaps = 14/266 (5%)
Query: 76 DADIVALK---NYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSK 132
+ D VA + NY+++ YFGEI IGTPPQNF V+FDTGSSNLWVPS+ C S AC H+
Sbjct: 57 NNDAVAYEPFTNYLNSYYFGEISIGTPPQNFLVLFDTGSSNLWVPSTYCQ-SQACSNHNT 115
Query: 133 YRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLA 192
+ SSTY+ NG++ ++YG+G+++ D V + ++V+ +QEF + EPS F A
Sbjct: 116 FNPSSSSTYRNNGQTYTLYYGSGSLTVLLGYDTVTVQNIVINNQEFGLSEIEPSNPFYYA 175
Query: 193 KFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPD 252
FDGILG+ + ++VG + V +MV QG + +P+FSF+F+R E GGE++ GG+D
Sbjct: 176 NFDGILGMAYPNLAVGDSPTVMQSMVQQGQLTQPIFSFYFSRQPTYEYGGELILGGVDTQ 235
Query: 253 HYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVN 312
Y GE + PVT++ YWQ + + +++ Q TG C+ GC AI D+GT +LA P +
Sbjct: 236 FYSGEIVWAPVTREMYWQVAIDEFLVNNQATGLCSQGCQAIVDTGTYVLAVPQQFM---- 291
Query: 313 HAIGATGIVSQECKAVVSQYGEEIIN 338
G QE A +Q G+ ++N
Sbjct: 292 ------GSFLQETGAQEAQNGDFVVN 311
>gi|224458280|ref|NP_001138943.1| gastricsin precursor [Pongo abelii]
gi|222425206|dbj|BAH20552.1| pepsinogen C [Pongo abelii]
Length = 388
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 115/254 (45%), Positives = 159/254 (62%), Gaps = 12/254 (4%)
Query: 85 YMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKN 144
YMDA YFGEI IGTPPQNF V+FDTGSSNLWVPS C S AC HS++ SSTY N
Sbjct: 68 YMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHSRFNPSESSTYSTN 126
Query: 145 GKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQE 204
G++ + YG+G+++GFF D + + + V +QEF + EP F+ A+FDGI+GL +
Sbjct: 127 GQTFSLQYGSGSLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGIMGLAYPA 186
Query: 205 ISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVT 264
+SV +A MV +G + PVFSF+ + N GG +VFGG+D Y G+ + PVT
Sbjct: 187 LSVDEATTAMQGMVQEGALTSPVFSFYLS-NQQGSSGGAVVFGGVDSSLYTGQIYWAPVT 245
Query: 265 QKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE 324
Q+ YWQ + + +I GQ +G+C+ GC AI D+GTSLL P ++ + A GA QE
Sbjct: 246 QELYWQIGIEEFLIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMSALLQATGA-----QE 300
Query: 325 CKAVVSQYGEEIIN 338
+YG+ ++N
Sbjct: 301 -----DEYGQFLVN 309
>gi|73621391|sp|Q9GMY4.1|PEPC_SORUN RecName: Full=Gastricsin; AltName: Full=Pepsinogen C; Flags:
Precursor
gi|9798664|dbj|BAB11754.1| pepsinogen C [Sorex unguiculatus]
Length = 389
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 120/299 (40%), Positives = 177/299 (59%), Gaps = 8/299 (2%)
Query: 32 RIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYF 91
++ LKK K + R R GE RT + + + G+ A Y+DA YF
Sbjct: 20 KVPLKKFK---SIRETLREQGLLGEFLRTHPYDPAQKYHFGDFSVA--YEPMAYLDAAYF 74
Query: 92 GEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIH 151
GEI IGTPPQNF V+FDTGSSNLWVPS C S AC H+++ +SSTY NG++ +
Sbjct: 75 GEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTGHARFNPSKSSTYSTNGQTFSLQ 133
Query: 152 YGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAV 211
YG+G+++GFF D + + ++ V QEF + EP F+ A+FDGI+G+ + +++G A
Sbjct: 134 YGSGSLTGFFGYDTMTLQNIKVPHQEFGLSQNEPGENFVYAQFDGIMGMAYPTLAMGGAT 193
Query: 212 PVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQF 271
M+ G ++ PVFSF+ + ++GG +VFGG+D Y G+ + PVTQ+ YWQ
Sbjct: 194 TALQGMLQAGALDSPVFSFYLSNQQSSKDGGAVVFGGVDNSLYTGQIFWTPVTQELYWQI 253
Query: 272 DMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQECKAVVS 330
+ +I GQ TG+C+ GC AI D+GTSLL P ++ + A GA + Q+ + VV+
Sbjct: 254 GVEQFLIGGQATGWCSQGCQAIVDTGTSLLTVPQQYLSALQQATGAQ--LDQDGQMVVN 310
>gi|406608071|emb|CCH40505.1| Saccharopepsin [Wickerhamomyces ciferrii]
Length = 401
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 121/254 (47%), Positives = 160/254 (62%), Gaps = 17/254 (6%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L NY++AQY+ EI IGTP Q F VI DTGSSNLWVPS+ C S+ACY HSKY SSTY
Sbjct: 78 LTNYLNAQYYTEIQIGTPGQPFKVILDTGSSNLWVPSTDCS-SLACYLHSKYDHEASSTY 136
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
K NG I YG+G++ G+ S+D +++GDLV+ Q+F EAT EP L F KFDGILGL
Sbjct: 137 KANGSDFAIRYGSGSLEGYVSQDTLQLGDLVIPKQDFAEATSEPGLAFAFGKFDGILGLA 196
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ ISV K VP Y +N GL++EP F+F+ E+GG FGG+D Y G+ T++
Sbjct: 197 YDTISVNKIVPPVYKALNSGLLDEPKFAFYLGDADKTEDGGVATFGGIDESKYTGKITWL 256
Query: 262 PVTQKGYWQFD-----MGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIG 316
PV +K YW+ +GD + + TG AAI D+GTSL+A P+ + +N IG
Sbjct: 257 PVRRKAYWEVKFNGIGLGDEFAELENTG------AAI-DTGTSLIALPSGLAEILNSEIG 309
Query: 317 A----TGIVSQECK 326
A +G S +C+
Sbjct: 310 AKKGWSGQYSVDCE 323
>gi|74136511|ref|NP_001028152.1| gastricsin precursor [Monodelphis domestica]
gi|73621388|sp|Q689Z7.1|PEPC_MONDO RecName: Full=Gastricsin; AltName: Full=Pepsinogen C; Flags:
Precursor
gi|51534970|dbj|BAD36918.1| pepsinogen C [Monodelphis domestica]
Length = 391
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 109/245 (44%), Positives = 159/245 (64%), Gaps = 4/245 (1%)
Query: 76 DADIVA---LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSK 132
+ D VA + NY+D+ YFGEI IGTPPQNF V+FDTGSSNLWVPS+ C S AC H++
Sbjct: 58 NKDAVAYEPITNYLDSFYFGEISIGTPPQNFLVLFDTGSSNLWVPSTYCQ-SQACSNHNR 116
Query: 133 YRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLA 192
+ +SST+ G++ + YG+G+++ D V + ++VV +QEF + EP+ F +
Sbjct: 117 FSPSQSSTFTNGGQTYTLSYGSGSLTVVLGYDTVTVQNIVVSNQEFGLSESEPTSPFYYS 176
Query: 193 KFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPD 252
FDGILG+ + ++VG + V M+ QG ++EP+FSF+F+R + GGE++ GG+DP
Sbjct: 177 DFDGILGMAYPAMAVGNSPTVMQGMLQQGQLSEPIFSFYFSRQPTHQYGGELILGGVDPQ 236
Query: 253 HYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVN 312
Y G+ T+ PVTQ+ YWQ + + I Q TG+C+ GC AI D+GT LLA P ++
Sbjct: 237 LYSGQITWTPVTQEVYWQIGIEEFAIGNQATGWCSQGCQAIVDTGTFLLAVPQQYMSAFL 296
Query: 313 HAIGA 317
A GA
Sbjct: 297 QATGA 301
>gi|57046|emb|CAA30082.1| unnamed protein product [Rattus norvegicus]
Length = 402
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 112/239 (46%), Positives = 155/239 (64%), Gaps = 4/239 (1%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRS 138
V L NY+D QY+GEIGIGTP Q F VIFDTGS+NLWVPS+KC AC H+ Y S S
Sbjct: 74 VVLTNYLDTQYYGEIGIGTPSQTFKVIFDTGSANLWVPSTKCGPLYTACEIHNLYDSSES 133
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
S+Y +NG IHYG+G + GF S+D V +G ++V Q F E T P + F+LAKFDG+L
Sbjct: 134 SSYMENGTEFTIHYGSGKVKGFLSQDVVTVGGIIVT-QTFGEVTELPLIPFMLAKFDGVL 192
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEH 258
G+GF V +PV+ ++++Q ++ E VFS +++R + GGE+V GG DP HY+G
Sbjct: 193 GMGFPAQVVDGVIPVFDHILSQRVLKEEVFSVYYSRES-HLLGGEVVLGGSDPQHYQGNF 251
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
YV +++ G WQ M V + G T C GC A+ D+GTS ++GPT+ + + A+G
Sbjct: 252 HYVSISKAGSWQITMKGVSL-GPATLLCEEGCMAVVDTGTSYISGPTSSLQLIMQALGV 309
>gi|413942271|gb|AFW74920.1| hypothetical protein ZEAMMB73_522985 [Zea mays]
Length = 468
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 99/129 (76%), Positives = 120/129 (93%)
Query: 215 YNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMG 274
YNMV QGL+++PVFSFWFNR+ADE EGGEIVFGGMD HYKG+HT+VPVT+KGYWQF+MG
Sbjct: 168 YNMVKQGLISDPVFSFWFNRHADEGEGGEIVFGGMDSSHYKGDHTFVPVTRKGYWQFNMG 227
Query: 275 DVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQECKAVVSQYGE 334
DV++DG++TGFCAGGCAAIADSGTSLLAGPT IIT++N IGA G+VSQECK VVSQYG+
Sbjct: 228 DVLVDGKSTGFCAGGCAAIADSGTSLLAGPTAIITEINEKIGAAGVVSQECKTVVSQYGQ 287
Query: 335 EIINMLLAK 343
+I+++LLA+
Sbjct: 288 QILDLLLAE 296
>gi|195433873|ref|XP_002064931.1| GK19045 [Drosophila willistoni]
gi|194161016|gb|EDW75917.1| GK19045 [Drosophila willistoni]
Length = 411
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 112/242 (46%), Positives = 154/242 (63%), Gaps = 4/242 (1%)
Query: 77 ADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRS 135
A I L N M+ YFG IGIGTP Q F V+FDTGS+NLW+PS C + +AC H++Y S
Sbjct: 86 ATIEELGNSMNMYYFGTIGIGTPEQYFNVVFDTGSANLWIPSVHCASTDVACQQHNQYNS 145
Query: 136 GRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFD 195
SSTY + ++ I YGTG+++GF + D V I L + +Q F EA +P +F FD
Sbjct: 146 SASSTYVASSQNFSIEYGTGSVTGFLAIDTVTINGLSIANQTFGEAITQPGSSFENVAFD 205
Query: 196 GILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYK 255
GILG+ +Q I+V VP +YN+ QGL++EPVF F+ RN +GGE++ GG D Y
Sbjct: 206 GILGMAYQTIAVDTVVPPFYNLYEQGLIDEPVFGFYLGRNGTATDGGELILGGSDESLYV 265
Query: 256 GEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAI 315
G +YVPV+Q+GYWQF + ++ +G C AIAD+GTSL+A P + +Q+N I
Sbjct: 266 GNLSYVPVSQQGYWQFAVNNITWNGTVV---CDNCQAIADTGTSLIACPFSAYSQLNQLI 322
Query: 316 GA 317
GA
Sbjct: 323 GA 324
>gi|195159708|ref|XP_002020720.1| GL15705 [Drosophila persimilis]
gi|194117670|gb|EDW39713.1| GL15705 [Drosophila persimilis]
Length = 408
Score = 224 bits (571), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 116/262 (44%), Positives = 159/262 (60%), Gaps = 16/262 (6%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSI-ACYFHSKYRSGRSST 140
L N + +Y+ + IGTPPQ F ++ DTGSSNLWVPSSKC ++ +C H++Y S SS+
Sbjct: 81 LGNAFNTEYYLPVTIGTPPQEFILLIDTGSSNLWVPSSKCPATVKSCVSHNQYDSKSSSS 140
Query: 141 YKKNGKSADIHYGTG-----AISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFD 195
Y NG + I Y + A+SG S+D V I +L ++ Q F E T EP TFL + FD
Sbjct: 141 YVANGTAFTIEYASKSEGGVALSGILSQDTVTIAELAIQRQVFAEITDEPEATFLSSPFD 200
Query: 196 GILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRN-ADEEEGGEIVFGGMDPDHY 254
G+ GLG+ IS+G P +YN+V QGL+ PVFS + NRN + +GGE+V GG+D +
Sbjct: 201 GMFGLGYASISIGGVTPPFYNLVAQGLIKHPVFSIYLNRNGTNATDGGELVLGGIDATLF 260
Query: 255 KGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHA 314
G TYVPV+Q+GYWQF M ++ G+T FC C AI D GTSLL PT I ++N
Sbjct: 261 SGCLTYVPVSQQGYWQFVMTSAVLGGKT--FCT-HCQAILDVGTSLLVAPTAAIKKINQL 317
Query: 315 IG------ATGIVSQECKAVVS 330
+ A+G+ C + S
Sbjct: 318 LAVLNPKDASGVFLVNCSTIAS 339
>gi|193499289|gb|ACF18587.1| pepsinogen A1 precursor [Siniperca chuatsi]
gi|193499299|gb|ACF18592.1| pepsinogen A1 precursor [Siniperca chuatsi]
gi|210076783|gb|ACJ06745.1| pepsinogen [Siniperca chuatsi]
gi|312369796|gb|ADQ74489.1| pepsinogen [Siniperca chuatsi]
Length = 378
Score = 224 bits (571), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 111/240 (46%), Positives = 158/240 (65%), Gaps = 7/240 (2%)
Query: 81 ALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSST 140
++ N D Y+G I IGTPPQ+F VIFDTGSSNLWVPS C S AC H K+ G+SST
Sbjct: 63 SMTNDADLSYYGIISIGTPPQSFKVIFDTGSSNLWVPSIYCN-SAACNNHDKFNPGKSST 121
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEF-IEATREPSLTFLLAKFDGILG 199
Y+ NG I YGTG+++G+ D V +G L VK+Q F + T P + ++ A DGILG
Sbjct: 122 YRNNGSPLTIQYGTGSMTGYLGYDTVTVGGLAVKNQIFGLSQTEAPFMQYMRA--DGILG 179
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
L + +S A PV+ NM+N+GLVN+ +FS + +A+ ++G + FGG+DP+HY G T
Sbjct: 180 LAYPRLSASGATPVFDNMMNEGLVNQDLFSVYL--SANSQQGSVVTFGGVDPNHYYGSIT 237
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATG 319
++P++ + YWQ + V ++GQ C+GGC AI D+GTSL+ GP I+ +N +GA+G
Sbjct: 238 WIPLSSELYWQITVDSVTVNGQVVA-CSGGCQAIVDTGTSLIVGPQNSISNINSGVGASG 296
>gi|254572447|ref|XP_002493333.1| Vacuolar aspartyl protease (proteinase A) [Komagataella pastoris
GS115]
gi|238033131|emb|CAY71154.1| Vacuolar aspartyl protease (proteinase A) [Komagataella pastoris
GS115]
gi|328352648|emb|CCA39046.1| vacuolar aspartic proteinase precursor similar to S. cerevisiae
PEP4 (YPL154C) [Komagataella pastoris CBS 7435]
Length = 410
Score = 224 bits (571), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 113/238 (47%), Positives = 154/238 (64%), Gaps = 4/238 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L NY++AQYF E+ +GTPPQ+F VI DTGSSNLWVPS C S+AC+ H+KY SSTY
Sbjct: 88 LTNYLNAQYFTEVSLGTPPQSFKVILDTGSSNLWVPSKDC-GSLACFLHAKYDHDESSTY 146
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
KKNG S +I YG+G++ G+ S+D ++IGDL + +F EAT EP L F KFDGILGL
Sbjct: 147 KKNGSSFEIRYGSGSMEGYVSQDVLQIGDLTIPKVDFAEATSEPGLAFAFGKFDGILGLA 206
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWF-NRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ISV K VP Y + L++EP F+F+ + + DE +GG FGG+D Y+G+ T+
Sbjct: 207 YDSISVNKIVPPIYKALELDLLDEPKFAFYLGDTDKDESDGGLATFGGVDKSKYEGKITW 266
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
+PV +K YW+ V + + G A D+GTSL+A P+ + +N IGAT
Sbjct: 267 LPVRRKAYWEVSFDGVGLGSEYAELQKTGAA--IDTGTSLIALPSGLAEILNAEIGAT 322
>gi|109287596|emb|CAJ55260.1| renin-like aspartic protease [Echis ocellatus]
Length = 395
Score = 224 bits (571), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 109/251 (43%), Positives = 161/251 (64%), Gaps = 6/251 (2%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCY-FSIACYFHSKYRSGRSST 140
L N+ D QY+GEI IGTP Q F V+FDTGSSNLWVPS +C AC H++Y S SST
Sbjct: 71 LTNFRDTQYYGEISIGTPAQIFKVVFDTGSSNLWVPSRQCSPLYSACVSHNRYDSSESST 130
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
YK G + Y G I GFFS+D V++ D+ + Q F EA PS+ F+ A+FDG+LG+
Sbjct: 131 YKPKGTKITLTYAQGYIKGFFSQDIVRVADIPII-QFFTEAIALPSIPFIFARFDGVLGM 189
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
G+ + ++G +PV+ N++++ +++E VFS +++R+++ GGEI+ GG DP HY G+ Y
Sbjct: 190 GYPKQAIGGVIPVFDNIMSEKVLSENVFSVYYSRHSESNTGGEIILGGSDPSHYTGDFHY 249
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT-- 318
V +++GYW D+ V I+ + C GC A D+GTS ++GP + I+ + IGAT
Sbjct: 250 VSTSREGYWHVDLKGVSIENKIV-LCHDGCTATIDTGTSFISGPASSISVLMETIGATLS 308
Query: 319 -GIVSQECKAV 328
G +CK +
Sbjct: 309 DGDYVIDCKKI 319
>gi|1246038|gb|AAB35842.1| pepsinogen A [turtles, Peptide, 361 aa]
Length = 361
Score = 224 bits (571), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 114/287 (39%), Positives = 177/287 (61%), Gaps = 18/287 (6%)
Query: 46 VAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVA--------------LKNYMDAQYF 91
+ ++ K+G+S R +++++ L + + + + L NYMDA+YF
Sbjct: 1 LVTKVPLKKGKSLRQNLKEHGLLEDFNKKHPYNPASRYFPSLGDEFATEPLTNYMDAEYF 60
Query: 92 GEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIH 151
G I IGTP Q+FTV+FDTGSSNLWVPS C S AC H+++ SSTY+ ++ I
Sbjct: 61 GTISIGTPAQDFTVVFDTGSSNLWVPSVTCS-SAACTQHNRFNPSDSSTYRATSQNLSIQ 119
Query: 152 YGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAV 211
YGTG+++G D+V++G LV +Q F + EP TF A DGILGL + I+ A
Sbjct: 120 YGTGSMTGILGYDNVQVGGLVDTNQIFGLSETEPGSTFYYAPMDGILGLAYPSIASSGAT 179
Query: 212 PVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQF 271
PV+ NM+++GLV++ +FS + + +DE+ G ++FGG D +Y G ++P++ + YW+
Sbjct: 180 PVFDNMMSEGLVSQDLFSVYLS--SDEQSGSFVMFGGNDTSYYSGSLNWIPLSAETYWEI 237
Query: 272 DMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
M + ++GQT C+GGC AI D+GTSLLAGP + ++ +N IGA+
Sbjct: 238 TMDSITMNGQTIA-CSGGCQAIIDTGTSLLAGPPSDVSNINSYIGAS 283
>gi|73621390|sp|Q9GMY3.1|PEPC_RHIFE RecName: Full=Gastricsin; AltName: Full=Pepsinogen C; Flags:
Precursor
gi|9798666|dbj|BAB11755.1| pepsinogen C [Rhinolophus ferrumequinum]
Length = 389
Score = 224 bits (571), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 109/254 (42%), Positives = 160/254 (62%), Gaps = 11/254 (4%)
Query: 85 YMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKN 144
YMDA YFGEI IGTPPQNF V+FDTGSSNLWVPS C + AC H+++ +SSTY N
Sbjct: 68 YMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-TQACTGHTRFNPSQSSTYSTN 126
Query: 145 GKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQE 204
G++ + YG+G+++GFF D + + + V +QEF + EP F+ A+FDGI+G+ +
Sbjct: 127 GQTFSLQYGSGSLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGIMGMAYPS 186
Query: 205 ISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVT 264
+++G A M+ +G + PVFSF+ + + GG ++FGG+D Y+G+ + PVT
Sbjct: 187 LAMGGATTALQGMLQEGALTSPVFSFYLSNQQGSQNGGAVIFGGVDNSLYQGQIYWAPVT 246
Query: 265 QKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE 324
Q+ YWQ + + +I GQ +G+C+ GC AI D+GTSLL P ++ + A GA QE
Sbjct: 247 QELYWQIGIEEFLIGGQASGWCSQGCQAIVDTGTSLLTVPQQYMSALLQATGA-----QE 301
Query: 325 CKAVVSQYGEEIIN 338
QYG+ +N
Sbjct: 302 -----DQYGQFFVN 310
>gi|194218271|ref|XP_001501895.2| PREDICTED: pepsin A-like [Equus caballus]
Length = 387
Score = 224 bits (571), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 123/288 (42%), Positives = 173/288 (60%), Gaps = 8/288 (2%)
Query: 30 LYRIGL-KKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDA 88
+Y++ L KK+ N R L+ + R KY + + AD L+NYMD
Sbjct: 17 IYKVPLVKKKSLRQNLRENGLLEDFLKQHPRNPASKYFPKE---AATLADTQPLENYMDE 73
Query: 89 QYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSA 148
YFG I IGTP Q FTVIFDTGSSNLWVPS C S+AC H+++ SSTY+ +S
Sbjct: 74 AYFGTISIGTPAQEFTVIFDTGSSNLWVPSIYCS-SLACSDHNRFNPEDSSTYRATSESV 132
Query: 149 DIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVG 208
I YGTG+++G D V++G + +Q F + EP A FDGILGL + IS
Sbjct: 133 SITYGTGSMTGVLGYDTVRVGGIEDTNQIFGLSETEPGSFLYYAPFDGILGLAYPSISAS 192
Query: 209 KAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGY 268
A PV+ N+ +QGLV++ +FS + ++D+E G ++FGG+DP +Y G +VPV+ +GY
Sbjct: 193 GATPVFDNIWDQGLVSQDLFSVYL--SSDDESGSVVMFGGIDPSYYTGSLHWVPVSNEGY 250
Query: 269 WQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIG 316
WQ M V ++G++ C+GGC AI D+GTSLLAGPT+ I + +G
Sbjct: 251 WQITMDSVTVNGESIA-CSGGCQAIVDTGTSLLAGPTSAIDNIQSYLG 297
>gi|206609|gb|AAA42030.1| preprorenin (EC 3.4.99.19) [Rattus norvegicus]
Length = 402
Score = 224 bits (570), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 111/239 (46%), Positives = 155/239 (64%), Gaps = 4/239 (1%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRS 138
V L NY+D QY+GEIGIGTP Q F VIFDTGS+NLWVPS+KC AC H+ Y S S
Sbjct: 74 VVLTNYLDTQYYGEIGIGTPSQTFKVIFDTGSANLWVPSTKCGPLYTACEIHNLYDSSES 133
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
S+Y +NG IHYG+G + GF S+D V +G ++V Q F E T P + F+LAKFDG+L
Sbjct: 134 SSYMENGTEFTIHYGSGKVKGFLSQDVVTVGGIIVT-QTFGEVTELPLIPFMLAKFDGVL 192
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEH 258
G+GF +V +PV+ ++++ ++ E VFS +++R + GGE+V GG DP HY+G
Sbjct: 193 GMGFPAQAVDGVIPVFDHILSHEVLKEEVFSVYYSRES-HLLGGEVVLGGSDPQHYQGNF 251
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
YV +++ G WQ M V + G T C GC A+ D+GTS ++GPT+ + + A+G
Sbjct: 252 HYVSISKAGSWQITMKGVSV-GPATLLCEEGCMAVVDTGTSYISGPTSSLQLIMQALGV 309
>gi|256075652|ref|XP_002574131.1| subfamily A1A unassigned peptidase (A01 family) [Schistosoma
mansoni]
gi|360043433|emb|CCD78846.1| subfamily A1A unassigned peptidase (A01 family) [Schistosoma
mansoni]
Length = 401
Score = 224 bits (570), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 118/277 (42%), Positives = 166/277 (59%), Gaps = 5/277 (1%)
Query: 50 LDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDT 109
L+ + S + I K+ R ++G DI L+NY + +Y+GEI IGTPPQ F VIFDT
Sbjct: 38 LNYQWNLSVKQFINKWLQRLTSSKNG-IDIEYLENYQNIEYYGEISIGTPPQIFHVIFDT 96
Query: 110 GSSNLWVPSSKCYFS-IACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKI 168
GS LW+PS KC S +AC H KY S +S TYK NG + YGTG SGF S D V I
Sbjct: 97 GSPYLWIPSKKCDPSNLACQLHHKYDSSKSLTYKPNGALFYVQYGTGTASGFLSSDCVHI 156
Query: 169 GDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVG-KAVPVWYNMVNQGLVNEPV 227
G L + DQ F E +P F+ FDGI+G+GFQ+ S P++ NM+ Q LV++P+
Sbjct: 157 GSLNIVDQTFGEVINQPGKVFVNFHFDGIMGMGFQQTSQNSNPTPIFMNMIEQNLVDKPL 216
Query: 228 FSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCA 287
F+ + N N D+ GEI+FGG+D +Y G TY V + YW ++ + I+G+ FC
Sbjct: 217 FAVYLNLNEDKTTSGEIMFGGIDDRYYTGNLTYSDVVSEEYWMINIDGISINGEI--FCP 274
Query: 288 GGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE 324
G A+ D+GT+L++GPT I +N +G+ + + E
Sbjct: 275 SGSTALIDTGTALISGPTEKINNINKYLGSIQMSNNE 311
>gi|223891|prf||1004236A renin
Length = 336
Score = 224 bits (570), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 112/239 (46%), Positives = 159/239 (66%), Gaps = 4/239 (1%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRS 138
V L NY+++QY+GEIGIGTPPQ F VIFDTGS+NLWVPS+KC +AC HS Y S S
Sbjct: 10 VVLTNYLNSQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCSRLYLACGIHSLYESSDS 69
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
S+Y +NG IHYG+G + GF S+D V +G + V Q F E T P + F+LA+FDG+L
Sbjct: 70 SSYMENGDDFTIHYGSGRVKGFLSQDSVTVGGITVT-QTFGEVTELPLIPFMLAQFDGVL 128
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEH 258
G+GF +VG PV+ ++++QG++ E VFS ++NR GGE+V GG DP+HY+G+
Sbjct: 129 GMGFPAQAVGGVTPVFDHILSQGVLKEKVFSVYYNRGP-HLLGGEVVLGGSDPEHYQGDF 187
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
YV +++ WQ M V + G +T C GC + D+G+S ++ PT+ + + A+GA
Sbjct: 188 GYVSLSKTDSWQITMKGVSV-GSSTLLCEEGCEVVVDTGSSFISAPTSSLKLIMQALGA 245
>gi|440903924|gb|ELR54511.1| Renin, partial [Bos grunniens mutus]
Length = 404
Score = 224 bits (570), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 127/298 (42%), Positives = 169/298 (56%), Gaps = 11/298 (3%)
Query: 23 FSTP--NGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIV 80
FS P RI LKK V L + + R L L V
Sbjct: 22 FSLPADTAAFRRIFLKKMP-----SVRESLKERGVDMARLGAEWSQLTKTLSFGNRTSPV 76
Query: 81 ALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCY-FSIACYFHSKYRSGRSS 139
L NY+D QY+GEIGIGTPPQ F V+FDTGS+NLWVPS+KC AC HS Y S SS
Sbjct: 77 VLTNYLDTQYYGEIGIGTPPQTFKVVFDTGSANLWVPSTKCSPLYTACEIHSLYDSLESS 136
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
+Y +NG IHYG+G + GF S+D V +G + V Q F E T P L F+LAKFDG+LG
Sbjct: 137 SYVENGTEFTIHYGSGKVKGFLSQDLVTVGGITVT-QTFGEVTELPLLPFMLAKFDGVLG 195
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
+GF +VG PV+ +++ Q ++ + VFS +++RN+ GGEIV GG DP +Y+
Sbjct: 196 MGFPAQAVGGVTPVFDHILAQRVLTDDVFSVYYSRNS-HLLGGEIVLGGSDPQYYQENFH 254
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
YV +++ G WQ M V + TT C GC I D+G S ++GPT+ + + A+GA
Sbjct: 255 YVSISKPGSWQIRMKGVSVR-STTLLCEEGCMVIVDTGASYISGPTSSLRLLMEALGA 311
>gi|330688453|ref|NP_001193438.1| renin precursor [Bos taurus]
Length = 398
Score = 223 bits (569), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 127/298 (42%), Positives = 169/298 (56%), Gaps = 11/298 (3%)
Query: 23 FSTP--NGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIV 80
FS P RI LKK V L + + R L L V
Sbjct: 16 FSLPADTAAFRRIFLKKMP-----SVRESLKERGVDMARLGAEWSQLTKTLSFGNRTSPV 70
Query: 81 ALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCY-FSIACYFHSKYRSGRSS 139
L NY+D QY+GEIGIGTPPQ F V+FDTGS+NLWVPS+KC AC HS Y S SS
Sbjct: 71 VLTNYLDTQYYGEIGIGTPPQTFKVVFDTGSANLWVPSTKCSPLYTACEIHSLYDSLESS 130
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
+Y +NG IHYG+G + GF S+D V +G + V Q F E T P L F+LAKFDG+LG
Sbjct: 131 SYVENGTEFTIHYGSGKVKGFLSQDLVTVGGITVT-QTFGEVTELPLLPFMLAKFDGVLG 189
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
+GF +VG PV+ +++ Q ++ + VFS +++RN+ GGEIV GG DP +Y+
Sbjct: 190 MGFPAQAVGGVTPVFDHILAQRVLTDDVFSVYYSRNS-HLLGGEIVLGGSDPQYYQENFH 248
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
YV +++ G WQ M V + TT C GC I D+G S ++GPT+ + + A+GA
Sbjct: 249 YVSISKPGSWQIRMKGVSVR-STTLLCEEGCMVIVDTGASYISGPTSSLRLLMEALGA 305
>gi|871442|emb|CAA25391.1| renin [Mus musculus]
Length = 387
Score = 223 bits (569), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 112/241 (46%), Positives = 157/241 (65%), Gaps = 5/241 (2%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRS 138
V L NY++ QY+GEIGIGTPPQ F VIFDTGS+NLWVPS+KC +AC HS Y S S
Sbjct: 56 VVLTNYLNTQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCSRLYLACGIHSLYESSDS 115
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDL--VVKDQEFIEATREPSLTFLLAKFDG 196
S+Y +NG IHYG+G + GF S+D V + + + Q F E T P + F+LAKFDG
Sbjct: 116 SSYMENGSDFTIHYGSGRVKGFLSQDSVTVSRVGGITVTQTFGEVTELPLIPFMLAKFDG 175
Query: 197 ILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKG 256
+LG+GF +VG PV+ ++++QG++ E VFS ++NR + GGE+V GG DP HY+G
Sbjct: 176 VLGMGFPAQAVGGVTPVFDHILSQGVLKEEVFSVYYNRGS-HLLGGEVVLGGSDPQHYQG 234
Query: 257 EHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIG 316
YV +++ WQ M V + G +T C GCA + D+G+S ++ PT+ + + A+G
Sbjct: 235 NFHYVSISKTDSWQITMKGVSV-GSSTLLCEEGCAVVVDTGSSFISAPTSSLKLIMQALG 293
Query: 317 A 317
A
Sbjct: 294 A 294
>gi|222425186|dbj|BAH20542.1| pepsinogen A-35 [Pongo abelii]
Length = 388
Score = 223 bits (569), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 116/253 (45%), Positives = 160/253 (63%), Gaps = 8/253 (3%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L+NY+D +YFG IGIGTP Q+FTV+FDTGSSNLWVPS CY S+ C H+ + SSTY
Sbjct: 68 LENYLDVEYFGSIGIGTPAQDFTVVFDTGSSNLWVPSVYCY-SLVCMDHNLFNPQDSSTY 126
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
K ++ I YGTG+++G D VK+G + +Q F + EP A FDGILGL
Sbjct: 127 KSTSETVSITYGTGSMTGILGYDTVKVGGISDTNQIFGLSESEPGSFLFFAPFDGILGLA 186
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ IS A PV+ N+ NQGLV++ +FS + +AD++ G ++FGG+D +Y G +V
Sbjct: 187 YPSISSSGATPVFDNIWNQGLVSQDLFSVYL--SADDKSGSVVIFGGIDSSYYTGSLNWV 244
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT--- 318
PVT +GYWQ + + ++G+T CA GC AI D+GTSLL GPT+ I + IGA+
Sbjct: 245 PVTVEGYWQITVDSITMNGKTIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENS 303
Query: 319 -GIVSQECKAVVS 330
G + C A+ S
Sbjct: 304 DGDMVVSCSAISS 316
>gi|494607|pdb|1SMR|A Chain A, The 3-D Structure Of Mouse Submaxillary Renin Complexed
With A Decapeptide Inhibitor Ch-66 Based On The 4-16
Fragment Of Rat Angiotensinogen
gi|157880102|pdb|1SMR|C Chain C, The 3-D Structure Of Mouse Submaxillary Renin Complexed
With A Decapeptide Inhibitor Ch-66 Based On The 4-16
Fragment Of Rat Angiotensinogen
gi|157880104|pdb|1SMR|E Chain E, The 3-D Structure Of Mouse Submaxillary Renin Complexed
With A Decapeptide Inhibitor Ch-66 Based On The 4-16
Fragment Of Rat Angiotensinogen
gi|157880106|pdb|1SMR|G Chain G, The 3-D Structure Of Mouse Submaxillary Renin Complexed
With A Decapeptide Inhibitor Ch-66 Based On The 4-16
Fragment Of Rat Angiotensinogen
Length = 335
Score = 223 bits (569), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 112/239 (46%), Positives = 159/239 (66%), Gaps = 4/239 (1%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRS 138
V L NY+++QY+GEIGIGTPPQ F VIFDTGS+NLWVPS+KC +AC HS Y S S
Sbjct: 7 VVLTNYLNSQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCSRLYLACGIHSLYESSDS 66
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
S+Y +NG IHYG+G + GF S+D V +G + V Q F E T+ P + F+LA+FDG+L
Sbjct: 67 SSYMENGDDFTIHYGSGRVKGFLSQDSVTVGGITVT-QTFGEVTQLPLIPFMLAQFDGVL 125
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEH 258
G+GF +VG PV+ ++++QG++ E VFS ++NR GGE+V GG DP HY+G+
Sbjct: 126 GMGFPAQAVGGVTPVFDHILSQGVLKEKVFSVYYNRGP-HLLGGEVVLGGSDPQHYQGDF 184
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
YV +++ WQ M V + G +T C GC + D+G+S ++ PT+ + + A+GA
Sbjct: 185 HYVSLSKTDSWQITMKGVSV-GSSTLLCEEGCEVVVDTGSSFISAPTSSLKLIMQALGA 242
>gi|50978946|ref|NP_001003194.1| renin precursor [Canis lupus familiaris]
gi|62287424|sp|Q6DYE7.1|RENI_CANFA RecName: Full=Renin; AltName: Full=Angiotensinogenase; Flags:
Precursor
gi|50058380|gb|AAT68959.1| preprorenin [Canis lupus familiaris]
Length = 403
Score = 223 bits (569), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 110/246 (44%), Positives = 157/246 (63%), Gaps = 4/246 (1%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCY-FSIACYFHSKYRSGRS 138
V L NY+D QY+GEIGIGTPPQ F V+FDTGS+NLWVPS++C AC H Y S S
Sbjct: 75 VVLTNYLDTQYYGEIGIGTPPQTFKVVFDTGSANLWVPSTRCSPLYTACEIHCLYDSSES 134
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
S+Y +NG + I YG+G + GF S+D V +G + V Q F E T P + F+LAKFDG+L
Sbjct: 135 SSYMENGTTFTIRYGSGKVKGFLSQDMVTVGGITVT-QTFGEVTELPLIPFMLAKFDGVL 193
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEH 258
G+GF +VG PV+ ++++QG++ E VFS +++RN+ GGE+V GG DP +Y+G
Sbjct: 194 GMGFPAQAVGGVTPVFDHILSQGVLKEEVFSVYYSRNS-HLLGGEVVLGGSDPQYYQGNF 252
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
YV +++ G WQ M V + T C GC + D+G S ++GPT+ + + +GA
Sbjct: 253 HYVSISKTGSWQIKMKGVSVRSATL-VCEEGCMVVVDTGASYISGPTSSLRLLMDTLGAQ 311
Query: 319 GIVSQE 324
+ + E
Sbjct: 312 ELSTNE 317
>gi|406701140|gb|EKD04292.1| endopeptidase [Trichosporon asahii var. asahii CBS 8904]
Length = 824
Score = 223 bits (569), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 127/260 (48%), Positives = 165/260 (63%), Gaps = 22/260 (8%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L NYM+AQY+ I IGTPPQ F V+ DTGSSNLWVPS +C SIAC+ KY + +SS
Sbjct: 226 VPLSNYMNAQYYAPITIGTPPQEFGVVLDTGSSNLWVPSVQCS-SIACF---KYDNSQSS 281
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYK NG I YG+G++ GF SED ++I L VKDQ F EAT+EP + F+ KFDGILG
Sbjct: 282 TYKANGSEFAIRYGSGSLEGFVSEDTLEIAGLKVKDQLFAEATKEPGMAFVFGKFDGILG 341
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ ISV + P +YNM++Q L++E VFSF + E++GGE +FGG D +
Sbjct: 342 LGYNTISVNQIPPPFYNMIDQNLLDEKVFSFRL--GSSEDDGGECIFGGYDKKWSDEKPI 399
Query: 260 YVPVTQKGYWQFDM-----GDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHA 314
YVPV +KGYW+ ++ GD + + TG AAI D+GTSL+A PT I +N
Sbjct: 400 YVPVRRKGYWEVELEGIKFGDEELPLENTG------AAI-DTGTSLIALPTDIAEILNKE 452
Query: 315 IGA----TGIVSQECKAVVS 330
IGA G + +C V S
Sbjct: 453 IGAEKSWNGQYTVDCSKVPS 472
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 123/260 (47%), Positives = 161/260 (61%), Gaps = 22/260 (8%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L NYM+AQY+ I IGTPPQ F V+ DTGSSNLWVPS +C SIAC+ KY + +SS
Sbjct: 527 VPLSNYMNAQYYAPITIGTPPQEFGVVLDTGSSNLWVPSVQCS-SIACF---KYDNSQSS 582
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYK NG I YG+G++ GF SED ++I L VKDQ F EAT+EP + F+ KF G
Sbjct: 583 TYKANGSEFAIRYGSGSLEGFVSEDTLEIAGLKVKDQLFAEATKEPGMAFVFGKFTVSFG 642
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ ISV + P +YNM++Q L++E VFSF + E++GGE +FGG D +
Sbjct: 643 LGYNTISVNQIPPPFYNMIDQNLLDEKVFSFRL--GSSEDDGGECIFGGYDKKWSDEKPI 700
Query: 260 YVPVTQKGYWQFDM-----GDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHA 314
YVPV +KGYW+ ++ GD + + TG AAI D+GTSL+A PT I +N
Sbjct: 701 YVPVRRKGYWEVELEGIKFGDEELPLENTG------AAI-DTGTSLIALPTDIAEILNKE 753
Query: 315 IGA----TGIVSQECKAVVS 330
IGA G + +C V S
Sbjct: 754 IGAEKSWNGQYTVDCSKVPS 773
>gi|241687194|ref|XP_002412838.1| aspartyl protease, putative [Ixodes scapularis]
gi|215506640|gb|EEC16134.1| aspartyl protease, putative [Ixodes scapularis]
Length = 320
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 118/236 (50%), Positives = 151/236 (63%), Gaps = 11/236 (4%)
Query: 88 AQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKS 147
+Y+G I IGTPPQ+F VIFDTGS+NLW+PSSKC + C H +Y S RSSTY+ +G++
Sbjct: 2 VEYYGPITIGTPPQDFQVIFDTGSANLWLPSSKCT-TKYCLHHHRYDSSRSSTYEADGRN 60
Query: 148 ADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISV 207
I YG+G + GF S+D +IG V Q EA + L A FDGILGL + I+V
Sbjct: 61 FTIVYGSGNVEGFISKDVCRIGSAKVSGQPLGEALVVGGESLLEAPFDGILGLAYPSIAV 120
Query: 208 GKAVPVWYNMVNQGLVNEP-VFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQK 266
VPV+ NM+ QGL+ E VFS + NR+ +EGGEI+FGG+D DHYKG TYVPVT K
Sbjct: 121 DGVVPVFDNMMKQGLLGEQNVFSVYLNRDPSSKEGGEILFGGIDHDHYKGSITYVPVTAK 180
Query: 267 GYWQFDMGDVMIDGQTTG----FCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
GYWQF +DG + C GC AIAD+GTSL+ GP + +N +G T
Sbjct: 181 GYWQF-----HVDGASKSVPELLCKDGCEAIADTGTSLITGPPEEVDSLNQYLGGT 231
>gi|222425182|dbj|BAH20540.1| pepsinogen A-15 [Pongo abelii]
Length = 388
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 116/253 (45%), Positives = 160/253 (63%), Gaps = 8/253 (3%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L+NY+D +YFG IGIGTP Q+FTV+FDTGSSNLWVPS CY S+AC H+ + SSTY
Sbjct: 68 LENYLDVEYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCY-SLACMDHNLFNPQDSSTY 126
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
K ++ I YGTG+++G D VK+G + +Q F + EP A FDGILGL
Sbjct: 127 KSTSETVSITYGTGSMTGILGYDTVKVGGISDTNQIFGLSESEPGSFLFFAPFDGILGLA 186
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ IS A PV+ N+ NQGLV++ +FS + +AD++ G ++FGG+D +Y G +V
Sbjct: 187 YPSISSSGATPVFDNIWNQGLVSQDLFSVYL--SADDKSGSVVIFGGIDSSYYTGSLNWV 244
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT--- 318
PVT +GYWQ + + ++G+ CA GC AI D+GTSLL GPT+ I + IGA+
Sbjct: 245 PVTVEGYWQITVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENS 303
Query: 319 -GIVSQECKAVVS 330
G + C A+ S
Sbjct: 304 NGDMVVSCSAISS 316
>gi|395537495|ref|XP_003770734.1| PREDICTED: renin [Sarcophilus harrisii]
Length = 413
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 123/296 (41%), Positives = 170/296 (57%), Gaps = 9/296 (3%)
Query: 23 FSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVAL 82
FS P+ L RI LKK N +L K+ F Y+ + L V L
Sbjct: 16 FSLPSDALQRIVLKKMPSIQEN---MKLKGKDLGKFNMEWLSYTKQLTLFNVMSP--VRL 70
Query: 83 KNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCY-FSIACYFHSKYRSGRSSTY 141
N+ D QY+GEI IG P Q F V+FDTGS++ WVPSSKC AC FH +Y S +SSTY
Sbjct: 71 TNFEDTQYYGEISIGNPSQTFQVVFDTGSADFWVPSSKCSPLYTACVFHHQYDSTKSSTY 130
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
K+NG I Y +G + GF SED V +G + + Q F E T P L F LAKFDG+LGLG
Sbjct: 131 KENGTEFKIQYASGQVMGFLSEDTVTVGGIKMT-QSFGEVTVLPLLPFGLAKFDGVLGLG 189
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
F +S+ K VP + N+++QG++ + VFS +++RN+ GGEI+ GG DP +Y+G Y+
Sbjct: 190 FPALSMSKIVPFFDNIISQGMLKKEVFSVYYSRNS-HVPGGEIILGGSDPKYYRGTFHYI 248
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
++ G+WQ M V ++ C GC A D+G S + GPT+ + +V +G
Sbjct: 249 NISHPGFWQIQMNGVSVESNVLA-CQDGCIASVDTGASFITGPTSSMRKVMKMLGV 303
>gi|302415619|ref|XP_003005641.1| vacuolar protease A [Verticillium albo-atrum VaMs.102]
gi|261355057|gb|EEY17485.1| vacuolar protease A [Verticillium albo-atrum VaMs.102]
Length = 383
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 118/255 (46%), Positives = 153/255 (60%), Gaps = 22/255 (8%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V + N+M+AQYF EI IGTPPQ F V+ DTGSSNLWVPS +C SIACY H+KY S SS
Sbjct: 74 VPVSNFMNAQYFSEITIGTPPQTFKVVLDTGSSNLWVPSQQCS-SIACYLHTKYDSSDSS 132
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYK NG +IHYG+G+++GF S+D V IGD+ +K+Q+F EAT EP L F +FDGILG
Sbjct: 133 TYKANGSEFEIHYGSGSLTGFVSQDTVTIGDIKIKNQDFAEATSEPGLAFAFGRFDGILG 192
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ ISV K VP +Y MVNQ V+EPVF+F+ ++ + EI
Sbjct: 193 LGYDTISVNKIVPPFYQMVNQKAVDEPVFAFYLGDTNEQGDESEITRSS----------- 241
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA-- 317
+ YW+ D+ + + T G AI D+GTSL P+T+ +N+ IGA
Sbjct: 242 ----PPQAYWEVDLDSISLGDNTAEL--DGHGAILDTGTSLNVLPSTLADMLNNEIGAKK 295
Query: 318 --TGIVSQECKAVVS 330
G S EC S
Sbjct: 296 GYNGQWSVECDKRAS 310
>gi|169731523|gb|ACA64894.1| progastricsin (predicted) [Callicebus moloch]
Length = 388
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 110/234 (47%), Positives = 149/234 (63%), Gaps = 2/234 (0%)
Query: 84 NYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKK 143
+YMDA YFGEI IGTPPQNF V+FDTGSSNLWVPS C S AC HS++ +SSTY
Sbjct: 67 DYMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHSRFNPSKSSTYSS 125
Query: 144 NGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQ 203
N ++ + YG+G+++GFF D + + + V QEF + EP F+ AKFDGI+GL +
Sbjct: 126 NEQTFSLQYGSGSLTGFFGYDTLTVQSIQVPKQEFGLSENEPGTNFIYAKFDGIMGLAYP 185
Query: 204 EISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPV 263
+SVG A M+ +G + PVFSF+ + N GG +VFGG+D Y G+ + PV
Sbjct: 186 ALSVGGATTAMQGMLQEGALTSPVFSFYLS-NQQGSSGGAVVFGGVDSSLYTGQIYWAPV 244
Query: 264 TQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
TQ+ YWQ + + +I GQ +G+C+ GC AI D+GTSLL P ++ A GA
Sbjct: 245 TQELYWQIGIEEFLIGGQASGWCSEGCQAIVDTGTSLLTVPQQYLSAFLEATGA 298
>gi|296479430|tpg|DAA21545.1| TPA: renin [Bos taurus]
Length = 401
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 113/241 (46%), Positives = 154/241 (63%), Gaps = 5/241 (2%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCY-FSIACYFHSKYRSGRS 138
V L NY+D QY+GEIGIGTPPQ F V+FDTGS+NLWVPS+KC AC HS Y S S
Sbjct: 70 VVLTNYLDTQYYGEIGIGTPPQTFKVVFDTGSANLWVPSTKCSPLYTACEIHSLYDSLES 129
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
S+Y +NG IHYG+G + GF S+D V +G + V Q F E T P L F+LAKFDG+L
Sbjct: 130 SSYVENGTEFTIHYGSGKVKGFLSQDLVTVGGITVT-QTFGEVTELPLLPFMLAKFDGVL 188
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEE--GGEIVFGGMDPDHYKG 256
G+GF +VG PV+ +++ Q ++ + VFS +++R++ GGEIV GG DP +Y+
Sbjct: 189 GMGFPAQAVGGVTPVFDHILAQRVLTDDVFSVYYSRDSKNSHLLGGEIVLGGSDPQYYQE 248
Query: 257 EHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIG 316
YV +++ G WQ M V + TT C GC I D+G S ++GPT+ + + A+G
Sbjct: 249 NFHYVSISKPGSWQIRMKGVSVR-STTLLCEEGCMVIVDTGASYISGPTSSLRLLMEALG 307
Query: 317 A 317
A
Sbjct: 308 A 308
>gi|253762215|gb|ACT35559.1| pepsinogen A2 precursor [Siniperca scherzeri]
Length = 376
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 115/258 (44%), Positives = 166/258 (64%), Gaps = 17/258 (6%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
+ N D Y+G I IG+PPQ+F+VIFDTGSSNLW+PS C S AC H ++ +S+T+
Sbjct: 61 MTNDADLSYYGVISIGSPPQSFSVIFDTGSSNLWIPSVYCS-SQACENHRRFNPQQSTTF 119
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEF-IEATREPSLTFLLAKFDGILGL 200
K + I YGTG+++G+ + D V++G + V +Q F I T P + + A DGILGL
Sbjct: 120 KWGNQPLSIQYGTGSMTGYLAIDTVEVGGISVANQVFGISRTEAPFMAHMQA--DGILGL 177
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
FQ I+ VPV+ NMV QGLV++P+FS + + N+ E+G E+VFGG+D HY G+ T+
Sbjct: 178 AFQTIASDNVVPVFDNMVKQGLVSQPLFSVYLSSNS--EQGSEVVFGGIDSSHYTGQITW 235
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
+P++ YWQ M V I+GQT C+GGC AI D+GTSL+ GPT+ I +N +GA+
Sbjct: 236 IPLSSATYWQIKMDSVTINGQTVA-CSGGCQAIIDTGTSLIVGPTSDINNMNAWVGAS-- 292
Query: 321 VSQECKAVVSQYGEEIIN 338
+QYGE +++
Sbjct: 293 --------TNQYGEAVVS 302
>gi|395821502|ref|XP_003784077.1| PREDICTED: gastricsin-like [Otolemur garnettii]
Length = 390
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 107/236 (45%), Positives = 150/236 (63%), Gaps = 1/236 (0%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
+ NYM++ YFGEI IGTPPQNF V+FDTGSSNLWVPS+ C S AC H + +SST+
Sbjct: 66 ITNYMESFYFGEISIGTPPQNFLVLFDTGSSNLWVPSTYCQ-SQACSNHHVFNPSQSSTF 124
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
NG++ + YG+G+++ D V I ++VV +QEF + EP++ F + FDGILG+
Sbjct: 125 SNNGQTYTLSYGSGSLTVVMGYDTVTIQNIVVNNQEFGLSENEPTVPFYYSAFDGILGMA 184
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ I+VG A V +M+ Q + +P+FSF+F+R + GGE++ GG+D Y GE +
Sbjct: 185 YPAIAVGNAPTVVQDMLQQNQLTQPIFSFYFSRQPTAQYGGELILGGVDSQLYSGEIVWT 244
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
PVTQ+ YWQ + + I Q TG C+ GC I D+GTSLL P I+ A GA
Sbjct: 245 PVTQEMYWQIAIQEFSIGNQATGLCSQGCQGIVDTGTSLLTVPQQYISSFVEATGA 300
>gi|401881725|gb|EJT46014.1| endopeptidase [Trichosporon asahii var. asahii CBS 2479]
Length = 528
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 126/260 (48%), Positives = 165/260 (63%), Gaps = 22/260 (8%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L NYM+AQY+ I IGTPPQ F V+ DTGSSNLWVPS +C SIAC+ KY + +SS
Sbjct: 192 VPLSNYMNAQYYAPITIGTPPQEFGVVLDTGSSNLWVPSVQCS-SIACF---KYDNSQSS 247
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TYK NG I YG+G++ GF SED ++I L VKDQ F EAT+EP + F+ KFDGILG
Sbjct: 248 TYKANGSEFAIRYGSGSLEGFVSEDTLEIAGLKVKDQLFAEATKEPGMAFVFGKFDGILG 307
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ ISV + P +YNM++Q L++E VFSF + E++GGE +FGG D +
Sbjct: 308 LGYNTISVNQIPPPFYNMIDQNLLDEKVFSFRL--GSSEDDGGECIFGGYDKKWSDEKPI 365
Query: 260 YVPVTQKGYWQFDM-----GDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHA 314
Y+PV +KGYW+ ++ GD + + TG AAI D+GTSL+A PT I +N
Sbjct: 366 YIPVRRKGYWEVELEGIKFGDEELPLENTG------AAI-DTGTSLIALPTDIAEILNKE 418
Query: 315 IGA----TGIVSQECKAVVS 330
IGA G + +C V S
Sbjct: 419 IGAEKSWNGQYTVDCSKVPS 438
>gi|4589842|dbj|BAA76892.1| pepsinogen C [Gallus gallus]
Length = 389
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 121/289 (41%), Positives = 171/289 (59%), Gaps = 6/289 (2%)
Query: 29 GLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDA 88
G+ R+ LKK K + R A + +S + + R Y + L N MD
Sbjct: 16 GILRVPLKKGK---SIREAMK-ESGVLHDYLANHRHYDPAYKFFSNFATAYEPLANNMDM 71
Query: 89 QYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSA 148
Y+GEI IGTPPQNF V+FDTGSSNLWVPS+ C S AC H+++ SST+ +
Sbjct: 72 SYYGEISIGTPPQNFLVLFDTGSSNLWVPSTLCQ-SQACANHNEFDPNESSTFSTQDEFF 130
Query: 149 DIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVG 208
+ YG+G+++G F D V I + + +QEF + EP +FL + FDGILGL F IS G
Sbjct: 131 SLQYGSGSLTGIFGFDTVTIQGISITNQEFGLSETEPGTSFLYSPFDGILGLAFPSISAG 190
Query: 209 KAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGY 268
A V M+ + L++ PVFSF+ + + +GGE+VFGG+DP+ Y G+ T+ PVTQ Y
Sbjct: 191 GATTVMQKMLQENLLDFPVFSFYLS-GQEGSQGGELVFGGVDPNLYTGQITWTPVTQTTY 249
Query: 269 WQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
WQ + D + GQ++G+C+ GC I D+GTSLL P + T++ IGA
Sbjct: 250 WQIGIEDFAVGGQSSGWCSQGCQGIVDTGTSLLTVPNQVFTELMQYIGA 298
>gi|45382395|ref|NP_990208.1| gastricsin precursor [Gallus gallus]
gi|4589840|dbj|BAA76893.1| pepsinogen C [Gallus gallus]
Length = 389
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 121/289 (41%), Positives = 171/289 (59%), Gaps = 6/289 (2%)
Query: 29 GLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDA 88
G+ R+ LKK K + R A + +S + + R Y + L N MD
Sbjct: 16 GILRVPLKKGK---SIREAMK-ESGVLHDYLANHRHYDPAYKFFSNFATAYEPLANNMDM 71
Query: 89 QYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSA 148
Y+GEI IGTPPQNF V+FDTGSSNLWVPS+ C S AC H+++ SST+ +
Sbjct: 72 SYYGEISIGTPPQNFLVLFDTGSSNLWVPSTLCQ-SQACANHNEFDPNESSTFSTQDEFF 130
Query: 149 DIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVG 208
+ YG+G+++G F D V I + + +QEF + EP +FL + FDGILGL F IS G
Sbjct: 131 SLQYGSGSLTGIFGFDTVTIQGISITNQEFGLSETEPGTSFLYSPFDGILGLAFPSISAG 190
Query: 209 KAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGY 268
A V M+ + L++ PVFSF+ + + +GGE+VFGG+DP+ Y G+ T+ PVTQ Y
Sbjct: 191 GATTVMQKMLQENLLDFPVFSFYLS-GQEGSQGGELVFGGVDPNLYTGQITWTPVTQTTY 249
Query: 269 WQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
WQ + D + GQ++G+C+ GC I D+GTSLL P + T++ IGA
Sbjct: 250 WQIGIEDFAVGGQSSGWCSQGCQGIVDTGTSLLTVPNQVFTELMQYIGA 298
>gi|222618904|gb|EEE55036.1| hypothetical protein OsJ_02714 [Oryza sativa Japonica Group]
Length = 290
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 105/154 (68%), Positives = 122/154 (79%)
Query: 61 SIRKYSLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSK 120
S R + RG+L +ALKN+++AQYFGEIG+G PPQNFTV+FDTGSSNLWVPS+K
Sbjct: 110 SYRGFRERGSLKNGTQNHPLALKNFLNAQYFGEIGVGCPPQNFTVVFDTGSSNLWVPSAK 169
Query: 121 CYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIE 180
C FS+ACYFH KY S SSTY +NG A IHYGTG+I G++S+D V IGDLVV +QEFIE
Sbjct: 170 CVFSLACYFHRKYESRSSSTYMENGTPASIHYGTGSIHGYYSQDQVTIGDLVVNNQEFIE 229
Query: 181 ATREPSLTFLLAKFDGILGLGFQEISVGKAVPVW 214
AT EP LTFL AKFDGILGLGF+EISV A PVW
Sbjct: 230 ATHEPGLTFLAAKFDGILGLGFKEISVEGADPVW 263
>gi|296198131|ref|XP_002746573.1| PREDICTED: gastricsin [Callithrix jacchus]
gi|18203304|sp|Q9N2D3.1|PEPC_CALJA RecName: Full=Gastricsin; AltName: Full=Pepsinogen C; Flags:
Precursor
gi|7008023|dbj|BAA90872.1| pepsinogen C [Callithrix jacchus]
Length = 388
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 113/255 (44%), Positives = 159/255 (62%), Gaps = 12/255 (4%)
Query: 84 NYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKK 143
+YMDA YFGEI IGTPPQNF V+FDTGSSNLWVPS C S AC HS++ SSTY
Sbjct: 67 DYMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHSRFNPSASSTYSS 125
Query: 144 NGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQ 203
NG++ + YG+G+++GFF D + + + V +QEF + EP F+ A+FDGI+GL +
Sbjct: 126 NGQTFSLQYGSGSLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGIMGLAYP 185
Query: 204 EISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPV 263
+S+G A M+ +G + PVFSF+ + N GG ++FGG+D Y G+ + PV
Sbjct: 186 ALSMGGATTAMQGMLQEGALTSPVFSFYLS-NQQGSSGGAVIFGGVDSSLYTGQIYWAPV 244
Query: 264 TQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQ 323
TQ+ YWQ + + +I GQ +G+C+ GC AI D+GTSLL P ++ A GA Q
Sbjct: 245 TQELYWQIGIEEFLIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMSAFLEATGA-----Q 299
Query: 324 ECKAVVSQYGEEIIN 338
E +YG+ ++N
Sbjct: 300 E-----DEYGQFLVN 309
>gi|166796432|gb|AAI59305.1| LOC496913 protein [Xenopus (Silurana) tropicalis]
Length = 381
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 112/276 (40%), Positives = 161/276 (58%), Gaps = 9/276 (3%)
Query: 49 RLDSKEGESFRTSIRKYSLRGNLGESGD-------ADIVALKNYMDAQYFGEIGIGTPPQ 101
++ K +S R +R++ ++ + + L NYMD Y+ EI IGTPPQ
Sbjct: 17 KVPLKRFKSMREVMREHGIKAPIVDPASKYYNQYATAFEPLANYMDMSYYEEISIGTPPQ 76
Query: 102 NFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFF 161
NF V+FDTGSSNLWV S+ C S AC H + +SSTY N + + YGTG+++G
Sbjct: 77 NFLVLFDTGSSNLWVASTNCQ-SQACTNHPLFNPSQSSTYSSNQQQFSLQYGTGSLTGIL 135
Query: 162 SEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQG 221
D V I ++ + QEF + EP F+ A+FDGILGL + I+VG A V M+ Q
Sbjct: 136 GYDTVTIQNIAISQQEFGLSVTEPGTNFVYAQFDGILGLAYPSIAVGGATTVMQGMLQQN 195
Query: 222 LVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQ 281
L+NEPVF F+ + + + GGE+ FGG+D ++Y G+ + PVT + YWQ + I+GQ
Sbjct: 196 LLNEPVFGFYLS-GENTQSGGEVAFGGVDQNYYTGQIYWTPVTSETYWQIGIQGFSINGQ 254
Query: 282 TTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
+G+C+ GC I D+GTSLL P +I + IGA
Sbjct: 255 ASGWCSQGCQGIVDTGTSLLTAPQSIFASLMQDIGA 290
>gi|395838792|ref|XP_003792290.1| PREDICTED: renin [Otolemur garnettii]
Length = 404
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 131/305 (42%), Positives = 174/305 (57%), Gaps = 9/305 (2%)
Query: 21 VVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIV 80
+ ST RI LKK V +L + + R S L V
Sbjct: 22 ISLSTDTSAFSRIFLKKMP-----SVREKLKERGVDMARLSAEWSQFTRRLSSGNSTSSV 76
Query: 81 ALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCY-FSIACYFHSKYRSGRSS 139
L NY+D QY+GEIGIGTPPQ F VIFDTGS+NLWVPS+KC AC HS Y S SS
Sbjct: 77 VLTNYLDTQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCSPLYTACEIHSLYDSSDSS 136
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
+Y +NG I YGTG + GF S+D V +G L V Q F E T P + F+LAKFDG+LG
Sbjct: 137 SYMENGTEFTIQYGTGKVKGFLSQDVVTVGGLTVT-QGFGEVTELPLMPFMLAKFDGVLG 195
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
+GF +VG PV+ N+++Q ++ E VFS +++RN+ GGEIV GG DP +Y+G
Sbjct: 196 MGFPAQAVGGITPVFDNILSQRVLKEDVFSVYYSRNS-HLLGGEIVLGGSDPQYYQGNFH 254
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATG 319
YV +++ G WQ M V + TT C GC A+ D+G S ++GPT+ + + A+GA
Sbjct: 255 YVSISKTGSWQIKMKGVSVR-STTLLCEDGCMAVVDTGASYISGPTSSLRLLMKALGAQE 313
Query: 320 IVSQE 324
+ E
Sbjct: 314 LSMNE 318
>gi|109287598|emb|CAJ55261.1| renin-like aspartic protease [Echis ocellatus]
Length = 395
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 106/238 (44%), Positives = 155/238 (65%), Gaps = 3/238 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCY-FSIACYFHSKYRSGRSST 140
L N+ D QY+GEI IGTP Q F V+FDTGSSNLWVPS +C AC H++Y S SST
Sbjct: 71 LTNFRDTQYYGEISIGTPAQIFKVVFDTGSSNLWVPSHQCSPLYSACVSHNRYDSSESST 130
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
YK G + YG G I GF S+D V++ D+ + Q F EA PS+ F+ A FDG+LG+
Sbjct: 131 YKPKGTKITLTYGQGYIEGFLSQDIVRVADIPIT-QFFTEAIALPSIPFMYAHFDGVLGM 189
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
G+ + ++G +PV+ N++++ +++E VFS +++R+++ GGEI+ GG DP HY G+ Y
Sbjct: 190 GYPKQAIGGVIPVFDNIMSEKVLSENVFSVYYSRHSESNTGGEIILGGSDPSHYTGDFHY 249
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
V +++GYW D+ V I+ + C GC A D+GTS ++GP + I+ + IGAT
Sbjct: 250 VSTSREGYWHVDLKGVSIENK-IALCHDGCTATIDTGTSFISGPASSISVLMETIGAT 306
>gi|193499293|gb|ACF18589.1| pepsinogen A2 precursor [Siniperca scherzeri]
Length = 376
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 115/258 (44%), Positives = 166/258 (64%), Gaps = 17/258 (6%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
+ N D Y+G I IG+PPQ+F+VIFDTGSSNLW+PS C S AC H ++ +S+T+
Sbjct: 61 MTNDADLSYYGVISIGSPPQSFSVIFDTGSSNLWIPSVYCS-SQACENHRRFNPQQSTTF 119
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEF-IEATREPSLTFLLAKFDGILGL 200
K + I YGTG+++G+ + D V++G + V +Q F I T P + + A DGILGL
Sbjct: 120 KWGNQPLSIQYGTGSMTGYLAIDTVEVGGISVANQVFGISRTEAPFMAHMQA--DGILGL 177
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
FQ I+ VPV+ NMV QGLV++P+FS + + N+ E+G E+VFGG+D HY G+ T+
Sbjct: 178 AFQTIASDNVVPVFDNMVKQGLVSQPLFSVYLSSNS--EQGSEVVFGGIDSSHYTGQITW 235
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
+P++ YWQ M V I+GQT C+GGC AI D+GTSL+ GPT+ I +N +GA+
Sbjct: 236 IPLSSATYWQIKMDSVTINGQTVA-CSGGCQAIIDTGTSLIVGPTSDINNMNAWVGAS-- 292
Query: 321 VSQECKAVVSQYGEEIIN 338
+QYGE +++
Sbjct: 293 --------TNQYGEAVVS 302
>gi|354493821|ref|XP_003509038.1| PREDICTED: gastricsin-like [Cricetulus griseus]
gi|344238302|gb|EGV94405.1| Gastricsin [Cricetulus griseus]
Length = 391
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 122/309 (39%), Positives = 177/309 (57%), Gaps = 18/309 (5%)
Query: 14 LCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKY----SLRG 69
+ LL P++ L R+ LKK K R + KE + ++ + + +
Sbjct: 6 VALLCLPLL----EAALVRVPLKKMK-------TIRQNMKEKGVLKDFLKTHKYDPAQKY 54
Query: 70 NLGESGDADIVALK-NYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACY 128
G GD ++ YMDA YFGEI IGTPPQNF V+FDTGSSNLWVPS C S AC
Sbjct: 55 RFGNFGDFSVLYEPIAYMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSIYCQ-SEACT 113
Query: 129 FHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLT 188
H +Y +SSTY G++ + YGTG+++GFF D + + + V +QEF + EP
Sbjct: 114 THPRYNPNKSSTYYTEGQTFSLQYGTGSLTGFFGYDTLTVQGIQVPNQEFGLSENEPGTN 173
Query: 189 FLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGG 248
F+ A FDGI+GL + +S G A ++ +G +++P+F + GG+IVFGG
Sbjct: 174 FVYADFDGIMGLAYPGLSAGGATTAMQGLLQEGALSQPLFGVYLGSQQGSN-GGQIVFGG 232
Query: 249 MDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTII 308
+D + Y GE T++PVTQ+ YWQ + D +I Q +G+C+ GCA I D+GTSLL P+ +
Sbjct: 233 VDENLYTGEITWIPVTQELYWQITIDDFLIGDQVSGWCSQGCAGIVDTGTSLLTMPSQYL 292
Query: 309 TQVNHAIGA 317
+ + IGA
Sbjct: 293 SDLLQTIGA 301
>gi|114607413|ref|XP_518465.2| PREDICTED: gastricsin isoform 2 [Pan troglodytes]
Length = 388
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 122/311 (39%), Positives = 177/311 (56%), Gaps = 24/311 (7%)
Query: 40 FDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGES-----------GDADIV-ALKNYMD 87
L+ ++ K+ +S R ++++ L G + GD + YMD
Sbjct: 11 LQLSEAAVVKVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYRFGDLSVTYEPMAYMD 70
Query: 88 AQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKS 147
A YFGEI IGTPPQNF V+FDTGSSNLWVPS C S AC HS++ SSTY NG++
Sbjct: 71 AAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHSRFNPSESSTYSTNGQT 129
Query: 148 ADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISV 207
+ YG+G+++GFF D + + + V +QEF + EP F+ A+FDGI+GL + +SV
Sbjct: 130 FSLQYGSGSLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGIMGLAYPALSV 189
Query: 208 GKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKG 267
+A MV +G + PVFS + + N GG +VFGG+D Y G+ + PVTQ+
Sbjct: 190 DEATTAMQGMVQEGALTSPVFSVYLS-NQQGSSGGAVVFGGVDSSLYTGQIYWAPVTQEL 248
Query: 268 YWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQECKA 327
YWQ + + +I GQ +G+C+ GC AI D+GTSLL P ++ + A GA QE
Sbjct: 249 YWQIGIEEFLIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMSALLEATGA-----QE--- 300
Query: 328 VVSQYGEEIIN 338
+YG+ ++N
Sbjct: 301 --DEYGQFLVN 309
>gi|195114666|ref|XP_002001888.1| GI14567 [Drosophila mojavensis]
gi|193912463|gb|EDW11330.1| GI14567 [Drosophila mojavensis]
Length = 402
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 113/245 (46%), Positives = 152/245 (62%), Gaps = 4/245 (1%)
Query: 73 ESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHS 131
+S + I L N + Y+G IGIGTPPQ F V+FDTGSSNLWVPS +C + +AC H+
Sbjct: 73 DSNEYVIETLSNNQNMDYYGVIGIGTPPQYFNVVFDTGSSNLWVPSVQCLSTDVACQNHN 132
Query: 132 KYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLL 191
+Y S SSTY NG+S I YGTG+++GF S D V I L + Q F EA +P+ +F
Sbjct: 133 QYNSSASSTYVPNGESFSIQYGTGSLTGFLSTDTVTINGLSIASQTFGEAISQPNGSFTG 192
Query: 192 AKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDP 251
FDGILG+G+ I+V VP +YN+ Q L++EP F F+ R+ + GG++V GG+D
Sbjct: 193 VPFDGILGMGYMSIAVDNVVPPFYNLYEQRLIDEPTFGFYLARDGSAQAGGQLVLGGIDS 252
Query: 252 DHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQV 311
+ G TYV V Q+GYWQF + + G + C AIAD+GTSLLA P + T +
Sbjct: 253 QLFSGNLTYVSVVQQGYWQFVVNSAEMGGYVVCY---NCQAIADTGTSLLACPGSAYTML 309
Query: 312 NHAIG 316
N IG
Sbjct: 310 NQLIG 314
>gi|387014|gb|AAA60062.1| pepsinogen [Homo sapiens]
Length = 385
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 114/254 (44%), Positives = 158/254 (62%), Gaps = 12/254 (4%)
Query: 85 YMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKN 144
YMDA YFGEI IGTPPQNF V+FDTGSSNLWVPS C S AC HS++ SSTY N
Sbjct: 65 YMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHSRFNPSESSTYSTN 123
Query: 145 GKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQE 204
G++ + YG+G+++GFF D + + + V +QEF + EP F+ A+FDGI+GL +
Sbjct: 124 GQTFSLQYGSGSLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGIMGLAYPA 183
Query: 205 ISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVT 264
+SV +A MV +G + PVFS + + N GG +VFGG+D Y G+ + PVT
Sbjct: 184 LSVDEATTAMQGMVQEGALTSPVFSVYLS-NQQGSSGGAVVFGGVDSSLYTGQIYWAPVT 242
Query: 265 QKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE 324
Q+ YWQ + + +I GQ +G+C+ GC AI D+GTSLL P ++ + A GA QE
Sbjct: 243 QELYWQIGIEEFLIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMSALLQATGA-----QE 297
Query: 325 CKAVVSQYGEEIIN 338
+YG+ ++N
Sbjct: 298 -----DEYGQFLVN 306
>gi|126309841|ref|XP_001370380.1| PREDICTED: gastricsin-like isoform 1 [Monodelphis domestica]
Length = 388
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 103/232 (44%), Positives = 150/232 (64%), Gaps = 1/232 (0%)
Query: 85 YMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKN 144
YMD+ Y+GEI IGTPPQNF V+FDTGSSNLWVPS C S AC H+++ +SSTY N
Sbjct: 67 YMDSSYYGEISIGTPPQNFLVLFDTGSSNLWVPSIYCQ-SQACSGHARFNPSQSSTYSTN 125
Query: 145 GKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQE 204
G++ + YG+G+++GFF D + + + V +QEF + EP F+ A+FDGI+G+ +
Sbjct: 126 GQTFSLQYGSGSLTGFFGYDTMTVQGIQVPNQEFGLSENEPGTNFIYAQFDGIMGMAYPA 185
Query: 205 ISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVT 264
++VG A M+ Q ++ P+FSF+ + + GGE++FGG+D + Y G+ + PVT
Sbjct: 186 LAVGGATTALQGMLQQNVLTNPIFSFYLSNQQSSQSGGEVIFGGVDNNLYSGQIYWAPVT 245
Query: 265 QKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIG 316
Q+ YWQ + + I GQ TG+C+ GC AI D+GTSLL P ++ A G
Sbjct: 246 QELYWQIGIQEFSIGGQATGWCSQGCQAIVDTGTSLLTVPQQYMSAFLQATG 297
>gi|241951846|ref|XP_002418645.1| Vacuolar aspartic proteinase, putative [Candida dubliniensis CD36]
gi|223641984|emb|CAX43948.1| Vacuolar aspartic proteinase, putative [Candida dubliniensis CD36]
Length = 419
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 118/243 (48%), Positives = 154/243 (63%), Gaps = 14/243 (5%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L NY++AQYF EI IGTP Q F VI DTGSSNLWVPS C S+AC+ H+KY SSTY
Sbjct: 96 LTNYLNAQYFTEIQIGTPGQPFKVILDTGSSNLWVPSQDCT-SLACFLHAKYDHDASSTY 154
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
K NG I YG+G++ G+ S+D + IGDLV+ Q+F EAT EP L F KFDGILGL
Sbjct: 155 KANGSEFSIQYGSGSMEGYISQDVLTIGDLVIPGQDFAEATSEPGLAFAFGKFDGILGLA 214
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWF-NRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ISV VP YN +NQGL+ +P F F+ + + DE +GG FGG D ++G+ T+
Sbjct: 215 YDTISVNHIVPPIYNAINQGLLEKPQFGFYLGSTDKDENDGGLATFGGYDASLFQGKITW 274
Query: 261 VPVTQKGYWQFD-----MGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAI 315
+PV +K YW+ +GD + TG AAI D+GTSL+ P+++ +N I
Sbjct: 275 LPVRRKAYWEVSFEGIGLGDEYAELHKTG------AAI-DTGTSLITLPSSLAEIINAKI 327
Query: 316 GAT 318
GAT
Sbjct: 328 GAT 330
>gi|126309843|ref|XP_001370404.1| PREDICTED: gastricsin-like isoform 2 [Monodelphis domestica]
Length = 389
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 103/232 (44%), Positives = 150/232 (64%), Gaps = 1/232 (0%)
Query: 85 YMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKN 144
YMD+ Y+GEI IGTPPQNF V+FDTGSSNLWVPS C S AC H+++ +SSTY N
Sbjct: 67 YMDSSYYGEISIGTPPQNFLVLFDTGSSNLWVPSIYCQ-SQACSGHARFNPSQSSTYSTN 125
Query: 145 GKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQE 204
G++ + YG+G+++GFF D + + + V +QEF + EP F+ A+FDGI+G+ +
Sbjct: 126 GQTFSLQYGSGSLTGFFGYDTMTVQGIQVPNQEFGLSENEPGTNFIYAQFDGIMGMAYPA 185
Query: 205 ISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVT 264
++VG A M+ Q ++ P+FSF+ + + GGE++FGG+D + Y G+ + PVT
Sbjct: 186 LAVGGATTALQGMLQQNVLTNPIFSFYLSNQQSSQSGGEVIFGGVDNNLYSGQIYWAPVT 245
Query: 265 QKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIG 316
Q+ YWQ + + I GQ TG+C+ GC AI D+GTSLL P ++ A G
Sbjct: 246 QELYWQIGIQEFSIGGQATGWCSQGCQAIVDTGTSLLTVPQQYMSAFLQATG 297
>gi|4505757|ref|NP_002621.1| gastricsin isoform 1 preproprotein [Homo sapiens]
gi|129796|sp|P20142.1|PEPC_HUMAN RecName: Full=Gastricsin; AltName: Full=Pepsinogen C; Flags:
Precursor
gi|387015|gb|AAA60063.1| pepsinogen C [Homo sapiens]
gi|551176|gb|AAA60074.1| pepsinogen [Homo sapiens]
gi|1658286|gb|AAB18273.1| gastricsin [Homo sapiens]
gi|49522219|gb|AAH73740.1| Progastricsin (pepsinogen C) [Homo sapiens]
gi|119624464|gb|EAX04059.1| progastricsin (pepsinogen C) [Homo sapiens]
Length = 388
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 114/254 (44%), Positives = 158/254 (62%), Gaps = 12/254 (4%)
Query: 85 YMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKN 144
YMDA YFGEI IGTPPQNF V+FDTGSSNLWVPS C S AC HS++ SSTY N
Sbjct: 68 YMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHSRFNPSESSTYSTN 126
Query: 145 GKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQE 204
G++ + YG+G+++GFF D + + + V +QEF + EP F+ A+FDGI+GL +
Sbjct: 127 GQTFSLQYGSGSLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGIMGLAYPA 186
Query: 205 ISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVT 264
+SV +A MV +G + PVFS + + N GG +VFGG+D Y G+ + PVT
Sbjct: 187 LSVDEATTAMQGMVQEGALTSPVFSVYLS-NQQGSSGGAVVFGGVDSSLYTGQIYWAPVT 245
Query: 265 QKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE 324
Q+ YWQ + + +I GQ +G+C+ GC AI D+GTSLL P ++ + A GA QE
Sbjct: 246 QELYWQIGIEEFLIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMSALLQATGA-----QE 300
Query: 325 CKAVVSQYGEEIIN 338
+YG+ ++N
Sbjct: 301 -----DEYGQFLVN 309
>gi|999902|pdb|1HTR|B Chain B, Crystal And Molecular Structures Of Human Progastricsin At
1.62 Angstroms Resolution
gi|2982065|pdb|1AVF|A Chain A, Activation Intermediate 2 Of Human Gastricsin From Human
Stomach
gi|2982067|pdb|1AVF|J Chain J, Activation Intermediate 2 Of Human Gastricsin From Human
Stomach
Length = 329
Score = 222 bits (565), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 114/254 (44%), Positives = 158/254 (62%), Gaps = 12/254 (4%)
Query: 85 YMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKN 144
YMDA YFGEI IGTPPQNF V+FDTGSSNLWVPS C S AC HS++ SSTY N
Sbjct: 9 YMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHSRFNPSESSTYSTN 67
Query: 145 GKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQE 204
G++ + YG+G+++GFF D + + + V +QEF + EP F+ A+FDGI+GL +
Sbjct: 68 GQTFSLQYGSGSLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGIMGLAYPA 127
Query: 205 ISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVT 264
+SV +A MV +G + PVFS + + N GG +VFGG+D Y G+ + PVT
Sbjct: 128 LSVDEATTAMQGMVQEGALTSPVFSVYLS-NQQGSSGGAVVFGGVDSSLYTGQIYWAPVT 186
Query: 265 QKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE 324
Q+ YWQ + + +I GQ +G+C+ GC AI D+GTSLL P ++ + A GA QE
Sbjct: 187 QELYWQIGIEEFLIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMSALLQATGA-----QE 241
Query: 325 CKAVVSQYGEEIIN 338
+YG+ ++N
Sbjct: 242 -----DEYGQFLVN 250
>gi|195575783|ref|XP_002077756.1| GD23099 [Drosophila simulans]
gi|194189765|gb|EDX03341.1| GD23099 [Drosophila simulans]
Length = 405
Score = 222 bits (565), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 115/253 (45%), Positives = 161/253 (63%), Gaps = 7/253 (2%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSST 140
L N M+ Y+G IGIGTP Q F V+FDTGS+NLWVPSS+C + +AC H++Y S SST
Sbjct: 85 LGNSMNMYYYGLIGIGTPEQYFKVVFDTGSANLWVPSSQCLSTDVACQQHNQYNSSASST 144
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
+ +G++ I YGTG++SG+ + D V I L + +Q F EA +P +F FDGILG+
Sbjct: 145 FVSSGQNFSIQYGTGSVSGYLAMDTVTINGLAIANQTFGEAVSQPGASFTDVAFDGILGM 204
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
G+Q+I+ VP +YN+ +GL++EPVF F+ RN EGG++ GG D + GE TY
Sbjct: 205 GYQQIAEDNVVPPFYNLYEEGLIDEPVFGFYLARNGSAVEGGQLTLGGTDQNLIAGEMTY 264
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
PVTQ+GYWQF + ++ +G + GC AIAD+GTSL+A P+ Q+N+ IG I
Sbjct: 265 TPVTQQGYWQFSVNNITWNGTV---ISSGCQAIADTGTSLIAAPSAAYIQLNNLIGGVLI 321
Query: 321 VSQ---ECKAVVS 330
+ C V S
Sbjct: 322 QGEYYVPCSTVSS 334
>gi|195350353|ref|XP_002041705.1| GM16820 [Drosophila sechellia]
gi|194123478|gb|EDW45521.1| GM16820 [Drosophila sechellia]
Length = 405
Score = 222 bits (565), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 115/253 (45%), Positives = 161/253 (63%), Gaps = 7/253 (2%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSST 140
L N M+ Y+G IGIGTP Q F V+FDTGS+NLWVPSS+C + +AC H++Y S SST
Sbjct: 85 LGNSMNMYYYGLIGIGTPEQYFKVVFDTGSANLWVPSSQCLATDVACQQHNQYNSSASST 144
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
+ +G++ I YGTG++SG+ + D V I L + +Q F EA +P +F FDGILG+
Sbjct: 145 FVSSGQNFSIQYGTGSVSGYLAMDTVTINGLAIANQTFGEAVSQPGASFTDVAFDGILGM 204
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
G+Q+I+ VP +YN+ +GL++EPVF F+ RN EGG++ GG D + GE TY
Sbjct: 205 GYQQIAEDNVVPPFYNLYEEGLIDEPVFGFYLARNGSAVEGGQLTLGGTDQNLIAGEMTY 264
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
PVTQ+GYWQF + ++ +G + GC AIAD+GTSL+A P+ Q+N+ IG I
Sbjct: 265 TPVTQQGYWQFSVNNITWNGTV---ISSGCQAIADTGTSLIAAPSAAYIQLNNLIGGVLI 321
Query: 321 VSQ---ECKAVVS 330
+ C V S
Sbjct: 322 QGEYYVPCSTVSS 334
>gi|90265707|dbj|BAE91900.1| Proteinase A [Ogataea minuta]
Length = 410
Score = 222 bits (565), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 118/237 (49%), Positives = 150/237 (63%), Gaps = 4/237 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L NY++AQYF EI IGTP Q F VI DTGSSNLWVP S C S+ACY H KY SSTY
Sbjct: 88 LTNYVNAQYFTEIQIGTPGQPFKVILDTGSSNLWVPGSDCS-SLACYLHQKYDHDSSSTY 146
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
K NG I YG+G++ GF S+D + +GDL++ Q+F EAT EP L F KFDGILGL
Sbjct: 147 KANGSEFAIRYGSGSLEGFVSQDTLTLGDLIIPKQDFAEATSEPGLAFAFGKFDGILGLA 206
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEE-EGGEIVFGGMDPDHYKGEHTY 260
+ ISV K VP YN +N GL++EP F+F+ A E +GG FGG+D Y G+ T+
Sbjct: 207 YDTISVDKIVPPIYNALNLGLLDEPQFAFYLGDTAKSEADGGVATFGGVDETKYDGKITW 266
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
+PV +K YW+ + + G G AAI D+GTSL+A P+ + +N IGA
Sbjct: 267 LPVRRKAYWEVKFDGIAL-GDEYATLDGYGAAI-DTGTSLIALPSQLAEILNSQIGA 321
>gi|327271277|ref|XP_003220414.1| PREDICTED: renin-like [Anolis carolinensis]
Length = 398
Score = 222 bits (565), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 114/277 (41%), Positives = 167/277 (60%), Gaps = 17/277 (6%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCY-FSIACYFHSKYRSGRSST 140
L NY+D QY+GEI IGTP Q F V+FDTGS+NLWVPS +C AC H++Y S RSST
Sbjct: 71 LTNYLDMQYYGEISIGTPAQIFKVVFDTGSANLWVPSQQCSPLYSACVSHNRYDSSRSST 130
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
YK NG I YG G + GF S+D V++ D+ V Q F EA P+ F+ A+FDG+LG+
Sbjct: 131 YKPNGTEIAIQYGQGYVKGFLSQDIVRVADIPVV-QLFAEAIALPNKPFIYARFDGVLGM 189
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
G+ ++ +PV+ ++++ +++E VFS +++RN++ GGEI+ GG DP +Y G+ Y
Sbjct: 190 GYPSQAIDGVIPVFDKIISERVLSEEVFSVYYSRNSEMNTGGEIILGGSDPSYYTGDFHY 249
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
V ++ GYW D+ V + G FC GC A D+G+S + GP + ++ + +IGAT +
Sbjct: 250 VSISTPGYWHIDLKGVSL-GSEMLFCHEGCTAAVDTGSSFITGPASAVSILMKSIGATLL 308
Query: 321 VSQ----ECKAVVSQYGEEIINMLLAKVSAKLSLRHY 353
+ ECK + LL +S L R Y
Sbjct: 309 EERDYVVECKKI----------HLLPDISFHLGDRSY 335
>gi|307167891|gb|EFN61280.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 431
Score = 222 bits (565), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 127/322 (39%), Positives = 174/322 (54%), Gaps = 56/322 (17%)
Query: 49 RLDSKEGESFRTSIRKYSL---RGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTV 105
R+ + + R +RK + + N +S + +L NY+D++Y+G I IGTPPQ F V
Sbjct: 21 RIQLHKMDPIRKRLRKIGIDLQQINFTKSNPSS-QSLYNYLDSEYYGNITIGTPPQQFKV 79
Query: 106 IFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSSTYKKNGKSADIHY-------GTGAI 157
+FDTGSSNLW+PS C + +AC H+KY S +S TYK N + Y +G++
Sbjct: 80 LFDTGSSNLWIPSILCSTANVACALHNKYDSTKSRTYKVNNTICSLQYDITSIPFNSGSV 139
Query: 158 SGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNM 217
SGF S D V + L V+ Q F EA E L ++A+FDGILG+G+ I+V PV+YN+
Sbjct: 140 SGFLSTDVVNVAGLNVQGQTFAEAIDELVLALVVAEFDGILGMGYSTIAVDGVTPVFYNL 199
Query: 218 VNQGLVNEPVFSFWFNRNA------------------------------------DEEEG 241
+ Q LV +PVFSF+ NR+ + G
Sbjct: 200 IKQKLVPQPVFSFYLNRHVFSYSIFKSISNKYIYNKKKYIYIAILKRIYNVYRDPSAKVG 259
Query: 242 GEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMI------DGQTTGFCAGGCAAIAD 295
GE++ GG DP +Y G YV VT+KGYWQF M V I G+T C GGC AIAD
Sbjct: 260 GELILGGSDPAYYTGHFKYVDVTKKGYWQFLMDRVRITRTKFNKGRT--LCMGGCQAIAD 317
Query: 296 SGTSLLAGPTTIITQVNHAIGA 317
+G SL+ GPT+ I +N IGA
Sbjct: 318 TGMSLIVGPTSEIDIINKYIGA 339
>gi|301784222|ref|XP_002927531.1| PREDICTED: pepsin B-like [Ailuropoda melanoleuca]
Length = 390
Score = 222 bits (565), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 103/236 (43%), Positives = 150/236 (63%), Gaps = 1/236 (0%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
NY+++ YFGEI IGTPPQNF V+FDTGSSNLWVPS+ C S AC H+ + SSTY
Sbjct: 66 FTNYLNSYYFGEISIGTPPQNFLVLFDTGSSNLWVPSTYCQ-SQACTNHNMFNPSSSSTY 124
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
+ NG++ ++YG+G+++ D V + ++++ +QEF + EP+ F A FDGILG+
Sbjct: 125 RNNGQTYTLYYGSGSLTVLLGYDTVNVQNIIINNQEFGLSEIEPNNPFYYANFDGILGMA 184
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ ++VG A V +MV Q + +P+FSF+F+R E GGE++ GG+D Y GE +
Sbjct: 185 YPNLAVGNAPTVTQSMVQQDQLTQPIFSFYFSRQPTYEYGGELILGGVDSQFYSGEIVWT 244
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
PVT++ YWQ + + ++ Q TG C+ GC AI D+GT +LA P I GA
Sbjct: 245 PVTREMYWQIAIDEFLVSNQATGLCSQGCQAIVDTGTYMLAVPQQFIGSFLQTTGA 300
>gi|397526910|ref|XP_003833357.1| PREDICTED: gastricsin [Pan paniscus]
Length = 388
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 114/254 (44%), Positives = 158/254 (62%), Gaps = 12/254 (4%)
Query: 85 YMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKN 144
YMDA YFGEI IGTPPQNF V+FDTGSSNLWVPS C S AC HS++ SSTY N
Sbjct: 68 YMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHSRFNPSESSTYSTN 126
Query: 145 GKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQE 204
G++ + YG+G+++GFF D + + + V +QEF + EP F+ A+FDGI+GL +
Sbjct: 127 GQTFSLQYGSGSLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGIMGLAYPA 186
Query: 205 ISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVT 264
+SV +A MV +G + PVFS + + N GG +VFGG+D Y G+ + PVT
Sbjct: 187 LSVDEATTAMQGMVQEGALTSPVFSVYLS-NQQGSSGGAVVFGGVDSSLYTGQIYWAPVT 245
Query: 265 QKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE 324
Q+ YWQ + + +I GQ +G+C+ GC AI D+GTSLL P ++ + A GA QE
Sbjct: 246 QELYWQIGIEEFLIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMSALLEATGA-----QE 300
Query: 325 CKAVVSQYGEEIIN 338
+YG+ ++N
Sbjct: 301 -----DEYGQFLVN 309
>gi|395535589|ref|XP_003769805.1| PREDICTED: chymosin-like [Sarcophilus harrisii]
Length = 382
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 116/259 (44%), Positives = 159/259 (61%), Gaps = 8/259 (3%)
Query: 63 RKYSLRGNLGESGDADIVA---LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSS 119
KYSL G+A VA L N++D+QYFG+I IGTPPQ FTV+FDTGSSNLWVPS
Sbjct: 45 NKYSLSSKYRRYGEAAKVASEPLTNFLDSQYFGKIYIGTPPQEFTVVFDTGSSNLWVPSV 104
Query: 120 KCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFI 179
C S AC H ++ SST+ + I YGTG++ G D V + +V DQ F
Sbjct: 105 YCN-STACENHHRFSPSESSTFNSTEEPLSIQYGTGSMEGVLGYDTVIVSSIVDPDQIFG 163
Query: 180 EATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEE 239
+T+EP F ++FDGILGLG+ ++V +A PV+ NM+N+ LV + +FS + NR+
Sbjct: 164 LSTQEPGNIFTYSEFDGILGLGYPSLAVDQATPVFDNMMNKHLVAQNLFSVYMNRHG--- 220
Query: 240 EGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTS 299
G + G +D +Y G +VP+T +GYWQF + + ++GQ C GGC AI D+GTS
Sbjct: 221 PGSMLTLGAIDSSYYTGSLHWVPITVQGYWQFSVDRITVNGQVVA-CDGGCQAILDTGTS 279
Query: 300 LLAGPTTIITQVNHAIGAT 318
LL GP+ I+ + IGAT
Sbjct: 280 LLVGPSYDISNIQSVIGAT 298
>gi|367000932|ref|XP_003685201.1| hypothetical protein TPHA_0D01260 [Tetrapisispora phaffii CBS 4417]
gi|357523499|emb|CCE62767.1| hypothetical protein TPHA_0D01260 [Tetrapisispora phaffii CBS 4417]
Length = 419
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 114/249 (45%), Positives = 156/249 (62%), Gaps = 6/249 (2%)
Query: 71 LGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFH 130
L E G + V L NY++AQY+ +I +GTP QNF VI DTGSSNLWVPS C S+ACY H
Sbjct: 86 LNEGGSHN-VPLSNYLNAQYYTDISLGTPKQNFKVILDTGSSNLWVPSKDCT-SLACYLH 143
Query: 131 SKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGD-LVVKDQEFIEATREPSLTF 189
SKY S+TY+KNG I YG+G++ G+ S D + IGD LV+ +Q+F EAT EP L F
Sbjct: 144 SKYDHDESTTYEKNGTKFTIQYGSGSMDGYISRDTLIIGDDLVIPEQDFAEATSEPGLAF 203
Query: 190 LLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWF-NRNADEEEGGEIVFGG 248
KFDGILGL + I+V K VP +YN + QG+++E F+F+ + N D + GGE FGG
Sbjct: 204 AFGKFDGILGLAYDTIAVNKVVPPFYNAIKQGILDENKFAFYLGDTNKDNKSGGEATFGG 263
Query: 249 MDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTII 308
D + G+ T++PV +K YW+ + + + G A D+GTSL+ P+ +
Sbjct: 264 YDKSKFTGDITWLPVRRKAYWEVKFDSIALGDEVASL--DGYGAAIDTGTSLITLPSGLA 321
Query: 309 TQVNHAIGA 317
+N IGA
Sbjct: 322 EVINTQIGA 330
>gi|34740274|dbj|BAC87742.1| pepsinogen [Paralichthys olivaceus]
Length = 377
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 131/330 (39%), Positives = 190/330 (57%), Gaps = 34/330 (10%)
Query: 13 FLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEG---ESFRTSIRKYSLRG 69
+L +L V FS L++I L K K AR D +E E +R Y+
Sbjct: 4 WLVVLSALVAFSE---CLHKIPLIKGK-------TARQDLQEKGLWEQYRKEF-PYNPMA 52
Query: 70 NLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYF 129
++G ++ N D Y+G I IGTPPQ+F+VIFDTGSSNLW+PS C S AC
Sbjct: 53 KFIQTGTE---SMTNDADLSYYGVISIGTPPQSFSVIFDTGSSNLWIPSVYCS-SQACEN 108
Query: 130 HSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEF-IEATREPSLT 188
H ++ +SST+ + I YGTG+++G+ + D V++G + V +Q F I + P +
Sbjct: 109 HKRFNPQQSSTFHWGNRPLSIQYGTGSMTGYLASDTVEVGGISVANQVFGISQSEAPFMA 168
Query: 189 FLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGG 248
+ K DGILGL FQ I+ VPV+ NM+ Q LV++P+FS + + N ++G E+VFGG
Sbjct: 169 HM--KADGILGLAFQSIASDNVVPVFDNMIKQNLVSQPLFSVYLSSN--NQQGSEVVFGG 224
Query: 249 MDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTII 308
+D +HY G+ +++P+T YWQ M V I+GQT C+GGC AI D+GTSL+ GPT I
Sbjct: 225 IDGNHYTGQVSWIPLTSATYWQIKMDSVTINGQTVA-CSGGCQAIIDTGTSLIVGPTNDI 283
Query: 309 TQVNHAIGATGIVSQECKAVVSQYGEEIIN 338
+N +GA+ +QYGE +N
Sbjct: 284 NNMNSWVGAS----------TNQYGEATVN 303
>gi|311260416|ref|XP_003128442.1| PREDICTED: gastricsin-like [Sus scrofa]
Length = 394
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 103/233 (44%), Positives = 152/233 (65%), Gaps = 1/233 (0%)
Query: 85 YMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKN 144
Y++A YFGEI IGTPPQNF V+FDTGSSNLWVPS C S+AC H+++ +SSTY +
Sbjct: 73 YLEAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCK-SLACTTHARFNPSKSSTYSTD 131
Query: 145 GKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQE 204
++ + YG+G+++GFF D +KI + V DQEF + EP +FL A+FDGI+GL + +
Sbjct: 132 RQTFSLQYGSGSLTGFFGYDTLKIQSIQVPDQEFGLSETEPGTSFLYAQFDGIMGLAYPD 191
Query: 205 ISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVT 264
+S G A ++ + + PVFSF+ + ++GGE+V GG+D Y G+ + PVT
Sbjct: 192 LSAGGATTAMQGLLQEDALTSPVFSFYLSNQQSSQDGGELVLGGVDSSLYTGQIYWAPVT 251
Query: 265 QKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
Q+ YWQ + + +I + +G+C+ GC AI D+GTSLL P ++ + A GA
Sbjct: 252 QELYWQIGIEEFLIGDEASGWCSEGCQAIVDTGTSLLTVPQDYLSDLVQATGA 304
>gi|327270926|ref|XP_003220239.1| PREDICTED: embryonic pepsinogen-like [Anolis carolinensis]
Length = 382
Score = 221 bits (563), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 110/257 (42%), Positives = 158/257 (61%), Gaps = 15/257 (5%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L NY+D +Y+G I IGTPPQ FTV+FDTGSSNLWVPS+ C C H ++ +SST+
Sbjct: 67 LMNYLDTEYYGTINIGTPPQAFTVVFDTGSSNLWVPSTYCS-DAPCQNHPRFDPSQSSTF 125
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
+ ++ I YGTG++ G D + + + V QEF ++ EP + F FDGILGLG
Sbjct: 126 ENTQQTMSIQYGTGSMQGILGYDTLTVTGITVPKQEFALSSSEPGVFFTYVPFDGILGLG 185
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ I+V PV+ NM+N+GLV E +FS + R G I FGG+D +Y G ++
Sbjct: 186 YPSIAVSDVTPVFDNMMNEGLVQENLFSVYLGRGG---TGSIITFGGIDESYYTGSINWI 242
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIV 321
PVT++GYWQ ++ ++++G+ C+ GC AI D+GTSL+AGP + I+ + +AIGAT
Sbjct: 243 PVTEQGYWQIELDSILVNGEAIA-CSDGCQAIVDTGTSLVAGPPSDISNLQNAIGAT--- 298
Query: 322 SQECKAVVSQYGEEIIN 338
QYG+ IN
Sbjct: 299 -------PGQYGQYDIN 308
>gi|426353119|ref|XP_004044046.1| PREDICTED: gastricsin [Gorilla gorilla gorilla]
Length = 388
Score = 221 bits (563), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 114/254 (44%), Positives = 158/254 (62%), Gaps = 12/254 (4%)
Query: 85 YMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKN 144
YMDA YFGEI IGTPPQNF V+FDTGSSNLWVPS C S AC HS++ SSTY N
Sbjct: 68 YMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHSRFNPSESSTYSTN 126
Query: 145 GKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQE 204
G++ + YG+G+++GFF D + + + V +QEF + EP F+ A+FDGI+GL +
Sbjct: 127 GQTFSLQYGSGSLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGIMGLAYPA 186
Query: 205 ISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVT 264
+SV +A MV +G + PVFS + + N GG +VFGG+D Y G+ + PVT
Sbjct: 187 LSVDEATTAMQGMVQEGALTSPVFSVYLS-NQQGSSGGAVVFGGVDNSLYTGQIYWAPVT 245
Query: 265 QKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE 324
Q+ YWQ + + +I GQ +G+C+ GC AI D+GTSLL P ++ + A GA QE
Sbjct: 246 QELYWQIGIEEFLIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMSALLQATGA-----QE 300
Query: 325 CKAVVSQYGEEIIN 338
+YG+ ++N
Sbjct: 301 -----DEYGQFLVN 309
>gi|194762104|ref|XP_001963198.1| GF19727 [Drosophila ananassae]
gi|190616895|gb|EDV32419.1| GF19727 [Drosophila ananassae]
Length = 449
Score = 221 bits (563), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 131/288 (45%), Positives = 164/288 (56%), Gaps = 19/288 (6%)
Query: 46 VAARLDSKEGESFRTSIRKYSLRGNLGE-----------SGDADIVA---LKNYMDAQYF 91
+ ++ E R IR +L NL S D + A L N + Y+
Sbjct: 9 LVGQVSQSMAELKRIEIRPRNLTHNLQSEILLLKAKYLSSADESVEAKEILVNAANFAYY 68
Query: 92 GEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSSTYKKNGKSADI 150
GEI IGTPPQNF+V+FDTGSSN WVPSS C S +AC H++Y+S SSTY G + I
Sbjct: 69 GEISIGTPPQNFSVLFDTGSSNTWVPSSLCPASDVACQSHNQYKSSASSTYVPVGTNISI 128
Query: 151 HYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKA 210
YGTG++ GF S D V+I L V +Q F EAT EP F FDGILGLGF +S G
Sbjct: 129 VYGTGSMEGFLSNDTVRIAGLNVTNQTFAEATAEPDGFFDSQPFDGILGLGFNTLSNGIN 188
Query: 211 VPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQ 270
PV NM+ QGL+++P FS + RN GGEI++GG DP Y G TYVPV+ YWQ
Sbjct: 189 TPV-DNMIAQGLLDKPEFSVYLRRNGSSLIGGEIIWGGTDPSIYHGSITYVPVSVPQYWQ 247
Query: 271 FDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
F + I+GQ C GC AIAD+GTSL+ P T +N + AT
Sbjct: 248 FTVDTGTINGQI--LCR-GCQAIADTGTSLIIVPKRAFTAINKQLNAT 292
>gi|253762219|gb|ACT35561.1| pepsinogen A2 precursor [Siniperca chuatsi]
Length = 376
Score = 221 bits (563), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 114/258 (44%), Positives = 165/258 (63%), Gaps = 17/258 (6%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
+ N D Y+G I IG+PPQ+F+VIFDTGSSNLW+PS C S AC H ++ + +T+
Sbjct: 61 MTNDADLSYYGVISIGSPPQSFSVIFDTGSSNLWIPSVYCS-SQACENHRRFNPQQPTTF 119
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEF-IEATREPSLTFLLAKFDGILGL 200
K + I YGTG+++G+ + D V++G + V +Q F I T P + + A DGILGL
Sbjct: 120 KWGNQPLSIQYGTGSMTGYLAIDTVEVGGISVANQVFGISRTEAPFMAHMQA--DGILGL 177
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
FQ I+ VPV+ NMV QGLV++P+FS + + N+ E+G E+VFGG+D HY G+ T+
Sbjct: 178 AFQTIASDNVVPVFDNMVKQGLVSQPLFSVYLSSNS--EQGSEVVFGGIDSSHYTGQITW 235
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
+P++ YWQ M V I+GQT C+GGC AI D+GTSL+ GPT+ I +N +GA+
Sbjct: 236 IPLSSATYWQIKMDSVTINGQTVA-CSGGCQAIIDTGTSLIVGPTSDINNMNAWVGAS-- 292
Query: 321 VSQECKAVVSQYGEEIIN 338
+QYGE +++
Sbjct: 293 --------TNQYGEAVVS 302
>gi|332267172|ref|XP_003282561.1| PREDICTED: pepsin A-5 [Nomascus leucogenys]
Length = 372
Score = 221 bits (563), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 116/257 (45%), Positives = 162/257 (63%), Gaps = 8/257 (3%)
Query: 78 DIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGR 137
D L+NY+D +YFG IGIGTP Q+FTVIFDTGSSNLWVPS C S+AC H+++
Sbjct: 48 DEQPLENYLDMEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACTNHNRFNPED 106
Query: 138 SSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGI 197
SSTY+ ++ I YGTG+++G D V++G + +Q F + EP A FDGI
Sbjct: 107 SSTYQSTSETVSIAYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYAPFDGI 166
Query: 198 LGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGE 257
LGL + IS A PV+ N+ NQGLV++ +FS + +AD++ G ++FGG+D +Y G
Sbjct: 167 LGLAYPSISSSGATPVFDNIWNQGLVSQDLFSVYL--SADDQSGSVVIFGGIDSSYYSGS 224
Query: 258 HTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
+VPVT +GYWQ + + ++G+T CA GC AI D+GTSLL GPT+ I + IGA
Sbjct: 225 LNWVPVTVEGYWQITVDSITMNGETIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGA 283
Query: 318 T----GIVSQECKAVVS 330
+ G + C A+ S
Sbjct: 284 SENSDGDMVVSCSAISS 300
>gi|351707910|gb|EHB10829.1| Gastricsin [Heterocephalus glaber]
Length = 391
Score = 221 bits (563), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 112/283 (39%), Positives = 167/283 (59%), Gaps = 15/283 (5%)
Query: 49 RLDSKEGESFRTSIRKYSLRGNLGESGDAD-IVALKN-------------YMDAQYFGEI 94
++ K+ +S R ++R L G+ ++ D I L N Y+DA YFGEI
Sbjct: 20 KIPLKKFKSIRETMRDKGLLGDFLKTHKQDHIRKLSNNFDHFSVLFEPMSYLDAAYFGEI 79
Query: 95 GIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGT 154
+GTPPQ+F V+FDTGSSNLWVPS C S+AC H ++ +SSTY G+S + YG+
Sbjct: 80 SLGTPPQSFQVLFDTGSSNLWVPSVYCQ-SLACTTHPRFNPSKSSTYTSTGQSFSLQYGS 138
Query: 155 GAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVW 214
G+++G F D + I V QEF + +EP TF+ A+FDGI+GLG+ ++ G A
Sbjct: 139 GSLTGVFGYDTMTIQGTQVPKQEFGLSEQEPGTTFVYAQFDGIMGLGYPGLAAGGATTAL 198
Query: 215 YNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMG 274
++ +G +++P+FS + +GG ++ GG+D Y G+ ++ PVTQ+ YWQ +
Sbjct: 199 QGLIREGALSQPLFSVYLGSQQGSSDGGALILGGVDESLYNGQISWTPVTQELYWQIGIE 258
Query: 275 DVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
DV +D Q G+C+ GC I D+GTSLL P +T + AIGA
Sbjct: 259 DVQLDNQALGWCSQGCQGIVDTGTSLLTLPQQYLTTLIQAIGA 301
>gi|195473093|ref|XP_002088830.1| GE10927 [Drosophila yakuba]
gi|194174931|gb|EDW88542.1| GE10927 [Drosophila yakuba]
Length = 372
Score = 221 bits (563), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 119/276 (43%), Positives = 166/276 (60%), Gaps = 16/276 (5%)
Query: 30 LYRIGLKKRKFDLNNRVAARLDSKEGESF-RTSIRKYSLRGNLGESGDADIVALKNYMDA 88
L+R+ + K + N V R + +S+ RT + SLR D L N M+
Sbjct: 19 LHRVPIFKEQ----NFVKTRQNVLAEKSYLRTKYQLPSLRS-------VDEEQLSNSMNM 67
Query: 89 QYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSA 148
Y+G I IGTP Q+F V+FD+GS+NLWVPS+ C S AC H++Y S SSTY NG+S
Sbjct: 68 AYYGAISIGTPAQSFKVLFDSGSANLWVPSNTCQ-SDACQTHNQYDSSASSTYVANGESF 126
Query: 149 DIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVG 208
I YGTG+++G+ S D V + L ++ Q F E+T EP F A FDGILG+ ++ ++V
Sbjct: 127 SIQYGTGSLTGYLSTDTVDVNGLSIESQTFAESTNEPGTNFNDANFDGILGMAYESLAVD 186
Query: 209 KAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGY 268
P +YNMV+Q LV+ VFSF+ R+ GGE++ GG D Y G TYVP++++GY
Sbjct: 187 GVAPPFYNMVSQDLVDSSVFSFYLARDGTSTFGGELILGGSDSSLYSGSLTYVPISEQGY 246
Query: 269 WQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGP 304
WQF M +DG++ C C AIAD+GTSL+ P
Sbjct: 247 WQFTMDGSSVDGES--LC-DDCQAIADTGTSLIVAP 279
>gi|403261257|ref|XP_003923041.1| PREDICTED: gastricsin [Saimiri boliviensis boliviensis]
Length = 388
Score = 221 bits (562), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 128/329 (38%), Positives = 184/329 (55%), Gaps = 30/329 (9%)
Query: 13 FLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLG 72
F+CL L + ++ LKK K + R KE R ++ + R G
Sbjct: 8 FVCLQLL-------EAAVVKVPLKKFK-------SIRETMKEKGLLREFLKTHK-RDPAG 52
Query: 73 ESGDADIVA---LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYF 129
+ +D+ +YMDA YFGEI IGTPPQNF V+FDTGSSNLWVPS C S AC
Sbjct: 53 KYHFSDLSVSYEPMDYMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTS 111
Query: 130 HSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTF 189
HS++ SSTY NG++ + YG+G+++G F D + + + V +QEF + EP F
Sbjct: 112 HSRFNPSASSTYSSNGQTFSLQYGSGSLTGLFGYDTLTVQSIQVPNQEFGLSENEPGTNF 171
Query: 190 LLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGM 249
+ A+FDGI+GL + +SVG A M+ + ++ PVFSF+ + N GG +VFGG+
Sbjct: 172 IYAQFDGIMGLAYPALSVGGATTAMQGMLQEDVLTSPVFSFYLS-NQQGSSGGAVVFGGV 230
Query: 250 DPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIIT 309
D Y G+ + PVTQ+ YWQ + + +I GQ +G+C+ GC AI D+GTSLL P ++
Sbjct: 231 DSSLYTGQIYWAPVTQELYWQIGIEEFLIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMS 290
Query: 310 QVNHAIGATGIVSQECKAVVSQYGEEIIN 338
A GA QE +YG+ ++N
Sbjct: 291 AFLEATGA-----QE-----DEYGQFLVN 309
>gi|194759256|ref|XP_001961865.1| GF15184 [Drosophila ananassae]
gi|190615562|gb|EDV31086.1| GF15184 [Drosophila ananassae]
Length = 403
Score = 221 bits (562), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 121/309 (39%), Positives = 178/309 (57%), Gaps = 25/309 (8%)
Query: 30 LYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGES--------------- 74
++RI + K + ++ VA + K + K+ LR +GE+
Sbjct: 18 IHRIKIHKTQPKKSHHVARHIAKK----VAIHVLKHELRHWVGETFNYDDGYDYSNDYPN 73
Query: 75 GDADIV--ALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHS 131
D D L N M+ Y+GEI IGTPPQ F V+FDTGS+NLW+PS +C + +AC H+
Sbjct: 74 QDTDYTNEELGNSMNMYYYGEISIGTPPQYFNVVFDTGSANLWIPSVQCLSTDVACQQHN 133
Query: 132 KYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLL 191
+Y S SST+ ++ I YGTG++ G+ + D V I L + Q F EA +P +F
Sbjct: 134 QYNSSASSTFVAVNENFTIEYGTGSVKGYLATDTVTINGLAITGQTFGEAISQPGSSFTD 193
Query: 192 AKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDP 251
+FDGILG+GFQ+I++ VP +YN+ QGL+++PVF F+ RN +EGG++ GG D
Sbjct: 194 VEFDGILGMGFQQIAIDYVVPPFYNLYEQGLIDQPVFGFYLARNGTSDEGGQLTLGGTDY 253
Query: 252 DHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQV 311
+ G+ TYVPVT++GYWQF + + +G +G AIAD+GTSL+ P T++
Sbjct: 254 NLIDGDLTYVPVTKQGYWQFAVNQITWNGTV---VSGPVQAIADTGTSLIVVPADAYTKI 310
Query: 312 NHAIGATGI 320
N IGA I
Sbjct: 311 NELIGAIYI 319
>gi|149725191|ref|XP_001501954.1| PREDICTED: pepsin A-like [Equus caballus]
Length = 387
Score = 221 bits (562), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 125/293 (42%), Positives = 178/293 (60%), Gaps = 30/293 (10%)
Query: 62 IRKYSLRGNLGESG--------------------DADIVA----LKNYMDAQYFGEIGIG 97
++K SLR NL E+G +A +A L+NYMD +YFG I IG
Sbjct: 23 VKKKSLRQNLRENGLLADFLKQHPRNPASKYFPKEAATLAATEGLENYMDEEYFGTISIG 82
Query: 98 TPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAI 157
TP Q FTVIFDTGSSNLWVPS+ C S+AC H+++ SSTY+ +S I YGTG++
Sbjct: 83 TPAQEFTVIFDTGSSNLWVPSTYCS-SLACSDHNRFNPEDSSTYEATSESVSITYGTGSM 141
Query: 158 SGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNM 217
+G D V++G + +Q F + EPS A FDGILGL + IS A PV+ N+
Sbjct: 142 TGVLGYDTVRVGGIEDTNQIFGLSESEPSSFLYYAPFDGILGLAYPSISASGATPVFDNI 201
Query: 218 VNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVM 277
+QGLV++ +FS + ++D+E G ++FGG+D +Y G +VPV+++ YWQ + +
Sbjct: 202 WDQGLVSQDLFSVYL--SSDDESGSVVMFGGIDSSYYSGSLNWVPVSEEAYWQITVDSIT 259
Query: 278 IDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQECKAVVS 330
++G++ C+GGC AI D+GTSLLAGPT+ I + IGA+ S E AV+S
Sbjct: 260 MNGESIA-CSGGCQAIVDTGTSLLAGPTSGIDNIQSYIGASEDSSGE--AVIS 309
>gi|354487263|ref|XP_003505793.1| PREDICTED: renin-like [Cricetulus griseus]
Length = 403
Score = 221 bits (562), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 113/239 (47%), Positives = 156/239 (65%), Gaps = 4/239 (1%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCY-FSIACYFHSKYRSGRS 138
V L NY+D QY+GEIGIGTPPQ F VIFDTGS+NLWVPS+KC AC HS Y S S
Sbjct: 75 VILTNYLDTQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCSPLYSACEIHSLYDSSES 134
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
S+Y +NG IHYG+G + GF S+D V +G ++V Q F E T P + F+LAKFDG+L
Sbjct: 135 SSYMENGTEFTIHYGSGKVKGFLSQDIVTVGGIIVT-QTFGEVTELPLIPFMLAKFDGVL 193
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEH 258
G+GF +VG PV+ ++++Q ++ E VFS +++R++ GGE+V GG DP HY+G
Sbjct: 194 GMGFPAQAVGGVTPVFDHILSQRVLKEEVFSVYYSRDS-HLLGGEVVLGGSDPQHYQGNF 252
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
YV V++ G W+ M V + G T C GC + D+G S ++GPT+ + + +GA
Sbjct: 253 HYVSVSRTGSWEIAMKGVSV-GSATLLCEEGCVVVVDTGASYISGPTSSLKLIMQTLGA 310
>gi|328771090|gb|EGF81130.1| hypothetical protein BATDEDRAFT_16209 [Batrachochytrium
dendrobatidis JAM81]
Length = 400
Score = 221 bits (562), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 129/290 (44%), Positives = 173/290 (59%), Gaps = 15/290 (5%)
Query: 33 IGLKKRKFD------LNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYM 86
I LKKR L + V + ++ TS +Y L G S V L ++
Sbjct: 23 IKLKKRPHTQDTLNALFSNVQSVYSNRLAFQSETSEDQYILGGGAEHS-----VPLTDFA 77
Query: 87 DAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGK 146
+AQYFGEI IGTPPQ FTVIFDTGSSNLWVPS++C SIAC+ H +Y + SSTY NG
Sbjct: 78 NAQYFGEIQIGTPPQPFTVIFDTGSSNLWVPSTRCS-SIACWMHRRYDASESSTYVNNGT 136
Query: 147 SADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEIS 206
I YGTGA+ G S+D V IG L +++Q F E+ +EP +TF + +FDGILGLGF IS
Sbjct: 137 EFAIQYGTGALEGVISQDTVTIGGLTIENQGFGESVKEPGITFAVGRFDGILGLGFDTIS 196
Query: 207 VGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQK 266
V K VP YN++N ++ P+F W ++ EE GEIVFG ++ DH+KG T+VPV +K
Sbjct: 197 VQKVVPPMYNLINNHQLDTPLFGVWLGSSSGEEG-GEIVFGAVNHDHFKGAVTWVPVVRK 255
Query: 267 GYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIG 316
YW+ ++ V I G+ + A D+G+SL A P +N +G
Sbjct: 256 AYWEVELEGVTIGGKKLAIKSSRAA--IDTGSSLFALPVAEADAINGILG 303
>gi|257215794|emb|CAX83049.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 240
Score = 221 bits (562), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 102/181 (56%), Positives = 132/181 (72%), Gaps = 1/181 (0%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSST 140
LKNY+DAQY+G+I IGTPPQ F+V+FDTGSSNLWVPS C YF IAC H KY S +S+T
Sbjct: 58 LKNYLDAQYYGDITIGTPPQTFSVVFDTGSSNLWVPSKHCSYFDIACLLHRKYDSSKSTT 117
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y NG I YGTG++SGF S D +++G L VK Q F EAT++P L F++AKFDGILG+
Sbjct: 118 YVPNGTDFSIRYGTGSLSGFLSTDSLQLGSLGVKGQTFGEATKQPGLVFVMAKFDGILGM 177
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ++VG PV+ NM+ QG+V+ PVFSF+ +RN GGE++ GG+D +Y GE Y
Sbjct: 178 AYPSLAVGGVTPVFVNMIKQGVVDSPVFSFYLSRNITNVLGGELMIGGIDDKYYTGEINY 237
Query: 261 V 261
V
Sbjct: 238 V 238
>gi|1705600|sp|P10977.3|CARPV_CANAL RecName: Full=Vacuolar aspartic protease; AltName: Full=ACP;
AltName: Full=Aspartate protease; Flags: Precursor
gi|1039462|gb|AAA79879.1| vacuolar aspartic proteinase precursor [Candida albicans]
Length = 419
Score = 221 bits (562), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 117/243 (48%), Positives = 154/243 (63%), Gaps = 14/243 (5%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L NY++AQYF EI IGTP Q F VI DTGSSNLWVPS C S+AC+ H+KY SSTY
Sbjct: 96 LTNYLNAQYFTEIQIGTPGQPFKVILDTGSSNLWVPSQDCT-SLACFLHAKYDHDASSTY 154
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
K NG I YG+G++ G+ S+D + IGDLV+ Q+F EAT EP L F KFDGILGL
Sbjct: 155 KVNGSEFSIQYGSGSMEGYISQDVLTIGDLVIPGQDFAEATSEPGLAFAFGKFDGILGLA 214
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWF-NRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ISV VP YN +NQGL+ +P F F+ + + DE +GG FGG D ++G+ T+
Sbjct: 215 YDTISVNHIVPPIYNAINQGLLEKPQFGFYLGSTDKDENDGGLATFGGYDASLFQGKITW 274
Query: 261 VPVTQKGYWQFD-----MGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAI 315
+P+ +K YW+ +GD + TG AAI D+GTSL+ P+++ +N I
Sbjct: 275 LPIRRKAYWEVSFEGIGLGDEYAELHKTG------AAI-DTGTSLITLPSSLAEIINAKI 327
Query: 316 GAT 318
GAT
Sbjct: 328 GAT 330
>gi|326933881|ref|XP_003213026.1| PREDICTED: gastricsin-like [Meleagris gallopavo]
Length = 389
Score = 221 bits (562), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 121/304 (39%), Positives = 174/304 (57%), Gaps = 12/304 (3%)
Query: 17 LLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEG---ESFRTSIRKYSLRGNLGE 73
L+F V+ GL R+ LKK K + R KE + + R Y
Sbjct: 4 LIFTVLCLHLCEGLLRVPLKKGK-------SIREVMKESGVLHDYLANHRYYDPAYKFFS 56
Query: 74 SGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKY 133
+ L N MD Y+GEI IGTPPQNF V+FDTGSSNLWVPS+ C S AC H+++
Sbjct: 57 NFATAYEPLANSMDMSYYGEISIGTPPQNFLVLFDTGSSNLWVPSTLCQ-SQACANHNEF 115
Query: 134 RSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAK 193
SST+ + + YG+G+++G F D V I + + +QEF + EP FL +
Sbjct: 116 NPNESSTFSTQNEFFSLQYGSGSLTGIFGFDTVTIQGISITNQEFGLSETEPGTNFLYSP 175
Query: 194 FDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDH 253
FDGILGL F IS G A V M+ + L++ P+FSF+ + + +GGE++FGG++P+
Sbjct: 176 FDGILGLAFPAISAGGATTVMQQMLQENLLDSPIFSFYLS-GQEGSQGGELIFGGVNPNL 234
Query: 254 YKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNH 313
Y G+ ++ PVTQ YWQ + D + GQ++G+C+ GC AI D+GTSLL P + +++
Sbjct: 235 YTGQISWTPVTQTTYWQIGIEDFTVGGQSSGWCSQGCQAIVDTGTSLLTVPNQVFSELMQ 294
Query: 314 AIGA 317
IGA
Sbjct: 295 YIGA 298
>gi|407726059|dbj|BAM46127.1| pepsinogen C [Cynops pyrrhogaster]
Length = 385
Score = 220 bits (561), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 113/253 (44%), Positives = 152/253 (60%), Gaps = 6/253 (2%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L NYMD Y+GEI IGTPPQNF V+FDTGSSNLWV S+ C S AC H + +SSTY
Sbjct: 60 LTNYMDMSYYGEISIGTPPQNFLVLFDTGSSNLWVASTYCS-SSACTNHPLFNPSQSSTY 118
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
+ I YGTG+++G D V I L + QEF + EP F+ A+FDGILGL
Sbjct: 119 STENQQFSIQYGTGSLTGILGYDTVSIQGLSITQQEFALSINEPGSNFVYAQFDGILGLA 178
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ I+ A V M+NQGL+++ +F F+ + + GGE++FGG+D ++Y GE T+
Sbjct: 179 YPSIAADGATTVMEGMMNQGLLSQNIFGFYMSEEG-TQPGGELIFGGVDSNYYTGEITWT 237
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT--- 318
PVTQ+ YWQ + ++GQ TG+C+ GC I D+GTSLL P + + IGAT
Sbjct: 238 PVTQQMYWQIGIQGFAVNGQETGWCSQGCQGIVDTGTSLLTAPGQYMAALMQDIGATLDS 297
Query: 319 -GIVSQECKAVVS 330
G C +V S
Sbjct: 298 SGQYVVTCSSVTS 310
>gi|426368717|ref|XP_004051349.1| PREDICTED: pepsin A-5-like isoform 2 [Gorilla gorilla gorilla]
Length = 388
Score = 220 bits (561), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 125/298 (41%), Positives = 171/298 (57%), Gaps = 33/298 (11%)
Query: 62 IRKYSLRGNLGESGD-------------------------ADIVALKNYMDAQYFGEIGI 96
IRK SLR L E G D L+NY+D +YFG IGI
Sbjct: 23 IRKKSLRRTLSERGLLKDFLKKHNLNPARKYFPQWEAPTLVDEQPLENYLDMEYFGTIGI 82
Query: 97 GTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGA 156
GTP Q+FTVIFDTGSSNLWVPS C S+AC H+++ SSTY+ ++ I YGTG+
Sbjct: 83 GTPAQDFTVIFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTYQSTSETVSITYGTGS 141
Query: 157 ISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYN 216
++G D V++G + +Q F + EP A FDGILGL + IS A PV+ N
Sbjct: 142 MTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYAPFDGILGLAYPSISSSGATPVFDN 201
Query: 217 MVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDV 276
+ NQGLV++ +FS + +AD++ G ++FGG+D +Y G +VPVT +GYWQ + +
Sbjct: 202 IWNQGLVSQDLFSVYL--SADDQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSI 259
Query: 277 MIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT----GIVSQECKAVVS 330
++G+T CA GC AI D+GTSLL GPT+ I + IGA+ G + C A+ S
Sbjct: 260 TMNGETIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSAISS 316
>gi|68485827|ref|XP_713194.1| hypothetical protein CaO19.9447 [Candida albicans SC5314]
gi|68485920|ref|XP_713148.1| hypothetical protein CaO19.1891 [Candida albicans SC5314]
gi|46434627|gb|EAK94031.1| hypothetical protein CaO19.1891 [Candida albicans SC5314]
gi|46434674|gb|EAK94077.1| hypothetical protein CaO19.9447 [Candida albicans SC5314]
Length = 419
Score = 220 bits (561), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 117/243 (48%), Positives = 154/243 (63%), Gaps = 14/243 (5%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L NY++AQYF EI IGTP Q F VI DTGSSNLWVPS C S+AC+ H+KY SSTY
Sbjct: 96 LTNYLNAQYFTEIQIGTPGQPFKVILDTGSSNLWVPSQDCT-SLACFLHAKYDHDASSTY 154
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
K NG I YG+G++ G+ S+D + IGDLV+ Q+F EAT EP L F KFDGILGL
Sbjct: 155 KVNGSEFSIQYGSGSMEGYISQDVLTIGDLVIPGQDFAEATSEPGLAFAFGKFDGILGLA 214
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWF-NRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ISV VP YN +NQGL+ +P F F+ + + DE +GG FGG D ++G+ T+
Sbjct: 215 YDTISVNHIVPPIYNAINQGLLEKPQFGFYLGSTDKDENDGGLATFGGYDASLFQGKITW 274
Query: 261 VPVTQKGYWQ-----FDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAI 315
+P+ +K YW+ +GD + TG AAI D+GTSL+ P+++ +N I
Sbjct: 275 LPIRRKAYWEVLFEGIGLGDEYAELHKTG------AAI-DTGTSLITLPSSLAEIINAKI 327
Query: 316 GAT 318
GAT
Sbjct: 328 GAT 330
>gi|125986537|ref|XP_001357032.1| GA16570 [Drosophila pseudoobscura pseudoobscura]
gi|54645358|gb|EAL34098.1| GA16570 [Drosophila pseudoobscura pseudoobscura]
Length = 408
Score = 220 bits (561), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 114/262 (43%), Positives = 159/262 (60%), Gaps = 16/262 (6%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSI-ACYFHSKYRSGRSST 140
L N + +Y+ + IGTPPQ F ++ DTGSSNLWVPSSKC ++ +C H++Y S SS+
Sbjct: 81 LGNAFNTEYYLPVTIGTPPQEFILLIDTGSSNLWVPSSKCPATVKSCVSHNQYDSKSSSS 140
Query: 141 YKKNGKSADIHYGTG-----AISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFD 195
Y NG + I Y + A+SG S+D V I +L ++ Q F E T EP TFL + FD
Sbjct: 141 YVANGTAFTIEYASKSEGGVALSGILSQDTVTIAELAIQRQVFAEITDEPEPTFLSSPFD 200
Query: 196 GILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRN-ADEEEGGEIVFGGMDPDHY 254
G+ GLG+ IS+G P +YN+V QGL+ PVFS + NR+ + +GGE+V GG+D +
Sbjct: 201 GMFGLGYASISIGGVTPPFYNLVAQGLIKHPVFSIYLNRSGTNATDGGELVLGGIDATLF 260
Query: 255 KGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHA 314
G TYVPV+Q+GYWQF M ++ G+T FC C AI D GTSLL PT I ++N
Sbjct: 261 SGCLTYVPVSQQGYWQFVMTSAVLGGKT--FCT-HCQAILDVGTSLLVAPTAAIKKINQL 317
Query: 315 IG------ATGIVSQECKAVVS 330
+ ++G+ C + S
Sbjct: 318 LAVLNPQDSSGVFLVNCSTIAS 339
>gi|426368715|ref|XP_004051348.1| PREDICTED: pepsin A-5-like isoform 1 [Gorilla gorilla gorilla]
Length = 388
Score = 220 bits (561), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 116/257 (45%), Positives = 162/257 (63%), Gaps = 8/257 (3%)
Query: 78 DIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGR 137
D L+NY+D +YFG IGIGTP Q+FTVIFDTGSSNLWVPS C S+AC H+++
Sbjct: 64 DEQPLENYLDMEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACTNHNRFNPED 122
Query: 138 SSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGI 197
SSTY+ ++ I YGTG+++G D V++G + +Q F + EP A FDGI
Sbjct: 123 SSTYQSTSETVSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYAPFDGI 182
Query: 198 LGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGE 257
LGL + IS A PV+ N+ NQGLV++ +FS + +AD++ G ++FGG+D +Y G
Sbjct: 183 LGLAYPSISSSGATPVFDNIWNQGLVSQDLFSVYL--SADDQSGSVVIFGGIDSSYYTGS 240
Query: 258 HTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
+VPVT +GYWQ + + ++G+T CA GC AI D+GTSLL GPT+ I + IGA
Sbjct: 241 LNWVPVTVEGYWQITVDSITMNGETIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGA 299
Query: 318 T----GIVSQECKAVVS 330
+ G + C A+ S
Sbjct: 300 SENSDGDMVVSCSAISS 316
>gi|432103960|gb|ELK30793.1| Gastricsin [Myotis davidii]
Length = 390
Score = 220 bits (561), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 118/283 (41%), Positives = 170/283 (60%), Gaps = 15/283 (5%)
Query: 49 RLDSKEGESFRTSIRKYS-----LRGNLGESGDA------DIVA---LKNYMDAQYFGEI 94
R+ K+G+S R ++ + L+ + E A D VA + NY+DA YFGEI
Sbjct: 19 RIILKKGKSIRQTMEEKGVLEKFLKNHRKEDPAAKYHFNNDAVAYEPITNYLDAFYFGEI 78
Query: 95 GIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGT 154
IGTPPQNF V+FDTGSSNLWVPS+ C S AC H+++ SST++ NG++ + YG+
Sbjct: 79 SIGTPPQNFLVLFDTGSSNLWVPSTYCQ-SQACSNHNRFNPSLSSTFRNNGQTYTLSYGS 137
Query: 155 GAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVW 214
G++S D V + ++VV +QEF + EP+ F + FDGILG+ + ++VG A V
Sbjct: 138 GSLSVVLGYDTVTVQNIVVNNQEFGLSENEPNDPFYYSDFDGILGMAYPNMAVGDAPTVM 197
Query: 215 YNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMG 274
M+ QG + P+FSF+F+R + GGE++ GG+D Y G+ + PVTQ+ YWQ +
Sbjct: 198 QGMLQQGQLTLPIFSFYFSRQPTRQYGGELILGGVDQQLYSGQIVWAPVTQELYWQIAIQ 257
Query: 275 DVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
+ I Q TG+C+ GC AI D+GT LLA P + A GA
Sbjct: 258 EFAIGDQATGWCSQGCQAIVDTGTFLLAVPQQYMGSFLQATGA 300
>gi|5921649|gb|AAD56283.1|AF156787_1 pepsinogen A form IIa [Pseudopleuronectes americanus]
Length = 378
Score = 220 bits (561), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 114/258 (44%), Positives = 163/258 (63%), Gaps = 16/258 (6%)
Query: 81 ALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSST 140
++ N D Y+G I IGTPPQ+F V+FDTGSSNLW+PS C S AC H KY G SST
Sbjct: 63 SMTNDADLAYYGIISIGTPPQSFKVVFDTGSSNLWIPSIYCS-SAACNNHDKYNPGLSST 121
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
YKKNG S I YGTG+++G D V +G L VK+Q F + E + + + DGILGL
Sbjct: 122 YKKNGGSLSIQYGTGSMTGILGYDTVTVGGLAVKNQIFGLSQSEATFMQYM-QADGILGL 180
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ +S A PV+ NM+ +GLV++ +FS + + NA ++G + FGG+DP+HY G T+
Sbjct: 181 AYPSLSASGATPVFDNMMTEGLVDQDLFSVYLSPNA--QQGSVVTFGGVDPNHYNGAITW 238
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
+P++ + YWQ + V ++GQ C+GGC AI D+GTSL+ GP + I+ +N+ +GAT
Sbjct: 239 IPLSSELYWQITVDSVTVNGQVVA-CSGGCQAIVDTGTSLIVGPQSSISNINNYVGAT-- 295
Query: 321 VSQECKAVVSQYGEEIIN 338
SQ G+ ++N
Sbjct: 296 ---------SQNGDYVVN 304
>gi|158257160|dbj|BAF84553.1| unnamed protein product [Homo sapiens]
Length = 388
Score = 220 bits (561), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 124/298 (41%), Positives = 171/298 (57%), Gaps = 33/298 (11%)
Query: 62 IRKYSLRGNLGESGD-------------------------ADIVALKNYMDAQYFGEIGI 96
IRK SLR L E G D L+NY+D +YFG IGI
Sbjct: 23 IRKKSLRRTLSERGLLKDFLKKHNLNPARKYFPQWEAPTLVDEQPLENYLDMEYFGTIGI 82
Query: 97 GTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGA 156
GTP Q+FTV+FDTGSSNLWVPS C S+AC H+++ SSTY+ ++ I YGTG+
Sbjct: 83 GTPAQDFTVVFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTYQSTSETVSITYGTGS 141
Query: 157 ISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYN 216
++G D V++G + +Q F + EP A FDGILGL + IS A PV+ N
Sbjct: 142 MTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYAPFDGILGLAYPSISSSGATPVFDN 201
Query: 217 MVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDV 276
+ NQGLV++ +FS + +AD++ G ++FGG+D +Y G +VPVT +GYWQ + +
Sbjct: 202 IWNQGLVSQDLFSVYL--SADDQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSI 259
Query: 277 MIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT----GIVSQECKAVVS 330
++G+ CA GC AI D+GTSLL GPT+ IT + IGA+ G + C A+ S
Sbjct: 260 TMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPITNIQSDIGASENSDGDMVVSCSAISS 316
>gi|344295434|ref|XP_003419417.1| PREDICTED: pepsin A-2/A-3-like [Loxodonta africana]
Length = 384
Score = 220 bits (561), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 111/239 (46%), Positives = 154/239 (64%), Gaps = 4/239 (1%)
Query: 78 DIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGR 137
D L+NY+D +YFG IGIGTP Q FTVIFDTGSSNLWVPS+ C S+AC H+++
Sbjct: 62 DTQTLENYLDVEYFGTIGIGTPAQEFTVIFDTGSSNLWVPSTYCS-SLACTNHNRFNPDD 120
Query: 138 SSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGI 197
SSTY+ ++ I YGTG+++G D VK+G + +Q F + EP + FDGI
Sbjct: 121 SSTYRSTSETVSITYGTGSMTGILGYDTVKVGGISDTNQIFGLSETEPGSFLYYSPFDGI 180
Query: 198 LGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGE 257
LGL + IS A PV+ N+ +QGLV++ +FS + ++DEE G ++FGG+D +Y G
Sbjct: 181 LGLAYPSISSSDATPVFDNIWDQGLVSQDLFSVYL--SSDEEGGSVVIFGGIDSSYYTGS 238
Query: 258 HTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIG 316
+VPV+ +GYWQ + V IDG++ C+ C AI D+GTSLLAGPTT I + +G
Sbjct: 239 LNWVPVSYEGYWQITLDSVSIDGESVA-CSDTCQAIIDTGTSLLAGPTTAIANIQEYLG 296
>gi|426250269|ref|XP_004018860.1| PREDICTED: gastricsin [Ovis aries]
Length = 431
Score = 220 bits (560), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 105/234 (44%), Positives = 147/234 (62%), Gaps = 1/234 (0%)
Query: 84 NYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKK 143
+YMDA YFGEI IGTPPQNF V+FDTGSSNLWVPS C S AC H ++ SSTY
Sbjct: 68 DYMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSLYCQ-SQACTSHPRFNPSLSSTYSS 126
Query: 144 NGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQ 203
N ++ + YG+G+++G D + + + V +QEF + EP FL AKFDGI+G+ +
Sbjct: 127 NEQTFSLQYGSGSLTGLLGYDTLTVQGIQVPNQEFGLSKTEPGTNFLYAKFDGIMGMAYP 186
Query: 204 EISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPV 263
+SV A V MV +G + P+FSF+ + ++GG ++FGG+D Y G+ + PV
Sbjct: 187 SLSVDGATTVLQGMVQEGALTSPIFSFYLSSQQGSQDGGAVIFGGVDSRLYTGQIYWAPV 246
Query: 264 TQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
TQ+ YWQ + + +I Q TG+C+ GC AI D+GTSLL P ++ + A GA
Sbjct: 247 TQELYWQIGIEEFLIGDQATGWCSAGCQAIVDTGTSLLTVPQQFLSALLQATGA 300
>gi|213688|gb|AAA49530.1| pepsinogen [Rana catesbeiana]
Length = 384
Score = 220 bits (560), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 105/236 (44%), Positives = 153/236 (64%), Gaps = 1/236 (0%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L NYMD Y+GEI IGTPPQNF V+FDTGSSNLWVPS+ C S AC H ++ +SS+Y
Sbjct: 59 LANYMDMSYYGEISIGTPPQNFLVLFDTGSSNLWVPSTYCQ-SQACTNHPQFNPSQSSSY 117
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
N + + YGTG+++G D V+I ++ + QEF + EP F+ A+FDGILGL
Sbjct: 118 SSNQQQFSLQYGTGSLTGILGYDTVQIQNIAISQQEFGLSVTEPGTNFVYAQFDGILGLA 177
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ I+ G A V M+ Q L+N+P+F+F+ + + + GGE+ FGG+D ++Y G+ +
Sbjct: 178 YPSIAEGGATTVMQGMIQQNLINQPLFAFYLSGQQNSQNGGEVAFGGVDQNYYSGQIYWT 237
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
PVT + YWQ + ++GQ TG+C+ GC I D+GTSLL P ++ + + +IGA
Sbjct: 238 PVTSETYWQIGIQGFSVNGQATGWCSQGCQGIVDTGTSLLTAPQSVFSSLMQSIGA 293
>gi|12248414|dbj|BAB20092.1| pepsinogen A [Rana catesbeiana]
Length = 385
Score = 220 bits (560), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 115/282 (40%), Positives = 164/282 (58%), Gaps = 17/282 (6%)
Query: 49 RLDSKEGESFRTSIRKYSLRGN-------------LGESGDADIVALKNYMDAQYFGEIG 95
++ ++GES R + + L G+ A L+NYMD +YFG I
Sbjct: 19 KVSLRKGESLRARLNRLGLLGDYLKKHHYNPATKYFPSLAQASGEPLQNYMDIEYFGTIS 78
Query: 96 IGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTG 155
IGTPPQ+FTVIFDTGSSNLWVPS C S AC H + +SST++ I YGTG
Sbjct: 79 IGTPPQSFTVIFDTGSSNLWVPSVYCS-SPACTNHHMFNPQQSSTFQATNTPVSIQYGTG 137
Query: 156 AISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWY 215
++SGF D V++G++ + +Q F + EP + FDGILGL F ++ +A PV+
Sbjct: 138 SMSGFLGYDTVQVGNIQITNQIFGLSQSEPGSFLYYSPFDGILGLAFPSLASSQATPVFD 197
Query: 216 NMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGD 275
NM NQGL+ + +FS + + + G ++FGG+D +Y G +VP+T + YWQ +
Sbjct: 198 NMWNQGLIPQDLFSVYLSSQG--QSGSFVLFGGVDTSYYTGNLNWVPLTAETYWQITVDS 255
Query: 276 VMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
+ I GQ C+G C+AI D+GTSLLAGP+T I + + IGA
Sbjct: 256 ISIGGQVIA-CSGSCSAIVDTGTSLLAGPSTPIANIQYYIGA 296
>gi|126306831|ref|XP_001370729.1| PREDICTED: renin-like [Monodelphis domestica]
Length = 389
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 120/296 (40%), Positives = 177/296 (59%), Gaps = 15/296 (5%)
Query: 23 FSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVAL 82
FS P+ GL RI LKK ++ + + ++ K E+ + + G + L
Sbjct: 16 FSLPSDGLQRIALKKM---ISVKESMKMRGKHLENLNMAENSWH--------GVVSPIIL 64
Query: 83 KNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTY 141
NY D QY+GEI IG+PPQ F V+FDTGSS+ WVPSS+C AC FH++Y + +SSTY
Sbjct: 65 TNYEDTQYYGEINIGSPPQTFKVVFDTGSSDFWVPSSQCDPLYTACEFHNRYDASKSSTY 124
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
K NG + IHY +G + GF S+D + IG++ V Q F E T P + F LA FDGILGLG
Sbjct: 125 KMNGSNFIIHYASGRVKGFLSQDILTIGEIKVT-QVFGEVTALPLIPFGLAWFDGILGLG 183
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ + S+ PV+ N++ +G++ E VFS +++R++ + GGE++ GG DP++Y+G Y+
Sbjct: 184 YPKRSMSGITPVFDNIMAEGVLKEDVFSIYYSRSSG-KNGGELILGGSDPNYYQGTFHYI 242
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
++ +WQ M V + C GC A+ D+GTS + GPT I + AIGA
Sbjct: 243 NTSRPHFWQIQMQGVAVKSYVLS-CEDGCPAVVDTGTSFITGPTDSIRGLMTAIGA 297
>gi|344302714|gb|EGW32988.1| hypothetical protein SPAPADRAFT_60314 [Spathaspora passalidarum
NRRL Y-27907]
Length = 417
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 111/237 (46%), Positives = 148/237 (62%), Gaps = 4/237 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L NY++AQYF EI +GTP Q+F VI DTGSSNLW+PS C S+AC+ HSKY SSTY
Sbjct: 94 LTNYLNAQYFTEISVGTPAQSFKVILDTGSSNLWIPSKDCT-SLACFLHSKYDHDASSTY 152
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
K NG I YG+G++ G+ S+D + IG L + Q+F EAT EP L F KFDGILGL
Sbjct: 153 KANGTEFSIQYGSGSMEGYISQDVLNIGGLQIPKQDFAEATSEPGLAFAFGKFDGILGLA 212
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWF-NRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ISV K VP YN +NQGL++EP F F+ + + DE +GG FGG D G+ T+
Sbjct: 213 YDTISVNKIVPPVYNAINQGLLDEPQFGFYLGDTSKDENDGGVATFGGYDKSLIDGKITW 272
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
+PV +K YW+ + + + G A D+GTSL+ P+++ +N IGA
Sbjct: 273 LPVRRKAYWEVSFEGIGLGDEYAELSKTGAA--IDTGTSLITLPSSLAEIINAKIGA 327
>gi|219521036|gb|AAI71897.1| Pepsinogen 5, group I (pepsinogen A) [Homo sapiens]
Length = 388
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 124/298 (41%), Positives = 171/298 (57%), Gaps = 33/298 (11%)
Query: 62 IRKYSLRGNLGESGD-------------------------ADIVALKNYMDAQYFGEIGI 96
IRK SLR L E G D L+NY+D +YFG IGI
Sbjct: 23 IRKKSLRRTLSERGLLKDFLKKHNLNPARKYFPQWEAPTLVDEQPLENYLDMEYFGTIGI 82
Query: 97 GTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGA 156
GTP Q+FTV+FDTGSSNLWVPS C S+AC H+++ SSTY+ ++ I YGTG+
Sbjct: 83 GTPAQDFTVVFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTYQSTSETVSITYGTGS 141
Query: 157 ISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYN 216
++G D V++G + +Q F + EP A FDGILGL + IS A PV+ N
Sbjct: 142 MTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYAPFDGILGLAYPSISSSGATPVFDN 201
Query: 217 MVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDV 276
+ NQGLV++ +FS + +AD++ G ++FGG+D +Y G +VPVT +GYWQ + +
Sbjct: 202 IWNQGLVSQDLFSVYL--SADDKSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSI 259
Query: 277 MIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT----GIVSQECKAVVS 330
++G+T CA GC AI D+GTSLL GPT+ I + IGA+ G + C A+ S
Sbjct: 260 TMNGETIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSAISS 316
>gi|23943854|ref|NP_055039.1| pepsin A-5 preproprotein [Homo sapiens]
gi|378522017|sp|P0DJD9.1|PEPA5_HUMAN RecName: Full=Pepsin A-5; AltName: Full=Pepsinogen-5; Flags:
Precursor
gi|20810074|gb|AAH29055.1| Pepsinogen 5, group I (pepsinogen A) [Homo sapiens]
gi|119594334|gb|EAW73928.1| pepsinogen 5, group I (pepsinogen A) [Homo sapiens]
gi|219520836|gb|AAI71889.1| Pepsinogen 5, group I (pepsinogen A) [Homo sapiens]
gi|223461673|gb|AAI47000.1| Pepsinogen 5, group I (pepsinogen A) [Homo sapiens]
Length = 388
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 124/298 (41%), Positives = 171/298 (57%), Gaps = 33/298 (11%)
Query: 62 IRKYSLRGNLGESGD-------------------------ADIVALKNYMDAQYFGEIGI 96
IRK SLR L E G D L+NY+D +YFG IGI
Sbjct: 23 IRKKSLRRTLSERGLLKDFLKKHNLNPARKYFPQWEAPTLVDEQPLENYLDMEYFGTIGI 82
Query: 97 GTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGA 156
GTP Q+FTV+FDTGSSNLWVPS C S+AC H+++ SSTY+ ++ I YGTG+
Sbjct: 83 GTPAQDFTVVFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTYQSTSETVSITYGTGS 141
Query: 157 ISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYN 216
++G D V++G + +Q F + EP A FDGILGL + IS A PV+ N
Sbjct: 142 MTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYAPFDGILGLAYPSISSSGATPVFDN 201
Query: 217 MVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDV 276
+ NQGLV++ +FS + +AD++ G ++FGG+D +Y G +VPVT +GYWQ + +
Sbjct: 202 IWNQGLVSQDLFSVYL--SADDKSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSI 259
Query: 277 MIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT----GIVSQECKAVVS 330
++G+T CA GC AI D+GTSLL GPT+ I + IGA+ G + C A+ S
Sbjct: 260 TMNGETIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSAISS 316
>gi|57164325|ref|NP_001009299.1| renin precursor [Ovis aries]
gi|1710090|sp|P52115.1|RENI_SHEEP RecName: Full=Renin; AltName: Full=Angiotensinogenase; Flags:
Precursor
gi|896318|gb|AAA69809.1| renin [Ovis aries]
Length = 400
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 112/241 (46%), Positives = 153/241 (63%), Gaps = 5/241 (2%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCY-FSIACYFHSKYRSGRS 138
V L NY+D QY+GEIGIGTPPQ F VIFDTGS+NLWVPS+KC AC HS Y S S
Sbjct: 70 VVLTNYLDTQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCSPLYTACEIHSLYDSLES 129
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
S+Y +NG I+YG+G + GF S+D V +G + V Q F E T P F+LAKFDG+L
Sbjct: 130 SSYVENGTEFTIYYGSGKVKGFLSQDLVTVGGITVT-QTFGEVTELPLRPFMLAKFDGVL 188
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEE--GGEIVFGGMDPDHYKG 256
G+GF +VG PV+ +++ Q ++ E VFS +++R++ GGEIV GG DP +Y+
Sbjct: 189 GMGFPAQAVGGVTPVFDHILAQRVLTEDVFSVYYSRDSKNSHLLGGEIVLGGSDPQYYQE 248
Query: 257 EHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIG 316
YV +++ G WQ M V + TT C GC + D+G S ++GPT+ + + A+G
Sbjct: 249 NFHYVSISKPGSWQIRMKGVSVR-STTLLCEEGCMVVVDTGASYISGPTSSLRLLMEALG 307
Query: 317 A 317
A
Sbjct: 308 A 308
>gi|290974880|ref|XP_002670172.1| predicted protein [Naegleria gruberi]
gi|284083728|gb|EFC37428.1| predicted protein [Naegleria gruberi]
Length = 388
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 118/309 (38%), Positives = 178/309 (57%), Gaps = 15/309 (4%)
Query: 31 YRIGLKKRKFD--------LNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVAL 82
+ I L+KR +D L+ + + +K ++ R + + +L + G IV L
Sbjct: 11 FEIPLRKRAYDDARAASVKLSKMRSIQTQNKLRSLLYSATRLATTKPSL-KVGATPIVPL 69
Query: 83 KNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYK 142
K+Y D +Y+GEI IGTP Q F V+FDTGSSNLWVPS C ++C H++Y +SSTY
Sbjct: 70 KDYDDVEYYGEITIGTPAQTFKVVFDTGSSNLWVPSVACK-DLSCVRHARYNHTKSSTYV 128
Query: 143 KNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGF 202
NG+S +I YGTGA+ G S D V +G L +K Q F E T E + TFL AK DGI G F
Sbjct: 129 PNGQSFNITYGTGAVKGILSSDTVVVGGLAIKGQVFGETTNEYTDTFLNAKIDGICGFAF 188
Query: 203 QEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVP 262
I+V PV+ N++ Q LV++ +FSF+ ++ A ++ GG++ +Y G +YVP
Sbjct: 189 PNIAVDGVTPVFNNLMKQRLVDKNIFSFYMSKKAG-SGASAMILGGINSKYYTGSFSYVP 247
Query: 263 VTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVS 322
+ Q YW + D+ ++GQ C GC AI D+GTSL+AG ++ + + + V+
Sbjct: 248 LIQHNYWSIALDDIAMNGQGQSLCGFGCMAIVDTGTSLIAGTPDVMQPIINQLN----VA 303
Query: 323 QECKAVVSQ 331
++C + S
Sbjct: 304 EDCSNIDSN 312
>gi|89111566|dbj|BAE80442.1| pepsinogen B isozyme [Canis lupus familiaris]
Length = 374
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 107/245 (43%), Positives = 154/245 (62%), Gaps = 4/245 (1%)
Query: 76 DADIVALK---NYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSK 132
+ D VA + NY+++ YFGEI IGTPPQNF V+FDTGSSNLWVPS+ C S AC H+
Sbjct: 41 NNDAVAYEPFTNYLNSYYFGEISIGTPPQNFLVVFDTGSSNLWVPSTYCQ-SQACSNHNT 99
Query: 133 YRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLA 192
+ SSTY+ NG++ ++YG+G+++ D V + ++V+ +QEF + EPS F A
Sbjct: 100 FNPSSSSTYRNNGQTYTLYYGSGSLTVLLGYDTVTVQNIVINNQEFGLSEIEPSNPFYYA 159
Query: 193 KFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPD 252
FDGILG+ + ++VG + V +MV QG + +P+FSF+F+R E GGE++ GG+D
Sbjct: 160 NFDGILGMAYPNLAVGDSPTVMQSMVQQGQLTQPIFSFYFSRQPTYEYGGELILGGVDTQ 219
Query: 253 HYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVN 312
Y GE + PVT++ YWQ + + +I Q TG C+ GC I D+GT L P +
Sbjct: 220 FYSGEIVWAPVTREMYWQVAIDEFLIGNQATGLCSQGCQGIVDTGTFPLTVPQQYLDSFV 279
Query: 313 HAIGA 317
A GA
Sbjct: 280 KATGA 284
>gi|194210206|ref|XP_001488754.2| PREDICTED: renin-like [Equus caballus]
Length = 391
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 121/292 (41%), Positives = 167/292 (57%), Gaps = 17/292 (5%)
Query: 48 ARLDSKEGESFRTSIRKY------------SLRGNLGESGDADIVALKNYMDAQYFGEIG 95
AR+ ++ S R S+R+ L V L NY+D QY+GEIG
Sbjct: 16 ARIFLRKMPSVRESLRERGVDVSRIGAEWSQFTKRLSRDNSTSPVVLTNYLDTQYYGEIG 75
Query: 96 IGTPPQNFTVIFDTGSSNLWVPSSKCY-FSIACYFHSKYRSGRSSTYKKNGKSADIHYGT 154
IGTPPQ F VIFDTGS+NLWVPS+KC AC HS Y S SS+Y +NG I YG+
Sbjct: 76 IGTPPQTFKVIFDTGSANLWVPSTKCSPLYAACEIHSLYDSSESSSYMENGTEFTIRYGS 135
Query: 155 GAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVW 214
G + GF S+D V +G + V Q F E T P + F+LAKFDG+LG+GF +VG PV+
Sbjct: 136 GKVKGFLSQDMVTVGGITVT-QTFAEVTELPLIPFMLAKFDGVLGMGFPAQAVGGVTPVF 194
Query: 215 YNMVNQGLVNEPVFSFWFNRNADEEE--GGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFD 272
++++Q ++ E VFS +++RN+ GGEIV GG DP +Y+G YV V++ WQ
Sbjct: 195 DHILSQRVLKEDVFSVYYSRNSKNSHLLGGEIVLGGSDPQYYQGNFHYVSVSKTDSWQIK 254
Query: 273 MGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE 324
M V + T C GC + D+G S ++GPT+ + + +GA + S E
Sbjct: 255 MKGVSVRSATL-LCEEGCMVVVDTGASYISGPTSSLRLLMETLGAKELSSDE 305
>gi|363743175|ref|XP_003642787.1| PREDICTED: renin-like [Gallus gallus]
Length = 451
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 107/244 (43%), Positives = 153/244 (62%), Gaps = 3/244 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCY-FSIACYFHSKYRSGRSST 140
L NY+D QY+GEI IGTPPQ F V+FDTGS+NLWVPS KC AC HS+Y S +S T
Sbjct: 124 LTNYLDTQYYGEISIGTPPQTFKVVFDTGSANLWVPSCKCSPLYSACISHSRYDSSKSRT 183
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
Y NG I YGTG++ GF S+D V + D+ + Q F EAT P+ F+ A+FDG+LG+
Sbjct: 184 YIANGTGFAIRYGTGSVKGFLSQDVVMVSDIPII-QVFAEATVLPAFPFIFARFDGVLGM 242
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
G+ ++ PV+ +++Q ++ E VFS +++RN+ + GGEI+ GG DP +Y G+ Y
Sbjct: 243 GYPSQAIDGITPVFDRILSQQILKEDVFSVYYSRNSPLKPGGEIILGGTDPAYYTGDFHY 302
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
+ +++ GYWQ M V + G FC GC+ D+G S + GP ++ + AIGA +
Sbjct: 303 LSISRSGYWQISMKGVSV-GAEMLFCKEGCSVAIDTGASYITGPAGPVSVLMKAIGAAEM 361
Query: 321 VSQE 324
E
Sbjct: 362 TEGE 365
>gi|194759276|ref|XP_001961875.1| GF15194 [Drosophila ananassae]
gi|190615572|gb|EDV31096.1| GF15194 [Drosophila ananassae]
Length = 411
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 113/242 (46%), Positives = 150/242 (61%), Gaps = 10/242 (4%)
Query: 81 ALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSS 139
+L NY + Y + IG PPQ FT++ DTGSSNLWVPS KC + C H+KY S SS
Sbjct: 83 SLSNYYNTAYSVAVEIGNPPQKFTLLIDTGSSNLWVPSIKCPATDRTCANHNKYNSAASS 142
Query: 140 TYKKNGKSADIHYGTG-----AISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKF 194
T+ N + +I YG+ A+SGF S+D V I L +K+Q F E T EP FL A +
Sbjct: 143 TFVANNTNFNIDYGSNSGGSVALSGFLSQDTVTIAGLPIKNQVFAEITEEPENPFLNAPY 202
Query: 195 DGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEE-EGGEIVFGGMDPDH 253
DG+LGL + EIS+G P +YN++ QGL+ +PVFS + NRN GGE++ GG+D
Sbjct: 203 DGLLGLAYSEISIGGVTPPFYNLIQQGLIKKPVFSIYLNRNGTSAITGGELILGGIDSGL 262
Query: 254 YKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNH 313
++G TYVPV+ +GYWQF MG I+G T FC C AI D GTSL+ P +T+VN
Sbjct: 263 FRGCLTYVPVSTQGYWQFTMGSADING--TNFC-NKCEAILDVGTSLMVIPQNALTKVNQ 319
Query: 314 AI 315
+
Sbjct: 320 IL 321
>gi|222425194|dbj|BAH20546.1| pepsinogen A-28 [Pongo abelii]
gi|222425196|dbj|BAH20547.1| pepsinogen A-17 [Pongo abelii]
gi|222425202|dbj|BAH20550.1| pepsinogen A-71 [Pongo abelii]
Length = 388
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 115/257 (44%), Positives = 161/257 (62%), Gaps = 8/257 (3%)
Query: 78 DIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGR 137
D L+NY+D +YFG IGIGTP Q+FTVIFDTGSSNLWVPS C S+AC H+++
Sbjct: 64 DEQPLENYLDMEYFGSIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACTNHNRFNPED 122
Query: 138 SSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGI 197
SSTY+ ++ I YGTG+++G D V++G + +Q F + EP A FDGI
Sbjct: 123 SSTYQSTSETVSIAYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYAPFDGI 182
Query: 198 LGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGE 257
LGL + IS A PV+ N+ NQGLV++ +FS + +AD++ G ++FGG+D +Y G
Sbjct: 183 LGLAYPSISSSGATPVFDNIWNQGLVSQDLFSVYL--SADDQSGSVVIFGGIDSSYYTGS 240
Query: 258 HTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
+VPVT +GYWQ + + ++G+ CA GC AI D+GTSLL GPT+ I + IGA
Sbjct: 241 LNWVPVTVEGYWQITVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGA 299
Query: 318 T----GIVSQECKAVVS 330
+ G + C A+ S
Sbjct: 300 SENSDGDMVVSCSAISS 316
>gi|56971217|gb|AAH88066.1| pga5-prov protein, partial [Xenopus (Silurana) tropicalis]
Length = 382
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 110/288 (38%), Positives = 170/288 (59%), Gaps = 17/288 (5%)
Query: 44 NRVAARLDSKEGESFRTSIRKYSLRGNLGES-------------GDADIVALKNYMDAQY 90
+ ++ ++GESFR +++ L G+ + + L+NYMD +Y
Sbjct: 11 SECVVKVPLRKGESFRNRLQRLGLLGDYLKKYPYNPASKYFPTLAQSSAEVLQNYMDIEY 70
Query: 91 FGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADI 150
+G I IGTPPQ FTVIFDTGS+NLWVPS C S AC H+++ +S+T++ I
Sbjct: 71 YGTISIGTPPQEFTVIFDTGSANLWVPSVYCS-SSACTNHNRFNPQQSTTFQATNTPVSI 129
Query: 151 HYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKA 210
YGTG++SGF D +++G++ + +Q F + EP + FDGILGL F I+ +A
Sbjct: 130 QYGTGSMSGFLGYDTLQVGNIKISNQMFGLSESEPGSFLYYSPFDGILGLAFPSIASSQA 189
Query: 211 VPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQ 270
PV+ NM +QGL+ + +FS + ++D + G ++FGG+D +Y G +VP+T + YWQ
Sbjct: 190 TPVFDNMWSQGLIPQNLFSVYL--SSDGQSGSYVLFGGVDTSYYSGSLNWVPLTAETYWQ 247
Query: 271 FDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
+ + I+GQ C+ C AI D+GTSL+ GPTT I + + IGA+
Sbjct: 248 ITLDSISINGQVIA-CSQSCQAIVDTGTSLMTGPTTPIANIQYYIGAS 294
>gi|395534129|ref|XP_003769100.1| PREDICTED: LOW QUALITY PROTEIN: gastricsin-like [Sarcophilus
harrisii]
Length = 391
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 121/313 (38%), Positives = 176/313 (56%), Gaps = 22/313 (7%)
Query: 29 GLYRIGLKKRKFDLNNRVAARLDSKEG---ESFRTSIRKYSLRGNLGESGDADIVALKNY 85
G +RI LKK K + R KE E F + KY N + + L +Y
Sbjct: 16 GFFRIPLKKGK-------SIRDTMKEKGVLEDFLKT-HKYDPAKNYHFKDFSVALHLPSY 67
Query: 86 MDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNG 145
+DA Y+GEI IGTPPQNF V+FDTG SNLWVPS C S AC H+++ +SSTY NG
Sbjct: 68 LDAAYYGEISIGTPPQNFLVLFDTGFSNLWVPSIYCQ-SQACSGHAQFSPSQSSTYSTNG 126
Query: 146 KSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEI 205
++ + YG+G+++GFF D + + + V +Q F + EP F+ A+FDGI+G+ + +
Sbjct: 127 QTFSLQYGSGSLTGFFGYDTITVQGIKVPNQVFGLSENEPGTNFVHAQFDGIMGMAYPAL 186
Query: 206 SVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQ 265
+VG A M+ Q ++ P+FSF+ GGE++FGG+D + Y G+ + PVTQ
Sbjct: 187 AVGGATTALQGMLQQNILTNPIFSFYLGNQQSSXNGGEVIFGGVDNNLYTGQIYWAPVTQ 246
Query: 266 KGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQEC 325
+ YWQ + + I GQ TG+C+ GC AI D+GTSLL P ++ A GA
Sbjct: 247 ELYWQIGIQEFSIGGQATGWCSQGCQAIVDTGTSLLTVPQQYMSAFLQATGAQ------- 299
Query: 326 KAVVSQYGEEIIN 338
QYG+ +++
Sbjct: 300 ---QDQYGQYVVD 309
>gi|222425192|dbj|BAH20545.1| pepsinogen A-59 [Pongo abelii]
Length = 388
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 115/257 (44%), Positives = 161/257 (62%), Gaps = 8/257 (3%)
Query: 78 DIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGR 137
D L+NY+D +YFG IGIGTP Q+FTVIFDTGSSNLWVPS C S+AC H+++
Sbjct: 64 DEQPLENYLDMEYFGSIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACTNHNRFNPED 122
Query: 138 SSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGI 197
SSTY+ ++ I YGTG+++G D V++G + +Q F + EP A FDGI
Sbjct: 123 SSTYQSTSETVSIAYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYAPFDGI 182
Query: 198 LGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGE 257
LGL + IS A PV+ N+ NQGLV++ +FS + +AD++ G ++FGG+D +Y G
Sbjct: 183 LGLAYPSISSSGATPVFDNIWNQGLVSQDLFSVYL--SADDQSGSVVIFGGIDSSYYTGS 240
Query: 258 HTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
+VPVT +GYWQ + + ++G+ CA GC AI D+GTSLL GPT+ I + IGA
Sbjct: 241 LNWVPVTVEGYWQITVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGA 299
Query: 318 T----GIVSQECKAVVS 330
+ G + C A+ S
Sbjct: 300 SENSDGDMVVSCSAISS 316
>gi|229576947|ref|NP_001153272.1| pepsinogen A precursor [Pongo abelii]
gi|222425188|dbj|BAH20543.1| pepsinogen A-19 [Pongo abelii]
gi|222425190|dbj|BAH20544.1| pepsinogen A-13 [Pongo abelii]
gi|222425204|dbj|BAH20551.1| pepsinogen A-41 [Pongo abelii]
Length = 388
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 115/257 (44%), Positives = 161/257 (62%), Gaps = 8/257 (3%)
Query: 78 DIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGR 137
D L+NY+D +YFG IGIGTP Q+FTVIFDTGSSNLWVPS C S+AC H+++
Sbjct: 64 DEQPLENYLDMEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACTNHNRFNPED 122
Query: 138 SSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGI 197
SSTY+ ++ I YGTG+++G D V++G + +Q F + EP A FDGI
Sbjct: 123 SSTYQSTSETVSIAYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYAPFDGI 182
Query: 198 LGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGE 257
LGL + IS A PV+ N+ NQGLV++ +FS + +AD++ G ++FGG+D +Y G
Sbjct: 183 LGLAYPSISSSGATPVFDNIWNQGLVSQDLFSVYL--SADDQSGSVVIFGGIDSSYYTGS 240
Query: 258 HTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
+VPVT +GYWQ + + ++G+ CA GC AI D+GTSLL GPT+ I + IGA
Sbjct: 241 LNWVPVTVEGYWQITVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGA 299
Query: 318 T----GIVSQECKAVVS 330
+ G + C A+ S
Sbjct: 300 SENSDGDMVVSCSAISS 316
>gi|1619323|emb|CAA69878.1| aspartic protease [Trematomus bernacchii]
Length = 406
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 111/254 (43%), Positives = 166/254 (65%), Gaps = 10/254 (3%)
Query: 66 SLRGNLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSI 125
SLR LG S + + N+MDAQYFG+I +GTP QNF+V+FDTGSS+LWVPS+ C S
Sbjct: 65 SLR--LGRSSEK----IYNFMDAQYFGDITLGTPGQNFSVVFDTGSSDLWVPSTYC-VSQ 117
Query: 126 ACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREP 185
AC H ++R+ S++++ +G+ IHYG+G + G + + + K+QEF E+ EP
Sbjct: 118 ACASHKRFRAFESTSFRHDGRMFGIHYGSGHLLGVMGR-QLTVAGMTAKNQEFGESVYEP 176
Query: 186 SLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNA-DEEEGGEI 244
F++A+FDG+LG+G+ ++ PV+ NM+ Q L+++PVFSF+ +R E + GE+
Sbjct: 177 GSAFVMARFDGVLGMGYPALAEILGNPVFDNMLAQKLLDQPVFSFYLSRKQLVEPQRGEL 236
Query: 245 VFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGP 304
+ GG+D Y G ++PVT KGYWQ M +++ G + FC+ GC AI D+GTSL+AGP
Sbjct: 237 LLGGIDEALYNGPINWLPVTAKGYWQIKMDSLVVQG-VSPFCSHGCQAIIDTGTSLIAGP 295
Query: 305 TTIITQVNHAIGAT 318
T I + IGA+
Sbjct: 296 TEDILNIQQLIGAS 309
>gi|28573989|ref|NP_787961.1| CG33128 [Drosophila melanogaster]
gi|7296076|gb|AAF51371.1| CG33128 [Drosophila melanogaster]
gi|17862850|gb|AAL39902.1| LP12231p [Drosophila melanogaster]
gi|220956466|gb|ACL90776.1| CG33128-PA [synthetic construct]
Length = 405
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 110/240 (45%), Positives = 157/240 (65%), Gaps = 4/240 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSST 140
L N M+ Y+G IGIGTP Q F V+FDTGS+NLWVPS++C + +AC H++Y S SST
Sbjct: 85 LGNSMNMYYYGLIGIGTPEQYFKVVFDTGSANLWVPSAQCLATDVACQQHNQYNSSASST 144
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
+ +G++ I YGTG++SG+ + D V I L + +Q F EA +P +F FDGILG+
Sbjct: 145 FVSSGQNFSIQYGTGSVSGYLAIDTVTINGLAIANQTFGEAVSQPGASFTDVAFDGILGM 204
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
G+Q+I+ VP +YN+ +GL++EPVF F+ RN +GG++ GG D + GE TY
Sbjct: 205 GYQQIAEDNVVPPFYNLYEEGLIDEPVFGFYLARNGSAVDGGQLTLGGTDQNLIAGEMTY 264
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
PVTQ+GYWQF + ++ +G + GC AIAD+GTSL+A P+ Q+N+ IG I
Sbjct: 265 TPVTQQGYWQFAVNNITWNGTV---ISSGCQAIADTGTSLIAAPSAAYIQLNNLIGGVPI 321
>gi|393214080|gb|EJC99573.1| endopeptidase [Fomitiporia mediterranea MF3/22]
Length = 400
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 112/253 (44%), Positives = 153/253 (60%), Gaps = 9/253 (3%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L ++M+ QYF + +G+PPQ F VI DTGSSNLWVPS+KC S C H+KY S SS
Sbjct: 79 VPLSDFMNVQYFTNVTLGSPPQEFRVILDTGSSNLWVPSTKCR-SFGCSMHAKYNSSASS 137
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TY++NG I YG+G++ GF S+D V IGDL + Q+F EAT++P F KFDGI G
Sbjct: 138 TYQENGTDIHITYGSGSMEGFVSKDVVTIGDLKIDGQDFAEATKDPGPAFAFGKFDGIFG 197
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
LG+ IS+ P +Y+MVNQGL+ P+FSF F + E++GGE FGG+D Y GE
Sbjct: 198 LGYDTISINHITPPFYSMVNQGLLGAPIFSFRF--GSSEDDGGEATFGGIDESAYTGEIN 255
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA-- 317
Y PV + +W+ ++ F + D+GTSL+ P + ++N IGA
Sbjct: 256 YAPVRSREHWEVELPKYAFGDNE--FILENTGGVIDTGTSLINLPVDVAEKLNAQIGAKK 313
Query: 318 --TGIVSQECKAV 328
TG + +CK V
Sbjct: 314 SRTGQYTVDCKKV 326
>gi|129797|sp|P03955.2|PEPC_MACFU RecName: Full=Gastricsin; AltName: Full=Pepsinogen C; Flags:
Precursor
gi|38073|emb|CAA42426.1| pepsinogen C [Macaca fuscata]
Length = 377
Score = 218 bits (556), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 112/254 (44%), Positives = 157/254 (61%), Gaps = 12/254 (4%)
Query: 85 YMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKN 144
YMDA YFGEI IGTPPQNF V+FDTGSSNLWVPS C S AC HS++ SSTY N
Sbjct: 57 YMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHSRFNPSESSTYSTN 115
Query: 145 GKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQE 204
G++ + YG+G+++GFF D + + + V +QEF + EP F+ A+FDGI+GL +
Sbjct: 116 GQTFSLQYGSGSLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGIMGLAYPT 175
Query: 205 ISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVT 264
+SV A MV +G + P+FS + + + GG +VFGG+D Y G+ + PVT
Sbjct: 176 LSVDGATTAMQGMVQEGALTSPIFSVYLS-DQQGSSGGAVVFGGVDSSLYTGQIYWAPVT 234
Query: 265 QKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE 324
Q+ YWQ + + +I GQ +G+C+ GC AI D+GTSLL P ++ + A GA QE
Sbjct: 235 QELYWQIGIEEFLIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMSALLQATGA-----QE 289
Query: 325 CKAVVSQYGEEIIN 338
+YG+ ++N
Sbjct: 290 -----DEYGQFLVN 298
>gi|119372302|ref|NP_001073276.1| pepsin A preproprotein [Homo sapiens]
gi|378521995|sp|P0DJD7.1|PEPA4_HUMAN RecName: Full=Pepsin A-4; AltName: Full=Pepsinogen-4; Flags:
Precursor
gi|387012|gb|AAA98529.1| pepsinogen [Homo sapiens]
gi|157170280|gb|AAI52845.1| Pepsinogen 4, group I (pepsinogen A) [synthetic construct]
gi|219520853|gb|AAI71920.1| Pepsinogen 4, group I (pepsinogen A) [Homo sapiens]
gi|219521176|gb|AAI71910.1| Pepsinogen 4, group I (pepsinogen A) [Homo sapiens]
gi|223462201|gb|AAI50660.1| Pepsinogen 4, group I (pepsinogen A) [Homo sapiens]
gi|261860840|dbj|BAI46942.1| pepsinogen 4, group I [synthetic construct]
Length = 388
Score = 218 bits (556), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 123/298 (41%), Positives = 170/298 (57%), Gaps = 33/298 (11%)
Query: 62 IRKYSLRGNLGESGD-------------------------ADIVALKNYMDAQYFGEIGI 96
IRK SLR L E G D L+NY+D +YFG IGI
Sbjct: 23 IRKKSLRRTLSERGLLKDFLKKHNLNPARKYFPQWEAPTLVDEQPLENYLDMEYFGTIGI 82
Query: 97 GTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGA 156
GTP Q+FTV+FDTGSSNLWVPS C S+AC H+++ SSTY+ ++ I YGTG+
Sbjct: 83 GTPAQDFTVVFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTYQSTSETVSITYGTGS 141
Query: 157 ISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYN 216
++G D V++G + +Q F + EP A FDGILGL + IS A PV+ N
Sbjct: 142 MTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYAPFDGILGLAYPSISSSGATPVFDN 201
Query: 217 MVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDV 276
+ NQGLV++ +FS + +AD++ G ++FGG+D +Y G +VPVT +GYWQ + +
Sbjct: 202 IWNQGLVSQDLFSVYL--SADDQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSI 259
Query: 277 MIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT----GIVSQECKAVVS 330
++G+ CA GC AI D+GTSLL GPT+ I + IGA+ G + C A+ S
Sbjct: 260 TMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSAISS 316
>gi|119372298|ref|NP_001073275.1| pepsin A preproprotein [Homo sapiens]
gi|378521956|sp|P0DJD8.1|PEPA3_HUMAN RecName: Full=Pepsin A-3; AltName: Full=Pepsinogen-3; Flags:
Precursor
gi|182887917|gb|AAI60184.1| Pepsinogen 3, group I (pepsinogen A) [synthetic construct]
Length = 388
Score = 218 bits (556), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 123/298 (41%), Positives = 170/298 (57%), Gaps = 33/298 (11%)
Query: 62 IRKYSLRGNLGESGD-------------------------ADIVALKNYMDAQYFGEIGI 96
IRK SLR L E G D L+NY+D +YFG IGI
Sbjct: 23 IRKKSLRRTLSERGLLKDFLKKHNLNPARKYFPQWKAPTLVDEQPLENYLDMEYFGTIGI 82
Query: 97 GTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGA 156
GTP Q+FTV+FDTGSSNLWVPS C S+AC H+++ SSTY+ ++ I YGTG+
Sbjct: 83 GTPAQDFTVVFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTYQSTSETVSITYGTGS 141
Query: 157 ISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYN 216
++G D V++G + +Q F + EP A FDGILGL + IS A PV+ N
Sbjct: 142 MTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYAPFDGILGLAYPSISSSGATPVFDN 201
Query: 217 MVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDV 276
+ NQGLV++ +FS + +AD++ G ++FGG+D +Y G +VPVT +GYWQ + +
Sbjct: 202 IWNQGLVSQDLFSVYL--SADDQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSI 259
Query: 277 MIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT----GIVSQECKAVVS 330
++G+ CA GC AI D+GTSLL GPT+ I + IGA+ G + C A+ S
Sbjct: 260 TMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSAISS 316
>gi|1246039|gb|AAB35843.1| pepsinogen 2 [tuna, Peptide, 360 aa]
Length = 360
Score = 218 bits (556), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 112/238 (47%), Positives = 157/238 (65%), Gaps = 7/238 (2%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
+ N D Y+G + IGTPPQ+F VIFDTGSSNLWVPS C S AC H K+ +SST+
Sbjct: 45 MTNDADLSYYGVVSIGTPPQSFKVIFDTGSSNLWVPSVYCS-SQACQNHKKFNPQQSSTF 103
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEF-IEATREPSLTFLLAKFDGILGL 200
K + I YGTG+++G + D V++G + V +Q F I + P + ++ K DGILGL
Sbjct: 104 KWGDQPLSIQYGTGSMTGRLASDIVEVGGISVNNQVFGISQSEAPFMAYM--KADGILGL 161
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
FQ I+ VPV+ NMV+QGLV++P+FS + + N+ ++G E+VFGG+D HY G+ T+
Sbjct: 162 AFQSIASDNVVPVFDNMVSQGLVSQPLFSVYLSSNS--QQGSEVVFGGIDSSHYTGQITW 219
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
+P+T YWQ M V I+GQT C+GGC AI D+GTSL+ GP+ I +N +GA+
Sbjct: 220 IPLTSATYWQIQMDSVTINGQTVA-CSGGCQAIIDTGTSLIVGPSRDIYNMNAWVGAS 276
>gi|238883632|gb|EEQ47270.1| saccharopepsin precursor [Candida albicans WO-1]
Length = 419
Score = 218 bits (556), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 116/243 (47%), Positives = 153/243 (62%), Gaps = 14/243 (5%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L NY++AQYF EI IGTP Q F VI DTGSSNLWVPS C S+AC+ H+KY SSTY
Sbjct: 96 LTNYLNAQYFTEIQIGTPGQPFKVILDTGSSNLWVPSQDCT-SLACFLHAKYDHDASSTY 154
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
K NG I YG+G++ G+ S+D + IGDLV+ Q+F EAT EP L F KFDGILGL
Sbjct: 155 KVNGSEFSIQYGSGSMEGYISQDVLTIGDLVIPGQDFAEATSEPGLAFAFGKFDGILGLA 214
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWF-NRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ ISV VP YN +NQGL+ +P F F+ + + DE + G FGG D ++G+ T+
Sbjct: 215 YDTISVNHIVPPIYNAINQGLLEKPQFGFYLGSTDKDENDDGLATFGGYDASLFQGKITW 274
Query: 261 VPVTQKGYWQFD-----MGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAI 315
+P+ +K YW+ +GD + TG AAI D+GTSL+ P+++ +N I
Sbjct: 275 LPIRRKAYWEVSFEGIGLGDEYAELHKTG------AAI-DTGTSLITLPSSLAEIINAKI 327
Query: 316 GAT 318
GAT
Sbjct: 328 GAT 330
>gi|148691635|gb|EDL23582.1| progastricsin (pepsinogen C) [Mus musculus]
Length = 392
Score = 218 bits (556), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 120/310 (38%), Positives = 179/310 (57%), Gaps = 19/310 (6%)
Query: 14 LCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSL----RG 69
+ LL P++ L R+ LKK K + R KE + ++ + +
Sbjct: 6 VALLCLPLL----EAALIRVPLKKMK-------SIRETMKEQGVLKDFLKNHKYDPGQKY 54
Query: 70 NLGESGDADIV-ALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACY 128
+ G+ GD ++ YMDA Y+GEI IGTPPQNF V+FDTGSSNLWV S C S AC
Sbjct: 55 HFGKFGDYSVLYEPMAYMDASYYGEISIGTPPQNFLVLFDTGSSNLWVSSVYCQ-SEACT 113
Query: 129 FHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLT 188
H++Y +SSTY G++ + YGTG+++GFF D +++ + V +QEF + EP
Sbjct: 114 THTRYNPSKSSTYYTQGQTFSLQYGTGSLTGFFGYDTLRVQSIQVPNQEFGLSENEPGTN 173
Query: 189 FLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGG 248
F+ A+FDGI+GL + +S G A M+ +G +++P+F + +GG+IVFGG
Sbjct: 174 FVYAQFDGIMGLAYPGLSSGGATTALQGMLGEGALSQPLFGVYLGSQQG-SDGGQIVFGG 232
Query: 249 MDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFC-AGGCAAIADSGTSLLAGPTTI 307
+D + Y GE T++PVTQ+ YWQ + D +I Q +G+C + GC I D+GTSLL P
Sbjct: 233 VDENLYTGELTWIPVTQELYWQITIDDFLIGNQASGWCSSSGCQGIVDTGTSLLVMPAQY 292
Query: 308 ITQVNHAIGA 317
+ ++ IGA
Sbjct: 293 LNELLQTIGA 302
>gi|12043774|gb|AAG47643.1|AF275939_1 progastricsin [Salvelinus fontinalis]
Length = 387
Score = 218 bits (556), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 124/326 (38%), Positives = 188/326 (57%), Gaps = 24/326 (7%)
Query: 13 FLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLG 72
+L ++L V + G++RI L K K + R+ K GE + +L+
Sbjct: 3 YLVIVLVCAVLAE---GIHRIPLVKHK-----SIRERMMEK-GEHL--PYQDPALKYFPD 51
Query: 73 ESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSK 132
E + + + NY D Y+G I IGTPPQ+F V+FDTGS+NLWV S C + AC H+K
Sbjct: 52 EFAGSTTMYINNYADTTYYGAITIGTPPQSFQVLFDTGSANLWVDSVLCN-TQACNTHTK 110
Query: 133 YRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLA 192
+ +SSTY NG++ + YGTG++SG F D V +G +++ +QE +T EP TF++A
Sbjct: 111 FNPQQSSTYSANGETFYLPYGTGSLSGVFGYDTVNVGGIILTNQEIGLSTDEPGQTFVVA 170
Query: 193 KFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPD 252
+FDGILGL + IS G+ PV N+++Q L+ +F+F+ R D ++G E+ FG +D
Sbjct: 171 QFDGILGLSYPSISAGQETPVMDNIMSQNLLQANLFAFYLTR--DGQQGSELSFGEVDNT 228
Query: 253 HYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVN 312
Y+G+ + PVT + YWQ + I+GQ TG+C GC AI D+GTS+L P+ I+ +
Sbjct: 229 KYQGQIYWTPVTSQTYWQIGVQGFQINGQETGWCGQGCQAIVDTGTSMLTAPSQIMGTLM 288
Query: 313 HAIGATGIVSQECKAVVSQYGEEIIN 338
+IGA QYG+ +N
Sbjct: 289 QSIGAQQ----------DQYGQYTVN 304
>gi|281183192|ref|NP_001162218.1| gastricsin precursor [Papio anubis]
gi|157939796|gb|ABW05535.1| progastricsin (predicted) [Papio anubis]
Length = 388
Score = 218 bits (556), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 112/254 (44%), Positives = 157/254 (61%), Gaps = 12/254 (4%)
Query: 85 YMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKN 144
YMDA YFGEI IGTPPQNF V+FDTGSSNLWVPS C S AC HS++ SSTY N
Sbjct: 68 YMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHSRFNPSESSTYSTN 126
Query: 145 GKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQE 204
G++ + YG+G+++GFF D + + + V +QEF + EP F+ A+FDGI+GL +
Sbjct: 127 GQTFSLQYGSGSLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGIMGLAYPT 186
Query: 205 ISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVT 264
+SV A MV +G + P+FS + + + GG +VFGG+D Y G+ + PVT
Sbjct: 187 LSVDGATTAMQGMVQEGALTSPIFSVYLS-DQQGSSGGAVVFGGVDSSLYTGQIYWAPVT 245
Query: 265 QKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE 324
Q+ YWQ + + +I GQ +G+C+ GC AI D+GTSLL P ++ + A GA QE
Sbjct: 246 QELYWQIGIEEFLIGGQASGWCSEGCQAIVDTGTSLLTVPQQYLSALLQATGA-----QE 300
Query: 325 CKAVVSQYGEEIIN 338
+YG+ ++N
Sbjct: 301 -----DEYGQFLVN 309
>gi|148222689|ref|NP_001079037.1| pepsinogen 4, group I (pepsinogen A) precursor [Xenopus laevis]
gi|12082176|dbj|BAB20798.1| pepsinogen A [Xenopus laevis]
gi|213623162|gb|AAI69372.1| Pepsinogen A [Xenopus laevis]
gi|213626396|gb|AAI69374.1| Pepsinogen A [Xenopus laevis]
Length = 384
Score = 218 bits (556), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 111/287 (38%), Positives = 170/287 (59%), Gaps = 17/287 (5%)
Query: 45 RVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVA-------------LKNYMDAQYF 91
++ ++GESFR +++ L G+ + + + L+NYMD +Y+
Sbjct: 14 ECVVKVPLRKGESFRNRLQRLGLLGDYLKKNPYNPASKYFPTLAQSSAETLQNYMDIEYY 73
Query: 92 GEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIH 151
G I IGTPPQ FTVIFDTGS+NLWVPS C S AC H+++ +SST++ I
Sbjct: 74 GTISIGTPPQEFTVIFDTGSANLWVPSVYCS-SQACSNHNRFNPQQSSTFQATNTPVSIQ 132
Query: 152 YGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAV 211
YGTG++SGF D +++G++ + +Q F + EP + FDGILGL F I+ +A
Sbjct: 133 YGTGSMSGFLGYDTLQVGNIQISNQMFGLSESEPGSFLYYSPFDGILGLAFPSIASSQAT 192
Query: 212 PVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQF 271
PV+ NM +QGL+ + +FS + ++D + G ++FGG+D +Y G +VP+T + YWQ
Sbjct: 193 PVFDNMWSQGLIPQNLFSVYL--SSDGQTGSYVLFGGVDNSYYSGSLNWVPLTAETYWQI 250
Query: 272 DMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
+ V I+GQ C+ C AI D+GTSL+ GP+T I + + IGA+
Sbjct: 251 TLDSVSINGQVIA-CSQSCQAIVDTGTSLMTGPSTPIANIQNYIGAS 296
>gi|296474377|tpg|DAA16492.1| TPA: progastricsin (pepsinogen C) [Bos taurus]
Length = 421
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 105/234 (44%), Positives = 147/234 (62%), Gaps = 1/234 (0%)
Query: 84 NYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKK 143
+YMDA YFGEI IGTPPQNF V+FDTGSSNLWVPS C S AC H+++ SSTY
Sbjct: 68 DYMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHTRFNHSLSSTYST 126
Query: 144 NGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQ 203
N ++ + YG+G+++G D + + + V +QEF + EP FL AKFDGI+G+ +
Sbjct: 127 NEQTFSLQYGSGSLTGILGYDTLTVQGIKVPNQEFGLSKTEPGTNFLYAKFDGIMGMAYP 186
Query: 204 EISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPV 263
+SV A V M+ +G + PVFSF+ + ++GG ++FGG+D Y G+ + PV
Sbjct: 187 SLSVDGATTVLQGMLQEGALTSPVFSFYLSSQQGSQDGGAVIFGGVDNCLYTGQIYWAPV 246
Query: 264 TQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
TQ+ YWQ + +I Q TG+C+ GC AI D+GTSLL P ++ + A GA
Sbjct: 247 TQELYWQIGFEEFLIGDQATGWCSTGCQAIVDTGTSLLTVPQQFLSALLQATGA 300
>gi|345318884|ref|XP_001520972.2| PREDICTED: renin-like [Ornithorhynchus anatinus]
Length = 388
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 123/300 (41%), Positives = 178/300 (59%), Gaps = 32/300 (10%)
Query: 59 RTSIRKY-SLRGNLGESG------DADI---------------VALKNYMDAQYFGEIGI 96
R S+RK SLR + E G DA++ + L NY+DAQYFGEIGI
Sbjct: 13 RISLRKMPSLRECMREQGQVTMKTDAELGFSAGPGSLENRTVPMLLTNYLDAQYFGEIGI 72
Query: 97 GTPPQNFTVIFDTGSSNLWVPSSKCY-FSIACYFHSKYRSGRSSTYKKNGKSADIHYGTG 155
G+P Q F VIFDTGS+NLWVPS C AC H+ Y + +S TY +NG I Y +G
Sbjct: 73 GSPAQTFKVIFDTGSANLWVPSINCKPIHSACETHNLYDASQSQTYMENGTQIAISYVSG 132
Query: 156 AISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWY 215
+ GF S+D V IG + V Q F E T P+ +F+ AKFDG+LG+G+ ++G PV+
Sbjct: 133 TVKGFLSQDLVTIGGIPVI-QMFAEITTLPTSSFMYAKFDGVLGMGYPAQAIGGITPVFD 191
Query: 216 NMVNQGLVNEPVFSFWFNRNADEEE---GGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFD 272
+++ Q ++ E VFS +++RN+ + GGEI+ GG DP +Y+G+ Y+ V++KG+WQ +
Sbjct: 192 HILTQHVLKEDVFSVYYSRNSKNDHMVPGGEIILGGRDPTYYQGDFYYLDVSKKGFWQVN 251
Query: 273 MGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI----VSQECKAV 328
M V +D +T FC GCAA+ D+G +L+ GP + + +GA I + +CK V
Sbjct: 252 MKGVSVD-RTLQFCQEGCAAMVDTGATLITGPVKDVKHMMDILGAQKIGGNMYAVDCKEV 310
>gi|194218273|ref|XP_001501915.2| PREDICTED: pepsin A-like [Equus caballus]
Length = 387
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 121/290 (41%), Positives = 173/290 (59%), Gaps = 8/290 (2%)
Query: 30 LYRIGL-KKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDA 88
+Y++ L KK+ N R L + R KY R + A L+NYMD
Sbjct: 17 IYKVPLVKKKSLRQNLRENGLLADFLKQHPRNPASKYFPRE---AATLAATEGLENYMDE 73
Query: 89 QYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSA 148
+YFG I IGTP Q FTVIFDTGSSNLWVPS C S+AC H+++ SSTY+ +S
Sbjct: 74 EYFGTISIGTPAQEFTVIFDTGSSNLWVPSVYCS-SLACSDHNRFNPEDSSTYEATSESV 132
Query: 149 DIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVG 208
I YGTG+++G D V++G + +Q F + EPS A FDGILGL + IS
Sbjct: 133 SITYGTGSMTGVLGYDTVRVGGIEDTNQIFGLSESEPSSFLYYAPFDGILGLAYPSISAS 192
Query: 209 KAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGY 268
A PV+ N+ +QGLV++ +FS + ++D+E G ++FGG+D +Y G +VPV+++ Y
Sbjct: 193 GATPVFDNIWDQGLVSQDLFSVYL--SSDDESGSVVMFGGIDSSYYSGSLNWVPVSEEAY 250
Query: 269 WQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
WQ + + ++G++ C+GGC AI D+GTSLLAGP + I + IGA+
Sbjct: 251 WQITVDSITMNGESIA-CSGGCQAIVDTGTSLLAGPPSAIDNIQSYIGAS 299
>gi|5921651|gb|AAD56284.1|AF156788_1 pepsinogen A form IIb precursor [Pseudopleuronectes americanus]
Length = 376
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 111/258 (43%), Positives = 162/258 (62%), Gaps = 15/258 (5%)
Query: 81 ALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSST 140
++ N D Y+G I IGTPPQ+F+VIFDTGSSNLW+PS C S AC H K+ +SST
Sbjct: 61 SMTNDADLSYYGVISIGTPPQSFSVIFDTGSSNLWIPSVYCS-SQACENHQKFNPQQSST 119
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
+K + I YGTG+++G + D V++G + + +Q F + E + + K DGILGL
Sbjct: 120 FKWGNQPLSIQYGTGSMTGNLASDTVQVGGISIANQVFGVSQTEAAFMASM-KADGILGL 178
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
FQ I+ VPV+ NM+ Q LV++P+FS + + NA ++G E++FGG D HY G+ ++
Sbjct: 179 AFQSIASDNVVPVFDNMIKQNLVSQPLFSVYLSSNA--QQGSEVIFGGTDSSHYTGQISW 236
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
+P+T YWQ M V I+GQT C+GGC AI D+GTSL+ GP + I+ +N +GA+
Sbjct: 237 IPLTSATYWQIKMDSVTINGQTVA-CSGGCQAIIDTGTSLIVGPNSDISNMNSWVGAS-- 293
Query: 321 VSQECKAVVSQYGEEIIN 338
+QYGE ++
Sbjct: 294 --------TNQYGEATVS 303
>gi|355329701|dbj|BAL14144.1| pepsinogen 1 [Pagrus major]
Length = 369
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 108/239 (45%), Positives = 157/239 (65%), Gaps = 7/239 (2%)
Query: 81 ALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSST 140
++ N D Y+G I IGTPPQ+F VIFDTGSSNLW+PS C S AC H K+ G SST
Sbjct: 54 SMTNDADMAYYGIISIGTPPQSFKVIFDTGSSNLWIPSIYCS-SAACNNHDKFNPGLSST 112
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEF-IEATREPSLTFLLAKFDGILG 199
Y+ NGKS I YGTG+++GF D V +G + VK+Q F + + P + ++ A DGILG
Sbjct: 113 YRHNGKSLQIQYGTGSMTGFLGYDTVTVGGIAVKNQIFGLSQSEAPFMQYMRA--DGILG 170
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHT 259
L + +S A PV+ NM+ +GLVN+ +FS + + N+ ++G + FGG+DP++Y G T
Sbjct: 171 LAYPRLSASGATPVFDNMMTEGLVNQDLFSVYLSSNS--QQGSVVTFGGVDPNYYTGSIT 228
Query: 260 YVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
++P++ + YWQ + V ++GQ C GGC AI D+GTSL+ GP + I+ +N +GA+
Sbjct: 229 WIPLSNELYWQITVDSVTVNGQVVA-CNGGCQAIVDTGTSLIVGPQSSISNINSYVGAS 286
>gi|444724642|gb|ELW65241.1| Chymosin [Tupaia chinensis]
Length = 381
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 118/283 (41%), Positives = 165/283 (58%), Gaps = 17/283 (6%)
Query: 48 ARLDSKEGESFRTSIRKYSL------------RGNLGESGDADIVALKNYMDAQYFGEIG 95
R+ +G+S R ++++ L G SG L NY+D QYFG+I
Sbjct: 20 TRIPLHKGKSLRRALKEQGLLEDFLKTQQSTVSGKYFSSGVVASEPLTNYLDTQYFGKIT 79
Query: 96 IGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTG 155
IGTPPQ FTV+FDTGSS+LWVPS C S AC H ++ +SST++ K I YGTG
Sbjct: 80 IGTPPQEFTVVFDTGSSDLWVPSVYCD-SAACQNHQRFDPSKSSTFQNLDKPLSIQYGTG 138
Query: 156 AISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWY 215
++ GF D V + D+V Q +T+EP F A+FDGILGL + ++ +VPV+
Sbjct: 139 SMQGFLGYDTVTVSDIVDTHQTVGLSTQEPGNVFTYAEFDGILGLAYPSLAAEYSVPVFD 198
Query: 216 NMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGD 275
NM+ + LV + +FS + +RN ++G + G +D +Y G +VPVT + YWQF M
Sbjct: 199 NMMQKHLVAKDLFSVYMSRN---DQGSMLTLGAIDSSYYTGSLHWVPVTMQDYWQFTMDS 255
Query: 276 VMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
V I+G C GGC AI D+GTSL+AGP++ I + AIGAT
Sbjct: 256 VTINGVVVA-CDGGCQAILDTGTSLVAGPSSDILNIQQAIGAT 297
>gi|410974069|ref|XP_003993470.1| PREDICTED: pepsin A-like [Felis catus]
Length = 387
Score = 218 bits (555), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 122/295 (41%), Positives = 169/295 (57%), Gaps = 32/295 (10%)
Query: 62 IRKYSLRGNLGESG---------------------DADIVA---LKNYMDAQYFGEIGIG 97
I+K +LR NL E G A ++A L+NYMD +YFG IGIG
Sbjct: 23 IKKKTLRENLIEHGLLDDFLKKYTFNPADKYFHDESATLIAAQPLENYMDMEYFGTIGIG 82
Query: 98 TPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAI 157
TPPQ FTVIFDTGSSNLWVPS C S AC H ++ SSTY+ I YGTG++
Sbjct: 83 TPPQQFTVIFDTGSSNLWVPSVYCK-SPACTNHKRFNPQESSTYQATNNPVSIAYGTGSM 141
Query: 158 SGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNM 217
+G D V++G + +Q F + EP A FDGILGL + +IS A PV+ NM
Sbjct: 142 TGILGYDTVQVGGVSDTNQIFGLSETEPGSFLYYAPFDGILGLAYPQISASGATPVFDNM 201
Query: 218 VNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVM 277
N+GLV++ +FS + + N ++ G ++FGG+D +Y G ++PV+ +GYWQ + +
Sbjct: 202 WNEGLVSQDLFSVYLSGN--DQSGSVVMFGGIDSSYYTGNLNWIPVSVEGYWQISVDSIT 259
Query: 278 IDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT----GIVSQECKAV 328
++GQ+ C GGC AI D+GTSLL GP+ I + IGA+ G + C A+
Sbjct: 260 MNGQSIA-CNGGCQAIVDTGTSLLTGPSNAIANIQSDIGASQNSYGQMGISCSAI 313
>gi|301625941|ref|XP_002942158.1| PREDICTED: pepsin A [Xenopus (Silurana) tropicalis]
Length = 384
Score = 218 bits (555), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 110/288 (38%), Positives = 170/288 (59%), Gaps = 17/288 (5%)
Query: 44 NRVAARLDSKEGESFRTSIRKYSLRGNLGES-------------GDADIVALKNYMDAQY 90
+ ++ ++GESFR +++ L G+ + + L+NYMD +Y
Sbjct: 13 SECVVKVPLRKGESFRNRLQRLGLLGDYLKKYPYNPASKYFPTLAQSSAEVLQNYMDIEY 72
Query: 91 FGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADI 150
+G I IGTPPQ FTVIFDTGS+NLWVPS C S AC H+++ +S+T++ I
Sbjct: 73 YGTISIGTPPQEFTVIFDTGSANLWVPSVYCS-SSACTNHNRFNPQQSTTFQATNTPVSI 131
Query: 151 HYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKA 210
YGTG++SGF D +++G++ + +Q F + EP + FDGILGL F I+ +A
Sbjct: 132 QYGTGSMSGFLGYDTLQVGNIKISNQMFGLSESEPGSFLYYSPFDGILGLAFPSIASSQA 191
Query: 211 VPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQ 270
PV+ NM +QGL+ + +FS + ++D + G ++FGG+D +Y G +VP+T + YWQ
Sbjct: 192 TPVFDNMWSQGLIPQNLFSVYL--SSDGQSGSYVLFGGVDTSYYSGSLNWVPLTAETYWQ 249
Query: 271 FDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
+ + I+GQ C+ C AI D+GTSL+ GPTT I + + IGA+
Sbjct: 250 IILDSISINGQVIA-CSQSCQAIVDTGTSLMTGPTTPIANIQYYIGAS 296
>gi|114637856|ref|XP_001145457.1| PREDICTED: pepsin A-5 isoform 6 [Pan troglodytes]
Length = 388
Score = 218 bits (555), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 124/298 (41%), Positives = 170/298 (57%), Gaps = 33/298 (11%)
Query: 62 IRKYSLRGNLGESGD-------------------------ADIVALKNYMDAQYFGEIGI 96
IRK SLR L E G D L+NY+D +YFG IGI
Sbjct: 23 IRKKSLRRTLSERGLLKDFLKKHNLNPASKYFPQWEAPTLVDEQPLENYLDMEYFGTIGI 82
Query: 97 GTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGA 156
GTP Q+FTVIFDTGSSNLWVPS C S+AC H+++ SSTY+ ++ I YGTG+
Sbjct: 83 GTPAQDFTVIFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTYQSTSETVSIAYGTGS 141
Query: 157 ISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYN 216
++G D V++G + +Q F + EP A FDGILGL + IS A PV+ N
Sbjct: 142 MTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYAPFDGILGLAYPSISSSGATPVFDN 201
Query: 217 MVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDV 276
+ NQGLV++ +FS + +AD++ G ++FGG+D +Y G +VPVT +GYWQ + +
Sbjct: 202 IWNQGLVSQDLFSVYL--SADDKSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSI 259
Query: 277 MIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT----GIVSQECKAVVS 330
++G+ CA GC AI D+GTSLL GPT+ I + IGA+ G + C A+ S
Sbjct: 260 TMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSAISS 316
>gi|355561685|gb|EHH18317.1| hypothetical protein EGK_14890 [Macaca mulatta]
gi|355748551|gb|EHH53034.1| hypothetical protein EGM_13592 [Macaca fascicularis]
Length = 388
Score = 218 bits (555), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 112/254 (44%), Positives = 157/254 (61%), Gaps = 12/254 (4%)
Query: 85 YMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKN 144
YMDA YFGEI IGTPPQNF V+FDTGSSNLWVPS C S AC HS++ SSTY N
Sbjct: 68 YMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHSRFNPSESSTYSTN 126
Query: 145 GKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQE 204
G++ + YG+G+++GFF D + + + V +QEF + EP F+ A+FDGI+GL +
Sbjct: 127 GQTFSLQYGSGSLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGIMGLAYPT 186
Query: 205 ISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVT 264
+SV A MV +G + P+FS + + + GG +VFGG+D Y G+ + PVT
Sbjct: 187 LSVDGATTAMQGMVQEGALTSPIFSVYLS-DQQGSSGGAVVFGGVDSSLYTGQIYWAPVT 245
Query: 265 QKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE 324
Q+ YWQ + + +I GQ +G+C+ GC AI D+GTSLL P ++ + A GA QE
Sbjct: 246 QELYWQIGIEEFLIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMSALLQATGA-----QE 300
Query: 325 CKAVVSQYGEEIIN 338
+YG+ ++N
Sbjct: 301 -----DEYGQFLVN 309
>gi|444706374|gb|ELW47716.1| Renin [Tupaia chinensis]
Length = 401
Score = 218 bits (555), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 122/279 (43%), Positives = 166/279 (59%), Gaps = 10/279 (3%)
Query: 29 GLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDA 88
G RI LKK V L + ++ R + K++L L V L NY+D
Sbjct: 24 GFQRIFLKKMP-----SVRESLKERGADAARL-VAKWNLSKTLSLGNSTSPVVLTNYLDT 77
Query: 89 QYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCY-FSIACYFHSKYRSGRSSTYKKNGKS 147
QY+GEIGIGTP Q F V+FDTGS+NLWVPS+KC AC HS Y S SS+Y +NG
Sbjct: 78 QYYGEIGIGTPAQTFKVVFDTGSANLWVPSTKCSPLYTACEIHSLYDSSESSSYMENGTE 137
Query: 148 ADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISV 207
IHYG+G + GF S+D V +G + V Q F E T P + F+LAKFDG+LG+G +V
Sbjct: 138 FAIHYGSGKVRGFLSQDVVTVGGITVT-QTFGEVTELPVIPFMLAKFDGVLGMGLPAQAV 196
Query: 208 GKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKG 267
G PV+ ++++Q ++ E VFS ++++N+ GGEIV GG DP +Y+G YV V+ G
Sbjct: 197 GGVTPVFDHILSQRVLKEDVFSVYYSKNS-HVLGGEIVLGGSDPQYYQGHFHYVSVSSTG 255
Query: 268 YWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTT 306
WQ M V + T C GC A+ D+GTS ++GPT+
Sbjct: 256 SWQVKMKGVSVRSATL-LCENGCMAVVDTGTSYISGPTS 293
>gi|219521691|gb|AAI71808.1| Pepsinogen 4, group I (pepsinogen A) [Homo sapiens]
Length = 388
Score = 218 bits (555), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 123/298 (41%), Positives = 170/298 (57%), Gaps = 33/298 (11%)
Query: 62 IRKYSLRGNLGESGD-------------------------ADIVALKNYMDAQYFGEIGI 96
IRK SLR L E G D L+NY+D +YFG IGI
Sbjct: 23 IRKKSLRRTLSERGLLKDFLKKHNLNPARKYFPQWEAPTLVDEQPLENYLDMEYFGTIGI 82
Query: 97 GTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGA 156
GTP Q+FTV+FDTGSSNLWVPS C S+AC H+++ SSTY+ ++ I YGTG+
Sbjct: 83 GTPAQDFTVLFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTYQSTSETVSITYGTGS 141
Query: 157 ISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYN 216
++G D V++G + +Q F + EP A FDGILGL + IS A PV+ N
Sbjct: 142 MTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYAPFDGILGLAYPSISSSGATPVFDN 201
Query: 217 MVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDV 276
+ NQGLV++ +FS + +AD++ G ++FGG+D +Y G +VPVT +GYWQ + +
Sbjct: 202 IWNQGLVSQDLFSVYL--SADDQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSI 259
Query: 277 MIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT----GIVSQECKAVVS 330
++G+ CA GC AI D+GTSLL GPT+ I + IGA+ G + C A+ S
Sbjct: 260 TMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSAISS 316
>gi|219520803|gb|AAI71814.1| Pepsinogen 4, group I (pepsinogen A) [Homo sapiens]
Length = 388
Score = 218 bits (555), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 123/298 (41%), Positives = 170/298 (57%), Gaps = 33/298 (11%)
Query: 62 IRKYSLRGNLGESGD-------------------------ADIVALKNYMDAQYFGEIGI 96
IRK SLR L E G D L+NY+D +YFG IGI
Sbjct: 23 IRKKSLRRTLSERGLLKDFLKKHNLNPARKYFPQWEAPTLVDEQPLENYLDMEYFGTIGI 82
Query: 97 GTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGA 156
GTP Q+FTV+FDTGSSNLWVPS C S+AC H+++ SSTY+ ++ I YGTG+
Sbjct: 83 GTPAQDFTVLFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTYQSTSETVSITYGTGS 141
Query: 157 ISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYN 216
++G D V++G + +Q F + EP A FDGILGL + IS A PV+ N
Sbjct: 142 MTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYAPFDGILGLAYPSISSSGATPVFDN 201
Query: 217 MVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDV 276
+ NQGLV++ +FS + +AD++ G ++FGG+D +Y G +VPVT +GYWQ + +
Sbjct: 202 IWNQGLVSQDLFSVYL--SADDQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSI 259
Query: 277 MIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT----GIVSQECKAVVS 330
++G+ CA GC AI D+GTSLL GPT+ I + IGA+ G + C A+ S
Sbjct: 260 TMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSAISS 316
>gi|189066533|dbj|BAG35783.1| unnamed protein product [Homo sapiens]
gi|193785072|dbj|BAG54225.1| unnamed protein product [Homo sapiens]
gi|219521010|gb|AAI71815.1| Pepsinogen 3, group I (pepsinogen A) [Homo sapiens]
Length = 388
Score = 218 bits (555), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 123/298 (41%), Positives = 170/298 (57%), Gaps = 33/298 (11%)
Query: 62 IRKYSLRGNLGESGD-------------------------ADIVALKNYMDAQYFGEIGI 96
IRK SLR L E G D L+NY+D +YFG IGI
Sbjct: 23 IRKKSLRRTLSERGLLKDFLKKHNLNPARKYFPQWKAPTLVDEQPLENYLDMEYFGTIGI 82
Query: 97 GTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGA 156
GTP Q+FTV+FDTGSSNLWVPS C S+AC H+++ SSTY+ ++ I YGTG+
Sbjct: 83 GTPAQDFTVLFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTYQSTSETVSITYGTGS 141
Query: 157 ISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYN 216
++G D V++G + +Q F + EP A FDGILGL + IS A PV+ N
Sbjct: 142 MTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYAPFDGILGLAYPSISSSGATPVFDN 201
Query: 217 MVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDV 276
+ NQGLV++ +FS + +AD++ G ++FGG+D +Y G +VPVT +GYWQ + +
Sbjct: 202 IWNQGLVSQDLFSVYL--SADDQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSI 259
Query: 277 MIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT----GIVSQECKAVVS 330
++G+ CA GC AI D+GTSLL GPT+ I + IGA+ G + C A+ S
Sbjct: 260 TMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSAISS 316
>gi|444706401|gb|ELW47743.1| Cathepsin E [Tupaia chinensis]
Length = 396
Score = 218 bits (555), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 120/302 (39%), Positives = 178/302 (58%), Gaps = 14/302 (4%)
Query: 30 LYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYS---LRGNLGESGDADIVALKNYM 86
L+R+ L+++ A SK +S +++++ +R + + L N
Sbjct: 19 LHRVPLRRQLSPWKKPWALPQLSKFWKSQNLDMKQFTKSCVRNQISKE------PLTNSF 72
Query: 87 DAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGK 146
+ QY+G + IG+P QNF+V+FDTGSS+ WV S C S AC H+K+ S RS+TY K G
Sbjct: 73 NMQYYGTVSIGSPLQNFSVLFDTGSSDFWVTSVYC-ISPACEKHTKFFSSRSNTYSKKGS 131
Query: 147 SADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEIS 206
+ I YG+G++SG D V +G L V DQEF E+ EP F+ A FDGILGLG+ +S
Sbjct: 132 NFFIEYGSGSLSGITGVDRVSVGGLTVVDQEFGESVTEPGQHFVYAAFDGILGLGYPSLS 191
Query: 207 VGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQK 266
V A PV+ NM+ +V +P+FS + + + + G E++FGG D H+ G ++PVT++
Sbjct: 192 VTGATPVFDNMIVHNMVAQPMFSVYMSSDIENGTGSELIFGGYDCSHFSGSLNWIPVTKQ 251
Query: 267 GYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT---GIVSQ 323
G+WQ + V + G T FC+ GC AI D+GTS + GP I +++ AIGAT GI
Sbjct: 252 GFWQIALDGVQV-GDTMMFCSKGCQAIVDTGTSRIIGPLNKIERLHRAIGATLVNGIYFV 310
Query: 324 EC 325
EC
Sbjct: 311 EC 312
>gi|387013|gb|AAA60061.1| pepsinogen A [Homo sapiens]
Length = 388
Score = 218 bits (554), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 114/257 (44%), Positives = 161/257 (62%), Gaps = 8/257 (3%)
Query: 78 DIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGR 137
D L+NY+D +YFG IGIGTP Q+FTV+FDTGSSNLWVPS C S+AC H+++
Sbjct: 64 DEQPLENYLDMEYFGTIGIGTPAQDFTVLFDTGSSNLWVPSVYCS-SLACTNHNRFNPED 122
Query: 138 SSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGI 197
SSTY+ ++ I YGTG+++G D V++G + +Q F + EP A FDGI
Sbjct: 123 SSTYQSTSETVSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYAPFDGI 182
Query: 198 LGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGE 257
LGL + IS A PV+ N+ NQGLV++ +FS + +AD++ G ++FGG+D +Y G
Sbjct: 183 LGLAYPSISSSGATPVFDNIWNQGLVSQDLFSVYL--SADDQSGSVVIFGGIDSSYYTGS 240
Query: 258 HTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
+VPVT +GYWQ + + ++G+ CA GC AI D+GTSLL GPT+ I + IGA
Sbjct: 241 LNWVPVTVEGYWQITVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGA 299
Query: 318 T----GIVSQECKAVVS 330
+ G + C A+ S
Sbjct: 300 SENSDGDMVVSCSAISS 316
>gi|1065259|pdb|1PSO|E Chain E, The Crystal Structure Of Human Pepsin And Its Complex With
Pepstatin
gi|5542461|pdb|1QRP|E Chain E, Human Pepsin 3a In Complex With A Phosphonate Inhibitor
Iva-Val-Val- Leu(P)-(O)phe-Ala-Ala-Ome
gi|157833570|pdb|1PSN|A Chain A, The Crystal Structure Of Human Pepsin And Its Complex With
Pepstatin
gi|361132440|pdb|3UTL|A Chain A, Human Pepsin 3b
Length = 326
Score = 218 bits (554), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 113/253 (44%), Positives = 160/253 (63%), Gaps = 8/253 (3%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L+NY+D +YFG IGIGTP Q+FTV+FDTGSSNLWVPS C S+AC H+++ SSTY
Sbjct: 6 LENYLDMEYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTY 64
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
+ ++ I YGTG+++G D V++G + +Q F + EP A FDGILGL
Sbjct: 65 QSTSETVSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYAPFDGILGLA 124
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ IS A PV+ N+ NQGLV++ +FS + +AD++ G ++FGG+D +Y G +V
Sbjct: 125 YPSISSSGATPVFDNIWNQGLVSQDLFSVYL--SADDQSGSVVIFGGIDSSYYTGSLNWV 182
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT--- 318
PVT +GYWQ + + ++G+ CA GC AI D+GTSLL GPT+ I + IGA+
Sbjct: 183 PVTVEGYWQITVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENS 241
Query: 319 -GIVSQECKAVVS 330
G + C A+ S
Sbjct: 242 DGDMVVSCSAISS 254
>gi|29244579|ref|NP_080249.2| gastricsin precursor [Mus musculus]
gi|73921722|sp|Q9D7R7.1|PEPC_MOUSE RecName: Full=Gastricsin; AltName: Full=Pepsinogen C; Flags:
Precursor
gi|12843461|dbj|BAB25990.1| unnamed protein product [Mus musculus]
gi|68534888|gb|AAH99409.1| Progastricsin (pepsinogen C) [Mus musculus]
Length = 392
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 120/310 (38%), Positives = 178/310 (57%), Gaps = 19/310 (6%)
Query: 14 LCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSL----RG 69
+ LL P++ L R+ LKK K + R KE + ++ + +
Sbjct: 6 VALLCLPLL----EAALIRVPLKKMK-------SIRETMKEQGVLKDFLKNHKYDPGQKY 54
Query: 70 NLGESGDADIV-ALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACY 128
+ G+ GD ++ YMDA Y+GEI IGTPPQNF V+FDTGSSNLWV S C S AC
Sbjct: 55 HFGKFGDYSVLYEPMAYMDASYYGEISIGTPPQNFLVLFDTGSSNLWVSSVYCQ-SEACT 113
Query: 129 FHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLT 188
H++Y +SSTY G++ + YGTG+++GFF D +++ + V +QEF + EP
Sbjct: 114 THTRYNPSKSSTYYTQGQTFSLQYGTGSLTGFFGYDTLRVQSIQVPNQEFGLSENEPGTN 173
Query: 189 FLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGG 248
F+ A+FDGI+GL + +S G A M+ +G +++P+F + GG+IVFGG
Sbjct: 174 FVYAQFDGIMGLAYPGLSSGGATTALQGMLGEGALSQPLFGVYLGSQQG-SNGGQIVFGG 232
Query: 249 MDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFC-AGGCAAIADSGTSLLAGPTTI 307
+D + Y GE T++PVTQ+ YWQ + D +I Q +G+C + GC I D+GTSLL P
Sbjct: 233 VDENLYTGELTWIPVTQELYWQITIDDFLIGNQASGWCSSSGCQGIVDTGTSLLVMPAQY 292
Query: 308 ITQVNHAIGA 317
+ ++ IGA
Sbjct: 293 LNELLQTIGA 302
>gi|16974928|pdb|1FLH|A Chain A, Crystal Structure Of Human Uropepsin At 2.45 A Resolution
Length = 326
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 113/253 (44%), Positives = 160/253 (63%), Gaps = 8/253 (3%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L+NY+D +YFG IGIGTP Q+FTV+FDTGSSNLWVPS C S+AC H+++ SSTY
Sbjct: 6 LENYLDMEYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTY 64
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
+ ++ I YGTG+++G D V++G + +Q F + EP A FDGILGL
Sbjct: 65 QSTSETVSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYAPFDGILGLA 124
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ IS A PV+ N+ NQGLV++ +FS + +AD++ G ++FGG+D +Y G +V
Sbjct: 125 YPSISSSGATPVFDNIWNQGLVSQDLFSVYL--SADDQSGSVVIFGGIDSSYYTGSLNWV 182
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT--- 318
PVT +GYWQ + + ++G+ CA GC AI D+GTSLL GPT+ I + IGA+
Sbjct: 183 PVTVEGYWQITVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENS 241
Query: 319 -GIVSQECKAVVS 330
G + C A+ S
Sbjct: 242 DGDMVVSCSAISS 254
>gi|129780|sp|P27677.1|PEPA2_MACFU RecName: Full=Pepsin A-2/A-3; AltName: Full=Pepsin III-2/III-1;
Flags: Precursor
gi|38069|emb|CAA42427.1| prepropepsin a [Macaca fuscata]
Length = 388
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 123/298 (41%), Positives = 170/298 (57%), Gaps = 33/298 (11%)
Query: 62 IRKYSLRGNLGESGD-------------------------ADIVALKNYMDAQYFGEIGI 96
+RK SLR NL E G D L+NY+D +YFG IGI
Sbjct: 23 VRKKSLRRNLSEHGLLKDFLKKHNFNPASKYFPQAEAPTLIDEQPLENYLDMEYFGTIGI 82
Query: 97 GTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGA 156
GTP Q+FTVIFDTGSSNLWVPS C S+AC H+++ SSTY+ + I YGTG+
Sbjct: 83 GTPAQDFTVIFDTGSSNLWVPSVYCS-SLACTNHNRFNPQDSSTYQSTSGTVSITYGTGS 141
Query: 157 ISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYN 216
++G D V++G + +Q F + EP A FDGILGL + IS A PV+ N
Sbjct: 142 MTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYAPFDGILGLAYPSISSSGATPVFDN 201
Query: 217 MVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDV 276
+ NQGLV++ +FS + +AD++ G ++FGG+D +Y G +VPV+ +GYWQ + +
Sbjct: 202 IWNQGLVSQDLFSVYL--SADDQSGSVVIFGGIDSSYYTGSLNWVPVSVEGYWQISVDSI 259
Query: 277 MIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT----GIVSQECKAVVS 330
++G+ CA GC AI D+GTSLL GPT+ I + IGA+ G + C A+ S
Sbjct: 260 TMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGEMVVSCSAISS 316
>gi|440905526|gb|ELR55898.1| Gastricsin [Bos grunniens mutus]
Length = 391
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 105/234 (44%), Positives = 147/234 (62%), Gaps = 1/234 (0%)
Query: 84 NYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKK 143
+YMDA YFGEI IGTPPQNF V+FDTGSSNLWVPS C S AC H+++ SSTY
Sbjct: 68 DYMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHTRFNHSLSSTYST 126
Query: 144 NGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQ 203
N ++ + YG+G+++G D + + + V +QEF + EP FL AKFDGI+G+ +
Sbjct: 127 NEQTFSLQYGSGSLTGILGYDTLTVQGIKVPNQEFGLSKTEPGTNFLYAKFDGIMGMAYP 186
Query: 204 EISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPV 263
+SV A V M+ +G + PVFSF+ + ++GG ++FGG+D Y G+ + PV
Sbjct: 187 SLSVDGATTVLQGMLQEGALTSPVFSFYLSSQQGSQDGGAVIFGGVDSCLYTGQIYWAPV 246
Query: 264 TQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
TQ+ YWQ + +I Q TG+C+ GC AI D+GTSLL P ++ + A GA
Sbjct: 247 TQELYWQIGFEEFLIGDQATGWCSTGCQAIVDTGTSLLTVPQQFLSALLQATGA 300
>gi|149725185|ref|XP_001501907.1| PREDICTED: pepsin A-like [Equus caballus]
Length = 387
Score = 217 bits (553), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 124/302 (41%), Positives = 179/302 (59%), Gaps = 10/302 (3%)
Query: 30 LYRIGLKKRK-FDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDA 88
+Y + L KRK N R L+ + R KY + + A L+NY D
Sbjct: 17 IYNVPLVKRKSLRQNLRENGLLEDFLKQHPRNPASKYFPKE---AATLAATEGLENYKDE 73
Query: 89 QYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSA 148
+YFG I IGTPPQ FTVIFDTGSSNLWVPS+ C S+AC H+++ SSTY+ +S
Sbjct: 74 EYFGTISIGTPPQEFTVIFDTGSSNLWVPSTYCS-SLACSDHNRFNPEDSSTYEATSESI 132
Query: 149 DIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVG 208
I YGTG+++G + V++G + +Q F + EPS A FDGILGL + IS
Sbjct: 133 SITYGTGSMTGVLRYNTVRVGGIEDTNQIFGLSESEPSSFLYYAPFDGILGLAYPSISSS 192
Query: 209 KAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGY 268
A PV+ N+ +QGLV++ +FS + ++D+E G ++F G+D +Y G +VPV+++ Y
Sbjct: 193 GATPVFDNIWDQGLVSQDLFSVYL--SSDDESGSMVIFSGIDSSYYSGSLCWVPVSEEAY 250
Query: 269 WQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQECKAV 328
WQ + + ++G++ C+GGC AI D+GTSLLAGP + I + IGA+ S E AV
Sbjct: 251 WQITVDSITMNGESIA-CSGGCQAIVDTGTSLLAGPPSAIDNIQSYIGASEDYSSE--AV 307
Query: 329 VS 330
+S
Sbjct: 308 IS 309
>gi|414871124|tpg|DAA49681.1| TPA: hypothetical protein ZEAMMB73_239621 [Zea mays]
Length = 299
Score = 217 bits (553), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 96/127 (75%), Positives = 117/127 (92%)
Query: 217 MVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDV 276
MV QGL+++PVFSFWFNR+ADE EGGEIVFGGMD HYKG+HT+VPVT+KGYWQF+MGDV
Sbjct: 1 MVKQGLISDPVFSFWFNRHADEGEGGEIVFGGMDSSHYKGDHTFVPVTRKGYWQFNMGDV 60
Query: 277 MIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQECKAVVSQYGEEI 336
++DG++TGFCAGGCAAIADSGTSLLAGP IIT++N IGA G+VSQECK VVSQYG++I
Sbjct: 61 LVDGKSTGFCAGGCAAIADSGTSLLAGPIAIITEINEKIGAAGVVSQECKTVVSQYGQQI 120
Query: 337 INMLLAK 343
+++LLA+
Sbjct: 121 LDLLLAE 127
>gi|146386352|gb|ABQ23964.1| cathepsin D [Oryctolagus cuniculus]
Length = 292
Score = 217 bits (553), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 115/255 (45%), Positives = 157/255 (61%), Gaps = 21/255 (8%)
Query: 109 TGSSNLWVPSSKC-YFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVK 167
TGSSNLWVPS C IAC+ H KY S +SSTY KNG + DIHYG+G++SG+ S+D V
Sbjct: 1 TGSSNLWVPSVHCKLLDIACWIHHKYNSKKSSTYVKNGTTFDIHYGSGSLSGYLSQDTVS 60
Query: 168 I-----GDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGL 222
+ + V+ Q F EAT++P +TF+ AKFDGILG+ + ISV +PV+ N++ Q L
Sbjct: 61 VPCTASSSIQVQKQIFGEATKQPGITFIAAKFDGILGMAYPRISVNNVLPVFDNLMQQKL 120
Query: 223 VNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQT 282
V + VFSF+ NR+ + GGE++ GG+DP +Y+G +Y+ VT+K YWQ M D + G
Sbjct: 121 VEKNVFSFYLNRDPAAQPGGELMLGGVDPKYYQGSLSYLNVTRKAYWQVHM-DQLNVGSG 179
Query: 283 TGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGIVSQE----CKAVVSQYGEEIIN 338
C GGC AI D+GTSLL GP + ++ AIGA ++ E C+ V S
Sbjct: 180 LTLCEGGCEAIVDTGTSLLVGPVDEVRELQRAIGAVPLIQGEYIIPCEKVSS-------- 231
Query: 339 MLLAKVSAKLSLRHY 353
L V+ KL R Y
Sbjct: 232 --LPPVTLKLGGRDY 244
>gi|297688536|ref|XP_002821738.1| PREDICTED: pepsin A-4 [Pongo abelii]
Length = 388
Score = 217 bits (553), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 115/257 (44%), Positives = 160/257 (62%), Gaps = 8/257 (3%)
Query: 78 DIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGR 137
D L+NY+D +YFG IGIGTP Q+FTVIFDTGSSNLWVPS C S+AC H+ +
Sbjct: 64 DEQPLENYLDVEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACTNHNLFNPED 122
Query: 138 SSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGI 197
SSTY+ ++ I YGTG+++G D V++G + +Q F + EP A FDGI
Sbjct: 123 SSTYQSTSETVSIAYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYAPFDGI 182
Query: 198 LGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGE 257
LGL + IS A PV+ N+ NQGLV++ +FS + +AD++ G ++FGG+D +Y G
Sbjct: 183 LGLAYPSISSSGATPVFDNIWNQGLVSQDLFSVYL--SADDQSGSVVIFGGIDSSYYTGS 240
Query: 258 HTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
+VPVT +GYWQ + + ++G+ CA GC AI D+GTSLL GPT+ I + IGA
Sbjct: 241 LNWVPVTVEGYWQITVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGA 299
Query: 318 T----GIVSQECKAVVS 330
+ G + C A+ S
Sbjct: 300 SENSDGDMVVSCSAISS 316
>gi|195470499|ref|XP_002087544.1| GE17593 [Drosophila yakuba]
gi|194173645|gb|EDW87256.1| GE17593 [Drosophila yakuba]
Length = 404
Score = 217 bits (553), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 111/243 (45%), Positives = 160/243 (65%), Gaps = 6/243 (2%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFS-IACYFHSKYRSGRSST 140
L N M+ Y+G IGIGTP Q F V+FDTGS+NLWVPS++C + +AC H++Y S SST
Sbjct: 84 LGNSMNMYYYGLIGIGTPEQYFKVVFDTGSANLWVPSAQCLATDVACQQHNQYNSSASST 143
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
+ +G++ I YGTG+++G+ + D V I L + +Q F EA +P +F FDGILG+
Sbjct: 144 FVSSGQNFSIQYGTGSVAGYLAIDTVTINGLAIANQTFGEAVSQPGASFTDVAFDGILGM 203
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
G+Q+I+ VP +YN+ +GL++EPVF F+ RN EGG++ GG D + GE TY
Sbjct: 204 GYQQIAEDNVVPPFYNLYEEGLIDEPVFGFYLARNGSAVEGGQLTLGGTDQELIAGEMTY 263
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
PVT++GYWQF + ++ +G + GC AIAD+GTSL+A P+ Q+N+ IG GI
Sbjct: 264 TPVTEQGYWQFAVNNITWNGTV---ISSGCQAIADTGTSLIAAPSAAYIQLNNLIG--GI 318
Query: 321 VSQ 323
+ Q
Sbjct: 319 LIQ 321
>gi|426251840|ref|XP_004019629.1| PREDICTED: pepsin A-like [Ovis aries]
Length = 386
Score = 217 bits (552), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 120/280 (42%), Positives = 170/280 (60%), Gaps = 27/280 (9%)
Query: 62 IRKYSLRGNLGESGD--------------------ADIVA---LKNYMDAQYFGEIGIGT 98
++K SLR NL E+G A +V+ L+NY+D +YFG IGIGT
Sbjct: 23 VKKKSLRQNLIENGKLKEFMKTHKYNLGSKYIREAATLVSDQPLQNYLDTEYFGTIGIGT 82
Query: 99 PPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAIS 158
P Q+FTVIFDTGSSNLWVPS C S AC H+++ SSTY+ ++ I YGTG+++
Sbjct: 83 PAQDFTVIFDTGSSNLWVPSIYCS-SEACTNHNRFNPQDSSTYEATSETLSITYGTGSMT 141
Query: 159 GFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMV 218
G D V++G + +Q F + EP A FDGILGL + IS A PV+ N+
Sbjct: 142 GILGYDTVEVGGISDTNQIFGLSETEPGSFLYYAPFDGILGLAYPSISSSGATPVFDNIW 201
Query: 219 NQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMI 278
+QGLV++ +FS + + N EE G ++FGG+D +Y G +VPV+ +GYWQ + + +
Sbjct: 202 DQGLVSQDLFSVYLSSN--EESGSVVMFGGIDSSYYSGSLNWVPVSVEGYWQITVDSITM 259
Query: 279 DGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
+G++ C+ GC AI D+GTSLLAGPTT I+ + IGA+
Sbjct: 260 NGESIA-CSDGCQAIVDTGTSLLAGPTTAISNIQSYIGAS 298
>gi|222425198|dbj|BAH20548.1| pepsinogen A-36 [Pongo abelii]
Length = 388
Score = 217 bits (552), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 115/257 (44%), Positives = 160/257 (62%), Gaps = 8/257 (3%)
Query: 78 DIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGR 137
D L+NY+D +YFG IGIGTP Q+FTVIFDTGSSNLWVPS C S+AC H+ +
Sbjct: 64 DEQPLENYLDMEYFGSIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACTNHNLFNPED 122
Query: 138 SSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGI 197
SSTY+ ++ I YGTG+++G D V++G + +Q F + EP A FDGI
Sbjct: 123 SSTYQSTSETVSIAYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYAPFDGI 182
Query: 198 LGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGE 257
LGL + IS A PV+ N+ NQGLV++ +FS + +AD++ G ++FGG+D +Y G
Sbjct: 183 LGLAYPSISSSGATPVFDNIWNQGLVSQDLFSVYL--SADDQSGSVVIFGGIDSSYYTGS 240
Query: 258 HTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
+VPVT +GYWQ + + ++G+ CA GC AI D+GTSLL GPT+ I + IGA
Sbjct: 241 LNWVPVTVEGYWQITVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGA 299
Query: 318 T----GIVSQECKAVVS 330
+ G + C A+ S
Sbjct: 300 SENSDGDMVVSCSAISS 316
>gi|329665035|ref|NP_001192720.1| gastricsin precursor [Bos taurus]
Length = 391
Score = 217 bits (552), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 105/234 (44%), Positives = 147/234 (62%), Gaps = 1/234 (0%)
Query: 84 NYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKK 143
+YMDA YFGEI IGTPPQNF V+FDTGSSNLWVPS C S AC H+++ SSTY
Sbjct: 68 DYMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ-SQACTSHTRFNHSLSSTYST 126
Query: 144 NGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQ 203
N ++ + YG+G+++G D + + + V +QEF + EP FL AKFDGI+G+ +
Sbjct: 127 NEQTFSLQYGSGSLTGILGYDTLTVQGIKVPNQEFGLSKTEPGTNFLYAKFDGIMGMAYP 186
Query: 204 EISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPV 263
+SV A V M+ +G + PVFSF+ + ++GG ++FGG+D Y G+ + PV
Sbjct: 187 SLSVDGATTVLQGMLQEGALTSPVFSFYLSSQQGSQDGGAVIFGGVDNCLYTGQIYWAPV 246
Query: 264 TQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
TQ+ YWQ + +I Q TG+C+ GC AI D+GTSLL P ++ + A GA
Sbjct: 247 TQELYWQIGFEEFLIGDQATGWCSTGCQAIVDTGTSLLTVPQQFLSALLQATGA 300
>gi|73621386|sp|Q9GMY8.1|PEPA_SORUN RecName: Full=Pepsin A; Flags: Precursor
gi|9798656|dbj|BAB11750.1| pepsinogen A [Sorex unguiculatus]
Length = 387
Score = 217 bits (552), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 110/251 (43%), Positives = 157/251 (62%), Gaps = 8/251 (3%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L NYMD +YFG I IGTPPQ FTVIFDTGSSNLWVPS C S AC H+++ +SST+
Sbjct: 67 LVNYMDMEYFGTISIGTPPQEFTVIFDTGSSNLWVPSIYCS-SPACSNHNRFDPQKSSTF 125
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
K ++ I YGTG+++G D V++ + +Q F + EP + FDGILGL
Sbjct: 126 KPTSQTVSIAYGTGSMTGVLGYDTVQVAGIADTNQIFGLSQSEPGSFLYYSPFDGILGLA 185
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ IS A PV+ NM NQGLV++ +FS + + N ++ G ++FGG+D +Y G +V
Sbjct: 186 YPSISSSGATPVFDNMWNQGLVSQDLFSVYLSSN--DQSGSVVMFGGIDSSYYTGSLNWV 243
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT--- 318
P++ +GYWQ + + ++GQ+ C GGC AI D+GTSLL+GPT I + IGA+
Sbjct: 244 PLSSEGYWQITVDSITMNGQSIA-CNGGCQAIVDTGTSLLSGPTNAIANIQSKIGASQNS 302
Query: 319 -GIVSQECKAV 328
G ++ C ++
Sbjct: 303 QGQMAVSCSSI 313
>gi|393212634|gb|EJC98134.1| endopeptidase [Fomitiporia mediterranea MF3/22]
Length = 401
Score = 217 bits (552), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 129/330 (39%), Positives = 181/330 (54%), Gaps = 24/330 (7%)
Query: 13 FLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLG 72
F +LL P+ +T +GG++++ L K V ++S + + L G G
Sbjct: 6 FAFILLLPI--ATASGGVHKLKLHKIP---RGNVDPTIESAYLSEKYSGQPQLPLMGTDG 60
Query: 73 ES------GD-----ADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC 121
S GD V L ++M+ QYF + +G+PPQ F VI DTGSSNLWVPS+KC
Sbjct: 61 PSHQFGSIGDKMTNGGHKVPLSDFMNVQYFTNVTLGSPPQEFRVILDTGSSNLWVPSTKC 120
Query: 122 YFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEA 181
S C H KY S SSTY++NG I YG+G + G S+D V IGDL + Q+F EA
Sbjct: 121 R-SFGCSKHVKYNSSVSSTYQENGTDIHIKYGSGDMEGIVSKDVVTIGDLKIDGQDFAEA 179
Query: 182 TREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEG 241
T++P F KFDGI GLG+ ISV P +Y+MVNQGL++ PV+SF F + E++G
Sbjct: 180 TKDPGPAFAFGKFDGIFGLGYDTISVNHITPPFYSMVNQGLLDAPVYSFRF--GSSEDDG 237
Query: 242 GEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLL 301
GE+VFGG+D Y GE Y PV + +W+ ++ + F + D+GTSL+
Sbjct: 238 GEVVFGGIDESAYSGEINYAPVRSREHWEVELPKYAFGDKE--FVLENTGGVIDTGTSLM 295
Query: 302 AGPTTIITQVNHAIGATGIVSQ---ECKAV 328
P + ++N IGA Q +CK V
Sbjct: 296 YLPVDVAEKLNAQIGANNRNGQYIVDCKKV 325
>gi|49019802|emb|CAD80095.2| pepsin A1 [Trematomus bernacchii]
Length = 375
Score = 217 bits (552), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 113/258 (43%), Positives = 161/258 (62%), Gaps = 15/258 (5%)
Query: 81 ALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSST 140
++ N D Y+G I IGTPPQ+F+VIFDTGSSNLW+PS C S AC H K+ SST
Sbjct: 60 SMTNDADLSYYGVISIGTPPQSFSVIFDTGSSNLWIPSVYCS-SQACQNHRKFNPKESST 118
Query: 141 YKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGL 200
+K K I YGTG+++G+ + D V++G + V +Q F + E + +A DGILGL
Sbjct: 119 FKWGNKPLSIQYGTGSMTGYLASDTVEVGGISVANQVFGLSQTEAAFLAHMAA-DGILGL 177
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
FQ I+ VPV+ MV + LV++P+FS + + N+ +G E+VFGG+D HY GE +
Sbjct: 178 AFQTIASDDVVPVFDMMVKERLVSQPLFSVYLSSNS--AQGSEVVFGGIDSSHYTGEIAW 235
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
+P++ YWQ +M V I+GQT C+GGC AI D+GTSL+ GP + I+ +N +GA+
Sbjct: 236 IPLSSATYWQINMDSVTINGQTVA-CSGGCQAIIDTGTSLIVGPNSDISSMNSRVGAS-- 292
Query: 321 VSQECKAVVSQYGEEIIN 338
+QYGE +N
Sbjct: 293 --------TNQYGEATVN 302
>gi|395852554|ref|XP_003798803.1| PREDICTED: pepsin A-like [Otolemur garnettii]
Length = 387
Score = 217 bits (552), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 121/297 (40%), Positives = 173/297 (58%), Gaps = 32/297 (10%)
Query: 62 IRKYSLRGNLGESG--------------------DADIV----ALKNYMDAQYFGEIGIG 97
+RK SLR +L E+G +A + L+NYMD +YFG IGIG
Sbjct: 23 VRKKSLRQSLAENGLLKDFLKNHHVNLASKYFPNEATTLLSAQPLENYMDTEYFGTIGIG 82
Query: 98 TPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAI 157
TP Q FTVIFDTGSSNLWVPS C S AC H+++ SSTY+ ++ I YGTG++
Sbjct: 83 TPAQEFTVIFDTGSSNLWVPSVYCS-SPACSNHNRFNPQSSSTYQATSQTVSIAYGTGSM 141
Query: 158 SGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNM 217
+G D V++G + +Q F + EP A FDGILGL + IS A PV+ NM
Sbjct: 142 TGILGYDTVQVGGITDTNQIFGLSETEPGSFLYYAPFDGILGLAYPSISSSGATPVFDNM 201
Query: 218 VNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVM 277
NQGLV++ +FS + + N ++ G ++FGG+D +Y GE ++P++ +GYWQ + +
Sbjct: 202 WNQGLVSQDLFSVFLSSN--DQSGSVVMFGGIDSSYYTGELNWIPLSSEGYWQITVDSIT 259
Query: 278 IDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT----GIVSQECKAVVS 330
++G+ C+ GC AI D+GTSLL+GPT+ I + IGA+ G + C A+ S
Sbjct: 260 MNGEPIA-CSQGCQAIVDTGTSLLSGPTSPIANIQSYIGASEDSYGQMVISCSAINS 315
>gi|151553998|gb|AAI49645.1| PGA5 protein [Bos taurus]
Length = 381
Score = 217 bits (552), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 128/307 (41%), Positives = 182/307 (59%), Gaps = 17/307 (5%)
Query: 31 YRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVA---LKNYMD 87
+RI L K+K N + + K E RT KY+L A +V+ L+NY+D
Sbjct: 13 HRIPLVKKKSLRQNLIE---NGKLKEFMRT--HKYNLGSKYIREA-ATLVSEQPLQNYLD 66
Query: 88 AQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKS 147
+YFG IGIGTP Q+FTVIFDTGSSNLWVPS C S AC H+++ SSTY+ ++
Sbjct: 67 TEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSIYCS-SEACTNHNRFNPQDSSTYEATSET 125
Query: 148 ADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISV 207
I YGTG+++G D V++G + +Q F + EP A FDGILGL + IS
Sbjct: 126 LSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYAPFDGILGLAYPSISS 185
Query: 208 GKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKG 267
A PV+ N+ +QGLV++ +FS + + N EE G ++FG +D +Y G +VPV+ +G
Sbjct: 186 SGATPVFDNIWDQGLVSQDLFSVYLSSN--EESGSVVIFGDIDSSYYSGSLNWVPVSVEG 243
Query: 268 YWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA----TGIVSQ 323
YWQ + + ++G++ C+ GC AI D+GTSLLAGPTT I+ + IGA +G V
Sbjct: 244 YWQITVDSITMNGESIA-CSDGCQAIVDTGTSLLAGPTTAISNIQSYIGASEDSSGEVVI 302
Query: 324 ECKAVVS 330
C ++ S
Sbjct: 303 SCSSIDS 309
>gi|60391920|sp|P00794.3|CHYM_BOVIN RecName: Full=Chymosin; AltName: Full=Preprorennin; Flags:
Precursor
gi|155967150|gb|ABU41411.1| preprochymosin [Bos taurus]
gi|225196434|gb|ACN82375.1| chymosin precursor [Bos taurus]
gi|296489363|tpg|DAA31476.1| TPA: chymosin precursor [Bos taurus]
Length = 381
Score = 216 bits (551), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 127/323 (39%), Positives = 179/323 (55%), Gaps = 25/323 (7%)
Query: 15 CLLLFPVVFSTPNGG-LYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRK--YSLRGNL 71
CL++ VF+ G + RI L K K + R KE ++K Y +
Sbjct: 3 CLVVLLAVFALSQGAEITRIPLYKGK-------SLRKALKEHGLLEDFLQKQQYGISSKY 55
Query: 72 GESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHS 131
G+ V L NY+D+QYFG+I +GTPPQ FTV+FDTGSS+ WVPS C S AC H
Sbjct: 56 SGFGEVASVPLTNYLDSQYFGKIYLGTPPQEFTVLFDTGSSDFWVPSIYCK-SNACKNHQ 114
Query: 132 KYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLL 191
++ +SST++ GK IHYGTG++ G D V + ++V Q +T+EP F
Sbjct: 115 RFDPRKSSTFQNLGKPLSIHYGTGSMQGILGYDTVTVSNIVDIQQTVGLSTQEPGDVFTY 174
Query: 192 AKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDP 251
A+FDGILG+ + ++ ++PV+ NM+N+ LV + +FS + +RN E + G +DP
Sbjct: 175 AEFDGILGMAYPSLASEYSIPVFDNMMNRHLVAQDLFSVYMDRNGQES---MLTLGAIDP 231
Query: 252 DHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQV 311
+Y G +VPVT + YWQF + V I G C GGC AI D+GTS L GP++ I +
Sbjct: 232 SYYTGSLHWVPVTVQQYWQFTVDSVTISGVVVA-CEGGCQAILDTGTSKLVGPSSDILNI 290
Query: 312 NHAIGATGIVSQECKAVVSQYGE 334
AIGAT +QYGE
Sbjct: 291 QQAIGATQ----------NQYGE 303
>gi|402893203|ref|XP_003909790.1| PREDICTED: pepsin A-2/A-3-like [Papio anubis]
Length = 388
Score = 216 bits (551), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 123/298 (41%), Positives = 169/298 (56%), Gaps = 33/298 (11%)
Query: 62 IRKYSLRGNLGESGD-------------------------ADIVALKNYMDAQYFGEIGI 96
+RK SLR NL E G D L+NY+D +YFG IGI
Sbjct: 23 VRKKSLRHNLSEHGLLKDFLKKHNFNPARKYFPQAEAPTLIDEQPLENYLDMEYFGTIGI 82
Query: 97 GTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGA 156
GTP Q+FTVIFDTGSSNLWVPS C S+AC H ++ SSTY+ + I YGTG+
Sbjct: 83 GTPAQDFTVIFDTGSSNLWVPSVYCS-SLACTNHKRFNPQDSSTYQSTSGTLSITYGTGS 141
Query: 157 ISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYN 216
++G D V++G + +Q F + EP A FDGILGL + IS A PV+ N
Sbjct: 142 MTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYAPFDGILGLAYPSISSSGATPVFDN 201
Query: 217 MVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDV 276
+ NQGLV++ +FS + +AD++ G ++FGG+D +Y G +VPV+ +GYWQ + +
Sbjct: 202 IWNQGLVSQDLFSVYL--SADDQSGSVVIFGGIDSSYYTGSLNWVPVSVEGYWQISVDSI 259
Query: 277 MIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT----GIVSQECKAVVS 330
++G+ CA GC AI D+GTSLL GPT+ I + IGA+ G + C A+ S
Sbjct: 260 TMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGEMVVSCSAISS 316
>gi|431910409|gb|ELK13482.1| Pepsin A [Pteropus alecto]
Length = 386
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 108/241 (44%), Positives = 155/241 (64%), Gaps = 4/241 (1%)
Query: 78 DIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGR 137
D L+NY+D +YFG IGIGTP Q FTVIFDTGSSNLWVPS C S+ACY H+ +
Sbjct: 62 DTETLENYLDMEYFGTIGIGTPAQEFTVIFDTGSSNLWVPSVYCS-SLACYNHNVFNPED 120
Query: 138 SSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGI 197
SST++ ++ I YGTG+++G D V++G + +Q F + EP A FDGI
Sbjct: 121 SSTFEATSETVSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYAPFDGI 180
Query: 198 LGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGE 257
LGL + IS A PV+ N+ +QGLV++ +FS + ++D++ G ++FGG+D +Y G
Sbjct: 181 LGLAYPSISASGATPVFDNLWDQGLVSQDLFSVYL--SSDDDSGSVVIFGGIDSSYYSGS 238
Query: 258 HTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
+VP++ + YWQ + V++DG+ C+ C AI D+GTSLLAGPTT I+ + IGA
Sbjct: 239 LNWVPLSSETYWQITVDSVILDGEAIA-CSATCQAIVDTGTSLLAGPTTAISSIQKYIGA 297
Query: 318 T 318
+
Sbjct: 298 S 298
>gi|193499295|gb|ACF18590.1| pepsinogen C precursor [Siniperca scherzeri]
Length = 387
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 118/304 (38%), Positives = 179/304 (58%), Gaps = 15/304 (4%)
Query: 29 GLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDA 88
G+ +I L+K K + R A R E + +L+ E + + + NY D
Sbjct: 16 GIVKIPLRKHK---SMREALREKGIE-----LPYQDPALKYQADEFAGSANMNINNYADT 67
Query: 89 QYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSA 148
Y+G I IGTPPQ+F V+FDTGS+NLWV S C + AC H+K+ +SST+ G+S
Sbjct: 68 TYYGAISIGTPPQSFQVLFDTGSANLWVDSVYCN-TEACNAHTKFNPQQSSTFTAKGQSF 126
Query: 149 DIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVG 208
+ YG G++ G F D V +G +V+ +QE +T EP TF +A+FDGILGL + IS G
Sbjct: 127 YLPYGAGSLYGVFGYDTVDVGGIVITNQEIGLSTNEPGETFAVAQFDGILGLSYPTISAG 186
Query: 209 KAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGY 268
A PV NM++Q L+N +F+F+ ++ E++G E+ FGG+D Y+G+ + PVT + Y
Sbjct: 187 GATPVMANMISQNLLNADIFAFYL--SSGEQQGSELSFGGVDSSMYQGQIYWTPVTSETY 244
Query: 269 WQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT----GIVSQE 324
WQ + I+GQ +G+C+ GC +I D+GTS+L P+ ++ + AIGA G+ +
Sbjct: 245 WQIGVQGFQINGQESGWCSQGCQSIVDTGTSMLTAPSQLLGYIMQAIGAQQSQYGMYMVD 304
Query: 325 CKAV 328
C V
Sbjct: 305 CSQV 308
>gi|11493777|gb|AAG35646.1|AF203473_1 progastricsin [Salvelinus fontinalis]
Length = 383
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 123/326 (37%), Positives = 185/326 (56%), Gaps = 24/326 (7%)
Query: 13 FLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLG 72
+L ++L V + G++RI L K K + R+ K GE + +L+
Sbjct: 3 YLVIVLVCAVLAE---GIHRIPLVKHK-----SIRERMMEK-GEHL--PYQDPALKYFPD 51
Query: 73 ESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSK 132
E + + + NY D Y+G I IGTPPQ+F V+FDTGS+NLWV S C + AC H+K
Sbjct: 52 EFAGSTTMYINNYADTTYYGAITIGTPPQSFQVLFDTGSANLWVDSVLCN-TQACNTHTK 110
Query: 133 YRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLA 192
+ +SSTY NG++ + YG G++SG F D V +G +++ +QE +T EP F++A
Sbjct: 111 FNPQQSSTYSANGETFYLPYGAGSLSGVFGYDTVNVGGIILTNQEIGLSTDEPGQNFVVA 170
Query: 193 KFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPD 252
+FDGILGL + IS G+ PV NM++Q L+ +F+F+ R D ++G E+ FG +D
Sbjct: 171 QFDGILGLSYPSISAGQETPVMDNMMSQNLLQANIFAFYMTR--DGQQGSELSFGEVDNT 228
Query: 253 HYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVN 312
Y+G+ + PVT + YWQ + I+GQ TG+C GC AI D+GTS+L P I+ +
Sbjct: 229 KYQGQIYWTPVTSQTYWQIGIQGFQINGQETGWCGQGCQAIVDTGTSMLTAPRQIMGTLM 288
Query: 313 HAIGATGIVSQECKAVVSQYGEEIIN 338
+IGA QYG+ +N
Sbjct: 289 QSIGAQQ----------DQYGQYTVN 304
>gi|194900440|ref|XP_001979765.1| GG22202 [Drosophila erecta]
gi|190651468|gb|EDV48723.1| GG22202 [Drosophila erecta]
Length = 395
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 121/316 (38%), Positives = 184/316 (58%), Gaps = 12/316 (3%)
Query: 12 FFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRG-- 69
+L L + + ++ G L RI ++ + + +A+R + G S + + KY++ G
Sbjct: 7 LWLSLWVLCLFWAKCQGQLIRIPMQFQA----SFMASRRQHRAGRS--SLLAKYNVVGGQ 60
Query: 70 NLGESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYF-SIACY 128
+ L N ++ +Y G I IG+P Q F ++FDTGS+NLWVPS++C S+AC+
Sbjct: 61 EVTSRNGGATETLDNRLNLEYAGPISIGSPGQPFNMLFDTGSANLWVPSAECSLKSVACH 120
Query: 129 FHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLT 188
H +Y + SST+ +G+ I YGTG++SG ++D V IG LVV++Q F AT EP T
Sbjct: 121 HHHRYNASASSTFVPDGRRFSIAYGTGSLSGILAQDTVAIGQLVVRNQTFAMATHEPGPT 180
Query: 189 FLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGG 248
F+ F GI+GLGF+ I+ + P++ +M +Q LV+E VFSF+ RN E GGE++FGG
Sbjct: 181 FVDTNFAGIVGLGFRPIAEQRIKPLFESMCDQQLVDECVFSFYLKRNGSERMGGELLFGG 240
Query: 249 MDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTII 308
+D + G TYVP+T GYWQF + + + G T + AIAD+GTSLLA P
Sbjct: 241 VDKTKFSGSLTYVPLTHAGYWQFPLDGIELGGTTI---SRHRQAIADTGTSLLAAPPREY 297
Query: 309 TQVNHAIGATGIVSQE 324
+N +G + E
Sbjct: 298 LIINSLLGGLPTSNNE 313
>gi|367009330|ref|XP_003679166.1| hypothetical protein TDEL_0A06230 [Torulaspora delbrueckii]
gi|359746823|emb|CCE89955.1| hypothetical protein TDEL_0A06230 [Torulaspora delbrueckii]
Length = 408
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 113/246 (45%), Positives = 152/246 (61%), Gaps = 7/246 (2%)
Query: 76 DADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRS 135
D V L NY++AQY+ +I +GTP QNF VI DTGSSNLWVPS C S+AC+ HSKY
Sbjct: 77 DNHDVPLTNYLNAQYYTDISLGTPAQNFKVILDTGSSNLWVPSVDC-GSLACFLHSKYDH 135
Query: 136 GRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFD 195
SS+YK NG I YG+G++ G+ S+D + +GDLV+ Q+F EAT EP L F KFD
Sbjct: 136 SASSSYKPNGTDFAIQYGSGSLEGYISQDTLSVGDLVIPKQDFAEATSEPGLAFAFGKFD 195
Query: 196 GILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFN----RNADEEEGGEIVFGGMDP 251
GILGL + ISV K VP +YN ++Q L++EP F+F+ GG FGG+D
Sbjct: 196 GILGLAYNTISVDKVVPPFYNAIDQDLLDEPRFAFYLGDSSDSEDSSVSGGVATFGGIDK 255
Query: 252 DHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQV 311
+KGE +++PV +K YW+ + + G G AAI D+GTSL+ P+ + +
Sbjct: 256 SKFKGEISWLPVRRKAYWEVKFEGIGL-GDEFAILDGHGAAI-DTGTSLITLPSGLAEII 313
Query: 312 NHAIGA 317
N IGA
Sbjct: 314 NSEIGA 319
>gi|355706340|gb|AES02605.1| napsin A aspartic peptidase [Mustela putorius furo]
Length = 258
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 102/203 (50%), Positives = 137/203 (67%), Gaps = 2/203 (0%)
Query: 73 ESGDADI-VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYF-SIACYFH 130
GD + V L NY++AQY+GEIG+GTPPQNF+V+FDTGSSNLWVPS +C+F S+ C+FH
Sbjct: 56 SPGDKPVFVPLSNYLNAQYYGEIGLGTPPQNFSVVFDTGSSNLWVPSIRCHFLSLPCWFH 115
Query: 131 SKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFL 190
++ S SS+++ NG I YGTG + G SED + IG + F EA EPSL F
Sbjct: 116 HRFNSKASSSFQPNGTKFAIQYGTGKLDGILSEDKLTIGGIKGASVIFGEALWEPSLVFT 175
Query: 191 LAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMD 250
A FDG+LGLGF ++VG P +V++GL+++P+FSF+ NR+ +GGE+V GG D
Sbjct: 176 FAHFDGVLGLGFPILAVGGVRPPLDTLVDEGLLDKPIFSFYLNRDPKAADGGELVLGGSD 235
Query: 251 PDHYKGEHTYVPVTQKGYWQFDM 273
P HY T++PVT YWQ M
Sbjct: 236 PAHYIPPLTFLPVTIPAYWQIHM 258
>gi|73620983|sp|P00792.2|PEPA_BOVIN RecName: Full=Pepsin A; Flags: Precursor
gi|24415088|emb|CAD55693.1| pepsinogen A [synthetic construct]
gi|37622272|gb|AAQ95219.1| pepsinogen A [Bos taurus]
Length = 372
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 123/296 (41%), Positives = 175/296 (59%), Gaps = 31/296 (10%)
Query: 62 IRKYSLRGNLGESGD--------------------ADIVA---LKNYMDAQYFGEIGIGT 98
++K SLR NL E+G A +V+ L+NY+D +YFG IGIGT
Sbjct: 9 VKKKSLRQNLIENGKLKEFMRTHKYNLGSKYIREAATLVSEQPLQNYLDTEYFGTIGIGT 68
Query: 99 PPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAIS 158
P Q+FTVIFDTGSSNLWVPS C S AC H+++ SSTY+ ++ I YGTG+++
Sbjct: 69 PAQDFTVIFDTGSSNLWVPSIYCS-SEACTNHNRFNPQDSSTYEATSETLSITYGTGSMT 127
Query: 159 GFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMV 218
G D V++G + +Q F + EP A FDGILGL + IS A PV+ N+
Sbjct: 128 GILGYDTVQVGGISDTNQIFGLSETEPGSFLYYAPFDGILGLAYPSISSSGATPVFDNIW 187
Query: 219 NQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMI 278
+QGLV++ +FS + + N EE G ++FG +D +Y G +VPV+ +GYWQ + + +
Sbjct: 188 DQGLVSQDLFSVYLSSN--EESGSVVIFGDIDSSYYSGSLNWVPVSVEGYWQITVDSITM 245
Query: 279 DGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA----TGIVSQECKAVVS 330
+G++ C+ GC AI D+GTSLLAGPTT I+ + IGA +G V C ++ S
Sbjct: 246 NGESIA-CSDGCQAIVDTGTSLLAGPTTAISNIQSYIGASEDSSGEVVISCSSIDS 300
>gi|440909656|gb|ELR59541.1| Chymosin [Bos grunniens mutus]
Length = 381
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 127/323 (39%), Positives = 179/323 (55%), Gaps = 25/323 (7%)
Query: 15 CLLLFPVVFSTPNGG-LYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRK--YSLRGNL 71
CL++ VF+ G + RI L K K + R KE ++K Y +
Sbjct: 3 CLVVLLAVFALSQGAEITRIPLYKGK-------SLRKALKEHGLLEDFLQKQQYGVSSKY 55
Query: 72 GESGDADIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHS 131
G+ V L NY+D+QYFG+I +GTPPQ FTV+FDTGSS+ WVPS C S AC H
Sbjct: 56 SGFGEVASVPLTNYLDSQYFGKIYLGTPPQEFTVLFDTGSSDFWVPSIYCK-SNACKNHQ 114
Query: 132 KYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLL 191
++ +SST++ GK IHYGTG++ G D V + ++V Q +T+EP F
Sbjct: 115 RFDPRKSSTFQNLGKPLSIHYGTGSMQGILGYDTVTVSNIVDIQQTVGLSTQEPGDVFTY 174
Query: 192 AKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDP 251
A+FDGILG+ + ++ ++PV+ NM+N+ LV + +FS + +RN E + G +DP
Sbjct: 175 AEFDGILGMAYPSLASEYSIPVFDNMMNRHLVAQDLFSVYMDRNGQES---MLTLGAIDP 231
Query: 252 DHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQV 311
+Y G +VPVT + YWQF + V I G C GGC AI D+GTS L GP++ I +
Sbjct: 232 SYYTGSLHWVPVTVQQYWQFTVDSVTISGVVVA-CEGGCQAILDTGTSKLVGPSSDILNI 290
Query: 312 NHAIGATGIVSQECKAVVSQYGE 334
AIGAT +QYGE
Sbjct: 291 QQAIGATQ----------NQYGE 303
>gi|344246136|gb|EGW02240.1| Renin [Cricetulus griseus]
Length = 720
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 112/241 (46%), Positives = 156/241 (64%), Gaps = 5/241 (2%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCY-FSIACYFHSKYRSGRS 138
V L NY+D QY+GEIGIGTPPQ F VIFDTGS+NLWVPS+KC AC HS Y S S
Sbjct: 38 VILTNYLDTQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCSPLYSACEIHSLYDSSES 97
Query: 139 STYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGIL 198
S+Y +NG IHYG+G + GF S+D V +G ++V Q F E T P + F+LAKFDG+L
Sbjct: 98 SSYMENGTEFTIHYGSGKVKGFLSQDIVTVGGIIVT-QTFGEVTELPLIPFMLAKFDGVL 156
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEH 258
G+GF +VG PV+ ++++Q ++ E VFS +++R++ GGE+V GG DP HY+G
Sbjct: 157 GMGFPAQAVGGVTPVFDHILSQRVLKEEVFSVYYSRDS-HLLGGEVVLGGSDPQHYQGNF 215
Query: 259 TYVPVTQKGYWQFDMGDV--MIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIG 316
YV V++ G W+ M + + G T C GC + D+G S ++GPT+ + + +G
Sbjct: 216 HYVSVSRTGSWEIAMKGLRRVSVGSATLLCEEGCVVVVDTGASYISGPTSSLKLIMQTLG 275
Query: 317 A 317
A
Sbjct: 276 A 276
>gi|193499297|gb|ACF18591.1| pepsinogen C precursor [Siniperca chuatsi]
gi|253762213|gb|ACT35558.1| pepsinogen C precursor [Siniperca scherzeri]
Length = 387
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 118/304 (38%), Positives = 179/304 (58%), Gaps = 15/304 (4%)
Query: 29 GLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDA 88
G+ +I L+K K + R A R E + +L+ E + + + NY D
Sbjct: 16 GIVKIPLRKHK---SMREALREKGIE-----LPYQDPALKYQADEFAGSANMNINNYADT 67
Query: 89 QYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSA 148
Y+G I IGTPPQ+F V+FDTGS+NLWV S C + AC H+K+ +SST+ G+S
Sbjct: 68 TYYGAISIGTPPQSFQVLFDTGSANLWVDSVYCN-TEACNAHTKFNPQQSSTFTAKGQSF 126
Query: 149 DIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVG 208
+ YG G++ G F D V +G +V+ +QE +T EP TF +A+FDGILGL + IS G
Sbjct: 127 YLPYGAGSLYGVFGYDTVNVGGIVITNQEIGLSTNEPGETFAVAQFDGILGLSYPTISAG 186
Query: 209 KAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGY 268
A PV NM++Q L+N +F+F+ ++ E++G E+ FGG+D Y+G+ + PVT + Y
Sbjct: 187 GATPVMDNMISQNLLNADIFAFYL--SSGEQQGSELSFGGVDSSMYQGQIYWTPVTSETY 244
Query: 269 WQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT----GIVSQE 324
WQ + I+GQ +G+C+ GC +I D+GTS+L P+ ++ + AIGA G+ +
Sbjct: 245 WQIGVQGFQINGQESGWCSQGCQSIVDTGTSMLTAPSQLLGYIMQAIGAQQNQYGMYMVD 304
Query: 325 CKAV 328
C V
Sbjct: 305 CSQV 308
>gi|307175238|gb|EFN65290.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 357
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 112/246 (45%), Positives = 140/246 (56%), Gaps = 4/246 (1%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L NY + Y+G I IGTPPQ F VIFDTGS+NLW+PS KC + AC H++Y S S+
Sbjct: 31 VILSNYQNINYYGVITIGTPPQEFKVIFDTGSANLWIPSKKCNLT-ACLIHNQYNSTASN 89
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLT-FLLAKFDGIL 198
TY I Y I G S D V + V++Q F E T + FL A FDGIL
Sbjct: 90 TYIAKNALIQIKYFNSIIDGLISTDIVNVAGFNVQNQTFAELTNMSNEELFLPAPFDGIL 149
Query: 199 GLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEH 258
GL + IS +PV+ NMVNQ LV+ +FSF+ NR+ E GE + GG DP HY G
Sbjct: 150 GLAYSYISDNNIIPVFDNMVNQNLVSSHIFSFYLNRDPSAELDGEFILGGSDPAHYDGNF 209
Query: 259 TYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
TYVPVT KG+WQF M + ++ C C AIAD+G GPT+ + +N IG T
Sbjct: 210 TYVPVTHKGFWQFTMDKIEVNN--ISLCQSSCQAIADTGMGETYGPTSDVKTINELIGTT 267
Query: 319 GIVSQE 324
I E
Sbjct: 268 NIDGME 273
>gi|222425200|dbj|BAH20549.1| pepsinogen A-50 [Pongo abelii]
Length = 388
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 115/257 (44%), Positives = 159/257 (61%), Gaps = 8/257 (3%)
Query: 78 DIVALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGR 137
D L+NY+D +YFG IGIGTP Q+FTVIFDTGSSNLWVPS C S+AC H+ +
Sbjct: 64 DEQPLENYLDMEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACTNHNLFNPED 122
Query: 138 SSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGI 197
SSTY+ ++ I YGTG+++G D V++G + Q F + EP A FDGI
Sbjct: 123 SSTYQSTSETVSIAYGTGSMTGILGYDTVQVGGISDTSQIFGLSETEPGSFLYYAPFDGI 182
Query: 198 LGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGE 257
LGL + IS A PV+ N+ NQGLV++ +FS + +AD++ G ++FGG+D +Y G
Sbjct: 183 LGLAYPSISSSGATPVFDNIWNQGLVSQDLFSVYL--SADDQSGSVVIFGGIDSSYYTGS 240
Query: 258 HTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
+VPVT +GYWQ + + ++G+ CA GC AI D+GTSLL GPT+ I + IGA
Sbjct: 241 LNWVPVTVEGYWQITVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGA 299
Query: 318 T----GIVSQECKAVVS 330
+ G + C A+ S
Sbjct: 300 SENSDGDMVVSCSAISS 316
>gi|1585066|prf||2124254C pepsin:ISOTYPE=3c
Length = 326
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 113/253 (44%), Positives = 160/253 (63%), Gaps = 8/253 (3%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L+NY+D +YFG IGIGTP Q+FTV+FDTGSSNLWVPS C S+AC H+++ SSTY
Sbjct: 6 LENYLDMEYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTY 64
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
+ ++ I YGTG+++G D V++G + +Q F + EP A FDGILGL
Sbjct: 65 QSTSETVSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYAPFDGILGLA 124
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
IS A PV+ N+ NQGLV++ +FS + +AD++ G ++FGG+D +Y G +V
Sbjct: 125 TPSISSSGATPVFDNIWNQGLVSQDLFSVYL--SADDKSGSVVIFGGIDSSYYTGSLNWV 182
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT--- 318
PVT +GYWQ + + ++G+ CA GC AI D+GTSLL GPT+ I ++ IGA+
Sbjct: 183 PVTVEGYWQITVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIAKIQSDIGASENS 241
Query: 319 -GIVSQECKAVVS 330
G + C A+ S
Sbjct: 242 DGDMVVSCSAISS 254
>gi|292658855|ref|NP_001001600.2| pepsin A preproprotein [Bos taurus]
Length = 386
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 123/296 (41%), Positives = 175/296 (59%), Gaps = 31/296 (10%)
Query: 62 IRKYSLRGNLGESGD--------------------ADIVA---LKNYMDAQYFGEIGIGT 98
++K SLR NL E+G A +V+ L+NY+D +YFG IGIGT
Sbjct: 23 VKKKSLRQNLIENGKLKEFMRTHKYNLGSKYIREAATLVSEQPLQNYLDTEYFGTIGIGT 82
Query: 99 PPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAIS 158
P Q+FTVIFDTGSSNLWVPS C S AC H+++ SSTY+ ++ I YGTG+++
Sbjct: 83 PAQDFTVIFDTGSSNLWVPSIYCS-SEACTNHNRFNPQDSSTYEATSETLSITYGTGSMT 141
Query: 159 GFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMV 218
G D V++G + +Q F + EP A FDGILGL + IS A PV+ N+
Sbjct: 142 GILGYDTVQVGGISDTNQIFGLSETEPGSFLYYAPFDGILGLAYPSISSSGATPVFDNIW 201
Query: 219 NQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMI 278
+QGLV++ +FS + + N EE G ++FG +D +Y G +VPV+ +GYWQ + + +
Sbjct: 202 DQGLVSQDLFSVYLSSN--EESGSVVIFGDIDSSYYSGSLNWVPVSVEGYWQITVDSITM 259
Query: 279 DGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA----TGIVSQECKAVVS 330
+G++ C+ GC AI D+GTSLLAGPTT I+ + IGA +G V C ++ S
Sbjct: 260 NGESIA-CSDGCQAIVDTGTSLLAGPTTAISNIQSYIGASEDSSGEVVISCSSIDS 314
>gi|1585064|prf||2124254A pepsin:ISOTYPE=3a
gi|1585065|prf||2124254B pepsin:ISOTYPE=3b
Length = 326
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 113/253 (44%), Positives = 159/253 (62%), Gaps = 8/253 (3%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L+NY+D +YFG IGIGTP Q+FTV+FDTGSSNLWVPS C S+AC H+++ SSTY
Sbjct: 6 LENYLDMEYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCS-SLACTNHNRFNPEDSSTY 64
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
+ ++ I YGTG+++G D V++G + +Q F + EP A FDGILGL
Sbjct: 65 QSTSETVSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYAPFDGILGLA 124
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
IS A PV+ N+ NQGLV++ +FS + +AD++ G ++FGG+D +Y G +V
Sbjct: 125 TPSISSSGATPVFDNIWNQGLVSQDLFSVYL--SADDQSGSVVIFGGIDSSYYTGSLNWV 182
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT--- 318
PVT +GYWQ + + ++G+ CA GC AI D+GTSLL GPT+ I + IGA+
Sbjct: 183 PVTVEGYWQITVDSITMNGEAIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENS 241
Query: 319 -GIVSQECKAVVS 330
G + C A+ S
Sbjct: 242 DGDMVVSCSAISS 254
>gi|296471634|tpg|DAA13749.1| TPA: pepsin A precursor [Bos taurus]
Length = 367
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 123/296 (41%), Positives = 175/296 (59%), Gaps = 31/296 (10%)
Query: 62 IRKYSLRGNLGESGD--------------------ADIVA---LKNYMDAQYFGEIGIGT 98
++K SLR NL E+G A +V+ L+NY+D +YFG IGIGT
Sbjct: 9 VKKKSLRQNLIENGKLKEFMRTHKYNLGSKYIREAATLVSEQPLQNYLDTEYFGTIGIGT 68
Query: 99 PPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAIS 158
P Q+FTVIFDTGSSNLWVPS C S AC H+++ SSTY+ ++ I YGTG+++
Sbjct: 69 PAQDFTVIFDTGSSNLWVPSIYCS-SEACTNHNRFNPQDSSTYEATSETLSITYGTGSMT 127
Query: 159 GFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMV 218
G D V++G + +Q F + EP A FDGILGL + IS A PV+ N+
Sbjct: 128 GILGYDTVQVGGISDTNQIFGLSETEPGSFLYYAPFDGILGLAYPSISSSGATPVFDNIW 187
Query: 219 NQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMI 278
+QGLV++ +FS + + N EE G ++FG +D +Y G +VPV+ +GYWQ + + +
Sbjct: 188 DQGLVSQDLFSVYLSSN--EESGSVVIFGDIDSSYYSGSLNWVPVSVEGYWQITVDSITM 245
Query: 279 DGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA----TGIVSQECKAVVS 330
+G++ C+ GC AI D+GTSLLAGPTT I+ + IGA +G V C ++ S
Sbjct: 246 NGESIA-CSDGCQAIVDTGTSLLAGPTTAISNIQSYIGASEDSSGEVVISCSSIDS 300
>gi|431896476|gb|ELK05888.1| Chymosin [Pteropus alecto]
Length = 348
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 107/237 (45%), Positives = 151/237 (63%), Gaps = 5/237 (2%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L NY+D+QYFG+I IGTPPQ+FTV+FDTGSS+LWVPS C S AC H ++ S SST+
Sbjct: 66 LVNYLDSQYFGKISIGTPPQDFTVVFDTGSSDLWVPSVYCK-SDACKNHRRFNSSESSTF 124
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
+K G+ I YGTG++ G D V + ++V Q +T+EP F +FDGILGL
Sbjct: 125 QKLGQPLSIQYGTGSMEGILGSDTVTVSNIVDSRQTVGLSTQEPGDVFTYFEFDGILGLA 184
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ ++ +VPV+ NM+ LV + +FS + +RN ++G + G +D +Y+G +V
Sbjct: 185 YPSLAAKDSVPVFDNMMKHHLVAQDLFSVYMSRN---DQGSMLTLGAIDSSYYRGSLHWV 241
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
PVT + YWQF + V +DG C GGC AI D+GTS+L GP++ I + AIGAT
Sbjct: 242 PVTVREYWQFTVDSVTVDGVVVA-CDGGCQAILDTGTSMLVGPSSDILNIQQAIGAT 297
>gi|440893605|gb|ELR46308.1| Pepsin A, partial [Bos grunniens mutus]
Length = 388
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 123/296 (41%), Positives = 175/296 (59%), Gaps = 31/296 (10%)
Query: 62 IRKYSLRGNLGESGD--------------------ADIVA---LKNYMDAQYFGEIGIGT 98
++K SLR NL E+G A +V+ L+NY+D +YFG IGIGT
Sbjct: 25 VKKKSLRQNLIENGKLKEFMRTHKYNLGSKYIREAATLVSEQPLQNYLDTEYFGTIGIGT 84
Query: 99 PPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSADIHYGTGAIS 158
P Q+FTVIFDTGSSNLWVPS C S AC H+++ SSTY+ ++ I YGTG+++
Sbjct: 85 PAQDFTVIFDTGSSNLWVPSIYCS-SEACTNHNRFNPQDSSTYEATSETLSITYGTGSMT 143
Query: 159 GFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVGKAVPVWYNMV 218
G D V++G + +Q F + EP A FDGILGL + IS A PV+ N+
Sbjct: 144 GVLGYDTVQVGGISDTNQIFGLSETEPGSFLYYAPFDGILGLAYPSISSSGATPVFDNIW 203
Query: 219 NQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGYWQFDMGDVMI 278
+QGLV++ +FS + + N EE G ++FG +D +Y G +VPV+ +GYWQ + + +
Sbjct: 204 DQGLVSQDLFSVYLSSN--EESGSVVIFGDIDSSYYSGSLNWVPVSVEGYWQITVDSITM 261
Query: 279 DGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA----TGIVSQECKAVVS 330
+G++ C+ GC AI D+GTSLLAGPTT I+ + IGA +G V C ++ S
Sbjct: 262 NGESIA-CSDGCQAIVDTGTSLLAGPTTAISNIQSYIGASEDSSGEVVISCSSIDS 316
>gi|296208793|ref|XP_002751243.1| PREDICTED: chymosin [Callithrix jacchus]
gi|18203303|sp|Q9N2D2.1|CHYM_CALJA RecName: Full=Chymosin; AltName: Full=Preprorennin; Flags:
Precursor
gi|7008025|dbj|BAA90873.1| prochymosin [Callithrix jacchus]
Length = 381
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 121/310 (39%), Positives = 177/310 (57%), Gaps = 17/310 (5%)
Query: 11 GFFLCLLLFPVVFSTPNGGLYRIGLKKRKFDLNNRVAARLDSKEGESFRTSIRKYSLRGN 70
GF + L +F + + G+ RI L K K + R KE ++ + +
Sbjct: 3 GFVVLLAVFAL---SQASGIVRIPLHKGK-------SLRRALKERGLLEDFLKNHQHAVS 52
Query: 71 LGESGDADIVA--LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACY 128
S ++ + L NY+D QYFG+I IGTPPQ FTV+FDTGSS+LWVPS C S+AC
Sbjct: 53 RKHSNSREVASEFLTNYLDCQYFGKIYIGTPPQEFTVVFDTGSSDLWVPSVYCN-SVACQ 111
Query: 129 FHSKYRSGRSSTYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLT 188
H ++ +SST++ KS I YGTG++ G D V + +V Q +T+EP
Sbjct: 112 NHHRFDPSKSSTFQNMDKSLSIQYGTGSMQGLLGYDTVTVSSIVDPHQTVGLSTQEPGDV 171
Query: 189 FLLAKFDGILGLGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGG 248
F ++FDGILGL + ++ +VPV+ NM+++ LV + +FS + +RN E+G + G
Sbjct: 172 FTYSEFDGILGLAYPSLASEYSVPVFDNMMDRHLVAQDLFSVYMSRN---EQGSMLTLGA 228
Query: 249 MDPDHYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTII 308
+DP +Y G ++PVT + YWQF + V +DG C GGC AI D+GTS+L GP + I
Sbjct: 229 IDPSYYTGSLHWIPVTVQEYWQFTVDSVTVDGVVVA-CDGGCQAILDTGTSMLVGPGSDI 287
Query: 309 TQVNHAIGAT 318
+ AIGAT
Sbjct: 288 FNIQQAIGAT 297
>gi|190014572|dbj|BAG48264.1| pepsinogen 2 [Thunnus orientalis]
Length = 376
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 111/238 (46%), Positives = 156/238 (65%), Gaps = 7/238 (2%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
+ N D Y+G + IGTPPQ+F VIFDTGSSNLWVPS C S AC K+ +SST+
Sbjct: 61 ITNDADLSYYGVVSIGTPPQSFKVIFDTGSSNLWVPSVYCS-SQACQNQKKFNPQQSSTF 119
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEF-IEATREPSLTFLLAKFDGILGL 200
K + I YGTG+++G + D V++G + V +Q F I + P + ++ K DGILGL
Sbjct: 120 KWGDQPLSIQYGTGSMTGRLASDIVEVGGISVNNQVFGISQSEAPFMAYM--KADGILGL 177
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
FQ I+ VPV+ NMV+QGLV++P+FS + + N+ ++G E+VFGG+D HY G+ T+
Sbjct: 178 AFQSIASDNVVPVFDNMVSQGLVSQPLFSVYLSSNS--QQGNEVVFGGIDSSHYTGKITW 235
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
+P+T YWQ M V I+GQT C+GGC AI D+GTSL+ GP+ I +N +GA+
Sbjct: 236 IPLTSATYWQIQMDSVTINGQTVA-CSGGCQAIIDTGTSLIVGPSRDIYNMNAWVGAS 292
>gi|317419142|emb|CBN81179.1| Pepsinogen 1 [Dicentrarchus labrax]
Length = 378
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 111/258 (43%), Positives = 161/258 (62%), Gaps = 18/258 (6%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
+ N D Y+G I IGTPPQ+F V+FDTGSSNLWVPS C S AC H K+ G SSTY
Sbjct: 64 MTNDADLAYYGIISIGTPPQSFKVVFDTGSSNLWVPSIYCN-SPACNNHDKFNPGLSSTY 122
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEF-IEATREPSLTFLLAKFDGILGL 200
+ NG I YGTG+++GF D V +G L V++Q F + + P + ++ A DGILGL
Sbjct: 123 RHNGSPLSIQYGTGSMTGFLGYDTVTVGGLAVRNQIFGMSKSEAPFMQYMRA--DGILGL 180
Query: 201 GFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTY 260
+ +S A PV+ NM+N+GLVN+ +FS + + N+ ++G + FGG+D +HY G T+
Sbjct: 181 AYPRLSASHATPVFDNMMNEGLVNQDLFSVYLSSNS--QQGSVVTFGGVDSNHYHGSITW 238
Query: 261 VPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGATGI 320
+P++ + YWQ + V ++GQ C GGC AI D+GTSL+ GP + I+ +N +GA+
Sbjct: 239 IPLSNELYWQITVDSVTVNGQVVA-CNGGCQAIVDTGTSLVVGPQSSISNINSMVGAS-- 295
Query: 321 VSQECKAVVSQYGEEIIN 338
SQ G+ ++N
Sbjct: 296 ---------SQNGDYVVN 304
>gi|157836875|pdb|3PSG|A Chain A, The High Resolution Crystal Structure Of Porcine
Pepsinogen
Length = 370
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 108/237 (45%), Positives = 152/237 (64%), Gaps = 4/237 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L+NY+D +YFG IGIGTP Q+FTVIFDTGSSNLWVPS C S+AC H+++ SST+
Sbjct: 50 LENYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACSDHNQFNPDDSSTF 108
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
+ + I YGTG+++G D V++G + +Q F + EP A FDGILGL
Sbjct: 109 EATSQELSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYAPFDGILGLA 168
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ IS A PV+ N+ +QGLV++ +FS + + N D G ++ GG+D +Y G +V
Sbjct: 169 YPSISASGATPVFDNLWDQGLVSQDLFSVYLSSNDD--SGSVVLLGGIDSSYYTGSLNWV 226
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
PV+ +GYWQ + + +DG+T C+GGC AI D+GTSLL GPT+ I + IGA+
Sbjct: 227 PVSVEGYWQITLDSITMDGETIA-CSGGCQAIVDTGTSLLTGPTSAIANIQSDIGAS 282
>gi|281339613|gb|EFB15197.1| hypothetical protein PANDA_017311 [Ailuropoda melanoleuca]
Length = 318
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 101/232 (43%), Positives = 147/232 (63%), Gaps = 1/232 (0%)
Query: 86 MDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNG 145
+ + YFGEI IGTPPQNF V+FDTGSSNLWVPS+ C S AC H+ + SSTY+ NG
Sbjct: 1 LQSYYFGEISIGTPPQNFLVLFDTGSSNLWVPSTYCQ-SQACTNHNMFNPSSSSTYRNNG 59
Query: 146 KSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEI 205
++ ++YG+G+++ D V + ++++ +QEF + EP+ F A FDGILG+ + +
Sbjct: 60 QTYTLYYGSGSLTVLLGYDTVNVQNIIINNQEFGLSEIEPNNPFYYANFDGILGMAYPNL 119
Query: 206 SVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQ 265
+VG A V +MV Q + +P+FSF+F+R E GGE++ GG+D Y GE + PVT+
Sbjct: 120 AVGNAPTVTQSMVQQDQLTQPIFSFYFSRQPTYEYGGELILGGVDSQFYSGEIVWTPVTR 179
Query: 266 KGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGA 317
+ YWQ + + ++ Q TG C+ GC AI D+GT +LA P I GA
Sbjct: 180 EMYWQIAIDEFLVSNQATGLCSQGCQAIVDTGTYMLAVPQQFIGSFLQTTGA 231
>gi|157837066|pdb|5PEP|A Chain A, X-Ray Analyses Of Aspartic Proteases. Ii.
Three-Dimensional Structure Of The Hexagonal Crystal
Form Of Porcine Pepsin At 2.3 Angstroms Resolution
Length = 326
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 111/253 (43%), Positives = 158/253 (62%), Gaps = 8/253 (3%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L+NY+D +YFG IGIGTP Q+FTVIFDTGSSNLWVPS C S+AC H+++ SST+
Sbjct: 6 LENYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACSDHNQFNPDDSSTF 64
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
+ + I YGTG+++G D V++G + +Q F + EP A FDGILGL
Sbjct: 65 EATSQELSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYAPFDGILGLA 124
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ IS A PV+ N+ +QGLV++ +FS + + N D G ++ GG+D +Y G +V
Sbjct: 125 YPSISASGATPVFDNLWDQGLVSQDLFSVYLSSNDD--SGSVVLLGGIDSSYYTGSLNWV 182
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT--- 318
PV+ +GYWQ + + +DG+T C+GGC AI D+GTSLL GPT+ I + IGA+
Sbjct: 183 PVSVEGYWQITLDSITMDGETIA-CSGGCQAIVDTGTSLLTGPTSAIANIQSDIGASENS 241
Query: 319 -GIVSQECKAVVS 330
G + C ++ S
Sbjct: 242 DGEMVISCSSIAS 254
>gi|73620984|sp|P81497.2|PEPA_SUNMU RecName: Full=Pepsin A; Flags: Precursor
gi|9798654|dbj|BAB11749.1| pepsinogen A [Suncus murinus]
Length = 387
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 116/290 (40%), Positives = 168/290 (57%), Gaps = 8/290 (2%)
Query: 30 LYRIGLKKRKFDLNNRVA-ARLDSKEGESFRTSIRKYSLRGNLGESGDADIVALKNYMDA 88
LY++ L K+K N + L + KY E D +V NYMD
Sbjct: 17 LYKVPLVKKKSLRQNLIENGLLKDFLAKHNVNPASKYFPTEAATELADQPLV---NYMDM 73
Query: 89 QYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSA 148
+YFG IGIGTPPQ FTVIFDTGSSNLWVPS C S AC H+++ +SST++ ++
Sbjct: 74 EYFGTIGIGTPPQEFTVIFDTGSSNLWVPSVYCS-SPACSNHNRFNPQKSSTFQSTSQTL 132
Query: 149 DIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVG 208
I YGTG+++G D V++ + +Q F + EP + FDGILGL + I+
Sbjct: 133 SIAYGTGSMTGVLGYDTVQVAGIADTNQIFGLSQTEPGSFLYYSPFDGILGLAYPNIASS 192
Query: 209 KAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGY 268
A PV+ NM NQGLV++ +FS + + N ++ G ++FGG+D +Y G +VP++ +GY
Sbjct: 193 GATPVFDNMWNQGLVSQDLFSVYLSSN--DQSGSVVIFGGIDSSYYTGNLNWVPLSSEGY 250
Query: 269 WQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
WQ + + ++GQ C+G C AI D+GTSLL+GP I + +IGA+
Sbjct: 251 WQITVDSITMNGQAIA-CSGSCQAIVDTGTSLLSGPNNAIANIQKSIGAS 299
>gi|118572685|sp|P00791.3|PEPA_PIG RecName: Full=Pepsin A; Flags: Precursor
Length = 385
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 108/237 (45%), Positives = 152/237 (64%), Gaps = 4/237 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L+NY+D +YFG IGIGTP Q+FTVIFDTGSSNLWVPS C S+AC H+++ SST+
Sbjct: 65 LENYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACSDHNQFNPDDSSTF 123
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
+ + I YGTG+++G D V++G + +Q F + EP A FDGILGL
Sbjct: 124 EATSQELSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYAPFDGILGLA 183
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ IS A PV+ N+ +QGLV++ +FS + + N D G ++ GG+D +Y G +V
Sbjct: 184 YPSISASGATPVFDNLWDQGLVSQDLFSVYLSSNDD--SGSVVLLGGIDSSYYTGSLNWV 241
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
PV+ +GYWQ + + +DG+T C+GGC AI D+GTSLL GPT+ I + IGA+
Sbjct: 242 PVSVEGYWQITLDSITMDGETIA-CSGGCQAIVDTGTSLLTGPTSAIANIQSDIGAS 297
>gi|164604|gb|AAA31096.1| pepsinogen A precursor [Sus scrofa]
Length = 385
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 108/237 (45%), Positives = 152/237 (64%), Gaps = 4/237 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L+NY+D +YFG IGIGTP Q+FTVIFDTGSSNLWVPS C S+AC H+++ SST+
Sbjct: 65 LENYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACSDHNQFNPDDSSTF 123
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
+ + I YGTG+++G D V++G + +Q F + EP A FDGILGL
Sbjct: 124 EATSQELSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYAPFDGILGLA 183
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ IS A PV+ N+ +QGLV++ +FS + + N D G ++ GG+D +Y G +V
Sbjct: 184 YPSISASGATPVFDNLWDQGLVSQDLFSVYLSSNDD--SGSVVLLGGIDSSYYTGSLNWV 241
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
PV+ +GYWQ + + +DG+T C+GGC AI D+GTSLL GPT+ I + IGA+
Sbjct: 242 PVSVEGYWQITLDSITMDGETIA-CSGGCQAIVDTGTSLLTGPTSAIANIQSDIGAS 297
>gi|397516556|ref|XP_003846117.1| PREDICTED: LOW QUALITY PROTEIN: pepsin A-5, partial [Pan paniscus]
Length = 274
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 113/246 (45%), Positives = 156/246 (63%), Gaps = 8/246 (3%)
Query: 89 QYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTYKKNGKSA 148
+YFG IGIGTP Q+FTV+FDTGSSNLWVPS CY S+AC H+++ SSTYK K+
Sbjct: 2 EYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCY-SLACTNHNRFNPQDSSTYKSTSKTV 60
Query: 149 DIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLGFQEISVG 208
I YGTG+++G D V++G + +Q F + EP A FDGILGL + IS
Sbjct: 61 SITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLXFAPFDGILGLAYPSISSS 120
Query: 209 KAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYVPVTQKGY 268
A PV+ N+ NQGLV++ +FS + +AD++ G ++FGG+D +Y G +VPVT +GY
Sbjct: 121 GATPVFDNIWNQGLVSQDLFSVYL--SADDKSGSVVIFGGIDSSYYTGSLNWVPVTVEGY 178
Query: 269 WQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT----GIVSQE 324
WQ + + ++G+T CA GC AI D+GTSLL GPT+ I + IGA+ G +
Sbjct: 179 WQITVDSITMNGKTIA-CAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVS 237
Query: 325 CKAVVS 330
C A+ S
Sbjct: 238 CSAISS 243
>gi|253723303|pdb|2PSG|A Chain A, Refined Structure Of Porcine Pepsinogen At 1.8 Angstroms
Resolution
Length = 370
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 108/237 (45%), Positives = 152/237 (64%), Gaps = 4/237 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L+NY+D +YFG IGIGTP Q+FTVIFDTGSSNLWVPS C S+AC H+++ SST+
Sbjct: 50 LENYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACSDHNQFNPDDSSTF 108
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
+ + I YGTG+++G D V++G + +Q F + EP A FDGILGL
Sbjct: 109 EATXQELSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYAPFDGILGLA 168
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ IS A PV+ N+ +QGLV++ +FS + + N D G ++ GG+D +Y G +V
Sbjct: 169 YPSISASGATPVFDNLWDQGLVSQDLFSVYLSSNDD--SGSVVLLGGIDSSYYTGSLNWV 226
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
PV+ +GYWQ + + +DG+T C+GGC AI D+GTSLL GPT+ I + IGA+
Sbjct: 227 PVSVEGYWQITLDSITMDGETIA-CSGGCQAIVDTGTSLLTGPTSAIANIQSDIGAS 282
>gi|292658825|ref|NP_999038.2| pepsin A preproprotein [Sus scrofa]
gi|121073319|gb|ABM47074.1| pepsinogen A [Sus scrofa]
Length = 385
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 107/237 (45%), Positives = 152/237 (64%), Gaps = 4/237 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L+NY+D +Y G IGIGTP Q+FTVIFDTGSSNLWVPS C S+AC H+++ SST+
Sbjct: 65 LENYLDTEYLGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACSDHNQFNPDDSSTF 123
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
+ + I YGTG+++G D V++G + +Q F + EP + A FDGILGL
Sbjct: 124 EATSQELSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSSLYYAPFDGILGLA 183
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ IS A PV+ N+ +QGLV++ +FS + + N D G ++ GG+D +Y G +V
Sbjct: 184 YPSISASGATPVFDNLWDQGLVSQDLFSVYLSSNDD--SGSVVLLGGIDSSYYTGSLNWV 241
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
PV+ +GYWQ + + +DG+T C+GGC AI D+GTSLL GPT+ I + IGA+
Sbjct: 242 PVSVEGYWQITLDSITMDGETIA-CSGGCQAIVDTGTSLLTGPTSAIANIQSDIGAS 297
>gi|50294061|ref|XP_449442.1| hypothetical protein [Candida glabrata CBS 138]
gi|49528756|emb|CAG62418.1| unnamed protein product [Candida glabrata]
Length = 415
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 112/257 (43%), Positives = 156/257 (60%), Gaps = 13/257 (5%)
Query: 80 VALKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSS 139
V L NYMDAQYF +I +GTPPQ F VI DTGSSNLWVPS C S+AC+ H+KY +SS
Sbjct: 81 VPLSNYMDAQYFADISLGTPPQKFKVILDTGSSNLWVPSVDC-GSLACFLHNKYDHSQSS 139
Query: 140 TYKKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILG 199
TY K+G+ I YG+G+I G+ SED+++IGDL +++Q+F E T EP L F KFDGILG
Sbjct: 140 TYIKDGRPLSISYGSGSIEGYISEDNLQIGDLTIQNQKFGETTSEPGLAFAFGKFDGILG 199
Query: 200 LGFQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADE-------EEGGEIVFGGMDPD 252
L + I+ P +Y+ + Q L++E FSF+ D +GG GG+D
Sbjct: 200 LAYDTIAQDDITPPFYSAIQQHLLDESKFSFYLKSVNDPAAEGGSASDGGVFTLGGVDSS 259
Query: 253 HYKGEHTYVPVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVN 312
+KG+ + V ++ YW+ + + + Q+TG AAI D+GTSL+ P+ + +N
Sbjct: 260 KFKGDLIPLHVRRQAYWEVPLNAIKLGDQSTGKLENTGAAI-DTGTSLITLPSDMAEIIN 318
Query: 313 HAIGA----TGIVSQEC 325
IGA TG + EC
Sbjct: 319 AQIGAKKGWTGQYTLEC 335
>gi|494476|pdb|1PSA|A Chain A, Structure Of A Pepsin(Slash)renin Inhibitor Complex
Reveals A Novel Crystal Packing Induced By Minor
Chemical Alterations In The Inhibitor
gi|494478|pdb|1PSA|B Chain B, Structure Of A Pepsin(Slash)renin Inhibitor Complex
Reveals A Novel Crystal Packing Induced By Minor
Chemical Alterations In The Inhibitor
gi|67463919|pdb|1YX9|A Chain A, Effect Of Dimethyl Sulphoxide On The Crystal Structure Of
Porcine Pepsin
Length = 326
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 107/237 (45%), Positives = 153/237 (64%), Gaps = 4/237 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L+NY+D +YFG IGIGTP Q+FTVIFDTGSSNLWVPS C S+AC H+++ SST+
Sbjct: 6 LENYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACSDHNQFNPDDSSTF 64
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
+ + I YGTG+++G D V++G + +Q F + EP A FDGILGL
Sbjct: 65 EATSQELSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYAPFDGILGLA 124
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ IS A PV+ N+ +QGLV++ +FS + + N ++ G ++ GG+D +Y G +V
Sbjct: 125 YPSISASGATPVFDNLWDQGLVSQDLFSVYLSSN--DDSGSVVLLGGIDSSYYTGSLNWV 182
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
PV+ +GYWQ + + +DG+T C+GGC AI D+GTSLL GPT+ I + IGA+
Sbjct: 183 PVSVEGYWQITLDSITMDGETIA-CSGGCQAIVDTGTSLLTGPTSAIANIQSDIGAS 238
>gi|157836865|pdb|3PEP|A Chain A, Revised 2.3 Angstroms Structure Of Porcine Pepsin.
Evidence For A Flexible Subdomain
Length = 326
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 107/237 (45%), Positives = 153/237 (64%), Gaps = 4/237 (1%)
Query: 82 LKNYMDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKCYFSIACYFHSKYRSGRSSTY 141
L+NY+D +YFG IGIGTP Q+FTVIFDTGSSNLWVPS C S+AC H+++ SST+
Sbjct: 6 LENYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS-SLACSDHNQFNPDDSSTF 64
Query: 142 KKNGKSADIHYGTGAISGFFSEDHVKIGDLVVKDQEFIEATREPSLTFLLAKFDGILGLG 201
+ + I YGTG+++G D V++G + +Q F + EP A FDGILGL
Sbjct: 65 EATSQELSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYAPFDGILGLA 124
Query: 202 FQEISVGKAVPVWYNMVNQGLVNEPVFSFWFNRNADEEEGGEIVFGGMDPDHYKGEHTYV 261
+ IS A PV+ N+ +QGLV++ +FS + + N ++ G ++ GG+D +Y G +V
Sbjct: 125 YPSISASGATPVFDNLWDQGLVSQDLFSVYLSSN--DDSGSVVLLGGIDSSYYTGSLNWV 182
Query: 262 PVTQKGYWQFDMGDVMIDGQTTGFCAGGCAAIADSGTSLLAGPTTIITQVNHAIGAT 318
PV+ +GYWQ + + +DG+T C+GGC AI D+GTSLL GPT+ I + IGA+
Sbjct: 183 PVSVEGYWQITLDSITMDGETIA-CSGGCQAIVDTGTSLLTGPTSAIANIQSDIGAS 238
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.321 0.138 0.421
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,081,764,059
Number of Sequences: 23463169
Number of extensions: 275469987
Number of successful extensions: 555726
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 3457
Number of HSP's successfully gapped in prelim test: 1728
Number of HSP's that attempted gapping in prelim test: 542243
Number of HSP's gapped (non-prelim): 5951
length of query: 359
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 216
effective length of database: 9,003,962,200
effective search space: 1944855835200
effective search space used: 1944855835200
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 77 (34.3 bits)